Normalizing Input for Machine Learning Algorithm

Question

I would like to normalize (z-score, minmax etc.) my predictor variables for a number of Machine Learning algorithms (Neural Network) and a Log Regression and I am wondering:

1) Should I normalize the entire predictor variables, that is training AND Test data?

2) Should normalize my predicted variables, y?

user9389968 · Accepted Answer · 2018-03-21 06:23:16Z

1

1) The correct procedure is to normalize your training data and use the transformation parameters to normalize the test data. Here is an example of a minmax normalization with one feature:

training = [1, 2, 3]
test = [0, 4]

The normalized data are the following:

training_normalized = [0.0, 0.5, 1.0]
test_normalized = [-0.5, 1.5]

2) Generally the answer is no but there are cases where it may help to transform the target variable. In any case you should make sure that the output of your model is able to match the target variable.

answered Mar 21, 2018 at 6:23

user9389968

Sign up to request clarification or add additional context in comments.

1 Comment

Niccola Tartaglia Over a year ago

Perfect, got it! Thank you so much Georgios!!

Collectives™ on Stack Overflow

Normalizing Input for Machine Learning Algorithm

1 Answer 1

1 Comment

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Your Answer

Sign up or log in

Post as a guest

Related