0

I would like to normalize (z-score, minmax etc.) my predictor variables for a number of Machine Learning algorithms (Neural Network) and a Log Regression and I am wondering:

1) Should I normalize the entire predictor variables, that is training AND Test data?

2) Should normalize my predicted variables, y?

1 Answer 1

1

1) The correct procedure is to normalize your training data and use the transformation parameters to normalize the test data. Here is an example of a minmax normalization with one feature:

training = [1, 2, 3]
test = [0, 4]

The normalized data are the following:

training_normalized = [0.0, 0.5, 1.0]
test_normalized = [-0.5, 1.5]

2) Generally the answer is no but there are cases where it may help to transform the target variable. In any case you should make sure that the output of your model is able to match the target variable.

Sign up to request clarification or add additional context in comments.

1 Comment

Perfect, got it! Thank you so much Georgios!!

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.