I am trying to learn Scikit_Learn and build an ML model. I am learning from "Hands-On Machine Learning with SciKit-Learn, Keras & TensorFlow".
In Chapter 2, there is a review of an entire ML project from the beginning until completion. One of the steps is to transform or prepare the data for the model. There are numerous transformation steps and each output is stored in a new object (mostly Dataframe).
Why is the model built with the original dataset? Wouldn't these transformations be ineffective?
I haven't cited the example from the book as it is too lengthy. If you have the book, please review and respond to my question.
Another similar instance would be from https://machinelearningmastery.com/python-machine-learning-mini-course/
In Lesson 7, the standardized data is stored in "rescaledX" and the model is built from X which is the original dataset. How is the standardized data used in the model that comes afterwards?