How to preprocess the data?

hmantv_83 · January 13, 2021, 8:40am

Hi! Would normalizing the values be helpful in this case for X_train? Also how to choose what kind of preprocessing to do for each type of data (e.g. normalizing, label encoding, one hot encoding etc.)? Thanks in advance.

prashant_ml · January 13, 2021, 9:00am

hey @hmantv_83 ,
preprocessing is not such decidable , it basically depends on what kind of data are you working on.
So , like if you are working with linear models like linear or logistic regression then , normalizing or standardizing the data would be helpful.
If you are using tree structures then feature extraction would be much helpful

So just try various different things as much as you can to improve the score.

I hope this helps.

hmantv_83 · January 13, 2021, 10:23am

The maximum accuracy I am able to get is 79% at k=26. How can I increase this further?

prashant_ml · January 13, 2021, 10:30am

to further increase it
what you can do is ,

Feature engineering
or try working on another model

prashant_ml · January 18, 2021, 5:37pm

I hope I’ve cleared your doubt. I ask you to please rate your experience here
Your feedback is very important. It helps us improve our platform and hence provide you
the learning experience you deserve.

On the off chance, you still have some questions or not find the answers satisfactory, you may reopen
the doubt.