How to preprocess the data?

Hi! Would normalizing the values be helpful in this case for X_train? Also how to choose what kind of preprocessing to do for each type of data (e.g. normalizing, label encoding, one hot encoding etc.)? Thanks in advance.

hey @hmantv_83 ,
preprocessing is not such decidable , it basically depends on what kind of data are you working on.
So , like if you are working with linear models like linear or logistic regression then , normalizing or standardizing the data would be helpful.
If you are using tree structures then feature extraction would be much helpful

So just try various different things as much as you can to improve the score.

I hope this helps.

The maximum accuracy I am able to get is 79% at k=26. How can I increase this further?

to further increase it
what you can do is ,

  1. Feature engineering
  2. or try working on another model

I hope I’ve cleared your doubt. I ask you to please rate your experience here
Your feedback is very important. It helps us improve our platform and hence provide you
the learning experience you deserve.

On the off chance, you still have some questions or not find the answers satisfactory, you may reopen
the doubt.