I am struggling a bit in figuring out which columns to remove and which columns to keep even after reading the description of the dataset.
- There are times when the description of a certain column doesn’t make any sense even after googling.
- There are times when a certain column can be useful in predicting the output but maybe it really isn’t. Example in Titanic dataset, number of children can actually affect the chances of that person’s survival but maybe it doesn’t really matter. Idk.
Can anyone suggest me any tips on how to further improve my analysis?