In previous videos, the dataset used had features which had 3 categories like outlook, temp, humidity. Here the decision tree is a binary tree, which works fine in this case since no feature here has more than 2 categories. So, how do we tackle situations with features having categories more than 2?
Also are the decision trees using sklearn also binary? Or do they have branches at each node according to the number of categories of the feature used in that node?