Why did Sir multiplied the terms while calculating conditional probabilities of features.
If that word/feature doesn’t belong to that class/label, then a very big factor is multiplied in the equation !!
That formula stands correct for a single feature probability calculation and then we decide and multiply with the suitable factor.
For instance :
Sir used that this word doesn’t belong to the class and assumed it’s prob. to be very less( =0.1)
Then, he multiplied with a even bigger factor ( =0.9) thus increasing the probability for the feature that doesn’t belong to the class.
I am not able to realise this formula !!