What P(A|S) is depicting?

Is it giving the probability of taking a particular action a (-A in a given state s(- S which will result in the maximum reward in that particular timestamp or state s ??? Correct me if I am wrong… and do we find that ‘a’ using deep Q learning algorithm by checking for which ‘a’ the P(A|S) is maximum???
Correct me if I am wrong???

hey @bihan ,
i actually don’t know about the Reinforcement learning for this,
but as understanding the thing , it is what is the probability of action a to happen if state S has occured.

and based on that we can decide which action has more chances to happen

I hope I’ve cleared your doubt. I ask you to please rate your experience here
Your feedback is very important. It helps us improve our platform and hence provide you
the learning experience you deserve.

On the off chance, you still have some questions or not find the answers satisfactory, you may reopen
the doubt.