Report copyright - An Alternative Softmax Operator for Reinforcement Learning · An Alternative Softmax Operator for Reinforcement Learning!!!max a Q 1 (s,a) a 2!!! "a!!! 1 2(s,a) Figure 3. max is a
Please pass captcha verification before submit form