reinforcement learning - Why do value iteration and policy iteration obtain similar policies even though they have different value functions? - Artificial Intelligence Stack Exchange
What are the advantages of using Q-value iteration versus value iteration in reinforcement learning? - Quora