The best decision sequence of Markov decision process is solved by Bellman equation, and the value of each state is determined not only by the current state but also by the later state.( )
选项:
A:对
B:错
发布时间:2024-05-14 11:47:55