CS294-112 深度强化学习 秋季学期(伯克利)NO.7 Optimal control and planning

 

transition possibility is unknown and we even don't need to estimate the possibility

 

 

 

 

  

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

猜你喜欢

转载自www.cnblogs.com/ecoflex/p/9094689.html