Actor-Critic
Actor-Critic Model
Actor-Critic Model
State Value
Policy
| 方法 | 策略评估 | 策略改进 | 特点 |
Paradigms
智能体 (Agent) 与环境 (Environment) 的交互模型:
Estimation
Gradient Metrics
Foundations
Mean Estimation
Regression
State Value
Clustering
Objective Function