HF强化学习-第 2 页-微草录AIGC导航

araffin/dqn-LunarLander-v2

DQN Agent playing LunarLander-v2 This is a trained model of a DQN agent playing LunarLander-v2 using the stable-baselines3 library. Usage (wi...

1年前 (2024)

PPO Agent playing BreakoutNoFrameskip-v4 This is a trained model of a PPO agent playing BreakoutNoFrameskip-v4 using the stable-baselines3 librar...

1年前 (2024)

poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...

1年前 (2024)

RL Zoo 是 Stable Baselines3 强化学习代理的训练框架，包括超参数优化和预训练代理。收录说明： 1、本网页并非 HumanCompatibleAI/ppo-seals-CartPole-v0...

1年前 (2024)

poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...

1年前 (2024)

Decision Transformer model trained on expert trajectories sampled from the Gym Hopper environment This is a trained Decision Transformer model tra...

1年前 (2024)

PPO Agent playing CartPole-v1 This is a trained model of a PPO agent playing CartPole-v1 using the stable-baselines3 library and the RL Zoo. The ...

1年前 (2024)

PPO Agent playing LunarLander-v2 This is a trained model of a PPO agent playing LunarLander-v2 using the stable-baselines3 library. Usage (with ...

1年前 (2024)

poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...

1年前 (2024)

Decision Transformer model trained on expert trajectories sampled from the Gym HalfCheetah environment This is a trained Decision Transformer mode...

1年前 (2024)