标签:Reinforcement Learning
araffin/dqn-LunarLander-v2
DQN Agent playing LunarLander-v2 This is a trained model of a DQN agent playing LunarLander-v2 using the stable-baselines3 library. Usage (wi...
sb3/ppo-BreakoutNoFrameskip-v4
PPO Agent playing BreakoutNoFrameskip-v4 This is a trained model of a PPO agent playing BreakoutNoFrameskip-v4 using the stable-baselines3 librar...
QYHcrossover/poca-test2
poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...
HumanCompatibleAI/ppo-seals-CartPole-v0
RL Zoo 是 Stable Baselines3 强化学习代理的训练框架,包括超参数优化和预训练代理。 收录说明: 1、本网页并非 HumanCompatibleAI/ppo-seals-CartPole-v0...
ahmad-alismail/poca-SoccerTwos
poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...
edbeeching/decision-transformer-gym-hopper-expert
Decision Transformer model trained on expert trajectories sampled from the Gym Hopper environment This is a trained Decision Transformer model tra...
sb3/ppo-CartPole-v1
PPO Agent playing CartPole-v1 This is a trained model of a PPO agent playing CartPole-v1 using the stable-baselines3 library and the RL Zoo. The ...
Classroom-workshop/assignment2-omar
PPO Agent playing LunarLander-v2 This is a trained model of a PPO agent playing LunarLander-v2 using the stable-baselines3 library. Usage (with ...
Raiden-1001/poca-Soccerv8
poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...
edbeeching/decision-transformer-gym-halfcheetah-expert
Decision Transformer model trained on expert trajectories sampled from the Gym HalfCheetah environment This is a trained Decision Transformer mode...