标签:HF强化学习

araffin/dqn-LunarLander-v2

DQN Agent playing LunarLander-v2 This is a trained model of a DQN agent playing LunarLander-v2 using the stable-baselines3 library. Usage (wi...

sb3/ppo-BreakoutNoFrameskip-v4

PPO Agent playing BreakoutNoFrameskip-v4 This is a trained model of a PPO agent playing BreakoutNoFrameskip-v4 using the stable-baselines3 librar...

QYHcrossover/poca-test2

poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...

HumanCompatibleAI/ppo-seals-CartPole-v0

RL Zoo 是 Stable Baselines3 强化学习代理的训练框架,包括超参数优化和预训练代理。 收录说明: 1、本网页并非 HumanCompatibleAI/ppo-seals-CartPole-v0...

ahmad-alismail/poca-SoccerTwos

poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...

edbeeching/decision-transformer-gym-hopper-expert

Decision Transformer model trained on expert trajectories sampled from the Gym Hopper environment This is a trained Decision Transformer model tra...

sb3/ppo-CartPole-v1

PPO Agent playing CartPole-v1 This is a trained model of a PPO agent playing CartPole-v1 using the stable-baselines3 library and the RL Zoo. The ...

Classroom-workshop/assignment2-omar

PPO Agent playing LunarLander-v2 This is a trained model of a PPO agent playing LunarLander-v2 using the stable-baselines3 library. Usage (with ...

Raiden-1001/poca-Soccerv8

poca Agent playing SoccerTwos This is a trained model of a poca agent playing SoccerTwos using the Unity ML-Agents Library. Usage (with ML-Ag...

edbeeching/decision-transformer-gym-halfcheetah-expert

Decision Transformer model trained on expert trajectories sampled from the Gym HalfCheetah environment This is a trained Decision Transformer mode...
123