--- env_name: Walker2d-v5 tags: - Walker2d-v5 - td3 - reinforcement-learning - custom-implementation - policy-gradient - pytorch - ddpg model-index: - name: TD3-Walker2dV5 results: - task: type: reinforcement-learning name: reinforcement-learning dataset: name: Walker2d-v5 type: Walker2d-v5 metrics: - type: mean_reward value: 4348.91 +/- 73.32 name: mean_reward verified: false --- # **TD3** Agent playing **Walker2d-v5** This is a trained model of a **TD3** agent playing **Walker2d-v5**. ## Usage ### create the conda env in https://github.com/GeneHit/drl_practice ```bash conda create -n drl python=3.10 conda activate drl python -m pip install -r requirements.txt ``` ### play with full model ```python # load the full model model = load_from_hub(repo_id="winkin119/TD3-Walker2dV5", filename="full_model.pt") # Create the environment. env = gym.make("Walker2d-v5") state, _ = env.reset() action = model.action(state) ... ``` There is also a state dict version of the model, you can check the corresponding definition in the repo.