-
Notifications
You must be signed in to change notification settings - Fork 0
Open
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers
Description
To enhance the capabilities and benchmarking potential of this repository, it would be beneficial to implement additional policy gradient algorithms. Suggested additions include:
- PPO (Proximal Policy Optimization)
- RPO (Robust Policy Optimization)
- DDPG (Deep Deterministic Policy Gradient)
- TD3 (Twin Delayed Deep Deterministic Policy Gradient)
These algorithms are widely used in reinforcement learning research and would broaden the range of experiments possible with this codebase. Contributions should ensure modularity and maintainability, ideally following the design patterns already present in the repo.
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or requestgood first issueGood for newcomersGood for newcomers