π Live Open Source Explorer
Explore live open-source projects and AI models.
Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.
π Live Search
Search live open-source data
Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.
Live Results
GitHub Open Source Repositories
Search: Policy-Gradient-Methods
Page 1
Showing 10 results from 24
Khrylx/PyTorch-RL
GitHub Python MIT LicensePyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
External source
GitHub
MrSyee/pg-is-all-you-need
GitHub Jupyter Notebook MIT LicensePolicy Gradient is all you need! A step-by-step tutorial for well-known PG methods.
External source
GitHub
cyoon1729/Policy-Gradient-Methods
GitHub Jupyter NotebookImplementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
External source
GitHub
ollebompa/PGA-MAP-Elites
GitHub Python MIT LicenseRepository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy domains. It uses Neuroevolution driven by a Genetic Algorithm (GA) coupled with Policy Gradients (PG) derived from an off-policy Deep Reinforcement Learning met... Read more
External source
GitHub
chingyaoc/photo-editing-tensorflow
GitHub Python MIT LicensePhoto Optimizing Adversarial Net with Policy Gradient Method
External source
GitHub
EnnaSachdeva/Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards
GitHub PythonDeep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single agent tasks. These methods have further been extended to multiagent domains in cooperative, competitive or mixed environments. This paper primarily focuses on ... Read more
External source
GitHub
traderben00/RL-Trader
GitHub Python Creative Commons Zero v1.0 UniversalThis Reinforcement learning agent uses Policy-Gradient method to trade the market
External source
GitHub
zwhong714/PSFT
GitHub Python[ICLR 2026] PSFT is a trust-regionβinspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, constraining policy drift to stabilize training and improve generalization.
External source
GitHub
ozekri/SEPO
GitHub Jupyter Notebook MIT LicenseCode for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"
External source
GitHub
chenxinpeng/Optimization_of_image_description_metrics_using_policy_gradient_methods
GitHub PythonTensorflow implement of paper: Optimization of image description metrics using policy gradient methods
External source
GitHub
10 results on this page Β· 24 total found
Showing first 24 accessible GitHub results.