🌱 Open Source β–Ύ

🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

πŸ”Ž Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻
πŸ”Ž
🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

Live Results

GitHub Open Source Repositories

Search: Policy-Gradient-Methods

Page 1

Showing 10 results from 24

P

Khrylx/PyTorch-RL

GitHub Python MIT License

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

β˜… 1,285 Forks 192 Khrylx Updated 05 Jun 2026
P

MrSyee/pg-is-all-you-need

GitHub Jupyter Notebook MIT License

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

β˜… 1,032 Forks 127 MrSyee Updated 08 Jun 2026
P

cyoon1729/Policy-Gradient-Methods

GitHub Jupyter Notebook

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

β˜… 100 Forks 30 cyoon1729 Updated 04 Aug 2025
P

ollebompa/PGA-MAP-Elites

GitHub Python MIT License

Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy domains. It uses Neuroevolution driven by a Genetic Algorithm (GA) coupled with Policy Gradients (PG) derived from an off-policy Deep Reinforcement Learning met... Read more

β˜… 60 Forks 3 ollebompa Updated 28 Apr 2026
P

chingyaoc/photo-editing-tensorflow

GitHub Python MIT License

Photo Optimizing Adversarial Net with Policy Gradient Method

β˜… 54 Forks 12 chingyaoc Updated 14 Mar 2023
R

EnnaSachdeva/Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards

GitHub Python

Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single agent tasks. These methods have further been extended to multiagent domains in cooperative, competitive or mixed environments. This paper primarily focuses on ... Read more

β˜… 53 Forks 10 EnnaSachdeva Updated 08 Dec 2025
R

traderben00/RL-Trader

GitHub Python Creative Commons Zero v1.0 Universal

This Reinforcement learning agent uses Policy-Gradient method to trade the market

β˜… 49 Forks 3 traderben00 Updated 07 Feb 2025
P

zwhong714/PSFT

GitHub Python

[ICLR 2026] PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, constraining policy drift to stabilize training and improve generalization.

β˜… 38 Forks 1 zwhong714 Updated 17 Apr 2026
S

ozekri/SEPO

GitHub Jupyter Notebook MIT License

Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"

β˜… 31 Forks 2 ozekri Updated 05 Feb 2026
O

chenxinpeng/Optimization_of_image_description_metrics_using_policy_gradient_methods

GitHub Python

Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods

β˜… 29 Forks 9 chenxinpeng Updated 02 Aug 2023
Pagination Page 1 of 3

10 results on this page Β· 24 total found

Showing first 24 accessible GitHub results.