🌍 Live Open Source Explorer

Explore live open-source projects and AI models.

Search public open-source repositories from GitHub and AI models from Hugging Face. Every page shows 10 results with clean pagination.

Curated Tools → Curated AI Models 🤖 Submit Tool ＋

Live sources

🐙

GitHub

Open repos

🤗

Hugging Face

AI models

🐳

Self Hosted

GitHub search

🧠

LLM

Model search

🔎 Live Search

Search live open-source data

Search GitHub repositories and Hugging Face models directly, then explore stars, downloads, source links and project details.

Reset Search ↻

Search keyword

🔎

Source

🌐

Try keywords like automation, CRM, analytics, chatbot, llama or workflow.

Choose where to search live data.

CRM Workflow Chatbot Llama

Live Results

GitHub Open Source Repositories

Search: Policy-Gradient-Methods

Page 1

Showing 10 results from 24

Khrylx/PyTorch-RL

GitHub Python MIT License

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

★ 1,285 Forks 192 Khrylx Updated 05 Jun 2026

External source

GitHub

View Details → Open Source ↗

MrSyee/pg-is-all-you-need

GitHub Jupyter Notebook MIT License

Policy Gradient is all you need! A step-by-step tutorial for well-known PG methods.

★ 1,032 Forks 127 MrSyee Updated 08 Jun 2026

External source

GitHub

View Details → Open Source ↗

cyoon1729/Policy-Gradient-Methods

GitHub Jupyter Notebook

Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC

★ 100 Forks 30 cyoon1729 Updated 04 Aug 2025

External source

GitHub

View Details → Open Source ↗

ollebompa/PGA-MAP-Elites

GitHub Python MIT License

Repository for the PGA-MAP-Elites algorithm. PGA-MAP-Elites was developed to efficiently scale MAP-Elites to large genotypes and noisy domains. It uses Neuroevolution driven by a Genetic Algorithm (GA) coupled with Policy Gradients (PG) derived from an off-policy Deep Reinforcement Learning met... Read more

★ 60 Forks 3 ollebompa Updated 28 Apr 2026

External source

GitHub

View Details → Open Source ↗

chingyaoc/photo-editing-tensorflow

GitHub Python MIT License

Photo Optimizing Adversarial Net with Policy Gradient Method

★ 54 Forks 12 chingyaoc Updated 14 Mar 2023

External source

GitHub

View Details → Open Source ↗

EnnaSachdeva/Recurrent-Multiagent-Deep-Deterministic-Policy-Gradient-with-Difference-Rewards

GitHub Python

Deep Reinforcement Learning (DRL) algorithms have been successfully applied to a range of challenging simulated continuous control single agent tasks. These methods have further been extended to multiagent domains in cooperative, competitive or mixed environments. This paper primarily focuses on ... Read more

★ 53 Forks 10 EnnaSachdeva Updated 08 Dec 2025

External source

GitHub

View Details → Open Source ↗

traderben00/RL-Trader

GitHub Python Creative Commons Zero v1.0 Universal

This Reinforcement learning agent uses Policy-Gradient method to trade the market

★ 49 Forks 3 traderben00 Updated 07 Feb 2025

External source

GitHub

View Details → Open Source ↗

zwhong714/PSFT

GitHub Python

[ICLR 2026] PSFT is a trust-region–inspired fine-tuning objective that views SFT as a policy gradient method with constant advantages, constraining policy drift to stabilize training and improve generalization.

★ 38 Forks 1 zwhong714 Updated 17 Apr 2026

External source

GitHub

View Details → Open Source ↗

ozekri/SEPO

GitHub Jupyter Notebook MIT License

Code for the paper: "Fine-Tuning Discrete Diffusion Models with Policy Gradient Methods"

★ 31 Forks 2 ozekri Updated 05 Feb 2026

External source

GitHub

View Details → Open Source ↗

chenxinpeng/Optimization_of_image_description_metrics_using_policy_gradient_methods

GitHub Python

Tensorflow implement of paper: Optimization of image description metrics using policy gradient methods

★ 29 Forks 9 chenxinpeng Updated 02 Aug 2023

External source

GitHub

View Details → Open Source ↗

Pagination Page 1 of 3

10 results on this page · 24 total found

Showing first 24 accessible GitHub results.