| Rank | Repo | Stars | Language | Developer |  | 1 | ml-agents | 14,334 | C# | Unity-Technologies |
 | 2 | reinforcement-learning-an-introduction | 12,208 | Python | ShangtongZhang |
 | 3 | amazon-sagemaker-examples | 8,063 | Jupyter Notebook | aws |
 | 4 | Reinforcement-learning-with-tensorflow | 7,884 | Python | MorvanZhou |
 | 5 | pysc2 | 7,708 | Python | deepmind |
 | 6 | machine_learning_examples | 7,340 | Python | lazyprogrammer |
 | 7 | tensorpack | 6,257 | Python | tensorpack |
 | 8 | easy-rl | 6,108 | Jupyter Notebook | datawhalechina |
 | 9 | Practical_RL | 5,279 | Jupyter Notebook | yandexdataschool |
 | 10 | stable-baselines3 | 5,249 | Python | DLR-RM |
 | 11 | deep-reinforcement-learning | 4,435 | Jupyter Notebook | udacity |
 | 12 | open_spiel | 3,591 | C++ | deepmind |
 | 13 | Reinforcement-Learning | 3,503 | Jupyter Notebook | andri27-ts |
 | 14 | ELF | 3,300 | C++ | pytorch |
 | 15 | tensorforce | 3,231 | Python | tensorforce |
 | 16 | Deep-Learning-Roadmap | 3,140 | Python | astorfi |
 | 17 | deep-rl-class | 2,768 | Jupyter Notebook | huggingface |
 | 18 | awesome-deeplearning-resources | 2,632 | unknown | endymecy |
 | 19 | DeepRL-Agents | 2,194 | Jupyter Notebook | awjuliani |
 | 20 | chess-alpha-zero | 2,008 | Jupyter Notebook | Zeta36 |
 | 21 | brax | 1,573 | Jupyter Notebook | google |
 | 22 | Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions | 1,517 | Jupyter Notebook | LyWangPX |
 | 23 | rainbow-is-all-you-need | 1,481 | Jupyter Notebook | Curt-Park |
 | 24 | | | | |
 | 25 | awesome-deep-rl | 1,242 | HTML | tigerneil |
 | 26 | TextWorld | 1,008 | Jupyter Notebook | microsoft |
 | 27 | DeepRL-Tutorials | 968 | Jupyter Notebook | qfettes |
 | 28 | basic_reinforcement_learning | 948 | Jupyter Notebook | vmayoral |
 | 29 | reinforcement_learning_course_materials | 811 | Jupyter Notebook | upb-lea |
 | 30 | Hands-On-Reinforcement-Learning-With-Python | 759 | Jupyter Notebook | sudharsan13296 |
 | 31 | rl-book | 718 | HTML | ZhiqingXiao |
 | 32 | David-Silver-Reinforcement-learning | 711 | Jupyter Notebook | dalmia |
 | 33 | Popular-RL-Algorithms | 708 | Jupyter Notebook | quantumiracle |
 | 34 | Reinforcement_learning_tutorial_with_demo | 615 | Jupyter Notebook | omerbsezer |
 | 35 | ReinforcementLearning.jl | 485 | Julia | JuliaReinforcementLearning |
 | 36 | QLearning_Trading | 475 | Jupyter Notebook | ucaiado |
 | 37 | godot_rl_agents | 400 | Python | edbeeching |
 | 38 | rl-cheatsheet | 269 | TeX | udacity |
 | 39 | ReinforcementLearningAnIntroduction.jl | 261 | Julia | JuliaReinforcementLearning |
 | 40 | RL-Theory-book | 217 | TeX | FortsAndMills |
 | 41 | Reinforce.jl | 202 | Julia | JuliaML |
 | 42 | arena | 200 | Python | diambra |
 | 43 | gb | 156 | Makefile | krocki |
 | 44 | automata | 142 | Elixir | upstarter |
 | 45 | openmodelica-microgrid-gym | 135 | Modelica | upb-lea |
 | 46 | Reinforcement-Learning-Cheat-Sheet | 131 | TeX | FrancescoSaverioZuppichini |
 | 47 | SeaPearl.jl | 130 | Julia | corail-research |
 | 48 | reinforcement_learning_financial_trading | 116 | MATLAB | matlab-deep-learning |
 | 49 | OpenAIGym.jl | 103 | Julia | JuliaML |
 | 50 | ultimate-volleyball | 64 | C# | CoderOneHQ |
 | 51 | ml-in-action | 64 | MATLAB | huiwenzhang |
 | 52 | commonsense-rl | 63 | Inform 7 | IBM |
 | 53 | DeepQLearning.jl | 63 | Julia | JuliaPOMDP |
 | 54 | ReinforcementLearningEnvironments.jl | 56 | Julia | JuliaReinforcementLearning |
 | 55 | Reinforment-Implementation-on-a-Quadruped | 56 | OpenEdge ABL | YunjaeChoi |
 | 56 | symbolic-rl | 54 | Lasso | 921kiyo |
 | 57 | JuliaRL | 45 | Julia | fabio-4 |
 | 58 | POMDPGallery.jl | 44 | Julia | JuliaPOMDP |
 | 59 | Pensieve-PPO | 40 | DIGITAL Command Language | godka |
 | 60 | practical-rl | 40 | Julia | dmitrijsc |
 | 61 | AtariAlgos.jl | 40 | Julia | JuliaML |
 | 62 | the_mayan_adventure | 33 | ShaderLab | simoninithomas |
 | 63 | cytonRL | 30 | Cuda | arthurxlw |
 | 64 | walk_the_blocks | 27 | ASP | xwhan |
 | 65 | MultiAgent-PPO | 23 | ASP | jsztompka |
 | 66 | DeepLaetitia | 15 | Mathematica | adendek |
 | 67 | Soccer-PPO | 13 | ASP | marcelloaborges |
 | 68 | Zombie-Shooter-Neural-Network | 13 | Processing | Daporan |
 | 69 | gace | 7 | Hy | AugustUnderground |
 | 70 | NS2-Reinforced-Distance-Vector-Routing-Protocol | 6 | Tcl | StanyMwamba |
 | 71 | P2_Continuous_Control | 5 | ASP | dalmia |
 | 72 | meetup_pythonbq_deeplearning | 4 | OpenEdge ABL | waybarrios |
 | 73 | BoatAttack-with-ML-Agents-build-versions | 2 | ASP.NET | dhyeythumar |
 | 74 | Drifting-with-RL | 2 | ASP.NET | defrag-bambino |
 | 75 | DialogPolicy | 2 | OpenEdge ABL | iTharindu |
 | 76 | multi-armed-bandit | 2 | NetLogo | ygrayson |
 | 77 | AI-AutoPark | 1 | ASP.NET | LDH0094 |
 | 78 | MultiAgent-SAC-Tennis | 1 | ASP.NET | Oreoluwa-Se |
 | 79 | RSE-RL | 1 | Jupyter Notebook | yunhaoyang234 |
 | 80 | autonomous-systems | 1 | ASP.NET | charlola |
 | 81 | deepladder | 1 | DIGITAL Command Language | thu-media |
 | 82 | Retro-Hackathon | 1 | ASP.NET | valerija-h |
 | 83 | gym_corewar | 1 | Red | CreeperLin |
 | 84 | master-thesis | 1 | ASP.NET | rsaenzi |
 | 85 | producton_system | 1 | Logtalk | vsraptor |
 | 86 | wazuhl | 1 | LLVM | SavchenkoValeriy |
 | 87 | UoMThesis | 0 | Inform 7 | JohnnySun8 |