A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
Role in this project:
ML Engineer Contributions:24 commits, 17 PRs, 1 branch in 11 months
Contributions summary:Ian's primary contribution focused on refactoring and converting existing TD3 and DQN trainers within the ReAgent framework to utilize PyTorch Lightning. They introduced new reporter classes and modified existing code to integrate with the new Lightning modules. Further contributions include adding a CRR trainer, modifying the model registration process, fixing bugs related to the Evaluator and reward boosts within the DQN and QR-DQN trainers, and adding unit tests.
reinforcement-learningcontextualbanditscontextual-banditsreinforcement
Contributions:9 commits, 5 pushes, 1 branch in 3 years 6 months