Tianhe Yu

Palo Alto, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
Tianhe Yu is a Research Scientist at Google DeepMind in Palo Alto with a decade of experience building and scaling reinforcement learning systems, currently driving Gemini RL, Thinking, post‑training work and leading efforts on Gemini 2.5. He combines academic rigor as a PhD candidate in Computer Science at Stanford with practical robotics and multi‑task RL experience from collaborations at Google Brain and the Robot Learning Lab. His NeurIPS 2021 work on multi‑task offline RL reflects a strong track record in publishing, while his open‑source contributions to Metaworld show hands‑on implementation of 6DOF robotics environments, reward functions, and observation spaces. Comfortable bridging simulation and real‑world tasks, he brings both research depth and engineering execution to large RL systems.
code10 years of coding experience
github-logo-circle

Github Skills (9)

gymnasium10
openai-gym10
open-ai-gym10
environmental10
robotics10
dev-environment10
environ10
python10
enviroment10

Programming languages (4)

PHPHTMLCythonPython

Github contributions (5)

github-logo-circle
Farama-Foundation/Metaworld

Feb 2019 - Jan 2020

Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Role in this project:
userBack-end Developer
Contributions:60 commits, 5 PRs, 3 pushes in 10 months
Contributions summary:Tianhe primarily contributed by adding new environments within the repository, specifically focusing on 6DOF environments like SawyerPickAndPlace6DOFEnv, SawyerStickPull6DOFEnv, SawyerReachPushPickPlace6DOFEnv, SawyerButtonPressTopdownWall6DOFEnv. The user made changes to existing code for the new environments, including implementation of reward functions and observation spaces. The work involved modifying code related to environment setup and interaction, suggesting a focus on the functionality and behavior of the robotics environments.
manipulationmulti-taskroboticsbenchmarkingmultiagent-reinforcement-learning
tianheyu927/mil

Nov 2017 - Oct 2018

Code for "One-Shot Visual Imitation Learning via Meta-Learning"
Contributions:24 commits, 3 PRs, 18 pushes in 11 months
pytorchmeta-learningone-shotrobustnessgeneralization
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Tianhe Yu