Ke Wang is a Senior Software Engineer in the San Francisco Bay Area with 10 years of experience building large-scale ML and cloud systems, currently focused on generative AI serving infrastructure at Google. He blends deep research instincts—from publications and prior work in nonlinear optics and bioinformatics—with practical engineering, having led delivery of enterprise CRM and order-management platforms early in his career. Ke contributes to open-source reinforcement learning (notably improvements to TensorFlow Agents’ PPO implementation and TD-lambda returns), reflecting a hands-on interest in scalable RL libraries. Equally comfortable in research and production, he pairs strong academic credentials across Europe and the U.S. with proven ability to ship robust, well-tested ML infrastructure at scale.
10 years of coding experience
4 years of employment as a software developer
Foreign Semester, Electrical and Electronics Engineering, 92/100, Foreign Semester, Electrical and Electronics Engineering, 92/100 at City University London
Master's degree, Microelectronics and Microsystems, 1.1/1.0 (A+), Master's degree, Microelectronics and Microsystems, 1.1/1.0 (A+) at Technische Universität Hamburg-Harburg
Master's degree, Business Administration and Management, General, 1.7/1.0 (A-), Master's degree, Business Administration and Management, General, 1.7/1.0 (A-) at Northern Institute of Technology Management
Bachelor's degree, Electrical Engineering and Automation, 92/100, Bachelor's degree, Electrical Engineering and Automation, 92/100 at Nanjing University of Aeronautics and Astronautics
Master of Engineering (MEng), Biomedical/Medical Engineering, 4.00/4.00 (A), Master of Engineering (MEng), Biomedical/Medical Engineering, 4.00/4.00 (A) at Cornell University
TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.
Role in this project:
ML Engineer
Contributions:5 commits in 8 months
Contributions summary:Ke contributed to the TensorFlow Agents library, specifically focusing on the Proximal Policy Optimization (PPO) agent. Their work included refactoring advantage and return calculations and fixing shape mismatches in TD lambda return calculations. Furthermore, they improved documentation and added an integration test using a driver. These changes directly impact the functionality and efficiency of reinforcement learning algorithms within the library.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.