Ge Zhang is a Data Scientist II based in San Jose with 12 years of experience building production-grade AI and data systems, currently engineering agentic AI at TikTok. He has deep hands-on experience in privacy-focused monitoring and LLM-driven review systems, having built TikTok’s privacy detection pipeline, RAG-enabled models for Data Transfer Review, and metrics-driven dashboards that operationalize privacy controls. Prior roles span data platform engineering and applied ML—designing ETL pipelines on AWS, launching data models for analytics, and deploying recommender systems using transformer models and computer vision. An active polyglot programmer and Lisp enthusiast, he has contributed to the well-known swagger-codegen project by improving Objective-C client generation for OpenAPI, showing a penchant for improving developer tooling. Academically strong with an Operations Research master’s from Columbia and top-tier undergraduate performance, he combines rigorous quantitative skills with product-minded engineering. He’s notable for bridging privacy, production ML, and data infrastructure to turn regulatory needs into measurable, automated systems.
12 years of coding experience
3 years of employment as a software developer
semester exchange, Economics, A, semester exchange, Economics, A at University of California, Berkeley
Bachelor’s Degree, Economics, 3.94 / 4.0, Bachelor’s Degree, Economics, 3.94 / 4.0 at Nankai University
Master’s Degree, Operations Research, 4.04 /4.0, Master’s Degree, Operations Research, 4.04 /4.0 at Columbia University in the City of New York
swagger-codegen contains a template-driven engine to generate documentation, API clients and server stubs in different languages by parsing your OpenAPI / Swagger definition.
Role in this project:
Back-end Developer
Contributions:185 commits, 69 PRs, 47 comments in 6 months
Contributions summary:Ge's contributions focused on enhancing the Objective-C client generator for the Swagger/OpenAPI specification tool. They added support for JSONModel, incorporating it into client code generation to handle model deserialization and serialization. Furthermore, they updated the API client's generated code to use the toDictionary method and the initWithDictionary method, improving data handling and object initialization within the client. These changes aimed at improving the Objective-C client code generation for the Swagger/OpenAPI specification tool.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.