Ge Zhang

Data Scientist II

San Jose, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

👤
Senior
🎓
Top School
Ge Zhang is a Data Scientist II based in San Jose with 12 years of experience building production-grade AI and data systems, currently engineering agentic AI at TikTok. He has deep hands-on experience in privacy-focused monitoring and LLM-driven review systems, having built TikTok’s privacy detection pipeline, RAG-enabled models for Data Transfer Review, and metrics-driven dashboards that operationalize privacy controls. Prior roles span data platform engineering and applied ML—designing ETL pipelines on AWS, launching data models for analytics, and deploying recommender systems using transformer models and computer vision. An active polyglot programmer and Lisp enthusiast, he has contributed to the well-known swagger-codegen project by improving Objective-C client generation for OpenAPI, showing a penchant for improving developer tooling. Academically strong with an Operations Research master’s from Columbia and top-tier undergraduate performance, he combines rigorous quantitative skills with product-minded engineering. He’s notable for bridging privacy, production ML, and data infrastructure to turn regulatory needs into measurable, automated systems.
code12 years of coding experience
job3 years of employment as a software developer
booksemester exchange, Economics, A, semester exchange, Economics, A at University of California, Berkeley
bookBachelor’s Degree, Economics, 3.94 / 4.0, Bachelor’s Degree, Economics, 3.94 / 4.0 at Nankai University
bookMaster’s Degree, Operations Research, 4.04 /4.0, Master’s Degree, Operations Research, 4.04 /4.0 at Columbia University in the City of New York
github-logo-circle

Github Skills (8)

objective-c10
swagger10
code-generation10
openapi10
apidoc9
api8
apim8
api-design8

Programming languages (9)

JavaMustacheObjective-CSwiftRubyElixirEmacs LispPython

Github contributions (5)

github-logo-circle
swagger-api/swagger-codegen

Mar 2015 - Sep 2015

swagger-codegen contains a template-driven engine to generate documentation, API clients and server stubs in different languages by parsing your OpenAPI / Swagger definition.
Role in this project:
userBack-end Developer
Contributions:185 commits, 69 PRs, 47 comments in 6 months
Contributions summary:Ge's contributions focused on enhancing the Objective-C client generator for the Swagger/OpenAPI specification tool. They added support for JSONModel, incorporating it into client code generation to handle model deserialization and serialization. Furthermore, they updated the API client's generated code to use the toDictionary method and the initWithDictionary method, improving data handling and object initialization within the client. These changes aimed at improving the Objective-C client code generation for the Swagger/OpenAPI specification tool.
openapi-codegenredocopenapi-specificationswagger-openapiopenapi
geekerzp/ActiveService

Jul 2013 - Nov 2013

Contributions:21 commits in 4 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Ge Zhang - Data Scientist II