Xiang Si - Software Engineer at Google

Xiang Si

Software Engineer at Google

San Francisco Bay Area United States

Join Prog.AI to see contacts

Summary

🤩

Rockstar

🎓

Top School

Xiang Si is a software engineer based in the San Francisco Bay Area specializing in high-performance ML inference and distributed systems, currently working on Cloud TPU inference at Google. With an MS in Computer Science from Carnegie Mellon and prior engineering roles at AWS (Neuron frameworks) and internships at Apple, he focuses on optimizing LLM inference performance and enabling disaggregated, scalable model serving. His background combines research in statistical machine learning with hands-on production work accelerating distributed training and inference across hardware and framework stacks. Notably, he has moved between deep research environments and hyperscale cloud teams, giving him a practiced ability to translate algorithmic ideas into production-grade, hardware-aware systems.

1 year of coding experience

4 years of employment as a software developer

Master of Science - MS, Computer Science, Master of Science - MS, Computer Science at Carnegie Mellon University

Chinese, English

Stackoverflow

Stats

26reputation

92reached

1answer

0questions

Github Skills (17)

large-language-models10

llm10

gpt10

gpu8

inference8

pytorch8

batching8

tpu7

llm-inference7

llama7

transformer7

gemma6

mlops6

android6

java6

Programming languages (2)

ShellPython

Github contributions (5)

sixiang-google/ml-auto-solutions

Jun 2024 - Oct 2024

Contributions:48 pushes in 4 months

sixiang-google/maxtext

Nov 2024 - Jan 2025

A simple, performant and scalable Jax LLM!

Contributions:22 pushes, 1 branch in 2 months

Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.

Request Free Trial