A Z is a Senior Staff Software Engineer based in Haidian District, Beijing, with 13 years of experience focused on large-scale ML systems, recommendation engines, and distributed training infrastructure. He has led end-to-end GPU training projects (including 128 A100 deployments) and architected PyTorch-based training/serving frameworks for deep learning recommendation models and foundation-scale MoE models at ByteDance, Meituan, Baidu, and Huawei. His work spans productionizing LLM and multi-modal training/serving infra for recommender systems and building NVMe-optimized TB-scale training nodes that cut hardware costs substantially. Known for delivering near-linear scaling across large GPU clusters, he combines hands-on systems engineering with algorithmic expertise in CTR and recommendation models. A Peking University master's graduate, he uniquely pairs large-scale systems optimization with direct impact on revenue-driving ML products.
13 years of coding experience
4 years of employment as a software developer
Master's Degree, Computer Software Engineering, Master's Degree, Computer Software Engineering at Peking University
Software Engineering, Software Engineering at Northeastern University (CN)
Contributions:8 releases, 82 commits, 20 PRs in 3 years 3 months
bigpipewebglpageletrenderingperformance
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.