Pucheng Yang

Software Engineer at Pinterest

Seattle, Washington, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Pucheng Yang is a software engineer with 11 years of experience in big data, back-end systems, machine learning, and blockchain, currently building Pinterest’s Big Data Query Platform in Seattle. He combines strong academic training from Sun Yat-sen University and Carnegie Mellon with hands-on engineering to design scalable batch processing and data warehouse solutions. An active open-source contributor, he has improved Apache Iceberg’s Python codepath and Hive Metastore interoperability, adding features and robustness used by the wider data ecosystem. Known for pragmatic problem solving, he often focuses on making core methods interoperable across engines and streamlining planning with process pools to improve performance. Passionate about using data technologies to make the world better, he blends deep systems work with a clear product-minded perspective.
code10 years of coding experience
bookBachelor's degree, Computer Software Engineering, Bachelor's degree, Computer Software Engineering at Sun Yat-sen University
bookMaster's degree, Electrical and Computer Engineering, Master's degree, Electrical and Computer Engineering at Carnegie Mellon University
github-logo-circle

Github Skills (8)

ice10
apache10
python10
apache-iceberg10
hive9
spark8
java5
javas5

Programming languages (9)

TypeScriptJavaC++CoffeeScriptScalaJavaScriptHTMLGroovy

Github contributions (5)

github-logo-circle
apache/iceberg

Sep 2021 - Oct 2022

Apache Iceberg
Role in this project:
userBack-end Developer
Contributions:20 reviews, 7 commits, 28 PRs in 1 year 1 month
Contributions summary:Pucheng contributed primarily to the Python (legacy) codebase, fixing bugs and adding features related to the Apache Iceberg project's interaction with Hive Metastore (HMS). They implemented changes to handle quote parsing and protocol customization. The user also worked on improving the system by adding process pool for planning and making core methods public for interoperability with Trino. Several commits involved modifying testing infrastructure.
apache-icebergapachebig-datadatastreamjava
puchengy/prestosql

Oct 2019 - Sep 2021

Official home of the community managed version of Presto, the distributed SQL query engine for big data, under the auspices of the Presto Software Foundation.
Contributions:4 PRs, 53 pushes, 13 branches in 2 years
queryquery-enginesqlbig-dataspark
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Pucheng Yang - Software Engineer at Pinterest