Abby Zhang is a Senior Lead Database Engineer in Cupertino with over 20 years of database programming experience and a two-decade track record designing high-throughput ETL and data warehousing systems. She architects and optimizes large-scale pipelines—most notably a multi-version Blackarrow data loader that processes 10M–18M TV viewing records per hour into Vertica—and has engineered Spark pipelines handling 100TB/month with sophisticated skew-mitigation and tuning strategies. Proficient across Vertica, Redshift, ClickHouse, Databricks, AWS S3, and PL/SQL, she combines deep SQL and Python expertise with OO Perl and shell scripting to stabilize production ETL, reporting, and forecasting platforms. Abby led migrations between cloud providers, validated parity across Redshift and BigQuery, and improved downstream analytics by standardizing complex healthcare and audience datasets. Beyond engineering, she blends quantitative modeling (Holt-Winters, NumPy/SciPy) with pragmatic performance engineering to deliver both predictive insights and reliably performant pipelines. Her background in computational science and advanced coursework in political science gives her a rare mix of technical depth and analytical perspective when solving messy, real-world data problems.
8 years of coding experience
8 years of employment as a software developer
University of Southern California
Computer Science, Computer Science at San José State University
Bachelor of Arts, American/U.S. Law/Legal Studies/Jurisprudence, Bachelor of Arts, American/U.S. Law/Legal Studies/Jurisprudence at East China Normal University
Contributions:51 pushes, 1 branch in 1 year 7 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.