Derek Ma

Data Engineer

San Francisco Bay Area United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

👤
Senior
🎓
Top School
Derek Ma is a data engineer with a decade of experience who blends rigorous academic training (M.S. in Computer Science from UC San Diego, 4.0 GPA) with hands-on production work at Meta and internships across Qualcomm, Kneron, and other engineering teams. He builds and scales data pipelines and warehouses—having implemented Snowflake and Informatica IICS solutions that processed hundreds of millions of records and automated high-throughput ETL—while also contributing to data quality as a QA/test automation engineer on the widely used pandas library. Derek pairs applied machine learning and computer vision experience with practical ops skills (Delta tables, PySpark, DVC, GitLab CI) and entrepreneurial drive demonstrated by founding a scheduling startup integrating Stripe and AWS. He’s currently focused on computational methods for imperfect-information games using heuristics and reinforcement learning, bringing both research curiosity and production-grade engineering to data-driven problems.
code10 years of coding experience
job2 years of employment as a software developer
bookMaster's degree, Computer Science, 4.00/4.00, Master's degree, Computer Science, 4.00/4.00 at University of California, San Diego - Jacobs School of Engineering
bookHigh School Diploma, 4.5/4.0 (Weighted), High School Diploma, 4.5/4.0 (Weighted) at Canyon Crest Academy
stackoverflow-logo

Stackoverflow

Stats
1reputation
0reached
0answers
0questions
github-logo-circle

Github Skills (7)

pandas10
pytest10
python10
testing10
data-analysis9
text-align7
text-alignment7

Programming languages (16)

C#JavaC++RustCMakefileHTMLProtocol Buffer

Github contributions (5)

github-logo-circle
pandas-dev/pandas

Mar 2020 - Mar 2020

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Role in this project:
userQA Engineer / Test Automation Engineer
Contributions:7 commits, 9 PRs, 12 comments in 14 days
Contributions summary:Derek's contributions primarily involve modifying existing tests within the pandas library. Their work focused on improving test coverage and addressing potential errors in various modules. The commits demonstrate a focus on correcting test cases and updating test setup to avoid bare `pytest.raises` usage. The changes span a range of areas, including dtypes, frame, and indexing tests, indicating a broad understanding of the library's functionality.
pythondatalabeled-datamanipulationdataframes
Vlek/vlek.github.io

May 2022 - Dec 2022

Personal blog
Contributions:7 commits, 2 PRs, 15 pushes in 7 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Derek Ma - Data Engineer