Summary
Tracy Shen is a Staff ML Engineer with 11 years of broad data and ML experience and seven years focused on applied ML/AI, specializing in LLM/VLM development, finetuning, deployment, and monitoring. She has led production projects that materially improved RAG systems and table extraction—finetuning Qwen2VL to surpass Claude-3.5 and GPT-4o on table tasks—and authored a public Model Context Protocol (MCP) server and numerous data connectors. Tracy combines deep learning expertise (PyTorch, TensorFlow, BERT-like models) with solid MLOps skills (Docker, Kubernetes, CI/CD, SageMaker) to take models from research to reliable services. Her AI engineering spans agentic workflows (document analysis, SQL↔NLP, Q&A) and model evaluation tooling, with a track record of reducing retrieval failures and boosting recall in production. Multilingual and analytically versatile, she also brings SQL/warehouse experience (Snowflake, BigQuery, Teradata), BI visualization skills, and hands-on API and platform development. Outside work she’s a triathlete and rock climber, a detail that mirrors her endurance for long, complex model tuning and system-level problem solving.
11 years of coding experience
11 years of employment as a software developer
Doctor of Philosophy - PhD Machine Learning, Doctor of Philosophy - PhD Machine Learning at Penn State University
The University of Hong Kong (HKU)
Study abroad Law and Finance, Study abroad Law and Finance at Université Jean Moulin Lyon 3
Data Science Specialization in R Data Science, Data Science Specialization in R Data Science at Johns Hopkins Bloomberg School of Public Health
Applied Data Science in Python Data Science, Applied Data Science in Python Data Science at University of Michigan
B.A English Literature, B.A English Literature at Zhejiang University
French, English, Chinese