Tuan Doan is a Staff Data Scientist and team lead at Quora with nine years of experience applying statistical and machine learning techniques to product growth and user-facing systems. A Yale-trained statistician, he has progressed from hands-on data scientist to leading the Poe Growth team and serving as Poe Data Science Tech Lead, blending technical ownership with cross-functional leadership. His work spans healthcare, customer behavior, sports analytics, and financial modeling, and his public repos demonstrate practical expertise in R and Python for Bayesian analysis, clustering, and sports analytics. Early career research in deep learning and applied internships at Deloitte and Spring Health show a mix of production-focused innovation and research grounding. Based in Seattle, he brings an interest in leveraging modern computational tools to convert complex data into actionable product improvements and reproducible analysis pipelines.
9 years of coding experience
4 years of employment as a software developer
Light Fellows Korean Language Summer Program, Light Fellows Korean Language Summer Program at Seoul National University
International Baccalaureate Diploma , International Baccalaureate Diploma at St. Joseph’s Institution International
Bachelor of Science Statistics and Data Science, Bachelor of Science Statistics and Data Science at Yale University
The code repository for projects and tutorials in R and Python that covers a variety of topics in data visualization, statistics sports analytics and general application of probability theory.
Role in this project:
Data Scientist
Contributions:91 commits, 4 PRs, 151 pushes in 3 years 3 months
Contributions summary:Tuan primarily contributed to data science projects involving statistical analysis, data visualization, and predictive modeling. Their work included creating and updating R scripts for Bayesian analysis, exploring penalty kick conversion rates using beta distributions, and implementing clustering and factor analysis techniques for team evaluation. They also contributed to data cleaning and prediction tasks within a sports analytics context, and added code for applying the "rule of three" concept. The user demonstrated proficiency in R and Python, working with libraries like `tidyverse`, `MASS`, `betareg`, `fpc`, and `scikit-surprise` in their analyses.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.