Summary
Hoang Pham is a data engineer and aspiring machine learning/AI engineer with a decade of experience building end-to-end analytics and ML pipelines across startups, research labs, and national archives. He combines industrial expertise in data integration, ETL, and analytics with recent advanced training in Language Technology and hands-on research in OCR, NLP and visual-language models. At Holistics he led the internal analytics stack and mentored a small data team, while academic projects at Uppsala and the Swedish National Archives focused on multimodal pipelines for transcribing and evaluating historical manuscripts. He is experienced with production tools from BigQuery and Airflow to Whisper, pyannote and TrOCR, and has translated that stack into tangible corpora and reproducible pipelines for policy and archival research. Comfortable switching between product-facing analytics and research-grade ML, he brings both operational rigor and a curiosity for improving language and vision systems. Based in Ho Chi Minh City, he blends practical data engineering leadership with a specialty in document and speech processing that few data engineers maintain.
10 years of coding experience
8 years of employment as a software developer
Bachelor of Arts (B.A.), Business English, Bachelor of Arts (B.A.), Business English at Vietnam National University, Hanoi
Master's degree, Language Technology, Master's degree, Language Technology at Uppsala University
Japanese, English, Vietnamese