Summary
Benjamin Thérien is a PhD student at Université de Montréal and Mila, currently interning as a Research Scientist at Meta where he focuses on distributed optimization for cross-datacenter training of large language models. With seven years of industry and research experience spanning Meta, Capital One, Morgan Stanley, and multiple Mila internships, he brings a rare blend of systems engineering and scalable ML research—especially in LLM conversion, MoE pre-training, and high‑performance parallelism. His prior work includes converting transformers to GPT‑NeoX formats, optimizing 3D parallelism on supercomputers for 9B-parameter models, and creating a paired LiDAR-image dataset that underpinned a WACV publication. Comfortable shipping production tooling and large-scale experiments, he pairs strong software design instincts (API/automation and data pipelines) with rigorous academic output. A US and Canadian citizen based in Montreal, he’s as likely to optimize GPU TFLOPs on Summit as to lead end-to-end research projects from idea to publication.
7 years of coding experience
3 years of employment as a software developer
MMath, Computer Science, MMath, Computer Science at University of Waterloo
College Degree, Business/Commerce, General, College Degree, Business/Commerce, General at Marianopolis College
Bachelor of Science - BS, Computer Science, Bachelor of Science - BS, Computer Science at Concordia University
Doctor of Philosophy - PhD, Computer Science, Doctor of Philosophy - PhD, Computer Science at Universite de Montreal
English, French