Rhys Williams is a Lead AI Engineer with nine years of cross-disciplinary experience specialising in large-scale computation, model quantization, and edge deployment. He has led production-grade RAG systems, multi-agent toolsets and distributed training pipelines using DDP, ZeRO, Ray/Kuberay and Triton optimisations. His background spans embedded NVIDIA Jetson/Orin work, microcontroller-driven creative tech for live events, and practical model compression for CPU and edge-first products. Rhys combines hands-on kernel and inference tuning (flash/sparse attention, fused softmax, Triton kernels) with team leadership, curriculum development and advisory roles. Based in Old Toronto, he’s equally at home reconstructing foundation model architectures as he is orchestrating hyperparameter and reinforcement learning at scale, often bridging research ideas into deployable, resource-efficient systems.
9 years of coding experience
9 years of employment as a software developer
Ba (Hons) Political Sociology, Political Science and Government, 2:1, Ba (Hons) Political Sociology, Political Science and Government, 2:1 at Lancaster University
Contributions:4 PRs, 31 pushes, 3 branches in 2 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.