Xuan-son Nguyen is a software engineer based in the Greater Paris area with 12 years of experience building performant, low-level systems and on-device ML runtimes. Currently at Hugging Face, he maintains GGUF and integrations for llama.cpp and contributes optimizations that make large-language-model inference practical in constrained environments. His open-source work includes meaningful performance and portability improvements to whisper.cpp and the ggml tensor library—optimizing SIMD for WASM, adding LoRA adapter support, and fixing platform-specific issues. Comfortable across C++, NodeJS, and systems-level networking, he has migrated codebases from Python to C++ and shipped cross-platform SDKs and apps. A security-trained engineer (INSA) with hands-on cybersecurity and backend experience, he pairs a pragmatic systems mindset with a flair for low-level optimization that often yields disproportionate performance gains.
11 years of coding experience
3 years of employment as a software developer
Diplôme d'ingénieur Sécurité et Technologies informatiques, Diplôme d'ingénieur Sécurité et Technologies informatiques at l'INSA Centre Val de Loire
Computer Science, Computer Science at Aix-Marseille University
Computer Science, Computer Science at Vietnam National University, Hanoi
Contributions:1 PR, 5 comments, 3 issues in 8 months
Contributions summary:Xuan-son primarily contributed to the development and improvement of the `ggml` tensor library for machine learning. Their work included implementing new features for LoRA adapter support, optimizing SIMD for WASM, and fixing issues related to unique tensor names and platform compatibility. Key contributions involved refactoring LoRA adapter functionalities, enhancing conversion scripts, and optimizing the library for improved performance in various environments.
Contributions summary:Xuan-son primarily focused on low-level optimizations and refactoring within the `ggml` library of the Whisper.cpp project. Their contributions include enforcing unique tensor names, adding features for handling LoRA adapters, and significantly improving performance for WASM builds by optimizing SIMD operations. The commits also involved fixing platform-specific issues, such as those related to `cpu_set_t` on Emscripten, and adding functionality to the `gguf-split` utility.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Xuan-son Nguyen - Ingénieur Logiciel at Hugging Face