Szymon Ożóg is a software engineer with seven years of experience bridging game development and machine learning, currently serving as Member of Engineering Staff at Poolside after a stint as Senior AI Inference Engineer at Aleph Alpha. He combines production game programming expertise from Carbon Studio with hands-on ML systems work—contributing to high-throughput LLM tooling such as vllm (GGUF support and quantized model optimizations) and low-level accelerators in tinygrad (CUDA/PTX and Triton support). Comfortable across full-stack and low-level performance engineering, he focuses on model loading, memory efficiency, and inference robustness. Based in Silesia, Poland, he holds a Master’s in Computer Science and brings a practical, systems-minded approach that surfaces in both game engines and ML inference pipelines.
7 years of coding experience
Master's degree, Computer Science, Master's degree, Computer Science at The Silesian University of Technology
You like pytorch? You like micrograd? You love tinygrad! ❤️
Role in this project:
Full-stack Developer
Contributions:40 reviews, 44 PRs, 87 comments in 1 year
Contributions summary:Szymon contributed significantly to the `tinygrad/tinygrad` repository, focusing on enhancing and maintaining the project's functionality. Their work involved debugging and improving the HIP error messages, making substantial updates to the codebase, and implementing features to support Triton compilation. They were also responsible for enabling, testing, and cleaning up the Triton project. The user demonstrated strong skills in the CUDA and PTX backends.
A high-throughput and memory-efficient inference and serving engine for LLMs
Role in this project:
ML Engineer
Contributions:14 reviews, 10 PRs, 40 comments in 1 month
Contributions summary:Szymon primarily contributes to the integration and improvement of GGUF (GGML Unified Format) support within the VLLM project. Their commits involve bug fixes related to GGUF weight initialization, the addition of new GGUF kernels, and implementation of GGUF support for Deepseek models. These changes demonstrate a focus on expanding the project's capabilities for quantized models and optimizing model loading processes. The user has also fixed issues related to chunked prefill operations.
amdcudadeepseekgpthpu
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Szymon Ożóg - Member Of Engineering Staff at Poolside