Iman Tabrizian is a Senior Deep Learning Software Engineer with 11 years of experience, currently advancing production inference at NVIDIA as a core contributor to the Triton Inference Server. With an MASc from the University of Toronto and a strong background in distributed ML and network automation, she blends research rigor from roles at Vector Institute and UofT with hands-on systems engineering in C++, Python, and PyTorch. Her open-source work on Triton’s Python backend, client libraries, and testing/CI improvements demonstrates a focus on robust, high-performance inference pipelines and GPU/shared-memory integrations. She’s as comfortable optimizing low-level C-API and CUDA paths as she is adding reliability to test automation—an uncommon mix that accelerates ML from prototype to scalable deployment.
11 years of coding experience
2 years of employment as a software developer
Amirkabir University of Technology
Master of Applied Science (MASc), 3.94/4.0, Master of Applied Science (MASc), 3.94/4.0 at University of Toronto
Triton backend that enables pre-process, post-processing and other logic to be implemented in Python.
Role in this project:
Back-end Developer
Contributions:568 reviews, 123 commits, 247 PRs in 2 years 4 months
Contributions summary:Iman's commits primarily involve the development and maintenance of the Python backend for Triton Inference Server. They are responsible for the initial setup and subsequent improvements to the Python backend, implementing core functionalities such as handling requests, managing input/output tensors, and integrating with the underlying Triton server. The user's work spans both the core backend logic and enhancements, including features for GPU tensor support and interaction with the shared memory. This user is focused on building the core inference pipeline.
Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.
Role in this project:
Back-end Developer & Performance Engineer
Contributions:303 reviews, 37 commits, 81 PRs in 2 years 5 months
Contributions summary:Iman primarily contributed to the client libraries, specifically the Python client, for the Triton Inference Server. Their work involved fixing bugs related to data serialization, particularly for the BYTES data type, and addressing issues with trailing zeros. Additionally, the user introduced and improved performance measurement features, including count-based stabilization, and made modifications to the C-API to support CUDA and system memory, which likely involved performance profiling and optimization.
golangpythongrpcscalaclient-libraries
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Iman Tabrizian - Senior Deep Learning Software Engineer at NVIDIA