Ofir Zafrir is an applied deep learning researcher at Intel Labs with a decade of engineering experience and a BS in Computer Engineering from Technion. He specializes in NLP model optimization and quantization, having contributed quantization-aware training, quantized embeddings and linear layers, and test coverage for quantized BERTs in the widely used IntelLabs/nlp-architect library. At Intel he progressed from intern to research engineer, bridging state-of-the-art models with hardware-aware algorithms to make production-ready NLP more efficient. Earlier embedded-systems work at Rafael gave him practical real-time C and MATLAB skills that inform his system-minded approach to model deployment.
10 years of coding experience
2 years of employment as a software developer
Bachelor of Science - BS, Computer Engineering, Bachelor of Science - BS, Computer Engineering at Technion - Israel Institute of Technology
A model library for exploring state-of-the-art deep learning topologies and techniques for optimizing Natural Language Processing neural networks
Role in this project:
ML Engineer
Contributions:25 commits, 10 PRs, 4 pushes in 1 year 2 months
Contributions summary:Ofir primarily focused on integrating and developing quantized BERT models within the repository, specifically for tasks like sequence classification and token classification. Their contributions include implementing quantization-aware training, building modules for quantized embeddings and linear layers, and creating specific quantized BERT models. These changes included modifying the BERT model architecture and adding test cases to validate the quantization process. The user also worked on documentation related to the quantized BERT implementation.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.