Ita Zaporozhets

Machine Learning Engineer at Hugging Face

Paris, Ile-de-France
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Ita Zaporozhets is a Machine Learning Engineer based in Paris, combining seven years of industry experience with rigorous AI training from Université Paris-Saclay and a BASc in Industrial Engineering from the University of Toronto. She has shipped production-grade systems at HelloFresh and contributed to tokenization features in the high-profile Hugging Face Transformers library, improving support for Llama and TikToken workflows. Currently pursuing a Master of Artificial Intelligence, she blends applied ML, data science, and automation expertise to build pragmatic solutions with social impact. Notably, her open-source work involved nuanced tokenizer fixes and integration tests—skills that translate to robust, reliable model pipelines in production.
code7 years of coding experience
job6 years of employment as a software developer
bookMVA Artificial Intelligence, MVA Artificial Intelligence at ENS Paris-Saclay
bookMinor Artificial Intelligence, Minor Artificial Intelligence at University of Toronto
bookMaster's degree Artificial Intelligence, Master's degree Artificial Intelligence at Université Paris-Saclay
languagesRussian, French, English
stackoverflow-logo

Stackoverflow

Stats
1reputation
0reached
0answers
0questions
github-logo-circle

Github Skills (12)

transformers10
llama10
pytorch10
machine-learning10
tokenizer10
nlp10
language-model10
python10
pre-trained-model9
testing9
transformer8
hub7

Programming languages (4)

TypeScriptC++RustPython

Github contributions (5)

github-logo-circle
huggingface/transformers

May 2024 - Apr 2025

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Role in this project:
userML Engineer
Contributions:128 reviews, 55 PRs, 348 pushes in 10 months
Contributions summary:Ita primarily contributed to the tokenization aspects of the Hugging Face Transformers library, with a focus on supporting and improving tokenizers for various language models, particularly Llama and models leveraging TikToken. Their work involved adding features like split special tokens and user-defined symbols, fixing issues related to spaces, and optimizing tokenization processes. The contributions included updating existing tests and creating new integration tests to validate the correct functionality of different tokenizer implementations.
pythonbertspeech-recognitionstate-of-the-artflax
itazap/APS360Project

Mar 2019 - Apr 2019

APS360Project
Contributions:5 PRs, 100 pushes, 1 branch in 26 days
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Ita Zaporozhets - Machine Learning Engineer at Hugging Face