Artem Chumachenko

AI Engineer at Together AI

Amsterdam, North Holland, Netherlands
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Artem Chumachenko is an AI Engineer based in Amsterdam with 9 years of experience building and deploying large language models and dialog systems. He has driven model training and finetuning work at Yandex (YaLM up to 14B) and improved chatbot personalization at Neiro.ai, and now applies that expertise at Together AI. His open-source contributions to the well-known BigScience "petals" project added distributed generation, beam search, sampling and adapter/prefix-tuning support—enabling BitTorrent-style LLM inference and faster fine-tuning. With a research background from MIPT in model compression for transformers, he uniquely blends efficiency-focused research with production-grade generative systems.
code10 years of coding experience
job4 years of employment as a software developer
bookBachelor's degree, Applied Physics and Mathematics, Bachelor's degree, Applied Physics and Mathematics at Moscow Institute of Physics and Technology (State University) (MIPT)
github-logo-circle

Github Skills (15)

pytorch10
machine-learning10
transformer10
language-models10
nlp10
language-model10
python10
distributed-system9
large-language-model9
large-language-models9
distributed-systems9
gpt9
deep-learning9
mixture7
mixtures7

Programming languages (4)

TypeScriptVueJupyter NotebookPython

Github contributions (5)

github-logo-circle
bigscience-workshop/petals

Jun 2022 - Jan 2023

🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
Role in this project:
userML Engineer
Contributions:30 reviews, 97 commits, 40 PRs in 6 months
Contributions summary:Artem primarily contributed to the development of a distributed language model platform, specifically focusing on integrating generation capabilities within the `petals` framework. Their contributions include implementing a `RemoteGenerationMixin` class for auto-regressive text generation, adding support for various decoding algorithms such as greedy search and sampling, and integrating prefix-tuned inference. The user also introduced the `beam_search` algorithm and designed functionality for utilizing adapters. Their work directly impacts the models capabilities for generation tasks.
llama2finetuningbittorrentllamahuggingface-transformers
catalyst-team/dl-course

Sep 2020 - Dec 2020

Deep Learning with Catalyst
Contributions:68 commits, 47 PRs, 47 pushes in 3 months
deep-learningcatalystmachine-learning
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Artem Chumachenko - AI Engineer at Together AI