Dmitry Petrov

Co-Founder, CEO at DataChain, Inc

San Francisco, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Dmitry Petrov is a Machine Learning Engineer in San Francisco with nine years of experience, a PhD in computer science, and prior work as a Data Scientist at Microsoft. He is the creator of dvc.org (Git for machine learning) and a core contributor to open-source tooling that powers data versioning and ML experiment management. Dmitry’s hands-on strengths are in Python back-end and CLI development—building robust command-line tools, configuration and environment handling—and in algorithmic work like perceptual image hashing where he improved wavelet-based hashing and added unit tests to harden edge cases. Equally comfortable in research and production, he pairs meticulous engineering with a dry sense of humor, often making jokes with a serious face.
code9 years of coding experience
job15 years of employment as a software developer
bookSaint Petersburg State Electrotechnical University "LETI"
languagesEnglish, Russian, komi
stackoverflow-logo

Stackoverflow

Stats
1,519reputation
292kreached
15answers
36questions
Badges
hadoop
top-5%
hiveql
top-5%
github-logo-circle

Github Skills (24)

unit-testing10
python10
configuration-management10
command-line-interface10
pi10
numpy10
command-line10
hadoop9
image-processing9
data-versioning9
hiveql9
test-automation9
versioning9
version-control9
assert8

Programming languages (8)

TypeScriptC++ShellJavaScriptGoHTMLJupyter NotebookPython

Github contributions (5)

github-logo-circle
iterative/dvc

Mar 2017 - Jun 2021

🦉 Data Versioning and ML Experiments
Role in this project:
userBack-end Developer
Contributions:4 releases, 3 reviews, 436 commits in 4 years 3 months
Contributions summary:Dmitry's commits primarily involve the creation of a base command class and its implementation in the init command for a data versioning tool. Their work focuses on setting up the configuration and structure for command-line tools, including parsing configuration files, managing environment variables, and defining command-line arguments. The user demonstrates proficiency in Python, utilizing the argparse, configparser, and subprocess libraries, indicating a strong understanding of back-end development principles for command-line applications.
data-version-controlml-modelspythonversion-controldata-science
JohannesBuchner/imagehash

Jun 2016 - Feb 2017

A Python Perceptual Image Hashing Module
Role in this project:
userBack-end Developer & QA Engineer
Contributions:5 commits, 5 PRs, 16 comments in 7 months
Contributions summary:Dmitry primarily focused on implementing and refining the `whash` function, a core component for wavelet-based image hashing. Their contributions include the initial implementation, iterative bug fixes, and the addition of unit tests to ensure the reliability and functionality of the image hashing algorithm. Furthermore, they made adjustments to scale handling for smaller images, making the algorithm more robust. They also addressed documentation inconsistencies.
hashingpythonperceptualimage-processingimage-hashing
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Dmitry Petrov - Co-Founder, CEO at DataChain, Inc