Henry Ehrenberg is a co-founder of Snorkel AI and a San Francisco–based software engineer with about 10 years of experience building data-first ML tooling that bridges research and production. He has been a hands-on backend and database engineer on the widely used open-source Snorkel project, adding document/HTML parsers, sentence extraction, and holdout capabilities that support robust model evaluation in weak supervision pipelines. Prior to Snorkel he was a senior applied research scientist at Facebook and a research assistant in Stanford’s Hazy Research group with Chris Ré, helping translate cutting-edge ML research into practical systems. He holds an MS in Computational and Mathematical Engineering from Stanford and a BS in Applied Mathematics from Yale, and contributes across the stack from build tooling and tutorial infrastructure to core ML platform features.
A system for quickly generating training data with weak supervision
Role in this project:
Back-end Developer & Database Engineer
Contributions:3 releases, 17 reviews, 678 commits in 5 years 10 months
Contributions summary:Henry added generic code parsers and HTML parsers, fixing parser installation scripts and HTML parser issues. They also implemented features for document parsing and sentence processing, including methods for extracting sentences and text from parsed documents. Furthermore, the user added support for holdout capabilities, which are crucial for model evaluation and feature analysis.
Contributions:1 review, 52 commits, 96 PRs in 6 months
Contributions summary:Henry contributed to the initial setup and configuration of the tutorial repository, including adding instructions and addressing code formatting. Their work involved setting up essential components of the project. In addition, the user implemented functionalities for checking and syncing notebooks within the repository, including developing scripts for verifying notebook consistency. This indicates a focus on build processes and ensuring the integrity of the tutorial content.
snorkel
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.