Henry Ehrenberg

Founder

San Francisco, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
Henry Ehrenberg is a co-founder of Snorkel AI and a San Francisco–based software engineer with about 10 years of experience building data-first ML tooling that bridges research and production. He has been a hands-on backend and database engineer on the widely used open-source Snorkel project, adding document/HTML parsers, sentence extraction, and holdout capabilities that support robust model evaluation in weak supervision pipelines. Prior to Snorkel he was a senior applied research scientist at Facebook and a research assistant in Stanford’s Hazy Research group with Chris Ré, helping translate cutting-edge ML research into practical systems. He holds an MS in Computational and Mathematical Engineering from Stanford and a BS in Applied Mathematics from Yale, and contributes across the stack from build tooling and tutorial infrastructure to core ML platform features.
code11 years of coding experience
github-logo-circle

Github Skills (19)

python10
scikit10
numpy10
parsing10
html-parsing10
parse10
build-automation10
document-processing10
scikit-learn10
jupyter-notebook10
database-design9
markdown-it8
markdown8
tensorflow7
tensorflow27

Programming languages (4)

ShellHTMLJupyter NotebookPython

Github contributions (5)

github-logo-circle
snorkel-team/snorkel

Mar 2016 - Dec 2021

A system for quickly generating training data with weak supervision
Role in this project:
userBack-end Developer & Database Engineer
Contributions:3 releases, 17 reviews, 678 commits in 5 years 10 months
Contributions summary:Henry added generic code parsers and HTML parsers, fixing parser installation scripts and HTML parser issues. They also implemented features for document parsing and sentence processing, including methods for extracting sentences and text from parsed documents. Furthermore, the user added support for holdout capabilities, which are crucial for model evaluation and feature analysis.
weak-supervisionpythondata-sciencemachine-learninglabeling
A collection of tutorials for Snorkel
Role in this project:
userFull-stack Developer
Contributions:1 review, 52 commits, 96 PRs in 6 months
Contributions summary:Henry contributed to the initial setup and configuration of the tutorial repository, including adding instructions and addressing code formatting. Their work involved setting up essential components of the project. In addition, the user implemented functionalities for checking and syncing notebooks within the repository, including developing scripts for verifying notebook consistency. This indicates a focus on build processes and ensuring the integrity of the tutorial content.
snorkel
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial