Pablo Hoffman

CPO CTO Co-Founder at Zyte (formerly Scrapinghub)

Madrid, Community of Madrid, Spain
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
award
Top expert inPython Development Ecosystem
Pablo Hoffman is a seasoned founder and technology leader with 20+ years building products and teams at the intersection of open source and web data extraction. He co-created Scrapy in 2008 and grew Scrapinghub (now Zyte) from a three-person startup into a scalable platform and 70-person company, combining hands-on engineering, DevOps, and product leadership. Currently CPO/CTO and co-founder of News To You, he focuses on turning crawling and scraping expertise into commercial media-monitoring solutions. A prolific contributor to core projects like Scrapy, Scrapyd, Splash and Parsel, he pairs low-level protocol and scheduler improvements with pragmatic developer tooling and documentation. Based in Madrid and an early Python evangelist in Uruguay, he brings rare institutional knowledge of the web-scraping stack plus a founder’s knack for shipping resilient, production-grade systems. He’s also an active mentor and lifelong learner who balances technical depth with people leadership.
code20 years of coding experience
job11 years of employment as a software developer
bookBachelor's degree, Electrical and Electronics Engineering, Bachelor's degree, Electrical and Electronics Engineering at Universidad ORT Uruguay
bookBachelor's degree, Computer Science, Bachelor's degree, Computer Science at Universidad de la República
languagesEnglish, Spanish
stackoverflow-logo

Stackoverflow

Stats
1,540reputation
90kreached
25answers
0questions
Badges
scrapy
top-5%
python
top-5%
github-logo-circle

Github Skills (41)

json10
lib10
python10
apidoc10
testing10
command-line-interface10
webscraping10
headless-browser10
scrapy10
pyqt10
api10
twisted10
html-parsing10
read-me10
lxml10

Programming languages (7)

TypeScriptCSSMakefileJavaScriptGoHTMLPython

Github contributions (5)

github-logo-circle
scrapy/scrapy

Jun 2008 - Nov 2022

Scrapy, a fast high-level web crawling & scraping framework for Python.
Role in this project:
userBack-end Developer
Contributions:2205 commits, 26 PRs, 39 pushes in 14 years 7 months
Contributions summary:Pablo primarily worked on implementing and improving the Scrapy framework, specifically the IBL (Instance Based Learning) extraction library. Their commits involved adding and documenting new features, improving existing functionality, and fixing bugs. The user demonstrated involvement with the core engine, downloader, scheduler, and web service components of the Scrapy framework.
pythonscrapyscraping-frameworkhigh-levelweb-crawling
scrapy/scrapyd

Sep 2010 - Apr 2019

A service daemon to run Scrapy spiders
Role in this project:
userBack-end Developer & DevOps Engineer
Contributions:155 commits, 6 PRs, 6 pushes in 8 years 8 months
Contributions summary:Pablo made significant contributions to the Scrapyd service, including enhancing its configuration capabilities to support multiple configuration files and documenting various options. They also added tests for core components like the scheduler, poller, and egg storage, improving the system's reliability. Furthermore, the user refactored the code to address bugs related to running projects without eggs and made enhancements to the web interface. They also implemented a new API for canceling pending/running jobs and added support for deferred spider queues, demonstrating a focus on improving the Scrapyd infrastructure and features.
pythonscrapydaemonscrapy-spidersspiders
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Pablo Hoffman - CPO CTO Co-Founder at Zyte (formerly Scrapinghub)