Pablo Hoffman is a seasoned founder and technology leader with 20+ years building products and teams at the intersection of open source and web data extraction. He co-created Scrapy in 2008 and grew Scrapinghub (now Zyte) from a three-person startup into a scalable platform and 70-person company, combining hands-on engineering, DevOps, and product leadership. Currently CPO/CTO and co-founder of News To You, he focuses on turning crawling and scraping expertise into commercial media-monitoring solutions. A prolific contributor to core projects like Scrapy, Scrapyd, Splash and Parsel, he pairs low-level protocol and scheduler improvements with pragmatic developer tooling and documentation. Based in Madrid and an early Python evangelist in Uruguay, he brings rare institutional knowledge of the web-scraping stack plus a founder’s knack for shipping resilient, production-grade systems. He’s also an active mentor and lifelong learner who balances technical depth with people leadership.
20 years of coding experience
11 years of employment as a software developer
Bachelor's degree, Electrical and Electronics Engineering, Bachelor's degree, Electrical and Electronics Engineering at Universidad ORT Uruguay
Bachelor's degree, Computer Science, Bachelor's degree, Computer Science at Universidad de la República
Scrapy, a fast high-level web crawling & scraping framework for Python.
Role in this project:
Back-end Developer
Contributions:2205 commits, 26 PRs, 39 pushes in 14 years 7 months
Contributions summary:Pablo primarily worked on implementing and improving the Scrapy framework, specifically the IBL (Instance Based Learning) extraction library. Their commits involved adding and documenting new features, improving existing functionality, and fixing bugs. The user demonstrated involvement with the core engine, downloader, scheduler, and web service components of the Scrapy framework.
Contributions:155 commits, 6 PRs, 6 pushes in 8 years 8 months
Contributions summary:Pablo made significant contributions to the Scrapyd service, including enhancing its configuration capabilities to support multiple configuration files and documenting various options. They also added tests for core components like the scheduler, poller, and egg storage, improving the system's reliability. Furthermore, the user refactored the code to address bugs related to running projects without eggs and made enhancements to the web interface. They also implemented a new API for canceling pending/running jobs and added support for deferred spider queues, demonstrating a focus on improving the Scrapyd infrastructure and features.
pythonscrapydaemonscrapy-spidersspiders
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Pablo Hoffman - CPO CTO Co-Founder at Zyte (formerly Scrapinghub)