Jonathan Vexler is a New York–based software engineer with eight years of experience, currently building product at Onehouse. He combines performance-focused backend and distributed-systems work—profiling and optimizing C++ at Rockset—with full‑stack and clinical software delivery at Epic. Jonathan has practical autonomous-systems experience from Uber and nuTonomy, where he built fault‑tolerant map-copying and coordinate-transform libraries that shipped into vehicle stacks. An active open-source contributor, he added PySpark examples to the widely used Apache Hudi project to make upserts, deletes and time-travel queries more accessible to data engineers. With BS and MS degrees from Brown, he’s fluent in Python, C++, cloud services and database internals, and excels at turning complex requirements into production-ready systems through cross-team collaboration.
Upserts, Deletes And Incremental Processing on Big Data.
Role in this project:
Data Engineer
Contributions:772 reviews, 52 commits, 455 PRs in 4 months
Contributions summary:Jonathan contributed to the development of PySpark examples, adding them to the repository to demonstrate and document the use of Apache Hudi with Apache Spark's Python API. The user's work involved creating and integrating example code for core Hudi functionalities like upserts, deletes, and time travel queries, which enhances the usability of Hudi for data ingestion. The user was responsible for showcasing how Hudi can be used with the PySpark, making it easier for developers to work with Hudi within a big data ecosystem.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.