Senior Data Engineer - Behaviour Data Products - Global Data Products
Berlin, Germany
Join Prog.AI to see contacts
Join Prog.AI to see contacts
Summary
👤
Senior
🎓
Top School
Prabeesh Keezhathra is a Senior Data Engineer based in Berlin with 12 years of experience building and modernizing large-scale data platforms across e-commerce and mobility. At HelloFresh he has led migrations from on-prem Cloudera to AWS, modernized legacy Python/Spark pipelines with Airflow, and stabilized profitability reporting—raising pipeline success from 60% to 95%. His background spans Apache Spark, Beam, Airflow and cloud data stacks (AWS/GCP), and he actively mentors teams and runs knowledge-sharing forums to elevate engineering practices. An early contributor to streaming integrations in major OSS projects, he added MQTT support to Apache Spark and Bahir and updated Twitter integrations in Apache Storm, demonstrating a knack for bridging real-time ingestion with analytics. He blends hands-on backend development with product-oriented data governance, delivering reliable, scalable data products that accelerate business decision-making. Notably, his open-source work enabled Spark to consume MQTT streams—an uncommon but impactful capability for IoT and real-time use cases.
12 years of coding experience
7 years of employment as a software developer
Bachelor of Technology (B.Tech.) Electronics and Communications Engineering, Bachelor of Technology (B.Tech.) Electronics and Communications Engineering at Government Engineering College Palakkad, Kerala
Contributions:13 commits, 1 PR, 1 comment in 3 years 4 months
Contributions summary:Prabeesh primarily contributed to the development of an MQTT streaming adapter for the Apache Bahir project. Their work involved creating an MQTTInputDStream class and associated receiver, enabling the project to subscribe to messages from an MQTT broker. The user also provided an example MQTT word count application. Further contributions involved code refactoring and adhering to project coding standards.
Apache Spark - A unified analytics engine for large-scale data processing
Role in this project:
Back-end Developer
Contributions:9 PRs, 46 comments in 1 year 5 months
Contributions summary:Prabeesh primarily contributed to integrating the MQTT protocol into the Apache Spark streaming framework. This involved creating an `MQTTInputDStream` to receive messages from an MQTT broker and integrating it into the `StreamingContext`. They added necessary dependencies, implemented example code showcasing word count functionality with MQTT, and made subsequent code improvements. The user's work enabled Spark streaming to ingest data from MQTT sources.
analyticspythondata-processingsqlapache
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Prabeesh Keezhathra - Senior Data Engineer - Behaviour Data Products - Global Data Products