Huaijin Hao

Software Engineer at OpenObserve

Shenzhen, Guangdong Province, China
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Huaijin Hao is a software engineer based in Shenzhen with six years of experience specializing in observability, distributed systems, and database systems. Currently at OpenObserve, he works on backend and database engineering for a petabyte-scale observability platform that aims to be a high-performance, cost-effective alternative to Elasticsearch/Splunk/Datadog. His open-source contributions include performance optimizations, query cache FIFO, and Parquet merge improvements in OpenObserve, as well as core planner and feature work in the Apache DataFusion SQL engine. Known for pragmatic refactors and careful correctness fixes (e.g., count(distinct) and explain-plan improvements), he blends deep systems thinking with hands-on performance tuning. He holds education from Harbin Institute of Technology, Shenzhen, and quietly focuses on making large-scale data pipelines faster and more reliable.
code6 years of coding experience
book哈尔滨工业大学(深圳)
github-logo-circle

Github Skills (15)

fusion10
apache-arrow10
query-engine10
rust10
sql10
arrows10
arrow-js10
compress9
data-engineering9
performance-optimization9
database-design9
parquet9
compression9
big-data8
query-optimization8

Programming languages (3)

JavaRustPython

Github contributions (5)

github-logo-circle
openobserve/openobserve

Jan 2024 - Mar 2025

🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).
Role in this project:
userBack-end Developer & Database Engineer
Contributions:156 reviews, 132 PRs, 485 pushes in 1 year 2 months
Contributions summary:Huaijin contributed to the backend functionality of the OpenObserve project. They implemented SQL mode support for alert URLs and refactored the code for count(distinct) to get the correct results. Their work also included optimizing ingester speed, enhancing field search by using hash maps, and improving the performance of merging Parquet files when dealing with a high number of fields. They also worked on the implementation of FIFO for query cache and resolving issues for the metric related features.
analyticsapmdatadogelasticsearchgrafana
apache/datafusion

May 2023 - Jan 2025

Apache DataFusion SQL Query Engine
Role in this project:
userBack-end Developer
Contributions:79 reviews, 30 PRs, 115 comments in 1 year 8 months
Contributions summary:Huaijin contributed to the Apache DataFusion SQL query engine by implementing and modifying core functionalities. Their work involved refactoring code, moving and updating file compression types, and adding new features like `join_on` to the logical plan builder. They also addressed bugs and implemented features to improve the explain plan and support for custom sources.
querypythonquery-enginedataframerust
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Huaijin Hao - Software Engineer at OpenObserve