Summary
Gyuhang Shim is a Senior Data Engineer with 13 years of experience designing and operating large-scale, real-time and batch data platforms across major Korean tech companies including Kakao and NAVER. He architects distributed pipelines and event-sourced logistics systems using Hadoop, Trino, Kafka, Airflow, Spark, and Iceberg, and pairs that with backend engineering in Kotlin/Java to deliver low-latency indexing and reporting features. His work emphasizes rigorous temporal validation, deduplication, partition-aware SQL/ETL optimization, and JVM and cluster tuning to meet strict SLAs at billion-record scale. Notably, he built automated cluster management and installer tooling (Barista) and introduced Akka-driven parallel processing patterns that cut multi-day migrations to hours. He blends platform-level systems thinking with hands-on performance engineering, enabling teams to ship resilient, high-throughput data services.
13 years of coding experience
1 year of employment as a software developer
Bachelor of Science (BS) Computer Science Electronic Engineering, Bachelor of Science (BS) Computer Science Electronic Engineering at Handong Global University
English, Korean