Ed Seidl

Computer Scientist at LLNL

Livermore, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
Ed Seidl is a seasoned computer scientist with 32 years of experience who blends deep systems-level expertise with hands-on open-source contributions in high-performance data tooling. Based in Livermore, CA and formerly a quantum mechanic at LLNL, he focuses on back-end engineering for data formats like Parquet and Apache Arrow, improving memory efficiency, encoding support, and metadata robustness. His work on prominent projects such as NVIDIA’s cuDF and the Rust implementation of Apache Arrow demonstrates a knack for performant file I/O, byte-stream encodings, and practical refactors that prevent data corruption. Colleagues rely on him for pragmatic solutions to complex data-parsing problems and for squeezing performance out of both C++/GPU and Rust stacks.
code32 years of coding experience
github-logo-circle

Github Skills (21)

c-language10
cudf10
data-structure10
apache-arrow10
fileio10
file-processing10
parquet10
cuda10
data-structures10
file-handling10
rust10
file-access10
cprogramming-language10
data-engineering9
memory-management9

Programming languages (7)

JavaC++ShellRustScalaHTMLThrift

Github contributions (5)

github-logo-circle
apache/arrow-rs

Feb 2023 - Apr 2025

Official Rust implementation of Apache Arrow
Role in this project:
userBack-end Developer
Contributions:219 reviews, 47 PRs, 350 comments in 2 years 2 months
Contributions summary:Ed primarily contributed to the official Rust implementation of Apache Arrow, focusing on the Parquet file format. Their work involved implementing support for level histograms, and refactoring. They also worked on byte stream split encoding for various data types like INT32, INT64, and FIXED_LEN_BYTE_ARRAY, which improved performance. Additionally, they contributed to file metadata processing, improving efficiency and addressing potential issues related to incomplete metadata.
rustolapapachearrowapache-arrow
rapidsai/cudf

May 2022 - Jan 2023

cuDF - GPU DataFrame Library
Role in this project:
userBack-end Developer
Contributions:691 reviews, 19 commits, 90 PRs in 7 months
Contributions summary:Ed primarily focused on enhancing the Parquet writer functionality within the cuDF library. Their contributions include adding features like page size control parameters and handling of fixed-length byte array data, which improved memory efficiency and file compatibility. Furthermore, the user implemented features for supporting additional encoding formats, such as DELTA_BINARY_PACKED and DELTA_BYTE_ARRAY, improving the versatility of the Parquet writer. Their work also involved refactoring aspects of the reader and addressing issues related to incorrect data handling and parsing, ensuring data integrity.
cudadataframe-librarydata-analysiscppcudf
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial
Ed Seidl - Computer Scientist at LLNL