Ted Dunning

Fellow And CTO For Data Fabric

Mountain View, California, United States
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts
email-iconphone-icongithub-logolinkedin-logotwitter-logostackoverflow-logofacebook-logo
Join Prog.AI to see contacts

Summary

🤩
Rockstar
🎓
Top School
Ted Dunning is Fellow and CTO for Data Fabric at HPE in Mountain View, bringing 15 years of experience building scalable data platforms and production analytics. He is an active Apache PMC member and ex-board member who bridges open-source stewarding with hands-on engine work. His contributions include implementing core pieces of Apache Drill’s planner and interpreter and authoring the widely used t-digest data structure for accurate streaming quantiles. He also serves on PMC projects such as Zookeeper and Mahout, underscoring deep expertise in distributed systems and machine-learning infrastructure. With a PhD from Sheffield and an MSc from New Mexico State, he specializes in turning research-grade algorithms into robust, tested production code.
code15 years of coding experience
bookPhD, PhD at Sheffield University
bookMSc, MSc at New Mexico State University
languagesGerman
stackoverflow-logo

Stackoverflow

Stats
1,897reputation
162kreached
67answers
0questions
Badges
parallel-processing
top-5%
hadoop
top-5%
github-logo-circle

Github Skills (25)

algorithm10
algorithms10
apache-drill10
data-structure10
java10
javas10
query-parser10
antlr10
drilldown10
query-engine10
data-structures10
querying10
hadoop9
big-data9
sql9

Programming languages (18)

JavaC++CRustScalaGoHTMLJupyter Notebook

Github contributions (5)

github-logo-circle
tdunning/t-digest

Nov 2013 - Feb 2022

A new data structure for accurate on-line accumulation of rank-based statistics such as quantiles and trimmed means
Role in this project:
userBack-end Developer
Contributions:7 reviews, 364 commits, 54 PRs in 8 years 4 months
Contributions summary:Ted primarily focused on the development of the `t-digest` data structure, contributing to the core source code. They added source code files and made changes to existing ones, which involved the implementation of the core functionality of this data structure, which accumulates rank-based statistics like quantiles. They also refactored and added tests for verifying the accuracy and functionality of the `t-digest` data structure.
statisticsquantileson-linequantileaccumulation
apache/drill

Sep 2012 - Jan 2018

Apache Drill is a distributed MPP query layer for self describing data
Role in this project:
userBack-end Developer
Contributions:34 commits, 4 PRs, 26 comments in 5 years 5 months
Contributions summary:Ted contributed to the Apache Drill project by implementing and improving the plan parser, which is essential for query execution. Their work involved developing a plan parser using ANTLR, and then improving error messages, debugging, and fixing tests. They also began working on the physical plan interpreter, which indicates a focus on the core query processing engine of Drill.
jdbcdatastreambig-datadatabasehadoop
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.
Request Free Trial