Thomas Collins is a Senior Data Analyst and Data Scientist with eight-plus years of experience building scalable PySpark workflows on Databricks and migrating legacy systems to cloud-native pipelines. He has driven NLP and AI-led solutions for scientific text classification and retrieval, contributing analyses cited by leadership and partners including Harvard and the NIH. At Elsevier he managed large, complex datasets (~100M rows, nested schemas), implemented secure AWS S3 processes compliant with GDPR, and translated research needs into production analytics. A former physics educator and PhD-trained physicist, he brings rigorous quantitative thinking and clear communication to cross-functional projects. Now at Soostone, he continues to blend production engineering, data mining, and prompt-based methods to turn messy data into actionable insight. Off-hours he describes himself as “plumber by day, programmer by night,” signaling a practical, hands-on approach to problem solving.
8 years of coding experience
11 years of employment as a software developer
Bachelor’s Degree Intensitve BA in Physics with a minor in Mathematics, Bachelor’s Degree Intensitve BA in Physics with a minor in Mathematics at New York University
PhD Physics, PhD Physics at Stevens Institute of Technology
High School Diploma High School/Secondary Diplomas and Certificates, High School Diploma High School/Secondary Diplomas and Certificates at Massapequa High School
Contributions:3 pushes, 1 branch, 1 tag in 1 year 7 months
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.