Summary
Thomas Thomas is a Senior Data and ML Platform Engineer based in Boston with nine years of experience building scalable data architectures and production ML pipelines. He has designed and operated large AWS-based frameworks processing hundreds of billions of records, migrated 112 TB from on-prem Hadoop to cloud, and led dbt-driven analytics and CI/CD implementations using GitHub Actions. His background spans healthcare and life sciences—modeling chemoinformatic datasets and assembling a 100 TB data lake—while also delivering real-time smart-home analytics with 98% uptime during his research work. Known for cost- and performance-driven engineering, he has cut cloud costs by 30%, halved SQL response times, and automated testing to materially improve code quality. He blends hands-on Python, Spark, EMR, Step Functions and serverless event-driven designs with practical DataOps and mentoring experience. Collected academic training in data analytics from Northeastern complements a pragmatic focus on turning complex scientific and operational requirements into robust, auditable pipelines.
8 years of coding experience
6 years of employment as a software developer
Bachelor of Technology (B.Tech.) Computer Science and Engineering, Bachelor of Technology (B.Tech.) Computer Science and Engineering at Manipal Institute of Technology
Master's degree Data Analytics Engineering, Master's degree Data Analytics Engineering at Northeastern University
Indian School Muscat
English, Hindi, Malayalam