Patrick Xia is a researcher and PhD candidate at Johns Hopkins CLSP, now at Microsoft, specializing in entity-level information extraction, document representations, and efficient NLP systems working with noisy web data. He combines deep academic training (PhD and MS from JHU; dual BS in CS and Math from CMU) with practical engineering experience from internships at Google, Meta, Semantic Machines, and Microsoft. His contributions to open-source NLP toolkits—such as integrating FastText, masked transformers, and ELMo support into the widely used jiant toolkit—reflect a focus on improving representation and encoder tooling for real-world tasks. With over a decade of research and teaching experience, he bridges rigorous modeling and production-minded system design, often optimizing for noisy inputs and scalability.
13 years of coding experience
Johns Hopkins University
Livingston High School
Bachelor of Science - BS, Mathematics, Bachelor of Science - BS, Mathematics at Carnegie Mellon University
Contributions:16 commits, 2 PRs, 10 pushes in 1 year 4 months
Contributions summary:Patrick primarily contributed to the implementation and integration of FastText embeddings for word representation within the NLP toolkit. They developed functions to load and utilize pre-trained embeddings, including handling model loading and path configurations. The user also introduced masked transformer components, and made changes to Elmo integration. These changes suggest a focus on enhancing the toolkit's capabilities for various NLP tasks, including encoding and model performance.
Find and Hire Top DevelopersWe’ve analyzed the programming source code of over 60 million software developers on GitHub and scored them by 50,000 skills. Sign-up on Prog,AI to search for software developers.