tiny vision language model
Role in this project:
ML Engineer Contributions:23 reviews, 79 PRs, 274 pushes in 1 year 3 months
Contributions summary:Vik implemented core functionalities for a vision language model. They added a vision encoder module with preprocessing steps and integrated it within the project's architecture, demonstrating a focus on computer vision tasks. Subsequent commits focused on building a simple text generation interface, showcasing the user's capability to integrate vision models with text-based functionalities. Further commits show the incorporation of features like loading and using pre-trained models, and streaming outputs to stdout, suggesting an active role in model development and usability enhancement.
Contributions:55 commits in 7 months
deployingansibleansible-scriptshummingbird