This repository contains code to quickly train visual-language models from pre-trained models (e.g., ~300 lines of code for model, dataset, and training).
openai/clip-vit-base-patch32google/siglip-base-patch16-224
allenai/OLMo-1B-hfgoogle/gemma-2b-itmistralai/Mistral-7B-Instruct-v0.3