Design, understand, and evaluate vision-language models
Retrieval augmented generation in multimodal and text-only settings.
We are developing language models that can process any written language.