We are the Language and Multimodal Processing Group at the Department of Computer Science at the University of Copenhagen. Our recent work includes representation learning for multilingual and multimodal data, resource creation, and language modelling.
Design, understand, and evaluate vision-language models
Retrieval augmented generation in multimodal and text-only settings.
We are developing language models that can process any written language.
|
2026 |
Efficient Test-Time Scaling for Small Vision-Language Models.
Proceedings of ICLR. |
|
2026 |
Token Distillation: Attention-Aware Input Embeddings for New Tokens.
Proceedings of ICLR. |
|
2026 |
Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation.
Proceedings of ICLR. |
|
2026 |
ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models.
Proceedings of WACV. |