We are the Language and Multimodal Processing Group at the Department of Computer Science at the University of Copenhagen. Our recent work includes representation learning for multilingual and multimodal data, resource creation, and language modelling.
Design, understand, and evaluate vision-language models
Retrieval augmented generation in multimodal and text-only settings.
We are developing language models that can process any written language.
|
2026 |
ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models.
Proceedings of WACV. |
|
2025 |
Multilingual Pretraining for Pixel Language Models.
Proceedings of EMNLP. |
|
2025 |
CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation.
Transactions of the ACL. |
|
2025 |
LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks.
Proceedings of ACL. |
|
2025 |
Can Community Notes Replace Professional Fact-Checkers?.
Proceedings of ACL. |
|
2025 |
Seeing What Tastes Good: Revisiting Multimodal Distributional Semantics in the Billion Parameter Era.
Findings of ACL. |
|
2025 |
How Do Multilingual Language Models Remember Facts?.
Findings of ACL. |
|
2025 |
Effective Machine Learning Techniques for Non-English Radiology Report Classification: A Danish Case Study.
AI 6(2). |