We are the Language and Multimodal Processing Group at the Department of Computer Science at the University of Copenhagen. Our recent work includes representation learning for multilingual and multimodal data, resource creation, and language modelling.
We are developing language models that can process any written language.
Retrieval augmentation of multimodal models for image captioning.
Design, understand, and evaluate multimodality in multilingual models
2023 |
LMCap: Few-shot Multilingual Image Captioning by Retrieval Augmented Language Model Prompting.
Findings of ACL. |
2023 |
SmallCap: Lightweight Image Captioning Prompted With Retrieval Augmentation.
Proceedings of CVPR. |
2023 |
Language Modelling with Pixels.
Proceedings of ICLR. Notable Top 5% Paper. |
2023 |
Cleaner Categories Improve Object Detection and Visual-Textual Grounding.
Image Analysis. |
2023 |
Retrieval-augmented Image Captioning.
Proceedings of EACL. |
2023 |
MultiFin: A Dataset for Multilingual Financial NLP.
Findings of EACL. |