1st WMDQS to take place at COLM 2025
The First Workshop on Multilingual Data Quality Signals will happen at COLM 2025 in Montreal.
The 1st Workshop on Multilingual Data Quality Signals (WMDQS will take place on October 10 in Montreal. We have an exciting agenda with 3 invited speakers, long and short research papers, and a shared task on Language ID. The first shared task is focused on community annotation to get broader langauge coverage. The second shared task is focused on language ID. Both are important tasks that are very early in a pipeline for building foundational models (aka VLMs and LLMs) but are often overlooked.
We look forward to seeing you in Canada soon.