Maieutic Lab

prof_pic.jpg

Hackerman Hall

Johns Hopkins University

Baltimore, MD 12345

Welcome to the Multilingual Artificial Intelligence for Eliciting Understanding Through Intermodal Content (MAIEUTIC) Lab website. We are a team of researchers at Johns Hopkins University who work on making AI systems perform better at languages beyond only English.

We work on a wide variety of multilingual aspects of artificial intelligence across a range of modalities. This includes Machine Translation, Cross-Lingual Retrieval (and Retrieval Augmented Generation), Preference Optimization, Dataset Creation, Hyperparameter Optimization, Robust Evaluation, and overall core Machine Learning. Within this, we work across modalities. For instance, in Machine Translation, we work not only on textual translation, but Optical Character Recognition (OCR) Translation and Speech (Audio) Translation as well. Or, we work on retrieving events in large collections of videos in multiple languages, as opposed to only working in the textual modality. Naturally, this is only a small subset of our ongoing projects and interests. You can find out more in the repositories tab. Also check out the websites of our individual researchers for even more depth.

We are always looking to work on fun problems and collaborate with many other centers and labs both within JHU and beyond. If you are a student currently affiliated with Hopkins and are interested in joinging the lab, please fill out this form.

news

Sep 17, 2025 1st WMDQS to take place at COLM 2025
Aug 08, 2025 1st MAGMaR Workshop takes place at ACL 2025
Aug 06, 2025 Welcome to the new MAIEUTIC Lab Website.

selected publications

  1. ICML 2024
    Contrastive preference optimization: pushing the boundaries of LLM performance in machine translation
    Haoran Xu, Amr Sharaf, Yunmo Chen, and 5 more authors
    In Proceedings of the 41st International Conference on Machine Learning, 2024
  2. NAACL 2024
    Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
    Nathaniel Robinson, Raj Dabre, Ammon Shurtz, and 14 more authors
    In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jun 2024
  3. LREC-COLING 2024
    Exploring Geometric Representational Disparities between Multilingual and Bilingual Translation Models
    Neha Verma, Kenton Murray, and Kevin Duh
    In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
  4. ICLR 2024
    Error norm truncation: Robust training in the presence of data noise for text generation models
    Tianjian Li, Haoran Xu, Philipp Koehn, and 2 more authors
    In Proceedings of the 12th International Conference on Learning Representations, May 2024