Julia Kreutzer

Research Scientist at Cohere for AI

View My GitHub Profile

About Me

I’m a senior research scientist at Cohere for Labs, where I conduct research on large language models, centered around multilinguality, reinforcement learning, and evaluation. I am also an associate industry member at MILA. I am based in Canada, in the suburbs of Montreal. Previously I worked at Google Translate (Research), Montreal, with a focus on machine translation. Very broadly speaking, I am interested in the intersection of natural language processing (NLP) and machine learning, especially where multiple languages come into play. I obtained my PhD from Heidelberg University, Germany, under supervision of Prof. Stefan Riezler at the StatNLP group.

My Research

RLHF before it was cool

In my PhD, I investigated how reinforcement learning algorithms can be used to turn weak supervision signals from users into meaningful updates for a machine translation system. If you’re into RLHF, check out my early works on learning from simulated human feedback (ACL 2016) - we called it “bandit structured prediction” back then, and learning from actual user feedback (NAACL 2018) for sequence-to-sequence models. You can find many elements (and challenges) of today’s leading RL algorithms there, just at smaller scale and before LLMs and the RLHF branding. A few years later we went back to these basics by taking apart PPO and bringing back good old REINFORCE.

Accessibility & Inclusion

🎯 My long-term goal for NLP research is to make it more accessible and inclusive, along multiple dimensions:

  1. Underresourced NLP: Foster research for underresourced languages and by underrepresented groups, such that not only English-speaking users can benefit from the progress we’re making in NLP. I’m particularly interested in helping grassroots communities, such as Masakhane, grow and mature.
  2. Novices: Reduce the entry burdens (in terms of coding and research practices) for novices in the field, especially for new students or researchers from other related areas. You can often find me at mentoring events, and I am generally generous with my time when it comes to helping out newcomers find their direction and resources to get started with research.
  3. Women: NLP and ML research is still a male-dominated field, and it can be challenging to navigate for gender minorities. It takes time to grow a network of support - and as a woman it often requires finding your own unique path because there are not many footsteps to follow. I am mom of a two toddlers, so if you’d like to connect to chat about balancing family and research, reach out, and I am motivated to make research a more supportive place for young families.

Check out our lab’s scholar program (sub-PhD internships, note that I do not host PhD interns) and Grant calls, these are unique opportunities for getting started with LLM research. Another great point of entry is the Cohere Labs community, our open science initiative.

Multilinguality

Most leading LLMs today are multilingual, more or less intentionally. I don’t see it as optional to evaluate techniques and models across languages, but rather as an obligation for responsible research with a global perspective (most critical for safety). Besides adding one dimension to every evaluation, it is a good reality check for robustness of developed techniques. While multilinguality can seen overwhelming and daunting, it is the perfect opportunity for collaboration, as we can put our global network of researchers into action, e.g. in data audits, or for open LLM or data development.

Evaluation

The faster model development and releases, the more important become evaluations. They represent our compass and proxies to real-world impact. It is incredibly important that we make sure to constantly evolve them, and maintain rigor and the will to look beyond a single score or ranking. I have written about this in two blogposts (Elo ratings, fair and comprehensive multilingual LLM evaluation practices) in collaboration with Singapore, and in a recent COLM paper, where we draw the connections between LLM and MT evaluations.

⏳ Last updated: 2 Nov 2025. If there’s no recent news below, it means I have been too busy with life and research.

News

Publications

Google scholar

Contact

Email: <lowercase first + last name>.@cohere.com