CV
Research CV focused on multilingual LLMs, machine translation, reward modeling, and evaluation.
General Information
| Name | Shaomu Tan |
| s.tan@uva.nl | |
| Research areas | Machine translation, multilingual LLMs, reasoning systems, translation evaluation, reward modeling, and LLM post-training. |
| Status | PhD candidate at the University of Amsterdam; expected completion in 2026. |
Education
-
2022-2026 Amsterdam, NL
PhD in Computer Science
University of Amsterdam, Amsterdam, Netherlands - Focus on LLMs and machine translation.
- Supervised by Christof Monz.
-
2020-2022 Utrecht, NL
-
2016-2020 Shandong, CN
Bachelor of Science in Information System
Shandong University, Shandong, China
Professional Experience
-
2025-10 - 2026-01 Berlin, DE
Applied Scientist Research Intern
Amazon AGI, Berlin, Germany - Conducted research on test-time scaling for long-context, book-level machine translation, bridging research and production LLM systems.
- Designed and led a large-scale human evaluation pipeline with 1M annotated words and around $90K USD in annotation cost.
- Research output accepted to ACL 2026.
-
2025-06 - 2025-09 Tokyo, JP
Research Intern
Sony Speech & Language AI Lab, Tokyo, Japan - Developed reasoning-based LLM evaluation models for translation quality estimation and refinement.
- Applied RLVR to improve evaluation reliability and alignment with human preferences using 7B-32B LLMs.
- Built a multi-role agentic LLM framework for evaluation and refinement pipelines.
- Research output accepted to ACL 2026.
Selected Publications
-
2026 What Does LLM Refinement Actually Improve? A Systematic Study on Document-Level Literary Translation
- ACL 2026
- Shaomu Tan, Dawei Zhu, Ke Tran, Michael Denkowski, Sony Trenous, Bill Byrne, Leonardo Ribeiro, Felix Hieber
-
2026 Remedy-R: Generative Reasoning for Machine Translation Evaluation without Error Annotations
- ACL 2026
- Shaomu Tan, Ryosuke Mitani, Ritvik Choudhary, Qiyu Wu, Toshiyuki Sekiya, Christof Monz
-
2025 ReMedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
- EMNLP 2025
- Shaomu Tan, Christof Monz
-
2024 Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation
- EMNLP 2024
- Shaomu Tan, Di Wu, Christof Monz
-
2024 How Far Can 100 Samples Go? Unlocking Zero-Shot Translation with Tiny Multi-Parallel Data
- ACL Findings 2024
- Di Wu, Shaomu Tan, Yan Meng, David Stap, Christof Monz
-
2023 Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance
- EMNLP 2023
- Shaomu Tan, Christof Monz
-
2023 Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
- NeurIPS 2023
- Baohao Liao, Shaomu Tan, Christof Monz
Competitions
-
2025 WSDM Cup - Multilingual Chatbot Arena
- Silver Medal - Ranked 19/950 teams
- Trained 7B-14B reward models for multilingual human preference prediction with DeepSpeed, vLLM, quantization, pseudo-labeling, and model merging.
-
2023 Google American Sign Language Fingerspelling Recognition
- Gold Medal - Ranked 11/1,315 teams
- Designed and implemented a Conformer-Transformer architecture over sign-language landmarks with CTC and cross-entropy objectives.
-
2023 WMT 2023 General Machine Translation Shared Task
- First Place on the English-Hebrew constrained track
- Built a compact encoder-decoder MT system that performed on par with GPT-4 5-shot prompting.
Teaching & Service
-
2024-10 Interference and Knowledge Transfer in Multilingual Translation Models
University of Amsterdam, AI Master course Deep Learning for NLP - Co-presented with Prof. Christof Monz.
-
2024-06 A Journey on Multilingual Neural Machine Translation
Utrecht University, AI Bachelor course Models for Language Processing - Co-presented with Prof. Denis Paperno.
-
2024-2026 Program Committee / Reviewing
ACL Rolling Review (ARR), TASLP, WMT, and MRL - Reviewed for ACL, EMNLP, EACL, NAACL, TASLP, WMT, and Multilingual Representation Learning.
Invited Talks
-
Aug 2025 Remedy-R: Large Reasoning Models for Machine Translation Evaluation
University of Tokyo, invited by Prof. Yoshimasa Tsuruoka -
Jul 2025 The Second Half of Machine Translation
Nara Institute of Science and Technology, invited by Prof. Taro Watanabe
Technical Skills
-
Programming & Systems
- Python, Bash/Linux, Git, Docker, Slurm, AWS.
-
Deep Learning & NLP
- PyTorch, Hugging Face Transformers, TRL, PEFT, SentencePiece, Fairseq.
-
LLM Training & Post-Training
- Verl, OpenRLHF, vLLM, Megatron-LM, Llama-Recipes, NeMo.
- DeepSpeed, FSDP, Flash-Attention, distributed training and inference up to 72B LLMs.
-
Languages
- Chinese native speaker.
- English full professional proficiency.