CV | Shaomu Tan

Name	Shaomu Tan
Email	s.tan@uva.nl
Research areas	Machine translation, multilingual LLMs, reasoning systems, translation evaluation, reward modeling, and LLM post-training.
Status	PhD candidate at the University of Amsterdam; expected completion in 2026.

2026
What Does LLM Refinement Actually Improve? A Systematic Study on Document-Level Literary Translation
- ACL 2026
- Shaomu Tan, Dawei Zhu, Ke Tran, Michael Denkowski, Sony Trenous, Bill Byrne, Leonardo Ribeiro, Felix Hieber
2026
Remedy-R: Generative Reasoning for Machine Translation Evaluation without Error Annotations
- ACL 2026
- Shaomu Tan, Ryosuke Mitani, Ritvik Choudhary, Qiyu Wu, Toshiyuki Sekiya, Christof Monz
2025
ReMedy: Learning Machine Translation Evaluation from Human Preferences with Reward Modeling
- EMNLP 2025
- Shaomu Tan, Christof Monz
2024
Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation
- EMNLP 2024
- Shaomu Tan, Di Wu, Christof Monz
2024
How Far Can 100 Samples Go? Unlocking Zero-Shot Translation with Tiny Multi-Parallel Data
- ACL Findings 2024
- Di Wu, Shaomu Tan, Yan Meng, David Stap, Christof Monz
2023
Towards a Better Understanding of Variations in Zero-Shot Neural Machine Translation Performance
- EMNLP 2023
- Shaomu Tan, Christof Monz
2023
Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
- NeurIPS 2023
- Baohao Liao, Shaomu Tan, Christof Monz

2025
WSDM Cup - Multilingual Chatbot Arena
- Silver Medal - Ranked 19/950 teams
- Trained 7B-14B reward models for multilingual human preference prediction with DeepSpeed, vLLM, quantization, pseudo-labeling, and model merging.
2023
Google American Sign Language Fingerspelling Recognition
- Gold Medal - Ranked 11/1,315 teams
- Designed and implemented a Conformer-Transformer architecture over sign-language landmarks with CTC and cross-entropy objectives.
2023
WMT 2023 General Machine Translation Shared Task
- First Place on the English-Hebrew constrained track
- Built a compact encoder-decoder MT system that performed on par with GPT-4 5-shot prompting.

2024-10
Interference and Knowledge Transfer in Multilingual Translation Models

University of Amsterdam, AI Master course Deep Learning for NLP
- Co-presented with Prof. Christof Monz.
2024-06
A Journey on Multilingual Neural Machine Translation

Utrecht University, AI Bachelor course Models for Language Processing
- Co-presented with Prof. Denis Paperno.
2024-2026
Program Committee / Reviewing

ACL Rolling Review (ARR), TASLP, WMT, and MRL
- Reviewed for ACL, EMNLP, EACL, NAACL, TASLP, WMT, and Multilingual Representation Learning.

Mar 2026

Toward Explainable, Robust, and Actionable Translation Quality Estimation

EACL 2026, Multilingual Multicultural Evaluation (MME) workshop, Rabat, Morocco
Aug 2025

Remedy-R: Large Reasoning Models for Machine Translation Evaluation

University of Tokyo, invited by Prof. Yoshimasa Tsuruoka
Jul 2025

The Second Half of Machine Translation

Nara Institute of Science and Technology, invited by Prof. Taro Watanabe

Programming & Systems
- Python, Bash/Linux, Git, Docker, Slurm, AWS.
Deep Learning & NLP
- PyTorch, Hugging Face Transformers, TRL, PEFT, SentencePiece, Fairseq.
LLM Training & Post-Training
- Verl, OpenRLHF, vLLM, Megatron-LM, Llama-Recipes, NeMo.
- DeepSpeed, FSDP, Flash-Attention, distributed training and inference up to 72B LLMs.
Languages
- Chinese native speaker.
- English full professional proficiency.