CV

Research CV focused on multilingual LLMs, machine translation, reward modeling, and evaluation.

General Information

Name Shaomu Tan
Email s.tan@uva.nl
Research areas Machine translation, multilingual LLMs, reasoning systems, translation evaluation, reward modeling, and LLM post-training.
Status PhD candidate at the University of Amsterdam; expected completion in 2026.

Education

  • 2022-2026

    Amsterdam, NL

    PhD in Computer Science
    University of Amsterdam, Amsterdam, Netherlands
    • Focus on LLMs and machine translation.
    • Supervised by Christof Monz.
  • 2020-2022

    Utrecht, NL

    Master of Science in Artificial Intelligence
    Utrecht University, Utrecht, Netherlands
    • GPA: 4.0/4.0.
  • 2016-2020

    Shandong, CN

    Bachelor of Science in Information System
    Shandong University, Shandong, China

Professional Experience

  • 2025-10 - 2026-01

    Berlin, DE

    Applied Scientist Research Intern
    Amazon AGI, Berlin, Germany
    • Conducted research on test-time scaling for long-context, book-level machine translation, bridging research and production LLM systems.
    • Designed and led a large-scale human evaluation pipeline with 1M annotated words and around $90K USD in annotation cost.
    • Research output accepted to ACL 2026.
  • 2025-06 - 2025-09

    Tokyo, JP

    Research Intern
    Sony Speech & Language AI Lab, Tokyo, Japan
    • Developed reasoning-based LLM evaluation models for translation quality estimation and refinement.
    • Applied RLVR to improve evaluation reliability and alignment with human preferences using 7B-32B LLMs.
    • Built a multi-role agentic LLM framework for evaluation and refinement pipelines.
    • Research output accepted to ACL 2026.

Selected Publications

Competitions

Teaching & Service

Invited Talks

Technical Skills

  • Programming & Systems
    • Python, Bash/Linux, Git, Docker, Slurm, AWS.
  • Deep Learning & NLP
    • PyTorch, Hugging Face Transformers, TRL, PEFT, SentencePiece, Fairseq.
  • LLM Training & Post-Training
    • Verl, OpenRLHF, vLLM, Megatron-LM, Llama-Recipes, NeMo.
    • DeepSpeed, FSDP, Flash-Attention, distributed training and inference up to 72B LLMs.
  • Languages
    • Chinese native speaker.
    • English full professional proficiency.