Team & Collaborators
Collaborators
I am very grateful to a number of academics, researchers and PhD students
whom I am currently working with and/or previously worked with:
- The Cambridge PicoLM Team: Richard Diehl Martinez
(and his MPhil/Part III students: Yuval Weiss and David Demitri Africa),
Ryan Daniels (a Machine Learning Engineer with the
Accelerate Programme for Scientific Discovery), and Prof Paula Buttery.
- Cambridge NLIP Group: Zeb Goriely,
Pietro Lesci,
Julius Cheng,
Dr Guy Emerson,
Dr Fermin Moscoso del Prado Martin – we are working on information-theoretic models of diachronic phonological typology.
Paul Siewert and I have also been thinking about Category Theoretic approaches in Linguistics.
Mila Marcheva,
Xiaochen (Neo) Zhu.
- ALTA Institute (Computer Science & Technology):
Gabrielle Gaudeau,
Dr Diana Galvan Sosa,
Dr Zheng Yuan,
Yuan Gao
- XFACT (KAIST AI) and NAVER Cloud: Jiwoo Hong (KAIST/Amazon Rufus),
Noah Lee (KAIST/Naver Cloud),
Dr James Thorne,
Jeonghoon Kim (NAVER Cloud, KAIST AI),
Woojin Chung.
- Cambridge Theoretical and Applied Linguistics (TAL): Nuria Bosch Masip
- Dr Donya Rooin (MilaNLP, Bocconi University) and Hongyi (Sam) Gu (KCL/NetMind.AI)
- Dr Konstantinos Voudouris (Institute for Human-Centered AI at Helmholtz Munich)
- Cambridge Language Technology Lab (LTL): Yijie Zhou (EJ),
Fangyu Liu,
Prof Nigel Collier
- Marek Masiak (Oxford University)
- Dr Thiemo Wambsganss (University of St Gallen, now Bern University – UROP Supervisor 2021)
Students
I have had the pleasure of supervising the following students for their research projects!
MPhil Advanced Computer Science (ACS)/Part III CST
- Bianca-Mihaela Ganescu – my MPhil student, completed a thesis on Small Multimodal (Vision-Language Models) for the MPhil in Advanced Computer Science. Co-supervised with
Dr Andrew Caines and Prof Paula Buttery.
Undergraduate
- Ellie Polyakova Reed (ep757), Shivan Arora (sa2200)
- Jacy To (cyt33)
- Ali Kheirkhah (PicoLM Research Intern, co-supervised by Richard Diehl Martinez)
Supervised Theses and Dissertations
“Integrating Cognitively-Inspired Selective Attention Cues in Small Vision-Language Models”. 2025. Bianca Ganescu, MPhil Advanced Computer Science Thesis.
Evaluating Typological Effects of L2 English in Language Model Benchmarks (Ellie P. Reed, 2025, Supervised UROP Report)
Controlled Text Generation: a teacher-student model paradigm for CEFR-levelled complexity control (Shivan Arora, 2025, Supervised UROP Report)