August 2025 – I'm co-chairing the poster session at the Cambridge Language Sciences Annual Symposium 2025: Ambitions for language science in 2050 (Thursday 27 November 2025 | 13:00 – 19:30, Cripps Court, Magdalene College, Cambridge).
August 2025 – I've joined the Organising Committee of the 23rd Old-World Conference in Phonology (OCP23), which will take place at Gonville & Caius College, Cambridge (United Kingdom), 14–16 January 2026. Supporting Lead organiser Yury Makarov and Programme chair Prof. Bert Vaux.
July 2025 – Measuring Grammatical Diversity from Small Corpora: Derivational Entropy Rates, Mean Length of Utterances, and Annotation Invariance (Fermín Moscoso del Prado Martín, Suchir Salhan) @ ACL Main Conference (Computational Linguistics Journal Accepted Paper) ( Vienna, Austria).
July 2025 – ByteSpan @ ICML Tokenisation Workshop ( Vancouver, Canada).
July – August 2025 – Co-supervising two Small Language Model Undergraduate Research Opportunity (UROP) students, based on Pico, our learning dynamics framework. Contact: sas245@cam.ac.uk, pjb48@cam.ac.uk.
July – August 2025 – Google DeepMind Research Ready Mentor with Richard Diehl Martinez and Prof Paula Buttery, supported by Google DeepMind, the Hg Foundation, and the Royal Academy of Engineering.
June 2025 – Invited Keynote Talk at The 13th International Conference on the Mental Lexicon, Montréal, Québec, Canada. The Distribution of Phonemes across Languages: Chance, costs, and integration across linguistic tiers (Dr Fermín Moscoso del Prado Martín & Suchir Salhan).
June 2025 – Position Paper in Cambridge Occasional Papers in Linguistics (CoPiL): Linguistics in the Age of Language Models: What Can Cognitively-Inspired Language Models Offer to Linguistic Theory?
June 2025 – Posters at the Cambridge Learning & Human Intelligence (LHI) Expo, Department of Computer Science & Technology, Cambridge Centre of Human-Inspired AI.
Easter 2025 – Dr Weiwei Sun and I are co-organising a reading group on Computational Models of Language. To join, email sas245@cam.ac.uk or ws390@cam.ac.uk.
March 2025 – We released PicoLM, the Cambridge Small Language Model & Learning Dynamics Framework. YouTube: Introducing PicoLM.
March 2025 – Poster at HumanCLAIM ( Göttingen, Germany).
March 2025 – Talk in Tübingen: “Human Validated Grammar Profiles for Language Models”, Colloquium organised by Prof Detmar Meurers ( Tübingen, Germany).
Lent 2025 – Teaching Assistant for CST IA Machine Learning & Real World Data; Supervisor for Machine Learning & Bayesian Inference (MBLI) [CST Part II].
I organise the Natural Language & Information Processing (NLIP) Seminars, Department of Computer Science & Technology, University of Cambridge.
November 2024 – Guest lecture for MPhil course, University of Cambridge, with Prof Paula Buttery & Dr Fermín Moscoso del Prado Martín: Language Model Evaluation.
November 2024 – MEng Thesis @ 2nd BabyLM Workshop (CoNLL, EMNLP) ( Miami, Florida, U.S.). ACL paper: https://aclanthology.org/2024.conll-babylm.15/.