Course notes
Notes
Writing and summarizing helps me process and remember. Here, I collect longer write-ups, that are created mostly as I work through a course.
01 Introduction to AI Evaluation 2 lectures
by José Hernández-Orallo, Leverhulme Centre for the Future of Intelligence, University of Cambridge
3 Feb 2026
by Jin-Dong Wang, William & Mary, Department of Data Science
8 Feb 2026
03 Metrics and Experimental Methodology 5 lectures
by Line Clemmensen, Technical University of Denmark (DTU)
10 Feb 2026
by Peter Flach, University of Bristol
11 Feb 2026
by Line Clemmensen, Technical University of Denmark (DTU)
12 Feb 2026
by Cèsar Ferri, Universitat Politècnica de València (UPV)
12 Feb 2026
by Thomas Dietterich, Oregon State University
24 Feb 2026
04 Benchmarks, Leaderboards and Competitions 5 lectures
by Joaquin Vanschoren, Eindhoven University of Technology
17 Feb 2026
by Lorenzo Pacchiardi, University of Cambridge
17 Feb 2026
by Lorenzo Pacchiardi, University of Cambridge
17 Feb 2026
by Manuel Cebrián, Spanish National Research Council (CSIC)
24 Feb 2026
by Joel Z Leibo, Google DeepMind
27 Feb 2026
05 Adversarial Evaluations 4 lectures
by Cozmin Ududec, UK AI Security Institute
3 Mar 2026
by Laura Weidinger, Google DeepMind
3 Mar 2026
by Adam Gleave, FAR.AI
5 Mar 2026
by Mohammad Taufeeque, FAR.AI
5 Mar 2026
06 Construct-Based Evaluation 4 lectures
by Liming Jiang, Microsoft Research Asia
6 Mar 2026
by Lucy Cheke, University of Cambridge
11 Mar 2026
by Marko Tešić, UK Department for Science, Innovation and Technology (DSIT)
12 Mar 2026
by Sanmi Koyejo, Stanford University
18 Mar 2026
07 Interpretability 2 lectures
by Callum McDougall, ARENA
20 Mar 2026
by Fazl Barez, University of Oxford
21 Mar 2026
08 Agentic, Alignment and Control Evaluations 6 lectures
by Tyler Tracy, Redwood Research
24 Mar 2026
by Xiaoyuan Yi, Microsoft Research Asia
24 Mar 2026
by Cozmin Ududec, UK AI Security Institute
25 Mar 2026
by Xiaoyuan Yi, Microsoft Research Asia
25 Mar 2026
by Jérémy Scheurer, Apollo Research
26 Mar 2026
by Xiaoyuan Yi, Microsoft Research Asia
26 Mar 2026
09 Real-World Evaluation: Societal Impacts of AI 6 lectures
by Katie Collins, MIT (Computational Cognitive Science) & Princeton AI Lab
7 Apr 2026
by Jonathan Prunty, Leverhulme Centre for the Future of Intelligence, University of Cambridge
7 Apr 2026
by Marko Tešić, UK Department for Science, Innovation and Technology (DSIT)
7 Apr 2026
by Laura Weidinger, Google DeepMind
8 Apr 2026
by Angelina Wang, Cornell Tech & Cornell University
9 Apr 2026
by Tom Cunningham, METR
21 Apr 2026
10 Governance, Policy and Regulation 5 lectures
by Patricia Paskov, RAND Corporation & Oxford Martin AI Governance Initiative
14 Apr 2026
by Seán Ó hÉigeartaigh, Leverhulme Centre for the Future of Intelligence, University of Cambridge
14 Apr 2026
by Seán Ó hÉigeartaigh, Leverhulme Centre for the Future of Intelligence, University of Cambridge
16 Apr 2026
by Stuart Elliott, OECD
22 Apr 2026
by Patricia Paskov, RAND Corporation & Oxford Martin AI Governance Initiative
23 Apr 2026