•  272
    Continual Learning Requires Evaluating Trajectories
    with Lorenzo Pacchiardi, Patricia Paskov, Seán Ó hÉigeartaigh, Fernando Martínez-Plumed, Katherine M. Collins, Fazl Barez, Matteo Gabriel Mecattaf, Zafeirios Fountas, Risto Uuk, Sanmi Koyejo, Cozmin Ududec, and José Hernández-Orallo
    AI systems increasingly incorporate continual learning mechanisms allowing their behaviour to adapt after deployment, from (1) in-context learning and (2) memory features already in wide use to (3) post-deployment weight modification under research. We argue that, by treating AI systems as frozen artefacts whose performance and safety are assessed at release, current evaluation practices structurally ignore the behavioural trajectory of a system that continues to learn from experience. Our posit…Read more
  •  37
    A cognitive template for human face detection
    with Rob Jenkins, Rana Qarooni, and Markus Bindemann
    Cognition 249 (C): 105792. 2024.
  •  190
    Reverse Turing Tests for Human-Machine Task Suitability Assessments Should be Profile-Driven
    with Marko Tešić, John Burden, Ben Slater, Zachary Tidler, Paul Clothier, Luning Sun, Katherine Collins, Bernardo Gonçalves, Giulio Corsi, Seán Ó hÉigeartaigh, Lucy Cheke, and Jose Hernandez-Orallo
    As AI is integrated into the workplace, organisations increasingly face allocation decisions between human and machine workers. These decisions are increasingly made or assisted by algorithms, creating a Reverse Turing Test dynamic wherein the machine is now the judge. In addition, human and machine workers may ``compete'' for a given task, reproducing aspects of adversarial games. This raises new methodological questions about assessing task suitability between humans and machines. The criteria…Read more
  •  44
    Capacity limits in face detection
    with Rana Qarooni, Markus Bindemann, and Rob Jenkins
    Cognition 228 (C): 105227. 2022.