-
250AI systems increasingly incorporate continual learning mechanisms allowing their behaviour to adapt after deployment, from (1) in-context learning and (2) memory features already in wide use to (3) post-deployment weight modification under research. We argue that, by treating AI systems as frozen artefacts whose performance and safety are assessed at release, current evaluation practices structurally ignore the behavioural trajectory of a system that continues to learn from experience. Our posit…Read more
-
179As AI is integrated into the workplace, organisations increasingly face allocation decisions between human and machine workers. These decisions are increasingly made or assisted by algorithms, creating a Reverse Turing Test dynamic wherein the machine is now the judge. In addition, human and machine workers may ``compete'' for a given task, reproducing aspects of adversarial games. This raises new methodological questions about assessing task suitability between humans and machines. The criteria…Read more
-
41Mapping Intelligence: Requirements and PossibilitiesIn Vincent C. Müller (ed.), Philosophy and theory of artificial intelligence 2017, Springer Verlag. pp. 117-135. 2017.New types of artificial intelligence (AI), from cognitive assistants to social robots, are challenging meaningful comparison with other kinds of intelligence. How can such intelligent systems be catalogued, evaluated, and contrasted, with representations and projections that offer meaningful insights? To catalyse the research in AI and the future of cognition, we present the motivation, requirements and possibilities for an atlas of intelligence: an integrated framework and collaborative open re…Read more
-
626Your Prompt is my command: On Assessing the Human-Centred Generality of Multimodal ModelsJournal of Artificial Intelligence Research 77. 2023.Even with obvious deficiencies, large prompt-commanded multimodal models are proving to be flexible cognitive tools representing an unprecedented generality. But the directness, diversity, and degree of user interaction create a distinctive “human-centred generality” (HCG), rather than a fully autonomous one. HCG implies that —for a specific user— a system is only as general as it is effective for the user’s relevant tasks and their prevalent ways of prompting. A human-centred evaluation of gene…Read more
-
74The Facets of Artificial Intelligence: A Framework to Track the Evolution of AIIn Fernando Martínez-Plumed, Bao Sheng Loe, Peter Flach, Sean O. O. HEigeartaigh, Karina Vold & José Hernández-Orallo (eds.), Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence Evolution of the contours of AI, . pp. 5180-5187. 2018.We present nine facets for the analysis of the past and future evolution of AI. Each facet has also a set of edges that can summarise different trends and contours in AI. With them, we first conduct a quantitative analysis using the information from two decades of AAAI/IJCAI conferences and around 50 years of documents from AI topics, an official database from the AAAI, illustrated by several plots. We then perform a qualitative analysis using the facets and edges, locating AI systems in the int…Read more
-
Cambridge UniversityResearcher
Cambridge, United Kingdom of Great Britain and Northern Ireland