-
127Social Choice Should Guide AI Alignment in Dealing with Diverse Human FeedbackProceedings of the 41St International Conference on Machine Learning 41 9346-9360. 2024.Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How can we aggregate the in…Read more
-
9Artificial IntelligenceIn S. Matthew Liao (ed.), Ethics of Artificial Intelligence, Oxford University Press. pp. 327-341. 2020.This chapter argues that there is very little chance that we humans can specify our objectives completely and correctly, in such a way that the pursuit of those objectives by more capable machines is guaranteed to result in beneficial outcomes for humans. Consequently, this chapter defends and further articulates the need for “provably beneficial AI,” which is the idea that to the extent that human values are revealed in our behavior, we should be able to get machines to learn underlying human p…Read more
-
6Rationality and IntelligenceIn Renee Elio (ed.), Common sense, reasoning, & rationality, Oxford University Press. pp. 37-59. 2002.This chapter considers how to formalize intelligence or rationality in a way that has value for the development of agents built for a specific application and of general theories of intelligence. It presents three candidates that traditionally have stood as formalizations of intelligence: perfect rationality, calculative rationality, and meta-level rationality. Perfect rationality is an abstraction that does not correspond to any physical reasoner. Calculative rationality fails to scale up to pr…Read more
-
39Artificial Intelligence: A Modern ApproachPearson. 2020."Updated edition of popular textbook on Artificial Intelligence. This edition specific looks at ways of keeping artificial intelligence under control"--
-
67AI content detection in the emerging information ecosystem: new obligations for media and tech companiesEthics and Information Technology 26 (4): 1-14. 2024.The world is about to be swamped by an unprecedented wave of AI-generated content. We need reliable ways of identifying such content, to supplement the many existing social institutions that enable trust between people and organisations and ensure social resilience. In this paper, we begin by highlighting an important new development: providers of AI content generators have new obligations to support the creation of reliable detectors for the content they generate. These new obligations arise ma…Read more
-
52Correction: AI content detection in the emerging information ecosystem: new obligations for media and tech companiesEthics and Information Technology 26 (4): 1-2. 2024.
-
153Generative AI models should include detection mechanisms as a condition for public releaseEthics and Information Technology 25 (4): 1-7. 2023.The new wave of ‘foundation models’—general-purpose generative AI models, for production of text (e.g., ChatGPT) or images (e.g., MidJourney)—represent a dramatic advance in the state of the art for AI. But their use also introduces a range of new risks, which has prompted an ongoing conversation about possible regulatory mechanisms. Here we propose a specific principle that should be incorporated into legislation: that any organization developing a foundation model intended for public use must …Read more
-
35Object identification: a Bayesian analysis with application to traffic surveillanceArtificial Intelligence 103 (1-2): 77-93. 1998.
-
1785A Logical Approach to Reasoning by AnalogyIn John P. McDermott (ed.), Proceedings of the 10th International Joint Conference on Artificial Intelligence (IJCAI'87), Morgan Kaufmann Publishers. pp. 264-270. 1987.We analyze the logical form of the domain knowledge that grounds analogical inferences and generalizations from a single instance. The form of the assumptions which justify analogies is given schematically as the "determination rule", so called because it expresses the relation of one set of variables determining the values of another set. The determination relation is a logical generalization of the different types of dependency relations defined in database theory. Specifically, we define dete…Read more
-
24Rationality and Intelligence: A Brief UpdateIn Vincent C. Müller (ed.), Fundamental Issues of Artificial Intelligence, Springer. pp. 7-28. 2016.The long-term goal of AI is the creation and understanding of intelligence. This requires a notion of intelligence that is precise enough to allow the cumulative development of robust systems and general results. The concept of rational agency has long been considered a leading candidate to fulfill this role. This paper, which updates a much earlier version (Russell, Artif Intell 94:57–77, 1997), reviews the sequence of conceptual shifts leading to a different candidate, bounded optimality, that…Read more