•  127
    Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback
    with Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mosse, Eric Pacuit, Hailey Schoelkopf, Emanuel Tewolde, and William S. Zwicker
    Proceedings of the 41St International Conference on Machine Learning 41 9346-9360. 2024.
    Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How can we aggregate the in…Read more
  •  9
    Artificial Intelligence
    In S. Matthew Liao (ed.), Ethics of Artificial Intelligence, Oxford University Press. pp. 327-341. 2020.
    This chapter argues that there is very little chance that we humans can specify our objectives completely and correctly, in such a way that the pursuit of those objectives by more capable machines is guaranteed to result in beneficial outcomes for humans. Consequently, this chapter defends and further articulates the need for “provably beneficial AI,” which is the idea that to the extent that human values are revealed in our behavior, we should be able to get machines to learn underlying human p…Read more
  •  6
    Rationality and Intelligence
    In Renee Elio (ed.), Common sense, reasoning, & rationality, Oxford University Press. pp. 37-59. 2002.
    This chapter considers how to formalize intelligence or rationality in a way that has value for the development of agents built for a specific application and of general theories of intelligence. It presents three candidates that traditionally have stood as formalizations of intelligence: perfect rationality, calculative rationality, and meta-level rationality. Perfect rationality is an abstraction that does not correspond to any physical reasoner. Calculative rationality fails to scale up to pr…Read more
  •  39
    Artificial Intelligence: A Modern Approach
    with Peter Norvig
    Pearson. 2020.
    "Updated edition of popular textbook on Artificial Intelligence. This edition specific looks at ways of keeping artificial intelligence under control"--
  •  67
    AI content detection in the emerging information ecosystem: new obligations for media and tech companies
    with Alistair Knott, Dino Pedreschi, Toshiya Jitsuzumi, Susan Leavy, David Eyers, Tapabrata Chakraborti, Andrew Trotman, Sundar Sundareswaran, Ricardo Baeza-Yates, Przemyslaw Biecek, Adrian Weller, Paul D. Teal, Subhadip Basu, Mehmet Haklidir, Virginia Morini, and Yoshua Bengio
    Ethics and Information Technology 26 (4): 1-14. 2024.
    The world is about to be swamped by an unprecedented wave of AI-generated content. We need reliable ways of identifying such content, to supplement the many existing social institutions that enable trust between people and organisations and ensure social resilience. In this paper, we begin by highlighting an important new development: providers of AI content generators have new obligations to support the creation of reliable detectors for the content they generate. These new obligations arise ma…Read more
  •  52
    Correction: AI content detection in the emerging information ecosystem: new obligations for media and tech companies
    with Alistair Knott, Dino Pedreschi, Toshiya Jitsuzumi, Susan Leavy, David Eyers, Tapabrata Chakraborti, Andrew Trotman, Sundar Sundareswaran, Ricardo Baeza-Yates, Przemyslaw Biecek, Adrian Weller, Paul D. Teal, Subhadip Basu, Mehmet Haklidir, Virginia Morini, and Yoshua Bengio
    Ethics and Information Technology 26 (4): 1-2. 2024.
  •  153
    Generative AI models should include detection mechanisms as a condition for public release
    with Alistair Knott, Dino Pedreschi, Raja Chatila, Tapabrata Chakraborti, Susan Leavy, Ricardo Baeza-Yates, David Eyers, Andrew Trotman, Paul D. Teal, Przemyslaw Biecek, and Yoshua Bengio
    Ethics and Information Technology 25 (4): 1-7. 2023.
    The new wave of ‘foundation models’—general-purpose generative AI models, for production of text (e.g., ChatGPT) or images (e.g., MidJourney)—represent a dramatic advance in the state of the art for AI. But their use also introduces a range of new risks, which has prompted an ongoing conversation about possible regulatory mechanisms. Here we propose a specific principle that should be incorporated into legislation: that any organization developing a foundation model intended for public use must …Read more
  •  35
    Object identification: a Bayesian analysis with application to traffic surveillance
    with Timothy Huang
    Artificial Intelligence 103 (1-2): 77-93. 1998.
  •  50
    Optimal composition of real-time systems
    with Shlomo Zilberstein
    Artificial Intelligence 82 (1-2): 181-213. 1996.
  •  68
    Principles of metareasoning
    with Eric Wefald
    Artificial Intelligence 49 (1-3): 361-395. 1991.
  •  1785
    We analyze the logical form of the domain knowledge that grounds analogical inferences and generalizations from a single instance. The form of the assumptions which justify analogies is given schematically as the "determination rule", so called because it expresses the relation of one set of variables determining the values of another set. The determination relation is a logical generalization of the different types of dependency relations defined in database theory. Specifically, we define dete…Read more
  •  59
    Rationality and intelligence
    Artificial Intelligence 94 (1-2): 57-77. 1997.
  •  24
    Rationality and Intelligence: A Brief Update
    In Vincent C. Müller (ed.), Fundamental Issues of Artificial Intelligence, Springer. pp. 7-28. 2016.
    The long-term goal of AI is the creation and understanding of intelligence. This requires a notion of intelligence that is precise enough to allow the cumulative development of robust systems and general results. The concept of rational agency has long been considered a leading candidate to fulfill this role. This paper, which updates a much earlier version (Russell, Artif Intell 94:57–77, 1997), reviews the sequence of conceptual shifts leading to a different candidate, bounded optimality, that…Read more