•  2
    Social Choice for AI Alignment: Dealing with Diverse Human Feedback
    with Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Bob M. Jacobs, Nathan Lambert, Milan Mosse, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde, and William S. Zwicker
    Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, so that, for example, they refuse to comply with requests for help with committing crimes or with producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potent…Read more
  •  4
    May's Theorem [K. O. May, Econometrica 20 (1952) 680-684] characterizes majority voting on two alternatives as the unique preferential voting method satisfying several simple axioms. Here we show that by adding some desirable axioms to May's axioms, we can uniquely determine how to vote on three alternatives. In particular, we add two axioms stating that the voting method should mitigate spoiler effects and avoid the so-called strong no show paradox. We prove a theorem stating that any preferent…Read more
  •  13
    By classic results in social choice theory, any reasonable preferential voting method sometimes gives individuals an incentive to report an insincere preference. The extent to which different voting methods are more or less resistant to such strategic manipulation has become a key consideration for comparing voting methods. Here we measure resistance to manipulation by whether neural networks of varying sizes can learn to profitably manipulate a given voting method in expectation, given differen…Read more
  •  5
    We propose a Condorcet-consistent voting method that we call Split Cycle. Split Cycle belongs to the small family of known voting methods satisfying the anti-vote-splitting criterion of independence of clones. In this family, only Split Cycle satisfies a new criterion we call immunity to spoilers, which concerns adding candidates to elections, as well as the known criteria of positive involvement and negative involvement, which concern adding voters to elections. Thus, in contrast to other clone…Read more
  •  27
    Stable Voting
    Constitutional Political Economy. forthcoming.
    We propose a new single-winner voting system using ranked ballots: Stable Voting. The motivating principle of Stable Voting is that if a candidate A would win without another candidate B in the election, and A beats B in a head-to-head majority comparison, then A should still win in the election with B included (unless there is another candidate A' who has the same kind of claim to winning, in which case a tiebreaker may choose between such candidates). We call this principle Stability for Winne…Read more
  •  19
    There is a long tradition of fruitful interaction between logic and social choice theory. In recent years, much of this interaction has focused on computer-aided methods such as SAT solving and interactive theorem proving. In this paper, we report on the development of a framework for formalizing voting theory in the Lean theorem prover, which we have applied to verify properties of a recently studied voting method. While previous applications of interactive theorem proving to social choice have…Read more
  •  5
    Games, Norms and Reasons: Logic at the Crossroads (edited book)
    with Johan van Benthem and Amitabha Gupta
    Springer. 2011.
    Games, Norms, and Reasons: Logic at the Crossroads provides an overview of modern logic focusing on its relationships with other disciplines, including new interfaces with rational choice theory, epistemology, game theory and informatics. This book continues a series called "Logic at the Crossroads" whose title reflects a view that the deep insights from the classical phase of mathematical logic can form a harmonious mixture with a new, more ambitious research agenda of understanding and enhanci…Read more
  •  25
    This book offers a state-of-the-art introduction to the basic techniques and results of neighborhood semantics for modal logic. In addition to presenting the relevant technical background, it highlights both the pitfalls and potential uses of neighborhood models – an interesting class of mathematical structures that were originally introduced to provide a semantics for weak systems of modal logic. In addition, the book discusses a broad range of topics, including standard modal logic results ; b…Read more
  •  26
    Choice-Driven Counterfactuals
    Journal of Philosophical Logic 51 (2): 297-345. 2021.
    In this paper, we investigate the semantics and logic of choice-driven counterfactuals, that is, of counterfactuals whose evaluation relies on auxiliary premises about how agents are expected to act, i.e., about their default choice behavior. To do this, we merge one of the most prominent logics of agency in the philosophical literature, namely stit logic, with the well-known logic of counterfactuals due to Stalnaker and Lewis. A key component of our semantics for counterfactuals is to distingui…Read more
  •  21
    Axioms for Defeat in Democratic Elections
    Journal of Theoretical Politics 33 (4). 2021.
    We propose six axioms concerning when one candidate should defeat another in a democratic election involving two or more candidates. Five of the axioms are widely satisfied by known voting procedures. The sixth axiom is a weakening of Kenneth Arrow's famous condition of the Independence of Irrelevant Alternatives (IIA). We call this weakening Coherent IIA. We prove that the five axioms plus Coherent IIA single out a method of determining defeats studied in our recent work: Split Cycle. In partic…Read more
  •  19
    Measuring Violations of Positive Involvement in Voting
    Electronic Proceedings in Theoretical Computer Science 335 189-209. 2021.
    In the context of computational social choice, we study voting methods that assign a set of winners to each profile of voter preferences. A voting method satisfies the property of positive involvement (PI) if for any election in which a candidate x would be among the winners, adding another voter to the election who ranks x first does not cause x to lose. Surprisingly, a number of standard voting methods violate this natural property. In this paper, we investigate different ways of measuring the…Read more
  •  38
    Intention as commitment toward time
    with Marc van Zee, Dragan Doder, Leendert van der Torre, Mehdi Dastani, and Thomas Icard
    Artificial Intelligence 283 (C): 103270. 2020.
  •  6
    The Tree of Knowledge in Action: Towards a Common Perspective
    In Marcus Kracht, Maarten de Rijke, Heinrich Wansing & Michael Zakharyaschev (eds.), Advances in Modal Logic, Csli Publications. pp. 87-106. 1998.
  •  26
    Evidence Logic: A New Look at Neighborhood Structures
    In Marcus Kracht, Maarten de Rijke, Heinrich Wansing & Michael Zakharyaschev (eds.), Advances in Modal Logic, Csli Publications. pp. 97-118. 1998.
  •  34
    Arrow's Decisive Coalitions
    Social Choice and Welfare 54. 2020.
    In his classic monograph, Social Choice and Individual Values, Arrow introduced the notion of a decisive coalition of voters as part of his mathematical framework for social choice theory. The subsequent literature on Arrow’s Impossibility Theorem has shown the importance for social choice theory of reasoning about coalitions of voters with different grades of decisiveness. The goal of this paper is a fine-grained analysis of reasoning about decisive coalitions, formalizing how the concept of a …Read more
  •  34
    Strategic Voting Under Uncertainty About the Voting Method
    Electronic Proceedings in Theoretical Computer Science 297. 2019.
    Much of the theoretical work on strategic voting makes strong assumptions about what voters know about the voting situation. A strategizing voter is typically assumed to know how other voters will vote and to know the rules of the voting method. A growing body of literature explores strategic voting when there is uncertainty about how others will vote. In this paper, we study strategic voting when there is uncertainty about the voting method. We introduce three notions of manipulability for a se…Read more
  • Focusing on Campaigns
    In Ramaswamy Ramanujam, Lawrence Moss & Can Başkent (eds.), Rohit Parikh on Logic, Language and Society, Springer Verlag. 2017.
    One of the important lessons to take away from Rohit Parikh’s impressive body of work is that logicians and computer scientists have much to gain by focusing their attention on the intricacies of political campaigns. Drawing on recent work developing a theory of expressive voting, we study the dynamics of voters’ opinions during an election. In this paper, we develop a model in which the relative importance of the different issues that concern a voter may change either in response to candidates’…Read more
  •  28
    Ecidence Logic: A New Look at Neighborhood Structures
    In Marcus Kracht, Maarten de Rijke, Heinrich Wansing & Michael Zakharyaschev (eds.), Advances in Modal Logic, Csli Publications. pp. 97-118. 1998.
  •  14
    The Tree of Knowledge in Action: Towards a Common Perspective
    In Marcus Kracht, Maarten de Rijke, Heinrich Wansing & Michael Zakharyaschev (eds.), Advances in Modal Logic, Csli Publications. pp. 87-106. 1998.
  •  37
    Action types in stit semantics
    with John Horty
    Review of Symbolic Logic 10 (4): 617-637. 2017.
  •  15
    Temporal Logics of Agency
    Journal of Logic, Language and Information 19 (4): 389-393. 2010.
  •  13
    Logic and intelligent interaction
    with Thomas Ågotnes and Johan Benthem
    Synthese 169 (2): 219-221. 2009.
  •  9
    Merging Frameworks for Interaction
    with Johan Benthem, Jelle Gerbrandy, and Tomohiro Hoshi
    Journal of Philosophical Logic 38 (5): 491-526. 2009.
    A variety of logical frameworks have been developed to study rational agents interacting over time. This paper takes a closer look at one particular interface, between two systems that both address the dynamics of knowledge and information flow. The first is Epistemic Temporal Logic (ETL) which uses linear or branching time models with added epistemic structure induced by agents’ different capabilities for observing events. The second framework is Dynamic Epistemic Logic (DEL) that describes int…Read more
  •  24
    An abstract approach to reasoning about games with mistaken and changing beliefs
    Australasian Journal of Logic 6 (5): 162-181. 2008.
    We do not believe that logic is the sole answer to deep and intriguing questions about human behaviour, but we think that it might be a useful tool in simulating and understanding it to a certain degree and in specifically restricted areas of application. We do not aim to resolve the question of what rational behaviour in games with mistaken and changing beliefs is. Rather, we develop a formal and abstract framework that allows us to reason about behaviour in games with mistaken and changing bel…Read more
  •  17
    First-Order Classical Modal Logic
    Studia Logica 84 (2): 171-210. 2006.
    The paper focuses on extending to the first order case the semantical program for modalities first introduced by Dana Scott and Richard Montague. We focus on the study of neighborhood frames with constant domains and we offer in the first part of the paper a series of new completeness results for salient classical systems of first order modal logic. Among other results we show that it is possible to prove strong completeness results for normal systems without the Barcan Formula (like FOL + K)in …Read more
  •  364
    The combination of logic and game theory provides a fine-grained perspective on information and interaction dynamics, a Theory of Play. In this paper we lay down the main components of such a theory, drawing on recent advances in the logical dynamics of actions, preferences, and information. We then show how this fine-grained perspective has already shed new light on the long-term dynamics of information exchange, as well as on the much-discussed question of extensive game rationality.
  •  77
    The Logic of Knowledge Based Obligation
    with Rohit Parikh and Eva Cogan
    Synthese 149 (2): 311-341. 2006.
    Deontic Logic goes back to Ernst Mally’s 1926 work, Grundgesetze des Sollens: Elemente der Logik des Willens [Mally. E.: 1926, Grundgesetze des Sollens: Elemente der Logik des Willens, Leuschner & Lubensky, Graz], where he presented axioms for the notion ‘p ought to be the case’. Some difficulties were found in Mally’s axioms, and the field has much developed. Logic of Knowledge goes back to Hintikka’s work Knowledge and Belief [Hintikka, J.: 1962, Knowledge and Belief: An Introduction to the Lo…Read more