•  127
    Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback
    with Vincent Conitzer, Rachel Freedman, Jobst Heitzig, Wesley H. Holliday, Nathan Lambert, Milan Mosse, Eric Pacuit, Stuart Russell, Hailey Schoelkopf, Emanuel Tewolde, and William S. Zwicker
    Proceedings of the 41St International Conference on Machine Learning 41 9346-9360. 2024.
    Foundation models such as GPT-4 are fine-tuned to avoid unsafe or otherwise problematic behavior, such as helping to commit crimes or producing racist text. One approach to fine-tuning, called reinforcement learning from human feedback, learns from humans' expressed preferences over multiple outputs. Another approach is constitutional AI, in which the input from humans is a list of high-level principles. But how do we deal with potentially diverging input from humans? How can we aggregate the in…Read more
  •  616
    Should We Vote in Non-Deterministic Elections?
    with Jobst Heitzig
    Philosophies 9 (4): 107. 2024.
    This article investigates reasons to participate in non-deterministic elections, where the outcomes incorporate elements of chance beyond mere tie-breaking. The background context situates this inquiry within democratic theory, specifically non-deterministic voting systems, which promise to re-evaluate fairness and power distribution among voting blocs. This study aims to explore the normative implications of such electoral systems and their impact on our moral duty to vote. We analyze instrumen…Read more