-
102Resource‐Rational Virtual Bargaining for Moral Judgment: Toward a Probabilistic Cognitive ModelTopics in Cognitive Science 17 (3): 713-738. 2025.Recent theoretical work has argued that moral psychology can be understood through the lens of “resource rational contractualism.” The view posits that the best way of making a decision that affects other people is to get everyone together to negotiate under idealized conditions. The outcome of that negotiation is an arrangement (or “contract”) that would lead to mutual benefit. However, this ideal is seldom (if ever) practical given the resource demands (time, information, computational process…Read more
-
69Probabilistic programming versus meta-learning as models of cognitionBehavioral and Brain Sciences 47. 2024.We summarize the recent progress made by probabilistic programming as a unifying formalism for the probabilistic, symbolic, and data-driven aspects of human cognition. We highlight differences with meta-learning in flexibility, statistical assumptions and inferences about cogniton. We suggest that the meta-learning approach could be further strengthened by considering Connectionist and Bayesian approaches, rather than exclusively one or the other.
-
141Beyond Preferences in AI AlignmentPhilosophical Studies 182 (7): 1813-1863. 2025.The dominant practice of AI alignment assumes (1) that preferences are an adequate representation of human values, (2) that human rationality can be understood in terms of maximizing the satisfaction of preferences, and (3) that AI systems should be aligned with the preferences of one or more humans to ensure that they behave safely and in accordance with our values. Whether implicitly followed or explicitly endorsed, these commitments constitute what we term a preferentist approach to AI alignm…Read more
Massachusetts Institute of Technology
Department of Electrical Engineering & Computer Science
PhD, 2025
Singapore, Singapore
Areas of Specialization
1 more
| Philosophy of Artificial Intelligence |
| Philosophy of Cognitive Science |
| Value Theory |
| Decision Theory |
| Game Theory |
| Machine Ethics |
Areas of Interest
1 more
| Value Theory |
| Philosophy of Artificial Intelligence |
| Philosophy of Cognitive Science |
| Decision Theory |
| Game Theory |
| Machine Ethics |