•  1160
    AI Welfare is Bullshit
    with Yunze Xiao, Gordon Dai, Shahan Ali Memon, Maarten Sap, and Mona Diab
    International Conference on Machine Learning. forthcoming.
    Recent proposals urge AI labs to prepare for “AI welfare” under uncertainty about whether AI systems have morally relevant inner states. We do not argue for or against the possibility of AI welfare. Instead, we argue that current AI welfare assessment fails for two linked structural reasons absent from other evaluation targets. First, AI welfare indicators are co-engineered with the systems they evaluate: ordinary development decisions that shape model behavior can also manufacture or suppress w…Read more