Program equivalence and context-free grammars
Barry K. Rosen
SWAT 1972
Human feedback on conversations with language models is central to how these systems learn about the world, improve their capabilities and are steered towards desirable and safe behaviours. However, this feedback is mostly collected by frontier artificial intelligence labs and kept behind closed doors. Here we bring together interdisciplinary experts to assess the opportunities and challenges to realizing an open ecosystem of human feedback for artificial intelligence. We first look for successful practices in the peer-production, open-source and citizen-science communities. We then characterize the main challenges for open human feedback. For each, we survey current approaches and offer recommendations. We end by envisioning the components needed to underpin a sustainable and open human feedback ecosystem. In the centre of this ecosystem are mutually beneficial feedback loops, between users and specialized models, incentivizing a diverse stakeholder community of model trainers and feedback providers to support a general open feedback pool.
Barry K. Rosen
SWAT 1972
Yehuda Naveli, Michal Rimon, et al.
AAAI/IAAI 2006
Fahiem Bacchus, Joseph Y. Halpern, et al.
IJCAI 1995
Haoran Liao, Derek S. Wang, et al.
Nature Machine Intelligence