Our project aims to improve how AI models learn from human feedback. Current methods assume human preferences can be reduced to a single “reward” value, but research shows this isn’t always true. We will investigate if an AI algorithm can learn from human preferences without relying on rewards. If possible, we’ll design and test algorithms […]
Read More