top of page
제목 없음 - 2022년 4월 22일 02_edited_edited_edited.jpg

Measuring Goodhart’s Law

Article's Summary:
But it’s important to keep track of how well the true objective is being optimized. We’ll focus on a setting that is particularly clean to analyze, in which we have access to the true objective. In addition, best-of-$n$ sampling has reliable performance and is straightforward to analyze mathematically, making it well-suited to empirical studies of Goodhart’s law and related phenomena. Together, these estimators allow us to easily analyze how the true objective varies with the amount of optimization applied to the proxy objective. In the settings we’ve studied so far, such as summarization, we’ve typically been able to reach a KL of around 10 nats using reinforcement learning before the true objective starts to decrease due to Goodhart’s law.

Article's Keywords: 'proxy', 'learning', 'objective', 'sampling', 'goodharts', 'reinforcement', 'samples', 'law', 'kl', 'bestofn', 'measuring', 'true'

Screenshot 2022-04-22 at 12.19.13 PM.png

DALL·E 2

Article's Summary:
Phased Deployment Based on LearningWe’ve been working with external experts and are previewing DALL·E 2 to a limited number of trusted users who will help us learn about the technology’s capabilities and limitations. We plan to invite more people to preview this research over time as we learn and iteratively improve our safety system.

Article's Keywords: 'research', 'preview', 'technologys', 'previewing', 'trusted', 'safety', 'learn', 'users', 'working', 'system', 'dalle'

Screenshot 2022-04-22 at 12.23.16 PM.png
bottom of page