top of page
source: https://openai.com/
Measuring Goodhart’s Law
Article's Summary:
But it’s important to keep track of how well the true objective is being optimized. We’ll focus on a setting that is particularly clean to analyze, in which we have access to the true objective. In addition, best-of-$n$ sampling has reliable performance and is straightforward to analyze mathematically, making it well-suited to empirical studies of Goodhart’s law and related phenomena. Together, these estimators allow us to easily analyze how the true objective varies with the amount of optimization applied to the proxy objective. In the settings we’ve studied so far, such as summarization, we’ve typically been able to reach a KL of around 10 nats using reinforcement learning before the true objective starts to decrease due to Goodhart’s law.
Article's Keywords: 'proxy', 'learning', 'objective', 'sampling', 'goodharts', 'reinforcement', 'samples', 'law', 'kl', 'bestofn', 'measuring', 'true'
DALL·E 2
Article's Summary:
Phased Deployment Based on LearningWe’ve been working with external experts and are previewing DALL·E 2 to a limited number of trusted users who will help us learn about the technology’s capabilities and limitations. We plan to invite more people to preview this research over time as we learn and iteratively improve our safety system.
Article's Keywords: 'research', 'preview', 'technologys', 'previewing', 'trusted', 'safety', 'learn', 'users', 'working', 'system', 'dalle'
source: https://openai.com/dall-e-2/
bottom of page