Learning to Reason with LLMs

Learning to Reason with LLMs - OpenAI

settembre 12, 2024

We are introducing OpenAI o1, a new large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers—it can produce a long internal chain of thought before responding to the user.

OpenAI o1 ranks in the 89th percentile on competitive programming questions (Codeforces), places among the top 500 students in the US in a qualifier for the USA Math Olympiad (AIME), and exceeds human PhD-level accuracy on a benchmark of physics, biology, and chemistry problems (GPQA). While the work needed to make this new model as easy to use as current models is still ongoing, we are releasing an early version of this model, OpenAI o1-preview, for immediate use in ChatGPT and to trusted API users(opens in a new window).

Cerca nel blog

My Cookie Mix

Learning to Reason with LLMs - OpenAI

We are introducing OpenAI o1, a new large language model trained with reinforcement learning to perform complex reasoning. o1 thinks before it answers—it can produce a long internal chain of thought before responding to the user.

Commenti

Posta un commento

Post popolari in questo blog

Dove trovare raccolte di dati (dataset) utilizzabili gratuitamente

Alternative a Yahoo Finance per scaricare i dati di borsa

Data visualization (Cosa si intende per data visualization?)