Artwork

Contenu fourni par Whitehat SEO and Whitehat Inbound Marketing Agency. Tout le contenu du podcast, y compris les épisodes, les graphiques et les descriptions de podcast, est téléchargé et fourni directement par Whitehat SEO and Whitehat Inbound Marketing Agency ou son partenaire de plateforme de podcast. Si vous pensez que quelqu'un utilise votre œuvre protégée sans votre autorisation, vous pouvez suivre le processus décrit ici https://fr.player.fm/legal.
Player FM - Application Podcast
Mettez-vous hors ligne avec l'application Player FM !

Unpacking OpenAI's Latest Reasoning Models

11:32
 
Partager
 

Manage episode 442461526 series 2330470
Contenu fourni par Whitehat SEO and Whitehat Inbound Marketing Agency. Tout le contenu du podcast, y compris les épisodes, les graphiques et les descriptions de podcast, est téléchargé et fourni directement par Whitehat SEO and Whitehat Inbound Marketing Agency ou son partenaire de plateforme de podcast. Si vous pensez que quelqu'un utilise votre œuvre protégée sans votre autorisation, vous pouvez suivre le processus décrit ici https://fr.player.fm/legal.
Comparing the reasoning capabilities of two new OpenAI models, o1-mini and o1-preview, through a series of tests. The first test involved a classic children's game, the Tower of London, which assesses the ability to plan and reason about future states. Both models struggled with the game's rules, suggesting they still lack fundamental reasoning skills. The second test involved a hypothetical business scenario, where the models were tasked with analyzing risks, opportunities, and strategic paths forward based on provided information. The models performed poorly, often simply regurgitating information without providing valuable insights or critical analysis. Finally, the video concluded that, despite the initial hype surrounding the models, they don’t represent a significant leap in reasoning capabilities compared to older models like GPT-3. Although the authors acknowledge that the models are still under development, they express disappointment that they are not yet able to perform complex reasoning tasks in a way that would be useful for real-world applications.
  continue reading

90 episodes

Artwork
iconPartager
 
Manage episode 442461526 series 2330470
Contenu fourni par Whitehat SEO and Whitehat Inbound Marketing Agency. Tout le contenu du podcast, y compris les épisodes, les graphiques et les descriptions de podcast, est téléchargé et fourni directement par Whitehat SEO and Whitehat Inbound Marketing Agency ou son partenaire de plateforme de podcast. Si vous pensez que quelqu'un utilise votre œuvre protégée sans votre autorisation, vous pouvez suivre le processus décrit ici https://fr.player.fm/legal.
Comparing the reasoning capabilities of two new OpenAI models, o1-mini and o1-preview, through a series of tests. The first test involved a classic children's game, the Tower of London, which assesses the ability to plan and reason about future states. Both models struggled with the game's rules, suggesting they still lack fundamental reasoning skills. The second test involved a hypothetical business scenario, where the models were tasked with analyzing risks, opportunities, and strategic paths forward based on provided information. The models performed poorly, often simply regurgitating information without providing valuable insights or critical analysis. Finally, the video concluded that, despite the initial hype surrounding the models, they don’t represent a significant leap in reasoning capabilities compared to older models like GPT-3. Although the authors acknowledge that the models are still under development, they express disappointment that they are not yet able to perform complex reasoning tasks in a way that would be useful for real-world applications.
  continue reading

90 episodes

Semua episod

×
 
Loading …

Bienvenue sur Lecteur FM!

Lecteur FM recherche sur Internet des podcasts de haute qualité que vous pourrez apprécier dès maintenant. C'est la meilleure application de podcast et fonctionne sur Android, iPhone et le Web. Inscrivez-vous pour synchroniser les abonnements sur tous les appareils.

 

Guide de référence rapide