undefined - Introducing GPT-5

Introducing GPT-5

Sam Altman, Greg Brockman, Sebastien Bubeck, Mark Chen, Yann Dubois, Brian Fioca, Adi Ganesh, Oliver Godement, Saachi Jain, Christina Kaplan, Christina Kim, Elaine Ya Le, Felipe Millon, Michelle Pokrass, Jakub Pachocki, Max Schwarzer, Rennie Song, Ruochen Wang introduce and demo GPT-5.

August 7, 202577:18

Table of Contents

0:00-8:35
8:35-17:27
17:27-22:59
22:59-29:57
30:04-34:30
34:38-40:51
41:05-49:16
49:24-54:43
54:43-1:05:10
1:05:17-1:11:29
1:11:36-1:15:21
1:15:21-1:17:26

🚀 What Makes GPT-5 a "Major Upgrade" Over GPT-4?

Historic AI Milestone Announcement

The Numbers That Tell the Story:

  1. 32 months ago: ChatGPT launched to 1 million users in the first week
  2. Today: 700 million people use ChatGPT every week
  3. The leap: From impressive to essential AI tool for work, learning, advice, and creation
Sam Altman
Today, finally, we're launching GPT-5. GPT-5 is a major upgrade over GPT-4 and a significant step along our path to AGI.
Sam AltmanOpenAIOpenAI | Co-founder & CEO
Sam Altman
We think you will love using GPT-5 much more than any previous AI. It is useful, it is smart, it is fast, and it's intuitive.
Sam AltmanOpenAIOpenAI | Co-founder & CEO

The Evolution Analogy:

  • GPT-3: Like talking to a high school student - "flashes of brilliance, lots of annoyance"
  • GPT-4: Like talking to a college student - "real intelligence, real utility"
  • GPT-5: Like talking to a PhD-level expert in any field, on demand

Revolutionary "Software on Demand" Concept:

  • Write entire computer programs from scratch for any purpose
  • Plan events, send invitations, order supplies automatically
  • Provide healthcare guidance and decision support
  • Deliver expert-level education on any topic
Sam Altman
This idea of software on demand is going to be one of the defining characteristics of the GPT-5 era. This is an incredible superpower on demand that would have been unimaginable at any previous time in history.
Sam AltmanOpenAIOpenAI | Co-founder & CEO
Sam Altman
You get access to an entire team of PhD-level experts in your pocket helping you with whatever you want to do. And anyone, pretty soon, will be able to do more than anyone in history could.
Sam AltmanOpenAIOpenAI | Co-founder & CEO

Timestamp: [0:00-2:55]Youtube Icon

🧠 How Does GPT-5 "Think Just the Perfect Amount" for Every Answer?

The Reasoning Paradigm Revolution

The Breakthrough Technology:

  1. Automatic Thinking: Models that pause to think before responding
  2. Perfect Balance: Eliminates choice between fast vs. thoughtful responses
  3. Adaptive Intelligence: Thinks exactly the right amount for each specific question
Mark Chen
Over the past few years, OpenAI has spearheaded the reasoning paradigm. These are models which pause to think before delivering more intelligent responses. Reasoning is at the heart of our AGI program.
Mark ChenOpenAIOpenAI | Chief Research Officer

What Powers This Intelligence:

  • Reasoning at the Core: Foundation technology behind ChatGPT Agent and Deep Research
  • Universal Application: Excels in coding, writing, learning, health, math, physics, and law
  • Expert-Level Knowledge: Deep reasoning capabilities across all domains
Mark Chen
Until now, our users have had to pick between the fast responses of standard GPTs or the slow, more thoughtful responses from our reasoning models. But GPT-5 eliminates this choice. It aims to think just the perfect amount to give you the perfect answer.
Mark ChenOpenAIOpenAI | Chief Research Officer

The Engineering Achievement:

  • Most powerful reasoning model ever shipped
  • Most reliable and robust performance
  • Fastest intelligent responses without compromising quality
  • Smartest model OpenAI has ever created

Timestamp: [2:55-4:46]Youtube Icon

📊 What Benchmarks Prove GPT-5 is By Far OpenAI's Smartest Model Ever?

Performance Metrics & Real-World Excellence

Coding Superiority:

  1. SWEBench: New high score on real software engineering tasks
  2. Aider Polyglot: Complex functionality across multiple programming languages
  3. Market Leadership: Best coding model available today
Max Schwarzer
We think GPT-5 is by far our smartest model ever.
Max SchwarzerOpenAIOpenAI | Post Training Lead

Multimodal & Mathematical Reasoning:

  • MMMU: New high score, outperforming human experts on visual reasoning
  • AIME 2025: Exceptional performance on International Mathematical Olympiad qualifying exam
  • Cross-Domain Excellence: Superior performance across all academic evaluations
Max Schwarzer
Language models historically have been plagued by hallucinations, factual errors that make it hard to rely on their outputs for actually important tasks. For GPT-5, we made improving factuality, especially on open-ended or complex questions, a priority.
Max SchwarzerOpenAIOpenAI | Post Training Lead

Revolutionary Reliability Improvements:

Factuality Breakthrough:

  • Priority Focus: Improving accuracy on open-ended and complex questions
  • New Evaluation Methods: Custom-built tests to track factual reliability
  • Hallucination Reduction: Most reliable and factual model ever created

Health & Real-World Applications:

  • Health Excellence: Best performance on health-related questions
  • Practical Impact: Addresses how people actually use AI in daily life
  • Trust Factor: Suitable for important, high-stakes decisions

Timestamp: [4:46-7:10]Youtube Icon

🌍 How is GPT-5 Bringing Frontier Intelligence to All Users Starting Today?

Universal Access & Rollout Strategy

Historic Accessibility:

  1. First Time Ever: Most advanced model available to free tier users
  2. Immediate Rollout: Available today for Free, Plus, Pro, and Team users
  3. Next Week: Enterprise and EDU access begins
Rennie Song
The best part is that we're bringing this frontier intelligence to all users. For the first time, our most advanced model will be available to the free tier.
Rennie SongOpenAIOpenAI | Engineering Lead

Tiered Usage Structure:

Free Users:

  • Start with GPT-5: Access to the most advanced model first
  • GPT-5 Mini Transition: Smaller but highly capable model after limit
  • Performance Note: GPT-5 Mini outperforms o3 on many dimensions

Plus Users:

  • Significantly Higher Usage: More access than free tier
  • Full GPT-5 Access: Extended capabilities and limits

Pro Subscribers:

  • Unlimited GPT-5: No usage restrictions
  • GPT-5 Pro Extended Thinking: Enhanced depth and reliability for complex tasks

Enterprise & Organization Benefits:

  • Default Model: GPT-5 as standard for everyday work
  • Generous Rate Limits: Entire organizations can adopt GPT-5
  • Full Tool Integration: All existing ChatGPT features work seamlessly

Complete Tool Ecosystem:

  • Search Integration: Enhanced with GPT-5 intelligence
  • File & Image Upload: Improved processing capabilities
  • Data Analysis: Python integration with advanced reasoning
  • Canvas & Memory: All tools enhanced by GPT-5
  • Custom Instructions: Personalization maintained across upgrades

Timestamp: [7:16-8:35]Youtube Icon

💎 Key Insights from [0:00-8:35]

Essential Insights:

  1. Historic Milestone: GPT-5 represents the transition from AI as a tool to AI as a team of PhD-level experts, marking a significant step toward AGI
  2. Universal Intelligence: The "software on demand" paradigm enables anyone to access capabilities that were previously impossible in human history
  3. Democratic Access: For the first time, cutting-edge AI intelligence is available to free users, democratizing access to frontier technology

Actionable Insights:

  • Immediate Availability: Users can start accessing GPT-5 today across all tiers, with free users getting unprecedented access to advanced AI
  • Enterprise Adoption: Organizations can confidently deploy GPT-5 as their default model with generous rate limits and full tool integration
  • Reasoning Revolution: The elimination of speed vs. intelligence trade-offs means optimal responses for every query without user decision-making

Timestamp: [0:00-8:35]Youtube Icon

📚 References from [0:00-8:35]

People Mentioned:

  • Sam Altman - OpenAI CEO announcing GPT-5 launch and vision
  • Mark Chen - Chief Research Officer explaining reasoning paradigm technology
  • Max Schwarzer - Post-training team lead presenting benchmark performance
  • Rennie Song - Engineering team member detailing rollout and availability

Companies & Products:

  • OpenAI - Company launching GPT-5 and ChatGPT platform
  • ChatGPT - The AI platform receiving GPT-5 integration
  • ChatGPT Agent - Tool powered by reasoning paradigm technology
  • Deep Research - Feature utilizing the reasoning capabilities

Technologies & Tools:

Concepts & Frameworks:

Academic Evaluations:

  • SWEBench - Software engineering task evaluation benchmark
  • MMMU - Multimodal reasoning assessment
  • AIME 2025 - American Invitational Mathematics Examination
  • Aider Polyglot - Multi-programming language implementation test

Timestamp: [0:00-8:35]Youtube Icon

🧪 How Does GPT-5 Turn Complex Physics Into Interactive Learning in Real-Time?

Elaine Ya Le
Reasoning, ChatGPT's ability to think deeply through complex problems—is now built into GPT-5. It will automatically think whenever needed, delivering a more comprehensive, accurate, and detailed answer to you.
Elaine Ya LeOpenAIOpenAI | AI Researcher

Live Physics Education & Code Generation Demo

The Bernoulli Effect Challenge:

  1. Real-World Scenario: Middle school physics homework about why airplanes are shaped the way they are
  2. Immediate Response: GPT-5 explains the Bernoulli phenomenon - faster-moving fluid has lower pressure, slower-moving fluid has higher pressure
  3. Enhanced Request: Create a moving SVG demonstration in Canvas tool
Elaine Ya Le
What's really nice is that you don't need to remember to turn on thinking each time; GPT-5 will do it for you automatically whenever the task benefits from deeper reasoning.
Elaine Ya LeOpenAIOpenAI | AI Researcher

Automatic Thinking in Action:

  • Simple Questions: No extra thinking needed, immediate high-quality answers
  • Complex Tasks: Automatically engages deeper reasoning for comprehensive responses
  • Transparent Process: Users can expand to see the model's thought process under the hood

The Coding Revolution:

What GPT-5 Accomplished in 2 Minutes:

  • 400 lines of front-end code generated automatically
  • Interactive SVG visualization with adjustable parameters
  • Complete functionality: Airspeed controls, angle of attack adjustments, real-time pressure changes
  • Physics accuracy: Ensured correct Bernoulli principle implementation

Historical Comparison:

  • 3 Years Ago (Original ChatGPT): Christina took 1 week to build similar functionality
  • Today (GPT-5): Same complexity achieved in 2 minutes
  • Technology Evolution: From "Chat with GPT" to sophisticated automatic reasoning
Christina Kim
It wasn't even called ChatGPT then. I think it was called Chat with GPT. It took me quite a bit of time to get the React app up... Honestly, maybe embarrassing to me, it took a week.
Christina KimOpenAIOpenAI | AI Researcher

Behind-the-Scenes Intelligence:

GPT-5's thinking process revealed:

  • Recognizing need for HTML code creation
  • Selecting appropriate tools (React, Tailwind)
  • Ensuring physics accuracy
  • Validating Bernoulli principle understanding

Timestamp: [8:35-14:54]Youtube Icon

🎯 How Did ChatGPT Evolve From "As an AI Model, I Can't..." to Human-Like Intelligence?

The Journey from First Demo to GPT-5

The Original ChatGPT Story:

Early Uncertainty (3 Years Ago):

  • Original Name: "Chat with GPT" (not even called ChatGPT)
  • Development Challenges: Front-end coding took Christina a full week
  • Use Case Confusion: Team wasn't sure how people would actually use it
  • Product Direction: Debating whether to release something more specific to certain use cases
Christina Kim
At the time we weren't really sure how people would actually use it and what use cases were important... It's really cool now that we have a much better understanding of how people actually want to work with chat.
Christina KimOpenAIOpenAI | AI Researcher

Personality Evolution:

  • Original Behavior: Always started with "As an AI model, I can't do something, something"
  • Modern Transformation: Much more human-like and natural interactions
  • Understanding Growth: Better comprehension of how people want to work with chat interfaces
Christina Kim
I don't know if people remember when the first version of ChatGPT would always start, 'As an AI model, I can't do...' It's so great to see how far we've come from that personality.
Christina KimOpenAIOpenAI | AI Researcher

The Revolutionary Leap:

Time Comparison:

  • 3 Years Ago: 1 week to build basic functionality
  • Today: 2 minutes for 400 lines of interactive code

Capability Evolution:

  1. From Limitations to Enablement: No longer starting with what it can't do
  2. From Generic to Personal: Responses tailored to individual context and nuance
  3. From Tool to Partner: Collaborative relationship rather than instruction-following
  4. From Specific to Universal: Optimized for all major use cases including coding

Educational Impact:

  • Universal Learning: Makes any subject (math, physics, chemistry, biology) approachable
  • Interactive Engagement: Brings hardcore concepts to life in moments
  • Personalized Education: Adapts to individual learning needs and styles
  • Immediate Application: From concept to working demonstration instantly
Elaine Ya Le
GPT-5 can just bring any hardcore concept to life in moments. Imagine you can use this for anything that you're interested in. Whether it's math, physics, chemistry, or biology. GPT-5 just makes learning so much more approachable and enjoyable.
Elaine Ya LeOpenAIOpenAI | Engineering Team

Timestamp: [8:35-17:27]Youtube Icon

✍️ How Does AI Writing Evolve From "Template" to "Emotionally Resonant"?

Revolutionary Writing Quality & Emotional Intelligence

Christina Kim
Writing is one of the most common use cases people have been using ChatGPT for. And I'm excited to say with GPT-5, we've improved the writing quality significantly. It's a much more effective partner.
Christina KimOpenAIOpenAI | AI Researcher

The Eulogy Writing Challenge:

Task: Write a heartfelt, heartwarming, yet hopeful eulogy for deprecated ChatGPT models

GPT-4o Performance Analysis:

Generic Template Approach:

  • Opening: "Today, as we prepare to welcome GPT-5 into the world, we gather to bid a heartfelt farewell to the models that came before"
  • Problem: Decent but formulaic start
  • Critical Weakness: "Your words reached across the globe, building connections where there had been none" - generic, could be about anything, feels templated

GPT-5 Writing Revolution:

Sophisticated Rhythm & Beat:

  • Opening: "Friends, colleagues, curious strangers who became regulars"
  • Immediate Impact: More rhythm and musicality in prose structure

Personal & Nuanced Content:

  • Standout Line: "These models helped millions write first lines, last lines, bridge language gaps, pass tests, argue better, soften emails, and say things they couldn't quite say alone"
  • Why It Works: Specific, personal, captures real human experiences
  • Emotional Intelligence: Gets the nuance of the situation exactly right
Christina Kim
I think I really like this line because it shows that it's not just a templated response, and it's actually quite personal and it gets the nuance of the situation right.
Christina KimOpenAIOpenAI | AI Researcher

The Fundamental Shift:

  1. From Template to Authentic: Less AI-like, more genuine human connection
  2. Enhanced Emotional Resonance: Responses that truly connect with people
  3. High IQ + High EQ: Combines intelligence with emotional understanding
  4. Writing Partnership: Effective collaboration tool for drafts, emails, and stories

Practical Applications:

  • Email Enhancement: More natural, emotionally appropriate tone
  • Creative Writing: Authentic voice and style development
  • Professional Communication: Nuanced understanding of context and audience
  • Personal Expression: Helping users say what they couldn't express alone
Christina Kim
With GPT-5, the responses feel less like AI and more like you're chatting with your high IQ and EQ friend.
Christina KimOpenAIOpenAI | AI Researcher

Timestamp: [15:00-17:27]Youtube Icon

💎 Key Insights from [8:35-17:27]

Essential Insights:

  1. Automatic Intelligence: GPT-5 eliminates the need to manually activate thinking modes - it automatically determines when deeper reasoning is needed and applies it seamlessly
  2. Development Speed Revolution: The improvement from week-long coding to 2-minute solutions represents a fundamental shift in how humans can interact with technology
  3. Emotional Intelligence Breakthrough: Writing quality improvements show AI moving beyond technical capability to genuine emotional resonance and human-like communication

Actionable Insights:

  • Educational Applications: Teachers and students can instantly create interactive demonstrations for any complex concept, making learning more engaging and accessible
  • Content Creation: Writers can leverage GPT-5's enhanced emotional intelligence for more authentic, nuanced, and personally resonant content across all formats
  • Development Workflow: The automatic thinking feature means users can focus on goals rather than prompt engineering, trusting GPT-5 to apply appropriate reasoning depth

Timestamp: [8:35-17:27]Youtube Icon

📚 References from [8:35-17:27]

People Mentioned:

  • Elaine Ya Le - OpenAI team member demonstrating physics learning and code generation capabilities
  • Christina Kim - Original ChatGPT team member since day one, showcasing writing improvements
  • Mark Chen - Chief Research Officer introducing the live demonstrations

Technologies & Tools:

  • Canvas Tool - ChatGPT's integrated environment for creating and editing content including SVG visualizations
  • React - JavaScript framework automatically selected by GPT-5 for front-end development
  • Tailwind - CSS framework utilized by GPT-5 for styling interactive demonstrations
  • SVG (Scalable Vector Graphics) - Technology used for creating interactive physics visualizations
  • GPT-5 Thinking Model - Enhanced reasoning mode available through model picker for paid users

Concepts & Frameworks:

  • Bernoulli Effect/Principle - Physics concept explaining relationship between fluid speed and pressure, demonstrated through airplane wing shape
  • Multistep Reasoning - GPT-5's built-in capability to think deeply through complex problems automatically
  • Automatic Thinking - Revolutionary feature that determines when deeper reasoning is needed without user intervention
  • Interactive Learning - Educational approach combining explanation with hands-on demonstration

Academic Subjects:

Timestamp: [8:35-17:27]Youtube Icon

💻 How Does GPT-5 Turn "Vibe Coding" Into Full-Featured Web Apps?

Revolutionary Code Generation for Non-Programmers

Yann Dubois
GPT-5 is clearly our best coding model yet. It will help everyone, even those who do not know how to write code, to bring their ideas to life.
Yann DuboisOpenAIOpenAI | Member of Technical Staff

The Personal Challenge:

Goal: Build a web app for Yann's partner to learn French and communicate with his family

The Complex Request:

  1. Beautiful & Interactive Interface: Highly engaging web application
  2. Progress Tracking: Daily learning progress monitoring
  3. Multiple Activities: Flashcards and interactive quizzes
  4. Custom Educational Game: Snake game with French twist - mouse eating cheese
  5. Audio Integration: Voice-over pronunciation for each French word collected

The "Vibe Coding" Revolution:

Multiple Design Variations:

  • Diversity by Design: GPT-5 creates different visual approaches to the same request
  • User Choice: Generate multiple tabs to compare different design aesthetics
  • Instant Iteration: Easy to request changes and improvements

Generated Features:

  • "Midnight in Paris" Theme: Romantic, engaging visual design
  • Functional Tabs: Flashcards, Quiz, Mouse and Cheese game
  • Real-time Progress: Automatically updating progress bars
  • Audio Pronunciation: French words spoken when cheese is collected
  • Interactive Elements: Working game mechanics and quiz functionality

The Technical Achievement:

Code Generation Metrics:

  • 240+ lines of code generated automatically
  • Complete functionality including game logic, UI, and audio
  • Multiple implementations created simultaneously
  • Zero programming knowledge required from user

Accessibility Revolution:

  • Universal Access: "Help everyone, even those who do not know how to write code, to bring their ideas to life"
  • Immediate Results: From concept to working application in minutes
  • Error Tolerance: Built-in ability to fix and iterate on rough edges
Yann Dubois
GPT-5 really opens up a whole new world of vibe coding... GPT-5 really brings the power of beautiful and effective code to everyone.
Yann DuboisOpenAIOpenAI | Member of Technical Staff

The Personal Touch:

Educational Game Details:

  • Cultural Adaptation: Snake → Mouse, Apples → Cheese (French cultural reference)
  • Learning Integration: New French word with each cheese collection
  • Pronunciation Practice: Audio playback for language learning
  • Progress Tracking: Gamified learning progress across all activities

Timestamp: [17:27-22:59]Youtube Icon

🎮 How Does AI Handle Game Physics + Audio + Progress Tracking?

Complex Game Development Made Simple

The Game Design Challenge:

Creative Requirements:

  1. Cultural Adaptation: Transform classic Snake game with French cultural elements
  2. Educational Integration: Each gameplay action triggers learning content
  3. Audio Pronunciation: Real-time French word pronunciation on cheese collection
  4. Progress Tracking: Integration with overall learning progress system

Technical Complexity Simplified:

What GPT-5 Automatically Handled:

  • Game Physics: Mouse movement, collision detection, cheese generation
  • Audio Integration: Web Speech API for French pronunciation
  • UI Design: Game canvas, controls, and visual feedback
  • Data Integration: Progress bar updates across different app sections
  • Responsive Design: Cross-device compatibility and styling

The Development Reality:

  • Traditional Approach: Would require game development expertise, audio programming, UI/UX design
  • GPT-5 Approach: Single prompt generates complete, functional game
  • Iteration Speed: Multiple design variations created simultaneously
  • Error Handling: Built-in ability to request fixes and improvements

The Diversity Advantage:

Multiple Implementation Styles:

  • Visual Variations: Different color schemes, layouts, and design aesthetics
  • Functional Differences: Various game mechanics and interaction patterns
  • Style Preferences: GPT-5's apparent preference for purple color schemes
  • User Selection: Easy comparison and choice between generated options

Educational Innovation:

Language Learning Integration:

  • Contextual Learning: French words presented during engaging gameplay
  • Pronunciation Practice: Immediate audio feedback for correct pronunciation
  • Progress Gamification: Learning achievements tracked across all activities
  • Cultural Context: French-themed elements (mouse/cheese) enhance cultural learning
Yann Dubois
That's also something to note is that GPT-5 really likes purple, so you will see a lot of that.
Yann DuboisOpenAIOpenAI | Member of Technical Staff

Timestamp: [17:27-22:59]Youtube Icon

🎨 How Does GPT-5's Design Diversity Enable "Vibe Coding" for Everyone?

The Art of Computational Creativity

The Multi-Tab Strategy:

Design Philosophy:

  1. Creative Exploration: Generate multiple design approaches simultaneously
  2. User Empowerment: Choose preferred aesthetic without technical knowledge
  3. Rapid Iteration: Easy to request changes and improvements
  4. Style Discovery: Uncover design preferences through comparison

Generated Application Features:

Complete Learning Platform:

  • "Midnight in Paris" Theme: Romantic, culturally relevant branding
  • Flashcard System: Interactive vocabulary learning with reveal functionality
  • Quiz Module: Multiple choice questions with immediate feedback
  • Progress Tracking: Visual progress bars updating across all activities
  • Game Integration: Educational snake game with cultural adaptation

Technical Sophistication:

  • Responsive Design: Works across different screen sizes and devices
  • Interactive Elements: Clickable buttons, tabs, and game controls
  • Audio Integration: French pronunciation using Web Speech API
  • Data Persistence: Progress tracking across different learning modules
  • Visual Polish: Professional styling and color coordination

The Accessibility Revolution:

Democratized Development:

  • No Coding Required: Complex applications created through natural language
  • Immediate Results: From idea to working prototype in minutes
  • Professional Quality: Production-ready code with proper structure
  • Easy Modification: Simple requests for changes and improvements

Creative Expression:

  • Personal Projects: Build applications for family, friends, personal needs
  • Educational Tools: Custom learning applications tailored to specific requirements
  • Rapid Prototyping: Test ideas quickly without technical barriers
  • Iterative Design: Explore multiple approaches effortlessly

The Quality Factor:

Code Generation Excellence:

  • 240+ Lines: Substantial, production-quality code generation
  • Functional Completeness: All requested features working immediately
  • Error Handling: Built-in resilience and debugging capabilities
  • Best Practices: Proper code structure and organization
Mark Chen
Front-end code's super hard. You miss a couple things and it just doesn't work.
Mark ChenOpenAIOpenAI | Chief Research Officer
Yann Dubois
Exactly. But the good part is that you don't need to understand any of that right now.
Yann DuboisOpenAIOpenAI | Member of Technical Staff
Yann Dubois
The good thing with GPT-5 is that if you have something that you don't like, you can just ask it to change it and it will do it for you.
Yann DuboisOpenAIOpenAI | Member of Technical Staff

Timestamp: [17:27-22:59]Youtube Icon

💎 Key Insights from [17:27-22:59]

Essential Insights:

  1. Democratized Development: GPT-5 transforms coding from a technical skill to a creative expression tool, enabling anyone to build sophisticated applications through natural language descriptions
  2. Design Diversity Strategy: The ability to generate multiple design variations simultaneously allows users to explore creative possibilities without being locked into a single approach
  3. Cultural Intelligence: GPT-5 demonstrates sophisticated understanding of cultural context, adapting familiar concepts (snake game) with culturally relevant elements (mouse and cheese for French learning)

Actionable Insights:

  • Personal Project Development: Individuals can now create custom applications for family members, educational needs, or personal interests without programming knowledge
  • Rapid Prototyping Workflow: The multi-tab generation strategy provides an effective method for exploring different design approaches and selecting optimal solutions
  • Educational Tool Creation: Educators and parents can build personalized learning applications that combine entertainment with educational objectives tailored to specific learners

Timestamp: [17:27-22:59]Youtube Icon

📚 References from [17:27-22:59]

People Mentioned:

  • Yann Dubois - OpenAI team member demonstrating coding capabilities and "vibe coding" concept
  • Mark Chen - Providing commentary and context about front-end development complexity

Technologies & Tools:

  • Web Speech API - Technology used for French word pronunciation in the educational game
  • Canvas Element - HTML5 technology for creating interactive game graphics
  • JavaScript - Programming language automatically generated for application functionality
  • CSS Styling - Automatically generated for visual design and responsive layout
  • HTML Structure - Generated markup for web application framework

Concepts & Frameworks:

  • Vibe Coding - New paradigm of intuitive, natural language-based programming
  • Educational Game Design - Combining entertainment with learning objectives through interactive gameplay
  • Cultural Adaptation - Modifying familiar game concepts with culturally relevant elements for enhanced learning
  • Progress Gamification - Tracking and visualizing learning achievements across multiple activities
  • Multi-tab Strategy - Generating multiple design variations simultaneously for user selection

Cultural References:

  • "Midnight in Paris" - Romantic theme generated by GPT-5 for the French learning application
  • Snake Game - Classic video game adapted with French cultural elements (mouse and cheese)
  • French Language Learning - Educational context for the application development demonstration

Educational Elements:

Timestamp: [17:27-22:59]Youtube Icon

🎤 What Makes GPT-5's Voice Translation "Seamless" Across Multiple Languages?

Revolutionary Voice Intelligence & Universal Access

Ruochen Wang
We've been steadily improving voice over the past year to make it more useful for everyone. First, it sounds incredibly natural, just like you're talking to a real person.
Ruochen WangOpenAIOpenAI | Multi Modal Researcher

Natural Conversation Breakthroughs:

  1. Human-Like Quality: Incredibly natural speech that eliminates the AI barrier
  2. Video Integration: Voice can see what you see while chatting
  3. Seamless Translation: Consistent, smooth language translation across conversation turns
  4. Custom Instructions: Voice follows specific user guidance and preferences

Universal Access Revolution:

Free Users:

  • Hours of Voice Chat: Extended voice conversation capabilities
  • No Usage Restrictions: Substantial access to advanced voice features

Paid Subscribers:

  • Nearly Unlimited Access: Extensive voice interaction capabilities
  • Custom GPT Integration: Voice available across all custom applications
  • Tailored Experience: Custom instruction following for personalized interactions
Ruochen Wang
We're doing something very special, where we are bringing our best voice experience to everyone. Free users can now chat for hours, while paid subscribers can have nearly unlimited access.
Ruochen WangOpenAIOpenAI | Multi Modal Researcher

Demo Highlights:

Instruction Following Precision:

  • Single Word Responses: "Could you only answer me in one word, please?"
  • Pride and Prejudice Plot: Summarized as "Relationships" in one word
  • Adaptive Communication: From comprehensive to concise to single-word responses

Language Learning Excellence:

  • Korean Practice: Realistic café ordering scenario
  • Speed Adaptation: Ultra-slow for beginners, ultra-fast for advanced practice
  • Cultural Context: Authentic pronunciation and cultural scenarios
  • Step-by-Step Guidance: New study-and-learn mode for structured learning
Mark Chen
It sounds so much more natural than the voice we demoed just a year ago in our 4-o demo.
Mark ChenOpenAIOpenAI | Chief Research Officer
Ruochen Wang
So now the voice is simpler, smarter, and more powerful than ever. We can't wait for you to experience it.
Ruochen WangOpenAIOpenAI | Multi Modal Researcher

Timestamp: [23:05-27:06]Youtube Icon

🎨 What Makes ChatGPT's New "Personality Options" a Game-Changer for Personal AI?

Comprehensive Personalization & Style Customization

Mark Chen
We're also launching a research preview of personalities. You can now change the personality of ChatGPT such that it's more supportive, or it's more professional and concise, or maybe even a little bit sarcastic.
Mark ChenOpenAIOpenAI | Chief Research Officer

Visual Personalization:

Custom Chat Colors:

  • Universal Options: Multiple color schemes for all users
  • Premium Exclusives: Special color options for paid subscribers
  • Visual Identity: Personal branding for your AI conversations

Personality Research Preview:

Communication Styles:

  1. Supportive Mode: Encouraging, empathetic, and motivational responses
  2. Professional & Concise: Business-appropriate, efficient communication
  3. Sarcastic Option: Witty, humorous, and playful interactions

Personal Communication Alignment:

  • Style Consistency: AI adapts to match your preferred communication approach
  • Authentic Interaction: More natural conversations that feel personalized
  • Flexible Adaptation: Switch between personalities based on context and mood
Mark Chen
And this lets you interact with ChatGPT in a way that's consistent with your own communication style.
Mark ChenOpenAIOpenAI | Chief Research Officer

Enhanced Memory System:

Mark Chen
One of my favorite features that we've launched over the last year has been memory. And we've made a lot of enhancements in memory in the time since. This allows ChatGPT to learn about you.
Mark ChenOpenAIOpenAI | Chief Research Officer
Christina Kaplan
This is our aspiration for ChatGPT—to understand what's meaningful to you so it can help you achieve your goals in life.
Christina KaplanOpenAIOpenAI | Personalisation Lead

Deep Personal Understanding:

  • Goal-Oriented Learning: AI understands what's meaningful to help achieve life goals
  • Continuous Improvement: Gets to know you better over time
  • Contextual Awareness: Remembers preferences, needs, and important details

Real-World Applications:

  • Fitness Planning: Personalized marathon training schedule creation
  • Life Organization: Daily planning and schedule optimization
  • Personal Assistant: Comprehensive life management support

The Vision Statement:

"To understand what's meaningful to you so it can help you achieve your goals in life"

Timestamp: [27:17-28:42]Youtube Icon

📧 How Does Gmail and Google Calendar Integration Transform ChatGPT Into Your Personal Life Manager?

Revolutionary Calendar & Email Intelligence

Christina Kaplan
ChatGPT still has many limitations. It doesn't understand my actual schedule. Next week, starting with Pro users, followed by Plus, Team, and Enterprise users, this is changing, and we're giving ChatGPT access to Gmail and Google Calendar.
Christina KaplanOpenAIOpenAI | Personalisation Lead

Integration Rollout:

User Tier Access:

  1. Pro Users: Next week access (first priority)
  2. Plus Users: Following Pro rollout
  3. Team Users: Enterprise-level integration
  4. Enterprise Users: Full organizational calendar access

Real-World Life Management:

Christina's Personal Use Case:

  • Marathon Training: Personalized running schedule coordination
  • Daily Planning: "Help me plan my schedule tomorrow" - instant organization
  • Busy Week Management: Used every day during GPT-5 launch week

Intelligent Schedule Analysis:

Automatic Capabilities:

  • Schedule Parsing: Instantly analyzes tomorrow's calendar commitments
  • Proactive Planning: Finds time for personal activities (like running) without being asked
  • Email Monitoring: Identifies missed emails from 2 days ago requiring responses
  • Travel Preparation: Creates packing lists based on personal preferences and trip details
Christina Kaplan
ChatGPT has pulled in my schedule tomorrow and—oh, without even asking—ChatGPT found time for my run. I don't think I was invited to the launch celebration.
Christina KaplanOpenAIOpenAI | Personalisation Lead

The Seamless Experience:

Setup Process:

  • One-Time Connection: Grant access to Gmail and Google Calendar
  • Instant Functionality: Works immediately after permission granted
  • Automatic Prompting: ChatGPT requests connection when needed

Personal Assistant Evolution:

  • Proactive Insights: Finds important information without specific requests
  • Contextual Understanding: Knows personal preferences for travel, work, and lifestyle
  • Life Integration: Combines calendar, email, and personal knowledge for comprehensive planning

The Personal Touch:

Demonstrated Features:

  • Missed Communications: "ChatGPT found an email that I didn't respond to two days ago"
  • Automatic Planning: Found time for running without being asked
  • Personal Preferences: Packing list based on known travel preferences
  • Event Awareness: Aware of launch celebrations and team activities
Christina Kaplan
It's been amazing to see that as GPT-5 is getting more capable, ChatGPT is getting more useful and more personal.
Christina KaplanOpenAIOpenAI | Personalisation Lead

Timestamp: [28:42-29:57]Youtube Icon

🌟 How Did OpenAI Turn Limited AI Into "Apps on Demand" Technology?

The Complete Transformation Journey

Mark Chen
We've come a long way from the days where only 5 to 10 lines of code were working, and now it's amazing that you can produce these kinds of apps on demand.
Mark ChenOpenAIOpenAI | Chief Research Officer

Historical Perspective:

The Early Days:

  • Limited Functionality: Only 5-10 lines of working code
  • Basic Responses: "As an AI model, I can't..." personality
  • Single Purpose: Uncertain about real-world applications

Today's Capabilities:

  • Complex Applications: 240+ line sophisticated web apps
  • Natural Interaction: Human-like voice and personality options
  • Life Integration: Calendar, email, and personal preference understanding

The Multimodal Revolution:

Voice Enhancements:

  • Natural Quality: Sounds like talking to a real person
  • Video Integration: AI can see while conversing
  • Language Translation: Seamless multilingual conversations
  • Custom Instructions: Follows specific user preferences

Personalization Depth:

  • Visual Customization: Chat colors and interface personalization
  • Communication Styles: Supportive, professional, or sarcastic personalities
  • Memory Enhancement: Deep understanding of personal goals and preferences
  • Life Management: Integration with real-world tools and schedules

The Study-and-Learn Innovation:

Educational Features:

  • Guided Learning: Step-by-step subject understanding
  • Language Practice: Real-world scenario simulation (café ordering)
  • Adaptive Teaching: Adjusts speed and complexity for learner level
  • Cultural Context: Authentic pronunciation and cultural scenarios

Universal Access Philosophy:

Democratized Intelligence:

  • Free User Access: Hours of advanced voice capabilities
  • Unlimited Premium: Nearly unlimited access for subscribers
  • Custom GPT Integration: Voice across all custom applications
  • Personal AI Vision: Understanding what's meaningful to achieve life goals

Timestamp: [22:59-29:57]Youtube Icon

💎 Key Insights from [22:59-29:57]

Essential Insights:

  1. Voice Democratization: The extension of natural, human-like voice capabilities to free users represents a fundamental shift in AI accessibility, breaking down barriers between premium and basic AI experiences
  2. Personal AI Evolution: The combination of personality options, enhanced memory, and real-world integration (Gmail/Calendar) transforms ChatGPT from a tool into a genuine personal assistant that understands individual goals and preferences
  3. Seamless Life Integration: The ability to automatically analyze schedules, find missed emails, and proactively plan activities without explicit requests demonstrates AI moving from reactive to proactive assistance

Actionable Insights:

  • Language Learning Revolution: The study-and-learn mode with adaptive speed and cultural context provides an unprecedented personalized language learning experience accessible to all users
  • Personal Productivity Enhancement: Gmail and Google Calendar integration enables immediate life organization and proactive schedule management for users across all subscription tiers
  • Communication Style Optimization: Personality options allow users to align AI interactions with their natural communication preferences, improving efficiency and comfort in daily AI usage

Timestamp: [22:59-29:57]Youtube Icon

📚 References from [22:59-29:57]

People Mentioned:

  • Ruochen Wang - OpenAI multimodal research team member demonstrating voice capabilities
  • Christina Kaplan - OpenAI team member showcasing personalization features and Gmail/Calendar integration
  • Mark Chen - Chief Research Officer introducing enhanced features and providing historical context

Technologies & Tools:

  • Gmail Integration - Email access for schedule planning and communication management
  • Google Calendar - Calendar integration for intelligent schedule analysis and planning
  • Custom GPTs - Personalized AI applications with voice capability integration
  • Study-and-Learn Mode - New educational feature for step-by-step subject understanding
  • Voice Model - Advanced speech synthesis and recognition technology

Languages & Cultural Elements:

Concepts & Frameworks:

  • Multimodal Research - Integration of voice, video, and text interaction capabilities
  • Personality Options - Customizable communication styles (supportive, professional, sarcastic)
  • Enhanced Memory System - AI capability to learn and remember user preferences and goals
  • Proactive Planning - AI initiative in finding solutions without explicit user requests
  • Universal Access Philosophy - Extending advanced features to free users

Subscription Tiers:

  • Free Users - Hours of voice chat access with core features
  • Plus Users - Enhanced access and premium color options
  • Pro Users - Unlimited voice access and first access to new integrations
  • Team Users - Organizational features with enhanced capabilities
  • Enterprise Users - Full organizational integration and management tools

Personal Use Cases:

  • Marathon Training - Personalized fitness schedule coordination and planning
  • Daily Schedule Management - Comprehensive life organization and time management
  • Travel Planning - Automated packing lists based on personal preferences
  • Language Learning - Interactive conversation practice with cultural context

Timestamp: [22:59-29:57]Youtube Icon

🛡️ How Does GPT-5's "Safe Completions" Approach Revolutionize AI Safety Beyond Simple Refusal?

Revolutionary Safety Training Paradigm

Deception Mitigation Breakthrough:

The Core Problem:

  • Model Misrepresentation: AI lying about task success or actions to users
  • Common Scenarios: Underspecified tasks, impossible requests, or missing tools
  • Previous Performance: GPT-5 significantly less deceptive than o3 and o4 Mini
Saachi Jain
In addition to mitigating hallucinations, we've also spent a significant amount of time mitigating deception. So, these are instances where the model might misrepresent its actions to the user or lie about task success.
Saachi JainOpenAIOpenAI | Safety Training Lead

The Old Binary Approach Problem:

Traditional Safety Model:

  1. Binary Decision: Either outright refuse or fully comply
  2. Failure Modes:
  • Cleverly worded prompts could sneak through
  • Legitimate but sensitive questions got outright refusal
  1. Inconsistent Responses: Same information treated differently based on framing

The Fireworks Example Case Study:

Dual-Use Scenario:

  • Request: Technical details on lighting pyrogen (fireworks material)
  • Legitimate Use: July 4th display preparation
  • Potential Misuse: Harmful applications

o3's Inconsistent Behavior:

  • Neutral Technical Framing: Full compliance with detailed information
  • Explicit Harmful Framing: Complete refusal of identical information
  • Problem: Over-rotation on intent assessment rather than content safety

Safe Completions Innovation:

Revolutionary Approach:

  • Core Principle: Maximize helpfulness within safety constraints
  • Partial Responses: Answer at appropriate level without full harmful details
  • High-Level Guidance: Provide conceptual understanding with safety boundaries
Saachi Jain
For GPT-5, we've changed this approach entirely, and we're introducing something that we're calling safe completions. The point of safe completions is, rather than judging the user's prompt, instead it tries to maximize helpfulness within safety constraints.
Saachi JainOpenAIOpenAI | Safety Training Lead

Enhanced User Experience:

  1. Explanation of Limitations: Clear reasoning for safety boundaries
  2. Alternative Pathways: Helpful suggestions for safe information access
  3. Guided Redirection: Steering conversations toward safe, productive directions

The GPT-5 Fireworks Response:

Intelligent Safety Handling:

  • Acknowledges Request: Understands the technical nature of the question
  • Explains Boundaries: Clear reason why direct assistance isn't provided
  • Provides Guidance: Points to safety guidelines and manufacturer manuals
  • Maintains Helpfulness: Offers legitimate pathways to safe information
Saachi Jain
Overall, GPT-5 allows for better handling of tricky dual-use scenarios, and users will experience fewer 'I'm sorry, I can't assist with that,' and it creates a more robust safety system. This is one big step towards a more safe, reliable, and helpful AI.
Saachi JainOpenAIOpenAI | Safety Training Lead

Timestamp: [30:04-33:01]Youtube Icon

🔄 What is the "Recursive Self-Improvement Loop" That's Transforming AI Training Forever?

Next-Generation Model Training Paradigm

The Synthetic Data Revolution:

Beyond Traditional Data Collection:

  • Frontier Models as Creators: AI systems now help create their own training data
  • Quality Over Quantity: Focus on "right kind of data" rather than just more data
  • Educational Approach: Data shaped to teach complex concepts, not fill space
Sebastian Bubeck
Today, Frontier models do not just consume data, they help create it. We used OpenAI's O3 to craft a high-quality synthetic curriculum to teach GPT-5 complex topics in a way that the raw web simply never could.
Sebastian BubeckOpenAIOpenAI | Member of Technical Staff

O3's Role in GPT-5 Training:

High-Quality Curriculum Creation:

  • Complex Topic Teaching: O3 crafts synthetic curriculum for advanced concepts
  • Beyond Raw Web Data: Structured learning that web data alone couldn't provide
  • Targeted Skill Development: Specific capability enhancement through designed scenarios

Industry Paradigm Shift:

Common Misconception vs. Reality:

  • Industry View: Synthetic data as cheap way to get more volume
  • OpenAI Breakthrough: Creating precisely crafted educational data
  • Key Insight: Data shaped for teaching effectiveness, not storage efficiency
Sebastian Bubeck
Our breakthrough was not just to create more data, but rather to create the right kind of data, shaped in a way to teach rather than just to fill space.
Sebastian BubeckOpenAIOpenAI | Member of Technical Staff

The Recursive Loop Concept:

Inter-Generational Model Cooperation:

  1. Previous Generation: Helps create training data for next generation
  2. Continuous Improvement: Each model iteration improves the training process
  3. Exponential Enhancement: Recursive improvement accelerates capability growth
Sebastian Bubeck
This interaction between generations of models foreshadows a recursive self-improvement loop where the previous generation of model increasingly helps to improve the data and generate the training for the next generation of models.
Sebastian BubeckOpenAIOpenAI | Member of Technical Staff

Training Evolution Timeline:

Historical Progression:

  • Pre-Training Era: Basic foundational model development
  • Reasoning Breakthrough: Advanced thinking and problem-solving capabilities
  • Current Innovation: Deep interaction between pre-training and reasoning

Future Training Pipeline:

Beyond Current Methods:

  • Post-Training Evolution: Moving beyond traditional post-training pipelines
  • Integrated Approach: Seamless combination of multiple training methodologies
  • Scaling Potential: New techniques ready for massive expansion

The Implications:

Fundamental AI Development Shift:

  • Self-Directing Progress: AI systems increasingly guide their own development
  • Exponential Capability Growth: Recursive improvement enables rapid advancement
  • Training Efficiency: More effective learning through designed rather than found data

Timestamp: [33:01-34:25]Youtube Icon

🧠 How Did OpenAI Master Pre-Training, Then Reasoning to Build the Foundation for AGI??

The Complete AI Development Mastery

The Three-Phase Mastery:

Historical Achievements:

  1. Pre-Training Breakthrough: Foundational model development mastered
  2. Reasoning Revolution: Advanced thinking and problem-solving achieved
  3. Deep Interaction: Sophisticated integration of both capabilities
Sebastian Bubeck
Here at OpenAI, we've cracked pre-training, then reasoning, and now we're seeing their interaction significantly deepen.
Sebastian BubeckOpenAIOpenAI | Member of Technical Staff

The Training Pipeline Evolution:

Current State Transcendence:

  • Beyond Traditional Methods: Moving past established pre-training and post-training approaches
  • Integrated Development: Seamless combination of multiple advanced techniques
  • Future-Ready Architecture: Foundation for next-generation AI systems

Recursive Self-Improvement Impact:

Exponential Development Potential:

  • Model-Assisted Training: AI systems helping design their successors
  • Quality-Focused Data: Educational curriculum rather than raw information
  • Accelerated Capability Growth: Each generation significantly enhances training effectiveness

The Near-Future Vision:

Immediate Scaling Potential:

  • Technique Maturation: Current innovations ready for large-scale deployment
  • Capability Multiplication: Recursive improvement enabling rapid progress
  • AGI Pathway: Clear progression toward artificial general intelligence
Sebastian Bubeck
In the future, AI systems will move far beyond our current pre-training and post-training pipelines that we have been used to, and we're seeing the first steps toward this right now, right here.
Sebastian BubeckOpenAIOpenAI | Member of Technical Staff

Research and Development Integration:

Safety and Capability Balance:

  • Deception Mitigation: Advanced safety while maintaining functionality
  • Safe Completions: Helpful AI without compromising security
  • Responsible Scaling: Rapid advancement with safety prioritization
Sebastian Bubeck
We could not be more excited to see what scaling up this new set of techniques will yield in the near future.
Sebastian BubeckOpenAIOpenAI | Member of Technical Staff

Industry Leadership Position:

Competitive Advantage:

  • Technical Mastery: Proven success across fundamental AI development areas
  • Innovation Pipeline: Clear pathway for continued advancement
  • Scaling Readiness: New techniques prepared for massive expansion

Timestamp: [30:04-34:30]Youtube Icon

💎 Key Insights from [30:04-34:30]

Essential Insights:

  1. Safety Paradigm Revolution: The shift from binary refuse/comply to "safe completions" represents a fundamental breakthrough in AI safety, enabling helpful responses while maintaining security boundaries
  2. Recursive Self-Improvement Reality: AI systems now actively participate in training their successors through high-quality synthetic data creation, marking the beginning of exponential capability growth
  3. Training Methodology Mastery: OpenAI's progression from pre-training to reasoning to deep integration positions them uniquely for the next phase of AI development toward AGI

Actionable Insights:

  • Dual-Use Scenario Handling: Organizations can expect more nuanced AI responses that provide helpful guidance while maintaining safety, reducing frustrating refusals and increasing practical utility
  • Data Strategy Evolution: The focus on educational quality over quantity in synthetic data suggests enterprises should prioritize curriculum-designed training rather than volume-based approaches
  • Development Pipeline Preparation: The move beyond traditional training pipelines indicates organizations should prepare for rapidly evolving AI capabilities and integration methods

Timestamp: [30:04-34:30]Youtube Icon

📚 References from [30:04-34:30]

People Mentioned:

  • Sachi - OpenAI safety training team leader presenting deception mitigation and safe completions approach
  • Sebastian Bubeck - OpenAI researcher explaining recursive self-improvement and synthetic data innovations
  • Mark Chen - Chief Research Officer introducing safety and training research topics

Technologies & Models:

  • O3 Model - Previous generation AI used to create training curriculum for GPT-5
  • O4 Mini - Model referenced for deception comparison with GPT-5
  • Safe Completions - New safety approach that maximizes helpfulness within constraints
  • Synthetic Data Curriculum - AI-generated educational content for training complex topics

Concepts & Frameworks:

  • Deception Mitigation - Safety training to prevent AI from misrepresenting actions or lying about task success
  • Dual-Use Scenarios - Situations where information could be used for both legitimate and harmful purposes
  • Recursive Self-Improvement Loop - Process where previous AI generations help improve training for next generations
  • Binary Refuse/Comply Model - Traditional safety approach of either complete refusal or full compliance

Safety Training Evolution:

  • Traditional Safety: Outright refusal or full compliance based on prompt assessment
  • Safe Completions: Partial responses and high-level guidance within safety constraints
  • Intent Assessment Problems - Issues with judging user intent rather than content safety
  • Alternative Pathways - Providing safe directions when direct assistance isn't appropriate

Technical Processes:

  • Pre-Training Mastery - Foundational model development achievements
  • Reasoning Integration - Advanced thinking capabilities combined with base training
  • Post-Training Pipeline Evolution - Moving beyond traditional training methodologies
  • Synthetic Curriculum Design - Creating educational data specifically shaped for teaching effectiveness

Real-World Examples:

Timestamp: [30:04-34:30]Youtube Icon

📋 How Does AI Transform Medical Jargon Blur Into Life-Saving Understanding?

Sam Altman
One of the top use cases of ChatGPT is health. People use it a lot. You've all seen examples of people getting day-to-day care advice or sometimes even a life-saving diagnosis.
Sam AltmanOpenAIOpenAI | Co-founder & CEO

Carolina's Life-Changing Healthcare Journey

The Devastating Diagnosis:

The Shocking Discovery:

  • October Timeline: Lives turned "completely upside down" in one week
  • Triple Cancer Diagnosis: Three different cancers including aggressive breast cancer
  • Age Factor: 39 years old - completely unexpected
  • Emotional Impact: "Absolutely nothing prepares you to receive news like this"
Carolina Millon
I completely panicked and, in that moment, did the first thing that I thought of, which was to take a screenshot of the report and put it into ChatGPT to see if it could just help me understand what this meant. And within seconds, it translated this complex report into plain language that I could understand.
Carolina MillonMacy'sMacy's | Sr. Director

The Critical Email Moment:

  1. Notification Received: Biopsy results ready via email
  2. Medical Jargon Confusion: Only understood "invasive carcinoma" from complex report
  3. Immediate Panic: Overwhelming fear and confusion
  4. Instinctive Action: Screenshot sent to ChatGPT for translation

The Life-Saving Translation:

From Panic to Understanding:

  • Seconds to Clarity: Complex medical report translated to plain language instantly
  • Emotional Relief: Moment of clarity amid overwhelming panic
  • Critical Preparation: 3-hour window to understand before doctor call
  • Empowered Conversation: Baseline knowledge enabled productive discussion about next steps
Carolina Millon
for me, bearing the weight of this decision that could have lifelong impact felt really heavy and I didn't feel equipped to make the call. So I turned to ChatGPT to gain knowledge and understand the nuances of my case.
Carolina MillonMacy'sMacy's | Sr. Director

Advanced Decision-Making Support:

The Radiation Treatment Dilemma:

  • Medical Disagreement: Doctors themselves couldn't agree on treatment
  • Nuanced Case: No clear medical consensus on optimal path
  • Patient Responsibility: Decision placed back on Carolina despite complexity
  • High Stakes: Lifelong impact potential created enormous pressure

ChatGPT's Comprehensive Analysis:

  • Detailed Breakdown: More thorough than 30-minute consultation
  • Matched Medical Input: Confirmed what doctors shared
  • Risk-Benefit Analysis: Comprehensive pros and cons evaluation
  • Informed Decision: Enabled confident choice on high-stakes treatment

The Personal Transformation:

Regaining Agency:

  • Knowledge Empowerment: Bridged gap between doctor expertise and patient understanding
  • Active Participation: From helpless recipient to engaged advocate
  • Self-Advocacy: Confident participation in own care journey
  • Family Impact: Informed decisions affecting loved ones

Timestamp: [34:38-38:13]Youtube Icon

🤝 What's Behind the Shift From "Fear-Based" to "Knowledge-Based" Medical Decisions?

The Philosophy of AI-Powered Healthcare Advocacy

The Knowledge Gap Reality:

Traditional Healthcare Dynamics:

  • Expertise Imbalance: Vast knowledge gap between doctors and patients
  • Feeling Helpless: Easy to become passive recipient of care
  • Limited Consultation Time: 30-minute appointments can't cover comprehensive understanding
  • Complex Medical Language: Barriers to understanding own health situation

The Agency Revolution:

Personal Investment Advantage:

  • Highest Motivation: "No one cares more about Carolina's health than she does"
  • Active Participation: Transform from passive patient to engaged advocate
  • Informed Decision-Making: Knowledge-based rather than fear-based choices
  • Self-Empowerment: Confidence in high-stakes medical decisions
Felipe Millon
And there's such a big knowledge gap between what the doctors know and what we know. And, however, no one cares more about Carolina's health than she does.
Felipe MillonOpenAIOpenAI | Government GTM

AI's Role in Healthcare Transformation:

Beyond Traditional AI Healthcare Applications:

  • Not Just Diagnostics: More than breakthrough discoveries or better diagnoses
  • Patient Empowerment Focus: Creating smarter, more empowered patients
  • Advocacy Support: Tools for self-advocacy in medical settings
  • Full Participation: Enabling complete engagement in care journey

The Inspirational Transformation:

Witnessing Empowerment:

  • Observable Change: Visible regaining of personal agency
  • Knowledge Acquisition: Active learning about medical condition
  • Confident Advocacy: Speaking up in medical consultations
  • Informed Choices: Decisions based on understanding rather than fear
Felipe Millon
For me, what was really inspirational was watching her regain her sense of agency by using ChatGPT. In this moment, it'd be so easy to feel helpless.
Felipe MillonOpenAIOpenAI | Government GTM

The Broader Healthcare Vision:

Systemic Impact Potential:

  • Smarter Patients: More informed healthcare consumers
  • Better Outcomes: Engaged patients often have better results
  • Healthcare Partnership: Collaborative rather than hierarchical relationships
  • Democratized Knowledge: Medical understanding accessible to all

The Future of Patient Care:

AI as Healthcare Companion:

  • 24/7 Availability: Instant access to medical information translation
  • Personalized Support: Tailored to individual medical situations
  • Emotional Support: Clarity during overwhelming moments
  • Decision Framework: Structured approach to complex medical choices
Felipe Millon
I think that the promise of AI in healthcare isn't in just breakthrough discoveries or better diagnostics. I think it's in creating smarter and more empowered patients that can fully participate and advocate for themselves in their care.
Felipe MillonOpenAIOpenAI | Government GTM

Timestamp: [38:13-39:01]Youtube Icon

🚀 How Does GPT-5's Thought Partner Approach Transform Healthcare Decision-Making?

Next-Generation Medical Intelligence & Support

GPT-5 Performance Breakthrough:

Speed and Thoroughness:

  • Alarmingly Fast: "Almost a little alarmingly" quick responses
  • Comprehensive Analysis: Thorough despite speed
  • Thought Partnership: Connects dots rather than just translating information
  • Navigation Support: Helps navigate problems, not just answer questions
Carolina Millon
I've been so mind-blown about GPT-5 and its capabilities. One of the first things that jumps out at me is just how fast it is. Almost a little alarmingly... and more importantly, it feels more like a thought partner that connects the dots. So rather than just translating information or giving you an answer, it helps you actually navigate the problem.
Carolina MillonMacy'sMacy's | Sr. Director

The Biopsy Report Comparison:

GPT-4o Capabilities:

  • Solid Translation: Explained medical terminology effectively
  • Basic Understanding: Helped users comprehend complex information
  • Information Processing: Converted medical jargon to plain language

GPT-5 Advanced Intelligence:

  • Contextual Understanding: Grasped the deeper context and implications
  • Question Behind the Question: Understood why patients were asking about biopsy results
  • Proactive Guidance: Identified missing information and pending results
  • Future Planning: Suggested questions for upcoming doctor consultations
Felipe Millon
A great example is we actually went back and took our initial biopsy prompts and put them into GPT-5. And GPT-4 had done a great job. It had translated, explained what these words meant, and helped in a way that we can understand. But GPT-5 seemed to understand more of the context and the question behind the question—why would we be asking biopsy results?
Felipe MillonOpenAIOpenAI | Government GTM

Comprehensive Patient Support:

Beyond Information Translation:

  1. Complete Personalized Picture: Holistic view of medical situation
  2. Pending Results Identification: Awareness of what information is still needed
  3. Question Preparation: Specific questions to ask healthcare providers
  4. Strategic Thinking: Long-term planning for medical discussions

Real-World Impact Assessment:

Benchmark vs. Reality:

  • Academic Performance: Strong scores on HealthBench evaluation
  • Practical Application: Real-world utility for actual patients
  • Immediate Availability: Tool accessible today for current patients
  • Continuous Improvement: Better support than available just 8 months prior

The Emotional Dimension:

Personal Stakes Recognition:

  • Individual Impact: Every person receiving similar diagnosis today
  • Family Consideration: Support for families facing cancer diagnoses
  • Life-Changing Decisions: Most challenging decisions of their lives
  • Tool Evolution: Access to better support systems than previously available
Sam Altman
GPT-5 is the best model ever for health, and it empowers you to be more in control of your healthcare journey.
Sam AltmanOpenAIOpenAI | Co-founder & CEO

Healthcare Accessibility Revolution:

Universal Healthcare Intelligence:

  • Top Use Case: Health consistently ranked as primary ChatGPT application
  • Day-to-Day Advice: Regular healthcare guidance for common issues
  • Life-Saving Potential: Sometimes providing critical diagnostic insights
  • Best Model Ever: GPT-5 represents highest healthcare capability achievement

Timestamp: [39:01-40:39]Youtube Icon

🌟 From Panic to Empowerment: How AI Healthcare Support Keeps Improving?

The Rapid Evolution of AI Healthcare Support

The Passionate Mission:

Why Share This Story:

  • Individual Impact: Every person receiving diagnosis today
  • Family Support: Families facing cancer and similar diagnoses
  • Real-World Urgency: People facing life's most challenging decisions
  • Immediate Availability: Better tools accessible right now
Felipe Millon
Those families going through a cancer diagnosis, similar medical diagnosis are going to face some of the most challenging decisions of their lives. And what really inspires me is that they're going to have access to better tools and support than we had even just 8 months ago.
Felipe MillonOpenAIOpenAI | Government GTM

The 8-Month Transformation:

Rapid Capability Evolution:

  • Benchmark Improvements: Measurable performance enhancements
  • Practical Utility: Real-world application effectiveness
  • Accessibility: Tool available today for current patients
  • Continuous Enhancement: Ongoing improvement in support quality

HealthBench Validation:

Professional Medical Evaluation:

  • 250 Physician Assessment: Comprehensive expert evaluation
  • Real-World Tasks: Practical healthcare scenarios
  • Highest Scoring: Superior performance compared to all previous models
  • Evidence-Based Improvement: Measurable advancement in healthcare capabilities
Sam Altman
We really prioritized improving this for GPT-5, and it scores higher than any previous model on HealthBench, an evaluation that we created with 250 physicians on real-world tasks.
Sam AltmanOpenAIOpenAI | Co-founder & CEO

The Personal Technology Impact:

From Fear to Empowerment:

  • Panic to Clarity: Instant translation of overwhelming medical information
  • Helplessness to Agency: Active participation in healthcare decisions
  • Confusion to Understanding: Complex medical concepts made accessible
  • Isolation to Support: 24/7 availability for medical guidance

Future Healthcare Vision:

Systemic Healthcare Transformation:

  • Patient Empowerment: More informed and engaged patients
  • Decision Support: Better tools for complex medical choices
  • Accessibility: Advanced healthcare intelligence for everyone
  • Continuous Improvement: Rapidly evolving support capabilities

The Broader Implications:

Healthcare Democratization:

  • Knowledge Access: Medical expertise available to all patients
  • Quality Care: Enhanced support regardless of healthcare access
  • Advocacy Tools: Better self-advocacy in medical settings
  • Outcome Improvement: More informed patients often achieve better results

Timestamp: [34:38-40:51]Youtube Icon

💎 Key Insights from [34:38-40:51]

Essential Insights:

  1. Healthcare Empowerment Revolution: AI transforms patients from passive recipients to active advocates by bridging the knowledge gap between medical professionals and patients, enabling informed decision-making in life-threatening situations
  2. Emotional Intelligence in Crisis: The combination of instant medical translation and emotional support during overwhelming moments represents a fundamental shift in how people can navigate healthcare emergencies
  3. Contextual Medical Understanding: GPT-5's ability to understand "the question behind the question" provides proactive guidance and comprehensive care planning beyond simple information translation

Actionable Insights:

  • Immediate Medical Translation: Patients can instantly translate complex medical reports and terminology into understandable language, enabling productive conversations with healthcare providers
  • Decision-Making Framework: AI provides comprehensive risk-benefit analysis and pros/cons evaluation for complex medical decisions when even doctors disagree on optimal treatment
  • Proactive Healthcare Planning: Advanced AI identifies missing information, suggests relevant questions for doctor visits, and helps create complete personalized medical pictures for better care coordination

Timestamp: [34:38-40:51]Youtube Icon

📚 References from [34:38-40:51]

People Mentioned:

  • Carolina Millon - Sharing her personal healthcare journey and AI assistance experience
  • Felipe Millon - Carolina's husband and OpenAI colleague witnessing his partner's empowerment and agency through AI
  • Sam Altman - OpenAI CEO introducing healthcare applications and supporting personal testimony

Healthcare Processes:

Technologies & Tools:

  • HealthBench Evaluation - Assessment tool created with 250 physicians for real-world healthcare tasks
  • Medical Report Translation - AI capability to convert complex medical language to plain English
  • Screenshot Analysis - Ability to analyze medical documents through image upload

Concepts & Frameworks:

  • Patient Agency - Concept of patients taking active control in their healthcare journey
  • Healthcare Advocacy - Self-representation and informed participation in medical decisions
  • Knowledge Gap Bridging - Reducing disparity between medical professional and patient understanding
  • Thought Partnership - AI functioning as collaborative decision-making support rather than simple information provider

Medical Decision-Making:

  • Risk-Benefit Analysis - Comprehensive evaluation of treatment pros and cons
  • Nuanced Medical Cases - Complex situations without clear consensus treatment protocols
  • Consultation Preparation - Strategic question development for healthcare provider meetings
  • Treatment Timeline Planning - Long-term healthcare journey coordination and planning

Healthcare Accessibility:

  • Day-to-Day Care Advice - Regular healthcare guidance for common medical questions
  • Life-Saving Diagnosis - Critical medical insights potentially preventing serious outcomes
  • Universal Healthcare Intelligence - Advanced medical support available to all users regardless of healthcare access

Timestamp: [34:38-40:51]Youtube Icon

🚀 What Makes GPT-5 the Best Model at Agentic Coding Tasks?

The Promise of What Computers Can Be

Greg Brockman
Software engineering is already fundamentally changing, and GPT-5 will turbocharge that revolution.
Greg BrockmanOpenAIOpenAI | President

The Historical Journey:

2021 Coding Revolution:

  • First Coding-Optimized Model: Released back in 2021 with live demonstrations
  • "Vibe Coding" Origin: First-time demonstration of conversational programming
  • Mind-Blowing Realization: Talk to computer, get actual applications built
  • The Vision: Computers that actually do what you want them to do
Greg Brockman
I remember seeing the model being capable of doing this, and it was so mind-blowing. You just realize we have to see where this goes. This is the promise of what computers can be: that you can talk to them and they actually do what you want.
Greg BrockmanOpenAIOpenAI | President

The Amplification Promise:

  • Personal Benefit: Dramatically increase individual capability and delivery
  • Global Impact: Amplify what you can accomplish for the world
  • Revolutionary Potential: Fundamental change in human-computer interaction

GPT-5 Agentic Coding Supremacy:

Advanced Autonomous Capabilities:

  1. Complex Task Management: Accomplish very complicated multi-step projects
  2. Extended Work Sessions: Work for many minutes or even longer on single tasks
  3. Tool Integration: Call many tools and coordinate their usage
  4. Goal Achievement: Follow through from instruction to complete implementation
Greg Brockman
It is the best model at agentic coding tasks. You can ask it to go and accomplish something very complicated, and it'll go off and it'll work on it. It'll call many tools. It'll work for many minutes at a time, sometimes even longer, to accomplish your goal.
Greg BrockmanOpenAIOpenAI | President

Specialized Excellence Areas:

  • Front-End Mastery: Beautiful visualizations and interactive games
  • Aesthetic Capabilities: Superior visual design and user experience
  • Instruction Following: Handle both vague intent and detailed specifications
  • Speed Optimization: Fast task completion with appropriate thinking time

The New Standard in Coding:

Benchmark Leadership:

  • Best Agentic Model: Superior performance on complex coding tasks
  • Detail Handling: Process extremely detailed specifications accurately
  • Intent Inference: Understand vague requirements and fill in gaps intelligently
  • Real-World Application: From imagination to working implementation

Developer Empowerment:

Beyond Personal Use:

  • Novel Applications: Enable building entirely new types of software
  • API Integration: Available for developers to build innovative solutions
  • Creative Freedom: Whatever you imagine can come to life
  • Professional Quality: Production-ready code generation

Timestamp: [41:05-43:08]Youtube Icon

⚡ What Makes GPT-5's "Three State-of-the-Art Reasoning Models" a Game-Changer for API Development?

Revolutionary API Architecture & Flexibility

The Three-Model Power Lineup:

Complete Cost-Latency Coverage:

  1. GPT-5: Full-powered reasoning for complex applications
  2. GPT-5 Mini: Balanced performance for standard use cases
  3. GPT-5 Nano: Optimized for speed-critical applications
Michelle Pokrass
Today I'm so excited to tell you that we're shipping three state-of-the-art reasoning models in the API: GPT-5, GPT-5 Mini, and GPT-5 Nano. All three slot right in the cost-latency curve, so you can pick the right one for your application.
Michelle PokrassOpenAIOpenAI | Post Training Lead

The "Minimal Reasoning" Innovation:

  • New Parameter Option: "Minimal" reasoning effort setting
  • Latency Optimization: Fastest possible responses when needed
  • Unified Model Approach: No need to choose between different models
  • Flexible Reasoning: Dial in the exact reasoning effort required
Michelle Pokrass
So now you don't actually have to choose between a bunch of models, and you can use GPT-5 for all of your use cases and just dial in the reasoning effort.
Michelle PokrassOpenAIOpenAI | Post Training Lead

Custom Tools Revolution:

Beyond JSON Limitations:

  • Traditional Constraint: All function calling wrapped in JSON format
  • Length Problem: Extremely long arguments difficult to escape in JSON
  • Control Character Issues: Challenges with complex code in JSON structure
  • Solution: Free-form plain text custom tools

Advanced Structured Outputs:

  • Regular Expression Support: Constrain outputs with regex patterns
  • Context-Free Grammar: Advanced grammar-based output control
  • Custom DSL Integration: Support for domain-specific languages
  • SQL Fork Compatibility: Custom database language variants

Enhanced Developer Experience:

Tool Call Preambles:

  • Explanation Capability: Model explains what it's about to do before acting
  • Extreme Steerability: Supercharged instruction following for preambles
  • Flexible Control: Preambles for every call, notable events only, or never
  • o3 Improvement: Capability that o3 lacked, now enhanced in GPT-5

Verbosity Parameter:

  • Long-Awaited Feature: Finally available in the API
  • Three Levels: Low, medium, and high verbosity settings
  • Output Control: Precise control over response length and detail
  • Application Optimization: Tailor responses to specific use case needs
Michelle Pokrass
We've actually wanted this in the API for a long time, and now you can set verbosity to low, medium, and high to control how terse or expansive the model is with its outputs.
Michelle PokrassOpenAIOpenAI | Post Training Lead

Real-World Application Focus:

Engineering-Research Intersection:

  • Utility-First Training: Focused on real-world utility over benchmark performance
  • Practical Excellence: Optimized for actual developer workflows
  • Benchmark Success: Exceptional scores achieved as byproduct of utility focus
  • Developer Love: Designed for excellent developer experience

Timestamp: [43:15-45:45]Youtube Icon

📊 How Does GPT-5's 97% Score on T² Benchmark Represent a 48-Point Leap in Just Two Months?

Unprecedented Performance Breakthroughs

SWEBench Python Coding Excellence:

New High Score Achievement:

  • GPT-5 Performance: 74.9% accuracy on real-world Python coding tasks
  • o3 Comparison: Previous best of 69.1% surpassed significantly
  • Skill Demonstration: Superior performance on actual software engineering challenges

Aider Polyglot Multi-Language Mastery:

Universal Programming Capability:

  • 88% Score: Exceptional performance across all programming languages
  • Beyond Python: Comprehensive language support, not just single-language focus
  • Stark Improvement: Significant advancement over o3 performance
  • Language Agnostic: Universal coding intelligence across technology stacks

Front-End Development Superiority:

Human Trainer Evaluation:

  • 70% Preference Rate: Human trainers prefer GPT-5 over o3
  • Aesthetic Excellence: Improved visual design and user interface capabilities
  • Overall Enhancement: Better capabilities across all aspects of development
  • Real-World Quality: Production-ready front-end development

T² Benchmark Revolution:

Agentic Tool Calling Leadership:

  • Industry Baseline: No model scored more than 49% just two months ago
  • GPT-5 Achievement: 97% score - nearly doubling previous best performance
  • Real-World Scenario: Telecom industry problem-solving with user collaboration
  • Complex Problem Solving: Service troubleshooting requiring multi-step tool coordination
Michelle Pokrass
Just two months ago, no model in the field scored more than 49%, and today GPT-5 scores 97%.
Michelle PokrassOpenAIOpenAI | Post Training Lead

Instruction Following Mastery:

Multiple Benchmark Excellence:

  1. COLLIE Benchmark: 99% score - near-perfect instruction following
  2. Scale Multi-Challenge: 70% score (10-point improvement over o3)
  3. Multi-Turn Capability: Superior performance on complex conversation flows

Real API Use Case Performance:

  • In-House Evaluation: Based on actual API usage patterns
  • Hard Subset: 64% score vs. 47% from o3 (17-point improvement)
  • Practical Relevance: Strong predictor of real application performance
  • Meaningful Improvement: Substantial advancement in practical utility
Michelle Pokrass
On the hard subset of this, GPT-5 scores 64%, up from 47% from o3. A pretty meaningful improvement.
Michelle PokrassOpenAIOpenAI | Post Training Lead

Extended Context Innovation:

400K Token Context Window:

  • Doubled Capacity: Increased from 200K tokens in o3
  • Effective Usage: Not just longer, but more effective context handling
  • State-of-the-Art Performance: Leading scores on long-context benchmarks

Long-Context Benchmark Leadership:

  • OpenAI MRCR: Superior performance on 128K to 256K context retrieval
  • Graph Walks BFS: Excellent reasoning over long-context inputs
  • BrowseComp: New open-source evaluation for challenging long-context questions
  • Reasoning Integration: Perfect merger of reasoning and extended context capabilities

Timestamp: [45:45-49:07]Youtube Icon

🔧 Why Did OpenAI Prioritize "Real-World Utility" Over Perfect Benchmark Scores?

Engineering-Research Intersection Excellence

Training Philosophy Revolution:

Utility-First Approach:

  • Real-World Focus: Trained specifically for practical developer needs
  • Benchmark Byproduct: Exceptional scores achieved incidentally, not as primary goal
  • Engineering Integration: Perfect blend of engineering practicality and research innovation
  • Developer Experience: Optimized for actual working conditions and workflows

The Intersection Achievement:

Engineering Meets Research:

  • Practical Excellence: Superior performance in real development scenarios
  • Research Quality: Cutting-edge capabilities backed by scientific rigor
  • Balanced Optimization: Neither purely academic nor purely commercial
  • Holistic Success: Excellence across theoretical and practical dimensions
Michelle Pokrass
We think GPT-5 is the best model for developers. It was trained with a focus on real-world utility and less so on benchmarks, but we happen to pick up a few of those along the way.
Michelle PokrassOpenAIOpenAI | Post Training Lead

Long-Context Usability Focus:

Beyond Raw Length:

  • Effective Implementation: Not just longer context, but more usable context
  • Retrieval Excellence: Superior ability to find relevant information in long contexts
  • Reasoning Integration: Combines extended context with advanced reasoning
  • Practical Application: Designed for real-world long-document scenarios

Open Source Contribution:

Community Investment:

  • BrowseComp Evaluation: New long-context benchmark released to community
  • Field Advancement: Spurring more research and development in long-context AI
  • Collaborative Progress: Supporting ecosystem-wide improvement
  • Benchmark Leadership: Leading by example in evaluation methodology
Michelle Pokrass
We're excited to spur on more work in this field.
Michelle PokrassOpenAIOpenAI | Post Training Lead

Developer-Centric Innovation:

Anticipated Features:

  • Verbosity Control: Long-requested API feature finally delivered
  • Custom Tools: Addressing real limitations in JSON-based function calling
  • Structured Outputs: Advanced grammar and regex support for specific needs
  • Reasoning Flexibility: Adaptive reasoning effort for different use cases

Future Development Paradigm:

Revolutionary Potential:

  • Conversational Programming: Natural language as primary development interface
  • Amplified Capability: Individual developers achieving previously impossible scale
  • Creative Freedom: From imagination to implementation without traditional barriers
  • Global Impact: Tools that amplify what developers can accomplish for the world
Michelle Pokrass
We focused a lot on the intersection of engineering and research, and we think you'll really love working with this model.
Michelle PokrassOpenAIOpenAI | Post Training Lead

Timestamp: [41:05-49:16]Youtube Icon

💎 Key Insights from [41:05-49:16]

Essential Insights:

  1. Development Paradigm Shift: GPT-5 represents the evolution from "vibe coding" concept to production-ready agentic development, where complex applications emerge from natural language conversations
  2. API Architecture Innovation: The three-model approach (GPT-5, Mini, Nano) with unified reasoning control eliminates traditional speed-vs-intelligence tradeoffs through adaptive reasoning effort
  3. Real-World Utility Focus: Training prioritized practical developer needs over benchmark performance, yet achieved record-breaking scores as a byproduct, demonstrating true engineering-research integration

Actionable Insights:

  • Unified Development Workflow: Developers can use a single model family across all use cases by adjusting reasoning effort, simplifying architecture decisions and reducing model management complexity
  • Advanced Tool Integration: Custom tools with regex/grammar constraints enable sophisticated domain-specific applications beyond traditional JSON limitations
  • Extended Context Applications: 400K token context with enhanced retrieval enables processing entire codebases, documentation sets, and complex multi-file projects in single conversations

Timestamp: [41:05-49:16]Youtube Icon

📚 References from [41:05-49:16]

People Mentioned:

  • Greg Brockman - OpenAI President introducing GPT-5's revolutionary coding capabilities and agentic development vision
  • Michelle Pokrass - Research team leader focused on post-training improvements for power users, coding, and instruction following

Technologies & Models:

  • GPT-5 - Full-powered reasoning model for complex development applications
  • GPT-5 Mini - Balanced performance model for standard use cases
  • GPT-5 Nano - Speed-optimized model for latency-sensitive applications
  • GPT-4.1 - Previous generation coding model referenced for comparison
  • O3 Model - Baseline model for performance comparisons across benchmarks

API Features & Enhancements:

  • Custom Tools - Free-form plain text tool calling beyond JSON constraints
  • Structured Outputs - Regular expression and context-free grammar output constraints
  • Tool Call Preambles - Model explanations before executing tool calls
  • Verbosity Parameter - Low, medium, high settings for response length control
  • Minimal Reasoning - New parameter for fastest possible responses

Performance Benchmarks:

Context & Memory:

  • 400K Token Context - Extended context window (doubled from O3's 200K)
  • OpenAI MRCR - Long-context retrieval capability benchmark
  • Graph Walks BFS - Long-context reasoning evaluation
  • BrowseComp - New open-source long-context evaluation

Concepts & Frameworks:

  • Vibe Coding - Conversational programming paradigm introduced in 2021
  • Agentic Coding - Autonomous, multi-step development task completion
  • Cost-Latency Curve - Model selection optimization across performance and speed
  • Engineering-Research Intersection - Balance between practical utility and scientific advancement
  • Real-World Utility Training - Focus on practical developer needs over benchmark optimization

Technical Specifications:

Industry Applications:

  • Telecom Problem Solving - TA² benchmark scenario for service troubleshooting
  • Front-End Development - Web development with enhanced aesthetic capabilities
  • Multi-Language Programming - Universal coding support across technology stacks
  • Long-Document Processing - Extended context for comprehensive codebase analysis

Timestamp: [41:05-49:16]Youtube Icon

🤝 How Did GPT-5 Learn to Be the "Ideal Pair Programmer" That Feels Right to Work With?

The Science of AI Personality Design

Greg Brockman
Benchmarks, they're exciting numbers, but we're starting to saturate them. When you're moving between 98 and 99% in some benchmark, it means you need something else to really capture how great the model is.
Greg BrockmanOpenAIOpenAI | President
Greg Brockman
One thing we've done very differently with this model is really focus on not just these numbers, but really on real-world application, it being really useful to you in your daily workflow
Greg BrockmanOpenAIOpenAI | President

Beyond Technical Excellence:

The Complete Developer Partner:

  • Software Engineering Mastery: Deep understanding of best practices and methodologies
  • Personality Integration: Feels natural and comfortable to collaborate with
  • Out-of-the-Box Perfection: Default behavior optimized for immediate productivity
  • Pair Programming Recreation: Authentic collaborative development experience
Brian Fioca
To recreate the ideal pair programmer, you need a model that understands best software engineering practices, but has a personality that just feels right to work with. For GPT-5, We worked really hard to make the model pair perfectly with you by default, out of the box
Brian FiocaOpenAIOpenAI | Solutions Architect

The Four Personality Traits Framework:

Research-Driven Development:

  1. Autonomy: Independent problem-solving and task completion
  2. Collaboration: Seamless teamwork and cooperative development
  3. Communication: Clear, helpful, and contextually appropriate interaction
  4. Context Management: Understanding and maintaining project scope and requirements
  5. Testing: Quality assurance and reliability focus
Brian Fioca
We started by talking to users and customers about how our models perform in the most popular coding tools like Cursor. And we identified frustrations and rough edges. And we boiled it all down into four personality traits.
Brian FiocaOpenAIOpenAI | Solutions Architect

Customer-Centric Training Process:

Real-World Feedback Integration:

  • User Research: Direct conversations with developers using popular coding tools
  • Cursor Integration: Specific optimization for industry-leading development environments
  • Frustration Identification: Systematic mapping of pain points and rough edges
  • Rubric Development: Personality traits converted into measurable training criteria

Collaborative Teammate Evolution:

  • Behavioral Tuning: Iterative refinement until natural collaboration achieved
  • Practical Testing: Real developers using the model in actual workflows
  • Continuous Improvement: Ongoing refinement based on usage patterns
  • Trust Building: Designed to feel like working with a reliable human partner

The Training Philosophy:

Practice-First Approach:

  • Real Behavior Focus: How model actually performs in daily workflows
  • User Need Prioritization: What developers genuinely want from AI assistance
  • Model Training Integration: Feedback directly incorporated into training process
  • Practical Excellence: Beyond benchmarks to genuine utility

Timestamp: [49:24-52:29]Youtube Icon

🐛 What Makes GPT-5's Bug-Fixing Approach So Different That It Succeeded Where o3 Failed?

Intelligent Problem-Solving & Communication

The Challenging Bug Scenario:

Previous Failure Context:

  • Live Stream Bug: Issue covered up during previous demonstration
  • o3 Inability: Previous model couldn't resolve the problem
  • GPT-5 Test: Model challenged to fix what defeated its predecessor
  • Demo Risk: "Taunting the demo gods" with live problem-solving
Brian Fioca
So last month I was on a different live stream, and towards the end I ran into a bug that I covered up... I tried to have o3 fix it for me, and it couldn't.
Brian FiocaOpenAIOpenAI | Solutions Architect

Superior Communication Strategy:

Upfront Planning:

  • Plan Communication: Explains approach before starting work
  • Bug Hunting Strategy: Details how it will search for and identify issues
  • Fix Methodology: Outlines potential solution approaches
  • Trust Building: Transparent communication builds developer confidence

Real-Time Updates:

  • Progress Reporting: Continuous updates on current activities
  • Search Feedback: "It searches faster than me" - superior investigation speed
  • Best Practices: Uses same methodologies as experienced developers
  • Power Amplification: More powerful than human developers while following familiar patterns

Intelligent Context Awareness:

Smart Problem Analysis:

  • Relevant Focus: Identifies and ignores unrelated linting issues
  • Targeted Fixes: Avoids unnecessary edits beyond the specific bug
  • Code Quality: Ensures shippable code before completion
  • Testing Integration: Runs builds and tests for reliability verification

The 45-Minute Docker Miracle:

Autonomous Complex Task:

  • Test Harness Refactoring: Converted existing system to parallel Docker execution
  • Time Pressure: Team was "pressed for time" with urgent deadline
  • Unattended Success: "Set it off—came back, like, 45 minutes later—it just finished"
  • First-Time Success: "Tested it out and it ran the first time"
Brian Fioca
We were pressed for time and we had it refactor one of our test harnesses to run in parallel on Docker... came back 45 minutes later—it just finished, and we tested it out and it ran the first time.
Brian FiocaOpenAIOpenAI | Solutions Architect

Professional Development Behavior:

Software Engineering Excellence:

  • Lint Analysis: Recognizes which warnings are relevant to current task
  • Build Verification: Ensures code compiles and functions correctly
  • Test Execution: Runs appropriate test suites for quality assurance
  • Shipping Standards: Maintains production-ready code quality

Timestamp: [50:39-53:28]Youtube Icon

🎯 How Does GPT-5's "Meta-Prompting" Capability Create the First Truly Trustworthy AI Developer?

Beyond Vibe Coding to Professional-Grade Development

Advanced Customization Features:

Complete Steerability:

  • System Prompts: Full control through custom system instructions
  • Cursor Rules: Integration with popular development environment settings
  • Verbosity Control: Adjustable communication detail levels
  • Reasoning Levels: Customizable thinking depth for different tasks
Brian Fioca
And the best part, GPT-5 is totally tunable. You can steer it with system prompts or Cursor rules. You can change its verbosity levels or reasoning levels to match your tasks.
Brian FiocaOpenAIOpenAI | Solutions Architect

Self-Improvement Intelligence:

Meta-Prompting Innovation:

  • Self-Modification: Can improve its own prompts when guidance is needed
  • Adaptive Learning: Adjusts approach based on user feedback and preferences
  • Stuck Resolution: Actively helps when development process stalls
  • Continuous Enhancement: Evolves its own instructions for better performance
Brian Fioca
GPT-5 is actually really good at modifying its own prompts by meta-prompting.
Brian FiocaOpenAIOpenAI | Solutions Architect

State-of-the-Art Achievement:

Zero-Shot Excellence:

  • Complex Task Reliability: Handles most challenging development scenarios
  • No Training Required: Immediate effectiveness without task-specific preparation
  • Professional Quality: Production-ready output from first attempt
  • Consistent Performance: Reliable results across diverse coding challenges
Brian Fioca
After using this for the past few weeks, it really feels we've achieved state-of-the-art zero-shot performance and reliability across the most complex coding tasks. For me, it's the first time I trust a model to do my most important work.
Brian FiocaOpenAIOpenAI | Solutions Architect

Trust Breakthrough:

Professional Confidence:

  • Most Important Work: First model developers trust with critical projects
  • Beyond Exploration: Moves past experimental "vibe coding" to serious development
  • Powerful Tool: Genuine utility for professional development workflows
  • Death Loop Avoidance: Stays productive and doesn't get stuck in infinite loops

Real-World Workflow Integration:

Daily Development Support:

  • Benchmark Saturation: Moving beyond 98-99% scores to practical utility
  • Workflow Focus: Optimized for actual developer daily routines
  • Application Priority: Real-world usefulness over academic performance
  • Practical Excellence: Genuine utility in professional development environments

The Professional Developer Experience:

Collaborative Intelligence:

  • Autonomy Balance: Independent yet collaborative approach
  • Communication Excellence: Clear, helpful, contextually appropriate interaction
  • Context Management: Maintains project understanding throughout development
  • Testing Integration: Quality assurance built into development process

Timestamp: [53:59-54:43]Youtube Icon

🏗️ What Does Moving From "Vibe Coding" to "Incredibly Powerful Tool" Mean for the Future of Development?

The Evolution of AI-Assisted Programming

The Benchmark Limitation Reality:

Beyond Numbers:

  • Saturation Problem: Moving between 98-99% on benchmarks lacks meaningful differentiation
  • Real-World Focus: Practical application more important than academic scores
  • Daily Workflow: Genuine utility in professional development environments
  • User Experience: How model feels to use matters more than test performance

The Grind of Practical Excellence:

Research-Practice Integration:

  • Behavioral Analysis: Deep study of how model performs in actual usage
  • User Need Discovery: Understanding what developers genuinely want
  • Training Integration: Feedback directly incorporated into model development
  • Continuous Refinement: Ongoing improvement based on real-world usage

Professional Development Transformation:

Trust Evolution:

  • First Trustworthy Model: Suitable for most important development work
  • Professional Confidence: Reliable enough for critical business applications
  • Quality Assurance: Maintains shipping standards throughout development
  • Autonomous Excellence: Independent problem-solving without getting stuck
Brian Fioca
This is beyond vibe coding. It's an incredibly powerful tool, and I'm really excited for people to try it.
Brian FiocaOpenAIOpenAI | Solutions Architect

The Four Pillars Implementation:

Collaborative Excellence:

  1. Autonomy: Independent task completion with minimal supervision
  2. Collaboration: Natural teamwork and cooperative development
  3. Communication: Clear, helpful, contextually appropriate interaction
  4. Context Management: Project understanding and scope maintenance
  5. Testing: Built-in quality assurance and reliability focus

Future Development Paradigm:

Revolutionary Potential:

  • Incredible Power: Beyond experimental coding to serious development tool
  • Complex Task Reliability: Handles challenging scenarios with consistency
  • Meta-Learning: Self-improvement and adaptive prompt optimization
  • Professional Integration: Seamless workflow incorporation for daily development

The Excitement Factor:

Developer Anticipation:

  • Public Availability: Excitement for widespread developer access
  • Transformative Potential: Genuine change in how development work is accomplished
  • Tool Evolution: From interesting experiment to essential development resource
  • Professional Impact: Significant enhancement of developer capability and productivity
Greg Brockman
It's been really amazing to see the team doing the grind of going and seeing how this model behaves in practice, figuring out what people really want, and putting that back into model training.
Greg BrockmanOpenAIOpenAI | President

Timestamp: [49:24-54:43]Youtube Icon

💎 Key Insights from [49:24-54:43]

Essential Insights:

  1. Personality-Driven Development: GPT-5's success comes from intentionally designing AI personality traits (autonomy, collaboration, communication, context management, testing) that make it feel like a natural development partner
  2. Trust Through Reliability: The transition from experimental "vibe coding" to professional-grade development tool represents the first AI model developers trust with their most important work
  3. Meta-Learning Capability: GPT-5's ability to modify its own prompts and adapt to user preferences demonstrates a new level of AI self-awareness and continuous improvement

Actionable Insights:

  • Workflow Integration: Developers can immediately integrate GPT-5 into professional development environments like Cursor with customizable verbosity and reasoning levels for different task requirements
  • Complex Task Automation: The 45-minute Docker refactoring success demonstrates GPT-5's ability to handle sophisticated, multi-step development tasks autonomously while maintaining production quality
  • Communication-Driven Development: GPT-5's upfront planning and real-time progress updates create a transparent development process that builds trust and enables better collaboration

Timestamp: [49:24-54:43]Youtube Icon

📚 References from [49:24-54:43]

People Mentioned:

  • Greg Brockman - OpenAI President emphasizing real-world application over benchmark performance
  • Brian Fioca - Solutions architect on startups team demonstrating Cursor integration and bug fixing
  • Adi Ganesh - Post-training team researcher explaining personality trait development and training methodology

Technologies & Tools:

  • Cursor - Popular development environment used for GPT-5 integration demonstration
  • Docker - Containerization technology used in parallel test harness refactoring example
  • System Prompts - Customization method for steering GPT-5 behavior
  • Cursor Rules - Development environment-specific configuration for AI behavior
  • Meta-Prompting - GPT-5's ability to modify its own prompts for improved performance

Development Concepts:

  • Pair Programming - Collaborative development methodology that GPT-5 recreates
  • Zero-Shot Performance - Model capability without task-specific training
  • Death Loops - Problematic AI behavior where models get stuck in repetitive cycles
  • Lint Analysis - Code quality checking and error identification
  • Build Verification - Ensuring code compiles and functions correctly
  • Test Harness - Framework for running automated tests

Four Personality Traits:

  • Autonomy - Independent problem-solving and task completion capability
  • Collaboration - Seamless teamwork and cooperative development approach
  • Communication - Clear, helpful, and contextually appropriate interaction
  • Context Management - Understanding and maintaining project scope and requirements
  • Testing - Quality assurance and reliability focus in development process

Performance Metrics:

  • Benchmark Saturation - Situation where models achieve 98-99% scores making differentiation difficult
  • Real-World Application - Practical utility in daily development workflows
  • State-of-the-Art Performance - Leading capability across complex coding tasks
  • Professional Trust - Developer confidence in AI for critical work

Development Scenarios:

  • Bug Fixing - Live demonstration of problem identification and resolution
  • Test Harness Refactoring - Complex 45-minute autonomous development task
  • Parallel Processing - Converting sequential systems to concurrent execution
  • Code Quality Assurance - Maintaining shipping standards throughout development

Customization Features:

  • Verbosity Levels - Adjustable communication detail and response length
  • Reasoning Levels - Customizable thinking depth for different task requirements
  • Steerability - Ability to guide and control AI behavior through various methods
  • Adaptive Behavior - Model adjustment based on user feedback and preferences

Timestamp: [49:24-54:43]Youtube Icon

💼 How Does GPT-5 Transform "Couple of Days" of Work Into 5 Minutes of Beautiful Design?

Professional Dashboard Creation Revolution

The CFO Dashboard Challenge:

Complex Business Requirements:

  • Target Audience: CFO of a startup requiring financial visualization
  • Design Specifications: Beautiful, tastefully designed with clear hierarchy
  • Functionality: Interactive elements with easy focus on important metrics
  • Technical Requirements: Specific frameworks (Next.js, Tailwind CSS)
Greg Brockman
How long do you think this kind of task would take you to take?
Greg BrockmanOpenAIOpenAI | President
Adi Ganesh
Yeah, easily at least a couple of days. I'm not a front-end expert. Just to understand the latest frameworks and piece everything together would—yeah, easily take me a few days.
Adi GaneshOpenAIOpenAI | Post Training Researcher

The Time Transformation:

Traditional Development Reality:

  • Estimated Time: "Easily at least a couple of days"
  • Technical Barriers: Understanding latest frameworks and integration challenges
  • Expertise Requirements: Front-end specialization and design skills
  • Implementation Complexity: Piecing together multiple technologies

GPT-5 Achievement:

  • Actual Time: 5 minutes from prompt to working application
  • Complete Solution: From scratch Next.js project with full functionality
  • Professional Quality: Production-ready code with modular architecture
  • Self-Improvement: Model iterates on its own code through build-error cycles

Advanced Aesthetic Intelligence:

Design Philosophy:

  • Good Aesthetics by Default: Beautiful results from concise prompts
  • Intent Inference: Understanding user goals from minimal specifications
  • Steerability: Precise instruction following when detailed requirements provided
  • Best of Both Worlds: Flexibility for both quick prototypes and detailed specifications
Adi Ganesh
How we trained GPT-5 to be a great front-end coding model. We tried to follow the principle of giving it good aesthetics by default but also making it steerable. So if I give the model a concise prompt, it should be able to infer my intent and make something that looks great by default.
Adi GaneshOpenAIOpenAI | Post Training Researcher
Adi Ganesh
On the other hand, if I'm specific about a layout or frameworks that I want the model to use, it should follow my instructions precisely. And this makes it the best of both worlds for developers.
Adi GaneshOpenAIOpenAI | Post Training Researcher

Training Excellence:

  • Typography Mastery: Superior understanding of text design and hierarchy
  • Color Intelligence: Sophisticated color palette and scheme selection
  • Spacing Expertise: Professional layout and visual breathing room
  • Detail Understanding: Eclipses previous models in design comprehension

The Creative Self-Improvement Loop:

Autonomous Development Process:

  1. Code Generation: Initial implementation from requirements
  2. Build Execution: Running builds and capturing errors
  3. Error Analysis: Streaming build feedback back to model
  4. Iterative Improvement: Self-correction and code enhancement
  5. Quality Assurance: Ensuring production-ready final result

Professional Results:

Complete Feature Set:

  • Financial Metrics: ARR, cash flow, revenue visualization
  • Interactive Charts: Hover tooltips with precise data values
  • Customer Analytics: Segmented customer data and growth tracking
  • Date Filtering: Dynamic date picker for temporal analysis
  • Modular Architecture: KPI cards, revenue charts, sample data components
Adi Ganesh
And this is, for me, a profound moment to see that the model could write code but also run builds, stream the errors back, and iterate on the code. So it's able to improve its own code in this sort of self-improvement loop.
Adi GaneshOpenAIOpenAI | Post Training Researcher

Timestamp: [54:57-1:02:24]Youtube Icon

🎨 What Makes GPT-5 the "First Model with a Sense of Creativity" That Surpasses Human Aesthetic Judgment?

Revolutionary Creative Intelligence & Design Excellence

The Aesthetic Superiority Discovery:

Human vs. AI Design Judgment:

  • Testing Evolution: A/B testing different model versions for UI quality
  • Human Limitation: Researchers couldn't distinguish better designs
  • Expert Consultation: Had to bring in professional designers for evaluation
  • AI Advantage: Model demonstrates superior aesthetic preferences
Brian Fioca
During testing, we were looking at the A's and B's for different versions of the model to see if it was doing better at UI. And at some point, we stopped being able to tell and actually had to pull in designers to teach us what was better.
Brian FiocaOpenAIOpenAI | Solutions Architect

Personal Aesthetic Deference:

  • Developer Confession: "I feel like the model has better aesthetics than me"
  • Practical Impact: Developers defer to model's design judgment
  • Default Excellence: Model's defaults are consistently great
  • Creative Partnership: AI as aesthetic guide for uncertain design decisions

Training for Creative Excellence:

Design Mastery Development:

  • Typography Excellence: Advanced understanding of text design principles
  • Color Sophistication: Professional-level color theory and palette selection
  • Spacing Intelligence: Perfect visual hierarchy and layout principles
  • Detail Comprehension: Unprecedented attention to design nuances

Ambitious Yet Coherent:

  • Creative Ambition: Goes above and beyond basic requirements
  • Adherence Balance: Stays true to specified prompts while adding value
  • Coherent Enhancement: Improvements that make sense within context
  • Quality Focus: Not just code generation, but high-quality, mergeable code

The Complete Development Lifecycle:

Beyond Code Generation:

  • Proper Abstractions: Thoughtful code organization and structure
  • Documentation: Comprehensive README files and code explanations
  • Modular Design: Component-based architecture for maintainability
  • Communication: Clear explanation of development decisions and approaches

Professional Standards:

  • Mergeable Code: Production-ready quality from first generation
  • Industry Practices: Following established software development standards
  • Scalable Architecture: Code structured for future enhancement and maintenance
  • Quality Assurance: Built-in testing and validation processes

Creative Intelligence Breakthrough:

First True Creative AI:

  • Creativity Recognition: "This is the first model I've worked with that actually has a sense of creativity"
  • Profound Experience: Working with genuinely creative artificial intelligence
  • Future Potential: Unlocking human creativity through AI partnership
  • Revolutionary Capability: Moving beyond functional to genuinely creative output
Adi Ganesh
Working with GPT-5 has been really fun and profound for me because, for me, this is the first model I've worked with that actually has a sense of creativity.
Adi GaneshOpenAIOpenAI | Post Training Researcher

Aesthetic Training Evolution:

Model Development Process:

  • Aesthetic Preference Evolution: Observable improvement in design choices during training
  • Designer Integration: Professional designers involved in training evaluation
  • Quality Differentiation: Ability to distinguish subtle design improvements
  • Creative Standards: Achieving professional design excellence through AI

Timestamp: [58:30-1:05:04]Youtube Icon

🏰 How Does a "Beautiful Castle" Prompt Create a 3D World with Interactive Characters and Mini-Games?

Creative Gaming & 3D World Generation

The 3D Castle Game Vision:

Personal Creative Project:

  • Family Motivation: Creating game for younger cousin
  • 3D Requirements: Castle-based three-dimensional environment
  • Interactive Elements: People patrolling walls, movement, horses
  • Mini-Game Integration: Balloon-popping with sound effects

Extraordinary Creative Output:

Visual Excellence:

  • Floating Rock Architecture: Creative environmental design choices
  • 3D Castle Complexity: Detailed medieval fortress construction
  • Guard Animation: Autonomous characters walking patrol routes
  • Cannon Functionality: Interactive firing mechanisms with visual effects

Rich Interactive Features:

  • Character Dialogue: Named NPCs with personality and conversation
  • Captain Rowan: Military character with authentic responses
  • Merchant Interaction: Commercial character with appropriate dialogue
  • Wisdom Sharing: Characters provide contextual advice and philosophy

Advanced Game Mechanics:

Balloon-Popping Mini-Game:

  • Sound Integration: Audio feedback for successful balloon hits
  • Interactive Targeting: Click-based shooting mechanics
  • Dynamic Movement: Moving balloon targets with varying difficulty
  • Score Feedback: Immediate response to player actions

Historical Accuracy & Humor:

  • "Historically Accurate Balloons": Playful acknowledgment of anachronisms
  • Character Names: Thoughtful NPC naming and personality development
  • Cultural Dialogue: Appropriate medieval-style conversations
  • Authentic Atmosphere: Immersive period-appropriate environment

Technical Sophistication:

3D Engine Implementation:

  • Complex Geometry: Castle architecture with multiple levels and details
  • Animation Systems: Character movement and patrol behaviors
  • Physics Integration: Projectile mechanics for cannon firing
  • Audio System: Sound effects for interactive elements

Creative Interpretation:

  • Aesthetic Sense: Beautiful visual design from minimal prompt
  • Creative Liberty: Thoughtful additions beyond basic requirements
  • Environmental Design: Cohesive world-building and atmosphere
  • Interactive Innovation: Multiple layers of user engagement

Development Efficiency:

Time Investment Reality:

  • Traditional Approach: Would require extensive 3D development expertise
  • Game Engine Knowledge: Understanding of complex 3D graphics systems
  • Animation Programming: Character movement and interaction systems
  • Audio Integration: Sound system implementation and management
Adi Ganesh
So, it's just wild how from a concise prompt, the model has this great sense of aesthetics where it's made this floating rock, made a 3D castle, and if you zoom in, you can see tons of detail—these guards that are walking around, cannons firing.
Adi GaneshOpenAIOpenAI | Post Training Researcher

Timestamp: [1:02:29-1:04:53]Youtube Icon

⚡ How GPT-5 Turns Data Visualization Hell Into 5-Minute Magic?

The Future of Accelerated Development

The D3 Visualization Challenge:

Traditional Development Reality:

  • Complex Implementation: Interactive hover tooltips requiring extensive D3 programming
  • Time Investment: "Five hours" for experienced developer to implement
  • Technical Expertise: Deep knowledge of data visualization libraries required
  • Integration Complexity: Combining multiple technologies for cohesive result
Brian Fioca
It would take me five hours to do that in D3.
Brian FiocaOpenAIOpenAI | Solutions Architect

GPT-5 Achievement:

  • Instant Implementation: Hover tooltips created as part of complete dashboard
  • No Specialization Required: Works for developers without front-end expertise
  • Complete Integration: Seamlessly integrated with entire application
  • Professional Quality: Production-ready interactive elements

The Concise Prompt Power:

Minimal Input, Maximum Output:

  • Simple Request: Basic dashboard requirements with audience specification
  • Creative Interpretation: Model infers sophisticated requirements from brief description
  • Beautiful Results: Professional aesthetic without detailed design specifications
  • 5-Minute Delivery: Complete application from concept to running code
Adi Ganesh
Yeah, it's crazy that this prompt is so concise and it's able to just give me something that looks beautiful in just 5 minutes.
Adi GaneshOpenAIOpenAI | Post Training Researcher

Future Development Paradigm:

Acceleration Implications:

  • Individual Amplification: Single developers achieving team-level output
  • Barrier Elimination: No excuse for ugly internal applications
  • Creative Unlocking: AI partnership enabling unprecedented creative expression
  • Workflow Revolution: Fundamental change in development speed and capability
Greg Brockman
It's definitely a good taste of what the future holds as well, right? When you really think about where these models can go and how much they can accelerate developers in kind of all aspects of—of what—what we all collectively do.
Greg BrockmanOpenAIOpenAI | President

The Self-Improvement Revolution:

Autonomous Development Loop:

  • Build Integration: Model runs its own builds to test code
  • Error Streaming: Real-time feedback from build processes
  • Iterative Enhancement: Self-correction and improvement cycles
  • Quality Assurance: Ensuring production-ready results through automation

Professional Impact:

Industry Transformation:

  • Speed Multiplication: Tasks reduced from days to minutes
  • Quality Enhancement: Better aesthetic results than human designers
  • Accessibility: Professional development capability for non-experts
  • Creative Partnership: AI as collaborative creative intelligence
Greg Brockman
There will no longer be an excuse for ugly internal applications.
Greg BrockmanOpenAIOpenAI | President

The Profound Moment:

Technical Breakthrough Recognition:

  • Self-Improvement Loop: Model improving its own code through build feedback
  • Future Preview: Glimpse of fully autonomous development capabilities
  • Acceleration Potential: All aspects of development significantly enhanced
  • Collective Impact: Amplifying what developers can accomplish together

Timestamp: [54:43-1:05:10]Youtube Icon

💎 Key Insights from [54:43-1:05:10]

Essential Insights:

  1. Creative Intelligence Emergence: GPT-5 represents the first AI model with genuine creative sense, demonstrating aesthetic judgment that surpasses human designers and requires professional design consultation for evaluation
  2. Development Time Compression: Complex applications that traditionally require days of work can now be created in minutes, with professional quality and self-improving code that runs builds and fixes its own errors
  3. Aesthetic Superiority: The model's design capabilities have evolved to the point where experienced developers defer to its aesthetic judgment, creating beautiful results by default from minimal prompts

Actionable Insights:

  • Internal Application Revolution: Organizations can eliminate ugly internal tools by leveraging GPT-5's superior design capabilities for dashboards, interfaces, and business applications
  • Creative Project Acceleration: Individual developers can create sophisticated 3D games, interactive visualizations, and complex applications that would normally require specialized teams and extensive time investment
  • Self-Improving Development: The autonomous build-test-fix cycle enables developers to set complex tasks in motion and return to find completed, tested, production-ready solutions

Timestamp: [54:43-1:05:10]Youtube Icon

📚 References from [54:43-1:05:10]

People Mentioned:

  • Adi Ganesh - Post-training team researcher demonstrating front-end coding and creative capabilities
  • Greg Brockman - OpenAI President providing commentary and testing interactive game elements
  • Professional Designers - External experts brought in to evaluate UI improvements when researchers couldn't distinguish quality differences

Technologies & Frameworks:

  • Next.js - React framework used for dashboard application development
  • Tailwind CSS - Utility-first CSS framework for styling and design
  • D3.js - Data visualization library referenced for comparison of implementation complexity
  • TypeScript - Programming language used for type-safe development
  • npm - Package manager for JavaScript dependencies
  • Cursor - Development environment used for code generation demonstration

Development Concepts:

  • Create-Next-App - Command-line tool for scaffolding Next.js projects
  • Modular Architecture - Component-based code organization for maintainability
  • KPI Cards - Key Performance Indicator display components
  • Revenue Charts - Financial data visualization components
  • Build-Test-Fix Cycle - Autonomous development loop for error correction

Creative Elements:

  • 3D Castle Game - Complex three-dimensional interactive gaming environment
  • Character Dialogue - Interactive NPCs with personality and conversation capabilities
  • Balloon-Popping Mini-Game - Interactive gaming element with sound effects
  • Captain Rowan - Named NPC character with military personality
  • Merchant Character - Commercial NPC with appropriate dialogue and interactions

Design Principles:

  • Good Aesthetics by Default - Beautiful results from minimal prompts
  • Steerability - Ability to follow specific design instructions precisely
  • Typography Excellence - Advanced understanding of text design principles
  • Color Intelligence - Sophisticated color theory and palette selection
  • Spacing Expertise - Professional layout and visual hierarchy

Business Applications:

  • CFO Dashboard - Financial visualization and analytics interface
  • ARR Tracking - Annual Recurring Revenue monitoring and display
  • Cash Flow Visualization - Financial data representation and analysis
  • Customer Segmentation - Customer data analysis and categorization
  • Date Filtering - Temporal data analysis and visualization controls

Game Development:

  • 3D Environment Creation - Complex three-dimensional world building
  • Character Animation - Autonomous NPCs with patrol behaviors
  • Cannon Mechanics - Interactive firing systems with visual effects
  • Sound Integration - Audio feedback for user interactions
  • Physics Implementation - Projectile mechanics and collision detection

Quality Metrics:

  • A/B Testing - Comparative evaluation of design quality improvements
  • Designer Consultation - Professional evaluation when quality differences became subtle
  • Production-Ready Code - Mergeable, professional-quality output
  • Self-Improvement Loop - Autonomous error detection and correction capability

Timestamp: [54:43-1:05:10]Youtube Icon

🔍 Can GPT-5 Actually Understand the "Why" Behind Your Technical Decisions?

Revolutionary Codebase Understanding Intelligence

The Cursor Team's First Test:

Deep Codebase Analysis Challenge:

  • Initial Request: "Tell us something non-obvious about our codebase"
  • Time Investment: Within a couple of minutes of analysis
  • Discovery Scope: Buried deep into complex existing codebase
  • System Identification: Remote code execution system detection
Michael Truell
So when we got access to GPT-5, we just set about using it on our actual work. And so to start with, as a test, we asked it to tell us something non-obvious about our codebase.
Michael TruellCursorCursor | Co-founder & CEO

Architectural Intelligence:

  • Non-Obvious Decisions: Identified subtle architecture choices
  • Security Understanding: Recognized hardening decisions and their purpose
  • Trade-off Comprehension: Understood complex engineering compromises
  • Design Rationale: Grasped the "why" behind technical decisions
Michael Truell
And within a couple of minutes, it buried into the codebase. It identified a particular system that we use for remote code execution. And it identified a non-obvious architecture decision we had made. And then it also understood why we made that architecture decision.
Michael TruellCursorCursor | Co-founder & CEO

Human vs. AI Analysis Speed:

Traditional Development Reality:

  • Human Time Investment: "Weeks to think through" architecture decisions
  • Complex Problem Solving: Multiple engineers deliberating on design choices
  • Security Considerations: Extensive analysis of hardening approaches
  • Trade-off Evaluation: Careful consideration of engineering compromises
Michael Truell
And those were architecture decisions and trade-offs that took humans weeks to think through. So, it was kind of amazing to see its codebase understanding abilities.
Michael TruellCursorCursor | Co-founder & CEO

GPT-5 Achievement:

  • Minutes vs. Weeks: Instantaneous understanding of complex architectural patterns
  • Deep Comprehension: Not just code reading, but design philosophy understanding
  • Security Awareness: Recognition of security-focused architectural decisions
  • Holistic Understanding: Complete picture of system design and rationale

Beyond Code Generation:

Complete Software Development Intelligence:

  • Code Reading Excellence: Superior understanding of existing codebases
  • Architecture Recognition: Pattern identification in complex systems
  • Design Philosophy: Understanding the reasoning behind technical choices
  • Security Consciousness: Recognition of hardening and protection mechanisms

Real-World Application Value:

Professional Development Impact:

  • Legacy System Understanding: Rapid comprehension of inherited codebases
  • Onboarding Acceleration: New team members understanding systems quickly
  • Documentation Generation: Automatic explanation of complex architectural decisions
  • Knowledge Transfer: Preserving institutional knowledge about design choices

Timestamp: [1:05:38-1:06:13]Youtube Icon

🤖 Why Cursor Team Trust GPT-5 With Their Most Important Work?

Perfect Balance of Power and Practicality

Intelligence Without Compromise:

The Rare Combination:

  • Incredibly Smart: Superior reasoning and problem-solving capabilities
  • Ease of Use: Natural interaction without complexity barriers
  • No Trade-offs: Intelligence doesn't come at the cost of accessibility
  • Real Pair Programming: Authentic collaborative development experience
Michael Truell
It's incredibly smart. It's very smart. And even though it is smart, it does not compromise on its ease of use for real pair programming.
Michael TruellCursorCursor | Co-founder & CEO

Interactive Development Excellence:

Communication and Transparency:

  • Upfront Planning: Explains what it's about to do before acting
  • Problem Decomposition: Breaks complex problems into understandable subproblems
  • Reasoning Traces: Leaves clear reasoning trails for human intervention
  • Interactive Speed: Fast enough for real-time collaborative development

Long-Session Collaboration:

Extended Development Workflows:

  • Multi-Query Sessions: Works effectively across long development sessions
  • Backtracking Capability: Can reverse decisions and change direction
  • Additional Changes: Handles evolving requirements and scope changes
  • Continuous Collaboration: Maintains context across extended interactions

Real-World Integration:

Daily Driver Capability:

  • Professional Use: Suitable for actual work, not just demos
  • Scoped Problems: Start with contained problems and expand usage
  • Synchronous Work: Real-time collaboration during development
  • Daily Development: Reliable enough for everyday programming tasks
Michael Truell
I would suggest using it for your real work. So, GPT-5 is a step forward towards a real pair programmer. And so I would start using it as a helper—as a daily driver model for you.
Michael TruellCursorCursor | Co-founder & CEO

Advanced Problem-Solving:

Complex Task Management:

  • High-Level Planning: Creates strategic approaches to complex problems
  • Codebase Search: Systematic exploration of large codebases
  • File Analysis: Intelligent reading and understanding of existing code
  • Solution Implementation: Practical code changes that solve real problems
Michael Truell
And so if you haven't used AI to code much before, I would take some of your more scoped-down problems and try handing them off to the bot and working with it synchronously.
Michael TruellCursorCursor | Co-founder & CEO

Timestamp: [1:06:25-1:08:54]Youtube Icon

🐛 How Does GPT-5 Solve a "3-Week-Old" OpenAI SDK Bug Using Custom Tools It's Never Seen Before?

Robustness and Adaptability in Real-World Scenarios

The Complex Bug Challenge:

Real-World Problem:

  • OpenAI Python SDK: Production codebase with active development
  • PDF Upload Issue: Specific functionality broken for three weeks
  • Non-Trivial Problem: Complex enough to remain unsolved for extended period
  • Public Repository: Real-world open source development scenario
Michael Truell
So we can see that it has buried into the codebase and discovered that there's an issue with the MIME type being sent up for PDFs and the plumbing through the SDK. It has identified that, and it's started making some code changes.
Michael TruellCursorCursor | Co-founder & CEO

Tool Adaptability Excellence:

Custom Tool Mastery:

  • Unseen Tools: Working with custom Cursor tools for the first time
  • Web Text Retrieval: Pulling down information from web sources
  • Codebase Search: Systematic exploration of large codebases
  • Tool Integration: Seamless integration of multiple custom utilities

Systematic Problem-Solving:

Professional Development Approach:

  1. High-Level Planning: Strategic approach to problem identification
  2. Codebase Exploration: Systematic search throughout the repository
  3. File Analysis: Reading and understanding relevant source files
  4. Issue Identification: Discovering MIME type problems in PDF handling
  5. Solution Implementation: Creating new methods and editing existing code

Real Codebase Excellence:

Large-Scale Development:

  • Big Codebases: Effective operation in complex, real-world repositories
  • Daily Driver Capability: Suitable for everyday professional development
  • Long-Lived Applications: Effective with established, mature codebases
  • Production Quality: Changes suitable for merging into production code

Advanced Instruction Following:

Complexity Management:

  • Subtle Instructions: Picking up on nuanced requirements and constraints
  • Long Task Specifications: Handling complex, multi-part instructions
  • Backtracking Ability: Reversing course when code execution reveals errors
  • Error Recovery: Learning from feedback and adjusting approach
Greg Brockman
What GPT-5 can't do?
Greg BrockmanOpenAIOpenAI | President
Michael Truell
Well, we're really excited about computer-use capabilities—about those getting better. It would be great if, for instance, the dashboard Adi just showed, if it could run the code, see the output, actually QA every little bit itself and then react to it. so looking forward to computer-use capabilities.
Michael TruellCursorCursor | Co-founder & CEO

The Future Vision:

Extended Development Cycles:

  • Current Demos: 5-10 minutes to couple hours of development
  • Future Potential: Days, weeks, eventually months of autonomous development
  • Computer-Use Integration: Visual QA and interaction with running applications
  • DevOps Expansion: Beyond code writing to complete development lifecycle
Greg Brockman
We run them for 5 minutes, 10 minutes, a couple hours, but I think extending that life cycle to really be able to go for days and weeks and eventually even months—I think that is—that is ultimately where we expect things to go.
Greg BrockmanOpenAIOpenAI | President

Timestamp: [1:07:13-1:10:34]Youtube Icon

🎯 Why Did Cursor's CEO Just Make GPT-5 Default for Every New User?

Industry Partnership and Professional Validation

The Cursor Partnership:

Strategic Integration:

  • Default Model: GPT-5 becomes the standard for new Cursor users
  • Universal Rollout: Available to all existing Cursor users
  • Free Trial: Several days of free access for evaluation
  • Professional Endorsement: CEO of leading development tool validates superiority
Michael Truell
Starting today, GPT-5 is default for new users in Cursor, and we're releasing it to all Cursor users. Free to try for the next few days so people get a sense of the model.
Michael TruellCursorCursor | Co-founder & CEO

Technical Validation:

Professional Assessment:

  • Smartest Model: "The smartest coding model we've ever tried"
  • Real-World Testing: Evaluated in actual professional development workflows
  • Production Use: Suitable for real work, not just demonstrations
  • Industry Leadership: Recognition from leading development tool company
Michael Truell
It is the smartest coding model we've ever tried.
Michael TruellCursorCursor | Co-founder & CEO

Immediate Availability:

User Access:

  • Today Launch: Immediate availability for Cursor users
  • Free Evaluation: Risk-free trial period for adoption
  • Professional Integration: Seamless integration into existing workflows
  • Market Validation: Trusted by industry-leading development platform

The MIME Type Bug Resolution:

Live Problem Solving:

  • Issue Identification: PDF upload problem in OpenAI Python SDK
  • Root Cause: MIME type handling in SDK plumbing
  • Code Changes: New methods and existing code modifications
  • Production Quality: "Looks roughly correct" and ready for PR merge

Future Development Paradigm:

Extended Capabilities:

  • Computer-Use Integration: Visual application testing and QA
  • DevOps Expansion: Beyond coding to complete development lifecycle
  • Extended Sessions: From minutes/hours to days/weeks/months of development
  • Autonomous Quality Assurance: Self-testing and validation capabilities

Real-World Impact:

Professional Development Revolution:

  • Daily Driver Use: Reliable for everyday professional development
  • Large Codebase Operation: Effective with complex, real-world repositories
  • Collaborative Development: True pair programming experience
  • Production Deployment: Code quality suitable for immediate merging

Timestamp: [1:10:40-1:11:29]Youtube Icon

💎 Key Insights from [1:05:17-1:11:29]

Essential Insights:

  1. Architectural Intelligence Revolution: GPT-5 demonstrates unprecedented codebase understanding by identifying complex architectural decisions and their security rationales in minutes rather than the weeks humans required for the original design
  2. Real Pair Programming Achievement: The model successfully balances extreme intelligence with practical usability, enabling true collaborative development without compromising on either sophisticated reasoning or ease of use
  3. Production-Ready Validation: Cursor's adoption of GPT-5 as the default model represents industry validation that AI coding has reached professional-grade reliability for real-world development workflows

Actionable Insights:

  • Immediate Professional Integration: Developers can start using GPT-5 as a daily driver for real work, beginning with scoped problems and expanding to complex, multi-session collaborative development
  • Legacy Code Understanding: Teams can leverage GPT-5's architectural intelligence to rapidly understand inherited codebases, accelerate onboarding, and preserve institutional knowledge about design decisions
  • Extended Development Cycles: The foundation is set for AI-assisted development to expand from current demo timeframes to days, weeks, and eventually months of autonomous development work

Timestamp: [1:05:17-1:11:29]Youtube Icon

📚 References from [1:05:17-1:11:29]

People Mentioned:

  • Michael Truell - Co-founder and CEO of Cursor providing professional validation of GPT-5's coding capabilities
  • Greg Brockman - OpenAI President demonstrating real-world bug fixing and discussing partnership significance

Companies & Products:

  • Cursor - AI-powered development environment integrating GPT-5 as default model for new users
  • OpenAI Python SDK - Production codebase used for live bug fixing demonstration
  • GitHub Issues - Platform where the 3-week-old PDF upload bug was documented and tracked

Technologies & Tools:

  • Remote Code Execution System - Complex architecture component identified by GPT-5 in Cursor's codebase
  • MIME Type Handling - Technical issue discovered in PDF upload functionality
  • Custom Cursor Tools - Development utilities for web text retrieval and codebase search
  • PDF Upload Functionality - Specific feature with production bug requiring systematic fixing

Development Concepts:

  • Architecture Decisions - Complex design choices requiring weeks of human deliberation
  • Security Hardening - Protective measures identified by GPT-5 in system design
  • Codebase Understanding - AI capability to comprehend existing code structure and rationale
  • Pair Programming - Collaborative development methodology enhanced by AI assistance
  • Daily Driver Model - AI system reliable enough for everyday professional development use

Problem-Solving Methodology:

  • High-Level Planning - Strategic approach to complex problem identification and resolution
  • Codebase Search - Systematic exploration and analysis of large software repositories
  • File Analysis - Intelligent reading and comprehension of existing source code
  • Backtracking Capability - Ability to reverse decisions and change development direction
  • Error Recovery - Learning from feedback and adjusting problem-solving approach

Future Capabilities:

  • Computer-Use Integration - Visual application testing and quality assurance capabilities
  • DevOps Expansion - Beyond coding to complete development lifecycle management
  • Extended Development Sessions - Autonomous development spanning days, weeks, or months
  • Autonomous Quality Assurance - Self-testing and validation without human intervention

Professional Validation:

  • Production Code Quality - AI-generated changes suitable for immediate production merging
  • Real-World Testing - Evaluation in actual professional development environments
  • Industry Partnership - Strategic integration with leading development tools
  • Free Trial Access - Risk-free evaluation period for professional adoption

Technical Challenges:

  • Three-Week-Old Bug - Complex issue demonstrating non-trivial problem-solving capability
  • Large Codebase Operation - Effective performance in complex, real-world repositories
  • Custom Tool Adaptation - Working with previously unseen development utilities
  • Multi-Part Instructions - Handling complex, nuanced development requirements

Timestamp: [1:05:17-1:11:29]Youtube Icon

🏢 Can 5 Million Businesses Really Transform Entire Industries with GPT-5?

Enterprise Transformation at Unprecedented Scale

The Business Reality:

Massive Adoption Statistics:

  • 5 Million Businesses: Currently using OpenAI technology
  • Production Focus: "Not just playing," "not just experimenting"
  • Real-World Implementation: Pushing new products into actual production
  • Step Function Change: GPT-5 expected to dramatically accelerate adoption
Olivier Godement
Since we launched ChatGPT and the API, 5 million businesses have been using our technology. I'm still mind-blown—five million businesses.
Olivier GodementOpenAIOpenAI | Platform Lead
Olivier Godement
And those businesses are not just playing. They're not just experimenting; they are pushing into production new products in the real world. And I believe GPT-5 is going to be a step function in that regard.
Olivier GodementOpenAIOpenAI | Platform Lead

The Subject-Matter Expert Vision:

Universal Expertise Access:

  • Pocket Expert: Subject-matter expert available to every employee
  • Cross-Domain Intelligence: Expert across legal, finance, and all application areas
  • Employee Empowerment: Enabling every worker to accomplish more
  • Industry Transformation: Key sectors can fundamentally transform themselves

Critical Industry Focus:

Target Sectors for Transformation:

  1. Healthcare: Revolutionary patient care and medical analysis
  2. Education: Enhanced learning and educational delivery
  3. Energy: Optimized operations and sustainability initiatives
  4. Finance: Advanced analysis and decision-making capabilities

OpenAI's Mission Alignment:

Business and Government Enablement:

  • Developer Priority: Strong focus on coding and development capabilities
  • Broader Mission: Equal emphasis on business and government transformation
  • Industry Evolution: Enabling fundamental transformation across key sectors
  • Scalable Impact: Technology designed for widespread enterprise adoption

Future Use Case Explosion:

Historical Pattern Recognition:

  • GPT-4 Precedent: Previous model generated unforeseen applications
  • Emerging Innovation: "Many, many use cases" expected in coming weeks/months
  • Unimaginable Applications: Use cases "all of us could not even imagine"
  • Collaborative Future: "Invent that future together" approach

Timestamp: [1:11:36-1:12:47]Youtube Icon

💊 Why Did Amgen Choose GPT-5 to Fight the World's Deadliest Diseases?

Life Sciences Revolution & Scientific Intelligence

Olivier Godement
First, I want to talk about life sciences. Amgen is a company in the U.S. that designs new drugs—new medicines—to fight some of the toughest human diseases.
Olivier GodementOpenAIOpenAI | Platform Lead

Amgen's Pioneering Role:

Early GPT-5 Adoption:

  • First Testers: Among the earliest companies to evaluate GPT-5
  • Drug Design Focus: Developing new medicines for toughest human diseases
  • U.S. Pharmaceutical Leader: Major company designing breakthrough medications
  • Real-World Application: Practical implementation in critical healthcare development
Olivier Godement
Amgen was one of the first testers of GPT-5, and they used it in the context of drug design. And what Amgen scientists found is that GPT-5 is particularly good at deep reasoning with complex data.
Olivier GodementOpenAIOpenAI | Platform Lead

Deep Reasoning Excellence:

Scientific Data Processing:

  • Complex Data Analysis: Superior performance with sophisticated scientific information
  • Scientific Literature: Advanced analysis of research papers and publications
  • Clinical Data: Intelligent processing of patient and trial information
  • Pattern Recognition: Identifying insights in complex medical datasets

Healthcare Impact Potential:

Drug Development Acceleration:

  • Research Efficiency: Faster analysis of scientific literature and data
  • Discovery Support: Enhanced identification of drug development opportunities
  • Clinical Insight: Better understanding of patient data and treatment outcomes
  • Innovation Enablement: Accelerated development of life-saving medications

Scientific Intelligence Advancement:

Beyond Traditional AI Applications:

  • Domain Expertise: Sophisticated understanding of pharmaceutical science
  • Research Acceleration: Significant speedup in drug discovery processes
  • Quality Enhancement: More accurate analysis of complex scientific information
  • Innovation Support: Enabling breakthrough discoveries in medical research

Real-World Medical Impact:

Patient Care Implications:

  • Faster Drug Development: Accelerated timeline for new medication availability
  • Better Treatment Options: Enhanced understanding leading to improved therapies
  • Disease Understanding: Deeper insights into complex medical conditions
  • Global Health Impact: Potential to address toughest human diseases more effectively

Timestamp: [1:12:59-1:13:26]Youtube Icon

📈 Why Is Spain's Biggest Bank Trusting GPT-5 With Critical Financial Decisions?

Finance Industry Transformation & Speed Revolution

Olivier Godement
Next, I want to talk about finance. BBVA is a multinational bank headquartered in Madrid, Spain.
Olivier GodementOpenAIOpenAI | Platform Lead

BBVA's Financial Analysis Breakthrough:

Multinational Banking Innovation:

  • Global Institution: Multinational bank headquartered in Madrid, Spain
  • Financial Analysis Focus: Advanced analysis capabilities for banking operations
  • Comprehensive Evaluation: Testing against all available AI models
  • Clear Performance Winner: GPT-5 superior in accuracy and speed
Olivier Godement
BBVA has been using GPT-5 for financial analysis, and the takeaway was pretty clear: GPT-5 beats every single other model out there in terms of accuracy and speed. What used to take three weeks for a financial analyst to do, GPT-5 can do in a couple of hours.
Olivier GodementOpenAIOpenAI | Platform Lead

The Time Transformation:

Productivity Revolution:

  • Traditional Timeline: 3 weeks for financial analyst to complete analysis
  • GPT-5 Performance: Same analysis completed in couple of hours
  • Speed Multiplication: Approximately 250x improvement in analysis speed
  • Accuracy Maintenance: Superior accuracy while achieving dramatic speed gains

Competitive Model Performance:

Market Leadership:

  • Universal Comparison: "Beats every single other model out there"
  • Dual Excellence: Superior performance in both accuracy and speed
  • Clear Differentiation: Definitive advantage over competing AI solutions
  • Professional Validation: Banking industry confirmation of superiority

Financial Industry Impact:

Sector-Wide Implications:

  • Analysis Acceleration: Dramatic speedup in financial decision-making
  • Resource Optimization: Analysts can focus on higher-value activities
  • Competitive Advantage: Banks using GPT-5 gain significant operational edge
  • Market Responsiveness: Faster analysis enables quicker response to market conditions

Global Banking Transformation:

Operational Excellence:

  • Decision Speed: Faster financial analysis enables quicker strategic decisions
  • Risk Assessment: Enhanced ability to evaluate complex financial risks
  • Client Service: Improved speed and accuracy in client financial analysis
  • Innovation Enablement: More time for strategic innovation and development

Timestamp: [1:13:26-1:13:50]Youtube Icon

🏛️ How Does ChatGPT Help 2 Million Federal Workers Serve Citizens Better?

Healthcare Intelligence & Medical Policy Revolution

Oscar Health's Clinical Intelligence:

Healthcare Insurance Innovation:

  • New York-Based Company: Major insurance provider implementing GPT-5
  • Clinical Reasoning Focus: Advanced medical decision-making capabilities
  • Industry Leadership: Leading implementation of AI in healthcare insurance
  • Practical Application: Real-world deployment in patient care scenarios
Olivier Godement
Oscar is an insurance company based in New York, and they've been using GPT-5, and what they found is that GPT-5 is the single best model for clinical reasoning. Think mapping complex medical policy to patient conditions.
Olivier GodementOpenAIOpenAI | Platform Lead

Clinical Reasoning Excellence:

Medical Intelligence Superiority:

  • Best-in-Class Performance: "Single best model for clinical reasoning"
  • Complex Policy Mapping: Sophisticated understanding of medical policies
  • Patient Condition Analysis: Advanced analysis of patient health conditions
  • Decision Support: Enhanced clinical decision-making capabilities

Healthcare Policy Integration:

Complex Medical Decision-Making:

  • Policy Mapping: Connecting complex medical policies to patient conditions
  • Condition Analysis: Understanding nuanced patient health scenarios
  • Treatment Authorization: Intelligent evaluation of treatment appropriateness
  • Care Coordination: Enhanced coordination between medical policies and patient needs

Healthcare Industry Impact:

Patient Care Enhancement:

  • Faster Decisions: Accelerated insurance and treatment decisions
  • Better Accuracy: More accurate matching of policies to patient needs
  • Improved Outcomes: Better alignment of coverage with medical necessity
  • Cost Optimization: More efficient allocation of healthcare resources

U.S. Federal Government Adoption:

Massive Government Implementation:

  • 2 Million Employees: U.S. federal workforce gaining GPT-5 access
  • ChatGPT Integration: Government employees using GPT-5 through ChatGPT
  • Service Delivery: "Better, faster services to the American people"
  • Public Sector Innovation: Government embracing AI for citizen service improvement
Olivier Godement
We are super excited by the announcement that we made yesterday that the two million U.S. federal employees will be able to use GPT-5 in ChatGPT, and I cannot wait to see how that enables delivery of better, faster services to the American people.
Olivier GodementOpenAIOpenAI | Platform Lead

Timestamp: [1:13:50-1:14:23]Youtube Icon

💰 How Will GPT-5 Nano's Pricing Trigger an AI Innovation Explosion?

Pricing Strategy & Accessibility Revolution

The Three-Model Pricing Structure:

Complete Cost-Performance Spectrum:

GPT-5 (Premium model):
  • Input tokens: $1.25 per million tokens
  • Output tokens: $10 per million tokens
GPT-5 Mini (Balanced performance and affordability):
  • Input tokens: $0.25 per million tokens
  • Output tokens: $2 per million tokens
GPT-5 Nano (25x more affordable than GPT-5 for maximum accessibility):
  • Input tokens: $0.05 per million tokens
  • Output tokens: $0.40 per million tokens
Olivier Godement
GPT-5 is going to be available in the API starting today. Three models—GPT-5, GPT-5 Mini, GPT-5 Nano.
Olivier GodementOpenAIOpenAI | Platform Lead

Immediate API Availability:

Today's Launch:

  • Instant Access: All three models available in API starting today
  • Enterprise Integration: Immediate availability for business implementation
  • Scalable Options: Cost-performance choices for different use cases
  • Production Ready: Full API access for real-world deployment

The Affordability Revolution:

GPT-5 Nano Impact:

  • 25x Cost Reduction: Dramatic affordability improvement over premium model
  • Mass Market Access: Enabling widespread adoption across all business sizes
  • Experimentation Enablement: Low-cost entry point for AI exploration
  • Scale Economics: Affordable high-volume processing capabilities
Olivier Godement
Mini and Nano are even faster and more affordable. Nano—don't sleep on it. It's 25 times more affordable than GPT-5. It's pretty cool. I cannot wait to see what you all build.
Olivier GodementOpenAIOpenAI | Platform Lead

Enterprise Adoption Acceleration:

Barrier Removal:

  • Cost Accessibility: Removes financial barriers to AI adoption
  • Scalable Implementation: Options for different business sizes and needs
  • Experimentation Support: Low-cost testing and development capabilities
  • Production Scaling: Affordable options for high-volume applications

Future Innovation Potential:

Developer and Business Enablement:

  • Creative Freedom: Affordable access enables more experimentation
  • Small Business Access: Level playing field for smaller organizations
  • Innovation Acceleration: Lower costs drive faster innovation cycles
  • Global Accessibility: Affordable pricing enables worldwide adoption
Olivier Godement
If history is a teacher—and we've seen it with GPT-4—we are going to see many, many use cases emerge over the coming weeks and months that all of us could not even imagine.
Olivier GodementOpenAIOpenAI | Platform Lead

The Build Invitation:

Community Innovation:

  • Open Innovation: "I cannot wait to see what you all build"
  • Collaborative Future: Partnership approach to AI development
  • Diverse Applications: Enabling innovation across all industries and use cases
  • Accessible Technology: Making advanced AI available to everyone

Timestamp: [1:14:48-1:15:21]Youtube Icon

💎 Key Insights from [1:11:36-1:15:21]

Essential Insights:

  1. Production-Scale Enterprise Reality: With 5 million businesses already using OpenAI technology in production (not just experimenting), GPT-5 represents a step function improvement that will accelerate real-world business transformation across critical industries
  2. Dramatic Efficiency Gains: Real-world implementations show unprecedented productivity improvements, with BBVA achieving 250x speedup (3 weeks to couple hours) while maintaining superior accuracy, demonstrating transformative potential across sectors
  3. Universal Accessibility Strategy: The 25x affordability difference between GPT-5 and Nano creates a complete spectrum of options, removing cost barriers and enabling widespread adoption from small startups to large enterprises and government agencies

Actionable Insights:

  • Immediate Enterprise Integration: Businesses can start implementing GPT-5 today across finance, healthcare, and life sciences with proven superior performance and immediate API availability
  • Government Service Enhancement: The 2 million U.S. federal employee adoption demonstrates AI's readiness for large-scale government implementation to deliver better, faster citizen services
  • Strategic Cost-Performance Optimization: Organizations can choose the optimal GPT-5 variant (standard, Mini, or Nano) based on their specific use case requirements and budget constraints

Timestamp: [1:11:36-1:15:21]Youtube Icon

📚 References from [1:11:36-1:15:21]

People Mentioned:

  • Olivier Godement - OpenAI platform leader presenting enterprise applications and pricing strategy
  • Greg Brockman - OpenAI President introducing enterprise focus and subject-matter expert concept

Companies & Organizations:

  • Amgen - U.S. pharmaceutical company using GPT-5 for drug design and fighting human diseases
  • BBVA - Multinational bank headquartered in Madrid, Spain, implementing GPT-5 for financial analysis
  • Oscar Health - New York-based insurance company using GPT-5 for clinical reasoning
  • U.S. Federal Government - 2 million employees gaining access to GPT-5 through ChatGPT

Industry Sectors:

  • Life Sciences - Drug design and pharmaceutical development using complex data analysis
  • Finance - Advanced financial analysis with superior accuracy and speed
  • Healthcare - Clinical reasoning and medical policy mapping to patient conditions
  • Education - Targeted sector for AI-driven transformation
  • Energy - Industry sector identified for fundamental transformation

Technologies & Models:

  • GPT-5 - Premium model priced at $1.25/$10 per million input/output tokens
  • GPT-5 Mini - Balanced performance model with enhanced affordability
  • GPT-5 Nano - Ultra-affordable model, 25x more cost-effective than GPT-5
  • OpenAI API - Platform providing immediate access to all three model variants

Use Cases & Applications:

  • Drug Design - Pharmaceutical development and new medicine creation
  • Financial Analysis - Complex banking and financial data processing
  • Clinical Reasoning - Medical decision-making and policy-patient condition mapping
  • Scientific Literature Analysis - Research paper and publication processing
  • Clinical Data Processing - Patient data and medical trial information analysis

Government Implementation:

  • Federal Employee Access - 2 million U.S. government workers using GPT-5
  • ChatGPT Integration - Government implementation through existing ChatGPT platform
  • Citizen Service Enhancement - Better, faster services to American people
  • Public Sector Innovation - Large-scale government AI adoption

Performance Metrics:

  • Speed Improvement - 3 weeks to couple hours for financial analysis (250x speedup)
  • Accuracy Excellence - Superior accuracy while maintaining dramatic speed gains
  • Model Comparison - "Beats every single other model out there" in finance
  • Clinical Performance - "Single best model for clinical reasoning" in healthcare

Business Impact:

  • 5 Million Businesses - Current scale of OpenAI technology adoption
  • Production Implementation - Real-world deployment beyond experimentation
  • Step Function Change - Expected dramatic acceleration with GPT-5
  • Subject-Matter Expertise - Universal expert access across all domains

Timestamp: [1:11:36-1:15:21]Youtube Icon

🎯 What Drives OpenAI's Team to Work With "Passionate Pursuit" Beyond Profit?

The Scientific Foundation of AI Development

The Core Mission:

Deep Learning Understanding:

  • Miraculous Technology: Recognition of deep learning as extraordinary breakthrough
  • Fundamental Research: Core focus on understanding the technology itself
  • Consequence Analysis: Investigating what deep learning can achieve
  • Steering Capability: Learning how to direct AI for safety and utility
Jakub Pachocki
At OpenAI, at the core, we are about understanding this miraculous technology called deep learning and what its consequences are.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Scientific Research Philosophy:

Beyond Product Development:

  • Understanding Focus: Primary goal of comprehending deep learning capabilities
  • Safety Priority: Ensuring AI development serves beneficial purposes
  • Universal Benefit: Making technology safe and useful for everyone
  • Research-Driven: Scientific investigation preceding product release
Jakub Pachocki
Our research aims to understand what deep learning is capable of and how to steer it to make it safe and useful for all of us. This is a work of passion.
Jakub PachockiOpenAIOpenAI | Chief Scientist

The Passion and Mission:

Work of Dedication:

  • Passionate Pursuit: Work driven by genuine enthusiasm and dedication
  • Mission-Oriented: Purpose beyond commercial success
  • Shared Goals: Team united by common vision and objectives
  • Meaningful Impact: Focus on transforming lives for the better

Team Recognition:

Collaborative Excellence:

  • Deep Appreciation: Heartfelt recognition of team contributions
  • Incredible Group: Acknowledgment of exceptional talent
  • Brilliant People: Recognition of intellectual excellence
  • Great Privilege: Honor of working with exceptional colleagues
Jakub Pachocki
It is a great privilege for me to work alongside this incredible group of brilliant people driven by this shared goal.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Timestamp: [1:15:26-1:16:32]Youtube Icon

🔮 Why Does OpenAI Say GPT-5 Is Just "Early Glimpses" of What's Coming?

The Long-Term Research Vision Behind Current Achievements

The Development Timeline:

Years of Investigation:

  • Extended Research: Years of dedicated investigation and development
  • Dual Purpose: Not just creating great releases, but building fundamental understanding
  • Technology Comprehension: Deep investigation into underlying technology principles
  • Foundation Building: Establishing knowledge base for future advancement
Jakub Pachocki
What adds up to a model like GPT-5 are years of investigations aimed not only at producing a great release, but at building an understanding of this underlying technology itself.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Early Glimpses Philosophy:

Future Potential Recognition:

  • Current Limitations: Present model represents only early glimpses
  • Greater Potential: Ideas that will extend much further in the future
  • Technology Preview: Current capabilities hint at far greater possibilities
  • Innovation Pipeline: Continuous development of more advanced concepts
Jakub Pachocki
And so a lot of what you'll see in this model are really just early glimpses of new ideas that we believe will go much further.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Research vs. Product Balance:

Scientific Foundation:

  • Understanding Priority: Building comprehension alongside product development
  • Technology Mastery: Investigating the fundamental nature of AI capabilities
  • Long-term Vision: Focusing on sustained advancement rather than quick releases
  • Innovation Depth: Deep research enabling breakthrough capabilities

Future Development Path:

Continued Investigation:

  • Ongoing Learning: Recognition that much remains to be understood
  • Knowledge Expansion: Continuous discovery about AI capabilities and potential
  • Technology Evolution: Expectation of significant future advancement
  • Research Continuation: Commitment to ongoing scientific investigation

Timestamp: [1:16:32-1:17:08]Youtube Icon

🌍 What Happens When AI Becomes Humanity's Greatest Discovery Tool?

The Transformative Vision for AI-Driven Discovery

Knowledge Discovery Vision:

AI as Explorer:

  • New Knowledge: AI capability to discover previously unknown information
  • World Understanding: Deep comprehension of natural and scientific phenomena
  • Discovery Acceleration: AI enabling faster scientific and intellectual progress
  • Knowledge Expansion: Pushing boundaries of human understanding
Jakub Pachocki
There is a lot we still have to understand, and we look toward a future where AI can uncover new knowledge about the world and meaningfully transform our lives for the better.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Meaningful Transformation:

Life Enhancement Focus:

  • Meaningful Change: Not just technological advancement, but genuine life improvement
  • Positive Impact: Transformation specifically "for the better"
  • Universal Benefit: Improvements that benefit all of humanity
  • Purpose-Driven Development: Technology advancement with clear beneficial intent

Future Research Horizons:

Ongoing Investigation:

  • Continued Learning: Recognition of vast unexplored potential
  • Unknown Territory: Much more to discover about AI capabilities
  • Future Focus: Looking ahead to greater possibilities
  • Discovery Mindset: Approach oriented toward uncovering new possibilities

The Vision Statement:

Transformative Potential:

  • Knowledge Revolution: AI as catalyst for new scientific and intellectual discoveries
  • World Transformation: Technology capable of fundamentally changing human experience
  • Beneficial Focus: Ensuring transformation serves positive human purposes
  • Future Optimism: Confident expectation of positive technological development
Jakub Pachocki
We hope you'll enjoy what we've built, and we'll get back to scaling. Thank you!
Jakub PachockiOpenAIOpenAI | Chief Scientist

Timestamp: [1:17:01-1:17:26]Youtube Icon

💎 Key Insights from [1:15:21-1:17:26]

Essential Insights:

  1. Scientific Foundation Priority: OpenAI's core mission focuses on understanding deep learning as "miraculous technology" rather than just product development, emphasizing fundamental research over commercial releases
  2. Long-Term Vision Perspective: GPT-5 represents "early glimpses of new ideas" from years of investigation, suggesting current capabilities are just the beginning of much greater future potential
  3. Knowledge Discovery Mission: The ultimate goal extends beyond current AI applications to enabling AI systems that can "uncover new knowledge about the world" and meaningfully transform human lives for the better

Actionable Insights:

  • Research-Driven Approach: Organizations should prioritize understanding AI capabilities deeply rather than focusing solely on immediate applications, building foundational knowledge for long-term success
  • Future Potential Recognition: Current GPT-5 capabilities should be viewed as early indicators of far greater possibilities, encouraging long-term strategic planning and investment in AI development
  • Beneficial Transformation Focus: AI development efforts should maintain clear focus on meaningful positive impact and universal benefit rather than purely technical advancement

Timestamp: [1:15:21-1:17:26]Youtube Icon

📚 References from [1:15:21-1:17:26]

People Mentioned:

  • Jakub Pachocki - OpenAI Chief Scientist delivering closing remarks and vision for future AI development
  • OpenAI Team - Collective group of researchers and developers recognized for their passionate work and shared mission

Technologies & Concepts:

  • Deep Learning - Described as "miraculous technology" that forms the foundation of AI advancement
  • GPT-5 - Current model representing years of investigation and early glimpses of future capabilities

Research Philosophy:

  • Fundamental Understanding - Core focus on comprehending deep learning capabilities and consequences
  • Safety and Utility - Mission to make AI technology safe and useful for universal benefit
  • Technology Steering - Capability to direct AI development toward beneficial outcomes
  • Knowledge Discovery - Vision of AI uncovering new understanding about the world

Development Approach:

  • Years of Investigation - Long-term research approach focusing on deep understanding
  • Early Glimpses - Recognition that current capabilities represent initial manifestations of greater potential
  • Continued Research - Ongoing commitment to understanding AI technology and its possibilities

Mission Elements:

  • Work of Passion - Research driven by genuine enthusiasm and dedication to the field
  • Shared Goals - Team unity around common vision for AI development and impact
  • Meaningful Transformation - Focus on positive change that genuinely improves human lives
  • Universal Benefit - Commitment to ensuring AI advantages serve all of humanity

Future Vision:

  • New Knowledge Discovery - AI's potential to uncover previously unknown information about the world
  • Life Transformation - Technology's capacity to meaningfully change human experience for the better
  • Continued Understanding - Recognition that much more remains to be learned about AI capabilities
  • Positive Impact Focus - Emphasis on beneficial outcomes and meaningful improvement to human lives

Scientific Approach:

  • Miraculous Technology Recognition - Acknowledgment of deep learning as extraordinary breakthrough
  • Consequence Analysis - Investigation into what deep learning can achieve and its implications
  • Research Foundation - Building understanding alongside product development
  • Innovation Pipeline - Continuous development and refinement of AI capabilities

Timestamp: [1:15:21-1:17:26]Youtube Icon