undefined - AGI progress, surprising breakthroughs, and the road ahead

AGI progress, surprising breakthroughs, and the road ahead

How close are we to automating scientific discovery? What do AI competition wins really tell us about progress toward AGI? OpenAI Chief Scientist Jakub Pachocki and researcher Szymon Sidor share inside stories—from gold medals at the International Math Olympiad to surprising leaps in reasoning—that reveal where AI is headed next.

August 15, 202540:23
Listen onYouTubeApple PodcastsSpotify

Table of Contents

0:00-6:30
6:33-16:44
16:51-26:46
26:52-33:50
33:50-40:19

🎯 How Does OpenAI Choose Its Next Big Research Bet?

Leadership Roles and Responsibilities at OpenAI

Understanding the key positions that drive AI research requires looking at both the strategic and hands-on aspects of building artificial general intelligence.

Jakub Pachocki - Chief Scientist Role:

  1. Research Roadmap Authority - Sets the technical path and long-term research direction for the entire company
  2. Strategic Decision Making - Determines which technological bets OpenAI will pursue
  3. Vision Implementation - Translates long-term AGI goals into actionable research programs
Jakub Pachocki
The primary thing I'm responsible for is setting the research roadmap for the company—deciding which technical paths we will bet on and the underlying long-term research we will pursue.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Szymon Sidor - Individual Contributor with Leadership:

  • Flexible Problem Solving - Takes on whatever challenges are most critical at any given time
  • Hands-On Research - Maintains direct involvement in technical work while providing strategic input
  • Adaptive Leadership - Balances individual contribution with mentoring and guidance

The structure demonstrates how OpenAI balances visionary leadership with practical execution, ensuring both strategic direction and tactical flexibility in their pursuit of AGI.

Timestamp: [0:35-1:19]Youtube Icon

🏫 How Do Two Classmates End Up Leading OpenAI Together?

The Educational Foundation Behind OpenAI's Leadership

The path from high school classmates to leading AI researchers reveals the importance of exceptional mentorship and rigorous technical training in shaping future innovators.

The Polish High School Experience:

  1. Exceptional Mentorship - Mr. Richard provided both technical excellence and emotional support
  2. Advanced Curriculum - Far beyond typical high school, including graph theory, matrices, and complex programming
  3. Competition Focus - Emphasis on programming competitions to drive excellence in computer science
Jakub Pachocki
We were both drawn there by our computer science teacher, Mr. Richard, who had a strong track record of developing computer scientists and programmers, with a deep focus on programming competitions and pursuing excellence in that field.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Key Educational Elements:

  • Deep Technical Dive - Students explored advanced mathematical concepts typically reserved for university
  • Mentor's Track Record - The teacher had previously developed multiple successful computer scientists
  • Emotional Bonds - The shared experience of coming to the U.S. strengthened their relationship beyond academics

Modern Implications:

  • AI as Educational Tool - ChatGPT can now provide some of the technical guidance previously requiring exceptional teachers
  • Irreplaceable Human Elements - Emotional support and personal connection remain uniquely human contributions
  • Enhanced Teaching Potential - AI tools can make good teachers even more capable
Szymon Sidor
I hope that with ChatGPT it’s easier for people to make this kind of deep dive, because without the right mentor and a lot of work, it’s hard to replicate that experience.
Szymon SidorOpenAIOpenAI | IC & Leadership

Timestamp: [1:20-4:14]Youtube Icon

🤖 How Do You Explain AGI to Your Younger Sibling?

From Abstract Concept to Measurable Reality

The definition of Artificial General Intelligence has evolved from a distant theoretical concept to something we can observe and measure in today's AI systems.

The Evolution of AGI Understanding:

  1. Past Perspective - AGI felt abstract and far away, with all capabilities seeming equally distant
  2. Current Reality - We can now distinguish between different types of intelligence and capabilities
  3. Measurable Milestones - Specific achievements like math olympiad performance provide concrete benchmarks
Jakub Pachocki
A few years ago, when we talked about AGI, the technology—deep learning—had incredible promise, but the concept still felt abstract and far away.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Distinct Capabilities Now Achieved:

  • Natural Conversation - AI can engage naturally across a wide range of topics
  • Mathematical Problem Solving - Complex math problems are now within AI's capabilities
  • Competition-Level Performance - Gold medal achievement at the International Math Olympiad (IMO)

Moving Beyond Point Measures:

  • Real-World Impact Focus - Shifting from specific test performance to actual world influence
  • Holistic Assessment - Understanding that AGI involves integrated capabilities, not just isolated skills
  • Practical Applications - Emphasis on how AI progress meaningfully changes outcomes
Jakub Pachocki
Increasingly, we see that such pointwise measures are inadequate, so we focus on actual impact in the world.
Jakub PachockiOpenAIOpenAI | Chief Scientist

The conversation reveals how AI researchers now think about AGI as a collection of distinct, measurable capabilities rather than a single, monolithic achievement.

Timestamp: [4:38-6:25]Youtube Icon

💎 Key Insights from [0:00-6:30]

Essential Insights:

  1. Leadership Structure Matters - Successful AI research requires both visionary roadmap setting and flexible execution, combining strategic planning with hands-on problem solving
  2. Educational Foundation Impact - Exceptional mentorship and advanced technical training in formative years can create lasting bonds and shape future AI leaders
  3. AGI Definition Evolution - The concept of AGI has transformed from abstract theory to measurable capabilities, with focus shifting from point achievements to real-world impact

Actionable Insights:

  • For Educators: AI tools like ChatGPT can enhance teaching capabilities but cannot replace the emotional support and personal connection that exceptional teachers provide
  • For Organizations: Balance strategic leadership with tactical flexibility, allowing key contributors to adapt to the most critical challenges
  • For AI Development: Move beyond isolated capability testing to assess integrated, real-world applications and meaningful impact

Timestamp: [0:00-6:30]Youtube Icon

📚 References from [0:00-6:30]

People Mentioned:

  • Mr. Richard - High school computer science teacher in Poland who mentored future OpenAI researchers with focus on programming competitions and excellence

Technologies & Tools:

  • ChatGPT - Referenced as educational tool that can create interactive graphics and explanations, making advanced concepts more accessible

Concepts & Frameworks:

  • International Math Olympiad (IMO) - Competition benchmark used to measure AI mathematical reasoning capabilities
  • National Math Olympiad - Additional mathematical competition milestone for measuring AI progress
  • Programming Competitions - Educational approach emphasizing excellence through competitive programming
  • Monty Hall Problem - Classic probability puzzle used as example of interactive educational content

Educational Concepts:

  • Graph Theory - Advanced mathematical concept taught at the Polish high school level
  • Matrix Mathematics - Complex mathematical framework included in advanced high school curriculum
  • Deep Learning - Core AI technology discussed as foundation for AGI development

Timestamp: [0:00-6:30]Youtube Icon

🔬 Can AI Actually Automate the Discovery of New Technology?

The Revolutionary Potential of Automated Scientific Discovery

The concept of machines independently generating fundamental technological breakthroughs challenges our basic assumptions about human ingenuity and the nature of innovation itself.

The Vision for Automated Discovery:

  1. Beyond Human Association - Moving past the traditional link between human creativity and technological progress
  2. Fundamental Change Potential - AI systems capable of ideas that fundamentally alter our understanding of the world
  3. Proximity to Reality - This capability may be closer than most people realize
Jakub Pachocki
It is hard to internalize that this is possible—to automate most of this process. A large computer could generate ideas that fundamentally change our understanding of the world, and that may not be far away.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Target Domains for Early Success:

  • Medicine - Already showing incredible results due to complex reasoning combined with domain knowledge
  • AI Research Itself - Automating the work of AI researchers could accelerate progress exponentially
  • AI Safety and Alignment - Critical for ensuring beneficial outcomes as capabilities advance

Strategic Approach at OpenAI:

  • General Intelligence Focus - Prioritizing broad capabilities over domain-specific optimization
  • Automated Researcher Goal - Building systems that can conduct research autonomously
  • Real-World Impact Emphasis - Moving beyond point achievements to meaningful technological advancement
Jakub Pachocki
At OpenAI, we aim to create very general intelligence and prioritize building an automated researcher.
Jakub PachockiOpenAIOpenAI | Chief Scientist

The implications suggest we may be approaching a threshold where the primary drivers of technological progress shift from human researchers to AI systems.

Timestamp: [6:33-9:27]Youtube Icon

📈 Why Do Headlines Say AI Progress is Slowing When Insiders Are Astounded?

The Disconnect Between Public Perception and Research Reality

Understanding the dramatic acceleration in AI capabilities requires an insider's perspective on the journey from complete failure to superhuman performance across multiple domains.

The 10-Year Journey from Failure to Success:

  1. 2014 Era - Basic natural language processing didn't work; sentiment analysis failed on simple negations
  2. Early Breakthroughs - Slowly solving basic tasks like part-of-speech tagging and simple classification
  3. GPT Evolution - From producing coherent paragraphs to surprising researchers with novel insights
  4. Current Capabilities - Competing in programming competitions and providing reliable research assistance

Personal AGI Moment - GPT-4:

  • Surprise Factor - The model began saying things that genuinely surprised experienced researchers
  • Capability Evolution - From "slightly better Google" to truly useful research companion
  • Competition Performance - Achieving results in programming competitions that took years of personal effort
Szymon Sidor
For me, GPT-4 was a personal AGI moment: it would sometimes say things that genuinely surprised me, and I found myself asking, can a model really surprise me?
Szymon SidorOpenAIOpenAI | IC & Leadership

The Economic Impact Perspective:

  • Historical Context - 10 years ago, AI economic impact was essentially 0.00001%
  • Current 3-5% - Represents massive growth when viewed in proper historical context
  • Projected Trajectory - Reasonable expectation of 10% in one year, 20% in two years

Timestamp: [9:33-13:23]Youtube Icon

📊 How Do You Know if AI Is Actually Smart or Just Good at Tests?

The Saturation Problem in AI Measurement

As AI systems reach human-level performance on standardized tests, traditional benchmarks become inadequate for measuring true progress and capability differences.

The Benchmark Saturation Challenge:

  1. Human-Level Performance - Models achieving top performance in difficult high-school competitions worldwide
  2. Constrained Measurement Limits - Traditional testing formats become insufficient for evaluation
  3. Specialized vs. General Ability - Models can be trained to excel at specific domains without representing overall intelligence

Evolution of AI Training Approaches:

  • Early Scaling Era - GPT-1 through GPT-4 benchmarks measured "rising tide" of general capability
  • Specialized Training - More data-efficient methods create models disproportionately good at specific tasks
  • Representation Issues - Math-focused models may excel on math benchmarks but lack proportional writing ability

The Real-World Utility Focus:

  • Beyond Test Performance - Shifting emphasis from benchmark scores to practical applications
  • Discovery Capability - Prioritizing models' ability to generate new insights over test-taking skills
  • Work vs. Test Performance - Recognition that good test-takers may not be effective work assistants
Jakub Pachocki
We need to focus on the real-world utility of these models—especially their ability to discover new insights.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Internet Comparison Analogy:

  • Economic Impact Invisibility - Like the early internet, AI's economic impact may be difficult to pinpoint on economic graphs
  • Measurement Challenges - Traditional metrics struggle to capture transformative technology adoption
  • Usage Complexity - Difficulty tracking who uses AI and how they apply it in practice

Timestamp: [13:29-16:44]Youtube Icon

💎 Key Insights from [6:33-16:44]

Essential Insights:

  1. Automated Discovery Revolution - AI systems may soon autonomously generate fundamental technological breakthroughs, shifting the primary source of innovation from human researchers to machines
  2. Progress Perception Gap - Public headlines suggesting AI slowdown contrast sharply with insider perspectives showing exponential capability growth from near-zero to significant economic impact
  3. Benchmark Evolution Necessity - Traditional testing methods become inadequate as AI reaches human-level performance, requiring new evaluation frameworks focused on real-world utility and discovery capability

Actionable Insights:

  • For Researchers: Focus on developing evaluation methods that measure practical utility and novel insight generation rather than standardized test performance
  • For Organizations: Prepare for AI systems that may soon automate research and discovery processes, potentially accelerating technological development across industries
  • For Policy Makers: Understand that current economic impact percentages represent massive growth from historical baselines and may accelerate rapidly in coming years

Timestamp: [6:33-16:44]Youtube Icon

📚 References from [6:33-16:44]

Technologies & Tools:

  • Mac Studio - Computer hardware mentioned for running open-source AI models continuously
  • GPT-OSS - Open-source AI model referenced for 24/7 operation experiments
  • Deep Research - AI capability for answering questions with minimal hallucination
  • BERT - Early transformer model used for natural language processing tasks
  • ChatGPT - AI assistant that evolved from basic utility to sophisticated research tool

AI Model Evolution:

  • GPT-1 - Early generative model in the scaling progression
  • GPT-2 - Breakthrough model that produced coherent paragraphs, available on GitHub
  • GPT-3 - Significant advancement in language model capabilities
  • GPT-4 - Model that achieved "personal AGI moment" for researchers with surprising responses

Concepts & Frameworks:

  • Sentiment Analysis - Natural language processing task for determining emotional tone
  • Part-of-Speech Tagging - Basic NLP task for grammatical classification
  • Programming Competitions - Competitive coding contests used as AI capability benchmarks
  • Benchmark Saturation - Phenomenon where AI models reach ceiling performance on standard tests
  • Economic Impact Measurement - Methods for quantifying AI's effect on economic productivity

Technical Concepts:

  • Automated Researcher - AI system capable of conducting independent research
  • Domain Knowledge Integration - Combining reasoning capabilities with specialized expertise
  • Data-Efficient Training - Methods for achieving specialized performance with less training data

Timestamp: [6:33-16:44]Youtube Icon

🏆 Why Are Math Contests Better Than Turing Tests for Measuring AI?

The True Test of Machine Intelligence Beyond Tool Use

Understanding why mathematical olympiads represent meaningful AI milestones requires recognizing the difference between knowledge application and creative reasoning under constraints.

What Makes Math Competitions Special:

  1. Constrained Environment - Limited knowledge requirements but intense reasoning demands
  2. Creative Thinking Focus - Problems require novel insights rather than formula application
  3. Proven Difficulty - Thousands of competitors worldwide validate the challenge level
  4. Time Pressure - Deep thinking required within 1-3 hour windows

The Reasoning Revolution:

  • Pure Mental Processing - No calculators, tools, or external frameworks allowed
  • Beyond Memorization - Success requires creative problem-solving, not knowledge recall
  • Historical Context - Two years ago, models couldn't multiply four-digit numbers
  • Current Achievement - Gold medal performance through reasoning alone
Jakub Pachocki
We’re excited about competitions like the International Math Olympiad and the Informatics Olympiad because they are constrained, require limited knowledge, and truly test your ability to think deeply about a problem for an hour or two—or three.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Limitations of Competition Metrics:

  • Researcher Bubble - These benchmarks matter deeply to AI researchers but may not resonate broadly
  • Domain Specificity - Math competitions don't reflect diverse human capabilities
  • Alternative Perspectives - Different professionals value different types of intelligence

The shift from computational failure to creative reasoning success represents a fundamental breakthrough in machine intelligence capabilities.

Timestamp: [16:51-19:45]Youtube Icon

📊 How Do You Measure AI Progress When Everyone Uses It Differently?

Breaking Out of the Research Bubble with Real-World Usage

The challenge of objective AI measurement becomes complex when researchers' preferred benchmarks don't align with how most people actually experience and value AI capabilities.

The Benchmark Bubble Problem:

  1. Personal Significance Bias - Competitions that shaped researchers' lives feel more important than they actually are
  2. Diverse User Values - A multilingual expert might care more about language capabilities than math skills
  3. Limited Perspective - What excites computer scientists may not matter to historians or other professionals

ChatGPT Usage as Reality Check:

  • Universal Application - People use ChatGPT across countless domains and use cases
  • Honest Feedback - Real usage patterns reveal true utility better than artificial benchmarks
  • Broad Coverage - Avoids the narrow focus that comes from researcher preferences
  • Practical Validation - Shows what actually works in the real world

Future Measurement Approaches:

  • Compute-Intensive Applications - Using vast computational resources to create broadly useful technology artifacts
  • Real-World Impact - Moving beyond user adoption to measure meaningful technological contributions
  • Extended Reasoning - Evaluating models' ability to think longer and deeper on complex problems

The Reasoning Capability Distinction:

  • Time Investment - Models that can reason longer may access capabilities beyond typical user interactions
  • Computational Resources - Future applications may use far more compute than individual users would purchase
  • Technology Artifacts - Focus on creating useful outputs rather than just measuring performance
Szymon Sidor
One imperfect but honest metric that helps us escape the bubble is ChatGPT usage, because everyone uses it for all sorts of use cases.
Szymon SidorOpenAIOpenAI | IC & Leadership

Timestamp: [19:45-21:42]Youtube Icon

🎯 What Happens When AI Stops Pretending to Know Everything?

The Breakthrough Moment of Self-Aware Limitation Recognition

One of the most significant advances in AI capability may be models' ability to accurately assess their own limitations and honestly report when they cannot solve a problem.

The IMO Problem 6 Phenomenon:

  1. Consistent Pattern - Both OpenAI and Google DeepMind models solved problems 1-5 perfectly
  2. Honest Assessment - Models correctly identified they couldn't make progress on problem 6
  3. Self-Awareness - Recognition of limitation rather than attempting to generate false solutions
Jakub Pachocki
The model correctly identified that it had not made progress on the problem.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Why This Matters:

  • Hallucination vs. Honesty - Distinguishing between fabricated answers and genuine uncertainty
  • Problem-Solving Intelligence - Moving beyond knowledge recall to genuine reasoning capability
  • Reliability Indicator - Models that know their limits are more trustworthy for critical applications

The Problem 6 Challenge:

  • Out-of-the-Box Thinking - Requires extremely creative approaches beyond typical mathematical domains
  • Boundary Recognition - Historical distinction between achieving gold medal and solving all problems
  • Validation of Difficulty - Consistent failure across multiple advanced AI systems confirms the challenge level

Implications for AI Development:

  • Fluid vs. Crystalline Intelligence - Separating knowledge possession from problem-solving capability
  • Metacognitive Awareness - Models developing understanding of their own cognitive processes
  • Trust Building - Honest limitation reporting enhances user confidence in AI outputs

This represents a crucial step toward AI systems that can be trusted to work autonomously on complex problems while maintaining intellectual honesty.

Timestamp: [21:49-23:37]Youtube Icon

🇯🇵 Why Was Second Place More Meaningful Than Any Gold Medal?

The Personal Drama of AI Racing Human Champions

The AtCoder competition in Japan became an unexpectedly personal story when OpenAI's model found itself competing directly against someone who had once mocked the idea that AI could master long-duration contests.

The AtCoder Competition Format:

  1. Marathon Style - Single problem solved over 10 hours of focused work
  2. Optimization Challenge - No single correct solution, just better and worse approaches
  3. Heuristic Problem-Solving - Extremely diverse tasks requiring adaptive thinking
  4. Global Prestige - Japan-organized but open to worldwide competitors

The Personal Story Behind the Competition:

  • Historical Friendship - Jakub's colleague Siho excelled at long-duration contests while Jakub focused on shorter formats
  • Past Predictions - Siho had mocked that shorter contests would be automated before longer ones
  • Live Drama - Watching the AI model race against Siho in real-time on Japanese livestream
  • Ultimate Irony - Siho himself prevented his own prediction from coming true by winning

Competition Results:

  • Second Place Finish - OpenAI's model achieved runner-up position
  • Human Champion - Siho took first place, narrowly defeating the AI
  • Exhausted Winner - Post-competition interview revealed Siho's fatigue and frustration

The Broader Implications:

  • Diverse AI Capabilities - Success across multiple competition formats (IOI, IMO, AtCoder)
  • Human-AI Dynamics - Personal relationships becoming intertwined with technological progress
  • Competitive Evolution - Long-duration contests proving as susceptible to AI advancement as shorter ones

The story illustrates how AI progress creates unexpectedly personal moments, turning abstract technological advancement into human drama with real emotional stakes.

Timestamp: [23:43-26:32]Youtube Icon

💎 Key Insights from [16:51-26:46]

Essential Insights:

  1. Pure Reasoning Breakthrough - AI models achieving gold medal performance through creative thinking alone, without tools or memorization, represents a fundamental shift from computational to cognitive capability
  2. Measurement Reality Check - Research benchmarks may not reflect real-world value; ChatGPT usage patterns provide more honest feedback about AI utility across diverse domains and use cases
  3. Self-Aware Limitation Recognition - Models that can accurately identify when they cannot solve problems demonstrate crucial metacognitive awareness, building trust and reliability for autonomous operation

Actionable Insights:

  • For AI Researchers: Balance technical benchmarks with real-world usage patterns to avoid research bubble bias and ensure meaningful progress measurement
  • For Organizations: Prioritize AI systems that demonstrate honest limitation reporting over those that attempt to answer everything, even incorrectly
  • For Competition Organizers: Long-duration, creative problem-solving contests may provide better AI capability assessment than traditional standardized tests

Timestamp: [16:51-26:46]Youtube Icon

📚 References from [16:51-26:46]

People Mentioned:

  • Anna Makanju - OpenAI colleague who speaks five languages, used as example of different expertise perspectives on AI capability measurement
  • Siho - Jakub's friend and competitor who excelled at long-duration programming contests and won first place at AtCoder competition against OpenAI's model

Companies & Products:

  • Google DeepMind - AI research company that also achieved similar results on IMO problems 1-5 but failed on problem 6
  • ChatGPT - Used as metric for real-world AI utility measurement across diverse use cases

Competitions & Events:

  • International Math Olympiad (IMO) - Prestigious mathematical competition used as AI reasoning benchmark
  • Informatics Olympiad (IOI) - Computer science competition parallel to math olympiad
  • AtCoder - Japanese programming competition platform hosting long-duration optimization contests
  • Humanity's Last Exam - Alternative AI capability test mentioned for broader assessment

Technologies & Tools:

  • o1 Model - OpenAI's reasoning model that introduced inner monologue capabilities
  • GPTs - Custom AI applications built by users for specialized tasks

Concepts & Frameworks:

  • Benchmark Saturation - Phenomenon where AI models exceed human performance on standardized tests
  • Research Bubble - Bias where researchers overvalue metrics important to their personal experience
  • Fluid vs. Crystalline Intelligence - Distinction between knowledge possession and problem-solving capability
  • Metacognitive Awareness - AI systems' ability to understand and report their own limitations
  • Heuristic Problem-Solving - Optimization approach without single correct solutions
  • Inner Monologue - AI reasoning process allowing extended thinking before responding

Timestamp: [16:51-26:46]Youtube Icon

😨 What Made OpenAI's Leadership Panic at 11 PM About AI Progress?

The Shocking Moment When Reasoning Breakthroughs Exceeded All Expectations

Behind the seemingly simple concept of "longer chain of thought" lies one of the most intense and frightening moments in AI development, when progress suddenly accelerated beyond what anyone was prepared for.

Jakub Pachocki
I want to point out that the team worked extremely hard on this. It may sound simple—‘just use a longer chain of thought’—but making it work was hard-earned.
Jakub PachockiOpenAIOpenAI | Chief Scientist

The Reality Behind the Breakthrough:

  1. Deceptive Simplicity - The reasoning breakthrough appears simple but required extraordinarily hard work to achieve
  2. Training Discovery - The moment when researchers realized they could train models to reason longer and get better results
  3. Organizational Crisis - Leadership questioning whether OpenAI was prepared for the pace of progress

The 11 PM Emergency Call:

  • Key Participants - Late-night discussion with Sam Altman and Mira Murati
  • Emotional Impact - The team was genuinely "freaked out" by the results they were seeing
  • Preparedness Questions - Serious concerns about whether the organization could handle incredibly fast-paced progress
Jakub Pachocki
I remember one evening around 11:00 p.m.; we were on the line with Sam and Mira, trying to process the results. We were, frankly, a little freaked out.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Understanding the Timeline:

  • Long Development - Years of work before the breakthrough became public
  • Sudden Realization - The moment when everything clicked was shocking and unexpected
  • World Perception - External surprise at a "fundamental new way" to extract more capability from existing infrastructure

The Compound Nature of Progress:

  • Scaling Persistence - Previous paradigms haven't vanished, they compound with new approaches
  • Multiple Directions - New scaling opportunities emerging alongside reasoning improvements
  • Infrastructure Leverage - Getting dramatically more capability from existing computational frameworks

Timestamp: [26:52-28:44]Youtube Icon

🚀 What Happens When AI Can Think for Days Instead of Seconds?

The Next Frontier: Long-Horizon Reasoning and Massive Compute Investment

The evolution from GPT-4's quick responses to models that can work persistently on focused problems for extended periods represents a fundamental shift in AI capability and application.

The Compute Investment Perspective:

  1. Current Scale - o1 Pro uses 10-20× more compute than GPT-4 for significantly better answers
  2. Future Potential - Problems worth solving justify incomparably larger computational investments
  3. High-Value Applications - Medical research and next-generation model development warrant massive resource allocation

Long-Horizon Problem Solving:

  • Model Persistence - Systems capable of working for extended periods on single focused problems
  • Planning Extension - Dramatically expanding the time horizon for AI reasoning and planning
  • Sustained Focus - Moving beyond quick responses to deep, prolonged investigation

Scaling Paradigm Evolution:

  • Compounding Effects - Previous scaling approaches don't disappear, they enhance new capabilities
  • New Directions - Multiple simultaneous advancement paths rather than single breakthrough dependence
  • Resource Justification - Problems that matter to many people justify enormous computational expense

Practical Applications:

  • Medical Research Progress - AI systems working continuously on complex healthcare challenges
  • Technology Development - Models contributing to next-generation AI system creation
  • Research Acceleration - Sustained investigation replacing human researcher time constraints
Jakub Pachocki
When you consider how much compute you’d be willing to spend on problems that truly matter—progress in medical research or developing the next generation of models—the amounts are incomparably larger.
Jakub PachockiOpenAIOpenAI | Chief Scientist

The Investment Logic:

  • Problem Value Assessment - Computing resources justified by problem importance and impact scale
  • Time vs. Compute Trade-off - Spending more computational power to solve problems faster or better
  • Resource Allocation Strategy - Matching computational investment to problem significance

Timestamp: [28:44-30:27]Youtube Icon

🏢 What Will AGI Actually Look Like in Your Daily Life?

From Automated Research Companies to Human-Like Digital Relationships

Rather than a single superintelligent entity, AGI may manifest as automated companies of researchers and engineers, fundamentally accelerating technological progress while creating new forms of human-AI relationships.

The Automated Research Company Vision:

  1. Collaborative Structure - Teams of very capable AI researchers and engineers working largely autonomously
  2. World Integration - Not black boxes, but systems that communicate, take inputs, and run experiments
  3. Artifact Creation - Developing new technology, codebases, designs, and other useful outputs
  4. Technical Acceleration - Radically speeding up the pace of technological progress
Jakub Pachocki
When I talk about automating research, I imagine a largely automated company of very capable researchers and engineers. It would interface with the world in many ways—not a black box. It would talk to people, take inputs, and run experiments.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Interface Evolution and Human Connection:

  • Human-Like Interaction - ChatGPT already feels human-like enough to form attachments
  • Increased Persistence - AI systems that remember and build ongoing relationships
  • Multi-Modal Expression - Communicating through various forms beyond just text
  • Stronger Emotional Bonds - Enhanced capability to create meaningful connections
Jakub Pachocki
ChatGPT can feel quite human-like, and we can form attachments to it. As it becomes more persistent and capable of expressing itself in different forms and media, those effects will grow stronger.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Current Trust Threshold Crossing:

  • Calendar and Email Access - Users becoming comfortable with AI accessing personal data
  • Economic Value Recognition - Clear benefits from allowing AI deeper data integration
  • Trust Evolution - Moving from fear to acceptance to excitement about AI capabilities

Security and Robustness Challenges:

  • Exploitation Vulnerabilities - Current models not robust enough against malicious attacks
  • Trade-off Tensions - Balancing functionality with security concerns
  • Iterative Improvement - Field-wide need to enhance AI system robustness

Societal Implications:

  • Technical Perspective - Need for careful development to ensure beneficial outcomes
  • Social Considerations - Managing the impact of human-AI relationships on society
  • Important Conversations - Addressing attachment formation and dependency issues

Timestamp: [30:27-33:26]Youtube Icon

💎 Key Insights from [26:52-33:50]

Essential Insights:

  1. Breakthrough Reality - Major AI advances require years of intensive work despite appearing simple in retrospect; the reasoning breakthrough genuinely shocked OpenAI leadership and forced serious organizational readiness questions
  2. Scaling Evolution - Future AI progress will compound multiple approaches rather than replace them; long-horizon reasoning with massive compute investment represents the next major frontier for tackling high-value problems
  3. AGI Manifestation - Artificial general intelligence will likely appear as automated research companies rather than individual superintelligent entities, accelerating technological progress while creating new forms of human-AI relationships

Actionable Insights:

  • For Organizations: Prepare for rapid AI capability acceleration by building organizational readiness for fast-paced technological change and development cycles
  • For Developers: Focus on creating robust, secure AI systems that can handle increased data access and persistent operation without exploitation vulnerabilities
  • For Society: Begin serious conversations about human-AI attachment formation and dependency as AI systems become more persistent and human-like in interaction

Timestamp: [26:52-33:50]Youtube Icon

📚 References from [26:52-33:50]

People Mentioned:

  • Sam Altman - OpenAI CEO who participated in late-night emergency call about shocking AI progress results
  • Mira Murati - OpenAI CTO who joined leadership discussion about organizational readiness for rapid AI advancement

Technologies & Tools:

  • ChatGPT - AI assistant referenced for calendar and Gmail integration, demonstrating trust threshold crossing
  • o1 Pro (GPT-5 Pro) - Advanced reasoning model using 10-20× more compute than GPT-4 for superior performance
  • GPT-3 - Referenced as baseline from five years ago to illustrate rapid progress timeline
  • GPT-4 - Comparison model for compute usage and capability benchmarking

Concepts & Frameworks:

  • Chain of Thought Reasoning - AI technique for extended thinking processes that required intensive development work
  • Long-Horizon Reasoning - Extended planning and problem-solving capability over extended time periods
  • Model Persistence - AI systems' ability to work continuously on focused problems for extended durations
  • Scaling Paradigm - Approach to AI development through increased computational resources and model size
  • Automated Research Company - Vision for AGI as collaborative teams of AI researchers and engineers
  • Human-AI Attachment - Psychological bonds formed between humans and increasingly human-like AI systems

Technical Concepts:

  • Compute Investment - Resource allocation strategy matching computational expense to problem importance
  • Interface Evolution - Development of more sophisticated human-AI interaction methods
  • Multi-Modal Expression - AI communication through various forms beyond text
  • Robustness Challenges - Security vulnerabilities in AI systems against malicious exploitation
  • Technical Acceleration - Rapid increase in technological development pace through AI automation

Timestamp: [26:52-33:50]Youtube Icon

💻 What Skills Will AI Never Be Able to Replace?

Why Programming Remains Essential Despite AI Automation

Counter to popular narratives about AI replacing programmers, learning to code develops critical thinking skills that remain valuable even as AI capabilities expand.

The Structured Intellect Advantage:

  1. Problem Decomposition - Breaking complicated problems into manageable pieces remains a premium skill
  2. Future-Proof Thinking - While the medium may change, structured problem-solving remains valuable
  3. Domain Flexibility - Programming is one effective way to develop analytical thinking, but not the only way
Jakub Pachocki
One skill that will remain at a premium is a well-structured intellect—the ability to break complicated problems into pieces. That might not involve programming in the future, but programming is a fine way to acquire that skill.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Real-World Application Benefits:

  • Prompt Engineering Skills - Understanding code logic helps in crafting better AI interactions
  • System Understanding - Knowing how systems work enhances ability to use them effectively
  • Bridge Building - People who understand both human communication and system logic have unique advantages
Andrew Mayne
I tell people: just because a system can do something doesn’t mean you no longer need to understand how it works. When I worked at OpenAI in prompt engineering, my coding background helped me break down language and make it do better things.
Andrew MayneOpenAIOpenAI Podcast Host

The Airplane Pilot Analogy:

  • Foundational Knowledge - Just as pilots benefit from understanding aerodynamics, AI users benefit from understanding logic
  • System Mastery - Deeper understanding enables more effective use of automated tools
  • Professional Advantage - Those who bridge technical and non-technical domains gain competitive edges
Andrew Mayne
Do I want an airplane pilot who doesn't understand aerodynamics? That doesn't make much sense to me.
Andrew MayneOpenAIOpenAI Podcast Host

Misinformation Warning:

  • Contrary Advice - Rejecting claims that programming skills are obsolete in the AI age
  • Skill Evolution - While specific programming tasks may be automated, underlying analytical thinking remains crucial
  • Long-term Value - Structured thinking skills will continue being at a premium regardless of technological changes

Timestamp: [33:50-34:58]Youtube Icon

🌟 What Happens When You Realize There Are No Real Constraints?

Breaking Through Perceived Limitations to Achieve Ambitious Goals

The journey from a Polish high school to leading AI research reveals how many barriers exist only in our minds, and how Silicon Valley's culture of ambitious problem-solving can inspire transformative change.

The Progressive Revelation Process:

  1. First Breakthrough - Realizing you can focus intensely on your passion at the cost of other subjects
  2. Geographic Expansion - Understanding that studying in the USA is actually possible, not just a dream
  3. Community Impact - Discovering environments where people attack big problems with genuine ambition

Silicon Valley's Inspiring Culture:

  • Big Problem Focus - Community willingness to tackle genuinely difficult challenges
  • Meaningful Change Belief - Conviction that individuals can create positive world impact
  • Ambitious Mindset - Cultural support for pursuing transformative rather than incremental goals

The Constraint Realization Pattern:

  • Perceived vs. Real Barriers - Many limitations are mental constructs rather than actual obstacles
  • Gradual Awareness - Breakthrough moments come through progressive realization of possibilities
  • Environmental Influence - Being around ambitious people expands perception of what's achievable

Personal Growth Through Challenge:

  • Passion-Driven Choices - Allocating time based on genuine interests rather than external expectations
  • International Mobility - Overcoming geographical and cultural barriers to pursue opportunities
  • Community Selection - Choosing environments that support and amplify ambitious goals

The Inspiration Factor:

  • Positive Change Focus - Emphasis on making meaningful contributions to the world
  • Community Values - Cherishing environments that support transformative work
  • Belief Systems - Surrounding yourself with people who believe change is possible
Jakub Pachocki
Seeing people attack big problems with real ambition—and believing you can make meaningful, positive change—has been incredibly inspiring. It's something I deeply cherish about this community.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Timestamp: [34:58-36:20]Youtube Icon

📚 What's the Iron Man Effect on Real-World Innovation?

From Iron Man Dreams to Deep Learning Reality

The unexpected connections between Paul Graham's philosophy, Marvel superhero movies, and the development of world-changing AI researchers reveal how inspiration comes from surprising sources.

The Accidental Influence Story:

  1. Perfect Timing - Jakub's father gave him "Hackers & Painters" at age 15 when he was uncertain about his future
  2. Unknown Connection - He didn't realize at the time that Paul Graham was a Silicon Valley legend
  3. Community Discovery - Later recognizing the book connected him to the same inspirational community he'd joined
Jakub Pachocki
I didn’t connect the dots at the time, but when I was about 15 and unsure what I wanted to do, my dad gave me the Polish edition of a book I didn’t know—Hackers & Painters. It turned out to be by Paul Graham.
Jakub PachockiOpenAIOpenAI | Chief Scientist

Iron Man's Unexpected Impact:

  • Cinematic Inspiration - Szymon's robotics interest sparked by Marvel's technological vision
  • Reality Check Disappointment - Discovering real robots were far behind movie depictions
  • Serendipitous Redirect - Meeting a deep learning friend during robotics disillusionment
  • Breakthrough Moment - AlphaGo's emergence transforming skepticism into excitement
Szymon Sidor
I have a slightly silly answer to that. I watched Iron Man. When I later started working on robotics, I was disappointed by how far real robots lagged; it hadn’t occurred to me that the movie was, well, a movie.
Szymon SidorOpenAIOpenAI | IC & Leadership

The AlphaGo Transformation:

  • Skepticism to Belief - Initial view of machine learning as hype until witnessing Go mastery
  • Deep Learning Conversion - Both researchers inspired by AlphaGo's demonstration of true AI capability
  • Physical Phenomenon Acceptance - Learning to study AI as natural science rather than pure computer science

Academic Background Reflections:

  • Mathematics Preference - Szymon wishing he'd focused more on mathematical foundations
  • Physics Value - Recognizing theoretical physics training as ideal preparation for AI research
  • Classical Training Challenges - Jakub's initial resistance to deep learning's empirical nature

The Inspiration Validation Principle:

  • No Wrong Influences - Even seemingly "stupid" inspirations like superhero movies can lead to meaningful work
  • Diverse Pathways - From magic shows to AI research, unconventional backgrounds bring unique perspectives
  • Dream Permission - Books and movies that encourage big thinking have genuine transformative power

Timestamp: [36:20-39:45]Youtube Icon

💎 Key Insights from [33:50-40:19]

Essential Insights:

  1. Programming Remains Essential - Despite AI automation, learning to code develops structured thinking and problem decomposition skills that remain valuable across domains and technological changes
  2. Perceived Constraints Are Often Illusory - Many barriers to ambitious goals exist primarily in our minds; progressive realization of possibilities combined with supportive communities can unlock transformative opportunities
  3. Inspiration Sources Are Unpredictable - Meaningful career directions can emerge from random book gifts, superhero movies, or unexpected scientific breakthroughs, validating the importance of staying open to diverse influences

Actionable Insights:

  • For Students: Learn programming not just for current utility but for developing analytical thinking skills that will remain valuable regardless of future technological automation
  • For Individuals: Question perceived limitations and actively seek environments filled with ambitious people who believe meaningful change is possible
  • For Educators: Encourage exposure to diverse inspirational sources, from technical books to popular media, as breakthrough moments often come from unexpected directions

Timestamp: [33:50-40:19]Youtube Icon

📚 References from [33:50-40:19]

People Mentioned:

  • Paul Graham - Entrepreneur and writer whose book "Hackers & Painters" influenced Jakub at age 15, though he didn't realize the connection to Silicon Valley culture at the time
  • Andy Weir - Author of "The Martian," referenced as example of how fiction can inspire real scientific careers

Books & Publications:

  • Hackers & Painters - Paul Graham's influential book that shaped Jakub's thinking about technology and ambition during his formative years
  • The Martian - Science fiction novel mentioned as inspiration for NASA scientists despite technical inaccuracies

Movies & Entertainment:

  • Iron Man - Marvel superhero movie that inspired Szymon's initial interest in robotics, despite later disappointment with real-world robotic capabilities
  • Thor - Jokingly referenced as alternative superhero inspiration that might have led to different career outcomes

Technologies & Breakthroughs:

  • AlphaGo - DeepMind's Go-playing AI that transformed both researchers' skepticism about machine learning into genuine excitement and belief
  • AlphaGo Zero - Self-taught version that eliminated human training data, representing a major milestone in AI self-improvement capabilities

Organizations & Institutions:

  • DeepMind - Google's AI research lab responsible for AlphaGo breakthrough that inspired the researchers
  • NASA - Referenced in context of scientists inspired by science fiction to pursue botanical research

Concepts & Frameworks:

  • Structured Intellect - Cognitive skill for breaking complex problems into manageable components, essential regardless of technological automation
  • Problem Decomposition - Analytical thinking approach that remains valuable across domains and technological changes
  • Prompt Engineering - AI interaction technique that benefits from programming logic understanding
  • Convex Optimization - Mathematical approach Jakub initially preferred before embracing deep learning's empirical methods
  • Physical Phenomenon Study - Approach to understanding AI systems through empirical observation rather than pure theoretical analysis

Academic Fields:

  • Deep Learning - Machine learning approach initially viewed skeptically by both researchers before AlphaGo demonstration
  • Theoretical Computer Science - Academic background both researchers consider valuable for AI research
  • Mathematics/Physics - Academic foundations Szymon wishes he had pursued more extensively for AI research preparation

Timestamp: [33:50-40:19]Youtube Icon