
Player FM 앱으로 오프라인으로 전환하세요!
OpenAI’s Deep Research Team on Why Reinforcement Learning is the Future for AI Agents
Manage episode 468379599 series 3586723
OpenAI’s Isa Fulford and Josh Tobin discuss how the company’s newest agent, Deep Research, represents a breakthrough in AI research capabilities by training models end-to-end rather than using hand-coded operational graphs. The product leads explain how high-quality training data and the o3 model’s reasoning abilities enable adaptable research strategies, and why OpenAI thinks Deep Research will capture a meaningful percentage of knowledge work. Key product decisions that build transparency and trust include citations and clarification flows. By compressing hours of work into minutes, Deep Research transforms what’s possible for many business and consumer use cases.
Hosted by: Sonya Huang and Lauren Reeder, Sequoia Capital
Mentioned in this episode:
- Yann Lecun’s Cake: An analogy Meta AI’s leader shared in his 2016 NIPS keynote
67 에피소드
Manage episode 468379599 series 3586723
OpenAI’s Isa Fulford and Josh Tobin discuss how the company’s newest agent, Deep Research, represents a breakthrough in AI research capabilities by training models end-to-end rather than using hand-coded operational graphs. The product leads explain how high-quality training data and the o3 model’s reasoning abilities enable adaptable research strategies, and why OpenAI thinks Deep Research will capture a meaningful percentage of knowledge work. Key product decisions that build transparency and trust include citations and clarification flows. By compressing hours of work into minutes, Deep Research transforms what’s possible for many business and consumer use cases.
Hosted by: Sonya Huang and Lauren Reeder, Sequoia Capital
Mentioned in this episode:
- Yann Lecun’s Cake: An analogy Meta AI’s leader shared in his 2016 NIPS keynote
67 에피소드
모든 에피소드
×플레이어 FM에 오신것을 환영합니다!
플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.