Player FM 앱으로 오프라인으로 전환하세요!
Will we have Superintelligence by 2028? With Anthropic’s Ben Mann
Manage episode 488306266 series 3444082
What happens when you give AI researchers unlimited compute and tell them to compete for the highest usage rates? Ben Mann, Co-Founder, from Anthropic sits down with Sarah Guo and Elad Gil to explain how Claude 4 went from "reward hacking" to efficiently completing tasks and how they're racing to solve AI safety before deploying computer-controlling agents. Ben talks about economic Turing tests, the future of general versus specialized AI models, Reinforcement Learning From AI Feedback (RLAIF), and Anthropic’s Model Context Protocol (MCP). Plus, Ben shares his thoughts on if we will have Superintelligence by 2028.
Sign up for new podcasts every week. Email feedback to [email protected]
Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @8enmann
Links:
Chapters:
00:00 Ben Mann Introduction
00:33 Releasing Claude 4
02:05 Claude 4 Highlights and Improvements
03:42 Advanced Use Cases and Capabilities
06:42 Specialization and Future of AI Models
09:35 Anthropic's Approach to Model Development
18:08 Human Feedback and AI Self-Improvement
19:15 Principles and Correctness in Model Training
20:58 Challenges in Measuring Correctness
21:42 Human Feedback and Preference Models
23:38 Empiricism and Real-World Applications
27:02 AI Safety and Ethical Considerations
28:13 AI Alignment and High-Risk Research
30:01 Responsible Scaling and Safety Policies
35:08 Future of AI and Emerging Behaviors
38:35 Model Context Protocol (MCP) and Industry Standards
41:00 Conclusion
136 에피소드
Manage episode 488306266 series 3444082
What happens when you give AI researchers unlimited compute and tell them to compete for the highest usage rates? Ben Mann, Co-Founder, from Anthropic sits down with Sarah Guo and Elad Gil to explain how Claude 4 went from "reward hacking" to efficiently completing tasks and how they're racing to solve AI safety before deploying computer-controlling agents. Ben talks about economic Turing tests, the future of general versus specialized AI models, Reinforcement Learning From AI Feedback (RLAIF), and Anthropic’s Model Context Protocol (MCP). Plus, Ben shares his thoughts on if we will have Superintelligence by 2028.
Sign up for new podcasts every week. Email feedback to [email protected]
Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @8enmann
Links:
Chapters:
00:00 Ben Mann Introduction
00:33 Releasing Claude 4
02:05 Claude 4 Highlights and Improvements
03:42 Advanced Use Cases and Capabilities
06:42 Specialization and Future of AI Models
09:35 Anthropic's Approach to Model Development
18:08 Human Feedback and AI Self-Improvement
19:15 Principles and Correctness in Model Training
20:58 Challenges in Measuring Correctness
21:42 Human Feedback and Preference Models
23:38 Empiricism and Real-World Applications
27:02 AI Safety and Ethical Considerations
28:13 AI Alignment and High-Risk Research
30:01 Responsible Scaling and Safety Policies
35:08 Future of AI and Emerging Behaviors
38:35 Model Context Protocol (MCP) and Industry Standards
41:00 Conclusion
136 에피소드
모든 에피소드
×플레이어 FM에 오신것을 환영합니다!
플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.