
Player FM 앱으로 오프라인으로 전환하세요!
The Startup Powering The Data Behind AGI
Manage episode 506743338 series 3011550
In this episode of Gradient Dissent, Lukas Biewald talks with the CEO & founder of Surge AI, the billion-dollar company quietly powering the next generation of frontier LLMs. They discuss Surge's origin story, why traditional data labeling is broken, and how their research-focused approach is reshaping how models are trained.
You’ll hear why inter-annotator agreement fails in high-complexity tasks like poetry and math, why synthetic data is often overrated, and how Surge builds rich RL environments to stress-test agentic reasoning. They also go deep on what kinds of data will be critical to future progress in AI—from scientific discovery to multimodal reasoning and personalized alignment.
It’s a rare, behind-the-scenes look into the world of high-quality data generation at scale—straight from the team most frontier labs trust to get it right.
Timestamps:
00:00 – Intro: Who is Edwin Chen?
03:40 – The problem with early data labeling systems
06:20 – Search ranking, clickbait, and product principles
10:05 – Why Surge focused on high-skill, high-quality labeling
13:50 – From Craigslist workers to a billion-dollar business
16:40 – Scaling without funding and avoiding Silicon Valley status games
21:15 – Why most human data platforms lack real tech
25:05 – Detecting cheaters, liars, and low-quality labelers
28:30 – Why inter-annotator agreement is a flawed metric
32:15 – What makes a great poem? Not checkboxes
36:40 – Measuring subjective quality rigorously
40:00 – What types of data are becoming more important
44:15 – Scientific collaboration and frontier research data
47:00 – Multimodal data, Argentinian coding, and hyper-specificity
50:10 – What's wrong with LMSYS and benchmark hacking
53:20 – Personalization and taste in model behavior
56:00 – Synthetic data vs. high-quality human data
Follow Weights & Biases:
https://twitter.com/weights_biases
https://www.linkedin.com/company/wandb
128 에피소드
Manage episode 506743338 series 3011550
In this episode of Gradient Dissent, Lukas Biewald talks with the CEO & founder of Surge AI, the billion-dollar company quietly powering the next generation of frontier LLMs. They discuss Surge's origin story, why traditional data labeling is broken, and how their research-focused approach is reshaping how models are trained.
You’ll hear why inter-annotator agreement fails in high-complexity tasks like poetry and math, why synthetic data is often overrated, and how Surge builds rich RL environments to stress-test agentic reasoning. They also go deep on what kinds of data will be critical to future progress in AI—from scientific discovery to multimodal reasoning and personalized alignment.
It’s a rare, behind-the-scenes look into the world of high-quality data generation at scale—straight from the team most frontier labs trust to get it right.
Timestamps:
00:00 – Intro: Who is Edwin Chen?
03:40 – The problem with early data labeling systems
06:20 – Search ranking, clickbait, and product principles
10:05 – Why Surge focused on high-skill, high-quality labeling
13:50 – From Craigslist workers to a billion-dollar business
16:40 – Scaling without funding and avoiding Silicon Valley status games
21:15 – Why most human data platforms lack real tech
25:05 – Detecting cheaters, liars, and low-quality labelers
28:30 – Why inter-annotator agreement is a flawed metric
32:15 – What makes a great poem? Not checkboxes
36:40 – Measuring subjective quality rigorously
40:00 – What types of data are becoming more important
44:15 – Scientific collaboration and frontier research data
47:00 – Multimodal data, Argentinian coding, and hyper-specificity
50:10 – What's wrong with LMSYS and benchmark hacking
53:20 – Personalization and taste in model behavior
56:00 – Synthetic data vs. high-quality human data
Follow Weights & Biases:
https://twitter.com/weights_biases
https://www.linkedin.com/company/wandb
128 에피소드
모든 에피소드
×플레이어 FM에 오신것을 환영합니다!
플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.