Player FM 앱으로 오프라인으로 전환하세요!
213 – Are Transformer Models Aligned By Default?
Manage episode 420872191 series 122703
Our species has begun to scrute the inscrutable shoggoth! With Matt Freeman
LINKS
Anthropic’s latest AI Safety research paper, on interpretability
Anthropic is hiring
Episode 93 of The Mind Killer
Talkin’ Fallout
VibeCamp
0:00:17 – A Layman’s AI Refresher
0:21:06 – Aligned By Default
0:50:56 – Highlights from Anthropic’s Latest Interpretability Paper
1:26:47 – Guild of the Rose Update
1:29:40 – Going to VibeCamp
1:37:05 – Feedback
1:43:58 – Less Wrong Posts
1:57:30 – Thank the Patron
Our Patreon, or if you prefer Our SubStack
Hey look, we have a discord! What could possibly go wrong?
We now partner with The Guild of the Rose, check them out.
Rationality: From AI to Zombies, The Podcast
LessWrong Sequence Posts Discussed in this Episode:
If You Demand Magic, Magic Won’t Help
Next Sequence Posts:
403 에피소드
Manage episode 420872191 series 122703
Our species has begun to scrute the inscrutable shoggoth! With Matt Freeman
LINKS
Anthropic’s latest AI Safety research paper, on interpretability
Anthropic is hiring
Episode 93 of The Mind Killer
Talkin’ Fallout
VibeCamp
0:00:17 – A Layman’s AI Refresher
0:21:06 – Aligned By Default
0:50:56 – Highlights from Anthropic’s Latest Interpretability Paper
1:26:47 – Guild of the Rose Update
1:29:40 – Going to VibeCamp
1:37:05 – Feedback
1:43:58 – Less Wrong Posts
1:57:30 – Thank the Patron
Our Patreon, or if you prefer Our SubStack
Hey look, we have a discord! What could possibly go wrong?
We now partner with The Guild of the Rose, check them out.
Rationality: From AI to Zombies, The Podcast
LessWrong Sequence Posts Discussed in this Episode:
If You Demand Magic, Magic Won’t Help
Next Sequence Posts:
403 에피소드
Tất cả các tập
×플레이어 FM에 오신것을 환영합니다!
플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.