Player FM 앱으로 오프라인으로 전환하세요!
AI Safety: Constitutional AI vs Human Feedback
Manage episode 424053414 series 3427795
With great power comes great responsibility. How do leading AI companies implement safety and ethics as language models scale? OpenAI uses Model Spec combined with RLHF (Reinforcement Learning from Human Feedback). Anthropic uses Constitutional AI. The technical approaches to maximizing usefulness while minimizing harm. Solo episode on AI alignment.
REFERENCE
OpenAI Model Spec
https://cdn.openai.com/spec/model-spec-2024-05-08.html#overview
Anthropic Constitutional AI
https://www.anthropic.com/news/claudes-constitution
To stay in touch, sign up for our newsletter at https://www.superprompt.fm
29 에피소드
Manage episode 424053414 series 3427795
With great power comes great responsibility. How do leading AI companies implement safety and ethics as language models scale? OpenAI uses Model Spec combined with RLHF (Reinforcement Learning from Human Feedback). Anthropic uses Constitutional AI. The technical approaches to maximizing usefulness while minimizing harm. Solo episode on AI alignment.
REFERENCE
OpenAI Model Spec
https://cdn.openai.com/spec/model-spec-2024-05-08.html#overview
Anthropic Constitutional AI
https://www.anthropic.com/news/claudes-constitution
To stay in touch, sign up for our newsletter at https://www.superprompt.fm
29 에피소드
모든 에피소드
×플레이어 FM에 오신것을 환영합니다!
플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.