Michael Dennis

TalkRL: The Reinforcement Learning Podcast

Robin Ranjit Singh Chauhan에서 제공하는 콘텐츠입니다. 에피소드, 그래픽, 팟캐스트 설명을 포함한 모든 팟캐스트 콘텐츠는 Robin Ranjit Singh Chauhan 또는 해당 팟캐스트 플랫폼 파트너가 직접 업로드하고 제공합니다. 누군가가 귀하의 허락 없이 귀하의 저작물을 사용하고 있다고 생각되는 경우 여기에 설명된 절차를 따르실 수 있습니다 https://ko.player.fm/legal.

4+ y ago 1:00:50

MP3•에피소드 홈

Michael Dennis is a PhD student at the Center for Human-Compatible AI at UC Berkeley, supervised by Professor Stuart Russell.

I'm interested in robustness in RL and multi-agent RL, specifically as it applies to making the interaction between AI systems and society at large to be more beneficial.

--Michael Dennis

Featured References

Emergent Complexity and Zero-shot Transfer via Unsupervised Environment Design [PAIRED]
Michael Dennis, Natasha Jaques, Eugene Vinitsky, Alexandre Bayen, Stuart Russell, Andrew Critch, Sergey Levine
Videos

Adversarial Policies: Attacking Deep Reinforcement Learning

Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, Stuart Russell
Homepage and Videos

Accumulating Risk Capital Through Investing in Cooperation
Charlotte Roman, Michael Dennis, Andrew Critch, Stuart Russell

Quantifying Differences in Reward Functions [EPIC]
Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike

Additional References

Safe Opponent Exploitation, Sam Ganzfried And Tuomas Sandholm 2015
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning, Natasha Jaques et al 2019
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research, Leibo et al 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning, Karl Cobbe et al 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions, Wang et al 2019
Consequences of Misaligned AI, Zhuang et al 2020
Conservative Agency via Attainable Utility Preservation, Turner et al 2019

73 에피소드

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

Michael Dennis

TalkRL: The Reinforcement Learning Podcast

83 subscribers

published 4+ y ago

MP3•에피소드 홈

Michael Dennis is a PhD student at the Center for Human-Compatible AI at UC Berkeley, supervised by Professor Stuart Russell.

I'm interested in robustness in RL and multi-agent RL, specifically as it applies to making the interaction between AI systems and society at large to be more beneficial.

--Michael Dennis

Featured References

Adam Gleave, Michael Dennis, Cody Wild, Neel Kant, Sergey Levine, Stuart Russell
Homepage and Videos

Accumulating Risk Capital Through Investing in Cooperation
Charlotte Roman, Michael Dennis, Andrew Critch, Stuart Russell

Quantifying Differences in Reward Functions [EPIC]
Adam Gleave, Michael Dennis, Shane Legg, Stuart Russell, Jan Leike

Additional References

Safe Opponent Exploitation, Sam Ganzfried And Tuomas Sandholm 2015
Social Influence as Intrinsic Motivation for Multi-Agent Deep Reinforcement Learning, Natasha Jaques et al 2019
Autocurricula and the Emergence of Innovation from Social Interaction: A Manifesto for Multi-Agent Intelligence Research, Leibo et al 2019
Leveraging Procedural Generation to Benchmark Reinforcement Learning, Karl Cobbe et al 2019
Paired Open-Ended Trailblazer (POET): Endlessly Generating Increasingly Complex and Diverse Learning Environments and Their Solutions, Wang et al 2019
Consequences of Misaligned AI, Zhuang et al 2020
Conservative Agency via Attainable Utility Preservation, Turner et al 2019

73 에피소드

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

모든 에피소드

플레이어 FM에 오신것을 환영합니다!

플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.

500개 이상의 주제를 청취

TalkRL: The Reinforcement Learning Podcast와 비슷한 콘텐츠

들어볼 가치가 있는 팟캐스트

TalkRL: The Reinforcement Learning Podcast « » Michael Dennis

Michael Dennis

들어볼 가치가 있는 팟캐스트

플레이어 FM에 오신것을 환영합니다!

TalkRL: The Reinforcement Learning Podcast와 비슷한 콘텐츠

빠른 참조 가이드

TalkRL: The Reinforcement Learning Podcast « »
Michael Dennis