Csaba Szepesvari

TalkRL: The Reinforcement Learning Podcast

Robin Ranjit Singh Chauhan에서 제공하는 콘텐츠입니다. 에피소드, 그래픽, 팟캐스트 설명을 포함한 모든 팟캐스트 콘텐츠는 Robin Ranjit Singh Chauhan 또는 해당 팟캐스트 플랫폼 파트너가 직접 업로드하고 제공합니다. 누군가가 귀하의 허락 없이 귀하의 저작물을 사용하고 있다고 생각되는 경우 여기에 설명된 절차를 따르실 수 있습니다 https://ko.player.fm/legal.

4y ago 48:42

MP3•에피소드 홈

Csaba Szepesvari is:

Head of the Foundations Team at DeepMind
Professor of Computer Science at the University of Alberta
Canada CIFAR AI Chair
Fellow at the Alberta Machine Intelligence Institute
Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning

References

Bandit based monte-carlo planning, Levente Kocsis, Csaba Szepesvári
Bandit Algorithms, Tor Lattimore, Csaba Szepesvári
Algorithms for Reinforcement Learning, Csaba Szepesvári
The Predictron: End-To-End Learning and Planning, David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
A Bayesian framework for reinforcement learning, Strens
Solving Rubik’s Cube with a Robot Hand ; Paper, OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang
The Nonstochastic Multiarmed Bandit Problem, Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire
Deep Learning with Bayesian Principles, Mohammad Emtiyaz Khan
Tackling climate change with Machine Learning David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

53 에피소드

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

Csaba Szepesvari

TalkRL: The Reinforcement Learning Podcast

82 subscribers

published 4y ago

MP3•에피소드 홈

Csaba Szepesvari is:

Head of the Foundations Team at DeepMind
Professor of Computer Science at the University of Alberta
Canada CIFAR AI Chair
Fellow at the Alberta Machine Intelligence Institute
Co-Author of the book Bandit Algorithms along with Tor Lattimore, and author of the book Algorithms for Reinforcement Learning

References

Bandit based monte-carlo planning, Levente Kocsis, Csaba Szepesvári
Bandit Algorithms, Tor Lattimore, Csaba Szepesvári
Algorithms for Reinforcement Learning, Csaba Szepesvári
The Predictron: End-To-End Learning and Planning, David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris
A Bayesian framework for reinforcement learning, Strens
Solving Rubik’s Cube with a Robot Hand ; Paper, OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang
The Nonstochastic Multiarmed Bandit Problem, Peter Auer, Nicolò Cesa-Bianchi, Yoav Freund, and Robert E. Schapire
Deep Learning with Bayesian Principles, Mohammad Emtiyaz Khan
Tackling climate change with Machine Learning David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

53 에피소드

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

Semua episode

플레이어 FM에 오신것을 환영합니다!

플레이어 FM은 웹에서 고품질 팟캐스트를 검색하여 지금 바로 즐길 수 있도록 합니다. 최고의 팟캐스트 앱이며 Android, iPhone 및 웹에서도 작동합니다. 장치 간 구독 동기화를 위해 가입하세요.

500개 이상의 주제를 청취

TalkRL: The Reinforcement Learning Podcast와 비슷한 콘텐츠

들어볼 가치가 있는 팟캐스트

TalkRL: The Reinforcement Learning Podcast « » Csaba Szepesvari

Csaba Szepesvari

들어볼 가치가 있는 팟캐스트

플레이어 FM에 오신것을 환영합니다!

TalkRL: The Reinforcement Learning Podcast와 비슷한 콘텐츠

빠른 참조 가이드

TalkRL: The Reinforcement Learning Podcast « »
Csaba Szepesvari