Nan Jiang

TalkRL: The Reinforcement Learning Podcast

Contenu fourni par Robin Ranjit Singh Chauhan. Tout le contenu du podcast, y compris les épisodes, les graphiques et les descriptions de podcast, est téléchargé et fourni directement par Robin Ranjit Singh Chauhan ou son partenaire de plateforme de podcast. Si vous pensez que quelqu'un utilise votre œuvre protégée sans votre autorisation, vous pouvez suivre le processus décrit ici https://fr.player.fm/legal.

4y ago 1:11:46

MP3•Maison d'episode

Nan Jiang is an Assistant Professor of Computer Science at University of Illinois. He was a Postdoc Microsoft Research, and did his PhD at University of Michigan under Professor Satinder Singh.

Featured References

Reinforcement Learning: Theory and Algorithms
Alekh Agarwal Nan Jiang Sham M. Kakade
Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches
Wen Sun, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford
Information-Theoretic Considerations in Batch Reinforcement Learning
Jinglin Chen, Nan Jiang

Additional References

Towards a Unified Theory of State Abstraction for MDPs, Lihong Li, Thomas J. Walsh, Michael L. Littman
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning, Nan Jiang, Lihong Li
Minimax Confidence Interval for Off-Policy Evaluation and Policy Optimization, Nan Jiang, Jiawei Huang
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning, Cameron Voloshin, Hoang M. Le, Nan Jiang, Yisong Yue

Errata

[Robin] I misspoke when I said in domain randomization we want the agent to "ignore" domain parameters. What I should have said is, we want the agent to perform well within some range of domain parameters, it should be robust with respect to domain parameters.

61 episodes

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

Nan Jiang

TalkRL: The Reinforcement Learning Podcast

84 subscribers

published 4y ago

MP3•Maison d'episode

Nan Jiang is an Assistant Professor of Computer Science at University of Illinois. He was a Postdoc Microsoft Research, and did his PhD at University of Michigan under Professor Satinder Singh.

Featured References

Reinforcement Learning: Theory and Algorithms
Alekh Agarwal Nan Jiang Sham M. Kakade
Model-based RL in Contextual Decision Processes: PAC bounds and Exponential Improvements over Model-free Approaches
Wen Sun, Nan Jiang, Akshay Krishnamurthy, Alekh Agarwal, John Langford
Information-Theoretic Considerations in Batch Reinforcement Learning
Jinglin Chen, Nan Jiang

Additional References

Towards a Unified Theory of State Abstraction for MDPs, Lihong Li, Thomas J. Walsh, Michael L. Littman
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning, Nan Jiang, Lihong Li
Minimax Confidence Interval for Off-Policy Evaluation and Policy Optimization, Nan Jiang, Jiawei Huang
Empirical Study of Off-Policy Policy Evaluation for Reinforcement Learning, Cameron Voloshin, Hoang M. Le, Nan Jiang, Yisong Yue

Errata

[Robin] I misspoke when I said in domain randomization we want the agent to "ignore" domain parameters. What I should have said is, we want the agent to perform well within some range of domain parameters, it should be robust with respect to domain parameters.

61 episodes

#Reinforcement Learning #Machine Learning #Robin Ranjit Singh Chauhan #Artificial Intelligence #Tech

Tous les épisodes

Bienvenue sur Lecteur FM!

Lecteur FM recherche sur Internet des podcasts de haute qualité que vous pourrez apprécier dès maintenant. C'est la meilleure application de podcast et fonctionne sur Android, iPhone et le Web. Inscrivez-vous pour synchroniser les abonnements sur tous les appareils.

Écoutez plus de 500 sujets

Similaire à TalkRL: The Reinforcement Learning Podcast

Podcasts qui valent la peine d'être écoutés

TalkRL: The Reinforcement Learning Podcast « » Nan Jiang

Nan Jiang

Podcasts qui valent la peine d'être écoutés

Bienvenue sur Lecteur FM!

Similaire à TalkRL: The Reinforcement Learning Podcast

Guide de référence rapide

TalkRL: The Reinforcement Learning Podcast « »
Nan Jiang