Další
Živý přenos začne již brzy!
Živý přenos již skončil.
Prezentace ještě nebyla nahrána!
  • title: Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
      0:00 / 0:00
      • Nahlásit chybu
      • Nastavení
      • Playlisty
      • Záložky
      • Titulky Off
      • Rychlost přehrávání
      • Kvalita
      • Nastavení
      • Debug informace
      • Server sl-yoda-v2-stream-002-alpha.b-cdn.net
      • Velikost titulků Střední
      • Záložky
      • Server
      • sl-yoda-v2-stream-002-alpha.b-cdn.net
      • sl-yoda-v2-stream-002-beta.b-cdn.net
      • 1001562353.rsc.cdn77.org
      • 1075090661.rsc.cdn77.org
      • Titulky
      • Off
      • English
      • Rychlost přehrávání
      • Kvalita
      • Velikost titulků
      • Velké
      • Střední
      • Malé
      • Mode
      • Video Slideshow
      • Audio Slideshow
      • Slideshow
      • Video
      Moje playlisty
        Záložky
          00:00:00
            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning
            • Nastavení
            • Sync diff
            • Kvalita
            • Nastavení
            • Server
            • Kvalita
            • Server

            Parallel Q-Learning: a Scheme for Time-efficient Reinforcement Learning

            24. července 2023

            Řečníci

            ZL

            Zechu Li

            Speaker · 0 followers

            TC

            Tao Chen

            Speaker · 0 followers

            ZH

            Zhang-Wei Hong

            Speaker · 0 followers

            O prezentaci

            Reinforcement learning algorithms require a long time to learn policies on complex tasks due to the need for a large amount of training data. With the recent advances in GPU-based simulation, such as Isaac Gym, data collection has been sped up thousands of times on a commodity GPU. Most prior works have used on-policy methods such as PPO to train policies due to their simplicity and easy-to-scale nature. Off-policy methods are usually more sample-efficient but more challenging to be scaled up, r…

            Organizátor

            I2
            I2

            ICML 2023

            Account · 657 followers

            Baví vás formát? Nechte SlidesLive zachytit svou akci!

            Profesionální natáčení a streamování po celém světě.

            Sdílení

            Doporučená videa

            Prezentace na podobné téma, kategorii nebo přednášejícího

            Stratified Adversarial Robustness with Rejection
            05:17

            Stratified Adversarial Robustness with Rejection

            Jiefeng Chen, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Evaluating the impact of incorporating ’legalese’ definitions and abstractive summarization on the categorization of legal cases
            12:23

            Evaluating the impact of incorporating ’legalese’ definitions and abstractive summarization on the categorization of legal cases

            Daniela Cortes Bermudez, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction
            05:20

            Grammar-Induced Geometry for Data-Efficient Molecular Property Prediction

            Minghao Guo, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Invariance in Policy Optimisation and Partial Identifiability in Reward Learning
            05:21

            Invariance in Policy Optimisation and Partial Identifiability in Reward Learning

            Joar Skalse, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging
            05:44

            ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging

            Alessandro Fontanella, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization
            05:15

            Variational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization

            Jian Cao, …

            I2
            I2
            ICML 2023 2 years ago

            Total of 0 viewers voted for saving the presentation to eternal vault which is 0.0%

            Zajímají Vás podobná videa? Sledujte ICML 2023