Sang-Hoon Lee, Seung-Bin Kim, Ji-Hyun Lee, Eunwoo Song, Min-Jae Hwang, Seong-Whan Lee · HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis · SlidesLive

Kategorie

CS

Přihlásit se Kontaktujte nás

Další

Živý přenos začne již brzy!

Živý přenos již skončil.

Prezentace ještě nebyla nahrána!

SlidesLive

title: HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis

0:00 / 0:00

Nahlásit chybu
Nastavení
Playlisty
Záložky
Titulky Off
Rychlost přehrávání
Kvalita

Nastavení
Debug informace
Server sl-yoda-v2-stream-001-alpha.b-cdn.net
Velikost titulků Střední

Záložky

Server
sl-yoda-v2-stream-001-alpha.b-cdn.net
sl-yoda-v2-stream-001-beta.b-cdn.net
1824830694.rsc.cdn77.org
1979322955.rsc.cdn77.org

Titulky
Off
English

Rychlost přehrávání

Kvalita

Velikost titulků
Velké
Střední
Malé

Mode
Video Slideshow
Audio Slideshow
Slideshow
Video

HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis

HierSpeech: Bridging the Gap between Text and Speech by Hierarchical Variational Inference using Self-supervised Representations for Speech Synthesis

28. listopadu 2022

Řečníci

Sang-Hoon Lee

Řečník · 0 sledujících

Seung-Bin Kim

Řečník · 0 sledujících

Ji-Hyun Lee

Řečník · 0 sledujících

O prezentaci

This paper presents HierSpeech, a high-quality end-to-end text-to-speech (TTS) system based on a hierarchical conditional variational autoencoder (VAE) utilizing self-supervised speech representations. Recently, single-stage TTS systems, which directly generate raw speech waveform from text, have been getting interest thanks to their ability in generating high-quality audio within a fully end-to-end training pipeline. However, there is still a room for improvement in the conventional TTS systems…

Organizátor

NeurIPS 2022

Účet · 962 sledujících

Baví vás formát? Nechte SlidesLive zachytit svou akci!

Profesionální natáčení a streamování po celém světě.

Sdílení

Doporučená videa

Prezentace na podobné téma, kategorii nebo přednášejícího

Efficient Sampling on Riemannian Manifolds via Langevin MCMC

05:15

Efficient Sampling on Riemannian Manifolds via Langevin MCMC

Zhlédnout později

Oblíbené

Xiang Cheng, …

NeurIPS 2022 2 years ago

Understanding the Evolution of Linear Regions in Deep Reinforcement Learning

04:46

Understanding the Evolution of Linear Regions in Deep Reinforcement Learning

Zhlédnout později

Oblíbené

Setareh Cohan, …

NeurIPS 2022 2 years ago

Reproducibility in Optimization: Theoretical Framework and Limits

05:19

Reproducibility in Optimization: Theoretical Framework and Limits

Zhlédnout později

Oblíbené

Kwangjun Ahn, …

NeurIPS 2022 2 years ago

Combinatorial Bandits with Linear Constraints: Beyond Knapsacks and Fairness

05:03

Combinatorial Bandits with Linear Constraints: Beyond Knapsacks and Fairness

Zhlédnout později

Oblíbené

Qingsong Liu, …

NeurIPS 2022 2 years ago

Capturing Failures of Large Language Models via Human Cognitive Biases

05:10

Capturing Failures of Large Language Models via Human Cognitive Biases

Zhlédnout později

Oblíbené

Erik Jones, …

NeurIPS 2022 2 years ago

User and Technical Perspectives of Controllable Code Generation

05:59

User and Technical Perspectives of Controllable Code Generation

Zhlédnout později

Oblíbené

Stephanie Houde, …

NeurIPS 2022 2 years ago