TanUkkii007 papers-i-read issues

TanUkkii007 / papers-i-read

23 stars 3 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

More Kawaii than a Real-Person Live Streamer: Understanding How the Otaku Community Engages with and Perceives Virtual YouTubers

#676 TanUkkii007 opened 3 years ago
0
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency

#675 TanUkkii007 opened 3 years ago
0
Pretraining Techniques for Sequence-to-Sequence Voice Conversion

#674 TanUkkii007 opened 3 years ago
0
Neural Text Generation With Unlikelihood Training

#673 TanUkkii007 closed 2 years ago
0
DALL·E: Creating Images from Text

#672 TanUkkii007 opened 3 years ago
0
Taming Transformers for High-Resolution Image Synthesis

#671 TanUkkii007 opened 3 years ago
0
VIBE: Video Inference for Human Body Pose and Shape Estimation

#670 TanUkkii007 opened 3 years ago
0
Contact and Human Dynamics from Monocular Video

#669 TanUkkii007 opened 3 years ago
0
Challenges in Deploying Machine Learning: a Survey of Case Studies

#668 TanUkkii007 opened 3 years ago
0
Confident Learning: Estimating Uncertainty in Dataset Labels

#667 TanUkkii007 closed 3 years ago
0
Differentiable Divergences Between Time Series

#666 TanUkkii007 opened 3 years ago
0
Deep Reinforcement Learning For Sequence to Sequence Models

#665 TanUkkii007 opened 3 years ago
0
End-to-End Adversarial Text-to-Speech

#664 TanUkkii007 opened 3 years ago
0
Self-supervised Learning: Generative or Contrastive

#663 TanUkkii007 opened 3 years ago
0
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners

#662 TanUkkii007 opened 3 years ago
0
Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis

#661 TanUkkii007 opened 3 years ago
0
A Review of Speaker Diarization: Recent Advances with Deep Learning

#660 TanUkkii007 opened 3 years ago
0
Document-Level Neural TTS Using Curriculum Learning and Attention Masking

#659 TanUkkii007 opened 3 years ago
0
Neural TTS Voice Conversion

#658 TanUkkii007 opened 3 years ago
0
High Fidelity Speech Synthesis with Adversarial Networks

#657 TanUkkii007 opened 3 years ago
0
Incremental Text to Speech for Neural Sequence-to-Sequence Models Using Reinforcement Learning

#656 TanUkkii007 closed 3 years ago
0
Self-supervised Pitch Detection by Inverse Audio Synthesis

#655 TanUkkii007 opened 3 years ago
0
Reading Wikipedia to Answer Open-Domain Questions

#654 TanUkkii007 opened 3 years ago
0
Vector-Quantized Timbre Representation

#653 TanUkkii007 opened 3 years ago
0
A Spectral Energy Distance for Parallel Speech Synthesis

#652 TanUkkii007 opened 3 years ago
0
Controllable Neural Prosody Synthesis

#651 TanUkkii007 opened 3 years ago
0
Phonological Features for 0-shot Multilingual Speech Synthesis

#650 TanUkkii007 opened 3 years ago
0
WaveGrad: Estimating Gradients for Waveform Generation

#649 TanUkkii007 opened 3 years ago
0
Controllable neural text-to-speech synthesis using intuitive prosodic features

#648 TanUkkii007 opened 3 years ago
0
DiffWave: A Versatile Diffusion Model for Audio Synthesis

#647 TanUkkii007 closed 3 years ago
0
Improving Zero-Shot Voice Style Transfer via Disentangled Representation Learning

#646 TanUkkii007 opened 3 years ago
0
AdaSpeech: Adaptive Text to Speech for Custom Voice

#645 TanUkkii007 opened 3 years ago
0
Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech

#644 TanUkkii007 closed 3 years ago
1
Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech

#643 TanUkkii007 opened 3 years ago
0
GiantMIDI-Piano: A large-scale MIDI dataset for classical piano music

#642 TanUkkii007 opened 3 years ago
0
The Blizzard Challenge 2020

#641 TanUkkii007 opened 3 years ago
0
Automatic multitrack mixing with a differentiable mixing console of neural audio effects

#640 TanUkkii007 opened 3 years ago
0
Learning Speaker Embedding from Text-to-Speech

#639 TanUkkii007 opened 3 years ago
0
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis

#638 TanUkkii007 opened 3 years ago
0
Perceptually Guided End-to-End Text-to-Speech

#637 TanUkkii007 opened 3 years ago
0
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis

#636 TanUkkii007 closed 3 years ago
0
Speaker Recognition Based on Deep Learning: An Overview

#635 TanUkkii007 opened 3 years ago
0
MLS: A Large-Scale Multilingual Dataset for Speech Research

#634 TanUkkii007 opened 3 years ago
0
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements

#633 TanUkkii007 opened 3 years ago
0
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch

#632 TanUkkii007 opened 3 years ago
0
NHSS: A Speech and Singing Parallel Database

#631 TanUkkii007 opened 3 years ago
0
A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions

#630 TanUkkii007 opened 3 years ago
0
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning

#629 TanUkkii007 opened 3 years ago
0
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling

#628 TanUkkii007 closed 3 years ago
1
TalkNet: Fully-Convolutional Non-Autoregressive Speech Synthesis Model

#627 TanUkkii007 closed 3 years ago
1

Previous Next