issues
search
TanUkkii007
/
papers-i-read
23
stars
3
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
More Kawaii than a Real-Person Live Streamer: Understanding How the Otaku Community Engages with and Perceives Virtual YouTubers
#676
TanUkkii007
opened
3 years ago
0
VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency
#675
TanUkkii007
opened
3 years ago
0
Pretraining Techniques for Sequence-to-Sequence Voice Conversion
#674
TanUkkii007
opened
3 years ago
0
Neural Text Generation With Unlikelihood Training
#673
TanUkkii007
closed
2 years ago
0
DALL·E: Creating Images from Text
#672
TanUkkii007
opened
3 years ago
0
Taming Transformers for High-Resolution Image Synthesis
#671
TanUkkii007
opened
3 years ago
0
VIBE: Video Inference for Human Body Pose and Shape Estimation
#670
TanUkkii007
opened
3 years ago
0
Contact and Human Dynamics from Monocular Video
#669
TanUkkii007
opened
3 years ago
0
Challenges in Deploying Machine Learning: a Survey of Case Studies
#668
TanUkkii007
opened
3 years ago
0
Confident Learning: Estimating Uncertainty in Dataset Labels
#667
TanUkkii007
closed
3 years ago
0
Differentiable Divergences Between Time Series
#666
TanUkkii007
opened
3 years ago
0
Deep Reinforcement Learning For Sequence to Sequence Models
#665
TanUkkii007
opened
3 years ago
0
End-to-End Adversarial Text-to-Speech
#664
TanUkkii007
opened
3 years ago
0
Self-supervised Learning: Generative or Contrastive
#663
TanUkkii007
opened
3 years ago
0
It's Not Just Size That Matters: Small Language Models Are Also Few-Shot Learners
#662
TanUkkii007
opened
3 years ago
0
Supervised and Unsupervised Approaches for Controlling Narrow Lexical Focus in Sequence-to-Sequence Speech Synthesis
#661
TanUkkii007
opened
3 years ago
0
A Review of Speaker Diarization: Recent Advances with Deep Learning
#660
TanUkkii007
opened
3 years ago
0
Document-Level Neural TTS Using Curriculum Learning and Attention Masking
#659
TanUkkii007
opened
3 years ago
0
Neural TTS Voice Conversion
#658
TanUkkii007
opened
3 years ago
0
High Fidelity Speech Synthesis with Adversarial Networks
#657
TanUkkii007
opened
3 years ago
0
Incremental Text to Speech for Neural Sequence-to-Sequence Models Using Reinforcement Learning
#656
TanUkkii007
closed
3 years ago
0
Self-supervised Pitch Detection by Inverse Audio Synthesis
#655
TanUkkii007
opened
3 years ago
0
Reading Wikipedia to Answer Open-Domain Questions
#654
TanUkkii007
opened
3 years ago
0
Vector-Quantized Timbre Representation
#653
TanUkkii007
opened
3 years ago
0
A Spectral Energy Distance for Parallel Speech Synthesis
#652
TanUkkii007
opened
3 years ago
0
Controllable Neural Prosody Synthesis
#651
TanUkkii007
opened
3 years ago
0
Phonological Features for 0-shot Multilingual Speech Synthesis
#650
TanUkkii007
opened
3 years ago
0
WaveGrad: Estimating Gradients for Waveform Generation
#649
TanUkkii007
opened
3 years ago
0
Controllable neural text-to-speech synthesis using intuitive prosodic features
#648
TanUkkii007
opened
3 years ago
0
DiffWave: A Versatile Diffusion Model for Audio Synthesis
#647
TanUkkii007
closed
3 years ago
0
Improving Zero-Shot Voice Style Transfer via Disentangled Representation Learning
#646
TanUkkii007
opened
3 years ago
0
AdaSpeech: Adaptive Text to Speech for Custom Voice
#645
TanUkkii007
opened
3 years ago
0
Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech
#644
TanUkkii007
closed
3 years ago
1
Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech
#643
TanUkkii007
opened
3 years ago
0
GiantMIDI-Piano: A large-scale MIDI dataset for classical piano music
#642
TanUkkii007
opened
3 years ago
0
The Blizzard Challenge 2020
#641
TanUkkii007
opened
3 years ago
0
Automatic multitrack mixing with a differentiable mixing console of neural audio effects
#640
TanUkkii007
opened
3 years ago
0
Learning Speaker Embedding from Text-to-Speech
#639
TanUkkii007
opened
3 years ago
0
TTS-by-TTS: TTS-driven Data Augmentation for Fast and High-Quality Speech Synthesis
#638
TanUkkii007
opened
3 years ago
0
Perceptually Guided End-to-End Text-to-Speech
#637
TanUkkii007
opened
3 years ago
0
Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis
#636
TanUkkii007
closed
3 years ago
0
Speaker Recognition Based on Deep Learning: An Overview
#635
TanUkkii007
opened
3 years ago
0
MLS: A Large-Scale Multilingual Dataset for Speech Research
#634
TanUkkii007
opened
3 years ago
0
Multi-Instrumentalist Net: Unsupervised Generation of Music from Body Movements
#633
TanUkkii007
opened
3 years ago
0
I'm Sorry for Your Loss: Spectrally-Based Audio Distances Are Bad at Pitch
#632
TanUkkii007
opened
3 years ago
0
NHSS: A Speech and Singing Parallel Database
#631
TanUkkii007
opened
3 years ago
0
A Comprehensive Survey on Deep Music Generation: Multi-level Representations, Algorithms, Evaluations, and Future Directions
#630
TanUkkii007
opened
3 years ago
0
An Overview of Voice Conversion and its Challenges: From Statistical Modeling to Deep Learning
#629
TanUkkii007
opened
3 years ago
0
Non-Attentive Tacotron: Robust and Controllable Neural TTS Synthesis Including Unsupervised Duration Modeling
#628
TanUkkii007
closed
3 years ago
1
TalkNet: Fully-Convolutional Non-Autoregressive Speech Synthesis Model
#627
TanUkkii007
closed
3 years ago
1
Previous
Next