issues
search
personabb
/
survey_paper
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
【2024/06】Reliability-Neurons: Investigating Neurons that Predict Model Uncertainty
#20
personabb
opened
3 days ago
1
【2024/06】Beyond Turn-Based Games: Real-Time Conversation with Duplex Model
#19
personabb
opened
3 days ago
2
【2024/06】ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning
#18
personabb
opened
4 days ago
1
【2024/06】Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation
#17
personabb
opened
4 days ago
3
【2024/06】Iterative Length-Regularized Direct Preference Optimization: A Case Study on Improving 7B Language Models to GPT-4 Level
#16
personabb
opened
4 days ago
2
【2023/07】VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
#15
personabb
closed
4 days ago
2
【2021/06】End-to-End Text-to-Speech Synthesis with Conditional Variational Autoencoders and Adversarial Learning
#14
personabb
closed
4 days ago
2
【2023/02】PERIOD VITS: VARIATIONAL INFERENCE WITH EXPLICIT PITCH MODELING FOR END-TO-END EMOTIONAL SPEECH SYNTHESIS
#13
personabb
opened
4 days ago
1
【2021/05】Grad-TTS: A Diffusion Probabilistic Model for Text-to-Speech
#12
personabb
opened
4 days ago
1
【2024/04】Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness
#11
personabb
closed
4 days ago
1
【2016/09】WAVENET: A GENERATIVE MODEL FOR RAW AUDIO
#10
personabb
closed
4 days ago
1
【2013】STATISTICAL PARAMETRIC SPEECH SYNTHESIS USING DEEP NEURAL NETWORKS
#9
personabb
opened
4 days ago
1
【2013/05】Speech Synthesis Based on Hidden Markov Models
#8
personabb
opened
4 days ago
2
【2022/05】Voice Activity Projection: Self-supervised Learning of Turn-taking Events
#7
personabb
closed
6 days ago
1
【2022/09】How Much Does Prosody Help Turn-taking? Investigations using Voice Activity Projection Models
#6
personabb
closed
6 days ago
6
【2024/06】Generating Speakers by Prompting Listener Impressions for Pre-trained Multi-Speaker Text-to-Speech Systems
#5
personabb
opened
6 days ago
1
【2024/06】The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
#4
personabb
opened
6 days ago
1
【2023/11】Manifold-Preserving Guidance for Diffusion Models
#3
personabb
closed
6 days ago
4
【2024/01】TURN-TAKING AND BACKCHANNEL PREDICTION WITH ACOUSTIC AND LARGE LANGUAGE MODEL FUSION
#2
personabb
opened
6 days ago
1
【2020/10】TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog
#1
personabb
opened
6 days ago
1