issues
search
facebookresearch
/
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Other
10.5k
stars
1.02k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Seamless Streaming using microphone input
#484
SherwinDesouza
opened
1 day ago
0
Does t2tt and t2ts support real-time streaming generation?
#483
suxuanning
opened
1 day ago
2
Norwagian Language Not supported this model
#482
Suryaravi0510
opened
2 days ago
0
Fix broken m4t_prepare_dataset guides
#481
kujyp
opened
2 days ago
2
I am going crazy waiting for the training and fine-tuning code for the SPEECH TO SPEECH task
#480
mc112611
opened
3 days ago
0
How use checkpoints that i got from fine-tuning??
#479
amirmfarzane
opened
5 days ago
0
آیا شما توانسته اید یادگیری مدل را اصلاح کنید ؟؟؟
#478
developeranalyser
opened
6 days ago
0
Cannot finetune TEXT_TO_SPEECH and SPEECH_TO_SPEECH
#477
yiyibooks
opened
1 week ago
1
T2TT not works
#476
zsq2010
opened
1 week ago
0
[help] libsndfile is not found | mac m1 proMax, sonoma14.4
#475
OpenSourceCommunityInterface
opened
1 week ago
1
$$ FineTune Not Work !
#474
developeranalyser
opened
1 week ago
0
tensor format input audio translation error
#473
DengHao97
opened
1 week ago
0
Bug in fine-tuning
#472
amirmfarzane
opened
2 weeks ago
11
Cast error details: Unable to cast Python instance of type <class 'pathlib.PosixPath'> to C++ type '?' (#define PYBIND11_DETAILED_ERROR_MESSAGES or compile in debug mode for details)
#471
liuhao0813
opened
2 weeks ago
0
RuntimeError: torchaudio_sox::save_audio_file() Expected a value of type 'str' for argument '_0' but instead found type 'PosixPath'.
#470
liuhao0813
opened
2 weeks ago
0
pip dependency conflict when installing
#469
ivanhe123
opened
2 weeks ago
0
S2S aligned metadata "extension" is a subset of prior metadata release?
#467
arlofaria-cartesia
opened
3 weeks ago
0
Obtain the speech embedding
#466
Sameep-c
opened
3 weeks ago
0
seamless-streaming inference error
#465
LesterGong
closed
1 week ago
0
Reproduciblity/seed
#464
Satyam52
opened
3 weeks ago
0
[ACL2024] DINO-PRETSSEL demo page
#463
mjhwang93
closed
4 weeks ago
0
Have anyone face this problem when finetune
#462
xufeiqiong
opened
4 weeks ago
1
How to find out speaker id for certain languages? Is there any reference?
#461
NBStarry
opened
1 month ago
0
Deployment of Seamless M4T Model - Exporting text.decoder to ONNX or Using torch.jit.trace
#460
HesamAlavian
opened
1 month ago
0
Fix potential issue with downloading Gigaspeech
#458
zrthxn
closed
1 month ago
0
I can't run bash script for generate dataset for fine-tuning m4t model!
#457
amirmfarzane
opened
1 month ago
0
Fix finetuning script
#456
mhlakhani
closed
1 month ago
2
Is there a person in this desert who can answer me?
#454
developeranalyser
opened
1 month ago
4
Documentation for denoising and segmentation
#453
am831
closed
1 month ago
0
text to text not showing even as an option m4t_predict: error: argument --task: invalid choice: 'T2TT' (choose from 'ASR', 'S2ST', 'S2TT')
#452
gloomiebloomie
closed
1 week ago
2
Any plans to release finetuning script for Seamless Streaming?
#451
mussakhojayeva
opened
1 month ago
0
Train speech language ID classification head
#450
am831
opened
1 month ago
0
Fixed finetuning trainer and enable freezing layers
#449
zrthxn
closed
1 month ago
0
M4T V2 --- Do you know how to teach the TEXT_TO_SPEECH ?? Does that mean no one has done it or you don't want to give me a guide? Thankful
#448
developeranalyser
opened
1 month ago
0
Language ID Model
#447
zrthxn
closed
1 month ago
0
Replace print statements with logger.info()
#446
am831
closed
1 month ago
0
Incorrect output for T2TT between English and Japanese on MPS backend.
#445
pettta
opened
1 month ago
1
Sequential Execution of translator.predict() in Multithreaded Environment
#444
lin-xiaosheng
opened
1 month ago
0
Enable downloading Gigaspeech
#443
zrthxn
closed
1 month ago
0
After downloading the seamless-m4t-large model file, not able to inference from local files.
#442
navjots121
opened
1 month ago
0
Denoise audio with Demucs and pipeline with Transcriber
#441
am831
closed
1 month ago
0
Why the extract_alignment function does not return times?
#440
fishfree
opened
2 months ago
0
Update alignment_extractor.py
#439
uralik
closed
2 months ago
0
Fine Tune Not work for text to speech ???????????????????????????????
#438
developeranalyser
closed
2 months ago
2
Odd results in ASR. Does it have a chat language model and text smoothing?
#437
cageyoko
opened
2 months ago
0
Please explain this part of the code to me
#436
developeranalyser
closed
2 months ago
0
FineTune on TPU?
#435
developeranalyser
closed
2 months ago
0
Train NEW
#434
developeranalyser
closed
2 months ago
0
The large version will crash out even on Colab A100. Maybe trying using the medium version.
#433
developeranalyser
closed
2 months ago
1
posible text to speech fineTune ???
#431
developeranalyser
closed
2 months ago
2
Next