facebookresearch seamless_communication issues

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Other

10.5k stars 1.02k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Seamless Streaming using microphone input

#484 SherwinDesouza opened 1 day ago
0
Does t2tt and t2ts support real-time streaming generation?

#483 suxuanning opened 1 day ago
2
Norwagian Language Not supported this model

#482 Suryaravi0510 opened 2 days ago
0
Fix broken m4t_prepare_dataset guides

#481 kujyp opened 2 days ago
2
I am going crazy waiting for the training and fine-tuning code for the SPEECH TO SPEECH task

#480 mc112611 opened 3 days ago
0
How use checkpoints that i got from fine-tuning??

#479 amirmfarzane opened 5 days ago
0
آیا شما توانسته اید یادگیری مدل را اصلاح کنید ؟؟؟

#478 developeranalyser opened 6 days ago
0
Cannot finetune TEXT_TO_SPEECH and SPEECH_TO_SPEECH

#477 yiyibooks opened 1 week ago
1
T2TT not works

#476 zsq2010 opened 1 week ago
0
[help] libsndfile is not found | mac m1 proMax, sonoma14.4

#475 OpenSourceCommunityInterface opened 1 week ago
1
$$ FineTune Not Work !

#474 developeranalyser opened 1 week ago
0
tensor format input audio translation error

#473 DengHao97 opened 1 week ago
0
Bug in fine-tuning

#472 amirmfarzane opened 2 weeks ago
11
Cast error details: Unable to cast Python instance of type <class 'pathlib.PosixPath'> to C++ type '?' (#define PYBIND11_DETAILED_ERROR_MESSAGES or compile in debug mode for details)

#471 liuhao0813 opened 2 weeks ago
0
RuntimeError: torchaudio_sox::save_audio_file() Expected a value of type 'str' for argument '_0' but instead found type 'PosixPath'.

#470 liuhao0813 opened 2 weeks ago
0
pip dependency conflict when installing

#469 ivanhe123 opened 2 weeks ago
0
S2S aligned metadata "extension" is a subset of prior metadata release?

#467 arlofaria-cartesia opened 3 weeks ago
0
Obtain the speech embedding

#466 Sameep-c opened 3 weeks ago
0
seamless-streaming inference error

#465 LesterGong closed 1 week ago
0
Reproduciblity/seed

#464 Satyam52 opened 3 weeks ago
0
[ACL2024] DINO-PRETSSEL demo page

#463 mjhwang93 closed 4 weeks ago
0
Have anyone face this problem when finetune

#462 xufeiqiong opened 4 weeks ago
1
How to find out speaker id for certain languages? Is there any reference?

#461 NBStarry opened 1 month ago
0
Deployment of Seamless M4T Model - Exporting text.decoder to ONNX or Using torch.jit.trace

#460 HesamAlavian opened 1 month ago
0
Fix potential issue with downloading Gigaspeech

#458 zrthxn closed 1 month ago
0
I can't run bash script for generate dataset for fine-tuning m4t model!

#457 amirmfarzane opened 1 month ago
0
Fix finetuning script

#456 mhlakhani closed 1 month ago
2
Is there a person in this desert who can answer me?

#454 developeranalyser opened 1 month ago
4
Documentation for denoising and segmentation

#453 am831 closed 1 month ago
0
text to text not showing even as an option m4t_predict: error: argument --task: invalid choice: 'T2TT' (choose from 'ASR', 'S2ST', 'S2TT')

#452 gloomiebloomie closed 1 week ago
2
Any plans to release finetuning script for Seamless Streaming?

#451 mussakhojayeva opened 1 month ago
0
Train speech language ID classification head

#450 am831 opened 1 month ago
0
Fixed finetuning trainer and enable freezing layers

#449 zrthxn closed 1 month ago
0
M4T V2 --- Do you know how to teach the TEXT_TO_SPEECH ?? Does that mean no one has done it or you don't want to give me a guide? Thankful

#448 developeranalyser opened 1 month ago
0
Language ID Model

#447 zrthxn closed 1 month ago
0
Replace print statements with logger.info()

#446 am831 closed 1 month ago
0
Incorrect output for T2TT between English and Japanese on MPS backend.

#445 pettta opened 1 month ago
1
Sequential Execution of translator.predict() in Multithreaded Environment

#444 lin-xiaosheng opened 1 month ago
0
Enable downloading Gigaspeech

#443 zrthxn closed 1 month ago
0
After downloading the seamless-m4t-large model file, not able to inference from local files.

#442 navjots121 opened 1 month ago
0
Denoise audio with Demucs and pipeline with Transcriber

#441 am831 closed 1 month ago
0
Why the extract_alignment function does not return times?

#440 fishfree opened 2 months ago
0
Update alignment_extractor.py

#439 uralik closed 2 months ago
0
Fine Tune Not work for text to speech ???????????????????????????????

#438 developeranalyser closed 2 months ago
2
Odd results in ASR. Does it have a chat language model and text smoothing?

#437 cageyoko opened 2 months ago
0
Please explain this part of the code to me

#436 developeranalyser closed 2 months ago
0
FineTune on TPU?

#435 developeranalyser closed 2 months ago
0
Train NEW

#434 developeranalyser closed 2 months ago
0
The large version will crash out even on Colab A100. Maybe trying using the medium version.

#433 developeranalyser closed 2 months ago
1
posible text to speech fineTune ???

#431 developeranalyser closed 2 months ago
2