facebookresearch seamless_communication issues

facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation

Other

10.8k stars 1.05k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Fix potential issue with downloading Gigaspeech

#458 zrthxn closed 4 months ago
0
I can't run bash script for generate dataset for fine-tuning m4t model!

#457 amirmfarzane opened 4 months ago
0
Fix finetuning script

#456 mhlakhani closed 4 months ago
2
Is there a person in this desert who can answer me?

#454 developeranalyser opened 4 months ago
4
Documentation for denoising and segmentation

#453 am831 closed 4 months ago
0
text to text not showing even as an option m4t_predict: error: argument --task: invalid choice: 'T2TT' (choose from 'ASR', 'S2ST', 'S2TT')

#452 gloomiebloomie closed 3 months ago
3
Any plans to release finetuning script for Seamless Streaming?

#451 mussakhojayeva opened 4 months ago
0
Train speech language ID classification head

#450 am831 opened 4 months ago
0
Fixed finetuning trainer and enable freezing layers

#449 zrthxn closed 4 months ago
0
M4T V2 --- Do you know how to teach the TEXT_TO_SPEECH ?? Does that mean no one has done it or you don't want to give me a guide? Thankful

#448 developeranalyser opened 4 months ago
1
Language ID Model

#447 zrthxn closed 4 months ago
0
Replace print statements with logger.info()

#446 am831 closed 4 months ago
0
Incorrect output for T2TT between English and Japanese on MPS backend.

#445 pettta opened 4 months ago
1
Sequential Execution of translator.predict() in Multithreaded Environment

#444 lin-xiaosheng opened 4 months ago
0
Enable downloading Gigaspeech

#443 zrthxn closed 4 months ago
0
After downloading the seamless-m4t-large model file, not able to inference from local files.

#442 navjots121 opened 4 months ago
0
Denoise audio with Demucs and pipeline with Transcriber

#441 am831 closed 4 months ago
0
Why the extract_alignment function does not return times?

#440 fishfree opened 5 months ago
0
Update alignment_extractor.py

#439 uralik closed 5 months ago
0
Fine Tune Not work for text to speech ???????????????????????????????

#438 developeranalyser closed 5 months ago
2
Odd results in ASR. Does it have a chat language model and text smoothing?

#437 cageyoko opened 5 months ago
0
Please explain this part of the code to me

#436 developeranalyser closed 5 months ago
0
FineTune on TPU?

#435 developeranalyser closed 5 months ago
0
Train NEW

#434 developeranalyser closed 5 months ago
0
The large version will crash out even on Colab A100. Maybe trying using the medium version.

#433 developeranalyser closed 5 months ago
1
posible text to speech fineTune ???

#431 developeranalyser closed 5 months ago
2
Why?

#430 developeranalyser closed 5 months ago
0
Tensor device mismatch in Transcriber

#429 zrthxn opened 5 months ago
0
How to segment hate speech downloaded from the Mutox dataset tsv file

#428 dlion168 closed 5 months ago
5
[WIP] test m4t finetuning on gigaspeech

#427 mavlyutovr opened 5 months ago
0
NotImplementedError: T2U finetuning implemented only for UnitYT2UModel why??!!

#426 developeranalyser opened 5 months ago
4
.pth file

#425 developeranalyser closed 5 months ago
0
FineTune Errror TEXT_TO_SPEECH Same lang

#424 developeranalyser closed 5 months ago
3
use orginal .pth or FineTuned .pth

#423 developeranalyser closed 5 months ago
0
Error with GPU 40GB !!!!!!!!!

#421 developeranalyser closed 5 months ago
6
Error with GPU 40G !!!!!!!!!

#420 developeranalyser closed 5 months ago
1
Could the demo run in windows 11?

#419 EricKong1985 opened 5 months ago
0
Fixed finetuning trainer and enable freezing layers

#418 zrthxn closed 4 months ago
0
How to use speech to text predit in batch by batch not one audio

#417 amirmfarzane opened 5 months ago
0
not Fixed in #400

#416 developeranalyser closed 5 months ago
0
Request for Enterprise Use

#415 ABHIMUCH opened 5 months ago
0
!!!!!!!!!Bug RuntimeError: expected scalar type Half but found Float $$$$$$$$$$

#414 developeranalyser closed 5 months ago
9
Indicate the SONAR device in the mutox example and explain the dataset columns

#412 avidale closed 5 months ago
0
Where can I set the max input length

#410 Longleaves opened 5 months ago
0
Why always Downloading the tokenizer of seamlessM4T_v2_large

#409 Longleaves opened 5 months ago
7
فروشگاه‌پرشین‌فیلتر

#408 Persianfilters closed 5 months ago
0
Segment audio with Silero VAD and pipeline with Transcriber

#406 am831 closed 4 months ago
0
Translated from Chinese to English, nitrogen converts hydrogen gas, oxygen converts oxidation, so terrible

#405 zouhuigang opened 6 months ago
0
artifact model seamless expression

#404 Webkamsky opened 6 months ago
0
unknown result in t2tt

#403 asulada opened 6 months ago
3

Previous Next