issues
search
facebookresearch
/
seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Other
10.8k
stars
1.05k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix potential issue with downloading Gigaspeech
#458
zrthxn
closed
4 months ago
0
I can't run bash script for generate dataset for fine-tuning m4t model!
#457
amirmfarzane
opened
4 months ago
0
Fix finetuning script
#456
mhlakhani
closed
4 months ago
2
Is there a person in this desert who can answer me?
#454
developeranalyser
opened
4 months ago
4
Documentation for denoising and segmentation
#453
am831
closed
4 months ago
0
text to text not showing even as an option m4t_predict: error: argument --task: invalid choice: 'T2TT' (choose from 'ASR', 'S2ST', 'S2TT')
#452
gloomiebloomie
closed
3 months ago
3
Any plans to release finetuning script for Seamless Streaming?
#451
mussakhojayeva
opened
4 months ago
0
Train speech language ID classification head
#450
am831
opened
4 months ago
0
Fixed finetuning trainer and enable freezing layers
#449
zrthxn
closed
4 months ago
0
M4T V2 --- Do you know how to teach the TEXT_TO_SPEECH ?? Does that mean no one has done it or you don't want to give me a guide? Thankful
#448
developeranalyser
opened
4 months ago
1
Language ID Model
#447
zrthxn
closed
4 months ago
0
Replace print statements with logger.info()
#446
am831
closed
4 months ago
0
Incorrect output for T2TT between English and Japanese on MPS backend.
#445
pettta
opened
4 months ago
1
Sequential Execution of translator.predict() in Multithreaded Environment
#444
lin-xiaosheng
opened
4 months ago
0
Enable downloading Gigaspeech
#443
zrthxn
closed
4 months ago
0
After downloading the seamless-m4t-large model file, not able to inference from local files.
#442
navjots121
opened
4 months ago
0
Denoise audio with Demucs and pipeline with Transcriber
#441
am831
closed
4 months ago
0
Why the extract_alignment function does not return times?
#440
fishfree
opened
5 months ago
0
Update alignment_extractor.py
#439
uralik
closed
5 months ago
0
Fine Tune Not work for text to speech ???????????????????????????????
#438
developeranalyser
closed
5 months ago
2
Odd results in ASR. Does it have a chat language model and text smoothing?
#437
cageyoko
opened
5 months ago
0
Please explain this part of the code to me
#436
developeranalyser
closed
5 months ago
0
FineTune on TPU?
#435
developeranalyser
closed
5 months ago
0
Train NEW
#434
developeranalyser
closed
5 months ago
0
The large version will crash out even on Colab A100. Maybe trying using the medium version.
#433
developeranalyser
closed
5 months ago
1
posible text to speech fineTune ???
#431
developeranalyser
closed
5 months ago
2
Why?
#430
developeranalyser
closed
5 months ago
0
Tensor device mismatch in Transcriber
#429
zrthxn
opened
5 months ago
0
How to segment hate speech downloaded from the Mutox dataset tsv file
#428
dlion168
closed
5 months ago
5
[WIP] test m4t finetuning on gigaspeech
#427
mavlyutovr
opened
5 months ago
0
NotImplementedError: T2U finetuning implemented only for UnitYT2UModel why??!!
#426
developeranalyser
opened
5 months ago
4
.pth file
#425
developeranalyser
closed
5 months ago
0
FineTune Errror TEXT_TO_SPEECH Same lang
#424
developeranalyser
closed
5 months ago
3
use orginal .pth or FineTuned .pth
#423
developeranalyser
closed
5 months ago
0
Error with GPU 40GB !!!!!!!!!
#421
developeranalyser
closed
5 months ago
6
Error with GPU 40G !!!!!!!!!
#420
developeranalyser
closed
5 months ago
1
Could the demo run in windows 11?
#419
EricKong1985
opened
5 months ago
0
Fixed finetuning trainer and enable freezing layers
#418
zrthxn
closed
4 months ago
0
How to use speech to text predit in batch by batch not one audio
#417
amirmfarzane
opened
5 months ago
0
not Fixed in #400
#416
developeranalyser
closed
5 months ago
0
Request for Enterprise Use
#415
ABHIMUCH
opened
5 months ago
0
!!!!!!!!!Bug RuntimeError: expected scalar type Half but found Float $$$$$$$$$$
#414
developeranalyser
closed
5 months ago
9
Indicate the SONAR device in the mutox example and explain the dataset columns
#412
avidale
closed
5 months ago
0
Where can I set the max input length
#410
Longleaves
opened
5 months ago
0
Why always Downloading the tokenizer of seamlessM4T_v2_large
#409
Longleaves
opened
5 months ago
7
فروشگاهپرشینفیلتر
#408
Persianfilters
closed
5 months ago
0
Segment audio with Silero VAD and pipeline with Transcriber
#406
am831
closed
4 months ago
0
Translated from Chinese to English, nitrogen converts hydrogen gas, oxygen converts oxidation, so terrible
#405
zouhuigang
opened
6 months ago
0
artifact model seamless expression
#404
Webkamsky
opened
6 months ago
0
unknown result in t2tt
#403
asulada
opened
6 months ago
3
Previous
Next