facebookresearch / seamless_communication

Foundational Models for State-of-the-Art Speech and Text Translation
Other
10.73k stars 1.04k forks source link

about tsv file #492

Open sunyclj opened 1 month ago

sunyclj commented 1 month ago

command: streaming_evaluate --task s2st --data-file ../audio-tsv/test.tsv --audio-root-dir ../test_data/audio/ --output ../ --tgt-lang eng --no-scoring 2024-07-11 05:44:48,463 INFO -- seamless_communication.streaming.agents.unity_pipeline: Loading the UnitY model: seamless_streaming_unity on device=cuda:0, dtype=torch.float16 2024-07-11 05:44:56,398 INFO -- seamless_communication.streaming.agents.unity_pipeline: Loading the Monotonic Decoder model: seamless_streaming_monotonic_decoder on device=cuda:0, dtype=torch.float16 2024-07-11 05:45:02,766 INFO -- seamless_communication.streaming.agents.online_vocoder: Loading the Vocoder model: vocoder_v2 on device=cuda:0, dtype=torch.float16 /opt/conda/envs/smlss_server/lib/python3.8/site-packages/torch/nn/utils/weight_norm.py:28: UserWarning: torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm. warnings.warn("torch.nn.utils.weight_norm is deprecated in favor of torch.nn.utils.parametrizations.weight_norm.") 2024-07-11 05:45:05,959 INFO -- simuleval.utils.agent: System will run on device: cuda:0. dtype: fp16 2024-07-11 05:45:05,959 INFO -- simuleval.dataloader: Evaluating from dataloader fairseq2_s2tt. 2024-07-11 05:45:05,963 WARNING -- seamless_communication.streaming.dataloaders.s2tt: Stripping silence in the beginning and the end of audio with SileroVAD. Using cache found in /home/jovyan/.cache/torch/hub/snakers4_silero-vad_master Traceback (most recent call last): File "/opt/conda/envs/smlss_server/bin/streaming_evaluate", line 8, in sys.exit(main()) File "/opt/conda/envs/smlss_server/lib/python3.8/site-packages/seamless_communication/cli/streaming/evaluate.py", line 100, in main evaluate(agent_class, {base_config, model_configs, **eval_configs}, parser) File "/opt/conda/envs/smlss_server/lib/python3.8/site-packages/simuleval/cli.py", line 80, in evaluate evaluator = build_evaluator(args) File "/opt/conda/envs/smlss_server/lib/python3.8/site-packages/simuleval/evaluator/init.py", line 12, in build_evaluator return SentenceLevelEvaluator.from_args(args) File "/opt/conda/envs/smlss_server/lib/python3.8/site-packages/simuleval/evaluator/evaluator.py", line 298, in from_args return cls(dataloader, quality_scorers, latency_scorers, args) File "/opt/conda/envs/smlss_server/lib/python3.8/site-packages/simuleval/evaluator/evaluator.py", line 141, in init self.end_index = len(self.dataloader) ValueError: len() should return >= 0

test.tsv contain "GZX-haitaowang-20240613162529979.wav 0 5118" ../test_data/audio/ contain GZX-haitaowang-20240613162529979.wav ,only one please, what reason?

sunyclj commented 1 month ago

image 已解决