deezer / spleeter

Deezer source separation library including pretrained models.
https://research.deezer.com/projects/spleeter.html
MIT License
25.68k stars 2.81k forks source link

[Bug]Each training period: ConcatOp:Dimension of inputs should match #515

Open seoulan opened 3 years ago

seoulan commented 3 years ago

Description

I'm a novice of deep learning. The training set prepared for 2stems.Sample_rate of training set is 8KHz,and the format of which is wav. Each training period,Some errors were shown on the console that ConcatOp:Dimension of inputs should match:shape[0]=[,,2] vs. shape[1]=[,,1]. Details as follows:

Step to reproduce

  1. spleeter train --verbose -p path of json path of audio training set

Output

gs\atmb\train\118.80T_1001_0015_Cessna_CockpitVoice2\vocals.wav' from 11.260 000000000002 to 31.26 INFO:spleeter:Loading audio b'K:\Anaconda3\Lib\site-packages\spleeter\confi gs\atmb\train\118.80T_1001_0015_Cessna_CockpitVoice1\vocals.wav' from 11.260 000000000002 to 31.26 WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice1\vocals.wav') > could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bu g, set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attac h the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:00.500000', t='0:00:20.000000') <152313499f09

could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice2\vocals.wav') <175cf1ae1684>> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bu g, set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attac h the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice2\vocals.wav') <175cf1ae1684>> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bu g, set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attac h the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:11.260000', t='0:00:20.000000') <f4673ba3c59a could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:00.500000', t='0:00:20.000000') <94e14c0fda78 could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice1\vocals.wav') > could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bu g, set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attac h the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:11.260000', t='0:00:20.000000') <1b2581ce8eaa could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph INFO:spleeter:Audio data loaded successfully INFO:spleeter:Audio data loaded successfully INFO:spleeter:Loading audio b'K:\Anaconda3\Lib\site-packages\spleeter\confi gs\atmb\train\118.80T_1001_0015_Cessna_CockpitVoice1\other.wav' from 0.5 to 20.5 INFO:spleeter:Audio data loaded successfully INFO:spleeter:Audio data loaded successfully INFO:spleeter:Loading audio b'K:\Anaconda3\Lib\site-packages\spleeter\confi gs\atmb\train\118.80T_1001_0015_Cessna_CockpitVoice2\other.wav' from 0.5 to 20.5 INFO:spleeter:Loading audio b'K:\Anaconda3\Lib\site-packages\spleeter\confi gs\atmb\train\118.80T_1001_0015_Cessna_CockpitVoice2\other.wav' from 11.2600 00000000002 to 31.26 INFO:spleeter:Loading audio b'K:\Anaconda3\Lib\site-packages\spleeter\confi gs\atmb\train\118.80T_1001_0015_Cessna_CockpitVoice1\other.wav' from 11.2600 00000000002 to 31.26 WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice1\other.wav') <7b99e0a559c3>> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bug , set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:00.500000', t='0:00:20.000000') <103881ea98f4 could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice2\other.wav') <937f23ce070b>> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bug , set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice2\other.wav') <937f23ce070b>> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bug , set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:11.260000', t='0:00:20.000000') <18ef7ee44f04 could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:00.500000', t='0:00:20.000000') <18fdbe3dd88a could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph INFO:spleeter:Audio data loaded successfully WARNING:tensorflow:Entity <bound method Node.stream of input(filename='K:\Anaco nda3\Lib\site-packages\spleeter\configs\atmb\train\118.80T_1001_0015_Cess na_CockpitVoice1\other.wav') <7b99e0a559c3>> could not be transformed and will be executed as-is. Please report this to the AutoGraph team. When filing the bug , set the verbosity to 10 (on Linux, export AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph WARNING:tensorflow:Entity <bound method Node.stream of output(ar=44100, filename ='pipe:', format='f32le', ss='0:00:11.260000', t='0:00:20.000000') <180187c01b07 could not be transformed and will be executed as-is. Please report this to th e AutoGraph team. When filing the bug, set the verbosity to 10 (on Linux, expor t AUTOGRAPH_VERBOSITY=10) and attach the full output. Cause: Mangled names are not yet supported by AutoGraph INFO:spleeter:Audio data loaded successfully INFO:spleeter:Audio data loaded successfully INFO:spleeter:Audio data loaded successfully Traceback (most recent call last): File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\client\session.py" , line 1365, in _do_call return fn(*args) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\client\session.py" , line 1350, in _run_fn target_list, run_metadata) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\client\session.py" , line 1443, in _call_tf_sessionrun run_metadata) tensorflow.python.framework.errors_impl.InvalidArgumentError: {{function_node __ inference_Dataset_map_DatasetBuilder.random_time_crop_1250}} ConcatOp : Dimensio ns of inputs should match: shape[0] = [858,1024,2] vs. shape[1] = [858,1024,1] [[{{node concat}}]] [[IteratorGetNext]]

During handling of the above exception, another exception occurred:

Traceback (most recent call last): File "K:\Anaconda3\Scripts\spleeter-script.py", line 10, in sys.exit(entrypoint()) File "K:\Anaconda3\lib\site-packages\spleeter__main.py", line 54, in entryp oint main(sys.argv) File "K:\Anaconda3\lib\site-packages\spleeter\main__.py", line 46, in main entrypoint(arguments, params) File "K:\Anaconda3\lib\site-packages\spleeter\commands\train.py", line 98, in entrypoint evaluation_spec) File "K:\Anaconda3\lib\site-packages\tensorflow_estimator\python\estimator\tra ining.py", line 473, in train_and_evaluate return executor.run() File "K:\Anaconda3\lib\site-packages\tensorflow_estimator\python\estimator\tra ining.py", line 613, in run return self.run_local() File "K:\Anaconda3\lib\site-packages\tensorflow_estimator\python\estimator\tra ining.py", line 714, in run_local saving_listeners=saving_listeners) File "K:\Anaconda3\lib\site-packages\tensorflow_estimator\python\estimator\est imator.py", line 370, in train loss = self._train_model(input_fn, hooks, saving_listeners) File "K:\Anaconda3\lib\site-packages\tensorflow_estimator\python\estimator\est imator.py", line 1161, in _train_model return self._train_model_default(input_fn, hooks, saving_listeners) File "K:\Anaconda3\lib\site-packages\tensorflow_estimator\python\estimator\est imator.py", line 1195, in _train_model_default saving_listeners) File "K:\Anaconda3\lib\site-packages\tensorflow_estimator\python\estimator\est imator.py", line 1494, in _train_with_estimatorspec , loss = mon_sess.run([estimator_spec.train_op, estimator_spec.loss]) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\training\monitored _session.py", line 754, in run run_metadata=run_metadata) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\training\monitored _session.py", line 1259, in run run_metadata=run_metadata) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\training\monitored _session.py", line 1360, in run raise six.reraise(original_exc_info) File "K:\Anaconda3\lib\site-packages\six.py", line 693, in reraise raise value File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\training\monitored _session.py", line 1345, in run return self._sess.run(args, *kwargs) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\training\monitored _session.py", line 1418, in run run_metadata=run_metadata) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\training\monitored _session.py", line 1176, in run return self._sess.run(args, **kwargs) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\client\session.py" , line 956, in run run_metadata_ptr) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\client\session.py" , line 1180, in _run feed_dict_tensor, options, run_metadata) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\client\session.py" , line 1359, in _do_run run_metadata) File "K:\Anaconda3\lib\site-packages\tensorflow_core\python\client\session.py" , line 1384, in _do_call raise type(e)(node_def, op, message) tensorflow.python.framework.errors_impl.InvalidArgumentError: ConcatOp : Dimens ions of inputs should match: shape[0] = [858,1024,2] vs. shape[1] = [858,1024,1]

     [[node concat (defined at \lib\site-packages\tensorflow_core\python\fra

mework\ops.py:1748) ]] [[IteratorGetNext]]

Original stack trace for 'concat': File "\Scripts\spleeter-script.py", line 10, in sys.exit(entrypoint()) File "\lib\site-packages\spleeter__main.py", line 54, in entrypoint main(sys.argv) File "\lib\site-packages\spleeter__main.py", line 46, in main entrypoint(arguments, params) File "\lib\site-packages\spleeter\commands\train.py", line 98, in entrypoint evaluation_spec) File "\lib\site-packages\tensorflow_estimator\python\estimator\training.py", l ine 473, in train_and_evaluate return executor.run() File "\lib\site-packages\tensorflow_estimator\python\estimator\training.py", l ine 613, in run return self.run_local() File "\lib\site-packages\tensorflow_estimator\python\estimator\training.py", l ine 714, in run_local saving_listeners=saving_listeners) File "\lib\site-packages\tensorflow_estimator\python\estimator\estimator.py", line 370, in train loss = self._train_model(input_fn, hooks, saving_listeners) File "\lib\site-packages\tensorflow_estimator\python\estimator\estimator.py", line 1161, in _train_model return self._train_model_default(input_fn, hooks, saving_listeners) File "\lib\site-packages\tensorflow_estimator\python\estimator\estimator.py", line 1195, in _train_model_default saving_listeners) File "\lib\site-packages\tensorflow_estimator\python\estimator\estimator.py", line 1490, in _train_with_estimator_spec log_step_count_steps=log_step_count_steps) as mon_sess: File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 584, in MonitoredTrainingSession stop_grace_period_secs=stop_grace_period_secs) File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 1014, in init stop_grace_period_secs=stop_grace_period_secs) File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 725, in init self._sess = _RecoverableSession(self._coordinated_creator) File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 1207, in init__ _WrappedSession.init(self, self._create_session()) File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 1212, in _create_session return self._sess_creator.create_session() File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 878, in create_session self.tf_sess = self._session_creator.create_session() File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 638, in create_session self._scaffold.finalize() File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 205, in finalize default_ready_op) File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 287, in get_or_default op = default_constructor() File "\lib\site-packages\tensorflow_core\python\training\monitored_session.py" , line 201, in default_ready_op ], 0) File "\lib\site-packages\tensorflow_core\python\util\dispatch.py", line 180, i n wrapper return target(*args, *kwargs) File "\lib\site-packages\tensorflow_core\python\ops\array_ops.py", line 1420, in concat return gen_array_ops.concat_v2(values=values, axis=axis, name=name) File "\lib\site-packages\tensorflow_core\python\ops\gen_array_ops.py", line 12 56, in concat_v2 "ConcatV2", values=values, axis=axis, name=name) File "\lib\site-packages\tensorflow_core\python\framework\op_def_library.py", line 794, in _apply_op_helper op_def=op_def) File "\lib\site-packages\tensorflow_core\python\util\deprecation.py", line 507 , in new_func return func(args, **kwargs) File "\lib\site-packages\tensorflow_core\python\framework\ops.py", line 3357, in create_op attrs, op_def, compute_device) File "\lib\site-packages\tensorflow_core\python\framework\ops.py", line 3426, in _create_op_internal op_def=op_def) File "\lib\site-packages\tensorflow_core\python\framework\ops.py", line 1748, in init__ self._traceback = tf_stack.extract_stack()

Environment

| OS | Windows 7 | | Installation type | Conda | | Hardware spec | CPU | | spleeter 1.5.3 | | tensorflow 1.15.0 |

Additional context

BILLXZY1215 commented 8 months ago

Same issue here. Any workaround?

BILLXZY1215 commented 8 months ago

Same issue here. Any workaround?

Found the reason, it's about the data preprocessing in my case. The audio file should all keep stereo instead of mono.