ๆ่ฟฐ่ฟไธช bug
ray.tune่ชๅจ่ฐๅ็่ฟ็จไธญๆฅ้
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
n_layers: 2 # (int) The number of transformer layers in transformer encoder.###
n_heads: 4 # (int) The number of attention heads for multi-head attention layer.##
hidden_size: 256 # (int) The number of features in the hidden state.###
inner_size: 256 # (int) The inner hidden size in feed-forward layer.
hidden_dropout_prob: 0.5 # (float) The probability of an element to be zeroed.
attn_dropout_prob: 0.5 # (float) The probability of an attention score to be zeroed.
hidden_act: 'gelu' # (str) The activation function in feed-forward layer.
layer_norm_eps: 1e-12 # (float) A value added to the denominator for numerical stability.
initializer_range: 0.02 # (float) The standard deviation for normal initialization.
mask_ratio: 0.2 # (float) The probability for a item replaced by MASK token.
loss_type: 'CE' # (str) The type of loss function.
transform: mask_itemseq # (str) The transform operation for batch data process.
ft_ratio: 0.5 # (float) The probability of generating fine-tuning samples
ๆจ็่ฟ่ก่ๆฌ
2024-02-05 01:09:58,835 WARNING utils.py:575 -- Detecting docker specified CPUs. In previous versions of Ray, CPU detection in containers was incorrect. Please ensure that Ray has enough CPUs allocated. As a temporary workaround to revert to the prior behavior, set RAY_USE_MULTIPROCESSING_CPU_COUNT=1 as an env var before starting Ray. Set the env var: RAY_DISABLE_DOCKER_CPU_WARNING=1 to mute this warning.
2024-02-05 01:09:59,011 INFO worker.py:1724 -- Started a local Ray instance.
2024-02-05 01:09:59,567 INFO tune.py:592 -- [output] This will use the new output engine with verbosity 2. To disable the new output and use the legacy output engine, set the environment variable RAY_AIR_NEW_OUTPUT=0. For more information, please see https://github.com/ray-project/ray/issues/36949
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ Configuration for experiment objective_function_2024-02-05_01-09-59 โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ Search algorithm BasicVariantGenerator โ
โ Scheduler AsyncHyperBandScheduler โ
โ Number of trials 5 โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
View detailed results here: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59
To visualize your results with TensorBoard, run: tensorboard --logdir /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59
Trial objective_function_3a46e_00000 started with configuration:
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ Trial objective_function_3a46e_00000 config โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ attn_dropout_prob 0.2 โ
โ epochs 3 โ
โ hidden_dropout_prob 0.5 โ
โ learning_rate 1e-05 โ
โ train_batch_size 16 โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
2024-02-05 01:10:03,575 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00000
Traceback (most recent call last):
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future
result = ray.get(future)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper
return fn(*args, kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper
return func(*args, *kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get
raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5180, ip=172.17.0.10, actor_id=59ef74a8e98fa23d2915878e01000000, repr=objective_function)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train
raise skipped from exception_cause(skipped)
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run
self._ret = self._target(self._args, self._kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00000 errored after 0 iterations at 2024-02-05 01:10:03. Total running time: 3s
Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00000_0_attn_dropout_prob=0.2000,epochs=3,hidden_dropout_prob=0.5000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00001 started with configuration:
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ Trial objective_function_3a46e_00001 config โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ attn_dropout_prob 0.3 โ
โ epochs 8 โ
โ hidden_dropout_prob 0.6 โ
โ learning_rate 1e-05 โ
โ train_batch_size 32 โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
2024-02-05 01:10:07,907 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00001
Traceback (most recent call last):
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future
result = ray.get(future)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper
return fn(*args, kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper
return func(*args, *kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get
raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5288, ip=172.17.0.10, actor_id=93914fdfe83c79965f2215bd01000000, repr=objective_function)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train
raise skipped from exception_cause(skipped)
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run
self._ret = self._target(self._args, self._kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00001 errored after 0 iterations at 2024-02-05 01:10:07. Total running time: 8s
Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00001_1_attn_dropout_prob=0.3000,epochs=8,hidden_dropout_prob=0.6000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00002 started with configuration:
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ Trial objective_function_3a46e_00002 config โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ attn_dropout_prob 0.3 โ
โ epochs 3 โ
โ hidden_dropout_prob 0.7 โ
โ learning_rate 1e-05 โ
โ train_batch_size 128 โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
2024-02-05 01:10:11,884 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00002
Traceback (most recent call last):
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future
result = ray.get(future)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper
return fn(*args, kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper
return func(*args, *kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get
raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5407, ip=172.17.0.10, actor_id=5ddebdbf0d35ad2b970d4e7a01000000, repr=objective_function)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train
raise skipped from exception_cause(skipped)
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run
self._ret = self._target(self._args, self._kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00002 errored after 0 iterations at 2024-02-05 01:10:11. Total running time: 12s
Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00002_2_attn_dropout_prob=0.3000,epochs=3,hidden_dropout_prob=0.7000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00003 started with configuration:
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ Trial objective_function_3a46e_00003 config โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ attn_dropout_prob 0.4 โ
โ epochs 6 โ
โ hidden_dropout_prob 0.8 โ
โ learning_rate 0.001 โ
โ train_batch_size 32 โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
2024-02-05 01:10:15,897 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00003
Traceback (most recent call last):
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future
result = ray.get(future)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper
return fn(*args, kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper
return func(*args, *kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get
raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5511, ip=172.17.0.10, actor_id=87da750c3700a373057fafee01000000, repr=objective_function)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train
raise skipped from exception_cause(skipped)
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run
self._ret = self._target(self._args, self._kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00003 errored after 0 iterations at 2024-02-05 01:10:15. Total running time: 16s
Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00003_3_attn_dropout_prob=0.4000,epochs=6,hidden_dropout_prob=0.8000,learning_rate=0.0010,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00004 started with configuration:
โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ
โ Trial objective_function_3a46e_00004 config โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ attn_dropout_prob 0.3 โ
โ epochs 5 โ
โ hidden_dropout_prob 0.8 โ
โ learning_rate 1e-05 โ
โ train_batch_size 16 โ
โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
2024-02-05 01:10:19,944 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00004
Traceback (most recent call last):
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future
result = ray.get(future)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper
return fn(*args, kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper
return func(*args, *kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get
raise value.as_instanceof_cause()
ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5615, ip=172.17.0.10, actor_id=6e5617af8c0396db6af5aa0101000000, repr=objective_function)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train
raise skipped from exception_cause(skipped)
File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run
self._ret = self._target(self._args, self._kwargs)
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00004 errored after 0 iterations at 2024-02-05 01:10:19. Total running time: 20s
Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00004_4_attn_dropout_prob=0.3000,epochs=5,hidden_dropout_prob=0.8000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Traceback (most recent call last):
File "run_hyper.py", line 127, in
ray_tune(args)
File "run_hyper.py", line 94, in ray_tune
result = tune.run(
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/tune.py", line 1036, in run
raise TuneError("Trials did not complete", incomplete_trials)
ray.tune.error.TuneError: ('Trials did not complete', [objective_function_3a46e_00000, objective_function_3a46e_00001, objective_function_3a46e_00002, objective_function_3a46e_00003, objective_function_3a46e_00004])
ๆ่ฟฐ่ฟไธช bug ray.tune่ชๅจ่ฐๅ็่ฟ็จไธญๆฅ้ KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
ๅฆไฝๅค็ฐ ๅค็ฐ่ฟไธช bug ็ๆญฅ้ชค๏ผ
dataset config
field_separator: "\t" #ๆๅฎๆฐๆฎ้field็ๅ้็ฌฆ seq_separator: " " #ๆๅฎๆฐๆฎ้ไธญtoken_seqๆ่ float_seqๅ้็ๅ้็ฌฆ USER_ID_FIELD: user_id #ๆๅฎ็จๆทidๅ ITEM_ID_FIELD: item_id #ๆๅฎ็ฉๅidๅ RATING_FIELD: rating #ๆๅฎๆๅratingๅ-ไบๅๆณๆฏๅฆ่ดญไนฐ TIME_FIELD: time #ๆๅฎๆถ้ดๅ NEGPREFIX: neg #ๆๅฎ่ด้ๆ ทๅ็ผ LABEL_FIELD: type #ๆๅฎๆ ็ญพๅ ITEM_LIST_LENGTH_FIELD: item_length #ๆๅฎๅบๅ้ฟๅบฆๅ LIST_SUFFIX: _list #ๆๅฎๅบๅๅ็ผ MAX_ITEM_LIST_LENGTH: 50 #ๆๅฎๆๅคงๅบๅ้ฟๅบฆ POSITION_FIELD: [V29, V11, V25] #ๆๅฎ็ๆ็ๅบๅไฝ็ฝฎid
ๆๅฎไปไปไนๆไปถ้่ฏปไปไนๅ๏ผ่ฟ้ๅฐฑๆฏไป.inter้้ข่ฏปๅuser_id, item_id, type, timestamp, flag่ฟไบๅ,ๅฉไธ็ไปฅๆญค็ฑปๆจ
load_col: inter: [user_id, item_id, time, type] user: [user_id, V29, V11, V25] selected_features: [V29, V11, V25]
training settings
epochs: 6 #่ฎญ็ป็ๆๅคง่ฝฎๆฐ
train_batch_size: 32 #่ฎญ็ป็batch_size
learner: adam #ไฝฟ็จ็pytorchๅ ็ฝฎไผๅๅจ
learning_rate: 0.001 #ๅญฆไน ็
training_neg_sample_args: ~ #่ด้ๆ ทๆฐ็ฎ eval_step: 1 #ๆฏๆฌก่ฎญ็ปๅๅevalaution็ๆฌกๆฐ stopping_step: 10 #ๆงๅถ่ฎญ็ปๆถๆ็ๆญฅ้ชคๆฐ๏ผๅจ่ฏฅๆญฅ้ชคๆฐๅ ่ฅ้ๅ็่ฏๆตๆ ๅๆฒกๆไปไนๅๅ๏ผๅฐฑๅฏไปฅๆๅๅๆญขไบ
bertๅๆฐ
n_layers: 2 # (int) The number of transformer layers in transformer encoder.### n_heads: 4 # (int) The number of attention heads for multi-head attention layer.## hidden_size: 256 # (int) The number of features in the hidden state.### inner_size: 256 # (int) The inner hidden size in feed-forward layer.
hidden_dropout_prob: 0.5 # (float) The probability of an element to be zeroed.
attn_dropout_prob: 0.5 # (float) The probability of an attention score to be zeroed.
hidden_act: 'gelu' # (str) The activation function in feed-forward layer. layer_norm_eps: 1e-12 # (float) A value added to the denominator for numerical stability. initializer_range: 0.02 # (float) The standard deviation for normal initialization. mask_ratio: 0.2 # (float) The probability for a item replaced by MASK token. loss_type: 'CE' # (str) The type of loss function. transform: mask_itemseq # (str) The transform operation for batch data process. ft_ratio: 0.5 # (float) The probability of generating fine-tuning samples
evalution settings
eval_setting: TO_LS,full #ๅฏนๆฐๆฎๆๆถ้ดๆๅบ๏ผ่ฎพ็ฝฎ็ไธๆณๅๅๆฐๆฎ้๏ผๅนถไฝฟ็จๅ จๆๅบ eval_args: split: {'LS': 'valid_and_test'} #ๅๅๆฏไพ mode: full order: TO metrics: ["Recall", "MRR","NDCG","Hit","Precision"] #่ฏๆตๆ ๅ topk: [1,5,10] #่ฏๆตๆ ๅไฝฟ็จtopk๏ผ่ฎพ็ฝฎๆ10่ฏๆตๆ ๅๅฐฑๆฏ["Recall@10", "MRR@10", "NDCG@10", "Hit@10", "Precision@10"]
valid_metric: MRR@10 #้ๅๅชไธช่ฏๆตๆ ๅไฝไธบไฝไธบๆๅๅๆญข่ฎญ็ป็ๆ ๅ
eval_batch_size: 256 #่ฏๆต็batch_size
2.testๆไปถ learning_rate choice [0.001,0.0001,0.00001] epochs choice [3,4,5,6,7,8] train_batch_size choice [16,32,64,128] hidden_dropout_prob choice [0.2,0.3,0.4,0.5,0.6,0.7,0.8] attn_dropout_prob choice [0.2,0.3,0.4,0.5,0.6,0.7,0.8]
ๆจ็ไปฃ็ python run_hyper.py --model=BERTRec --dataset=use --config_files=bert_test.yaml --params_file=bert_test.test --tool=Ray
ๆจ็่ฟ่ก่ๆฌ 2024-02-05 01:09:58,835 WARNING utils.py:575 -- Detecting docker specified CPUs. In previous versions of Ray, CPU detection in containers was incorrect. Please ensure that Ray has enough CPUs allocated. As a temporary workaround to revert to the prior behavior, set
RAY_USE_MULTIPROCESSING_CPU_COUNT=1
as an env var before starting Ray. Set the env var:RAY_DISABLE_DOCKER_CPU_WARNING=1
to mute this warning. 2024-02-05 01:09:59,011 INFO worker.py:1724 -- Started a local Ray instance. 2024-02-05 01:09:59,567 INFO tune.py:592 -- [output] This will use the new output engine with verbosity 2. To disable the new output and use the legacy output engine, set the environment variable RAY_AIR_NEW_OUTPUT=0. For more information, please see https://github.com/ray-project/ray/issues/36949 โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Configuration for experiment objective_function_2024-02-05_01-09-59 โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ Search algorithm BasicVariantGenerator โ โ Scheduler AsyncHyperBandScheduler โ โ Number of trials 5 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏView detailed results here: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59 To visualize your results with TensorBoard, run:
tensorboard --logdir /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59
Trial status: 5 PENDING Current time: 2024-02-05 01:09:59. Total running time: 0s Logical resource usage: 0/12 CPUs, 0/1 GPUs (0.0/1.0 accelerator_type:G) โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial name status learning_rate epochs train_batch_size hidden_dropout_prob attn_dropout_prob โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ objective_function_3a46e_00000 PENDING 1e-05 3 16 0.5 0.2 โ โ objective_function_3a46e_00001 PENDING 1e-05 8 32 0.6 0.3 โ โ objective_function_3a46e_00002 PENDING 1e-05 3 128 0.7 0.3 โ โ objective_function_3a46e_00003 PENDING 0.001 6 32 0.8 0.4 โ โ objective_function_3a46e_00004 PENDING 1e-05 5 16 0.8 0.3 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
Trial objective_function_3a46e_00000 started with configuration: โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial objective_function_3a46e_00000 config โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ attn_dropout_prob 0.2 โ โ epochs 3 โ โ hidden_dropout_prob 0.5 โ โ learning_rate 1e-05 โ โ train_batch_size 16 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ 2024-02-05 01:10:03,575 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00000 Traceback (most recent call last): File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future result = ray.get(future) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper return fn(*args, kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper return func(*args, *kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get raise value.as_instanceof_cause() ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5180, ip=172.17.0.10, actor_id=59ef74a8e98fa23d2915878e01000000, repr=objective_function) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train raise skipped from exception_cause(skipped) File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run self._ret = self._target(self._args, self._kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00000 errored after 0 iterations at 2024-02-05 01:10:03. Total running time: 3s Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00000_0_attn_dropout_prob=0.2000,epochs=3,hidden_dropout_prob=0.5000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00001 started with configuration: โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial objective_function_3a46e_00001 config โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ attn_dropout_prob 0.3 โ โ epochs 8 โ โ hidden_dropout_prob 0.6 โ โ learning_rate 1e-05 โ โ train_batch_size 32 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ 2024-02-05 01:10:07,907 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00001 Traceback (most recent call last): File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future result = ray.get(future) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper return fn(*args, kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper return func(*args, *kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get raise value.as_instanceof_cause() ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5288, ip=172.17.0.10, actor_id=93914fdfe83c79965f2215bd01000000, repr=objective_function) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train raise skipped from exception_cause(skipped) File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run self._ret = self._target(self._args, self._kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00001 errored after 0 iterations at 2024-02-05 01:10:07. Total running time: 8s Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00001_1_attn_dropout_prob=0.3000,epochs=8,hidden_dropout_prob=0.6000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00002 started with configuration: โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial objective_function_3a46e_00002 config โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ attn_dropout_prob 0.3 โ โ epochs 3 โ โ hidden_dropout_prob 0.7 โ โ learning_rate 1e-05 โ โ train_batch_size 128 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ 2024-02-05 01:10:11,884 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00002 Traceback (most recent call last): File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future result = ray.get(future) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper return fn(*args, kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper return func(*args, *kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get raise value.as_instanceof_cause() ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5407, ip=172.17.0.10, actor_id=5ddebdbf0d35ad2b970d4e7a01000000, repr=objective_function) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train raise skipped from exception_cause(skipped) File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run self._ret = self._target(self._args, self._kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00002 errored after 0 iterations at 2024-02-05 01:10:11. Total running time: 12s Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00002_2_attn_dropout_prob=0.3000,epochs=3,hidden_dropout_prob=0.7000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00003 started with configuration: โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial objective_function_3a46e_00003 config โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ attn_dropout_prob 0.4 โ โ epochs 6 โ โ hidden_dropout_prob 0.8 โ โ learning_rate 0.001 โ โ train_batch_size 32 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ 2024-02-05 01:10:15,897 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00003 Traceback (most recent call last): File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future result = ray.get(future) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper return fn(*args, kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper return func(*args, *kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get raise value.as_instanceof_cause() ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5511, ip=172.17.0.10, actor_id=87da750c3700a373057fafee01000000, repr=objective_function) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train raise skipped from exception_cause(skipped) File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run self._ret = self._target(self._args, self._kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00003 errored after 0 iterations at 2024-02-05 01:10:15. Total running time: 16s Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00003_3_attn_dropout_prob=0.4000,epochs=6,hidden_dropout_prob=0.8000,learning_rate=0.0010,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial objective_function_3a46e_00004 started with configuration: โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial objective_function_3a46e_00004 config โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ attn_dropout_prob 0.3 โ โ epochs 5 โ โ hidden_dropout_prob 0.8 โ โ learning_rate 1e-05 โ โ train_batch_size 16 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ 2024-02-05 01:10:19,944 ERROR tune_controller.py:1374 -- Trial task failed for trial objective_function_3a46e_00004 Traceback (most recent call last): File "/root/miniconda3/lib/python3.8/site-packages/ray/air/execution/_internal/event_manager.py", line 110, in resolve_future result = ray.get(future) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/auto_init_hook.py", line 22, in auto_init_wrapper return fn(*args, kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/client_mode_hook.py", line 103, in wrapper return func(*args, *kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/_private/worker.py", line 2624, in get raise value.as_instanceof_cause() ray.exceptions.RayTaskError(KeyError): ray::ImplicitFunc.train() (pid=5615, ip=172.17.0.10, actor_id=6e5617af8c0396db6af5aa0101000000, repr=objective_function) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/trainable.py", line 342, in train raise skipped from exception_cause(skipped) File "/root/miniconda3/lib/python3.8/site-packages/ray/air/_internal/util.py", line 88, in run self._ret = self._target(self._args, self._kwargs) File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 115, in
training_func=lambda: self._trainable_func(self.config),
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/function_trainable.py", line 332, in _trainable_func
output = fn()
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/trainable/util.py", line 138, in inner
return trainable(config, **fn_kwargs)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/quick_start/quick_start.py", line 205, in objective_function
config = Config(config_dict=config_dict, config_file_list=config_file_list)
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 88, in init
self.model, self.model_class, self.dataset = self._get_model_and_dataset(
File "/root/autodl-tmp/zzzzzz/RecBole-1.2.0/recbole/config/configurator.py", line 207, in _get_model_and_dataset
raise KeyError(
KeyError: 'model need to be specified in at least one of the these ways: [model variable, config file, config dict, command line] '
Trial objective_function_3a46e_00004 errored after 0 iterations at 2024-02-05 01:10:19. Total running time: 20s Error file: /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00004_4_attn_dropout_prob=0.3000,epochs=5,hidden_dropout_prob=0.8000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt
Trial status: 5 ERROR Current time: 2024-02-05 01:10:19. Total running time: 20s Logical resource usage: 0/12 CPUs, 1.0/1 GPUs (0.0/1.0 accelerator_type:G) โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial name status learning_rate epochs train_batch_size hidden_dropout_prob attn_dropout_prob โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ objective_function_3a46e_00000 ERROR 1e-05 3 16 0.5 0.2 โ โ objective_function_3a46e_00001 ERROR 1e-05 8 32 0.6 0.3 โ โ objective_function_3a46e_00002 ERROR 1e-05 3 128 0.7 0.3 โ โ objective_function_3a46e_00003 ERROR 0.001 6 32 0.8 0.4 โ โ objective_function_3a46e_00004 ERROR 1e-05 5 16 0.8 0.3 โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
Number of errored trials: 5 โญโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฎ โ Trial name # failures error file โ โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค โ objective_function_3a46e_00000 1 /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00000_0_attn_dropout_prob=0.2000,epochs=3,hidden_dropout_prob=0.5000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt โ โ objective_function_3a46e_00001 1 /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00001_1_attn_dropout_prob=0.3000,epochs=8,hidden_dropout_prob=0.6000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt โ โ objective_function_3a46e_00002 1 /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00002_2_attn_dropout_prob=0.3000,epochs=3,hidden_dropout_prob=0.7000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt โ โ objective_function_3a46e_00003 1 /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00003_3_attn_dropout_prob=0.4000,epochs=6,hidden_dropout_prob=0.8000,learning_rate=0.0010,train_batch_siz_2024-02-05_01-09-59/error.txt โ โ objective_function_3a46e_00004 1 /root/autodl-tmp/zzzzzz/RecBole-1.2.0/ray_log/objective_function_2024-02-05_01-09-59/objective_function_3a46e_00004_4_attn_dropout_prob=0.3000,epochs=5,hidden_dropout_prob=0.8000,learning_rate=0.0000,train_batch_siz_2024-02-05_01-09-59/error.txt โ โฐโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโฏ
Traceback (most recent call last): File "run_hyper.py", line 127, in
ray_tune(args)
File "run_hyper.py", line 94, in ray_tune
result = tune.run(
File "/root/miniconda3/lib/python3.8/site-packages/ray/tune/tune.py", line 1036, in run
raise TuneError("Trials did not complete", incomplete_trials)
ray.tune.error.TuneError: ('Trials did not complete', [objective_function_3a46e_00000, objective_function_3a46e_00001, objective_function_3a46e_00002, objective_function_3a46e_00003, objective_function_3a46e_00004])
้ขๆ ๅฆไฝ่งฃๅณ้ฎ้ขไฝฟๅพray้กบๅฉ่ฟ่กๅข๏ผ
ๅฑๅนๆชๅพ ๆทปๅ ๅฑๅนๆชๅพไปฅๅธฎๅฉ่งฃ้ๆจ็้ฎ้ขใ๏ผๅฏ้๏ผ
้พๆฅ ๆทปๅ ่ฝๅคๅค็ฐ bug ็ไปฃ็ ้พๆฅ๏ผๅฆ Colab ๆ่ ๅ ถไปๅจ็บฟ Jupyter ๅนณๅฐใ๏ผๅฏ้๏ผ
ๅฎ้ช็ฏๅข๏ผ่ฏท่กฅๅ จไธๅไฟกๆฏ๏ผ๏ผ