HuangChiEn commented 2 years ago

def Clf_trainer(x_train, y_train, save_path, r_seed):
    print('[INFO] : data prepare \n')
    (tra_X, tra_y), (val_X, val_y) = prepare_train_data(x_train, y_train, resampling=False, val_ratio=0.2, r_seed=r_seed)

    print('[INFO] : auto fit data \n')
    clf = AutoSklearn2Classifier(time_left_for_this_task=1800, ensemble_size=20, memory_limit=2048)
    clf.fit(tra_X, tra_y)

    print(clf.show_models())

if __name__ == '__main__' :
    from sklearn.metrics import classification_report
    from sklearn.metrics import accuracy_score, precision_score, recall_score, f1_score
    from sklearn.model_selection import train_test_split
    from sklearn.utils import shuffle as sk_shuffle
    from autosklearn.experimental.askl2 import AutoSklearn2Classifier

    (x_train, y_train), (x_test, y_test) = Get_Numpy_Datasets_of_Training_and_Testing(cfg)
    Clf_trainer(x_train, y_train, cfg['training']['save_path'], cfg['training']['random_state'])

🔥 For running the above code snippet, I have encountered the following Error :

[ERROR] [2022-05-05 02:47:20,752:Client-AutoML(1):ada19815-cc1d-11ec-8fbd-0242ac110004] Dummy prediction failed with run state StatusType.MEMOUT and additional output: {'error': 'Memout (used more than 2048 MB).', 'configuration_origin': 'DUMMY'}.

Traceback (most recent call last): File "auto_ml.py", line 185, in Clf_trainer(x_train, y_train, cfg['training']['save_path'], cfg['training']['random_state']) File "auto_ml.py", line 115, in Clf_trainer clf.fit(tra_X, tray) File "/opt/conda/lib/python3.8/site-packages/autosklearn/experimental/askl2.py", line 460, in fit return super().fit( File "/opt/conda/lib/python3.8/site-packages/autosklearn/estimators.py", line 1045, in fit super().fit( File "/opt/conda/lib/python3.8/site-packages/autosklearn/estimators.py", line 375, in fit self.automl.fit(load_models=self.load_models, kwargs) File "/opt/conda/lib/python3.8/site-packages/autosklearn/automl.py", line 2056, in fit return super().fit( File "/opt/conda/lib/python3.8/site-packages/autosklearn/automl.py", line 808, in fit self.num_run += self._do_dummy_prediction(datamanager, num_run=1) File "/opt/conda/lib/python3.8/site-packages/autosklearn/automl.py", line 476, in _do_dummy_prediction raise ValueError( ValueError: Dummy prediction failed with run state StatusType.MEMOUT and additional output: {'error': 'Memout (used more than 2048 MB).', 'configuration_origin': 'DUMMY'}.

🏴󠁮󠁡󠁯󠁳󠁿 OS related information :

In the code snippet, I also declare environment var

os.environ['OPENBLAS_NUM_THREADS'] = '4'

ulimit -a

core file size (blocks, -c) unlimited data seg size (kbytes, -d) unlimited scheduling priority (-e) 0 file size (blocks, -f) unlimited pending signals (-i) 7256749 max locked memory (kbytes, -l) 65536 max memory size (kbytes, -m) unlimited open files (-n) 1048576 pipe size (512 bytes, -p) 8 POSIX message queues (bytes, -q) 819200 real-time priority (-r) 0 stack size (kbytes, -s) 8192 cpu time (seconds, -t) unlimited max user processes (-u) unlimited virtual memory (kbytes, -v) unlimited file locks (-x) unlimited

memory info:

Although the default setting seems not the suggested setup, every suggestion of setup for fit function will be appreciate!!

eddiebergman commented 2 years ago

So I will assume your dataset is not so large as to cause problems. We can reduce the size of numpy datasets but pandas datasets are still not support. If it exceeds your total memory_limit=2048 then yes, the dummy will fail.

The short term solution is up the memory_limit until it works. Not ideal but if you need results sooner rather than later than this is what I can suggest.

However i've seen two issues related to this now so I'm starting to think this may be something internal I did which is unfortunate as I have no clue what that may be.

Some bullet points:

What is the size of your dataset?
Can you report import psutil; print(psutil.Process().memory_info()) in the process, just before you call fit? The dependency is part of the autosklearn stack so you shouldn't have to install anything.
The ulimits are only relevant in the main process but when we train models, we use that memory_limit=2048 to set new ones for a spawn subprocess, i.e. the 1.73T available memory you report won't do much.
This subprocess inherits all of your imports. If you have many big libraries, this means the process that trains the DUMMY configuration can have a large memory footprint, before even considering the data or the the model itself.

I need to look into this and see if there is anything in auto-sklearn that specifically causes the process size to explode as I've seen it in tests too. I'll get back to you if I see anything.

eddiebergman commented 2 years ago

This issue also appeared in #1453 for @belzheng, I'll report back what I find here

eddiebergman commented 2 years ago

Hi @HuangChiEn and @belzheng,

I did some testing locally and a clean install of auto-sklearn only consumes about 900mb of memory for me by the time _do_dummy_prediction is called and you receive that error. Therefore I do not think this issue is on our side and we recommend reading our FAQ section to figure out what's going.

https://automl.github.io/auto-sklearn/master/faq.html#resource-management

The recommended solution is still to increase the memory limit if you have a lot of packages in your setup or a lot of data.

For the future, we have some limited dataset reduction in place but this only applies to the training set in fit and only applies to numpy only data. For pandas, we will look to AutoPytorch as they recently had some solution there.

I handy debugging tool is to do import psutil; print(psutil.Process().memory_info().vms to see your memory consumption at any point you like. This will give you memory consumption in bytes but you can convert quickly by doing x / (2**20).

I will close this issue as there's not much we can do but point to documentation. If you've tried these different approaches and have code that show it still does not work, please feel free to re-open.

Best, Eddie

automl / auto-sklearn

Get ValueError: Dummy prediction failed with run state StatusType.MEMOUT in fit function #1460

[ERROR] [2022-05-05 02:47:20,752:Client-AutoML(1):ada19815-cc1d-11ec-8fbd-0242ac110004] Dummy prediction failed with run state StatusType.MEMOUT and additional output: {'error': 'Memout (used more than 2048 MB).', 'configuration_origin': 'DUMMY'}.

In the code snippet, I also declare environment var

ulimit -a

memory info: