blei-lab / hdp

Hierarchical Dirichlet processes. Topic models where the data determine the number of topics. This implements Gibbs sampling.
GNU General Public License v2.0
150 stars 47 forks source link

The executive function takes up too much resources and the process is killed. #10

Open simon28li opened 4 months ago

simon28li commented 4 months ago

Hello, I have been testing the function of hdp recently, but in the process of testing, I will encounter the problem that the process is killed due to the excessive occupation of environmental resources. Is there something wrong with my usage or is the test failure caused by my own environmental restrictions?

ls data.txt

3 2 6
0:1 1:2 2:3
0:2 1:1 2:3
0:3 1:1 2:2
0:1 1:3 2:2
0:2 1:2 2:2
0:1 1:2 2:3

hdp --algorithm train --data data.txt  --directory train_dir

Program starts with following parameters:
algorithm:          = train
data_path:          = data.txt
directory:          = train_dir
max_iter            = 1000
save_lag            = 100
init_topics         = 0
random_seed         = 1716260426
gamma_a             = 1.00
gamma_b             = 1.00
alpha_a             = 1.00
alpha_b             = 1.00
eta                 = 0.50
#restricted_scans   = 5
split-merge         = no
sampling hyperparam = no

reading data from data.txt
Killed
cat /etc/os-release
NAME="CentOS Linux"
VERSION="8 (Core)"
ID="centos"
ID_LIKE="rhel fedora"
VERSION_ID="8"
PLATFORM_ID="platform:el8"
PRETTY_NAME="CentOS Linux 8 (Core)"
ANSI_COLOR="0;31"
CPE_NAME="cpe:/o:centos:centos:8"
HOME_URL="https://www.centos.org/"
BUG_REPORT_URL="https://bugs.centos.org/"

CENTOS_MANTISBT_PROJECT="CentOS-8"
CENTOS_MANTISBT_PROJECT_VERSION="8"
REDHAT_SUPPORT_PRODUCT="centos"
REDHAT_SUPPORT_PRODUCT_VERSION="8"

uname -a
Linux simon28li 4.18.0-383.el8.aarch64 #1 SMP Wed Apr 20 15:39:57 UTC 2022 aarch64 aarch64 aarch64 GNU/Linux