issues
search
NVIDIA-Merlin
/
HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Apache License 2.0
905
stars
196
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[Requirement] Custom allocator support for gpu_cache
#454
mfbalin
opened
3 days ago
0
[BUG]The wdl_8gpu.py script execution has halted and training cannot proceed.
#453
redzhang1990
opened
5 days ago
1
[BUG] Slot calculation error in static_hash_table.cu
#452
Jiaao-Bai
opened
2 weeks ago
1
[Question] How to add new models to HPS configuration when using Model Control Mode EXPLICIT?
#451
dmac
closed
2 weeks ago
5
[BUG] I/O error on Linux kernel with 64KiB base page size
#450
flx42
opened
2 weeks ago
0
[Question] Help converting ONNX to TensorRT with graphsurgeon and HPS plugin
#449
dmac
closed
1 month ago
4
Update hierarchical_parameter_server_demo.ipynb
#448
jq
opened
1 month ago
1
Remove some internal files
#447
EmmaQiaoCh
closed
2 months ago
1
Fix hps docs typo and hps profiler example argument
#446
shyeonn
opened
2 months ago
1
[BUG] Enabling regularization causes CUDNN_STATUS_MAPPING_ERROR for deepfm example
#445
klmentzer
opened
3 months ago
1
[Question] Is there any related architecture design or documentation for embedding collection
#444
Jiaao-Bai
closed
3 months ago
2
[Question] Can i read parquet data from HDFS?
#443
wangxingda
closed
3 months ago
6
[BUG]build failed on gtest!
#442
SeekPoint
closed
3 months ago
5
[BUG] cudaErrorIllegalAddress: an illegal memory access was encounteredThread
#441
kangna-qi
closed
3 months ago
4
[BUG] Seg Fault When Deploying TF+HPS Model with merlin-tensorflow
#440
tuanavu
opened
4 months ago
9
[BUG] Run sok tests error
#439
kangna-qi
closed
5 months ago
1
[Question] How to dump incremental model to kafka in Release 23.12?
#438
lausannel
opened
6 months ago
2
[Question] Is there pipeline mechanism to help the lookup requests always be handled on device cache in HugeCTR?
#437
Lifann
opened
6 months ago
1
support lock-free hashmap backend
#436
ZhuYuJin
opened
6 months ago
0
[BUG]preprocess.sh 1 criteo failed with 'Schema' object has no attribute 'write'
#435
SeekPoint
opened
6 months ago
1
build docker failed with 401 Unauthorized (Set Up the Development Environment With Merlin Containers)
#434
SeekPoint
opened
6 months ago
4
[BUG] CUDNN_STATUS_MAPPING_ERROR with cudnnSetStream
#433
rgandikota
closed
6 months ago
21
sok-experiment static_map empty_key_sentinel and reclaimed_key_sentinel is not right for int64 [BUG]
#432
amazingyyc
closed
7 months ago
4
Trouble installing hugectr_backend for Triton Server
#431
sezhiyanhari
closed
7 months ago
1
fix: typo in kafka broker
#430
lausannel
opened
7 months ago
1
[BUG] Encountered ETC error of din model when training with multiple keyset.
#429
dusir
closed
6 months ago
3
[Question] nv_gpu_cache compiling problem
#428
RobertLou
closed
8 months ago
1
[Question] How can I pre-calculate the GPU memory required for embedding cache size?
#427
tuanavu
opened
8 months ago
2
Support for configuration issues
#426
EmmaQiaoCh
opened
8 months ago
1
[Question] Difference between Embedding Training Cache and GPU Embedding Cache
#424
hsezhiyan
opened
8 months ago
9
Update doc dependencies
#423
EmmaQiaoCh
closed
9 months ago
1
[Question] How to serve TF2 SOK model in Triton Inference and convert it to ONNX?
#422
tuanavu
closed
8 months ago
1
[Question] COnfiguration issues with mlcommon benchmarking
#421
raghavendrachari08
opened
9 months ago
1
[Question] Is there a slack channel or discord server for questions and discussion ?
#420
lilida
opened
9 months ago
4
[Question]Running the DCN on a single GPU leads to the illegal memory access
#419
dusir
opened
9 months ago
1
[Question] tensorflow 1.15 sok example
#418
MichoChan
opened
9 months ago
2
[Question] An illegal memory access was encountered on H800 & Hugectr dcn test
#417
dusir
closed
8 months ago
4
[BUG] cooperative_groups/scan.h not in cuda11.X
#416
MichoChan
opened
9 months ago
5
[Question] How can I export keras model with SOK?
#415
longern
opened
10 months ago
3
[Question] Does HugeCtr support H800 GPU?
#414
sparkling9809
closed
9 months ago
6
[Question]Does HugeCtr support read data for trainning from Kafka ?
#413
sparkling9809
closed
10 months ago
3
HashMapBackend occupies 10x memory usage than binary data.
#412
ZhuYuJin
closed
8 months ago
7
[Question] Does HugeCTR support all P-series GPUs? and does it support tfserving as inference?
#411
Shu-HowTing
closed
8 months ago
2
[Requirement] FS Support for Azure Blob Storage
#410
shivamsbatra
closed
8 months ago
1
[BUG] Can’t compile sok
#409
kangna-qi
closed
11 months ago
2
[Question] Confused about the additional element of the output of InteractionLayer
#408
heroes999
closed
11 months ago
6
[Question] Is there any way for hps to load an embedding table into multiple GPUs?
#407
sparkling9809
closed
10 months ago
4
[Question]link for day_1.gz is invalid
#406
zmxdream
closed
11 months ago
0
Update session_inference_test.cpp
#405
lxh
closed
8 months ago
0
[Question] Multi-node training encounters Runtime error: unhandled system error ncclGroupEnd()
#404
heroes999
closed
11 months ago
9
Next