issues
search
NVIDIA-Merlin
/
HugeCTR
HugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
Apache License 2.0
950
stars
200
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[Question] sok performance
#463
Orca-bit
opened
1 month ago
1
[BUG] sok amp mode error
#462
Orca-bit
opened
1 month ago
1
[BUG] encounter error when running sok dlrm benchmark
#461
Orca-bit
opened
1 month ago
2
[BUG] compile sok error
#460
Orca-bit
closed
1 month ago
2
[Question] What is the difference between HugeCTR/embedding and HugeCTR/src/embeddings?
#459
Orca-bit
closed
1 month ago
1
Sync from gitlab
#458
EmmaQiaoCh
closed
2 months ago
2
Bump actions/download-artifact from 3 to 4.1.7 in /.github/workflows
#457
dependabot[bot]
opened
2 months ago
1
[Question]Is there a version of tensorflow 1.15 with a merlin-tensorflow image of sok installed?
#456
recklessnolove
opened
3 months ago
1
low frequency filter
#455
ccccjunkang
closed
2 months ago
2
[Requirement] Custom allocator support for gpu_cache
#454
mfbalin
opened
4 months ago
1
[BUG]The wdl_8gpu.py script execution has halted and training cannot proceed.
#453
redzhang1990
opened
5 months ago
1
[BUG] Slot calculation error in static_hash_table.cu
#452
Jiaao-Bai
opened
5 months ago
2
[Question] How to add new models to HPS configuration when using Model Control Mode EXPLICIT?
#451
dmac
closed
5 months ago
5
[BUG] I/O error on Linux kernel with 64KiB base page size
#450
flx42
opened
5 months ago
0
[Question] Help converting ONNX to TensorRT with graphsurgeon and HPS plugin
#449
dmac
closed
5 months ago
4
Update hierarchical_parameter_server_demo.ipynb
#448
jq
opened
6 months ago
1
Remove some internal files
#447
EmmaQiaoCh
closed
7 months ago
1
Fix hps docs typo and hps profiler example argument
#446
shyeonn
opened
7 months ago
1
[BUG] Enabling regularization causes CUDNN_STATUS_MAPPING_ERROR for deepfm example
#445
klmentzer
opened
8 months ago
4
[Question] Is there any related architecture design or documentation for embedding collection
#444
Jiaao-Bai
closed
8 months ago
2
[Question] Can i read parquet data from HDFS?
#443
wangxingda
closed
8 months ago
6
[BUG]build failed on gtest!
#442
SeekPoint
closed
8 months ago
5
[BUG] cudaErrorIllegalAddress: an illegal memory access was encounteredThread
#441
kangna-qi
closed
8 months ago
4
[BUG] Seg Fault When Deploying TF+HPS Model with merlin-tensorflow
#440
tuanavu
opened
9 months ago
9
[BUG] Run sok tests error
#439
kangna-qi
closed
10 months ago
1
[Question] How to dump incremental model to kafka in Release 23.12?
#438
lausannel
opened
11 months ago
2
[Question] Is there pipeline mechanism to help the lookup requests always be handled on device cache in HugeCTR?
#437
Lifann
opened
11 months ago
1
support lock-free hashmap backend
#436
ZhuYuJin
opened
11 months ago
0
[BUG]preprocess.sh 1 criteo failed with 'Schema' object has no attribute 'write'
#435
SeekPoint
opened
11 months ago
1
build docker failed with 401 Unauthorized (Set Up the Development Environment With Merlin Containers)
#434
SeekPoint
opened
11 months ago
4
[BUG] CUDNN_STATUS_MAPPING_ERROR with cudnnSetStream
#433
rgandikota
closed
11 months ago
21
sok-experiment static_map empty_key_sentinel and reclaimed_key_sentinel is not right for int64 [BUG]
#432
amazingyyc
closed
11 months ago
4
Trouble installing hugectr_backend for Triton Server
#431
sezhiyanhari
closed
1 year ago
1
fix: typo in kafka broker
#430
lausannel
opened
1 year ago
1
[BUG] Encountered ETC error of din model when training with multiple keyset.
#429
dusir
closed
11 months ago
3
[Question] nv_gpu_cache compiling problem
#428
RobertLou
closed
1 year ago
1
[Question] How can I pre-calculate the GPU memory required for embedding cache size?
#427
tuanavu
opened
1 year ago
2
Support for configuration issues
#426
EmmaQiaoCh
opened
1 year ago
1
[Question] Difference between Embedding Training Cache and GPU Embedding Cache
#424
hsezhiyan
opened
1 year ago
9
Update doc dependencies
#423
EmmaQiaoCh
closed
1 year ago
1
[Question] How to serve TF2 SOK model in Triton Inference and convert it to ONNX?
#422
tuanavu
closed
1 year ago
1
[Question] COnfiguration issues with mlcommon benchmarking
#421
raghavendrachari08
opened
1 year ago
2
[Question] Is there a slack channel or discord server for questions and discussion ?
#420
lilida
opened
1 year ago
4
[Question]Running the DCN on a single GPU leads to the illegal memory access
#419
dusir
opened
1 year ago
1
[Question] tensorflow 1.15 sok example
#418
MichoChan
opened
1 year ago
2
[Question] An illegal memory access was encountered on H800 & Hugectr dcn test
#417
dusir
closed
1 year ago
4
[BUG] cooperative_groups/scan.h not in cuda11.X
#416
MichoChan
opened
1 year ago
5
[Question] How can I export keras model with SOK?
#415
longern
opened
1 year ago
3
[Question] Does HugeCtr support H800 GPU?
#414
sparkling9809
closed
1 year ago
6
[Question]Does HugeCtr support read data for trainning from Kafka ?
#413
sparkling9809
closed
1 year ago
3
Next