DeepRec is a high-performance recommendation deep learning framework based on TensorFlow. It is hosted in incubation in LF AI & Data Foundation.
1.03k
stars
349
forks
source link
the performance in a100 is slower than the nvidia-tensorflow #105
Closed
thinking03 closed 2 years ago
env: A100 8gpu + horovod framework: DeepRec1.15.5 vs nvidia-tensorflow 1.15.4 model: transformer + mmoe result: DeepRec 1.8s/step, nv-tf:0.55s/step