-
**Issue**
Your network structure is utilized only at the beginning of the pipeline in a pre-processing step.
This is classical ML where you manually construct features for your users.
This is kno…
-
I carefully read the code, I found the word embedding is only used to calculate the adjacency matrix, each epoch will calculate all the results,every time the input is always a diag without any word e…
-
in the forward pass, in the table wise sharding, when pooling is executed? is it after alltoall communication? and executed on trainer local? where can I see the exact code in torchrec code base?
i…
-
- Llama : https://llama.meta.com/docs/how-to-guides/fine-tuning
- Quantization : 실수형 변수(float)를 정수형 변수(int)로 변환
- 효과
- 모델 사이즈 축소
- 모델 연산량 감소
- 효율적인 하드웨어 사용
- Parameter Efficient Fi…
-
Hi ! Thanks for your great work for inspiring us! I have some doubts about the pretrained weight of cue aggregator. Are the parameters of forward order block and backward order block randomly initial…
-
Dear all:
I'm working with grafana recently to show some figures and tables. While there're issues i found. When I try to open grafana url with PyQt,the fatal problem come out. My python code and t…
-
Hello,
I'm working on a project that use graph neural networks, using embedding of bounding boxes images of persons and their similarity.
The model i use is OSNet-AIN, because according to the res…
-
I am going to use TensorRT to accelerate my inference step.
For many issues, like the input data is a dict, it cannot be converted to ONNX.
-
error on x64 config (x86 and arm) (tdlib 1.8.0 builded in vsix and installed)
#include "flatbuffers/flatbuffers.h"
at \Unigram\Libraries\libtextclassifier\lang_id\common\flatbuffers\embedding-networ…
-
While running the Nvidia code for DLRMv2 on a 4090 GPU with batch size 1400, we are seeing the below accuracy which is lower than expected. Can someone help us if we are missing something? We have tri…