Open Bill-WangJiLong opened 1 month ago
The primary computational expense of Visualized BGE is concentrated in the Stage 1 training. During this stage, the Base model undergoes a total of (116K+48K) training steps. This process takes approximately 15 days when utilizing 24 A800 GPUs.
May I ask how much GPU resources and time you spent training Visualized BGE