FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs
MIT License
6.79k stars 485 forks source link

The cost of training Visualized BGE #1003

Open Bill-WangJiLong opened 1 month ago

Bill-WangJiLong commented 1 month ago

May I ask how much GPU resources and time you spent training Visualized BGE

JUNJIE99 commented 1 month ago

The primary computational expense of Visualized BGE is concentrated in the Stage 1 training. During this stage, the Base model undergoes a total of (116K+48K) training steps. This process takes approximately 15 days when utilizing 24 A800 GPUs.