Questions about Figure 3 in the original paper

In the figure, Rank = 1024 and Rank = 512 is very close to the baseline, even better than the baseline. In response, I have the following 2 questions.

Is Rank = 1024 and Rank = 512 steadily better than baseline, or is there some randomness? If it is steadily better than baseline, how can we explain this phenomenon?
Have you ever done an experiment with a very small case of Rank (e.g. n/8、n/16), and how much does this affect the results of the experiment specifically? Looking forward to your early reply. Your support has been invaluable to me.

jiaweizzhao / GaLore

Questions about Figure 3 in the original paper #42