Open Edenzzzz opened 7 months ago
Thanks for the integration. I just tried again using the latest datasets version and it worked smoothly from my end. Is it possible due to other issue?
Thanks for replying. I think they've fixed the data and I can load it now.
Also, any ideas from integration with FSDP on projecting sharded gradients that are flattened and not necessarily reshapable into a matrix? We can all-gather grads before the SVD step, but doing that in every step will be prohibitively expensive. We can't project flat vectors. Thanks!
Hi, Thanks for the good work. I'm trying to intergrate this into Colossal-AI(https://github.com/hpcaitech/ColossalAI), compatible with Tensor Parallel and ZeRO. However, I had trouble loading the dataset; seems they updated the dataset to remove the json schema. Could you share your dataset version and how you're able to load it? Thanks!