Thanks for open-sourcing your wonderful project! I noticed that the full training code and the 56K pretraining dataset haven’t been released yet, and I’m very interested in them. Could you please let me know if there are plans to release the code and dataset soon? They would be incredibly helpful for further research.
Thanks for open-sourcing your wonderful project! I noticed that the full training code and the 56K pretraining dataset haven’t been released yet, and I’m very interested in them. Could you please let me know if there are plans to release the code and dataset soon? They would be incredibly helpful for further research.