Closed viai957 closed 5 months ago
Current this only supports c4 mini and C4 dataset I would love see fineweb dataset support
just use load_from_disk and pass the path of your fineweb. It works fine with me.
load_from_disk
Current this only supports c4 mini and C4 dataset I would love see fineweb dataset support