Hello, I added a step by step on how to use the datasets library from Hugging Face and turn that into mlx streams
Although mlx is not dependent on hf, it is such a common workflow in the industry that I thought it might be helpful to share. Not sure if this should sit in mlx-data or the main mlx repo.
Hello, I added a step by step on how to use the
datasets
library from Hugging Face and turn that intomlx streams
Although mlx is not dependent on hf, it is such a common workflow in the industry that I thought it might be helpful to share. Not sure if this should sit in mlx-data or the main mlx repo.
I also got a notebook version here https://github.com/mwrites/apple-mlx-tutorials/blob/main/hf_datasets_mlx_streams.ipynb
Side Note:
I noticed that these two minimal examples
always print
<class 'numpy.ndarray'>
hence the need to domx.array(buff[0]['x'])
again later when passing the inputs to a model.