techascent / tech.ml.dataset

A Clojure high performance data processing system
Eclipse Public License 1.0
680 stars 35 forks source link

support jsonl format for read #404

Open behrica opened 6 months ago

behrica commented 6 months ago

Like this file: https://github.com/haifengl/smile/blob/v2.6.0/shell/src/universal/data/json/books1.json

It it technically not valid JSON. I think it's usually called "jsonl"

harold commented 6 months ago

related recent discussion: https://clojurians.zulipchat.com/#narrow/stream/236259-tech.2Eml.2Edataset.2Edev/topic/writing.20.26.20reading.20JSONL.20with.20charred