microsoft / graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system
https://microsoft.github.io/graphrag/
MIT License
17.71k stars 1.69k forks source link

Documentation request - add links to sample CSV / step to convert text to input CSV #133

Open erjadi opened 5 months ago

erjadi commented 5 months ago

Apologies if this is just my lack of understanding, but going through the getting started tutorial, there seems to be a step missing?

We download a book from project gutenberg in text format, and then we start the indexer. However the indexer expects CSV files in the input folder and we only have the book .txt file.

I checked the dulce.csv file that also in the repo to transform my input into something acceptable, but I think either:

would help people who are starting out.

chiragshah285 commented 3 months ago

Totally agree here, in the same boat

natoverse commented 2 months ago

For some time there was a bug in the config such that the Gutenberg txt wasn't working. This has been fixed. So I think this can probably be closed, but I'll give it a few days to see if folks chime in with a similar issue.