Closed xinzheng99 closed 3 months ago
It came up when I randomly constructed the DATA:
dataset = pd.DataFrame([ {"author": "Eva", "message": "I'm fine, thanks!", "date(yyyyMMddHHmmss)": 20240821204422}, {"author": "Eva", "message": "I'm fine, thanks!", "date(yyyyMMddHHmmss)": 20240821204422}, {"author": "Eva", "message": "I'm fine, thanks!", "date(yyyyMMddHHmmss)": 20240821204422}, {"author": "Eva", "message": "I'm fine, thanks!", "date(yyyyMMddHHmmss)": 20240821204422}, ])
Traceback (most recent call last):
File "/Users/maxz/graphrag-source/examples/entity_extraction/with_nltk/run.py", line 93, in
Marking as duplicate of https://github.com/microsoft/graphrag/issues/353
Do you need to file an issue?
Describe the issue
When I tried to learn the examples under the examples directory, I found that I didn't have the data I needed in the file. Path: examples/entity_extraction/with_graph_intelligence/run.py Need: `sample_data_dir = os.path.join( os.path.dirname(os.path.abspath(file)), "../../_sample_data/" )
shared_dataset = asyncio.run( load_input( PipelineCSVInputConfig( file_pattern=".*\.csv$", base_dir=sample_data_dir, source_column="author", text_column="message", timestamp_column="date(yyyyMMddHHmmss)", timestamp_format="%Y%m%d%H%M%S", title_column="message", ), ) )`
"../../_sample_data/" does not exist, and I
Steps to reproduce
Jus the file doesn't exist.
GraphRAG Config Used
Logs and screenshots
No response
Additional Information