Open sdx0112 opened 4 months ago
I looked into the cluster_graph.py code. Here is the code causing the issue from line 102:
output_df[[level_to, to]] = pd.DataFrame(
output_df[to].tolist(), index=output_df.index
)
The left side has two columns while the right side has only one column.
I also printed out the output_df. Seems it has only one row, and only the first column 'entity_graph' has value.
I looked into the cluster_graph.py code. Here is the code causing the issue from line 102:
output_df[[level_to, to]] = pd.DataFrame( output_df[to].tolist(), index=output_df.index )
The left side has two columns while the right side has only one column.
So how to resolve this issue? Thanks
I'm also getting the same issue when I run this code locally, but not when I run it on google colab.
Hey! I've been diving into this, seems to have to do with the way the models are being called with the cache functions. This is part of the library itself and so I'm trying to figure out a better way to handle it locally. Right now it seems like clearing the cache in the indexing dir you initialize helps the issue. Still going to keep working on making this more stable and a non-issue. Thanks for your patience!
I am getting the same issue! And I tried to clean the cache in the indexing dir, it did not work. And then I tried to excute 'python -m graphrag.index --init --root ./indexing/', it raised another error: ValueError: Project already initialized at indexing. So how to resolve it?
I am getting the same issue! And I tried to clean the cache in the indexing dir, it did not work. And then I tried to excute 'python -m graphrag.index --init --root ./indexing/', it raised another error: ValueError: Project already initialized at indexing. So how to resolve it?
Command python -m graphrag.index --init --root ./indexing
will create .env
and settings.yaml
in floder ./indexing
.
If you already have those two files, it will raise error: ValueError: Project already initialized at indexing
.
Use python -m graphrag.index --root ./indexing
to run the indexing.
Hi, I have pulled the latest version and encountered the following error:
09:21:37,332 datashaper.workflow.workflow ERROR Error executing verb "cluster_graph" in create_base_entity_graph: Columns must be same length as key Traceback (most recent call last): File "D:\miniforge3\envs\graphrag\Lib\site-packages\datashaper\workflow\workflow.py", line 410, in _execute_verb result = node.verb.func(**verb_args) ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "D:\Projects\GPT\GraphRAG-Local\GraphRAG-Local-UI\graphrag\index\verbs\graph\clustering\cluster_graph.py", line 102, in cluster_graph output_df[[level_to, to]] = pd.DataFrame(