Simplified regex for getting entities, removed expectation of the following characters "(", ")", "{", "}"
These characters are no longer a part of the node.text this is why the regex wasn't returning anything. With these changes the GraphRAG works. Why these characters are no longer a part of the node.text I don't know, if these characters should be a part of the text of the node this will not resolve the issue and make it worse.
Description
Simplified regex for getting entities, removed expectation of the following characters "(", ")", "{", "}"
These characters are no longer a part of the node.text this is why the regex wasn't returning anything. With these changes the GraphRAG works. Why these characters are no longer a part of the node.text I don't know, if these characters should be a part of the text of the node this will not resolve the issue and make it worse.
Fixes https://github.com/run-llama/llama_index/issues/15173
New Package?
Did I fill in the
tool.llamahub
section in thepyproject.toml
and provide a detailed README.md for my new integration or package?Version Bump?
Did I bump the version in the
pyproject.toml
file of the package I am updating? (Except for thellama-index-core
package)Type of Change
Please delete options that are not relevant.
How Has This Been Tested?
Your pull-request will likely not be merged unless it is covered by some form of impactful unit testing.
Suggested Checklist:
make format; make lint
to appease the lint gods