Open wladimirleite opened 7 months ago
I processed an UFDR with 56GB.
It resulted in a case with 160 GB in index
folder and 456 GB in neo4j
folder.
I disabled extractMessages
in ParserConfig.xml
as workaround. Also disabled enableGraphGeneration
. Now the case 6.8 GB in index
folder.
I think that makes sense messages to be linked with the group, not the members of the group, since it is common huge groups in Telegram (with thousands messages and thousands members). Maybe there should be a metadata in the group containing all members.
Thanks @aberenguel for your feedback. This change will be definitely implemented and included in 4.2.0 version.
As discussed in this https://github.com/sepinf-inc/IPED/pull/1999#issuecomment-1839261218, Telegram groups with a lot of members can take too long to be processed and generate a very large case, if each member is included in the multivalued "Communication:To" metadata.