First off, thank you for making this class. I'm trying to use Freebase's dump with Java, and there's very little documentation / info available.
I have a couple of questions regarding using this class:
1) Can I just pass in the path to the gzipped file to this class, or do I need to split the file into parts, etc, as the notes.txt file linked by you describes? I'm using a high memory ec2 instance (100+MB) for this, so memory is not a problem.
2) It seems that you just output the fixed string, but don't write it back to the file? Will running this script actually fix the file? Since you don't seem to be writing your changes anywhere.
Again, thanks, and I'm very interested in this project. I'd be happy to contribute more to this repo.
Hi there,
First off, thank you for making this class. I'm trying to use Freebase's dump with Java, and there's very little documentation / info available.
I have a couple of questions regarding using this class:
1) Can I just pass in the path to the gzipped file to this class, or do I need to split the file into parts, etc, as the notes.txt file linked by you describes? I'm using a high memory ec2 instance (100+MB) for this, so memory is not a problem.
2) It seems that you just output the fixed string, but don't write it back to the file? Will running this script actually fix the file? Since you don't seem to be writing your changes anywhere.
Again, thanks, and I'm very interested in this project. I'd be happy to contribute more to this repo.