-
Hello,
I am trying to use the warctools library to open a WAT file, which is a special type of WARC file that contains metadata. When I try to read in a file I get this:
('Reading file: ', 'sampleW…
-
_This issue is being filed by a script, but if you reply, I will see it._
[PackageEvaluator.jl](https://github.com/IainNZ/PackageEvaluator.jl) is a script that runs nightly. It attempts to load all J…
-
鱼片的小露宝 hi 我正在学习hadoop,我想知道从哪里可以获取一些原始数据样本(就好像hadoop权威指南里面所说的NCDC的气象数据日志)来做数据分析的练习?谢谢
-
-
- [x] manage hadoop
- [x] compile example without ant
- [x] run example without ccRunExample
- [x] find hadoop output
-
When using uncompressed ARC files with Nanite, which uses warc-hadoop-recordreaders, an exception is thrown.
ARCReaderFactory in 3.1.0/3.1.1 has two methods to open an ARC, only one of which tests fo…
-
Hey Julien,
Would you be open to a patch that makes Behemoth work LucidWorksEnterprise? It's a standalone module (you can see it on my fork under the LWE branch). It only requires Solr dependencies…
-
Hi would you consider adding a Amazon Web Services Icon to you set?
http://www.commoncrawl.org/wp-content/uploads/2012/01/AWS_LOGO_CMYK.png
and in a totally different category drupal?
http://drupal…
-
Hi All!
When trying to build JAR using Ant receive following errors:
compile-core-classes:
[javac] Compiling 167 source files to /root/commoncrawl/commoncrawl-commoncrawl-24052ae/build/classes
…