Closed gaurav closed 10 years ago
Done in @e1efa66 -- you can see the result at https://raw.githubusercontent.com/gaurav/commons-extraction/master/commonswiki/20140101/commonswiki-20140101-file-information.tql
This is probably as far as we can get just with WikiPage: anything more sophisticated will require making sense of the file metadata dump. I also need to use the code on line https://github.com/jimkont/extraction-framework/blob/server_test_extraction/core/src/main/scala/org/dbpedia/extraction/mappings/TemplateMapping.scala#L116 to add related classes, so that we end up emitting RDF from:
But that will rely on #10
This is done in @f512a9d6cd, including relatedClasses.
The MIME types could just be hard-coded into the FileTypeExtractor for now.