importer class incorrectly assumes that all documents are of one single doctype

It seems that the class Importer(BaseImportExport) as defined in core/import_export_classes assumes that the batch to be imported is of a single doctype (doctype is a mandatory argument of the .run() method.

This makes the importer incompatible with the exporter: If I use the exporter to export a bunch of JSON documents which happen to have multiple doctypes, I cannot import them back using the importer.

It would be nice if this could be fixed, so that the json-importers/exporters in the importers_exporters/ folder can be used to transfer documents between ES instances and for backup purposes.

This needs to be resolved to solve https://github.com/uvacw/inca/issues/291

def run(self, path, *args, **kwargs): """uses the documents from the load method in batches """ # this method is overwritten because in contrast to # other importers, we do not have a single doctype. # Each document can have a different one. for doc in self.load(path, *args,**kwargs): self._ingest(iterable=doc, doctype=doc['doctype']) self.processed += 1

uvacw / inca

importer class incorrectly assumes that all documents are of one single doctype #319