Open aminmkhan opened 5 years ago
It is not supported in the current version as we didn't have time to implement that. Not sure to support loading all sub-files only in one layer or scanning the whole folder recursively. In fact, #302 is a rough idea.
HDFSInputFormat
supports reading all files in the specified directory (#302). DoesFileInputFormat
withNFSFileSplitter
also support loading data from a folder?This can be useful for TF-IDF example, so that all input files from a folder are loaded. This would be similar behavior as for the TF-IDF example in Spark.