Closed romulogoncalves closed 6 years ago
It's not an issue, it's a report that i can move to a debug level. It is about gz
compression: https://github.com/geotrellis/geotrellis-pointcloud/blob/master/src/main/scala/geotrellis/pointcloud/spark/io/hadoop/formats/FileStreamRecordReader.scala#L41
When reading the point with the new stream approach from #11 we see the following warning being reported:
It seems it is not reading as a compressed file that will lead to corrupted data. In #11 we tested by reading the metadata to trigger the creation of the RDD. The metadata was correct so we assumed things were working because the RDD had to be populate before we could retrieve the metadata. However, we did not check the data content to see if it is corrupted or not.
It is strange that the codec is not found because if we misspell it geotrellis-pointcloud reports an error.
We read the file like this: