Open kylebarron opened 6 years ago
I can see that when reading a Parquet file, the top-level class ParquetFile() in parquet.py instantiates ParquetReader() from _parquet.pyx. The top level function to read an entire file, ParquetFile.read() calls ParquetReader.read_all(). The gist of that function looks to be:
ParquetFile()
parquet.py
ParquetReader()
_parquet.pyx
ParquetFile.read()
ParquetReader.read_all()
cdef: shared_ptr[CTable] ctable vector[int] c_column_indices check_status(self.reader.get().ReadTable(&ctable)) return pyarrow_wrap_table(ctable)
ctable is defined in lib.pxd.
ctable
lib.pxd
Library locations for working parquet-cpp/examples/parquet-arrow
parquet-cpp/examples/parquet-arrow
I can see that when reading a Parquet file, the top-level class
ParquetFile()
inparquet.py
instantiatesParquetReader()
from_parquet.pyx
. The top level function to read an entire file,ParquetFile.read()
callsParquetReader.read_all()
. The gist of that function looks to be:ctable
is defined inlib.pxd
.