kylebarron / stata-parquet-old

Read and write Parquet files from Stata
GNU General Public License v3.0
4 stars 0 forks source link

leads #1

Open kylebarron opened 6 years ago

kylebarron commented 6 years ago

I can see that when reading a Parquet file, the top-level class ParquetFile() in parquet.py instantiates ParquetReader() from _parquet.pyx. The top level function to read an entire file, ParquetFile.read() calls ParquetReader.read_all(). The gist of that function looks to be:

cdef:
    shared_ptr[CTable] ctable
    vector[int] c_column_indices

    check_status(self.reader.get().ReadTable(&ctable))
return pyarrow_wrap_table(ctable)

ctable is defined in lib.pxd.

kylebarron commented 6 years ago

Library locations for working parquet-cpp/examples/parquet-arrow image