Closed padi-pm-dungnt closed 2 years ago
Hi there, Thanks for the report. I wanted to give a short message I've taken notice. I think it is indeed an issue and I'm investigating. Thankful for any hint. I think it has to do with RLE reading or a wrong offset. No ETA, I'm maintaining this in my spare time.
Thanks for your reply.I think the reason is due to wrong offset as you said.Maybe something wrong happen in some of these functions:
@padi-pm-dungnt I've identified and fixed the issue and I'm preparing a new release. It was a PHP syntax misinterpretation with parentheses.
I'd like to strip down the test file to contain just one column (project_id) and include a test case in this project. Please confirm you are authorized to give permission and allow me to use respective information from your provided parquet file.
@Katalystical Thanks for your update & looking forward for new release.
Hi, thanks for your great library.It works well with small parquet file, but when i tried to read data from Parquet file with ~500k row of data, array values from readColumn ->getData() become incorrect.
Here is my parquet file: https://dev-sc2-pn.s3.ap-northeast-1.amazonaws.com/sc2_area_master+(3).parquet
My parquet file has only 92 rows with project_id = '123456789012345678', but when i get data from colum getData(), it return more than 300k row with this project_id.
Here is my sample code.Do you have any idea about this issue?