Closed kylebarron closed 5 years ago
I think there is some merge support in arrow, but the low-level API is so poorly documented that it might take a while.
parquet merge [m:1 or 1:1] using file.parquet
I don't think I could implement 1:m efficiently (and m:m is evil). I suppose this could also work
parquet use file.parquet, merge([m:1 or 1:1 or 1:m] using file.parquet)
parquet query [SQL-like query? e.g. select * from file.parquet left join file.parquet etc.]
I don't want to re-invent the wheel with merges, though. If I can use some other library to merge and just this to read to Stata once it's done that'd be great.
I don't want to re-invent the wheel with merges, though. If I can use some other library to merge and just this to read to Stata once it's done that'd be great.
Yes I agree.
Dan pointed me to this: https://github.com/kylebarron/ftools/commit/c8072fa2350f811f5626657350a60c8379353189
This can probably be closed
This is really Dan's feature request. Just something to put on a possible long term roadmap.