apache / parquet-java

Apache Parquet Java
https://parquet.apache.org/
Apache License 2.0
2.49k stars 1.37k forks source link

Proposal to rename parquet-mr to parquet-java #2906

Closed asfimport closed 1 month ago

asfimport commented 1 month ago

When I first encountered "parquet-mr" I didn't not understand what the mr meant (I think it is map reduce?) but I believe what the https://github.com/apache/parquet-mr repo actually contains is a java implementation maintained by the Apache Parquet PMC

 

@pitrou  mentions the confusion about parquet-mr naming as well here

https://github.com/apache/parquet-site/pull/53#discussion_r1598284018

 

Also, can we explain what "mr" stands for? It's a mystery for most people.

 

 

Julien Le Dem suggested  renaming the parquet-mr repository to parquet-java to reduce confusion about its content: https://github.com/apache/parquet-site/pull/59#pullrequestreview-2056038304

 

This looks great. Thank you for taking the initiative. Hadoop is not required indeed. Perhaps at some point we should rename parquet-mr to parquet-java?

 

Thus I suggest we rename the repo to parquet-java. This requires an infra ticket which I can do if people agree with this

 

Note that all existing github links would continue to function by redirecting to the new name we recently renamed https://github.com/apache/arrow-datafusion to https://github.com/apache/datafusion for example and all the old links still work)

Reporter: Andrew Lamb / @alamb

Note: This issue was originally created as PARQUET-2475. Please see the migration documentation for further details.

asfimport commented 1 month ago

Andrew Lamb / @alamb: Related mailing list discussion: https://lists.apache.org/thread/m911chzz5pz8o8fb1y0x49h96g8w29d7

asfimport commented 1 month ago

Andrew Lamb / @alamb: Infra request: https://issues.apache.org/jira/browse/INFRA-25802

asfimport commented 1 month ago

Andrew Lamb / @alamb: Repo has been renamed: https://github.com/apache/parquet-java