Rdatatable / data.table

R's data.table package extends data.frame:
http://r-datatable.com
Mozilla Public License 2.0
3.59k stars 978 forks source link

What is "within scope" for the data.table package? #5722

Closed TysonStanley closed 9 months ago

TysonStanley commented 11 months ago

As we are working towards a new governance document for data.table (#5676), it seems important to consider what features are “within scope” of the data.table package. For instance, it is clear that essentially anything tied to data wrangling, cleaning, reformatting, structuring, and analysis are all within the scope of the package. But currently plotting (e.g., something like ggplot2) is not. To help communicate and guide feature requests and contributions, having this laid out clearly is important. Notably, the README states “data.table provides a high-performance version of base R's data.frame with syntax and feature enhancements for ease of use, convenience and programming speed.” Obviously, there are additional features (e.g., fread()) in addition to the high-performance version of data.frames.   To start, here are the features that are already within scope (as described in the README).

Topics we currently believe are out of scope:

Please add any others that make sense to include or should be discussed. Note that these topics may be relevant for the “Seal of Approval” (#5723).

jangorecki commented 11 months ago

I think we will not cover up front all options, and best may be just how we dealt with it till now.

If there is a request/idea, and we are convinced it is out of scope, we just close the issue with such comment. We can of course re-open it later on, based on more comments or upvotes.

TysonStanley commented 11 months ago

Yeah, I didn't think it would be possible to be comprehensive and have the foresight for all future developments. I think some of this is more to help new people understand the overall plans for where development will go. Do you think it's useful at all to delineate this information? Even with something like a 1 (or 5) year plan as we are putting together the governance doc?

Also, curious if the decision point is mostly on the spot with the active developers and a quick "tally" of agreement on the issue? I think that makes a lot of sense (especially with the opportunity to reactivate with enough comments/upvotes.

jangorecki commented 11 months ago

I think some of this is more to help new people understand the overall plans for where development will go.

That would be nice but without having funding for development team these kinds of roadmap will be "where we would like development will go" and not really where "development will go". We cannot really make much commitments without having business model or some other kinds of funding. I can say what where two big points on the roadmap in the recent past: on disk file mapped data.table and long vector support. I cannot easily imagine having those anytime soon (and later) when Matt is not actively developing anymore. PoC yes, but having it ready to merge to master will be completely different story. As long as volunteers are contributing I don't think it make sense to hold any commitments.