GAA-UAM / scikit-fda

Functional Data Analysis Python package
https://fda.readthedocs.io
BSD 3-Clause "New" or "Revised" License
301 stars 54 forks source link

Data structure for discretized sparse data #498

Open vnmabus opened 1 year ago

vnmabus commented 1 year ago

Is your feature request related to a problem? Please describe. We need a FData subclass capable of storing discretized functions where:

This structure is necessary for efficient storage and computation with this kind of data.

Describe the solution you'd like The proposal for the implementation is to have three arrays:

This approach has a compact representation and also allows for vectorization to be applied. The indexes can be used to apply the reduceat method of NumPy ufuncs.

Describe alternatives you've considered A more high-level API should be also exposed and used when possible.

vnmabus commented 1 year ago

After an internal discussion, and noticing that classes for representing irregular data are in the packages Tidyfun, fundata and fdaPy, we think that this class should definitely be implemented, but the name should reflect that is a class for representing irregular data. Sparsity is more a property of the actual dataset rather than of the structure.

Thus, @opintosant, you can start implementing this class, focusing on the evaluation and plot methods first.