dynamicslab / pysindy

A package for the sparse identification of nonlinear dynamical systems from data
https://pysindy.readthedocs.io/en/latest/
Other
1.46k stars 324 forks source link
dynamical-systems machine-learning model-discovery nonlinear-dynamics sparse-regression system-identification

PySINDy

|BuildCI| |RTD| |PyPI| |Codecov| |JOSS1| |JOSS2| |DOI|

PySINDy is a sparse regression package with several implementations for the Sparse Identification of Nonlinear Dynamical systems (SINDy) method introduced in Brunton et al. (2016a), including the unified optimization approach of Champion et al. (2019), SINDy with control from Brunton et al. (2016b), Trapping SINDy from Kaptanoglu et al. (2021), SINDy-PI from Kaheman et al. (2020), PDE-FIND from Rudy et al. (2017), and so on. A comprehensive literature review is given in de Silva et al. (2020) and Kaptanoglu, de Silva et al. (2021).

.. contents:: Table of contents

System identification

System identification refers to the process of leveraging measurement data to infer governing equations, in the form of dynamical systems, describing the data. Once discovered, these equations can make predictions about future states, can inform control inputs, or can enable the theoretical study using analytical techniques. Dynamical systems are a flexible, well-studied class of mathematical objects for modeling systems evolving in time. SINDy is a model discovery method which uses sparse regression to infer nonlinear dynamical systems from measurement data. The resulting models are inherently interpretable and generalizable.

How it works ^^^^^^^^^^^^ Suppose, for some physical system of interest, we have measurements of state variables x(t) (a vector of length n) at different points in time. Examples of state variables include the position, velocity, or acceleration of objects; lift, drag, or angle of attack of aerodynamic objects; and concentrations of different chemical species. If we suspect that the system could be well-modeled by a dynamical system of the form

.. code-block:: text

x'(t) = f(x(t)),

then we can use SINDy to learn f(x) from the data (x'(t) denotes the time derivative of x(t)). Note that both f(x) and x(t) are typically vectors. The fundamental assumption SINDy employs is that each component of f(x), f_i(x) can be represented as a sparse linear combination of basis functions theta_j(x)

.. code-block:: text

f_i(x) = theta_1(x) * xi_{1,i} + theta_2(x) * xi_{2,i} + ... + theta_k * xi{k,i}

Concatenating all the objects into matrices (denoted with capitalized names) helps to simplify things. To this end we place all measurements of the state variables into a data matrix X (with a row per time measurement and a column per variable), the derivatives of the state variables into a matrix X', all basis functions evaluated at all points in time into a matrix Theta(X) (each basis function gets a column), and all coefficients into a third matrix Xi (one column per state variable). The approximation problem to be solved can then be compactly written as

.. code-block:: text

X' = Theta(X) * Xi.

Each row of this matrix equation corresponds to one coordinate function of f(x). SINDy employs sparse regression techniques to find a solution Xi with sparse column vectors. For a more in-depth look at the mathematical foundations of SINDy, please see our introduction to SINDy <https://pysindy.readthedocs.io/en/latest/examples/2_introduction_to_sindy/example.html>__.

Relation to PySINDy ^^^^^^^^^^^^^^^^^^^ The PySINDy package revolves around the SINDy class which consists of three primary components; one for each term in the above matrix approximation problem.

Once a SINDy object has been created it must be fit to measurement data, similar to a scikit-learn model. It can then be used to predict derivatives given new measurements, evolve novel initial conditions forward in time, and more. PySINDy has been written to be as compatible with scikit-learn objects and methods as possible.

Example ^^^^^^^ Suppose we have measurements of the position of a particle obeying the following dynamical system at different points in time

.. code-block:: text

x' = -2x y' = y

Note that this system of differential equations decouples into two differential equations whose solutions are simply x(t) = x_0 * exp(-2 * t) and y(t) = y_0 * exp(t), where x_0 = x(0) and y_0 = y(0) are the initial conditions.

Using the initial conditions x_0 = 3 and y_0 = 0.5, we construct the data matrix X.

.. code-block:: python

import numpy as np import pysindy as ps

t = np.linspace(0, 1, 100) x = 3 np.exp(-2 t) y = 0.5 * np.exp(t) X = np.stack((x, y), axis=-1) # First column is x, second is y

To instantiate a SINDy object with the default differentiation method, feature library, and optimizer and then fit it to the data, we invoke

.. code-block:: python

model = ps.SINDy(feature_names=["x", "y"]) model.fit(X, t=t)

We use the feature_names argument so that the model prints out the correct labels for x and y. We can inspect the governing equations discovered by the model and check whether they seem reasonable with the print function.

.. code-block:: python

model.print()

which prints the following

.. code-block:: text

x' = -2.000 x y' = 1.000 y

PySINDy provides numerous other features not shown here. We recommend the feature overview <https://pysindy.readthedocs.io/en/latest/examples/1_feature_overview/example.html>__ section of the documentation for a more exhaustive summary of additional features.

Installation

Installing with pip ^^^^^^^^^^^^^^^^^^^

If you are using Linux or macOS you can install PySINDy with pip:

.. code-block:: bash

pip install pysindy

Installing from source ^^^^^^^^^^^^^^^^^^^^^^ First clone this repository:

.. code-block:: bash

git clone https://github.com/dynamicslab/pysindy.git

Then, to install the package, run

.. code-block:: bash

pip install .

If you do not have root access, you should add the --user option to the above lines.

Caveats ^^^^^^^

To run the unit tests, or example notebooks, you should install the dev-dependencies with:

.. code-block:: bash

pip install pysindy[dev]

or if you are installing from a local copy

.. code-block:: bash

pip install .[dev]

To build a local copy of the documentation, you should install the docs-dependencies with:

.. code-block:: bash

pip install pysindy[docs]

If you are looking to use convex optimization provided by cvxpy <https://github.com/cvxpy/cvxpy>__, then you have to install

.. code-block:: bash

pip install pysindy[cvxpy]

to utilize Mixed-Integer Optimized Sparse Regression (MIOSR) via GurobiPy <https://pypi.org/project/gurobipy/>__, you require

.. code-block:: bash

pip install pysindy[miosr]

Documentation

The documentation site for PySINDy can be found here <https://pysindy.readthedocs.io/en/latest/>_. There are numerous examples <https://pysindy.readthedocs.io/en/latest/examples/index.html> of PySINDy in action to help you get started. Examples are also available as Jupyter notebooks <https://github.com/dynamicslab/pysindy/tree/master/examples>. A video overview of PySINDy can be found on Youtube <https://www.youtube.com/watch?v=DvbbXX8Bd90>. We have also created a video playlist <https://www.youtube.com/playlist?list=PLN90bHJU-JLoOfEk0KyBs2qLTV7OkMZ25>__ with practical PySINDy tips.

PySINDy implements a lot of advanced functionality that may be overwhelming for new users or folks who are unfamiliar with these methods. Below (see here if image does not render https://github.com/dynamicslab/pysindy/blob/master/docs/JOSS2/Fig3.png), we provide a helpful flowchart for figuring out which methods to use, given the characteristics of your dataset:

.. image:: https://github.com/dynamicslab/pysindy/blob/master/docs/JOSS2/Fig3.png

This flow chart summarizes how PySINDy users can start with a dataset and systematically choose the proper candidate library and sparse regression optimizer that are tailored for a specific scientific task. The GeneralizedLibrary class allows for tensoring, concatenating, and otherwise combining many different candidate libraries.

Community guidelines

Contributing examples ^^^^^^^^^^^^^^^^^^^^^ We love seeing examples of PySINDy being used to solve interesting problems! If you would like to contribute an example, reach out to us by creating an issue.

At a minimum, we need to be able to run the example notebooks in the normal mode as well as in a test mode that uses smaller data in order to run faster and simply verify that cells execute without error. In order to do that, your example should obey the following directory tree

.. code-block::

.// \ |-example.py # save your notebook as a python script |-example_data.py # has functions to create/load data |-mock_data.py # has functions with same name as in example_data.py which create/load smaller datasets |-example.ipynb # run python examples/publish_notebook/ to generate this. Needs packages in requirements-dev.txt |-utils.py (Any other names example.py needs to import. Any additional local modules imported by example.py need to be submodules of utils.py, e.g. utils.plotting)

You can optimize your notebook for testing by checking __name__. When our tests run example.py they set the __name__ global to "testing". For instance, your notebook should determine whether to import from mock_data or example_data using this method (another example: you could also use this method to set max_iter). It's a bit arbitrary, but try to make your examples run in under ten seconds using the mock data. You can use our test to verify your example in testing mode:

.. code-block::

pytest -k test_external --external-notebook="path/to/"

Contributing code ^^^^^^^^^^^^^^^^^ We welcome contributions to PySINDy. To contribute a new feature please submit a pull request. To get started we recommend installing an editable dev version from a local clone via

.. code-block:: bash

pip install -e .[dev]

This will allow you to run unit tests and automatically format your code. To be accepted your code should conform to PEP8 and pass all unit tests. Code can be tested by invoking

.. code-block:: bash

pytest

We recommend using pre-commit to format your code. The easiest approach is to install pre-commit via

.. code-block:: bash

pre-commit install

After which pre-commit will automatically check all future commits. Once you have staged changes to commit

.. code-block:: bash

git add path/to/changed/file.py

Pre-commit will then automatically run all checks against your committed code. If you want to trigger this manually, you can run the following to automatically reformat your staged code

.. code-block:: bash

pre-commit

Note that you will then need to re-stage any changes pre-commit made to your code.

Building documentation requires the docs dependencies, which can be installed with either

.. code-block:: bash

pip install pysindy[docs]

or with

.. code-block:: bash

pip install .[docs]

for a local clone of the repository. Once installed, run

.. code-block:: bash

python -m sphinx -TEb html -d _build/doctrees -D language=en . ./build

Or check the build step in the most recent CI run or RTD build.

There are a number of SINDy variants and advanced functionality that would be great to implement in future releases:

  1. Bayesian SINDy, for instance that from Hirsh, Seth M., David A. Barajas-Solano, and J. Nathan Kutz. "Sparsifying Priors for Bayesian Uncertainty Quantification in Model Discovery." arXiv preprint arXiv:2107.02107 (2021).

  2. Tensor SINDy, using the methods in Gelß, Patrick, et al. "Multidimensional approximation of nonlinear dynamical systems." Journal of Computational and Nonlinear Dynamics 14.6 (2019).

  3. Stochastic SINDy, using the methods in Brückner, David B., Pierre Ronceray, and Chase P. Broedersz. "Inferring the dynamics of underdamped stochastic systems." Physical review letters 125.5 (2020): 058103.

  4. Integration of PySINDy with a Python model-predictive control (MPC) code.

  5. The PySINDy weak formulation is based on the work in Reinbold, Patrick AK, Daniel R. Gurevich, and Roman O. Grigoriev. "Using noisy or incomplete data to discover models of spatiotemporal dynamics." Physical Review E 101.1 (2020): 010203. It might be useful to additionally implement the weak formulation from Messenger, Daniel A., and David M. Bortz. "Weak SINDy for partial differential equations." Journal of Computational Physics (2021): 110525. The weak formulation in PySINDy is also fairly slow and computationally intensive, so finding ways to speed up the code would be great.

  6. The blended conditional gradients (BCG) algorithm for solving the constrained LASSO problem, Carderera, Alejandro, et al. "CINDy: Conditional gradient-based Identification of Non-linear Dynamics--Noise-robust recovery." arXiv preprint arXiv:2101.02630 (2021).

Reporting issues or bugs ^^^^^^^^^^^^^^^^^^^^^^^^ If you find a bug in the code or want to request a new feature, please open an issue.

Getting help ^^^^^^^^^^^^ For help using PySINDy please consult the documentation <https://pysindy.readthedocs.io/en/latest/> and/or our examples <https://github.com/dynamicslab/pysindy/tree/master/examples>, or create an issue.

Citing PySINDy

PySINDy has been published in the Journal of Open Source Software (JOSS). The paper can be found here <https://joss.theoj.org/papers/10.21105/joss.02104>__.

If you use PySINDy in your work, please cite it using the following two references:

Brian M. de Silva, Kathleen Champion, Markus Quade, Jean-Christophe Loiseau, J. Nathan Kutz, and Steven L. Brunton., (2020). PySINDy: A Python package for the sparse identification of nonlinear dynamical systems from data. Journal of Open Source Software, 5(49), 2104, https://doi.org/10.21105/joss.02104

Kaptanoglu et al., (2022). PySINDy: A comprehensive Python package for robust sparse system identification. Journal of Open Source Software, 7(69), 3994, https://doi.org/10.21105/joss.03994

Bibtex:

.. code-block:: text

@article{desilva2020,
doi = {10.21105/joss.02104},
url = {https://doi.org/10.21105/joss.02104},
year = {2020},
publisher = {The Open Journal},
volume = {5},
number = {49},
pages = {2104},
author = {Brian de Silva and Kathleen Champion and Markus Quade and Jean-Christophe Loiseau and J. Kutz and Steven Brunton},
title = {PySINDy: A Python package for the sparse identification of nonlinear dynamical systems from data},
journal = {Journal of Open Source Software}
}

Bibtex:

.. code-block:: text

@article{Kaptanoglu2022,
doi = {10.21105/joss.03994},
url = {https://doi.org/10.21105/joss.03994},
year = {2022},
publisher = {The Open Journal},
volume = {7},
number = {69},
pages = {3994},
author = {Alan A. Kaptanoglu and Brian M. de Silva and Urban Fasel and Kadierdan Kaheman and Andy J. Goldschmidt and Jared Callaham and Charles B. Delahunt and Zachary G. Nicolaou and Kathleen Champion and Jean-Christophe Loiseau and J. Nathan Kutz and Steven L. Brunton},
title = {PySINDy: A comprehensive Python package for robust sparse system identification},
journal = {Journal of Open Source Software}
}

References

Related packages

Contributors

This repository is a fork from original work <https://github.com/Ohjeah/sparsereg> by Markus Quade <https://github.com/Ohjeah>.

Thanks to the members of the community who have contributed to PySINDy!

+-------------------------------------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------+ | billtubbs <https://github.com/kopytjuk> | Bug fix #68 <https://github.com/dynamicslab/pysindy/issues/68> | +-------------------------------------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------+ | kopytjuk <https://github.com/kopytjuk> | Concatenation feature for libraries #72 <https://github.com/dynamicslab/pysindy/pull/72> | +-------------------------------------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------+ | andgoldschmidt <https://github.com/andgoldschmidt> | derivative <https://derivative.readthedocs.io/en/latest/> package for numerical differentiation #85 <https://github.com/dynamicslab/pysindy/pull/85>_ | +-------------------------------------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------+

.. |BuildCI| image:: https://github.com/dynamicslab/pysindy/actions/workflows/main.yml/badge.svg :target: https://github.com/dynamicslab/pysindy/actions/workflows/main.yml?query=branch%3Amaster

.. |RTD| image:: https://readthedocs.org/projects/pysindy/badge/?version=latest :target: https://pysindy.readthedocs.io/en/latest/?badge=latest :alt: Documentation Status

.. |PyPI| image:: https://badge.fury.io/py/pysindy.svg :target: https://badge.fury.io/py/pysindy

.. |Codecov| image:: https://codecov.io/gh/dynamicslab/pysindy/branch/master/graph/badge.svg :target: https://codecov.io/gh/dynamicslab/pysindy

.. |JOSS1| image:: https://joss.theoj.org/papers/82d080bbe10ac3ab4bc03fa75f07d644/status.svg :target: https://joss.theoj.org/papers/82d080bbe10ac3ab4bc03fa75f07d644

.. |JOSS2| image:: https://joss.theoj.org/papers/10.21105/joss.03994/status.svg :target: https://doi.org/10.21105/joss.03994

.. |DOI| image:: https://zenodo.org/badge/186055899.svg :target: https://zenodo.org/badge/latestdoi/186055899