pyOpenSci / software-submission

Submit your package for review by pyOpenSci here! If you have questions please post them here: https://pyopensci.discourse.group/
93 stars 36 forks source link

Solar Data Tools pre-submission inquiry #204

Closed pluflou closed 1 month ago

pluflou commented 3 months ago

Submitting Author: Name (@pluflou)
Package Name: Solar Data Tools One-Line Description of Package: Library of tools for analyzing photovoltaic power time-series data. Repository Link (if existing): https://github.com/slacgismo/solar-data-tools EiC: @cmarmo


Code of Conduct & Commitment to Maintain Package

Description

Community Partnerships

We partner with communities to support peer review with an additional layer of checks that satisfy community requirements. If your package fits into an existing community please check below:

Scope

Domain Specific


The IO module provides functions to pull data from various online sources. The main DataHandler class provides automated data extraction and cleaning of unlabeled time-series data (including cleaning up erroneous timestamps from measurement devices), provides a data quality score, and has methods to run several analyses on the data such as time shift analysis and degradation loss factor analysis. The scientific methods used in this package have been peer reviewed in other publications/conferences. See here and here.

This package is for anyone dealing with photovoltaic data, especially data with no meteorological information (unlabeled). This includes photovoltaic professionals (in private solar industry or utility companies for example), researchers and students in the solar power domain, community solar owners, and anyone with a rooftop system. The scientific goal of the package is to facilitate analysis of photovoltaic data for any system, even those that are difficult to model, and the package uses signal decomposition to achieve that.

There are two other packages that are similar in that they offer data analysis tools for solar applications: PVAnalytics and RdTools. They are both model driven, and require the user to define their own analysis. PVAnalytics focuses on preprocessing and QA, while RdTools focuses on loss factor analysis. Solar Data Tools provides both data quality and loss factor analysis, runs automatically with little to no setup, and is model-free and does not require any weather or other information. Solar Data Tools is most suited for when users want a pre-defined pipeline to get information on complex systems/sites that can't be modeled easily and that no meteorological data. A recent tutorial that was part of a virtual tutorial series on open-source tools and open-access solar data held by DOE’s Solar Technology Office in March 2024 goes over the differences in these packages and when each tool is appropriate to use. You can find the recording here and the slide deck here (see slide 16 for a summary).

P.S. Have feedback/comments about our review process? Leave a comment here

cmarmo commented 3 months ago

Dear @pluflou, thank you for your detailed submission to pyOpenSci! Solar Data Tools is definitely in scope for us: you provided relevant and very interesting references. Would you mind opening a new submission issue referencing this presubmission enquiry? Thank you.

pluflou commented 3 months ago

Dear @pluflou, thank you for your detailed submission to pyOpenSci! Solar Data Tools is definitely in scope for us: you provided relevant and very interesting references. Would you mind opening a new submission issue referencing this presubmission enquiry? Thank you.

Thank you, will do! We're just going to release a new version with some bug fixes before submitting. Should be within the month.