open-life-science / ols-2

Creative Commons Attribution Share Alike 4.0 International
11 stars 8 forks source link

sktime - A Unified Framework for ML with Time Series #12

Open mloning opened 3 years ago

mloning commented 3 years ago

Project Lead: @mloning Mentor: @martinagvilas Project repository: https://github.com/alan-turing-institute/sktime

Welcome to OLS-2! This issue will be used to track your project and progress during the program. Please use this checklist over the next few weeks as you start Open Life Science program :tada:.


Week 1 (31 August - 4 September 2020): Meet your mentor!

Before Week 2 (7 - 11 September 2020): Cohort Call (Welcome to Open Life Science!)

Before Week 3 (14 - 18 September 2020): Meet your mentor!

Before Week 4 (21 - 25 September 2020): Cohort Call (Tooling and roadmapping for Open projects)

Week 5 and later

This issue is here to help you keep track of work as you start Open Life Science program. Please refer to the OLS-2 Syllabus for more detailed weekly notes and assignments past week 4.

mloning commented 3 years ago

Vision statement

We want to enable understandable and accessible machine learning with time series by providing instructive documentation and by building a friendly, collaborative and inclusive community. The aim is to unify the time series analysis field by providing a common framework for multiple learning tasks, bringing together contributors from academia and the wider data science community into a joint framework and embedding best practices into the time series analysis field

mloning commented 3 years ago

Open canvas

https://docs.google.com/presentation/d/1Lxd9lU_7gKWdIHicLDlN1rn6ciagAqLi5AD0aR7Xg8g/edit?usp=sharing

koudyk commented 3 years ago

Hi @mloning, this looks like an exciting project! It would be very useful to have a package like scikit-learn for time series. After reading your vision statement, I feel like it could a few clarifications:

mloning commented 3 years ago

Thanks @koudyk, yes we need to state the problems of the current ecosystem first. Some key problems in our view are:

What do you think?

During OLS, I wanted to focus on three things (subject to change):

CooperSmout commented 3 years ago

Hi @mloning, sounds cool. By 'time series' do you mean 'time series neuroimaging data', or something else? Might want to clarify this for the reader, otherwise people will bring in their own biases (my background is cog neuro so this makes me instantly think of neuro data).

We want to enable understandable and accessible machine learning with time series by providing instructive documentation and by building a friendly, collaborative and inclusive community.

I also think your sentence structure might be better if you flipped it around, stating what you're building first and then what is the outcome, e.g.,

--> We want to build ... <community/documentation etc> ... to enable... <understanding/accessibility etc>

Just a thought :)

baileythegreen commented 3 years ago

@mloning Do you know of the PythonistaCafé? It's an online community of Python programmers with some very active members who do different things, including package developers. It is not free, but might be a good investment for your project.

It's probably going to be more helpful from the package development side, but I had a quick look, and there do seem to be some discussions about time series analysis, though I can't say for certain that those would be directly-relevant for you.

mloning commented 3 years ago

Hi @baileythegreen, thanks for the suggestion! I didn't know about it. Are you part of it? I'm not sure what the exact benefit would be over public discussions on stackoverflow and so on?

baileythegreen commented 3 years ago

@mloning I am a member (or I wouldn't be able to search it). Stackoverflow can be a useful resource (and has the advantage of being an open one), and I certainly use it a lot (though not only for Python stuff). The PythonistaCafé community has a membership system to help ensure exchanges are respectful and constructive, and to foster actual relationships between members. It may also mean that members are more active because they do pay something for it. There are a handful of people (really knowledgeable ones) that I know are posting and responding to stuff there like every day.

It isn't overly-expensive – $49 per quarter, so you could give it a try to see if it's a good fit.

koudyk commented 3 years ago

Thanks @koudyk, yes we need to state the problems of the current ecosystem first. Some key problems in our view are:

* many specialised smaller packages for specific modelling approaches or learning tasks which often do not integrate with more foundational libraries like scikit-learn and are often incompatible with each other

* methodology experts who develop algorithms are often not well connected to domain experts who work with time series data

What do you think?

During OLS, I wanted to focus on three things (subject to change):

* user guide (high level documentation of sktime and intro to ML with time series)

* governance document outlining community structure and decision making processes (already written and I received feedback from my mentor)

* working on ideas how to engage and integrate other package developers to work towards a more unified ecosystem (specifying common APIs, building up best practices, common ways of doing things)

It's more clear to me now! Sounds like you're going about unifying the package in a good way. Again, I'm not sure whether our vision statement is supposed to be the vision statement for the larger project, or for the portion that we're working on in OLS. So I'm not sure whether you should include your specific OLS goals in the vision statement. @malvikasharan can you clarify?

malvikasharan commented 3 years ago

@all-contributors please add @mloning for idea, content and review on other issues.

allcontributors[bot] commented 3 years ago

@malvikasharan

I've put up a pull request to add @mloning! :tada:

malvikasharan commented 3 years ago

@koudyk sorry for not seeing this earlier.

In principle, your vision statement should be for your overall project. However, it's ok to have a short-term vision if you only want to focus on what you are doing at OLS.

mloning commented 3 years ago

I've opened a draft PR for the roadmap and software management plan https://github.com/alan-turing-institute/sktime/pull/467