Open mloning opened 4 years ago
We want to enable understandable and accessible machine learning with time series by providing instructive documentation and by building a friendly, collaborative and inclusive community. The aim is to unify the time series analysis field by providing a common framework for multiple learning tasks, bringing together contributors from academia and the wider data science community into a joint framework and embedding best practices into the time series analysis field
Hi @mloning, this looks like an exciting project! It would be very useful to have a package like scikit-learn for time series. After reading your vision statement, I feel like it could a few clarifications:
Thanks @koudyk, yes we need to state the problems of the current ecosystem first. Some key problems in our view are:
What do you think?
During OLS, I wanted to focus on three things (subject to change):
Hi @mloning, sounds cool. By 'time series' do you mean 'time series neuroimaging data', or something else? Might want to clarify this for the reader, otherwise people will bring in their own biases (my background is cog neuro so this makes me instantly think of neuro data).
We want to enable understandable and accessible machine learning with time series by providing instructive documentation and by building a friendly, collaborative and inclusive community.
I also think your sentence structure might be better if you flipped it around, stating what you're building first and then what is the outcome, e.g.,
--> We want to build ... <community/documentation etc> ... to enable... <understanding/accessibility etc>
Just a thought :)
@mloning Do you know of the PythonistaCafé? It's an online community of Python programmers with some very active members who do different things, including package developers. It is not free, but might be a good investment for your project.
It's probably going to be more helpful from the package development side, but I had a quick look, and there do seem to be some discussions about time series analysis, though I can't say for certain that those would be directly-relevant for you.
Hi @baileythegreen, thanks for the suggestion! I didn't know about it. Are you part of it? I'm not sure what the exact benefit would be over public discussions on stackoverflow and so on?
@mloning I am a member (or I wouldn't be able to search it). Stackoverflow can be a useful resource (and has the advantage of being an open one), and I certainly use it a lot (though not only for Python stuff). The PythonistaCafé community has a membership system to help ensure exchanges are respectful and constructive, and to foster actual relationships between members. It may also mean that members are more active because they do pay something for it. There are a handful of people (really knowledgeable ones) that I know are posting and responding to stuff there like every day.
It isn't overly-expensive – $49 per quarter, so you could give it a try to see if it's a good fit.
Thanks @koudyk, yes we need to state the problems of the current ecosystem first. Some key problems in our view are:
* many specialised smaller packages for specific modelling approaches or learning tasks which often do not integrate with more foundational libraries like scikit-learn and are often incompatible with each other * methodology experts who develop algorithms are often not well connected to domain experts who work with time series data
What do you think?
During OLS, I wanted to focus on three things (subject to change):
* user guide (high level documentation of sktime and intro to ML with time series) * governance document outlining community structure and decision making processes (already written and I received feedback from my mentor) * working on ideas how to engage and integrate other package developers to work towards a more unified ecosystem (specifying common APIs, building up best practices, common ways of doing things)
It's more clear to me now! Sounds like you're going about unifying the package in a good way. Again, I'm not sure whether our vision statement is supposed to be the vision statement for the larger project, or for the portion that we're working on in OLS. So I'm not sure whether you should include your specific OLS goals in the vision statement. @malvikasharan can you clarify?
@all-contributors please add @mloning for idea, content and review on other issues.
@malvikasharan
I've put up a pull request to add @mloning! :tada:
@koudyk sorry for not seeing this earlier.
In principle, your vision statement should be for your overall project. However, it's ok to have a short-term vision if you only want to focus on what you are doing at OLS.
I've opened a draft PR for the roadmap and software management plan https://github.com/alan-turing-institute/sktime/pull/467
Project Lead: @mloning Mentor: @martinagvilas Project repository: https://github.com/alan-turing-institute/sktime
Welcome to OLS-2! This issue will be used to track your project and progress during the program. Please use this checklist over the next few weeks as you start Open Life Science program :tada:.
Week 1 (31 August - 4 September 2020): Meet your mentor!
Before Week 2 (7 - 11 September 2020): Cohort Call (Welcome to Open Life Science!)
Before Week 3 (14 - 18 September 2020): Meet your mentor!
Before Week 4 (21 - 25 September 2020): Cohort Call (Tooling and roadmapping for Open projects)
Week 5 and later
README.md
file, or landing page, for your projectLICENSE.md
CODE_OF_CONDUCT.md
This issue is here to help you keep track of work as you start Open Life Science program. Please refer to the OLS-2 Syllabus for more detailed weekly notes and assignments past week 4.