metaNanoPype: a reproducible Nanopore python pipeline for metabarcoding

antonioggsousa commented 3 years ago

Project Lead:

António Sousa

Mentor:

Hans-Rudolf Hotz

Welcome to OLS-3! This issue will be used to track your project and progress during the program. Please use this checklist over the next few weeks as you start Open Life Science program :tada:.

Week 1 (week starting 8 February 2021): Meet your mentor!

[x] Create an account on GitHub
[x] Check if you have access to the HackMD notes set up for your meetings with your mentor
[x] Prepare to meet your mentor(s) by completing a short homework provided in the HackMD notes
[x] Complete your own copy of the open leadership self-assessment and share it to your mentor If you're a group, each teammate should complete this assessment individually. This is here to help you set your own personal goals during the program. No need to share your results, but be ready to share your thoughts with your mentor.
[x] Make sure you know when and how you'll be meeting with your mentor.

Before Week 2 (week starting 15 February 2021): Cohort Call (Welcome to Open Life Science!)

[x] Create an issue on the OLS-3 GitHub repository for your OLS work and share the link to your mentor.
[x] Draft a brief vision statement using your goals

This lesson from the Open Leadership Training Series (OLTS) might be helpful
[x] Leave a comment on this issue with your draft vision statement & be ready to share this on the call
[x] Check the Syllabus for notes and connection info for all the cohort calls.

Before Week 3 (week starting 22 February 2021): Meet your mentor!

[ ] Look up two other projects and comment on their issues with feedback on their vision statement
[x] Complete this compare and contrast assignment about current and desired community interactions and value exchanges
[x] Complete your Open Canvas (instructions, canvas)
[ ] Share a link to your Open Canvas in your GitHub issue
[ ] Start your Roadmap
[ ] Comment on your issue with your draft Roadmap
[x] Suggest a cohort name at the bottom of the shared notes and vote on your favorite with a +1

Before Week 4 (week starting 1 March 2021): Cohort Call (Tooling and roadmapping for Open projects)

[ ] Look up two other projects and comment on their issues with feedback on their open canvas.

Week 5 and later

[x] Create a GitHub repository for your project
[ ] Add the link to your repository in your issue
[ ] Use your canvas to start writing a README.md file, or landing page, for your project
[ ] Link to your README in a comment on this issue
[ ] Add an open license to your repository as a file called LICENSE.md
[ ] Add a Code of Conduct to your repository as a file called CODE_OF_CONDUCT.md
[ ] Invite new contributors to into your work!

This issue is here to help you keep track of work as you start Open Life Science program. Please refer to the OLS-3 Syllabus for more detailed weekly notes and assignments past week 4.

antonioggsousa commented 3 years ago

Vision statement

We are trying to implement a flexible and reproducible pipeline for Nanopore metabarcoding data analysis. We aim to achieve this by applying different tools under the same framework and the production of a report recording all the relevant information to reproduce the analysis. We think this project can grow openly being deployed through github where the community could contribute by open new issues requesting new features or suggesting new ideas. Also more advanced users could even contribute with code doing their own pull requests to the project. We hope that through this engagement with the community the project will be more useful, updated and interesting.

katoss commented 3 years ago

Hey @antonioggsousa, I just read your vision statement and it sounds like you already have a good overview of the process and how to publish the pipeline and make it open for contributors. As someone who is not working in bioinformatics, I am wondering what your motivations are for building this pipeline, and what kinds of target groups you would like to address as contributors. That could be something to add to the vision statement. Apart from that, I find it clear and understandable :)

malvikasharan commented 3 years ago

@all-contributors please add @antonioggsousa for content

allcontributors[bot] commented 3 years ago

@malvikasharan

I've put up a pull request to add @antonioggsousa! :tada:

antonioggsousa commented 3 years ago

Hey @antonioggsousa, I just read your vision statement and it sounds like you already have a good overview of the process and how to publish the pipeline and make it open for contributors. As someone who is not working in bioinformatics, I am wondering what your motivations are for building this pipeline, and what kinds of target groups you would like to address as contributors. That could be something to add to the vision statement. Apart from that, I find it clear and understandable :)

Hi @katoss thank you for your feedback/opinion. I really appreciated. As someone that works in bioinformatics and came across this type of data a few times, I had a hard time to find a way to process it due to a lack of a user-friendly framework/pipeline integrating the several steps necessary to process the data from quality-check until annotation and diversity analyses. This also poses a problem for reproducibility. Therefore I would say that the motivation under this project is to make my life easier (and perhaps the life of others too). I would say that the target users are any user with some experience in using command-line tools that needs to process Nanopore metabarcoding data.

According to your good suggestion I tried to update my vision statement below:

Vision statement

We are trying to implement a flexible and reproducible pipeline for Nanopore metabarcoding data analysis in order to make the whole process more user-friendly for the community of researchers analyzing this type of data. We aim to achieve this by applying different tools under the same framework and the production of a report recording all the relevant information to reproduce the analysis. We think this project can grow openly being deployed through github where the community could contribute by open new issues requesting new features or suggesting new ideas. Also more advanced users could even contribute with code doing their own pull requests to the project. We hope that through this engagement with the community the project will be more useful, updated and interesting.

katoss commented 3 years ago

Hi @antonioggsousa, thank you for the explanations! I like the new version of the vision statement. The phrase on motivations makes it a lot easier to put into context for lay-persons :)

cemonks commented 3 years ago

Hey @antonioggsousa, I just read your vision statement and it sounds like you already have a good overview of the process and how to publish the pipeline and make it open for contributors. As someone who is not working in bioinformatics, I am wondering what your motivations are for building this pipeline, and what kinds of target groups you would like to address as contributors. That could be something to add to the vision statement. Apart from that, I find it clear and understandable :)

Hi @katoss thank you for your feedback/opinion. I really appreciated. As someone that works in bioinformatics and came across this type of data a few times, I had a hard time to find a way to process it due to a lack of a user-friendly framework/pipeline integrating the several steps necessary to process the data from quality-check until annotation and diversity analyses. This also poses a problem for reproducibility. Therefore I would say that the motivation under this project is to make my life easier (and perhaps the life of others too). I would say that the target users are any user with some experience in using command-line tools that needs to process Nanopore metabarcoding data.

According to your good suggestion I tried to update my vision statement below:

Vision statement

We are trying to implement a flexible and reproducible pipeline for Nanopore metabarcoding data analysis in order to make the whole process more user-friendly for the community of researchers analyzing this type of data. We aim to achieve this by applying different tools under the same framework and the production of a report recording all the relevant information to reproduce the analysis. We think this project can grow openly being deployed through github where the community could contribute by open new issues requesting new features or suggesting new ideas. Also more advanced users could even contribute with code doing their own pull requests to the project. We hope that through this engagement with the community the project will be more useful, updated and interesting.

Hi @antonioggsousa! I agree with @katoss's feedback and think this looks really clear (I'm also from way outside of this field but I understand what you're doing!). My only suggestion would be to remember the words of Yoda: 'Do or do not, there is no try'. Your project seems ambitious and exciting, and you could maybe reduce the length and increase the motivational aspect of your vision statement by removing the qualifying statements ("We are trying to", "We aim to achieve this by", "We think this project will", "We hope that", etc)?

open-life-science / ols-3

metaNanoPype: a reproducible Nanopore python pipeline for metabarcoding #4

Vision statement