tira-io / ir-experiment-platform

28 stars 3 forks source link

documentation issues #1

Closed MangoIV closed 1 year ago

MangoIV commented 1 year ago

When trying to use this tutorial, I encountered several issues:

mam10eks commented 1 year ago

Dear @MangoIV,

Thank you for bringing up the issue. The dependency used here is ir_datasets. Documentation in research software is always a bit hard because things change fast.

We used this tutorial now for the very first time with bachelor and master students, and I think the feedback, questions, and problems that we saw will help us to improve this tutorial and its documentation further. We tried to encourage questions and discussions, and I think this worked quite well for the first iteration of the IR lab.

For future iterations of the IR lab, I think I learned that we should already provide the complete ir_datasets integration, so that only specialized needs have to adopt the ir_datasets registrations and most use cases are already covered by the template that we give. In that scenario, the goal for the first milestone (dataset creation) is to implement a preprocessing script/program that outputs a file, which should be much simpler.

For your questions:

What do you think about the idea that I described above? Should still allow all possibilities, but should substantially simplify everything, or?

Best regards,

Maik

MangoIV commented 1 year ago

Hi, thank you for your comprehensive answer. This is a great help.

I don't think anything needs to be simplified. I still feel like it would be good to leave a comment of intended use of certain things, ie I couldn't immediately figure out that for local file we want that packagesmth class and then searched for something that sounded right encountering new issues (there's a RelPath class which takes a "streamer" which is documented nowhere, so what class is expected there?)

So as a conclusion perhaps if the software is used by some third party like in this case, it would be good to leave a docstring or a type signature (or annotation in Python?) which typically don't tend to get stale.

This is perhaps also true for the ir_datasets dep.

Please mind that there are several people who encountered this issue, so I'm really not the only one.

Thanks again for the comprehensive and fast answer.

mam10eks commented 1 year ago

I now improved the documentation, and we also have created a dedicated example for the dataset used in the IR lab: https://github.com/mam10eks/ir-lab-sose-2023/tree/main/milestone-01

I think in future iterations of the IR lab, we will directly give a dedicated template as the one linked above, because this than allows to put more focus on the interesting parts and not technical details.

Thanks again for bringing up the issue,

Best regards,

Maik