Researchstudio-DSc / kodicare_framework

IR framework for Kodicare project
0 stars 0 forks source link

task2-1_v2 apply BoW and TF-IDF on cord19 dtc #24

Open Alaa-Ebshihy opened 1 year ago

Alaa-Ebshihy commented 1 year ago

Use the document representation implementation to generated document representations for the dynamic test collections of the CORD 19 data set

put on mind also the calculation of the KD using these representations

Alaa-Ebshihy commented 1 year ago

First I need to apply on document collection and explore the effect using BoW This can be done by manually creating a correlation matrix between words using the vocab

Alaa-Ebshihy commented 1 year ago

After an experiment the correlation matrix is the same color

From noticing the vocab there are plenty of words which need to be cut out, there are two options to exploer:

Alaa-Ebshihy commented 1 year ago

Done on Friday 10.03.2023

Next to do

Alaa-Ebshihy commented 1 year ago

Done on Friday 17.03.2023

Alaa-Ebshihy commented 1 year ago

TODO as of 20.03.2023

Alaa-Ebshihy commented 1 year ago

Update as of 21.03.2023

TODO

Alaa-Ebshihy commented 1 year ago

Update as of 22.03.2023

TODO

Alaa-Ebshihy commented 1 year ago

Reopen: some implementation need to be cleaned from the representation folder