nih-cfde / dashboard

Files and documentation related to the CFDE project dashboards
Other
0 stars 0 forks source link

Counting in review plot seems off by a lot #112

Closed ACharbonneau closed 3 years ago

ACharbonneau commented 3 years ago

I understand that we still have the double counting issue, and that's fine. But this seems really weird. This data set only has 5 subjects.

image

karlcz commented 3 years ago

I think this is multi-counting because of the pre-processed level1_stats reflecting combinations of single terms from all different vocab "dimensions" at once. There are actually three core_fact combinations for 3 subjects that combine multiple terms on multiple vocabs, and these would expand into many single-term-per-vocab combinations.

ACharbonneau commented 3 years ago

KK. I'll stand down then :)