hypothesis / product-backlog

Where new feature ideas and current bugs for the Hypothesis product live
117 stars 7 forks source link

LMS admin usage report may be missing some courses #1546

Open mkdir-washington-edu opened 4 days ago

mkdir-washington-edu commented 4 days ago

University College London ORG ID: us.lms.org.CROs5T7nS72IqISlCi0rEQ

Generating a usage report for the ORG for dates 2023-07-01 through today, and then counting unique users in a spreadsheet, gives 460 users. See "MD work 3" tab in this Sheet.

But I noticed that, when looking at all of the application instances for this org, I got different sets of groups when looking on the grouping table vs the group_info table. See MD work 1 tab.

There are 35 total authority_provided_ids tied to the AIs associated with the ORG. 30 of them are on the group_info table. 27 of them are on the grouping table. Combining and de-duping the authority_provided_ids, and using those to get H groups with annotations made from 2023-07-01 through today, gives 1,487 users.

mkdir-washington-edu commented 4 days ago

This is a Moodle course. I wonder if they've been using the "course reset" option, which re-uses the same LMS course year after year and puts new students in it.

That itself might be the cause of these numbers.


In fact, some of the differences here do seem to be due to a student making new annotations in an old course. Assuming we don't count that old course, I think the current ORG reports make sense; it's probably a good thing they ignore the scenario where a lone student made new annotations in an old course with a high number of students in it.

See: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1719779730810179?thread_ts=1719509850.784029&cid=C2BLQDKHA

marcospri commented 1 day ago

Further digging:

I think it should be safe to ignore "group_info" for dates after 2023.

Those don't have any recent annotations.

There doesn't seem to be any annotations made after march, so it shouldn't count for the 2023-07 report

marcospri commented 1 day ago

The main difference in unique user counts is the source of the users, H vs LMS.

In H we can see 'acct:002d464a2f9cb0058f0bf7c9eb334f@lms.hypothes.is' as a returned user for the count.

This user belong to 510a9b39317d8309977b0678a74a3a218e30beef in H but we don't have any record of him in LMS.

Not in the "user" table and of course not in grouping_membership or events.

Should this student be counted? I'd argue it shouldn't.

510a9b39317d8309977b0678a74a3a218e30beef was created in the summer of 2021. Someone made an annotation there and we brought all the students the ever belonged to that course even before we did any user recording in LMS.

It would be tricky to present the information about this user in the LMS context, we won't display him in the dashboards, we can't track his launches... I reckon this type of user is the one more likely to cause questions about our numbers and ticker type to answer.