ec2u / data

EC2U Knowledge Hub
https://data.ec2u.eu
Apache License 2.0
2 stars 0 forks source link

Differences in MMT and Hub event data #31

Closed hmaskat17 closed 1 year ago

hmaskat17 commented 2 years ago

Steps to reproduce

  1. Go to MMT and search for "Science Battle" (screenshot below)
  2. Go to KH interface and search for "Science Battle" > https://data.ec2u.eu/events/?%7B%22~label%22:%22science%20battle%22%7D

What did you expect to happen? There should be the same number of events for MMT and KH

What did actually happen? MMT has several more events that KH

Would you share any observation or additional context about the bug? Example > Science Battle:

MMT imported seven Science Battle events:

All of the above have a unique ID field in MMT even though they are the same events of a group of 3 individual events. So all three events imported July 16th are found in KH - but these are duplicates of the June 2nd and April 20th events in MMT.

The question is why do the events imported July 16th have new unique ID's compared to the ones imported June 2nd?

I checked and found the events from July 16th have updated source URLs.

Another example with the same problem is Workshop: Vorbereitung auf Jena (Deutsch) > https://data.ec2u.eu/events/725a6dd33c549950919d5c1721d0f7cd

Screenshots

From MMT app: image

@ec2u/mmt

hmaskat17 commented 1 year ago

Just to recap, it seems new ID's were created for some Jena events and is why MMT imported them as new events and the app is showing duplicates.

Any ideas on how this can be solved so that in the future these events with new ID's do not show up in MMT. Can this be easily fixed in Hub side or should we try fixing this in MMT side?

@ec2u/mmt

knoan commented 1 year ago

We're reviewing the matter.

We update the id generation rules around mid May and that could account for some inconsistencies/leftovers.

However, events published on multiple calendars would be expected to collapse, which apparently is not the case.

Back to you as soon as we have a fix.

knoan commented 1 year ago

I reviewed the matter on KH side; the only relevant change was an upgrade to the id generation algorithm on 2022-05-19:

The new logic prevents ids from changing on event rescheduling and collapses events published on multiple calendars with a unique external URL.

That said, I don't think we have active issues on KH:

The same reasoning applies as well to the additional example.

My suggestion is that we manually remove MMT leftovers and then keep the situation monitored for similar future issues.

On our side, we'll keep you posted should we upgrade id generation logic on other sources.