CredentialEngine / CredentialRegistry

Repository for development of the Credential Registry
Apache License 2.0
12 stars 10 forks source link

Call to resources/check_existence returns duplicate CTIDs #674

Closed mparsons-ce closed 5 months ago

mparsons-ce commented 9 months ago

@excelsior We have a collection of course CTIDs being uploaded. Part of the upload step is to check if the CTIDs exist in the registry using .../resources/check_existence A list of 6573 CTIDs were sent, but 6575 were returned! I checked the list and found duplicates for the following CTIDs:

I am not sure how this can happen. Is it possible that somehow there are actual duplicate resources for these CTIDs?

excelsior commented 9 months ago

@mparsons-ce Yes, I found 8 duplicates in total.

               resource                 |                                     envelopes
----------------------------------------+---------------------------------------------------------------------------------
ce-0310ef55-3763-4bab-b542-c9482b81b95f | ce-0310ef55-3763-4bab-b542-c9482b81b95f, ce-0310ef55-3763-4bab-b542-c9482b81b95f
ce-17d99fc5-ca8a-45b3-b4ef-b3d78efb625f | ce-17d99fc5-ca8a-45b3-b4ef-b3d78efb625f, ce-17d99fc5-ca8a-45b3-b4ef-b3d78efb625f
ce-1ee35d87-9194-4c44-89b4-e7a169f3e1d5 | ce-1ee35d87-9194-4c44-89b4-e7a169f3e1d5, ce-1ee35d87-9194-4c44-89b4-e7a169f3e1d5
ce-660e8df6-3699-4a3d-a3df-ef0d1f983ceb | ce-660e8df6-3699-4a3d-a3df-ef0d1f983ceb, ce-660e8df6-3699-4a3d-a3df-ef0d1f983ceb
ce-6fca72e1-f283-4a7f-9f0b-c6796f04a652 | ce-6fca72e1-f283-4a7f-9f0b-c6796f04a652, ce-6fca72e1-f283-4a7f-9f0b-c6796f04a652
ce-a2e9c853-d068-4fcb-b5bc-0233bd2dbce5 | ce-a2e9c853-d068-4fcb-b5bc-0233bd2dbce5, ce-a2e9c853-d068-4fcb-b5bc-0233bd2dbce5
ce-a9c3c797-cb67-4148-af8e-8aa3eee05d5e | ce-a9c3c797-cb67-4148-af8e-8aa3eee05d5e, ce-a9c3c797-cb67-4148-af8e-8aa3eee05d5e
ce-ed12a61b-f82a-4962-bf36-aeb0a12bbea8 | ce-ed12a61b-f82a-4962-bf36-aeb0a12bbea8, ce-ed12a61b-f82a-4962-bf36-aeb0a12bbea8
mparsons-ce commented 8 months ago

As previously noted, the CTIDs were not duplicated in different resources, the same resource was published a few times. https://docs.google.com/spreadsheets/d/1pqZUBfwP9ZIYcrvFUbGXwbx-z5vlNCowBnTcUyd_7_Y/edit#gid=0

excelsior commented 7 months ago

@mparsons-ce The only true duplicates appeared due to a race condition. I deleted those and am working on preventing such cases.

mparsons-ce commented 5 months ago

We have not had this issue reoccur, and Alex has implemented a process to do deduplication.