Closed zdk123 closed 2 years ago
@zdk123 nice that you can make good use of the package! Thanks for spotting this!
I think we should just raise a more informative exception if duplicate inchikeys are passed.
If you want @zdk123 you could try to implement this yourself? See https://github.com/matchms/ms2deepscore/blob/main/CONTRIBUTING.md on more information for how to contribute.
sure, I can take a stab at that!
Cool!
@zdk123 thank you for your awesome contribution! Let me know if you are interested in becoming a collaborator :)
@svenvanderburg sure, I would be happy to!
Hi @zdk123 , thanks for making this issue and welcome on-board 😃 . I just invited you to be collaborator on this project and gave you writing rights (e.g. to start pull requests). Let me know if you need any help.
Thanks!
On Thu, Mar 10, 2022 at 3:12 AM Florian Huber @.***> wrote:
Hi @zdk123 https://github.com/zdk123 , thanks for making this issue and welcome on-board 😃 . I just invited you to be collaborator on this project and gave you writing rights (e.g. to start pull requests). Let me know if you need any help.
— Reply to this email directly, view it on GitHub https://github.com/matchms/ms2deepscore/issues/97#issuecomment-1063778522, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAUD2RCEKXTIRFYCOEAJNSDU7GVHRANCNFSM5PJEOIBQ . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.
You are receiving this because you were mentioned.Message ID: @.***>
Thanks for this cool package!
I just wanted to report this bug I just tripped over while attempting to retrain the Siamese network. It's obvious in hindsight that the user shouldn't be trying to pass duplicate inchikeys in the tanimoto score DataFrame. But right now this triggers are rather uninformative error:
And even after de-duplicated the full keys, if there are still duplicated inchikey14s, the bug that gets triggered is:
I can provide an example if needed - you could also consider deduplicating (with a warning?) during the data_generator construction.