map_df was too slow, it was spotted on the data from Jeremiah for which ~ 10 minutes were required. With the improved version, it takes ~ a few seconds.
Checklist for sustainable code base
[ ] I added tests for any code changed/added
[ ] I added documentation for any code changed/added
[ ] I made sure naming of any new functions is self-explanatory and consistent
Description
Improve performance of 'map_df' with vectorized logic for exact matches
What changed?
refactor: improve performance of 'map_df' with vectorized logic for exact matches
Related JIRA issue: https://jira.gene.com/jira/browse/GDR-1888
Why was it changed?
map_df was too slow, it was spotted on the data from Jeremiah for which ~ 10 minutes were required. With the improved version, it takes ~ a few seconds.
Checklist for sustainable code base
Logistic checklist
Screenshots (optional)