cancerDHC / umls-rrf-scala

A very basic library for parsing files in the UMLS RRF format
MIT License
4 stars 2 forks source link

Write out in SSSOM TSV #8

Closed gaurav closed 4 years ago

gaurav commented 4 years ago

This PR modifies the output of CodeMapper to produce mappings in the SSSOM TSV format.

It also includes an overhaul of the CodeMapper controller code and HalfMap loading code. In particular, the HalfMap loading code now uses a Scalacache cache to avoid reloading HalfMaps. It currently takes around an hour to run this on the 7432 SNOMED IDs included in DICOM, but this should be faster still once I do more profiling and replace the current system with a stream-based interface (if that becomes necessary).

Should be merged after PR #9.