As of release 6.8.0, the zipped version of the underthesea codebase measures 73.9MB, which is excessively substantial. Aiming to optimize, the goal is to reduce the size to approximately 10MB.
Proposed Strategies:
[ ] Sub-repository Allocation for Datasets
Migrate each dataset to individual sub-repositories to decentralize the storage and manage the codebase efficiently.
[ ] Eliminate Storage of Binary Models
Avoid the incorporation of binary models within the codebase. For reference, binary models are currently stored here:models/ws_crf_vlsp2013_20230727
[ ] Code Refactoring
Undertake a comprehensive refactoring of the code to improve its structure, readability, and maintainability, which can also contribute to reducing the overall size of the codebase.
As of release 6.8.0, the zipped version of the underthesea codebase measures
73.9MB
, which is excessively substantial. Aiming to optimize, the goal is to reduce the size to approximately10MB
.Proposed Strategies:
[ ] Sub-repository Allocation for
Datasets
Migrate each dataset to individual sub-repositories to decentralize the storage and manage the codebase efficiently.[ ] Eliminate Storage of Binary Models Avoid the incorporation of binary models within the codebase. For reference, binary models are currently stored here:models/ws_crf_vlsp2013_20230727
[ ] Code Refactoring Undertake a comprehensive refactoring of the code to improve its structure, readability, and maintainability, which can also contribute to reducing the overall size of the codebase.