os-climate / aicoe-osc-demo

This repository is the central location for the demos the ET data science team is developing within the OS-Climate project. This demo shows how to use the tools provided by Open Data Hub (ODH) running on the Operate First cluster to perform ETL, create training and inference pipelines.
Apache License 2.0
10 stars 24 forks source link

[EPIC] Sparsification task for relevance text classification #215

Open Shreyanand opened 1 year ago

Shreyanand commented 1 year ago

The current relevance and kpi extraction models are huge (0.3gb and 1.3gb respectively) and they take around a total of 15mins to infer for a pdf. We want to find smaller version of the models that get similar performance but using a smaller and faster model.

Overall, we want to investigate model pruning and test the effects of tools such as NeuralMagic to measure the performance impact of different levels of pruning. This EPIC is the first step for this overall goal.