Open 29swastik opened 2 months ago
Not sure if you figured this out already, but let me drop a comment in case someone else has the same problem.
It's very important to think about what your model is working on - which is essentially a vector representation of your text (job title) that we then transform into a prediction using a prediction model (head). I had to look up SDET, and it's almost guaranteed that a non-specific model will probably have a poor representation of that title there. Therefore, you would need a lot of examples to classify it correctly, and even then another very specific term is as likely to occupy that space as not, so all your finetuning goes to waste. You can find the datasets this was trained on here https://huggingface.co/sentence-transformers/all-mpnet-base-v2 (and you can see the vast majority is things that do not relate to your domain at all).
Things you can do to improve this relatively painlessly (as in, there are multiple tutorials on these):
Whether more labels actually helps you is an open question.
A more involved approach would be to restate the objective (i.e. change the model), including by adding more information than the job title.
Hopefully this helped a little
Hi, my usecase is to classify Job Title into Functional Areas. I finetuned
all-mpnet-base-v2
with the help of setfit by providing some 10+ examples for each class (Functional Areas).I got
82%
accuracy on running the evaluation on my test set. I observed some of the simple & straightforward job titles are classified into wrong label with0.6
score.For example:
Here SDET should have labelled as
QA / SDET
but it is classified toBig Data / DWH / ETL
with0.62
score. Few shot examples used for both classes doesn't have anything in common which could confuse the model except one example whose title isData Quality Engineer
and it is underBig Data / DWH / ETL
.Few shot examples (added only for 2 here)
TrainingArgs
Here is the complete set of functional areas.
My guess is accuracy is low because of short text (which is just job title). Please suggest few things which I can try out to improve the accuracy of the model.