huggingface / setfit

Efficient few-shot learning with Sentence Transformers
https://hf.co/docs/setfit
Apache License 2.0
2.24k stars 223 forks source link

ABSA for Non-English Language #491

Closed firqaaa closed 9 months ago

firqaaa commented 9 months ago

Hi, we are aware that, up to now, most ABSA models have been developed for English, and they typically require the installation of the spaCy model for that language. However, it is observed that spaCy does not cover all languages. So, is it feasible to construct an ABSA model for languages with limited resources that are not supported by spaCy?

kgourgou commented 9 months ago

@firqaaa are you particularly interested in such models being fine-tuned with the setfit training paradigm? Your question seems to be a bit more general.

firqaaa commented 9 months ago

@kgourgou Yes, currently i want to build setfit for indonesian language but there's no spacy model available that spacy provided in their pipeline. so, is there any ways to build absa model without include the spacy model??

tomaarsen commented 9 months ago

Hello!

I'm afraid that the spaCy model is indeed required for SetFit ABSA. It is used throughout the codebase there, so we can't currently modify SetFit ABSA to use some other initial candidate selector.

That said, I see that you have been successful!

I believe this is the first non-English SetFit ABSA model. Well done!

firqaaa commented 9 months ago

Hi @tomaarsen

Thank you for the appreciation. Yes, you're correct; that's my work on SetFit ABSA for Bahasa. Although it was successful, there were still some improvements that I need to make, especially regarding the enhancement of the NER task in my own spaCy models for Bahasa and the accuracy of the model's polarity. Since it has been resolved, I will close this issue. Thank you!