AI4Bharat / Indic-BERT-v1

Indic-BERT-v1: BERT-based Multilingual Model for 11 Indic Languages and Indian-English. For latest Indic-BERT v2, check: https://github.com/AI4Bharat/IndicBERT
https://indicnlp.ai4bharat.org
MIT License
276 stars 41 forks source link

Documentation for IndicGLUE #4

Closed ameet-1997 closed 4 years ago

ameet-1997 commented 4 years ago

Hi,

This repository has some great resources!

I was wondering where the documentation for the fine-tuning datasets are. Sorry if I overlooked something obvious. I want to evaluate my models (not IndicBERT) on your dataset, so wanted to see how that is possible.

Thanks!

anoopkunchukuttan commented 4 years ago

Hi Ameet,

You can find more details here: https://indicnlp.ai4bharat.org/

Specifically about the datasets here: https://indicnlp.ai4bharat.org/indic-glue/

And paper about this work here: https://indicnlp.ai4bharat.org/papers/arxiv2020_indicnlp_corpus.pdf

ameet-1997 commented 4 years ago

Thanks for the reply! I've gone through all these resources.

However, I was referring to documentation on the code for fine-tuning external models. Is this notebook (link) the only available resource for that? Or is there a place I can look for the documentation?

divkakwani commented 4 years ago

Hey Ameet. Unfortunately, we don't have a documentation yet. We are working on adding both user and developer documentation in our repository.

For now, depending on your use case, you can check if the following helps:

It is also possible to run the evaluation on non-huggingface models. I will shortly put a documentation on it and update here.

ameet-1997 commented 4 years ago

Thanks so much! Leaving this open so that you can close it after you have updated the documentation, but feel free to close it!

divkakwani commented 4 years ago

Hey, I have added some usage notes here https://github.com/AI4Bharat/indic-bert/blob/master/docs/advanced-usage.md. Does it answer your query?

Also, feel free to add here anything else that you'd want to see in the documentation.