sebastian-nehrdich / byt5-sanskrit-analyzers

4 stars 1 forks source link

ByT5-Sanskrit Analyzers

This repository contains inference scripts for ByT5-Sanskrit analyzers developed for our EMNLP submission. The analyzer capabilities can be accessed in an interactive application at dharmamitra.org. The finetuning data for these models is taken from the DCS.

Models

The pretrained base model is available here: Huggingface link
The finetuned multitask model: Huggingface link

Repository Structure

Our project is organized into three main directories. Currently, we make the applications/ section available, we will add the training and data sections in the future.

Getting Started

For instructions how to run the individual downstream applications, see the README.md files in the subfolders under applications/.

Citation

The preprint is available on arxiv. If you like our work and use it in your research, feel free to cite the paper:

@inproceedings{
nehrdichetal2024,
title={One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit {NLP} Tasks},
author={Nehrdich, Sebastian and Hellwig, Oliver and Keutzer, Kurt},
booktitle={Findings of the 2024 Conference on Empirical Methods in Natural Language Processing},
year={2024},
}