clarin-eric / parla-clarin

Schema for modelling parliamentary debates
https://clarin-eric.github.io/parla-clarin/
21 stars 6 forks source link

Parlaclarin Python module #13

Open ninpnin opened 3 years ago

ninpnin commented 3 years ago

In the WESTAC project, we use Parla-Clarin to store our corpus, and Python as our go-to programming language. As we did not find specific Python modules for the task, we have mostly been directly working with the XML.

Now we have repackaged the reusable parts of our scripts into a Python module. It's on PyPi. If there is overlapping interest, we are interested in collaboration. Also, if there are similar efforts that we are unaware of, please let us know.

TomazErjavec commented 3 years ago

Nice, thanks for getting in touch. As your code in in PyPi, maybe the simplest would be to make a note of it in the README? Alternativelly, you could make a directory in the project with the code and send a pull request.

As for similar efforts, we just recently finished work on the ParlaMint project and are now on holidays :). But we will continue with the project starting October. One of the extension corpora is also Swedish with Umea, but maybe we/you could collaborate there. We will also revisit the Parla-CLARIN recommendation in the scope of ParlaMint II.