MontrealCorpusTools / PolyglotDB

Language data store and linguistic query API
MIT License
39 stars 14 forks source link

Import CSJ XML format #53

Open mmcauliffe opened 8 years ago

mmcauliffe commented 8 years ago
msonderegger commented 8 years ago

You're probably thinking this already, but you'll likely need to just do a subset of the XML formatting; it's quite complex, and includes stuff we can't currently represent (like moras, prosodic units).

orianakc commented 8 years ago

It might be easier to import the TextGrids? They contain a lot of the same information as the XML, including ToBI annotations.

msonderegger commented 8 years ago

Good idea, that sounds simpler..