OpenPecha / toolkit-v2

OpenPecha toolkit version 2
MIT License
0 stars 0 forks source link

OPT20027: Google Docs Parser #63

Closed tenzin3 closed 1 week ago

tenzin3 commented 1 month ago

Description

Parser for root and commentary text into OPF format.

Important Note

Store annotation of meaning segment and its (root/commentary) mapping only. Commentary pecha should store the detail of root pecha such as Pecha id , ....

Input

Google docx file

Expected Output

Parser able to give an OPF STAM format.

Implementation Steps

tenzin3 commented 2 weeks ago

Important Notes when coding the serializer:

tenzin3 commented 1 week ago

Root pecha uploaded to I0152F99B Commentary pecha IA12F61B0

a serialized json for dolma 21 root and commentary uloaded to here root_commentary.json

tenzin3 commented 1 week ago

base_text_titles should be in "english." root and commentary file should be separate. tibetan should be in target.(both root and commentary)