Closed danielinux7 closed 3 years ago
Ахҳәаа
I need to extract the text from OpenOffice's sdf files and LibreOffice's po files to make a parallel ab-ru corpus, also Firefox, Telegram and Gnome.
Ауадаҩрақәа Find the files and put them in the right format.
Аӡбара
Use Linux command lines to edit PO and SDF files, and use excel sheets oo2po po2csv poswap Copy all the files in a folder and all the subfolders (depth 10) find . -maxdepth 10 -type f -exec cp {} . \; https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/c238f29b7a61a3659b63bc9f91010b0a77cf5642 https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/dedf4876cfe1ff7b505d5378e6b66186364ecb51 LibreOffice en-ab ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/c82820f143df700a99952a0047886c3550b843b5 https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/cbf6163bc1cff54925b2d2facabc5a3c977c6902 OpenOffice en-ab ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/476992b8d431ea7f0548c7b0d1968f94661fd6eb FireFox ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/b6b77b4cc016b0d10fed67cc18a855a33912c16a Gnome ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/e9ac42ea6e0dc530d4bcd02339a0263afcb0d619 https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/fa5475fc10e984774acadba1b7790e009682dde7 Telegram ab-ru (all) ab-en (part)
oo2po
po2csv
poswap
find . -maxdepth 10 -type f -exec cp {} . \;
Азхьарԥшқәа:
Ахҳәаа
I need to extract the text from OpenOffice's sdf files and LibreOffice's po files to make a parallel ab-ru corpus, also Firefox, Telegram and Gnome.
Ауадаҩрақәа Find the files and put them in the right format.
Аӡбара
Use Linux command lines to edit PO and SDF files, and use excel sheets
oo2po
po2csv
poswap
Copy all the files in a folder and all the subfolders (depth 10)find . -maxdepth 10 -type f -exec cp {} . \;
https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/c238f29b7a61a3659b63bc9f91010b0a77cf5642 https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/dedf4876cfe1ff7b505d5378e6b66186364ecb51 LibreOffice en-ab ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/c82820f143df700a99952a0047886c3550b843b5 https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/cbf6163bc1cff54925b2d2facabc5a3c977c6902 OpenOffice en-ab ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/476992b8d431ea7f0548c7b0d1968f94661fd6eb FireFox ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/b6b77b4cc016b0d10fed67cc18a855a33912c16a Gnome ru-ab https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/e9ac42ea6e0dc530d4bcd02339a0263afcb0d619 https://github.com/danielinux7/Multilingual-Parallel-Corpus/commit/fa5475fc10e984774acadba1b7790e009682dde7 Telegram ab-ru (all) ab-en (part)Азхьарԥшқәа: