Open johnwdubois opened 3 years ago
@ZoeF99 can you prepare each of the bullet points as strings for localization?
Translation of bullet points in various languages will be required later.
Here is Lauren's mockup of the Pre-Import screen made with Figma
Check to make sure localizations for the import screen are present in the next build of Rezonator.
Translations needed for all descriptions.
Background The import screen shows information to guide the user in selecting which import type is appropriate.
Providing a description for each data type will give the user useful information for choosing the import.
Options to display Show the import options in the following sequence.
Choose the data type that best matches your file: Song & Verse Prose One Word Per Line (OWPL) CoNLL-U Transcription Elan (tab-delimited) Interlinear Glossed Text (IGT)
Screenshot
BULLET POINTS Depending on the import type that is selected by the user, display the following bullet points in a window labeled "Description", displayed in the middle column of the import screen (as shown in the screenshot above).
Song & Verse • short lines of text • line breaks are meaningful • no word wrap • split words on whitespace • file type: plain text (.txt) • example: song, poem, etc.
Prose • long paragraphs of text • one hard return (newline) at end of paragraph • words wrap to fit on page or screen • split words on whitespace • file type: plain text (.txt) • example: news, blog, Wikipedia, novel
One Word Per Line (OWPL) • columns and rows • text column reads vertically • each row represents 1 word (token) • each column shows a word feature • file type: spreadsheet (.csv) • example: Santa Barbara Corpus .csv
CoNLL-U • columns and rows • text column reads vertically • each row represents 1 word (token) • each column shows a word feature • hashtag lines mark unit features • file type: spreadsheet (.csv) • example: Universal Dependencies corpus
Transcription • one unit per line • text reads normally • tab for speaker labels • tab for timestamps (optional) • split words on whitespace • file type: tab-delimited text (.txt, .csv) • example: Santa Barbara Corpus .txt
Elan (tab-delimited export) • one unit per block • blocks have 1+ lines • text reads normally • tab for speaker labels • tab for timestamps • split words on whitespace • file type: tab-delimited text (.txt) • example:
Interlinear Glossed Text (IGT) • one unit per block • blocks have 2+ lines • blocks separated by a blank line • split morphs on whitespace & hyphen • file type: plain text (.txt) • example: Nuuchanulth
Additional updates