RoboTutorLLC / RoboTutor_2019

Main code for RoboTutor. Uploaded 11/20/2018 to XPRIZE from RoboTutorLLC/RoboTutor.
Other
7 stars 4 forks source link

READ narrate more stories #54

Open JackMostow opened 7 years ago

JackMostow commented 7 years ago

From @JackMostow on February 2, 2017 23:43

Narrate the other 37 current stories + new ones, to engage kids

Copied from original issue: synaptek/RoboTutor#268

JackMostow commented 6 years ago

Judith email, Mon 5/28/2018 2:09 AM:

  1. Please plan to record more stories over the summer with Leonora especially the very simple ones. I'll give you the list of the ones we need.

  2. Please plan to record more songs with Filipo... he is currently curating more songs from schools and teachers in TZ, and we still have some that didn't make code drop 1.

Judith email, Tue 5/29/2018 7:05 PM:

Matthew will continue on stories. He will give us the stories that need to be recorded, and regenerate them during the summer and when he returns in the fall

JackMostow commented 6 years ago

Matthew - Stories will take time to narrate, so please point me to them ASAP.

  1. How are these three folders related? Which one(s) contain the most updated story versions to narrate?

a. RoboTutor Swahili ASP Stories contains 104 storydata.json files and 104 mcq.json files.

b. CONTENT > Development > New Swahili Content > result has 181 story.json files and no mcq.json files. Are all of the new ASP stories already in XPRIZE - Home > CONTENT > Development > New Swahili Content?

c. RoboTutor New Swahili ASP Stories has 193 storydata.json files and no mcq.json files. Why 12 more?

  1. You put a nice pipeline of Python scripts in XPRIZE - Home > CONTENT > Development for ASP stories.

    a. Are the latest scripts there or in GitHub?

    b. It looks like they should also work for stories we author in-house too. True? Any caveats?

    c. CONTENT > Development > New Swahili Content > result has identically named scripts. Should I ignore them?

    d. (Where) are the input format requirements documented other than in the python source code, e.g.:

    first line represents title and author as follows:

    {TITLE} {AUTHOR} {LEVEL}

    followed by \n\n\n.

    first page (page 2) starts then. If any image exists, it will be:

    {image_file_name_inside_folder_page_n.jpg}

    level_ok_set = ["level_1", "level_2", "level_3", "level_4", "level_5"]
    illustrations entitled (story_title+"_page_" + str(page_number) + "_image_0001.jpg").lower()

e. What if any file formats besides .jpg can illustrations be, e.g. .svg?

  1. DQGen-Swahili questions:

a. Is CONTENT > Development > DQGen-Swahili the latest version, or is it in GitHub, as the presence of .git suggests? If so, where?

b. Can you point to (if necessary create) documentation for how to run DQGen-Swahili? README.md doesn't say.

THANKS! - Jack

JackMostow commented 6 years ago

@judithodili -

  1. For now I'll upload the 5 story folders of narrations we just recorded, with names like "RT11-POSEIDON storydata 4 170305.535 07202018", to stories to record. Where do you want them in future? We scheduled recording sessions for TWThF July 24-27 but none the following week due to the Summer School.

  2. As per the instructions I sent Leonora with cc to you, the corrected story texts are in Google docs, unlike the original texts, which are .txt files. You may need to convert the Google docs to regular .txt files to run the script to generate storydata.json files. I renamed the buggy story .txt files to try to prevent confusion.

  3. storydata 5 (both .txt and .json) is a duplicate of 4 ("Anaitwa Bi Simu") so we didn't record it.

  4. We also recorded several new prompts in RT11-POSEIDON MORE PROMPTS 160726.491 07202018. Eventually I should reactivate my script to generate English-named copies of Swahili prompt narrations.

Thanks. - Jack

JackMostow commented 6 years ago

@judithodili and @amyogan - Leonora came up with something even better than "Soma kwa sauti" (Read aloud), namely "Somea RoboTutor" (Read to RoboTutor), to encourage kids to think of RoboTutor as an audience. We used the same tactic in some other new prompts too.

amyogan commented 6 years ago

Nice! I like that.

JackMostow commented 6 years ago

Once again: The narration files for each story are in a folder named e.g. RT11-POSEIDON Storydata 17 172248.170 07202018 in stories to record.

JackMostow commented 6 years ago

Leonora just narrated 9 more stories, already uploaded. What a pro!

JackMostow commented 6 years ago

@judithodili, @eyarzebinski, and @amyogan - Can you include all of the following in the next QA?

@msalim and @amogh112 - SWAHILI STORIES ON WRITING contains her narrations of mini-stories about capitalization and punctuation for kids to hear, read, and practice writing.

@msalim - stories to record now contains Leonora's narrations of all the Swahili stories through Story 79 (recorded 9/13/2018). Please include them along with their comprehension questions.

WORD PROBLEMS IN SWAHILI contains her narrations of Swahili word problems taken from EGMA, adapted by Judith, or generated (manually) by Hui, plus Hui's scaffolds for the word problem tutor.