Bootstrap a corpus file from Watson Assistant training data

In https://github.com/IBM/watson-tts-python#extract_skill_textpy we extract WA data to a two-column file for listening to the prompts. The second column could be extracted to a separate file (sans header), and that new file is suitable as a language model corpus file for STT.

Either this tool or https://github.com/IBM/watson-stt-wer-python should be able to create (or append) to a corpus file from WA training data. I prefer append because we frequently want to duplicate utterances in the LM corpus file.

IBM / watson-tts-python

Bootstrap a corpus file from Watson Assistant training data #22