Open mxnthng opened 2 years ago
There can be various ways to compute a distribution over textual data. One simple way (also used in this repo) is to use the frequency of words (alphabetic, numeric, symbolic, and alphanumeric words) to build a distribution, sample a category based on this distribution and uniformly randomly select a word from that category.
can I make a custom distribution and image, ocr, tb file my own? I mean I'd like to generate table data with Japanese, not English but I have no idea how to do this?