Open trecius opened 8 years ago
For each time frame (one line in your training corpus), if it only contains 5 features, you could build embedding model likes. That means each time frame has its unique id. ID1 0.923 0.223 0.573 0.235 0.111 ID2 0.920 0.228 0.353 0.213 0.098 ID3 0.901 0.677 0.235 0.551 0.121 ... ID2 0.920 0.228 0.353 0.213 0.098
I just updated RNNSharp to support embedding model in raw text format, so you could use above format for training directly. Please replace WORDEMBEDDING_FILENAME with WORDEMBEDDING_RAW_FILENAME in configuration file.
For #2, yes. It looks good. For example, it may looks like ID1 Wave ID2 Label2 ID2 Wave ... IDn LabelX
For each time frame, it has a corresponding label as result.
Hello:
I'm getting closer. I've since extracted all my time frames that I want to train the dataset into a single file: rawModel.txt. It has the format:
Since you are going to use continuous values as features, the template.txt should only keep one line: U01:%x[0,0]. All of other lines are used for discrete features only.
In training corpus, RNNSharp uses a blank line to split two entities, but embedding model (rawModel.txt in your example) needn't to use blank lines, since embedding model is just a key-value pair, RNNSharp access embedding model by keyword, and get dense features from embedding model for encoding or decoding.
RNNSharp already supports embedding model in raw text format, you could sync the latest code from depot and use it. In your case, the configuration file looks like:
TFEATURE_FILENAME: tfeature
TFEATURE_CONTEXT: 0
WORDEMBEDDING_RAW_FILENAME: rawModel.txt
WORDEMBEDDING_CONTEXT: -1, 0, 1
WORDEMBEDDING_COLUMN: 0
I hope these information can help you. For exception you mentioned, could you please show more detailed information about it ?
Hello,
I have a spatio-temporal dataset that I have compiled. It's in a TSV format, and I'd like your RNNSharp to consume the input for classification as well as recognition. My features are continuous values in the range [0, 1]. My TSV file looks like the following:
ID1 0.923 0.223 0.573 0.235 0.111 ID1 0.920 0.228 0.353 0.213 0.098 ID1 0.901 0.677 0.235 0.551 0.121 ... ID1 0.853 0.383 0.301 0.618 0.132