Open hvgazula opened 1 year ago
Yes, let's flesh this out and implement a prototype for causal
so we can implement it for other types as well.
Consider adding special tokens to causal models at the beginning, to stay true to what the model was trained on. Needs investigating.
Let's talk about this again
waiting to resolve issue with whisper embedding generation replication
The input generation, inference, and embeddings/logits extraction functions (as appropriate)
tfsemb_main.py
should be moved into separate scripts forcausal
,mlm
, andseq2seq
models.