I am interested in training the model with languages(Korean) other than English. Is this feasible? I noticed that the Word Embedding Layer is frozen according to the paper. Does this affect the ability to train with different languages?
Additionally, I am curious about the 8-frame sampling process. Are 8 frames uniformly sampled regardless of the video length (short, mid, long)?
Hello,
I am interested in training the model with languages(Korean) other than English. Is this feasible? I noticed that the
Word Embedding Layer
is frozen according to the paper. Does this affect the ability to train with different languages?Additionally, I am curious about the 8-frame sampling process. Are 8 frames uniformly sampled regardless of the video length (short, mid, long)?
Thank you for your assistance!