Open Vincent131499 opened 5 months ago
Great job! QWen is an open-source model widely used by the community. Does it support the training of this model?
good idea, it would be nice if someone could make a PR to support Qwen 1.5 or later Qwen 2.
it should be quite easy as the architecture seems very similar to the ones we already support in the current code
Great job! QWen is an open-source model widely used by the community. Does it support the training of this model?