Closed eaidova closed 3 months ago
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.
@echarlaix could you please take a look?
Can we have a test for each model architecture that is updated in this PR?
Can we have a test for each model architecture that is updated in this PR?
this is update for models that are already in testing, I added only baichuan based on different code version in tests, mpt and internlm are remain without changes
What does this PR do?
optimize mpt and internlm models with scaled dot product attention fixed export baichuan-13b model
Before submitting