Closed Aisuko closed 2 weeks ago
Description
This PR
fixes #175 fixes #173 fixes #171 fixes #163
Notes for Reviewers The test env is 8 CPUs, 16 GB memory. And the inference speed is reasonable
In this PR, we replaced GPT-2minimal to Phi3-minimal:
Although add instruction and tokenization. However, these functions should be re-write in more stable way.
Signed commits
Description
This PR
fixes #175 fixes #173 fixes #171 fixes #163
Notes for Reviewers The test env is 8 CPUs, 16 GB memory. And the inference speed is reasonable
In this PR, we replaced GPT-2minimal to Phi3-minimal:
Although add instruction and tokenization. However, these functions should be re-write in more stable way.
Signed commits