-
Related GitLab milestone:https://git.twcc.ai/twcc/manual/-/milestones/35#tab-issues
## To-dos
請將 https://github.com/twcc/man-vip/issues/351 中文對齊調整的部分 ,再對齊至 EN 版內容,謝謝!
> - 可透過 中文對齊的 PR > 點選 `Fil…
-
Related GitLab milestone:https://git.twcc.ai/twcc/manual/-/milestones/35#tab-issues
## To-dos
對齊 10/14 (含) 之後 ~ 10/28 上架 HackMD 的上架內容,請檢/視以下文件 HackMD 最新一筆更新紀錄,並將變更部分,更新至 Dcsrs
> 若 HackMD 在 10/1…
-
Currently, it seems that the implementation of in-flight batch is closed-source. Is there any way to print some logs for better performance analysis?
For example, we would like to know the batch si…
-
This claims iterative sequences can be used but I cannot find any examples of how to use it. I was hoping to use it to improve the latency of my mt5 decoder model with key value caching that runs usin…
-
Below is the error I am getting while using generate api with below params. First time it is able to generate with **prefix_pos** but next call I am getting below error.
"use_beam_search": false,…
-
**Description**
The README.md for the client libraries repository doesn't give complete instructions for Windows build. It instructs you how to make a container for doing the Windows build of the cli…
-
**Is your feature request related to a problem? Please describe.**
So I'm trying to use tritonserver in my project. But it uses a lot of RAM for a single model.
* Is this expected behaviour?
* Ar…
-
# MindSpore OCR 服务化部署功能设计说明书
## 一、修订记录
| ***\*日期\**** | ***\*修订版本\**** | ***\*修改章节\**** | ***\*修改描述\**** | ***\*作者\**** |
| -------------- | ------------------ | ------------------ | ---…
-
-
Hi, Do we have standard RTFX & Latency numbers for streaming & non streaming pruned transducer stateless X ? I am configuring triton perf benchmarking. Let me know if any specific steps to be follow…