-
Dear Fluid TOC members and maintainers,
I am proposing to have Shiming Wu(GithubID: wushiming540) as a new committer.
Wu Shiming, senior engineer of Huawei's NAIE platform, is responsible for l…
-
### Site URL
https://www.alibabacloud.com
### Description of the above provided site
### Cloud Service Links
1. **Alibaba Cloud**
- URL:(https://www.alibabacloud.com)
- Description: …
-
I noticed hopper cluster setting may have a chance to optimize the performance of batch_decode by merging `VariableLengthMergeStates` with `BatchDecodeWithPagedKVCacheKernel`. Is there any plan to us…
-
I've been watching slskd metrics after upgrading to .NET 7, and one thing that is really jumping out at me is the correlation between garbage collection frequency, CPU utilization, and the number of d…
-
When training the llm model according to the example, the following error occurred
Qwen1.5-0.5B-Chat and chatglm3-6b had same error.
Please help me check where the problem is.
Thanks !!!
…
-
### Feature request
## Description
This feature proposal aims to update Hugging Face's support for tensor parallelism (TP) to accommodate the increasing size and complexity of models such as [LLaM…
-
Dear Fluid TOC members and maintainers,
I am proposing to have Xiaozheng Zhang (GithubID: zhang-x-z) as a new Fluid committer.
Xiaozheng Zhang, now is a master student in Nanjing University, foc…
-
### Describe This Problem
Considering the query targeting at a partition table whose hash partition key is called `partition_col`:
`select * from partition_table where partition_col in ("a", "b", …
-
### Bug description
When Optuna is run in parallel mode (`n_jobs=-1`), with `NeptuneCallback`, I get:
`[neptune] [error ] Error occurred during asynchronous operation processing: X-coordinates (ste…
-
## Description
We need to implement an algorithm for generating computation graphs. This algorithm should efficiently create and manage computation graphs, which are essential for various computati…