-
### What happened + What you expected to happen
I started seeing an issue on our Ray cluster where I'd see nodes with no activity still displaying high memory usage. When I looked into the k8s pod,…
-
I set the environment variables as follow in train_dist.sh in gpt_hf folder:
```
export NUM_NODES=1
export NUM_GPUS_PER_NODE=8
export MASTER_ADDR=localhost
export MASTER_PORT=2222
export NODE_RA…
-
### System Info
Running Llama-3-8B-Instruct with TGI on a high-performance compute cluster with apptainer & SLURM with the following command: `srun --pty --gres=gpu:NVIDIA_A40:2 --mem 32G apptainer…
-
We have a problem that seems to relate to #566 and #189. I cannot access via haproxy with multiple servers.
Initially we exposed a single elastic search server the world a reverse nginx proxy with ba…
-
### System Info
- `transformers` version: 4.38.0.dev0
- Platform: Linux-5.4.0-169-generic-x86_64-with-glibc2.31
- Python version: 3.10.13
- Huggingface_hub version: 0.20.3
- Safetensors version…
-
This is ClickHouse roadmap 2021.
Descriptions and links to be filled.
It will be published in documentation in December.
# Main tasks
### ✔️ Provide alternative for ZooKeeper
Implementation…
-
roachtest.schemachange/mixed-versions [failed](https://teamcity.cockroachdb.com/buildConfiguration/Cockroach_Nightlies_RoachtestNightlyGceBazel/14960054?buildTab=log) with [artifacts](https://teamcity…
-
The buffer size of RedisOutput/InputStream is fixed as 8192, 8KB.
It is no problem if there is not much connection.
In our case, we maintain many connections concurrently and handle under 1KB.
So 8KB…
itugs updated
8 months ago
-
backup info
```
chi-clickhouse-cluster-dtstack-0-0-0:/$ clickhouse-backup create
2024/04/26 17:40:04.008743 info clickhouse connection prepared: tcp://localhost:9000 run ping logger=clickhouse
20…
-
Hi~!In a large-size cluster(scylla version 5.2.9, total 60 nodes in 3 datacenters),A seastar_memory - oversized allocation warning occurred during cluster expansion.The specific error is as follows:
…