-
Apache Celeborn (Incubating) uses [JIRA](https://issues.apache.org/jira/projects/CELEBORN/issues) for Issue Management, please open new issues in [JIRA](https://issues.apache.org/jira/projects/CELEBOR…
-
### Failing test case
TestLogIngestionFleetManaged/Monitoring_logs_are_shipped
### Error message
```
[...]
>>> (windows-amd64-2022-fleet) Test output (sudo) (stdout): Error Trace: C:/Users/w…
-
### Code of Conduct
- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
### Search before asking
- [X] I have searched in the [issues](http…
-
hi all, I saw there is a spark30 branch for spark 3.0.x supported in the readme. there also seems to be a spark31 branch but wondering is there any plans to support spark 3.2 or could it work out of t…
cpd85 updated
2 years ago
-
This issue is used for tracking Ray-based Shuffle For Mars progress:
- [x] Ray Future-based Shuffle MEP:https://github.com/mars-project/meps/pull/2
- [x] Shuffle Meta optimization #3055
- [x] Ray F…
-
## 🐛 Bug
Hi, we are using lightning with litdata on our local machine and aws s3 system. However, training would hang randomly during the very first iterations with ddp and remote cloud directory.
…
-
Hi!
I get an error when running mvn clean package -Pserver -DskipTests
```
[INFO] BUILD FAILURE
[INFO] ------------------------------------------------------------------------
[INFO] Total time…
-
### Backend
VL (Velox)
### Bug description
[Expected behavior] and [actual behavior].
spark vanilla execute tpcds tt ( executor 4 tpcds 99 queries same time)test mode pass and no error report;
##…
-
### Preconditions
- [X] Requirements fulfilled
- [X] The bug is not a known issue
- [X] The bug has not been solved in the past or the solution that was provided in the past does not work on my s…
-
### Bug description
Our tpu v3-8 deadlocks when using multiple 8 TPU cores on large datasets. Specifically, datasets larger than 2^15; one size larger and we get deadlock.
The deadlock occurs some…