issues
search
openxla
/
xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Apache License 2.0
2.38k
stars
355
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[XLA:SPMD] Fix sharding propagation for kGetTupleElement not generating correct sharding shape with subgroup manual.
#14304
copybara-service[bot]
opened
1 hour ago
0
Adds a dedicated constructor to CostGraph for testing.
#14303
copybara-service[bot]
opened
3 hours ago
0
[xla:ffi] NFC: Clean up XLA FFI type aliases
#14302
copybara-service[bot]
opened
3 hours ago
0
Add a global mesh name constant
#14301
copybara-service[bot]
opened
6 hours ago
0
[XLA] Clarify and test the order preservation requirement of a tuple simplification.
#14300
copybara-service[bot]
closed
4 hours ago
0
Cannot use `XlaOp` from outer scope
#14299
joelberkeley
opened
7 hours ago
0
Refactor C++ header replacements
#14298
copybara-service[bot]
opened
7 hours ago
0
[xla:ffi] Add benchmarks for internal XLA FFI implementation
#14297
copybara-service[bot]
opened
8 hours ago
0
Reverts 4d7ec1e49fb89185ecf0c46493f9fb6fcc6519cd
#14296
copybara-service[bot]
closed
7 hours ago
0
[ROCm] fixed gcc build
#14295
i-chaochen
opened
9 hours ago
1
[xla] Add a test for HLO deduplication + execution threads
#14294
copybara-service[bot]
closed
7 hours ago
0
Changes needed for future hardware compatibility.
#14293
dimvar
opened
10 hours ago
0
[XLA:GPU] Set minimum concat fragment size to 64
#14292
copybara-service[bot]
closed
36 minutes ago
0
[xla] Add more details to HLO verifier error message
#14291
copybara-service[bot]
closed
9 hours ago
0
Roll back cl/647677220 "[XLA:GPU] Move `triton_gpu.sparse_dot` to LLVM pattern..."
#14290
copybara-service[bot]
closed
9 hours ago
0
[xla:ffi] NFC: Update documentation
#14289
copybara-service[bot]
closed
8 hours ago
0
[xla:cpu] Add FFI custom call thunk runtime support to PJRT CPU client.
#14288
copybara-service[bot]
opened
13 hours ago
0
[XLA:CPU] Port concatenate instruction to Thunks
#14287
copybara-service[bot]
opened
13 hours ago
1
[IFRT] Include input_specs in RemapPlan::DebugString()
#14286
copybara-service[bot]
closed
9 hours ago
0
[XLA:FFI] Catch exceptions in user FFI calls.
#14285
copybara-service[bot]
opened
14 hours ago
0
Clean up tile size computation for row reductions.
#14284
copybara-service[bot]
closed
14 hours ago
0
Break circular dependency between gpu_stream.cc and gpu_event.cc.
#14283
copybara-service[bot]
closed
14 hours ago
0
Unify indexing map computation for reduction emitters.
#14282
copybara-service[bot]
closed
16 hours ago
0
Move client creation to a separate file (NFC).
#14281
copybara-service[bot]
opened
18 hours ago
0
Standardize patch format to work with internal tooling.
#14280
copybara-service[bot]
opened
18 hours ago
0
Clean up some duplication and dead code in reduction emitter.
#14279
copybara-service[bot]
closed
17 hours ago
0
Reverts 8e7c93698dd15a3c6c347c301de1875dff612515
#14278
copybara-service[bot]
closed
17 hours ago
0
Remove unused GetBinaryDir function.
#14277
copybara-service[bot]
closed
18 hours ago
0
[XLA] Prohibit defining `Literal::AbslHashValue` as it cannot be made consistent.
#14276
copybara-service[bot]
closed
18 hours ago
0
[XLA:GPU] Move `triton_gpu.sparse_dot` to LLVM pattern from Triton patch (`convert-triton-gpu-to-llvm` pass) to OpenXLA (`sparse-convert-layout-op` pass).
#14275
copybara-service[bot]
closed
15 hours ago
0
[XLA:GPU] Disable cublasLT tracing for cuda version < 12030
#14274
shawnwang18
closed
3 hours ago
1
Reverts bf95473bdb484f742de0df899329f15b3e2249ed
#14273
copybara-service[bot]
closed
20 hours ago
0
Explicitly disallow duplicated devices during array construction
#14272
copybara-service[bot]
closed
10 hours ago
0
Automated Code Change
#14271
copybara-service[bot]
opened
22 hours ago
0
Don't refactor dependencies into a bzl file.
#14270
copybara-service[bot]
closed
20 hours ago
0
PR #14166: Add kPower case to algsimp IsNonNegative
#14269
copybara-service[bot]
opened
23 hours ago
0
[PJRT:GPU] Implement copying buffers to pinned host memory space
#14268
jaro-sevcik
closed
12 hours ago
1
Make GPU PJRT client error out early when not compiled with GPU support
#14267
copybara-service[bot]
opened
1 day ago
0
[XLA:GPU] Add temporal TE custom call supports
#14266
shawnwang18
opened
1 day ago
4
[IFRT] Modify ifrt-verify-donation to reject instances when an arg is both donated and not donated.
#14265
copybara-service[bot]
closed
13 hours ago
0
[XLA:GPU] Add debug info for command buffer trace cache
#14264
shawnwang18
opened
1 day ago
1
[xla:ffi] Add an API to update CallFrame in place
#14263
copybara-service[bot]
closed
1 day ago
0
[xla:ffi] NFC: Implement Update as Clone and UpdateInPlace
#14262
copybara-service[bot]
closed
1 day ago
0
[xla:ffi] NFC: Use absl::InlinedVector to store dimensions
#14261
copybara-service[bot]
closed
1 day ago
0
[xla:ffi] Add an API to update CallFrame with new run time values (buffer pointers)
#14260
copybara-service[bot]
closed
1 day ago
0
Downgrade TSL's zlib to fix breakage
#14259
copybara-service[bot]
closed
1 day ago
0
[XLA:CollectivePipeliner] Sort formatting_ops (wrt original order) so that the mapper can traverse them in a way that operands are already mapped.
#14258
copybara-service[bot]
closed
23 hours ago
0
[tsl::mutex] Implement `assert_held()` and `assert_held_shared()` methods.
#14257
copybara-service[bot]
closed
1 day ago
0
Removed redundant *Internal methods from CpuCallback
#14256
copybara-service[bot]
closed
20 hours ago
0
Fix build issue in xla/hlo/utils/hlo_sharding_util_test.cc
#14255
apivovarov
opened
1 day ago
3
Next