issues
search
openxla
/
xla
A machine learning compiler for GPUs, CPUs, and ML accelerators
Apache License 2.0
2.39k
stars
358
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Test for reduction indexing maps being bijections.
#14388
copybara-service[bot]
opened
26 minutes ago
0
Reverts 7e3f86da7241a8db61fd192b47ec0ee076fc3c0b
#14387
copybara-service[bot]
closed
32 minutes ago
0
[PJRT] Use AnyInvocable for WorkerThread.
#14386
copybara-service[bot]
opened
9 hours ago
0
Adding translation from HLO --> StableHLO
#14385
copybara-service[bot]
opened
9 hours ago
0
Integrate LLVM at llvm/llvm-project@efefee28a41e
#14384
copybara-service[bot]
closed
3 hours ago
0
Extend CustomCallOp backend_config to take a DictionaryAttr
#14383
copybara-service[bot]
opened
10 hours ago
0
Refactor build transforms and header replacements
#14382
copybara-service[bot]
opened
11 hours ago
0
Add ulp error field for FP8 floats.
#14381
copybara-service[bot]
opened
11 hours ago
0
Remove obsolete workflows from XLA and TensorFlow
#14380
copybara-service[bot]
closed
11 hours ago
0
[xla:cpu] Add benchmark for compiling a chain if f32[12] buffers
#14379
copybara-service[bot]
opened
12 hours ago
0
Add an XLA:CPU fusion benchmark.
#14378
copybara-service[bot]
closed
13 hours ago
0
[xla:gpu] Increase the threshold for strength reducing small dots to 10M elements
#14377
copybara-service[bot]
opened
14 hours ago
0
[XLA] Give NumMappedDims() internal linkage and use a simple constant instead.
#14376
copybara-service[bot]
closed
14 hours ago
0
Inline buildozer rule for `stream_executor_impl`
#14375
copybara-service[bot]
closed
13 hours ago
0
[XLA] Add tests to make sure that erf/erfc return results in-range
#14374
copybara-service[bot]
opened
15 hours ago
0
Refactor more build target transforms
#14373
copybara-service[bot]
closed
14 hours ago
0
Allow duplicate handler registration when traits and bundle addresses are equal
#14372
copybara-service[bot]
opened
15 hours ago
0
[XLA] Simplify logic in hlo broadcast splitter.
#14371
copybara-service[bot]
opened
16 hours ago
0
Reverts 153798d9ccad24d87a3aec76373503979a479f0c
#14370
copybara-service[bot]
closed
14 hours ago
0
[XLA] Remove the special input bounder
#14369
copybara-service[bot]
opened
16 hours ago
0
[XLA:CPU] Fix the detection of FP16 extension to AVX512
#14368
akhilgoe
opened
16 hours ago
0
gen_gpu_hlo_compile_tests: Don't run in internal coverage infrastructure.
#14367
copybara-service[bot]
opened
17 hours ago
0
Reverts 153798d9ccad24d87a3aec76373503979a479f0c
#14366
copybara-service[bot]
opened
18 hours ago
0
Integrate Triton up to [a7e3476e](https://github.com/openai/triton/commits/a7e3476e86757780c2528d1100eb7aabe58d8c19)
#14365
copybara-service[bot]
opened
20 hours ago
0
[ROCm] Add script to run multi gpu tests
#14364
hsharsha
opened
20 hours ago
0
[XLA:CPU] Align thunks runtime to current runtime behavior for `rng` op.
#14363
copybara-service[bot]
opened
21 hours ago
1
[XLA:GPU] Change `ConstraintExpression` to use `llvm::SmallVector` under the hood.
#14362
copybara-service[bot]
closed
20 hours ago
0
Remove a duplicate bounds check.
#14361
copybara-service[bot]
closed
19 hours ago
0
Fix file path in target patterns.
#14360
copybara-service[bot]
closed
23 hours ago
0
Add Blackwell-related methods to CudaComputeCapability class
#14359
sergey-kozub
opened
1 day ago
3
Fix build failure for //xla/service/gpu/kernels:cutlass_gemm_custom_kernel_benchmarks in OSS
#14358
sergey-kozub
closed
1 day ago
0
Fix build failure for //xla/service/gpu:gpu_latency_hiding_scheduler_test in OSS
#14357
sergey-kozub
closed
23 hours ago
0
Fix //xla/service/gpu:execution_stream_assignment_test in OSS
#14356
sergey-kozub
closed
1 day ago
0
Reduce stack usage of HLO->MLIR conversion functions.
#14355
copybara-service[bot]
closed
16 hours ago
0
Fix //xla/service/gpu:autotuner_util_test in OSS
#14354
sergey-kozub
closed
21 hours ago
0
Fix //xla/service/gpu:triton_support_test in OSS
#14353
sergey-kozub
closed
20 hours ago
0
Build fails
#14352
AleksKnezevic
opened
1 day ago
0
Remove invalid target pattern from TensorFlow builds
#14351
copybara-service[bot]
opened
1 day ago
0
[XLA:SPMD] Remove stale shard group instruction after sharding-aware CSE in sharding proapgation.
#14350
copybara-service[bot]
opened
1 day ago
0
Add mising `llvm/Support` dependency on `run_hlo_module_test` target
#14349
copybara-service[bot]
opened
1 day ago
0
Refactor more build target transformations
#14348
copybara-service[bot]
closed
1 day ago
0
Use the cpu tag filters for the CPU build and cuda tag filters for the CUDA build
#14347
copybara-service[bot]
closed
1 day ago
0
Integrate StableHLO at openxla/stablehlo@2a6ae6e1
#14346
copybara-service[bot]
closed
12 hours ago
0
Remove early return when verifying async custom-call instructions.
#14345
copybara-service[bot]
opened
1 day ago
0
[NVIDIA GPU] Annotate syntactic sugar op name in nsys profile
#14344
terryysun
opened
1 day ago
4
[xla:sdy] Open source xla passes for Shardy.
#14343
copybara-service[bot]
opened
1 day ago
0
Fix //xla/service/gpu:ir_emitter_triton_mem_utils_test in OSS
#14342
sergey-kozub
opened
1 day ago
2
Refactor build target transformations
#14341
copybara-service[bot]
closed
1 day ago
0
[xla:ffi] Use lazy decoding for Buffer<dtype,rank>
#14340
copybara-service[bot]
opened
1 day ago
2
Change `test_filters` and targets for TensorFlow builds
#14339
copybara-service[bot]
closed
1 day ago
0
Next