pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration
https://pytorch.org
Other
83.65k stars 22.57k forks source link

DISABLED test_comprehensive_nn_functional_interpolate_nearest-exact_cpu_float64 (__main__.TestInductorOpInfoCPU) #135988

Closed pytorch-bot[bot] closed 1 month ago

pytorch-bot[bot] commented 1 month ago

Platforms: mac, macos

This test was disabled because it is failing in CI. See recent examples and the most recent trunk workflow logs.

Over the past 3 hours, it has been determined flaky in 6 workflow(s) with 18 failures and 6 successes.

Debugging instructions (after clicking on the recent samples link): DO NOT ASSUME THINGS ARE OKAY IF THE CI IS GREEN. We now shield flaky tests from developers so CI will thus be green but it will be harder to parse the logs. To find relevant log snippets:

  1. Click on the workflow logs linked above
  2. Click on the Test step of the job so that it is expanded. Otherwise, the grepping will not work.
  3. Grep for test_comprehensive_nn_functional_interpolate_nearest-exact_cpu_float64
  4. There should be several instances run (as flaky tests are rerun in CI) from which you can study the logs.
Sample error message ``` Traceback (most recent call last): File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_device_type.py", line 1140, in test_wrapper return test(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_device_type.py", line 1409, in only_fn return fn(self, *args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_utils.py", line 2133, in wrapper fn(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_device_type.py", line 1210, in dep_fn return fn(slf, *args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_device_type.py", line 1210, in dep_fn return fn(slf, *args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_utils.py", line 1526, in wrapper fn(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_utils.py", line 1466, in wrapper fn(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/unittest/mock.py", line 1336, in patched return func(*newargs, **newkeywargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/contextlib.py", line 79, in inner return func(*args, **kwds) File "/Users/ec2-user/runner/_work/pytorch/pytorch/test/inductor/test_torchinductor_opinfo.py", line 557, in inner raise e File "/Users/ec2-user/runner/_work/pytorch/pytorch/test/inductor/test_torchinductor_opinfo.py", line 549, in inner fn(self, device, dtype, op) File "/Users/ec2-user/runner/_work/pytorch/pytorch/test/inductor/test_torchinductor_opinfo.py", line 793, in test_comprehensive raise e File "/Users/ec2-user/runner/_work/pytorch/pytorch/test/inductor/test_torchinductor_opinfo.py", line 775, in test_comprehensive self.check_model( File "/Users/ec2-user/runner/_work/pytorch/pytorch/test/inductor/test_torchinductor.py", line 430, in check_model actual = run(*example_inputs, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/eval_frame.py", line 465, in _fn return fn(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/convert_frame.py", line 1278, in __call__ return self._torchdynamo_orig_callable( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/convert_frame.py", line 526, in __call__ return _compile( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/convert_frame.py", line 924, in _compile guarded_code = compile_inner(code, one_graph, hooks, transform) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/convert_frame.py", line 666, in compile_inner return _compile_inner(code, one_graph, hooks, transform) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_utils_internal.py", line 87, in wrapper_function return function(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/convert_frame.py", line 699, in _compile_inner out_code = transform_code_object(code, transform) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/bytecode_transformation.py", line 1322, in transform_code_object transformations(instructions, code_options) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/convert_frame.py", line 219, in _fn return fn(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/convert_frame.py", line 634, in transform tracer.run() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/symbolic_convert.py", line 2796, in run super().run() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/symbolic_convert.py", line 983, in run while self.step(): File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/symbolic_convert.py", line 895, in step self.dispatch_table[inst.opcode](self, inst) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/symbolic_convert.py", line 2987, in RETURN_VALUE self._return(inst) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/symbolic_convert.py", line 2972, in _return self.output.compile_subgraph( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/output_graph.py", line 1117, in compile_subgraph self.compile_and_call_fx_graph(tx, list(reversed(stack_values)), root) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/output_graph.py", line 1369, in compile_and_call_fx_graph compiled_fn = self.call_user_compiler(gm) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/output_graph.py", line 1416, in call_user_compiler return self._call_user_compiler(gm) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/output_graph.py", line 1465, in _call_user_compiler raise BackendCompilerFailed(self.compiler_fn, e).with_traceback( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/output_graph.py", line 1446, in _call_user_compiler compiled_fn = compiler_fn(gm, self.example_inputs()) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/repro/after_dynamo.py", line 129, in __call__ compiled_gm = compiler_fn(gm, example_inputs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/repro/after_dynamo.py", line 129, in __call__ compiled_gm = compiler_fn(gm, example_inputs) File "/Users/ec2-user/runner/_work/pytorch/pytorch/test/inductor/test_torchinductor.py", line 422, in compile_fx_wrapper return compile_fx(model_, example_inputs_) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/compile_fx.py", line 1525, in compile_fx return aot_autograd( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/backends/common.py", line 72, in __call__ cg = aot_module_simplified(gm, example_inputs, **self.kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_functorch/aot_autograd.py", line 1071, in aot_module_simplified compiled_fn = dispatch_and_compile() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_functorch/aot_autograd.py", line 1056, in dispatch_and_compile compiled_fn, _ = create_aot_dispatcher_function( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_functorch/aot_autograd.py", line 522, in create_aot_dispatcher_function return _create_aot_dispatcher_function( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_functorch/aot_autograd.py", line 759, in _create_aot_dispatcher_function compiled_fn, fw_metadata = compiler_fn( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_functorch/_aot_autograd/jit_compile_runtime_wrappers.py", line 586, in aot_dispatch_autograd compiled_fw_func = aot_config.fw_compiler(fw_module, adjusted_flat_args) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/compile_fx.py", line 1354, in fw_compiler_base return _fw_compiler_base(model, example_inputs, is_inference) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/compile_fx.py", line 1425, in _fw_compiler_base return inner_compile( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/compile_fx.py", line 476, in compile_fx_inner return wrap_compiler_debug(_compile_fx_inner, compiler_name="inductor")( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_dynamo/repro/after_aot.py", line 85, in debug_wrapper inner_compiled_fn = compiler_fn(gm, example_inputs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/compile_fx.py", line 662, in _compile_fx_inner compiled_graph = FxGraphCache.load( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/codecache.py", line 1334, in load compiled_graph = compile_fx_fn( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/compile_fx.py", line 571, in codegen_and_compile compiled_graph = fx_codegen_and_compile(gm, example_inputs, **fx_kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/compile_fx.py", line 879, in fx_codegen_and_compile compiled_fn = graph.compile_to_fn() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/graph.py", line 1953, in compile_to_fn return self.compile_to_module().call File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/graph.py", line 1879, in compile_to_module return self._compile_to_module() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/graph.py", line 1907, in _compile_to_module mod = PyCodeCache.load_by_key_path( File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/codecache.py", line 2856, in load_by_key_path mod = _reload_python_module(key, path) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/runtime/compile_tasks.py", line 45, in _reload_python_module exec(code, mod.__dict__, mod.__dict__) File "/var/folders/bm/fnn3xd1d39lcpbxrgwys1c140000gn/T/tmpik0_7fvn/bh/cbhyrf3znoonjp5x5nvpskwmazivmtetwkitjo7plczt6ungiir4.py", line 75, in async_compile.wait(globals()) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/async_compile.py", line 286, in wait scope[key] = result.result() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/codecache.py", line 3333, in result return self.result_fn() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/codecache.py", line 2357, in future result = get_result() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/codecache.py", line 2154, in load_fn future.result() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/concurrent/futures/_base.py", line 446, in result return self.__get_result() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/concurrent/futures/_base.py", line 391, in __get_result raise self._exception File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/concurrent/futures/thread.py", line 58, in run result = self.fn(*self.args, **self.kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/codecache.py", line 2195, in _worker_compile_cpp cpp_builder.build() File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/cpp_builder.py", line 1517, in build status = run_compile_cmd(build_cmd, cwd=_build_tmp_dir) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/cpp_builder.py", line 352, in run_compile_cmd return _run_compile_cmd(cmd_line, cwd) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/_inductor/cpp_builder.py", line 346, in _run_compile_cmd raise exc.CppCompileError(cmd, output) from e torch._dynamo.exc.BackendCompilerFailed: backend='compile_fx_wrapper' raised: CppCompileError: C++ compile error Command: clang++ /var/folders/bm/fnn3xd1d39lcpbxrgwys1c140000gn/T/tmpik0_7fvn/zc/czc7oerpwdscqklhen6spv27ofk3qvsp2wh4og7yzkrjope5al5d.cpp -D TORCH_INDUCTOR_CPP_WRAPPER -D C10_USING_CUSTOM_GENERATED_MACROS -D CPU_CAPABILITY_NEON -D AT_BUILD_ARM_VEC256_WITH_SLEEF -shared -fPIC -undefined dynamic_lookup -O3 -DNDEBUG -ffast-math -fno-finite-math-only -fno-unsafe-math-optimizations -ffp-contract=off -Wall -std=c++17 -Wno-unused-variable -Wno-unknown-pragmas -Werror=ignored-optimization-argument -Xclang -fopenmp -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/include/python3.9 -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/include/python3.9 -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include/TH -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include/THC -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/include -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include/torch/csrc/api/include -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include/TH -I/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/include/THC -D_GLIBCXX_USE_CXX11_ABI=0 -lc10 -lomp -L/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib -L/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/lib -L/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/lib -L/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib -L/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/lib -o /var/folders/bm/fnn3xd1d39lcpbxrgwys1c140000gn/T/tmpik0_7fvn/zc/czc7oerpwdscqklhen6spv27ofk3qvsp2wh4og7yzkrjope5al5d.so Output: Set TORCH_LOGS="+dynamo" and TORCHDYNAMO_VERBOSE=1 for more information You can suppress this exception and fall back to eager by setting: import torch._dynamo torch._dynamo.config.suppress_errors = True The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_utils.py", line 2979, in wrapper method(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_device_type.py", line 448, in instantiated_test result = test(self, **param_kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_utils.py", line 1526, in wrapper fn(*args, **kwargs) File "/Users/ec2-user/runner/_work/_temp/conda_environment_10842529251/lib/python3.9/site-packages/torch/testing/_internal/common_device_type.py", line 1152, in test_wrapper raise e_tracked from e Exception: Caused by sample input at index 9: SampleInput(input=Tensor[size=(2, 3, 4, 4), device="cpu", dtype=torch.float64], args=(), kwargs={'size': 'None', 'scale_factor': '0.6', 'mode': "'nearest-exact'", 'align_corners': 'None', 'recompute_scale_factor': 'False'}, broadcasts_input=False, name='') To execute this test, run the following from the base repo dir: PYTORCH_OPINFO_SAMPLE_INPUT_INDEX=9 python test/inductor/test_torchinductor_opinfo.py TestInductorOpInfoCPU.test_comprehensive_nn_functional_interpolate_nearest-exact_cpu_float64 This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 ```

Test file path: inductor/test_torchinductor_opinfo.py

cc @clee2000 @malfet @albanD @ezyang @chauhang @penguinwu @voznesenskym @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @yf225 @chenyang78 @kadeng @muchulee8 @ColinPeppler @amjames @desertfire

pytorch-bot[bot] commented 1 month ago
Hello there! From the DISABLED prefix in this issue title, it looks like you are attempting to disable a test in PyTorch CI. The information I have parsed is below: * Test name: `test_comprehensive_nn_functional_interpolate_nearest-exact_cpu_float64 (__main__.TestInductorOpInfoCPU)` * Platforms for which to skip the test: mac, macos * Disabled by `pytorch-bot[bot]` Within ~15 minutes, `test_comprehensive_nn_functional_interpolate_nearest-exact_cpu_float64 (__main__.TestInductorOpInfoCPU)` will be disabled in PyTorch CI for these platforms: mac, macos. Please verify that your test name looks correct, e.g., `test_cuda_assert_async (__main__.TestCuda)`. To modify the platforms list, please include a line in the issue body, like below. The default action will disable the test for all platforms if no platforms list is specified. ``` Platforms: case-insensitive, list, of, platforms ``` We currently support the following platforms: asan, dynamo, inductor, linux, mac, macos, rocm, slow, win, windows. ### How to re-enable a test To re-enable the test globally, close the issue. To re-enable a test for only a subset of platforms, remove the platforms from the list in the issue body. This may take some time to propagate. To re-enable a test only for a PR, put `Fixes #135988` in the PR body and rerun the test jobs. Note that if a test is flaky, it maybe be difficult to tell if the test is still flaky on the PR.
pytorch-bot[bot] commented 1 month ago

Another case of trunk flakiness has been found here. The list of platforms [mac, macos] appears to contain all the recently affected platforms [mac, macos]. Either the change didn't propogate fast enough or disable bot might be broken.

pytorch-bot[bot] commented 1 month ago

Another case of trunk flakiness has been found here. The list of platforms [mac, macos] appears to contain all the recently affected platforms [mac, macos]. Either the change didn't propogate fast enough or disable bot might be broken.

pytorch-bot[bot] commented 1 month ago

Resolving the issue because the test is not flaky anymore after 150 reruns without any failures and the issue hasn't been updated in 14 days. Please reopen the issue to re-disable the test if you think this is a false positive