trt10.5 pytorch-quantization has compile bug

Description

trt10.5 pytorch-quantization has compile bug.

https://github.com/NVIDIA/TensorRT/blob/release/10.5/tools/pytorch-quantization/src/tensor_quant_gpu.cu#L28-L37 define two macro AT_DISPATCH_CASE_FLOATING_TYPES and AT_DISPATCH_FLOATING_TYPES

#define AT_DISPATCH_CASE_FLOATING_TYPES(...)   \
  AT_DISPATCH_CASE(at::ScalarType::Double, __VA_ARGS__)  \
  AT_DISPATCH_CASE(at::ScalarType::Float, __VA_ARGS__)  \
  AT_DISPATCH_CASE(at::ScalarType::Half, __VA_ARGS__)   \
  AT_DISPATCH_CASE(at::ScalarType::BFloat16, __VA_ARGS__)

#define AT_DISPATCH_FLOATING_TYPES(TYPE, NAME, ...) \
  AT_DISPATCH_SWITCH(                                        \
      TYPE, NAME, AT_DISPATCH_CASE_FLOATING_TYPES(__VA_ARGS__))

but in https://github.com/NVIDIA/TensorRT/blob/release/10.5/tools/pytorch-quantization/src/tensor_quant_gpu.cu#L18 #include <ATen/ATen.h> --> #include <ATen/Dispatch.h> --> has already defined these two macros.
I check torch1.13 and torch2.4.1, both the same case.

two macros duplicate definition. @moraxu need use

#undef AT_DISPATCH_CASE_FLOATING_TYPES(...)  
#undef AT_DISPATCH_FLOATING_TYPES(TYPE, NAME, ...)

before #define in tensor_quant_gpu.cu

Environment

TensorRT Version:10.5

NVIDIA GPU:rtx2000

NVIDIA Driver Version:

CUDA Version:11.8

CUDNN Version:9.1

Operating System:

Python Version (if applicable):3.8

PyTorch Version (if applicable):1.13 or 2.4.1

NVIDIA / TensorRT

trt10.5 pytorch-quantization has compile bug #4197

Description

Environment