pytorch / data

A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.
BSD 3-Clause "New" or "Revised" License
1.12k stars 149 forks source link

PyTorch 2.2: import torchdata fails on ubuntu-20.04 github runners #1225

Closed agunapal closed 7 months ago

agunapal commented 7 months ago

🐛 Describe the bug

After upgrading to PyTorch 2.2, our CI runs are failing on github runners(ubuntu-20.04) because import of torchdata is failing The issue doesn't happen locally or on mac-OS runners.

Looks like some dependency has changed in the new version of torchdata?

Traceback (most recent call last):
  File "<string>", line 1, in <module>
  File "/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/torchdata/__init__.py", line 7, in <module>
    from torchdata import _extension  # noqa: F401
  File "/opt/hostedtoolcache/Python/3.8.18/x64/lib/python3.8/site-packages/torchdata/_extension.py", line 34, in <module>
    _init_extension()
  File "/opt/hostedtoolcache/Python/3.8.[18](https://github.com/pytorch/serve/actions/runs/7804799065/job/21287571465#step:8:19)/x64/lib/python3.8/site-packages/torchdata/_extension.py", line 31, in _init_extension
    from torchdata import _torchdata as _torchdata
ImportError: libssl.so.3: cannot open shared object file: No such file or directory

here is a failing run

https://github.com/pytorch/serve/actions/runs/7804933569/job/21287979228

Versions

Collecting environment information... PyTorch version: 2.2.0+cpu Is debug build: False CUDA used to build PyTorch: None ROCM used to build PyTorch: N/A OS: Ubuntu 20.04.6 LTS (x86_64) GCC version: (Ubuntu 9.4.0-1ubuntu1~20.04.2) 9.4.0 Clang version: 11.0.0-2~ubuntu20.04.1 CMake version: version 3.28.2 Libc version: glibc-2.31 Python version: 3.8.18 (default, Aug 28 2023, 08:26:46) [GCC 9.4.0] (64-bit runtime) Python platform: Linux-5.15.0-1054-azure-x86_64-with-glibc2.2.5 Is CUDA available: False CUDA runtime version: No CUDA CUDA_MODULE_LOADING set to: N/A GPU models and configuration: No CUDA Nvidia driver version: No CUDA cuDNN version: No CUDA HIP runtime version: N/A MIOpen runtime version: N/A Is XNNPACK available: True CPU: Architecture: x86_64 CPU op-mode(s): 32-bit, 64-bit Byte Order: Little Endian Address sizes: 48 bits physical, 48 bits virtual CPU(s): 4 On-line CPU(s) list: 0-3 Thread(s) per core: 2 Core(s) per socket: 2 Socket(s): 1 NUMA node(s): 1 Vendor ID: AuthenticAMD CPU family: 25 Model: 1 Model name: AMD EPYC 7763 64-Core Processor Stepping: 1 CPU MHz: 2709.745 BogoMIPS: 4890.86 Virtualization: AMD-V Hypervisor vendor: Microsoft Virtualization type: full L1d cache: 64 KiB L1i cache: 64 KiB L2 cache: 1 MiB L3 cache: 32 MiB NUMA node0 CPU(s): 0-3 Vulnerability Gather data sampling: Not affected Vulnerability Itlb multihit: Not affected Vulnerability L1tf: Not affected Vulnerability Mds: Not affected Vulnerability Meltdown: Not affected Vulnerability Mmio stale data: Not affected Vulnerability Retbleed: Not affected Vulnerability Spec rstack overflow: Mitigation; safe RET, no microcode Vulnerability Spec store bypass: Vulnerable Vulnerability Spectre v1: Mitigation; usercopy/swapgs barriers and __user pointer sanitization Vulnerability Spectre v2: Mitigation; Retpolines, STIBP disabled, RSB filling, PBRSB-eIBRS Not affected Vulnerability Srbds: Not affected Vulnerability Tsx async abort: Not affected Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt pdpe1gb rdtscp lm constant_tsc rep_good nopl tsc_reliable nonstop_tsc cpuid extd_apicid aperfmperf pni pclmulqdq ssse3 fma cx16 pcid sse4_1 sse4_2 movbe popcnt aes xsave avx f16c rdrand hypervisor lahf_lm cmp_legacy svm cr8_legacy abm sse4a misalignsse 3dnowprefetch osvw topoext invpcid_single vmmcall fsgsbase bmi1 avx2 smep bmi2 erms invpcid rdseed adx smap clflushopt clwb sha_ni xsaveopt xsavec xgetbv1 xsaves clzero xsaveerptr rdpru arat npt nrip_save tsc_scale vmcb_clean flushbyasid decodeassists pausefilter pfthreshold v_vmsave_vmload umip vaes vpclmulqdq rdpid fsrm Versions of relevant libraries: [pip3] intel-extension-for-pytorch==2.1.0 [pip3] mypy==1.3.0 [pip3] mypy-extensions==1.0.0 [pip3] numpy==1.24.3 [pip3] onnx==1.14.1 [pip3] onnxruntime==1.15.0 [pip3] torch==2.2.0+cpu [pip3] torchaudio==2.2.0+cpu [pip3] torchdata==0.7.1+cpu [pip3] torchpippy==0.1.1 [pip3] torchtext==0.17.0+cpu [pip3] torchvision==0.17.0+cpu [conda] No relevant packages

cc @seemethere @malfet @osalpekar @atalman

atalman commented 7 months ago

Looks like a problem when installing torchdata from our custom index: download.pytorch.org From pip: Installs: torchdata-0.7.1-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl

ldd _torchdata.so 
        linux-vdso.so.1 (0x00007ffe263b0000)
        libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007f5c5b7ef000)
        libz.so.1 => /lib/x86_64-linux-gnu/libz.so.1 (0x00007f5c5b7d3000)
        libdl.so.2 => /lib/x86_64-linux-gnu/libdl.so.2 (0x00007f5c5b7cd000)
        libstdc++.so.6 => /lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007f5c5b5eb000)
        libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007f5c5b49c000)
        libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007f5c5b481000)
        libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007f5c5b28d000)
        /lib64/ld-linux-x86-64.so.2 (0x00007f5c5c33c000)

From download.pytorch.org: Installs: torchdata-0.7.1+cpu-cp38-cp38-linux_x86_64.whl

 ldd _torchdata.so 
        linux-vdso.so.1 (0x00007ffd173f4000)
        libpthread.so.0 => /lib/x86_64-linux-gnu/libpthread.so.0 (0x00007fd71db67000)
        libz.so.1 => /lib/x86_64-linux-gnu/libz.so.1 (0x00007fd71db4b000)
        libcurl.so.4 => not found
        libssl.so.3 => not found
        libcrypto.so.3 => not found
        libstdc++.so.6 => /lib/x86_64-linux-gnu/libstdc++.so.6 (0x00007fd71d967000)
        libm.so.6 => /lib/x86_64-linux-gnu/libm.so.6 (0x00007fd71d818000)
        libgcc_s.so.1 => /lib/x86_64-linux-gnu/libgcc_s.so.1 (0x00007fd71d7fd000)
        libc.so.6 => /lib/x86_64-linux-gnu/libc.so.6 (0x00007fd71d60b000)
        /lib64/ld-linux-x86-64.so.2 (0x00007fd71e2c8000)
malfet commented 7 months ago

Moved to torchdata repo

agunapal commented 7 months ago

Thanks for fixing @atalman Issue is resolved now