Normalized expression values different from scanpy

tjbencomo commented 5 months ago

Describe the bug Hi - thanks for making this software! Very much needed, especially as single cell datasets get very large with spatial transcriptomics platforms. I've noticed some differences between scanpy and rapids_singlecell that are significantly affecting my results so I went back and started to compare how output from the two packages differed.

I've noticed that normalize_total seems to produce different results between rapids_singlecell and scanpy. I realize that not all functions are expected to produce the exact same results, but figured that normalization should be equivalent since it's such a deterministic and basic step. I've tried my best to create a good head to head comparison below where parameters are the same for both function calls.

Thanks in advance for the help!

Steps/Code to reproduce bug I downloaded the example dataset used in the Demo Workflow and Decoupler notebook using this URL from the data_downloader notebook that accompanies the demo notebook.

I then ran the following code:

import anndata as ad
import cupy as cp
import numpy as np
import rapids_singlecell as rsc
import scanpy as sc

import rmm
from rmm.allocators.cupy import rmm_cupy_allocator

rmm.reinitialize(
    managed_memory=True,  # Allows oversubscription
    pool_allocator=False,  # default is False
    devices=0,  # GPU device IDs to register. By default registers only GPU 0.
)
cp.cuda.set_allocator(rmm_cupy_allocator)
import gc

# Example dataset from Demo Workflow and Decoupler notebook
## cpudata: anndata object processed with scanpy
cpudata = sc.read_h5ad("/hpc/temp/setty_m/tbencomo/adata.raw_compressed.h5ad")
## gpudata: anndata object processed with rapids_singlecell 
gpudata = cpudata.copy()

rsc.get.anndata_to_GPU(gpudata)
rsc.pp.normalize_total(gpudata, target_sum=1e4)
sc.pp.normalize_total(cpudata, target_sum=1e4)
rsc.get.anndata_to_CPU(gpudata)

print(f"Number values not matching after normalization: {np.sum(cpudata.X != gpudata.X)}")

Expected behavior I expect the expression matrices to be equivalent after normalization by the two libraries. np.sum(cpudata.X != gpudata.X) should produce a value of 0.

Environment details (please complete the following information):

Environment location: HPC Slurm Cluster running Jupyter Lab
Linux Distro/Architecture: Ubuntu 18.04.6 LTS amd64
GPU Model/Driver: GeForce RTX 2080 Ti and Driver Version: 535.104.05
CUDA: 12.2

Method of Rapids install: pip

aiobotocore               2.5.4
aiohttp                   3.9.5
aioitertools              0.11.0
aiosignal                 1.3.1
anndata                   0.10.7
anyio                     4.3.0
argon2-cffi               23.1.0
argon2-cffi-bindings      21.2.0
array_api_compat          1.7
arrow                     1.3.0
asciitree                 0.3.3
asttokens                 2.4.1
async-lru                 2.0.4
attrs                     23.2.0
autopep8                  2.1.1
Babel                     2.15.0
beautifulsoup4            4.12.3
black                     24.4.2
bleach                    6.1.0
bokeh                     3.4.1
botocore                  1.31.17
cachetools                5.3.3
certifi                   2024.2.2
cffi                      1.16.0
charset-normalizer        3.3.2
click                     8.1.7
click-plugins             1.1.1
cligj                     0.7.2
cloudpickle               3.0.0
colorama                  0.4.6
colorcet                  3.1.0
comm                      0.2.2
contourpy                 1.2.1
cucim-cu12                24.4.0
cuda-python               12.5.0
cudf-cu12                 24.4.1
cugraph-cu12              24.4.0
cuml-cu12                 24.4.0
cuproj-cu12               24.4.0
cupy-cuda12x              13.1.0
cuspatial-cu12            24.4.0
cuvs-cu12                 24.4.0
cuxfilter-cu12            24.4.1
cycler                    0.12.1
dask                      2024.5.1
dask-cuda                 24.4.0
dask-cudf-cu12            24.4.1
dask-expr                 1.1.1
dask-image                2024.5.3
datashader                0.16.1
debugpy                   1.8.1
decorator                 5.1.1
decoupler                 1.6.0
defusedxml                0.7.1
distributed               2024.5.1
docrep                    0.3.2
executing                 2.0.1
fasteners                 0.19
fastjsonschema            2.19.1
fastrlock                 0.8.2
fiona                     1.9.6
fonttools                 4.52.1
fqdn                      1.5.1
frozenlist                1.4.1
fsspec                    2023.6.0
geopandas                 0.14.4
gitdb                     4.0.11
GitPython                 3.1.43
h11                       0.14.0
h5py                      3.11.0
holoviews                 1.18.3
httpcore                  1.0.5
httpx                     0.27.0
idna                      3.7
igraph                    0.11.5
imageio                   2.34.1
importlib_metadata        7.1.0
inflect                   7.2.1
ipykernel                 6.29.4
ipython                   8.24.0
isoduration               20.11.0
isort                     5.13.2
jedi                      0.19.1
Jinja2                    3.1.4
jmespath                  1.0.1
joblib                    1.4.2
json5                     0.9.25
jsonpointer               2.4
jsonschema                4.22.0
jsonschema-specifications 2023.12.1
jupyter_client            8.6.2
jupyter_core              5.7.2
jupyter-events            0.10.0
jupyter-lsp               2.2.5
jupyter_server            2.14.0
jupyter-server-mathjax    0.2.6
jupyter_server_proxy      4.1.2
jupyter_server_terminals  0.5.3
jupyterlab                4.2.1
jupyterlab_code_formatter 2.2.1
jupyterlab-execute-time   3.1.2
jupyterlab-link-share     0.3.0
jupyterlab_pygments       0.3.0
jupyterlab_server         2.27.2
jupyterlab_widgets        3.0.10
jupyterlmod               4.0.3
kiwisolver                1.4.5
lazy_loader               0.4
legacy-api-wrap           1.4
leidenalg                 0.10.2
linkify-it-py             2.0.3
llvmlite                  0.42.0
locket                    1.0.0
Markdown                  3.6
markdown-it-py            3.0.0
MarkupSafe                2.1.5
matplotlib                3.8.0
matplotlib-inline         0.1.7
matplotlib-scalebar       0.8.1
mdit-py-plugins           0.4.1
mdurl                     0.1.2
mistune                   3.0.2
more-itertools            10.2.0
msgpack                   1.0.8
multidict                 6.0.5
multipledispatch          1.0.0
multiscale_spatial_image  0.11.2
mypy-extensions           1.0.0
natsort                   8.4.0
nbclient                  0.10.0
nbconvert                 7.16.4
nbdime                    4.0.1
nbformat                  5.10.4
nest-asyncio              1.6.0
networkx                  3.3
nodejs                    0.1.1
notebook_shim             0.2.4
numba                     0.59.1
numcodecs                 0.12.1
numpy                     1.26.4
nvtx                      0.2.10
ome-zarr                  0.9.0
omnipath                  1.0.8
optional-django           0.1.0
overrides                 7.7.0
packaging                 24.0
pandas                    2.2.1
pandocfilters             1.5.1
panel                     1.4.3
param                     2.1.0
parso                     0.8.4
partd                     1.4.2
pathspec                  0.12.1
patsy                     0.5.6
pexpect                   4.9.0
pillow                    10.3.0
PIMS                      0.6.1
pip                       24.0
platformdirs              4.2.2
prometheus_client         0.20.0
prompt-toolkit            3.0.43
protobuf                  4.25.3
psutil                    5.9.8
ptyprocess                0.7.0
pure-eval                 0.2.2
pyarrow                   14.0.2
pycodestyle               2.11.1
pycparser                 2.22
pyct                      0.5.0
pygeos                    0.14
Pygments                  2.18.0
pylibcugraph-cu12         24.4.0
pylibraft-cu12            24.4.0
pynndescent               0.5.12
pynvjitlink-cu12          0.2.3
pynvml                    11.4.1
pyparsing                 3.1.2
pyproj                    3.6.1
python-dateutil           2.9.0.post0
python-json-logger        2.0.7
pytz                      2024.1
pyviz_comms               3.0.2
PyYAML                    6.0.1
pyzmq                     26.0.3
raft-dask-cu12            24.4.0
rapids-dask-dependency    24.4.1
rapids_singlecell         0.10.4
referencing               0.35.1
requests                  2.32.2
rfc3339-validator         0.1.4
rfc3986-validator         0.1.1
rich                      13.7.1
rmm-cu12                  24.4.0
rpds-py                   0.18.1
s3fs                      2023.6.0
scanpy                    1.10.1
scikit-image              0.22.0
scikit-learn              1.5.0
scikit-misc               0.3.1
scipy                     1.13.1
seaborn                   0.13.2
Send2Trash                1.8.3
session_info              1.0.0
setuptools                70.0.0
shapely                   2.0.4
simpervisor               1.0.0
six                       1.16.0
slicerator                1.1.0
smmap                     5.0.1
sniffio                   1.3.1
sortedcontainers          2.4.0
soupsieve                 2.5
spatial_image             0.3.0
spatialdata               0.0.15
squidpy                   1.4.1
stack-data                0.6.3
statsmodels               0.14.2
stdlib-list               0.10.0
tblib                     3.0.0
terminado                 0.18.1
texttable                 1.7.0
threadpoolctl             3.5.0
tifffile                  2024.5.22
tinycss2                  1.3.0
tomli                     2.0.1
toolz                     0.12.1
tornado                   6.4
tqdm                      4.66.4
traitlets                 5.14.3
treelite                  4.1.2
typeguard                 4.2.1
types-python-dateutil     2.9.0.20240316
typing_extensions         4.12.0
tzdata                    2024.1
uc-micro-py               1.0.3
ucx-py-cu12               0.37.0
umap-learn                0.5.6
uri-template              1.3.0
urllib3                   1.26.18
validators                0.28.2
wcwidth                   0.2.13
webcolors                 1.13
webencodings              0.5.1
websocket-client          1.8.0
wheel                     0.43.0
wrapt                     1.16.0
xarray                    2023.12.0
xarray-dataclasses        1.7.0
xarray-datatree           0.0.14
xarray-schema             0.0.3
xarray-spatial            0.4.0
xyzservices               2024.4.0
yapf                      0.40.2
yarl                      1.9.4
zarr                      2.18.1
zict                      3.0.0
zipp                      3.18.2

Additional context A similar problem happens with my own datasets

Intron7 commented 5 months ago

Hi @tjbencomo,

Thank you for your detailed report and for using our software.

This is not a bug but an inherent property of floating-point values. Unlike integers, floats are approximations of the real value, meaning that with every computational step, some form of rounding occurs. When you add parallel processing into the mix, this rounding happens asynchronously, which can also cause slight changes in the results.

Therefore, achieving exact matches between CPU and GPU computations with 32-bit floating-point (FP32) precision is generally not possible. In testing, we typically verify that values are roughly the same, usually within a tolerance of 1e-5.

If you require more precision, you might consider switching to 64-bit floats (FP64). However, be aware that this comes with a significant increase in computational cost and memory usage.

I hope this clears up your issue.

Best regards, Severin

tjbencomo commented 4 months ago

Thanks for the explanation!

scverse / rapids_singlecell

Normalized expression values different from scanpy #208