Just adding that when I found this issue, I had all dnatracing
, plotting
and summary_stats
turned off.
Pandas version at the very least would be useful to know as its pd.concat()
that does the concatenation.
pip show pandas
pandas v2.2.3
Small update where I got with this earlier using Laura's smaller test set.
(different to the failure of imagestats.csv
I saw earlier)
causes no issue in the same environment 🙃.
import pandas as pd
import numpy as np
from collections import defaultdict
base = "/Users/Maxgamill/Desktop/Uni/PhD/topo_test/TopoStats/concat/" img1 = "20230526_puc19_tube1_24hr_mg.0_00003" img2 = "20230526_puc19_tube1_24hr_mg.0_00002"
results = defaultdict() for img in [img1, img2]: df = pd.read_csv(base+img+".csv") results[img] = df
total_df = pd.concat(results.values())
- I've made branch `maxgamill-sheffield/969-concat-issue` in which I've attempted to make a few fixes:
- The `folder_<stats>.csv` was being overwritten by the dis and mol stats so that has been modified to produce all folder stats.
- The error / failed outputs of the better tracing pipeline now add the columns that should have been added should it have succeeded.
- Thought it might have been because of columns that are present in one but not the other due to failure but alas nope.
topostats process --core 1
topostats --version
Both @MaxGamill-Sheffield and I keep running into the same concatenation warning when running
topostats process
. This warning occurs right at the end of processing and results in no all_statistics.csv being output. It looks as though the issue arises from deprecation of the function that originally concatenated empty or all NA data frames, and the suggested resolution is to exclude these prior to concatenation.Copy of the output
