Closed citynorman closed 3 years ago
pyarrow can't save duplicate columns
To find out which column:
df.columns[df.columns.duplicated()] # or from collections import Counter Counter(df.columns).most_common()[:5]
To fix it:
df.rename(columns={'name':'name2'})
pyarrow can't save duplicate columns