NREL / polyID

https://polyid.nrel.gov
BSD 3-Clause "New" or "Revised" License
13 stars 4 forks source link

Fix bug when dataframe index is not unique #1

Closed jlaw9 closed 1 year ago

jlaw9 commented 1 year ago

If the index is not unique, then using join to combine the two dataframes creates all possible combinations of matching indexes. pd.concat does the expected here, which is a 1:1 mapping.

Since the index is based on the monomers hash, it's not unique across monomers with different distributions. In Mark's case, what was supposed to be 170 rows turns into 2270 rows.