Open ljn917 opened 4 years ago
Problem is line 68158 of chemical identifier.tsv
. PubChem gave a mismatched formula, weight, & smiles. (Don't expect accuracy from the govt). The smiles should be [Li+].[BH4-]
.
It gets better. PubChem has 5 separate "compound" entries, all claiming to be lithium borohydride:
Wikipedia cites No.3, as does the ChemSpider entry with the same CAS number.
(There are also duplicated sodium aluminum hydride entries, one showing net charge and the other formal charge.)
@CalebBell I recommend deleting the CID# 20722760 row altogether, and adding a row for CID# 4148881 with the CAS# 16949-15-8.
Separately, given the number of errors & duplicates in PubChem, a chemical identifiers duplicate.tsv
database should be created to alias the various duplicate CID's.
Hi,
It looks like the data for CAS# 16949-15-8 is not correct. As this shows, CAS# 16949-15-8 is LiBH4, but I got the following output. It looks like the hydrogens are dropped incorrectly.
Thanks