Closed pengzhangzhi closed 1 year ago
I found pandas pdb also does not support the pdb1.gz format...
HI @pengzhangzhi thanks for raising this issue. It's a good spot and it should be supported. I'd suggest opening another issue with biopandas as we will also need to add support there first.
Re: why this format exists, this is to distinguish biological assemblies.
aha, thanks :)
Hi,
I found an interesting naming convention in pdb, which causes bugs in graphin. When I download files from PDB, e.g., 6mhu.pdb1.gz, which ends with pdb1.gz, graphin can not read it because it accepts pdb.gz. https://github.com/a-r-j/graphein/blob/77a4d9ab90dd525876766e6d5b88f0bb7ac10274/graphein/protein/graphs.py#L103 However, these two formats are the same thing. Just curious about why PDB has such a name format. I also suggest U to support this format. I am happy to submit a PR for that. Below is the code to download the pdb files.