Closed shawngraham closed 3 years ago
Same problem here
replicated.
Temporary solution by wrapping from_pub = get_metadata(e)
with a try and except. Seems to be a problem with the parsing of specific links.
for e in html.find('#gs_res_ccl_mid .gs_r'):
try:
from_pub = get_metadata(e)
except Exception as err:
print(err)
continue
yield from_pub, to_pub
@wolfiex I tried the "try and except" suggestion without success. I only get a ten nodes .gexf output file and the script stops with 'data-cid'...
@wolfiex I tried the "try and except" suggestion without success. I only get a ten nodes .gexf output file and the script stops with 'data-cid'...
Ah that's a shame, I'm not sure what to suggest as it had fixed it in my instance - sorry.
Please help me to understand where etudier is saving output files?
I think the data-cid problem was fixed in v0.0.8. Sorry for not addressing this issue sooner!
Not a useful issue title, I know, but...
caught exception 'data-cid'
...all of which probably points to me doing something weird, but what?