edsu / etudier

Extract a citation network from Google Scholar
161 stars 27 forks source link

ah weirdness #9

Closed shawngraham closed 3 years ago

shawngraham commented 5 years ago

Not a useful issue title, I know, but...

...all of which probably points to me doing something weird, but what?

okcgw commented 5 years ago

Same problem here

wolfiex commented 4 years ago

replicated.

wolfiex commented 4 years ago

Temporary solution by wrapping from_pub = get_metadata(e) with a try and except. Seems to be a problem with the parsing of specific links.

    for e in html.find('#gs_res_ccl_mid .gs_r'):

        try:
            from_pub = get_metadata(e)
        except Exception as err:
            print(err)
            continue

        yield from_pub, to_pub
hls1138 commented 4 years ago

@wolfiex I tried the "try and except" suggestion without success. I only get a ten nodes .gexf output file and the script stops with 'data-cid'...

wolfiex commented 4 years ago

@wolfiex I tried the "try and except" suggestion without success. I only get a ten nodes .gexf output file and the script stops with 'data-cid'...

Ah that's a shame, I'm not sure what to suggest as it had fixed it in my instance - sorry.

alonasorochynska commented 4 years ago

Please help me to understand where etudier is saving output files?

edsu commented 3 years ago

I think the data-cid problem was fixed in v0.0.8. Sorry for not addressing this issue sooner!