allenai / scholarphi

An interactive PDF reader.
Apache License 2.0
420 stars 54 forks source link

[bug] 404 not found when download model #331

Open TITC opened 3 years ago

TITC commented 3 years ago

when I execute the command detect-definition, this part code try to download model https://scholarphi.s3-us-west-1.amazonaws.com/joint_symnick_abbrexp_termdef.zip but failure.

2021-10-18 21:10:10,067 [INFO]: Launching command detect-definitions
2021-10-18 21:10:10,512 [DEBUG]: Loading Spacy models (this may take some time).
2021-10-18 21:10:55,311 [DEBUG]: Created cache directory for models at ./cache/DocDef2+AI2020+W00_model
2021-10-18 21:11:12,084 [DEBUG]: Downloading model from https://scholarphi.s3-us-west-1.amazonaws.com/joint_symnick_abbrexp_termdef.zip. Warning: this will take a long time.
2021-10-18 21:12:17,923 [ERROR]: Unexpected exception processing papers: ['0801.4750']
Traceback (most recent call last):
  File "/yuhang/draft/Lab/scholarphi/data-processing/scripts/run_pipeline.py", line 93, in run_commands_for_arxiv_ids
    run_command(command)
  File "/yuhang/draft/Lab/scholarphi/data-processing/scripts/commands.py", line 95, in run_command
    for result in cmd.process(item):
  File "/yuhang/draft/Lab/scholarphi/data-processing/entities/definitions/commands/detect_definitions.py", line 609, in process
    model = DefinitionDetectionModel(prediction_type)
  File "/yuhang/draft/Lab/scholarphi/data-processing/entities/definitions/nlp.py", line 93, in __init__
    urllib.request.urlretrieve(
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 247, in urlretrieve
    with contextlib.closing(urlopen(url, data)) as fp:
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 222, in urlopen
    return opener.open(url, data, timeout)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 531, in open
    response = meth(req, response)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 640, in http_response
    response = self.parent.error(
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 569, in error
    return self._call_chain(*args)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 502, in _call_chain
    result = func(*args)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 649, in http_error_default
    raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 404: Not Found
Traceback (most recent call last):
  File "/opt/conda/envs/scholarphi/lib/python3.8/runpy.py", line 194, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/opt/conda/envs/scholarphi/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/root/.vscode-server/extensions/ms-python.python-2021.10.1336267007/pythonFiles/lib/python/debugpy/__main__.py", line 45, in <module>
    cli.main()
  File "/root/.vscode-server/extensions/ms-python.python-2021.10.1336267007/pythonFiles/lib/python/debugpy/../debugpy/server/cli.py", line 444, in main
    run()
  File "/root/.vscode-server/extensions/ms-python.python-2021.10.1336267007/pythonFiles/lib/python/debugpy/../debugpy/server/cli.py", line 285, in run_file
    runpy.run_path(target_as_str, run_name=compat.force_str("__main__"))
  File "/opt/conda/envs/scholarphi/lib/python3.8/runpy.py", line 265, in run_path
    return _run_module_code(code, init_globals, run_name,
  File "/opt/conda/envs/scholarphi/lib/python3.8/runpy.py", line 97, in _run_module_code
    _run_code(code, mod_globals, init_globals,
  File "/opt/conda/envs/scholarphi/lib/python3.8/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/yuhang/draft/Lab/scholarphi/data-processing/scripts/run_pipeline.py", line 506, in <module>
    digest_for_papers = run_commands_for_arxiv_ids(
  File "/yuhang/draft/Lab/scholarphi/data-processing/scripts/run_pipeline.py", line 100, in run_commands_for_arxiv_ids
    raise exc
  File "/yuhang/draft/Lab/scholarphi/data-processing/scripts/run_pipeline.py", line 93, in run_commands_for_arxiv_ids
    run_command(command)
  File "/yuhang/draft/Lab/scholarphi/data-processing/scripts/commands.py", line 95, in run_command
    for result in cmd.process(item):
  File "/yuhang/draft/Lab/scholarphi/data-processing/entities/definitions/commands/detect_definitions.py", line 609, in process
    model = DefinitionDetectionModel(prediction_type)
  File "/yuhang/draft/Lab/scholarphi/data-processing/entities/definitions/nlp.py", line 93, in __init__
    urllib.request.urlretrieve(
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 247, in urlretrieve
    with contextlib.closing(urlopen(url, data)) as fp:
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 222, in urlopen
    return opener.open(url, data, timeout)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 531, in open
    response = meth(req, response)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 640, in http_response
    response = self.parent.error(
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 569, in error
    return self._call_chain(*args)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 502, in _call_chain
    result = func(*args)
  File "/opt/conda/envs/scholarphi/lib/python3.8/urllib/request.py", line 649, in http_error_default
    raise HTTPError(req.full_url, code, msg, hdrs, fp)
urllib.error.HTTPError: HTTP Error 404: Not Found
2021-10-18 21:12:31,976 [INFO]: Internal process exited

I have tried to access it through a web browser and got the same result. image

How to solve it?

TITC commented 3 years ago

I found related info at here, it seems not everyone can access this URL? @ca16 @andrewhead


here is a history version and not successful by GitHub checks. @nuwandavek image