Open joshuacox opened 5 months ago
I am facing the same issue as well
hi @joshuacox , can you run python run_first.py
and verify if the same error pops up? I'm able to run the same with a new clone .
@SubramanyamChalla24 i am trying docker compose up -d command and after python run_first.py command started executing this gives me the same error can you hellp
@amritajain13 , did you try running python run_first.py
?
python run_first.py
Traceback (most recent call last):
File "/unreal/gpu/Resume-Matcher/run_first.py", line 5, in <module>
from scripts import JobDescriptionProcessor, ResumeProcessor
File "/unreal/gpu/Resume-Matcher/scripts/__init__.py", line 2, in <module>
from .JobDescriptionProcessor import JobDescriptionProcessor
File "/unreal/gpu/Resume-Matcher/scripts/JobDescriptionProcessor.py", line 5, in <module>
from .parsers import ParseJobDesc, ParseResume
File "/unreal/gpu/Resume-Matcher/scripts/parsers/__init__.py", line 1, in <module>
from .ParseJobDescToJson import ParseJobDesc
File "/unreal/gpu/Resume-Matcher/scripts/parsers/ParseJobDescToJson.py", line 5, in <module>
from scripts.Extractor import DataExtractor
File "/unreal/gpu/Resume-Matcher/scripts/Extractor.py", line 6, in <module>
from .utils import TextCleaner
File "/unreal/gpu/Resume-Matcher/scripts/utils/__init__.py", line 3, in <module>
from .Utils import TextCleaner
File "/unreal/gpu/Resume-Matcher/scripts/utils/Utils.py", line 7, in <module>
nlp = spacy.load("en_core_web_md")
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/nix/store/8msv6rh44z033csqkg2r3fa2j21m92px-python3-3.11.9-env/lib/python3.11/site-packages/spacy/__init__.py", line 51, in load
return util.load_model(
^^^^^^^^^^^^^^^^
File "/nix/store/8msv6rh44z033csqkg2r3fa2j21m92px-python3-3.11.9-env/lib/python3.11/site-packages/spacy/util.py", line 472, in load_model
raise IOError(Errors.E050.format(name=name))
OSError: [E050] Can't find model 'en_core_web_md'. It doesn't seem to be a Python package or a valid path to a data directory.
I am in NixOS, which is one of the reasons I am trying to run it inside of docker to try it out.
and there are no files named 'en_cor*' in my fresh clone.
find . -iname 'en_core*'
@joshuacox can you install the spacy english mdel with the command python -m spacy download en_core_web_sm
and retry it?
bash➜ python
Python 3.11.9 (main, Apr 2 2024, 08:25:04) [GCC 13.2.0] on linux
Type "help", "copyright", "credits" or "license" for more information.
>>> import spacy
>>> nlp = spacy.load("en_core_web_sm")
>>> import en_core_web_sm
>>> doc = nlp("This is a sentence.")
>>> print([(w.text, w.pos_) for w in doc])
[('This', 'PRON'), ('is', 'AUX'), ('a', 'DET'), ('sentence', 'NOUN'), ('.', 'PUNCT')]
but that is not inside the container, which apparently has not downloaded the model?
It's included in the requirements.txt for the docker container. It seems to be an issue , can you more details about the issue? Or if you can join discord , we can discuss the issue there .
Had the same Issue when switching to docker compose. Manually adding the Processed Folder as a mounted Volume fixed this for me. Kind of
- ./Data/Processed:/data/Resume-Matcher/Data/Processed
@dieideeistgut ya this docker compose file is completely broken without that volume, @SubramanyamChalla24 it works for you?
there are some proposed changes, but unfortunately this is still broken. I left an ls -alh Data/Resumes; sleep5
statement in the dockerfile to show that, indeed, the Resumes folder is in there and populated with resumes.
@joshuacox @shredinjohn @amritajain13 @dieideeistgut
Issue is:
Repo is missing 3 directories.
Upon creating them everything works flawlessly. My pull request currently awaits approval.
I've also fixed the relative pathing (might mitigate possible Docker issues) and added a traceback which would give us a hint what's going on to the Error we were all getting.
Its not hard to fix. Add the directories missing as pointed out by P4jMepR. Inside Processed:
Then run python run_first.py
and then start the UI
@nanafy I have specifically mentioned that this is inside of docker compose
I have specifically mentioned that this is inside of
docker compose
@joshuacox clone my pull and try to compose it again. It should work just fine.
@P4jMepR I get the same results using your PR
0.163 /usr/local/lib/python3.11/site-packages/pypdf/_crypt_providers/_cryptography.py:32: CryptographyDeprecationWarning: ARC4 has been moved to cryptography.hazmat.decrepit.ciphers.algorithms.ARC4 and will be removed from this module in 48.0.0.
0.163 from cryptography.hazmat.primitives.ciphers.algorithms import AES, ARC4
1.916 2024-07-31 11:24:11,846 (run_first.py:44) - INFO: Started to read from Data/Resumes
1.916 2024-07-31 11:24:11,846 (run_first.py:54) - ERROR: There are no resumes present in the specified folder.
1.916 2024-07-31 11:24:11,846 (run_first.py:55) - ERROR: Exiting from the program.
1.916 2024-07-31 11:24:11,846 (run_first.py:56) - ERROR: Please add resumes in the Data/Resumes folder and try again.
1.916 2024-07-31 11:24:11,847 (run_first.py:57) - ERROR: Traceback (most recent call last):
1.916 File "/data/Resume-Matcher/run_first.py", line 48, in <module>
1.916 remove_old_files(PROCESSED_RESUMES_PATH)
1.916 File "/data/Resume-Matcher/run_first.py", line 32, in remove_old_files
1.916 for filename in os.listdir(files_path):
1.916 ^^^^^^^^^^^^^^^^^^^^^^
1.916 FileNotFoundError: [Errno 2] No such file or directory: '/data/Resume-Matcher/Data/Processed/Resumes'
1.916
------
failed to solve: process "/bin/sh -c python run_first.py" did not complete successfully: exit code: 1
same here, still error using latest repo
There are no resumes present in the specified folder. Yet the defaults are there.
I did try adding my resume in there (josh_cox.pdf), and creating the
Processed
directory in there too (as mentioned in another issue) with no change.To Reproduce Steps to reproduce the behavior:
docker compose up
Expected behavior
app to serve
Screenshots
Desktop (please complete the following information):