masnaashraf / Resume-parser

An NLP project for parsing the resume using spacy library and hugging face model
2 stars 0 forks source link

Overview

Resume Parser

The Resume Matcher is a Python-based project designed to match job descriptions with candidate resumes using natural language processing techniques. This tool aids in identifying the most suitable candidates for specific job roles by calculating the similarity between job descriptions and candidate resumes.

The project is divided into four main steps:

Data Collection:

Text Preprocessing and Tokenization:

-Preprocess and tokenize both resumes and job descriptions. -Ensure consistent text formatting and language handling.

Word Embedding Extraction:

-Generate word embeddings for both resumes and job descriptions. -Utilize advanced models like DistilBERT for embeddings.

Resume Matching:

-Calculate cosine similarity between job descriptions and resumes. -Rank CVs based on similarity scores and list the top candidates.

Project Structure

The project directory structure is organized as follows:

Results

The matching results are stored in the output/ directory in JSON format. You can explore the top candidates for each job description based on similarity scores.