AnasAito / SkillNER

A (smart) rule based NLP module to extract job skills from text
https://skillner.vercel.app/
MIT License
131 stars 47 forks source link

Is it possible to extract all the skill names into a list? as well as if it's a hard, soft skill or certification? #70

Open shuchangliang opened 1 year ago

shuchangliang commented 1 year ago

Is it possible to extract all the skill names into a list? as well as if it's a hard, soft skill or certification?

annotations = skill_extractor.annotate(job_description) skill_extractor.describe(annotations)

these 2 lines of codes will annotate the texts visually, and I tried to use the followings to extract all the skills into a list: doc_node_values = list(set([entry['doc_node_value'] for entry in annotations['results']['ngram_scored']])) print(doc_node_values)

is there a way to also extract the skill label? (e.g., soft, hard, certification etc.)?

thank you

steinathan commented 6 months ago

Asking this too

i saw that theres a

SKILL_TO_COLOR = {
    'Hard Skill': '#818CF8',
    'Soft Skill': '#F472B6',
    'Certification': "#552448"
}

but dont know how to unwrap the skill_id eg. KS1282M68WL9T4YH3SLF to output the skill name and other info