Skip to content

rojack96/cip

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CIP Curriculm Italian Parser

Curriculum Italian Parser

A simple curriculum parser for italian CV

from cip import Cip

r = Cip("it")
    
result = r.read_file_pdf('/path/to/pdf/file')
print(result)

result

{
    "nome": "Rojack",
    "data_di_nascita": "01/01/1970 (53)",
    "residenza": [
        "Silicon Valley"
    ],
    "email": "[email protected]",
    "telefono": "(+39) 321 123 12 32",
    "universita": true,
    "linguaggi_di_programmazione": [
        "Angular",
        "Python"
    ],
    "soft_skill": null,
    "ruolo": [
        "Backend",
        "Junior"
    ],
    "altro": null
}

If use Cip() return the name of key in english (ex. nome -> name)

{
    "name": "Rojack",
    "date_of_birth": "01/01/1970 (53)",
    "location": [
        "Silicon Valley"
    ],
    "email": "[email protected]",
    "phone": "(+39) 321 123 12 32",
    ...
    ...
}

The keys linguaggi_di_programmazione, soft_skill, ruolo and altro return an array of word matched by relative keys in config.json file.

About

Curriculm Italian Parser

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages