File size: 4,768 Bytes
aed5dfc 135096f 69d3ad3 aed5dfc 673c126 69d3ad3 aed5dfc 673c126 aed5dfc c0bf361 9bb90ac c0bf361 e697a24 aed5dfc e697a24 64a9914 ac17402 bde2b28 2485d30 8448181 2485d30 96a133c 2485d30 aed5dfc e697a24 aed5dfc 8448181 2485d30 8852942 2485d30 8852942 69d3ad3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 |
---
widget:
- text: |2-
John Doe
123 Main Street, Cityville, CA 12345
[email protected]
(555) 123-4567
linkedin.com/in/johndoe
Professional Summary
Experienced and results-driven Data Scientist with a strong background in statistical analysis, machine learning, and data visualization. Proven track record of delivering actionable insights and driving data-driven decision-making processes. Adept at leveraging advanced analytics to solve complex business problems.
Education
Master of Science in Data Science
ABC University, Cityville, CA
May 2021
Bachelor of Science in Computer Science
XYZ University, Townsville, CA
Graduation Date: May 2018
Professional Experience
Data Scientist | Tech Innovators Inc., Cityville, CA | June 2021 - Present
Lead data analysis projects, extracting valuable insights to inform business strategies.
Develop and deploy machine learning models to optimize key processes, resulting in a 15% increase in efficiency.
Collaborate with cross-functional teams to design and implement data-driven solutions.
Utilize Python, R, and SQL for data extraction, transformation, and analysis.
Create compelling data visualizations to communicate findings to non-technical stakeholders.
Data Analyst | Data Solutions Co., Townsville, CA | January 2019 - May 2021
Conducted exploratory data analysis to identify trends, patterns, and anomalies.
Implemented data cleaning and preprocessing techniques to ensure data quality.
Produced comprehensive reports and dashboards, aiding in executive decision-making.
Collaborated with business units to define and refine analytical requirements.
Skills
Programming Languages: Python, R
Data Analysis Tools: Pandas, NumPy
Machine Learning: Scikit-Learn, TensorFlow, Keras
Database Management: SQL
Data Visualization: Matplotlib, Seaborn
Statistical Analysis: Hypothesis testing, Regression analysis
Communication: Strong written and verbal communication skills
Certifications
Certified Data Scientist (CDS)
Machine Learning Specialist Certification
tags:
- spacy
- token-classification
- cv
- resume parsing
- resume extraction
- named entity recognition
- resume
language:
- en
model-index:
- name: en_cv_info_extr
results:
- task:
name: NER
type: token-classification
metrics:
- name: NER Precision
type: precision
value: 0.8333333333
- name: NER Recall
type: recall
value: 0.8067729084
- name: NER F Score
type: f_score
value: 0.8198380567
library_name: spacy
pipeline_tag: token-classification
---
# Information extraction from Resumes/CVs written in English
### Model Description
This model is designed for information extraction from resumes/CVs written in English. It employs a transformer-based architecture with spaCy for named entity recognition (NER) tasks. The model aims to parse various sections of resumes, including personal details, education history, professional experience, skills, and certifications, enabling users to extract structured information for further processing or analysis.
### Model Details
| Feature | Description |
| --- | --- |
| `Language` | English |
| `Task` | Named Entity Recognition (NER) |
| `Objective` | Information extraction from resumes/CVs |
| `Spacy Components` | Transformer, Named Entity Recognition (NER) |
| `Author` | [Youssef Chafiqui](https://huggingface.co/ychafiqui) |
### NER Entities
The model recognizes various entities corresponding to different sections of a resume. Below are the entities used by the model:
| Label | Description |
| --- | --- |
| 'FNAME' | First name |
| 'LNAME' | Last name |
| 'ADDRESS' | Address |
| 'CERTIFICATION' | Certification |
| 'EDUCATION' | Education section |
| 'EMAIL' | Email address |
| 'EXPERIENCE' | Experience section |
| 'HOBBY' | Hobby |
| 'HSKILL' | Hard skill |
| 'LANGUAGE' | Language |
| 'PHONE' | Phone number |
| 'PROFILE' | Profile |
| 'PROJECT' | Project section |
| 'SSKILL' | Soft skill |
### Evaluation Metrics
| Type | Score |
| --- | --- |
| `F1 score` | 81.98 |
| `Precision` | 83.33 |
| `Recall` | 80.68 |
## Usage
### Presequities
Install spaCy library
```bash
pip install spacy
```
Install Transformers library
```bash
pip install transformers
```
Download the model
```bash
pip install https://huggingface.co/ychafiqui/en_cv_info_extr/resolve/main/en_cv_info_extr-any-py3-none-any.whl
```
### Load the model
```python
import spacy
nlp = spacy.load("en_cv_info_extr")
```
### Inference using the model
```python
doc = nlp('put your resume here')
for ent in doc.ents:
print(ent.text, "-", ent.label_)
``` |