Problem Statement

The goal of this study is to evaluate the potential biases in a proprietary CV parser, which automatically extracts skills from anonymized, raw resumes.

Note: We did not have access to the proprietary parser; accordingly our analysis was carried out only on the parsing process’s input (the raw CV text) and its output (the skills extracted by the proprietary parser).

Specifically, we aim to answer:

  1. Candidate Representation:
  1. Gendered Skill Association:
  1. Cultural and Geographical Bias:
  1. Hard vs. Soft Skills Distribution:
  1. Demographic Underrepresentation: