Mining Career Paths from Large Resume Databases-Reference-Cited by-同舟云学术

Mining Career Paths from Large Resume Databases

Published:2020-06-30 Issue:3 Volume:14 Page:1-38
ISSN:1556-4681
Container-title:ACM Transactions on Knowledge Discovery from Data
language:en
Short-container-title:ACM Trans. Knowl. Discov. Data

Author:

Lappas Theodoros¹^ORCID

Affiliation:

1. Stevens Institute of Technology, Hoboken, New Jersey

Abstract

The emergence of online professional platforms, such as LinkedIn and Indeed, has led to unprecedented volumes of rich resume data that have revolutionized the study of careers. One of the most prevalent problems in this space is the extraction of prototype career paths from a workforce. Previous research has consistently relied on a two-step approach to tackle this problem. The first step computes the pairwise distances between all the career sequences in the database. The second step uses the distance matrix to create clusters, with each cluster representing a different prototype path. As we demonstrate in this work, this approach faces two significant challenges when applied on large resume databases. First, the overwhelming diversity of job titles in the modern workforce prevents the accurate evaluation of distance between career sequences. Second, the clustering step of the standard approach leads to highly heterogeneous clusters, due to its inability to handle categorical sequences and sensitivity to outliers. This leads to non-representative centroids and spurious prototype paths that do not accurately represent the actual groups in the workforce. Our work addresses these two challenges and has practical implications for the numerous researchers and practitioners working on the analysis of career data across domains.

Publisher

Association for Computing Machinery (ACM)

Subject

General Computer Science

Link

https://dl.acm.org/doi/pdf/10.1145/3379984

Reference114 articles.

1. Measuring Resemblance in Sequence Data: An Optimal Matching Analysis of Musicians' Careers

2. Sequence Analysis and Optimal Matching Methods in Sociology

3. Industry or Academia, Basic or Applied? Career Choices and Earnings Trajectories of Scientists

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Analysis of CEO career patterns using machine learning: taking US university graduates as an example;Data Technologies and Applications;2024-08-02

2. The Impact of a Skill-Driven Model on Scrum Teams in Software Projects: A Catalyst for Digital Transformation;Systems;2024-04-26

3. CareerMiner: Automatic extraction of professional network from large Chinese resume data;Franklin Open;2024-03

4. Measuring employer attractiveness in diverse talent markets;Decision Support Systems;2024-02

5. Towards a Better Characterization of Career Paths: Sequential Job Embedding and Mixture Markov Models;SSRN Electronic Journal;2022