When Automatic Filtering Comes to the Rescue: Pre-Computing Company Competitor Pairs in Owler

Author:

Guo Jinsong1ORCID,Jami Aditya2ORCID,Kröll Markus2ORCID,Schweizer Lukas2ORCID,Paramonov Sergey2ORCID,Aichinger Eric2ORCID,Sferrazza Stefano2ORCID,Scaccia Mattia2ORCID,Reissfelder Stéphane2ORCID,Cicek Eda3ORCID,Grasso Giovanni4ORCID,Gottlob Georg1ORCID

Affiliation:

1. University of Oxford & Ratiolytics, Oxford, United Kingdom

2. Meltwater, San Francisco, CA, USA

3. TU Wien, Vienna, Austria

4. University of Calabria, Rende, Italy

Abstract

Competitor data constitutes information significantly valuable for many business applications. Meltwater provides users with access to a large Company Information System (CIS), Owler, which contains competitor pairs and other useful information about companies. Meltwater has been seeking a practical solution to discover more competitor pairs in Owler. The first attempt, a fully-manual workflow (called MW_Manual) for finding more competitor pairs in Owler consisted of two manual steps: a filtering step that excludes obvious non-competitor company pairs, and a further inspection process that inspects each left company pair after the filtering step. MW_Manual was cost prohibitive because the results of the filtering step contained too many non-competitor pairs. Inspecting such non-competitor pairs caused an overhead to the overall workload. To reduce the manual workload, especially the required human effort in the manual inspection process, Meltwater has transformed MW_Manual into a semi-automatic workflow (called MW_CPFilter) by replacing the manual filtering with an automatic yet more precise process that adopts a system called CPFilter. This paper presents CPFilter, a system used in the filtering process of MW_CPFilter. CPFilter automatically pre-computes likely competitor pairs from existing competitor pairs in Owler. CPFilter combines (i) the generation of new competitor candidate pairs by inference from existing competitors and other company-specific knowledge, with (ii) the validation of each candidate competitor pair of two companies by checking whether or not empirical evidence that indicates the competitor relationships of these two companies can be found. CPFilter has three key advantages compared with the manual filtering process and previous works: (i) it resulted in a high workload reduction rate of 0.81, (ii) it is domain-independent so that it can be applied to different sectors in Owler, and (iii) its results are explainable so that humans can easily understand its results.

Publisher

Association for Computing Machinery (ACM)

Reference43 articles.

1. Expressive Languages for Querying the Semantic Web

2. Competitor Mining with the Web

3. Nils Barlaug and Jon Atle Gulla . 2021. Neural networks for entity matching: A survey. ACM Transactions on Knowledge Discovery from Data (TKDD) , Vol. 15 , 3 ( 2021 ), 1--37. Nils Barlaug and Jon Atle Gulla. 2021. Neural networks for entity matching: A survey. ACM Transactions on Knowledge Discovery from Data (TKDD), Vol. 15, 3 (2021), 1--37.

4. Vadalog: A modern architecture for automated reasoning with large knowledge graphs

5. Knowledge Graphs and Enterprise AI: The Promise of an Enabling Technology

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3