SCL-SKG:Software Knowledge Triplet Extraction with Span-level Contrastive Learning-Reference-Cited by-同舟云学术

SCL-SKG:Software Knowledge Triplet Extraction with Span-level Contrastive Learning

Published:2022-10-24 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Tang Mingjing¹,Zhang Shu¹,Zheng Ming²,Ma Zifei³,Gao Wei¹

Affiliation:

1. Yunnan Normal University

2. Anhui Normal University

3. Yunnan Agriculture University

Abstract

Abstract The text of software knowledge community contains abundant knowledge of software engineering field. The software knowledge triplet can be extracted automatically and efficiently to form the software knowledge graph, which is helpful for software knowledge-centric intelligent applications, such as intelligent question answering, automatic document generation and software expert recommendation. Most existing methods are confronted with problems of task dependence and entity overlap. In this paper, we propose a software knowledge triplet extraction method based on span-level contrastive learning. From the level of sentence sequence modelling, we model the sentence sequence with span as a unit, and generate abundant positive and negative samples of entity span through the span representation layer to avoid the problem that the token-level method cannot select overlapping entities. From the level of feature learning, we propose supervised entity contrastive learning and relation contrastive learning, which obtain enhanced feature representation of entity span and entity pair through positive and negative sample enhancement and contrastive loss function construction. Experiments are conducted on the dataset which is constructed based on texts of the StackOverflow, and show that our approach achieves a better performance than baseline models.

Publisher

Research Square Platform LLC

Reference42 articles.

1. Survey of Software Data Mining for Open Source Ecosystem[J];Yin G;J Softw,2018

2. Tabassum J, Maddela M, Xu W, Ritter A (2020) Code and Named Entity Recognition in StackOverflow[C]. Proc. 58th Annual Meeting of the Association for Computational Linguistics (ACL), Online, : 4913–4926

3. Ye DH, Xing ZC, Foo CY et al (2016) Software-Specific Named Entity Recognition in Software Engineering Social Content[C]. Proc. 23th International Conference on Software Analysis, Evolution, and Reengineering (SNER), Osaka, Japan, : 90–101

4. Reddy MVPR, Prasad PVRD, Chikkamath M et al (2019) NERSE: named entity recognition in software engineering as a service[C]. Proc. Australian Symposium on Service Research and Innovation, : 65–80

5. MEIM: a multi-source software knowledge entity extraction integration model[J];Lv WQ;Computers Mater Continua,2021