Deep code search efficiency based on clustering-Reference-Cited by-同舟云学术

Deep code search efficiency based on clustering

Published:2024-03-13 Issue:13 Volume:36 Page:
ISSN:1532-0626
Container-title:Concurrency and Computation: Practice and Experience
language:en
Short-container-title:Concurrency and Computation

Author:

Liu Kun¹^ORCID,Liu Jianxun¹^ORCID,Hu Haize¹

Affiliation:

1. School of Computer Science and Engineering Hunan University of Science and Technology Xiangtan China

Abstract

AbstractThe deep‐learning based code search model mainly takes accuracy as the only target for judging the performance of the model, ignoring the efficiency of code search. This article proposes a clustering‐based code search model (C‐DCS). C‐DCS uses the K‐Means to divide the code vector base into K clusters and obtains the center vectors of K clusters. While searching, C‐DCS first matches the query vector with the K center vectors to get the best matching center vector. After matching the center vector, C‐DCS matches the query vector with code vectors in the cluster corresponding to the best matching center vector one by one and then gets the best matching code snippet vector. To verify the efficiency of C‐DCS in the code search task, experimental analysis was built on a large dataset. The experimental results showed that C‐DCS saves 92.2% of the search time compared to the baseline model while remaining the accuracy. In the experimental evaluation section, we optimized the K‐Means algorithm to improve the code search efficiency of C‐DCS further, reducing the search time to 93.8% of the baseline model. Hence, C‐DCS reduces the code search time greatly with not affecting the accuracy, improving the efficiency of software development.

Publisher

Wiley

Link

https://onlinelibrary.wiley.com/doi/pdf/10.1002/cpe.8027

Reference32 articles.

1. Big Code Search: A Bibliography

2. SunZ LiL LiuY et al.On the importance of building high‐quality training datasets for neural code search. Proceedings of the 44th International Conference on Software Engineering 1609–1620.2022.

3. XieY LinJ DongH et al.A survey of deep code search. arXiv Preprint arXiv:2305.05959.2023.

4. Opportunities and Challenges in Code Search Tools

5. SunW FangC ChenY et al.Code search based on context‐aware code translation. Proceedings of the 44th International Conference on Software Engineering 388–400.2022.