Author:
Wu Zengyuan,Zhao Jiali,Li Ying,Wang Zelin,He Bin,Chen Liang
Abstract
AbstractThe number of patents increases quickly, while more and more low-quality patents are emerging. It’s important to identify high-quality patents from massive data quickly and accurately for organizational R&D decision-making and patent layout. However, due to low percentage of high-quality patents, it is challenging to identify them efficiently. In order to solve above problem, we reconstruct the existing index system for identifying high-quality patents by adding 4 features from technological strength of patentees. Furthermore, we propose an improved model by integrating resampling technique and ensemble learning algorithm. First, generative adversarial networks (GAN) are used to expand minority samples. Second, Extreme Gradient Boosting algorithm (XGBoost) with Bayesian optimization (BO) is used to identify high-quality patents. For clarity, this model is called a GAN-BO-XGBoost model. To test the effectiveness of above model, we use patent data in field of lithography technology. Tenfold cross-validation is carried out to evaluate the performance between our proposed model and other models. The results show that GAN-BO-XGBoost model performs better and it’s more stable than other models.
Funder
Zhejiang Provincial Philosophy and Social Sciences Planning Project
Zhejiang Provincial Soft Science Research Program
the Key Program of Zhejiang Province
Special Project for the Alliance of High-level Universities in the Changjiang Delta
Publisher
Springer Science and Business Media LLC
Reference39 articles.
1. Huang, Z., Li, J. & Yue, H. Study on comprehensive evaluation based on AHP-MADM model for patent value of balanced vehicle. Axioms 11(9), 481 (2022).
2. Wu, J. L. et al. A patent quality analysis and classification system using self-organizing maps with support vector machine. Appl. Soft Comput. 41(6), 305–316 (2016).
3. Trappey, A. J. C. et al. Patent value analysis using deep learning models—The case of IoT technology mining for the manufacturing industry. IEEE Trans. Eng. Manag. 68(5), 1334–1346 (2019).
4. Barandela, R. et al. Strategies for learning in class imbalance problems. Pattern Recogn. 36(3), 849–851 (2003).
5. Thomas, J. R. The responsibility of the rule maker: Comparative approaches to patent administration reform. Berkeley Tech. LJ 17, 728–761 (2002).