A Hybrid Approach for Soil Total Nitrogen Anomaly Detection Integrating Machine Learning and Spatial Statistics

Author:

Zheng Wengang12,Lan Renping12,Zhangzhong Lili3,Yang Linnan4,Gao Lutao4,Yu Jingxin3ORCID

Affiliation:

1. School of Agricultural Engineering, Jiangsu University, Zhenjiang 212013, China

2. National Engineering Research Center for Information Technology in Agriculture, Beijing 100097, China

3. National Engineering Research Center for Intelligent Equipment in Agriculture, Beijing 100097, China

4. School of Big Data, Yunnan Agricultural University, Kunming 650201, China

Abstract

Soil total nitrogen is one of the most important basic indicators for fertiliser decision making, but tens of millions of soil total nitrogen sampling data have been accumulated, forming a huge database. In this large database, there is a large amount of anomalous data, which can interfere with data analysis, affect the construction of spatial interpolation and prediction models, and then affect the accuracy of nutrient management decisions. The traditional method of identifying soil total nitrogen anomalies based on boxplots suffers from the problems of not being able to identify local anomalies, which can easily lead to misclassification of soil total nitrogen data anomalies, and the detection efficiency is not high. We propose a method to identify soil total nitrogen outliers by combining the Isolation Forest algorithm and local spatial autocorrelation analysis, which can simultaneously detect global and local outliers from large amounts of data and combine organic matter as an auxiliary indicator in the spatial analysis to help judge local outliers. Finally, the results of global and local anomalies were combined to provide a comprehensive assessment of the soil nitrogen data, avoiding the misjudgement or omission of judgement that can occur when using a single method. Using 25,930 soil test data from Yunnan Province in 2009 as an example, we compared and analysed the typical boxplot method and the unsupervised OneClassSVM method and evaluated the performance of each method in terms of correct detection rate, false positive rate and false negative rate. The results show that the proposed method has a correct detection rate (TR) of 99.97%, a false positive rate (FPR) of 8.06% and a false negative rate (FNR) of 0.01% on the data, which shows high validity and accuracy; it is also comparable to the independent isolated forests (FNR = 4.76%), boxplot (FNR = 3.90%) and OneClassSVM (FNR = 4.77%), and the false negative rate is reduced by 4.75%, 3.89% and 4.76%, respectively.

Funder

National Key R&D Program of China

Yunnan Provincial Major Science and Technology Special Project

Beijing Academy of Agriculture and Forestry Sciences Major Scientific and Technological Achievement Cultivation Project

Publisher

MDPI AG

Subject

Agronomy and Crop Science

Reference45 articles.

1. Liu, H., Zhu, Q., Xia, X., Li, M., and Huang, D. (2022). Multi-Feature Optimization Study of Soil Total Nitrogen Content Detection Based on Thermal Cracking and Artificial Olfactory System. Agriculture, 12.

2. Development of a Predictive Tool for Rapid Assessment of Soil Total Nitrogen in Wheat-Corn Double Cropping System with Hyperspectral Data;Song;Environ. Pollut. Bioavailab.,2019

3. Rapid Detection of Total Nitrogen Content in Soil Based on Hyperspectral Technology;Ma;Inf. Process. Agric.,2022

4. Total Nitrogen Analysis of Soil and Plant Tissues;Nelson;J. Assoc. Off. Anal. Chem.,1980

5. Optimizing Nitrogen Fertilizer Use for More Grain and Less Pollution;Ren;J. Clean. Prod.,2022

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3