Research on the identification model of orange origin based on machine learning in Near infrared (NIR) spectroscopy. According to the characteristics of NIR spectral data, a complete general framework for origin identification is proposed. It includes steps such as data preprocessing, feature selection, model building and cross validation. Compare multiple preprocessing algorithms and multiple machine learning algorithms under the framework. Based on NIR spectroscopy to identify the origin of orange, a good identification result was obtained. Improve the accuracy of orange origin identification and obtained the best origin identification accuracy of 92.8%.