Abstract
Economic and social activities in today's world are becoming increasingly digital. The Internet and the assistance of artificial intelligence technologies have led to massive amounts of data from these activities. These data come from different sources and in various forms, both structured and unstructured. Some data have large sample sizes, while there are also high-dimensional, enormous data in which the dimensionality of the explanatory variables surpasses the sample size. These enormous data sets are valuable, and they could drive a variety of economic activities. Big data production, machine learning, and statistics are deeply interrelated. This article discusses the concepts and methods of big data and machine learning from the characteristics of big data and the nature of machine learning. In particular, the four characteristics of big data are deeply analyzed and discussed. In addition, the analysis discusses the relationship between machine learning and big data. The article concludes with a summary and outlook of the whole article.
Publisher
Darcy & Roy Press Co. Ltd.
Reference10 articles.
1. Bok B, Caratelli D, Giannone D, et al. Macroeconomic nowcasting and forecasting with big data. Staff Reports, 2014, 11(16): 51-59.
2. Breiman L. Statistical Modeling: The Two Cultures (with comments and a rejoinder by the author). Statistical Science, 2001, 16(3):199-215.
3. Biau G. Analysis of a Random Forests Model. Journal of Machine Learning Research, 2010, 13(2):1063-1095.
4. Groenvik H. A self-normalizing approach to the specification test of mixed frequency models. Michigan Technological University. 2016.
5. Khan A, Gul M A, Uddin M I, et al. Corrigendum to "Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics". Scientific Programming, 2021 12(6):1-10.