Affiliation:
1. Public Joint Stock Company “Aeroflot – Russian Airlines”;
Ulyanovsk Civil Aviation Institute named after Air Chief Marshal B.P. Bugaev
2. Ulyanovsk Civil Aviation Institute named after Air Chief Marshal B.P. Bugaev
Abstract
Natural language processing (NLP) technologies, in one of their applications, provide effective research of patterns and trends in large sets of textual data. Textual safety data presented in the form of accident investigation reports is a promising object for extracting new useful information that can be used both in flight safety management and in the framework of simulator training. This paper discusses the application of NLP technologies for the study of the body of flight safety reports of PJSC Aeroflot – Russian Airlines. The aim of the work is to develop a method for identifying relevant topics of simulator training for pilots. The paper presents an analysis of existing foreign works in the field of intellectual analysis of textual information in civil aviation. It has been revealed that NLP technologies are actively used abroad to study flight safety reports. The paper presents a scheme of a method for identifying relevant topics of pilot simulator training based on clustering of flight safety reports. The procedures of text preprocessing and the construction of its vector space are described. The scientific novelty of the approach is that, unlike previous works, it is proposed to use a full vector representation of flight safety reports, which is built by combining matrices of thematic and semantic vectors. The proposed method has been tested. The analyzed corpus of texts amounted to 1080 reports. As a result of the clustering algorithm, 36 clusters were identified, which were then visualized using the algorithms t-distributed stochastic embedding of neighbors (t-SNE). The practical significance of the research results lies in the fact that the approach based on clustering of reports will allow for a more in-depth analysis of flight safety reports, which can simplify and speed up the work of both safety management specialists and flight simulator instructors.
Publisher
Moscow State Institute of Civil Aviation
Reference19 articles.
1. Groff, L. (2018). Applying natural language processing tools to occurrence reports. ICAO, 20 p. Available at: https://www.icao.int/safety/iStars/Documents/IUG%20Meeting%201/Presentations/Applying%20Natural%20Language%20Processing%20Tools%20to%20Occurrence%20Reports%20-%20Loren%20Groff.pdf (accessed: 20.01.2024).
2. Junjie, L., Huijuan, Y., Yinlan, D. (2020). Application of text analysis technology in aviation safety information analysis. Journal of Physics: Conference Series, vol. 1624. no. 3, pp. 032033. DOI: 10.1088/1742-6596/1624/3/032033 (accessed: 20.01.2024).
3. Pimm, C., Raynal, C., Tulechki, N., Hermann, E., Caudy, G. (2012). Natural Language Processing (NLP) tools for the analysis of incident and accident reports. In: International Conference on Human-Computer Interaction in Aerospace (HCI-Aero). Belgium, Brussels, pp. 1–7. Available at: https://core.ac.uk/download/pdf/50536379.pdf (accessed: 20.01.2024).
4. Rose, R.L., Puranik, T.G., Mavris, D.N. (2020). Natural language processing based method for clustering and analysis of aviation safety narratives. Aerospace, vol. 7, no. 10, ID: 143. DOI: 10.3390/aerospace7100143 (accessed: 20.01.2024).
5. Miyamoto, A., Bendarkar, M.V., Mavris, D.N. (2022). Natural language processing of aviation safety reports to identify inefficient operational patterns. Aerospace, vol. 9, no. 8. ID: 450. DOI: 10.3390/aerospace9080450 (accessed: 20.01.2024).