Author:
Tipu Abdul Jabbar Saeed,Conbhuí Padraig Ó,Howley Enda
Abstract
AbstractHPC or super-computing clusters are designed for executing computationally intensive operations that typically involve large scale I/O operations. This most commonly involves using a standard MPI library implemented in C/C++. The MPI-I/O performance in HPC clusters tends to vary significantly over a range of configuration parameters that are generally not taken into account by the algorithm. It is commonly left to individual practitioners to optimise I/O on a case by case basis at code level. This can often lead to a range of unforeseen outcomes. The ExSeisDat utility is built on top of the native MPI-I/O library comprising of Parallel I/O and Workflow Libraries to process seismic data encapsulated in SEG-Y file format. The SEG-Y File data structure is complex in nature, due to the alternative arrangement of trace header and trace data. Its size scales to petabytes and the chances of I/O performance degradation are further increased by ExSeisDat. This research paper presents a novel study of the changing I/O performance in terms of bandwidth, with the use of parallel plots against various MPI-I/O, Lustre (Parallel) File System and SEG-Y File parameters. Another novel aspect of this research is the predictive modelling of MPI-I/O behaviour over SEG-Y File benchmarks using Artificial Neural Networks (ANNs). The accuracy ranges from 62.5% to 96.5% over the set of trained ANN models. The computed Mean Square Error (MSE), Mean Absolute Error (MAE) and Mean Absolute Percentage Error (MAPE) values further support the generalisation of the prediction models. This paper demonstrates that by using our ANNs prediction technique, the configurations can be tuned beforehand to avoid poor I/O performance.
Funder
Science Foundation Ireland
National University Ireland, Galway
Publisher
Springer Science and Business Media LLC
Subject
Computer Networks and Communications,Software
Reference34 articles.
1. Bödvarsdóttir, I., Elklit, A.: Psychological reactions in icelandic earthquake survivors. Scand. J. Psychol. 45(1), 3–13 (2004)
2. Yilmaz, Ö.: Seismic data analysis: Processing, inversion, and interpretation of seismic data. Soc. Explor. Geophys. https://doi.org/10.1190/1.9781560801580 (2001)
3. Hagelund, R., Levin, S.A.: Seg-y\_r2. 0: Seg-y Revision 2.0 Data Exchange Format. Society of Exploration Geophysicists, Houston (2017)
4. Fisher, M.A., Conbhuí, P.Ó., Brion, C.Ó., Acquaviva, J.-T., Delaney, S., O’brien, G.S., Dagg, S., Coomer, J., Short, R.: Exseisdat: a set of parallel i/o and workflow libraries for petroleum seismology. Oil & Gas Science and Technology–Revue d’IFP Energies nouvelles 73:74 (2018)
5. Gropp, W., Lusk, E., Doss, N., Skjellum, A.: A high-performance, portable implementation of the mpi message passing interface standard. Parall. Comput. 22(6), 789–828 (1996)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献