Affiliation:
1. Corning Incorporated, New York, USA
Abstract
This chapter is the companion chapter to “Part 1: State-of-the-Art Time-Series Data Formats Performance Evaluation.” In this chapter, algorithms for converting data from one format to other formats are presented. To implement the algorithms, existing open-source Python libraries are used extensively, and where needed, new Python routines for converting data formats are developed. It is envisaged that the algorithms and Python libraries and routines that are freely provided in this chapter will be useful for data engineers, data scientists, and for industrial IoT, cyber-physical systems (CPS), multimedia, and big data practitioners who are on the quest to use different types of data formats that are compatible with memory-constrained factory floor IoT devices. It will also be useful for Delta Lake and big data engineers, who are on the quest for delivering robust bronze, silver, and gold data lakes in the cloud.
Reference73 articles.
1. Ahmed, Ferzund, Rehman, Usman Ali, Sarwar, & Mehmood. (2017). Modern Data Formats for Big Bioinformatics Data Analytics. Int’l Journal of Advanced Computer Sc. & Applications, 8(4).
2. Apache Arrow. (n.d.). Feather File Format. Apache Arrow White Paper. Available: https://arrow.apache.org/docs/python/feather.html
3. Delta lake
4. Baheti, R., & Gill, H. (2011). Cyber-Physical Systems. The Impact of Control Technology, 12(1), 161–166. Available: http://ieeecss.org/sites/ieeecss/files/2019-07/IoCT-Part3-02CyberphysicalSystems.pdf
5. Baltrusaitis, T., Ahuja, C., & Morency, L. (2017). Multimodal Machine Learning: A Survey and Taxonomy. arXiv:1705.09406v2 [cs.LG]