1. [n.d.]. Apache Airflow. https://airflow.apache.org/ [n.d.]. Apache Airflow. https://airflow.apache.org/
2. [n.d.]. Chronicling America historic American newspapers. https://lccn.loc.gov/2007618519 [n.d.]. Chronicling America historic American newspapers. https://lccn.loc.gov/2007618519
3. [n. d.]. Elasticsearch: The Official Distributed Search & Analytics Engine. https://www.elastic.co//elasticsearch [n. d.]. Elasticsearch: The Official Distributed Search & Analytics Engine. https://www.elastic.co//elasticsearch
4. [n. d.]. Improving the quality of the output. https://tesseract-ocr.github.io/tessdoc/ImproveQuality.html [n. d.]. Improving the quality of the output. https://tesseract-ocr.github.io/tessdoc/ImproveQuality.html
5. [n. d.]. Kibana: Explore Visualize Discover Data. https://www.elastic.co/kibana [n. d.]. Kibana: Explore Visualize Discover Data. https://www.elastic.co/kibana