1. Datamaran technical report. https://arxiv.org/abs/1708.08905. Datamaran technical report. https://arxiv.org/abs/1708.08905.
2. Flex: lexical analyzer generator. https://en.wikipedia.org/wiki/Flex_(lexical_analyser_generator). Flex: lexical analyzer generator. https://en.wikipedia.org/wiki/Flex_(lexical_analyser_generator).
3. Recordbreaker: Automatic structure for your text-formatted data. http://cloudera.github.io/RecordBreaker/. Recordbreaker: Automatic structure for your text-formatted data. http://cloudera.github.io/RecordBreaker/.
4. Stack exchange data dump. https://archive.org/details/stackexchange. Accessed: 2017-07-13. Stack exchange data dump. https://archive.org/details/stackexchange. Accessed: 2017-07-13.
5. Mining reference tables for automatic text segmentation