A Systematic Literature Review of Lexical Analyzer Implementation Techniques in Compiler Design-Reference-Cited by-同舟云学术

A Systematic Literature Review of Lexical Analyzer Implementation Techniques in Compiler Design

Published:2020-12-31 Issue: Volume: Page:285-301
ISSN:2581-7000
Container-title:International Journal of Applied Engineering and Management Letters
language:en
Short-container-title:IJAEML

Author:

Pai T Vaikunta¹,Jayanthila Devi A.²,Aithal P. S.³

Affiliation:

1. Associate Professor, College of Computer Science & Information Science, Srinivas University, Mangalore-575001, India

2. Professor, College of Computer Science & Information Science, Srinivas University, Mangalore – 575001, India.

3. Professor, College of Management & Commerce, Srinivas University, Mangalore – 575001, India.

Abstract

The term “lexical” in lexical analysis process of the compilation is derived from the word “lexeme”, which is the basic conceptual unit of the linguistic morphological study. In computer science, lexical analysis, also referred to as lexing, scanning or tokenization, is the process of transforming the string of characters in source program to a stream of tokens, where the token is a string with a designated and identified meaning. It is the first phase of a two-step compilation processing model known as the analysis stage of compilation process used by compiler to understand the input source program. The objective is to convert character streams into words and recognize its token type. The generated stream of tokens is then used by the parser to determine the syntax of the source program. A program in compilation phase that performs a lexical analysis process is termed as lexical analyzer, lexer, scanner or tokenizer. Lexical analyzer is used in various computer science applications, such as word processing,information retrieval systems, pattern recognition systems and language-processing systems. However, the scope of our review study is related to language processing. Various tools are used for automatic generation of tokens and are more suitable for sequential execution of the process. Recent advances in multi-core architecture systems have led to the need to re-engineer the compilation process to integrate the multi-core architecture. By parallelization in the recognition of tokens in multiple cores, multi cores can be used optimally, thus reducing compilation time. To attain parallelism in tokenizationon multi-core machines, the lexical analyzer phase of compilation needs to be restructured to accommodate the multi-core architecture and by exploiting the language constructs which can run parallel and the concept of processor affinity. This paper provides a systematic analysis of literature to discuss emerging approaches and issues related to lexical analyzer implementation and the adoption of improved methodologies. This has been achieved by reviewing 30 published articles on the implementation of lexical analyzers. The results of this review indicate various techniques, latest developments, and current approaches for implementing auto generated scanners and hand-crafted scanners. Based on the findings, we draw on the efficacy of lexical analyzer implementation techniques from the results discussed in the selected review studies and the paper provides future research challenges and needs to explore the previously under-researched areas for scanner implementation processes.

Publisher

Srinivas University

Subject

General Medicine

Reference39 articles.

1. Aho, A. V., Lam, M. S., & Sethi, R. (2009). Compilers Principles, Techniques and Tools, 2nd ed, PEARSON Education.

2. Lesk, M. E., & Schmidt, E. (1975). Lex: A lexical analyzer generator. Computing Science Technical Report No. 39, Bell Laboratories, Murray Hills, New Jersey.

3. Mickunas, M. D., & Schell, R. M. (1978, December). Parallel compilation in a multiprocessor environment. In Proceedings of the 1978 annual conference (pp. 241-246).

4. Kitchenham, B. (2004). Procedures for performing systematic reviews. Keele, UK, Keele University, 33(1), 1-26.

5. Webster, J., & Watson, R. T. (2002). Analyzing the past to prepare for the future: Writing a literature review. MIS quarterly, 26(2), 8-23.

Cited by 4 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Leveraging IR based sequence and graph features for source-binary code alignment;2024 4th International Conference on Neural Networks, Information and Communication (NNICE);2024-01-19

2. Design of efficient Programming Language with Lexer using ’$’-prefixed identifier;ICST Transactions on Scalable Information Systems;2023-09-20

3. Improved Parallel Scanner for the Concurrent Execution of Lexical Analysis Tasks on Multi-Core Systems;International Journal of Applied Engineering and Management Letters;2022-03-23

4. Improved Parallel Scanner for the Concurrent Execution of Lexical Analysis Tasks on Multi-Core Systems;International Journal of Applied Engineering and Management Letters;2022-03-23