CourseNana | CAU COMPILER PROJECT 2024: Syntax analyzer

COMPILER PROJECT 2024 CourseNana.COM

The goal of the term-project is to implement a bottom-up syntax analyzer (a.k.a., parser) as we’ve learned. More specifically, you will implement the syntax analyzer for a simplified C programming language with the following context free grammar G; CourseNana.COM

CFG G: CourseNana.COM

01: CODE → VDECL CODE | FDECL CODE | ε CourseNana.COM
02: VDECL → vtype id semi | vtype ASSIGN semi CourseNana.COM
03: ASSIGN → id assign RHS CourseNana.COM
04: RHS → EXPR | literal | character | boolstr CourseNana.COM
05: EXPR → EXPR addsub EXPR | EXPR multdiv EXPR CourseNana.COM
06: EXPR → lparen EXPR rparen | id | num CourseNana.COM
07: FDECL → vtype id lparen ARG rparen lbrace BLOCK RETURN rbrace CourseNana.COM
08: ARG → vtype id MOREARGS | ε CourseNana.COM
09: MOREARGS → comma vtype id MOREARGS | ε CourseNana.COM
10: BLOCK → STMT BLOCK | ε CourseNana.COM
11: STMT → VDECL | ASSIGN semi CourseNana.COM
12: STMT → if lparen COND rparen lbrace BLOCK rbrace ELSE CourseNana.COM
13: STMT → while lparen COND rparen lbrace BLOCK rbrace CourseNana.COM
14: COND → COND comp COND | boolstr CourseNana.COM
15: ELSE → else lbrace BLOCK rbrace | ε CourseNana.COM
16: RETURN → return RHS semi CourseNana.COM

✓ Terminals (21) CourseNana.COM

vtype for the types of variables and functions CourseNana.COM
num for signed integers CourseNana.COM
character for a single character CourseNana.COM
boolstr for Boolean strings CourseNana.COM
literal for literal strings CourseNana.COM
id for the identifiers of variables and functions CourseNana.COM
if, else, while, and return for if, else, while, and return statements respectively CourseNana.COM

class for class declarations CourseNana.COM
addsub for + and - arithmetic operators CourseNana.COM
multdiv for * and / arithmetic operators CourseNana.COM
assign for assignment operators CourseNana.COM
comp for comparison operators CourseNana.COM
semi and comma for semicolons and commas respectively CourseNana.COM
lparen, rparen, lbrace, and rbrace for (, ), {, and } respectively CourseNana.COM

✓ Non-terminals (13)
CODE, VDECL, ASSIGN, RHS, EXPR, FDECL, ARG, MOREARGS, BLOCK, STMT, COND, ELSE, RETURN CourseNana.COM
✓ Start symbol: CODE CourseNana.COM

Descriptions CourseNana.COM

✓ The given CFG G is non-left recursive, but ambiguous. CourseNana.COM
✓ Codes include zero or more declarations of functions and variables (CFG line 1) CourseNana.COM
✓ Variables are declared with or without initialization (CFG line 2 ~ 3) CourseNana.COM
✓ The right hand side of assignment operations can be classified into four types; 1) arithmetic CourseNana.COM

operations (expressions), 2) literal strings, 3) a single character, and 4) Boolean strings (CFG CourseNana.COM

4) CourseNana.COM
✓ Arithmetic operations are the combinations of +, -, *, / operators (CFG line 5 ~ 6) CourseNana.COM
✓ Functions can have zero or more input arguments (CFG line 7 ~ 9) CourseNana.COM
✓ Function blocks include zero or more statements (CFG line 10) CourseNana.COM
✓ There are four types of statements: 1) variable declarations, 2) assignment operations, 3) if- CourseNana.COM

else statements, and 4) while statements (CFG line 11 ~ 13) CourseNana.COM
✓ if and while statements include a conditional operation which consists of Boolean strings CourseNana.COM

and condition operators (CFG line 12 ~ 14) CourseNana.COM

✓ if statements can be used with or without an else statement (CFG line 12 & 15) CourseNana.COM
✓ return statements return 1) the computation result of arithmetic operations, 2) literal strings, CourseNana.COM

3) a single character, or 4) Boolean strings (CFG line 16) CourseNana.COM
✓ This is not a CFG for C. This is for simplified C. So, you don’t need to consider grammars CourseNana.COM

and structures not mentioned in this specification. CourseNana.COM

Based on this CFG, you should implement a bottom-up parser as follows:
✓ Discard an ambiguity in the CFG
✓ Construct a SLR parsing table for the non-ambiguous CFG through the following website: CourseNana.COM

http://jsmachines.sourceforge.net/machines/slr.html CourseNana.COM

✓ Implement a SLR parsing program for the simplified Java programming language by using the constructed table. CourseNana.COM

For the implementation, please use C, C++, or Python (If you want to use . Your syntax analyzer must run on Linux or Unix-like OS without any error.
Your syntax analyzer should work as follows: CourseNana.COM

✓ The execution flow of your syntax analyzer: syntax_analyzer <input file> CourseNana.COM

✓ Input: A sequence of tokens (terminals) written in the input file
e.g., vtype id semi vtype id lparen rparen lbrace if lparen boolstr comp boolstr rparen lbrace rbrace CourseNana.COM

✓ Output
◼ (If a parsing decision output is “accept”) please construct a parse tree (not abstract CourseNana.COM

syntax tree) for the input sequence CourseNana.COM

◆ You can design the data structure to represent the tree as you want.
◼ (If an output is “reject”) please make an error report which explains why and where the CourseNana.COM

error occurred (e.g., line number) CourseNana.COM

Term-project schedule and submission CourseNana.COM

✓ Deadline: 6/9, 23:59 (through an e-class system)
◼ For a delayed submission, you will lose 0.1 * your original project score per each CourseNana.COM

delayed day
✓ Submission file: team_<your_team_number>.zip or .tar.gz CourseNana.COM

◼ The compressed file should contain
◆ The source code of your syntax analyzer with detailed comments
◆ The executable binary file of your syntax analyzer (if you implemented using CourseNana.COM

a complied language) CourseNana.COM

◆ Documentation (the most important thing!) CourseNana.COM

⚫ It must include 1) your non-ambiguous CFG G and 2) your SLR parsing table CourseNana.COM
⚫ It must also include any change in the CFG G and all about how your syntax CourseNana.COM

analyzer works for validating token sequences (for example, overall procedures, implementation details like algorithms and data structures, working examples, and so on) CourseNana.COM

◆ Test input files and outputs which you used in this project
⚫ The test input files are not given. You should make the test files, by yourself, CourseNana.COM

CAU COMPILER PROJECT 2024: Syntax analyzer

Get in Touch with Our Experts