1. Homepage
  2. Programming
  3. [2022] COMP2041/9044 Software Construction - Assignment 2: Slippy

[2022] COMP2041/9044 Software Construction - Assignment 2: Slippy

Engage in a Conversation
COMP2041COMP9044Software ConstructionUNSWSlippyPython

Assignment 2: Slippy


CourseNana.COM

Aims

This assignment aims to give you: CourseNana.COM

  • practice in Python programming generally.
  • a clear and concrete understanding of sed's core semantics.

Introduction

Your task in this assignment is to implement Slippy. CourseNana.COM

Slippy stands for [S]ed [L]anguage [I]nterpreter in [P]ure [PY]thon.
A subset of the important Unix/Linux tool Sed. CourseNana.COM

You will do this in Python. CourseNana.COM

Sed is a very complex program that has many commands.
You will implement only a few of the most important commands.
You will also be given a number of simplifying assumptions, which make your task easier. CourseNana.COM

Slippy is a POSIX-compatible subset of sed with extended regular expressions (EREs).
On CSE systems you would run sed -E CourseNana.COM

You must implement Slippy in Python only.
See the Permitted Languages section below for more information. CourseNana.COM

Reference implementation

Many aspects of this assignment are not fully specified in this document;
instead, you must match the behaviour of the reference implementation: 2041 slippy CourseNana.COM

Provision of a reference implementation is a common method to provide or define an operational specification,
and it's something you will likely need to do after you leave UNSW. CourseNana.COM

Discovering and matching the reference implementation's behaviour is deliberately part of the assignment,
and will take some thought. CourseNana.COM

If you discover what you believe to be a bug in the reference implementation, report it in the class forum.
Andrew and Dylan may fix the bug, or indicate that you do not need to match the reference implementation's behaviour in this case. CourseNana.COM

Slippy Commands


Subset 0

In subset 0 slippy will always be given a single Slippy command as a command-line argument. CourseNana.COM

The Slippy command will be one of 'q', 'p', 'd', or 's' (see below). CourseNana.COM

The only other command-line argument possible in subset 0 is the -n option. CourseNana.COM

Input files will not be specified in subset 0. CourseNana.COM

For subset 0 slippy need only read from standard input. CourseNana.COM

Subset 0: q - quit command

The Slippy q command causes slippy.py to exit, for example: CourseNana.COM

seq 1 5 | 2041 slippy '3q'
1
2
3
seq 9 20 | 2041 slippy '3q'
9
10
11
seq 10 15 | 2041 slippy '/.1/q'
10
11
seq 500 600 | 2041 slippy '/^.+5$/q'
500
501
502
503
504
505
seq 100 1000 | 2041 slippy '/1{3}/q'
100
101
102
103
104
105
106
107
108
109
110
111

slippy commands are applied to input lines as they are read. CourseNana.COM

The q command means slippy may not read all input. CourseNana.COM

For example, the command prints an "infinite" number of lines containing (by default) "yes". CourseNana.COM

yes | 2041 slippy '3q'
y
y
y

This means slippy can not read all input first, e.g. into a list, before applying commands. CourseNana.COM

Subset 0: p - print command

The Slippy p commands prints the input line, for example: CourseNana.COM

seq 1 5 | 2041 slippy '2p'
1
2
2
3
4
5
seq 7 11 | 2041 slippy '4p'
7
8
9
10
10
11
seq 65 85 | 2041 slippy '/^7/p'
65
66
67
68
69
70
70
71
71
72
72
73
73
74
74
75
75
76
76
77
77
78
78
79
79
80
81
82
83
84
85
seq 1 5 | 2041 slippy 'p'
1
1
2
2
3
3
4
4
5
5

Subset 0: d - delete command

The Slippy d command deletes the input line, for example: CourseNana.COM

seq 1 5 | 2041 slippy '4d'
1
2
3
5
seq 1 100 | 2041 slippy '/.{2}/d'
1
2
3
4
5
6
7
8
9
seq 11 20 | 2041 slippy '/[2468]/d'
11
13
15
17
19

Subset 0: s - substitute command

The Slippy s command replaces the specified regex on the input line. CourseNana.COM

seq 1 5 | 2041 slippy 's/[15]/zzz/'
zzz
2
3
4
zzz
seq 10 20 | 2041 slippy 's/[15]/zzz/'
zzz0
zzz1
zzz2
zzz3
zzz4
zzz5
zzz6
zzz7
zzz8
zzz9
20
seq 100 111 | 2041 slippy 's/11/zzz/'
100
101
102
103
104
105
106
107
108
109
zzz0
zzz1

The substitute command can be followed optionally by the modifier character g, for example: CourseNana.COM

echo Hello Andrew | 2041 slippy 's/e//'
Hllo Andrew
echo Hello Andrew | 2041 slippy 's/e//g'
Hllo Andrw

g is the only permitted modifier character. CourseNana.COM

Like the other commands, the substitute command can be given addresses to be applied to: CourseNana.COM

seq 11 19 | 2041 slippy '5s/1/2/'
11
12
13
14
25
16
17
18
19
seq 51 60 | 2041 slippy '5s/5/9/g'
51
52
53
54
99
56
57
58
59
60
seq 100 111 | 2041 slippy '/1.1/s/1/-/g'
100
-0-
102
103
104
105
106
107
108
109
110
---

Subset 0: -n command line option

The Slippy -n command line option stops input lines being printed by default. CourseNana.COM

seq 1 5 | 2041 slippy -n '3p'
3
seq 2 3 20 | 2041 slippy -n '/^1/p'
11
14
17

-n command line option is the only useful in conjunction with the p command,
but can still be used with the other commands. CourseNana.COM

Subset 0: Addresses

All Slippy commands in subset0 can optionally be preceded by an address specifying the line(s) they apply to. CourseNana.COM

In subset 0, this address can either be a line number or a regex. CourseNana.COM

The line number must be a positive integer. CourseNana.COM

The regex must be delimited with slash / characters. CourseNana.COM

Subset 0: Regexes

In subset 0, you can assume backslashes \ do not appear in address or substitution regexes. CourseNana.COM

In subset 0, you can assume semicolons ; do not appear in address or substitution regexes. CourseNana.COM

In subset 0, you can assume commas , do not appear in address or substitution regexes. CourseNana.COM

In subset 0, regexes are delimited with slash / characters, so you can assume slashes do not appear in regexes. CourseNana.COM

In subset 0 and all other subsets, you can assume the regex is correct. You do not have to check for errors in the regex. CourseNana.COM

In subset 0 and all other subsets, you can assume the regex is a POSIX-compatible extended regular expression. CourseNana.COM

In subset 0 and all other subsets, you can assume the regex is compatible with Python.
In other words, the regex can be used directly as a Python regular expression, for example passed to re.search, and will have the same meaning. CourseNana.COM


Subset 1

Subset 1 is more difficult. You will need to spend some time understanding the semantics (meaning) of these operations, by running the reference implementation and researching the equivalent sed operations. CourseNana.COM

Note the assessment scheme recognises this difficulty. CourseNana.COM

Subset 1: s - substitute command

In subset 1, any non-whitespace character may be used to delimit a substitute command, for example: CourseNana.COM

seq 1 5 | 2041 slippy 'sX[15]XzzzX'
zzz
2
3
4
zzz
seq 1 5 | 2041 slippy 's?[15]?zzz?'
zzz
2
3
4
zzz
seq 1 5 | 2041 slippy 's_[15]_zzz_'
zzz
2
3
4
zzz
seq 1 5 | 2041 slippy 'sX[15]Xz/z/zX'
z/z/z
2
3
4
z/z/z

Subset 1: Multiple Commands

In subset 1, multiple Slippy commands can be supplied separated by semicolons ; or newlines. For example: CourseNana.COM

seq 1 5 | 2041 slippy '4q;/2/d'
1
3
4
seq 1 5 | 2041 slippy '/2/d;4q'
1
3
4
seq 1 20 | 2041 slippy '/2$/,/8$/d;4,6p'
1
9
10
11
19
20
seq 1 5 | 2041 slippy '4q
/2/d'
1
3
4
seq 1 5 | 2041 slippy '/2/d
4q'
1
3
4

Semicolons can not appear elsewhere in subset 1 commands. CourseNana.COM

Subset 1: -f command line option

The Slippy -f reads Slippy commands from the specified file, for example: CourseNana.COM

echo 4q   >  commands.slippy
echo /2/d >> commands.slippy
seq 1 5 | 2041 slippy -f commands.slippy
1
3
4
echo /2/d >  commands.slippy
echo 4q   >> commands.slippy
seq 1 5 | 2041 slippy -f commands.slippy
1
3
4

commands can be supplied separated by semicolons ; or newlines. CourseNana.COM

Subset 1: Input Files

In subset 1, input files can be specified on the command line: CourseNana.COM

seq 1 2 > two.txt
seq 1 5 > five.txt
2041 slippy '4q;/2/d' two.txt five.txt
1
1
2
seq 1 2 > two.txt
seq 1 5 > five.txt
2041 slippy '4q;/2/d' five.txt two.txt
1
3
4
echo 4q   >  commands.slippy
echo /2/d >> commands.slippy
seq 1 2 > two.txt
seq 1 5 > five.txt
2041 slippy -f commands.slippy two.txt five.txt
1
1
2

Subset 1: Comments & White Space

In subset 1, whitespace can appear before and/or after commands and addresses. CourseNana.COM

In subset 1, '#' can be used as a comment character, for example: CourseNana.COM

seq 24 43 | 2041 slippy ' 3, 17  d  # comment'
24
25
41
42
43

On both the command line and in a command file, a newline ends a comment CourseNana.COM

seq 24 43 | 2041 slippy '/2/d # delete  ;  4  q # quit'
30
31
33
34
35
36
37
38
39
40
41
43

Subset 1: Addresses

In subset 1, $ can be used as an address.
It matches the last line, for example: CourseNana.COM

seq 1 5 | 2041 slippy '$d'
1
2
3
4
seq 1 10000 | 2041 slippy -n '$p'
10000

Slippy can read one line of input ahead to handle $ addresses. CourseNana.COM

In subset 1, Slippy commands can optionally be preceded by a comma-separated pair of addresses specifying the start and finish of the range of lines the command applies to, for example: CourseNana.COM

seq 10 21 | 2041 slippy '3,5d'
10
11
15
16
17
18
19
20
21
seq 10 21 | 2041 slippy '3,/2/d'
10
11
21
seq 10 21 | 2041 slippy '/2/,4d'
10
11
14
15
16
17
18
19
seq 10 21 | 2041 slippy '/1$/,/^2/d'
10
seq 10 30 | 2041 slippy '/4/,/6/s/[12]/9/'
10
11
12
13
94
95
96
17
18
19
20
21
22
23
94
95
96
27
28
29
30

Comma-separated pairs of addresses can not be used with the q command. CourseNana.COM

Subset 1: Regexes

All the rules from Subset 0 about regex still apply, except: CourseNana.COM

In subset 1, substitute regexes are not always delimited with slash / characters,
So you can not assume slashes do not appear in regexes.
You can assume that whatever the delimiter is, it will not appear in the substitute regex.
Only substitute regexes can be delimited with other characters, address regex are always delimited by slashes. CourseNana.COM


Subset 2

Subset 2 is even more difficult. You will need to spend considerable time understanding the semantics of these operations, by running the reference implementation, and/or researching the equivalent sed operations. CourseNana.COM

Note the assessment scheme recognises this difficulty. CourseNana.COM

Subset 2: s - substitute command

In subset 2, the character used to delimit the substitute command may appear in the regex or replacement string. CourseNana.COM

In subset 2, backslash may appear in the regex or replacement string. CourseNana.COM

In subset 2, you can not assume the regex is correct. You need to check for errors in the regex. CourseNana.COM

Subset 2: -i command line option

The Slippy -i command line option replaces file contents with the output of the Slippy commands. You should use a temporary file. CourseNana.COM

seq 1 5 > five.txt
cat five.txt
1
2
3
4
5
2041 slippy -i /[24]/d five.txt
cat five.txt
1
3
5

Subset 2: Multiple Commands

In subset 2, semicolons ; and commas , can appear inside Slippy commands. CourseNana.COM

echo 'Punctuation characters include . , ; :' | 2041 slippy 's/;/semicolon/g;/;/q'
Punctuation characters include . , semicolon :

Subset 2: : - label command

The Slippy : command indicates where b and t commands should continue execution. CourseNana.COM

There can not be an address before a label command. CourseNana.COM

Subset 2: b - branch command

The Slippy b command branches to the specified label, if the label is omitted, it branches to the end of the script. CourseNana.COM

Subset 2: t - conditional branch command

The Slippy t command behaves the same as the b command except it branches only if there has been a successful substitute command since the last input line was read and since the last t command. CourseNana.COM

echo 1000001 | 2041 slippy ': start; s/00/0/; t start'
101
echo 0123456789 | 2041 slippy -n 'p; : begin;s/[^ ](.)/ \1/; t skip; q; : skip; p; b begin'
0123456789
 123456789
  23456789
   3456789
    456789
     56789
      6789
       789
        89
         9

Subset 2: a - append command

The Slippy a command appends the specified text. CourseNana.COM

seq 5 9 | 2041 slippy '3a hello'
5
6
7
hello
8
9

Subset 2: i - insert command

The Slippy i command inserts the specified text. CourseNana.COM

seq 5 9 | 2041 slippy '3i hello'
5
6
hello
7
8
9

Subset 2: c - change command

seq 5 9 | 2041 slippy '3c hello'
5
6
hello
8
9

The Slippy c command replaces the selected lines with the specified text. CourseNana.COM

Subset 2 Assmptions: Regexes

In subset 2, backslash \ may appear in regexes. CourseNana.COM

In subset 2, the character used to delimit the regex may appear in the regex itself. CourseNana.COM


Other Sed Features

You do not have to implement in Slippy sed features and commands other than those described above. CourseNana.COM

For example, sed on CSE systems provides extra commands including {} D h H g G l n p T w W x y which are not part of Slippy. CourseNana.COM

For example, sed on CSE systems adds extra syntax to addresses including features involving the characters: ! + ~ 0 \. These are not part of Slippy. CourseNana.COM

For example, sed on CSE systems has a number of command-line options other than -i-n and -f. These are not part of Slippy CourseNana.COM

The reference implementation implements many of these extra sed features and commands. CourseNana.COM

The marking will not test your code on these extra features and commands. CourseNana.COM

You do not have to check for these extra features and commands. CourseNana.COM

You will not be penalized if you choose to implement any of these extra features and commands. CourseNana.COM

Assumptions/Clarifications - All Subsets

Like all good programmers, you should make as few assumptions as possible. CourseNana.COM

You can assume that only the arguments described above are supplied to slippy commands. You do not have to handle other arguments. CourseNana.COM

You must apply the Slippy commands to input lines as you read the input lines. You can not read all input lines first (e.g. into a list). There may be an unlimited number of input lines. CourseNana.COM

You are permitted to read one line ahead to handle $ addresses. CourseNana.COM

You are permitted to read one line ahead even if the commands do not use a $ address. CourseNana.COM

You should match the output streams used by the reference implementations. It writes error messages to stderr: so should you. CourseNana.COM

You should match the exit status used by the reference implementation. It exits with status 1 after an error: so should you. CourseNana.COM

You can assume arguments will be in the position and order shown in the usage message from the reference implementation. Other orders and positions will not be tested. Here is the usage message: CourseNana.COM

./slippy --help
usage: slippy [-i] [-n] [-f <script-file> | <sed-command>] [<files>...]

You can assume, Slippy regular expressions are valid Python regular expressions and are compatible with Python. In other words, they can be used as Python regular expressions and will have the same effect. CourseNana.COM

You can assume command line arguments, STDIN and all files contain only ASCII bytes. CourseNana.COM

You can assume all input lines in STDIN and in all files are terminated by a '\n' byte. CourseNana.COM

Slippy error messages include the program name. It is recommended you use sys.argv[0] however it is also acceptable to hard-code the program name. The automarking and style marking will accept both. CourseNana.COM

Get in Touch with Our Experts

WeChat WeChat
Whatsapp WhatsApp
COMP2041代写,COMP9044代写,Software Construction代写,UNSW代写,Slippy代写,Python代写,COMP2041代编,COMP9044代编,Software Construction代编,UNSW代编,Slippy代编,Python代编,COMP2041代考,COMP9044代考,Software Construction代考,UNSW代考,Slippy代考,Python代考,COMP2041help,COMP9044help,Software Constructionhelp,UNSWhelp,Slippyhelp,Pythonhelp,COMP2041作业代写,COMP9044作业代写,Software Construction作业代写,UNSW作业代写,Slippy作业代写,Python作业代写,COMP2041编程代写,COMP9044编程代写,Software Construction编程代写,UNSW编程代写,Slippy编程代写,Python编程代写,COMP2041programming help,COMP9044programming help,Software Constructionprogramming help,UNSWprogramming help,Slippyprogramming help,Pythonprogramming help,COMP2041assignment help,COMP9044assignment help,Software Constructionassignment help,UNSWassignment help,Slippyassignment help,Pythonassignment help,COMP2041solution,COMP9044solution,Software Constructionsolution,UNSWsolution,Slippysolution,Pythonsolution,