INFO20003 Semester 2, 2022
Due: 6:00pm Friday 16th September
Assignment 2: SQL
Weighting: 10% of your total assessment
Melbourne Touch Rugby Database
Description
The Melbourne Touch Rugby competition is a 6-player-a-side competition for men’s, women’s and mixed teams (a mixed team has a minimum of 3 women and maximum of 3 men on the field at any time). Players must be registered with a club to play in the competition and players can only be registered for one club at a time.
Currently there are 8 rugby clubs participating in the competition, which began in 2020. Each club has three teams – a men’s team, a women’s team and a mixed team. The men’s competition is known as the Dewar Shield; the women’s competition is known as the Williams Plate; and the mixed teams competition is known as the Bingham Trophy.
A season is the set of games played in a competition in a calendar year. Each season consists of rounds in which every team plays a game.
Even though each game is played by 12 players (6 from each team), a team is not a fixed group of 6 players. The team officials choose 6 players from their club, or possibly even from another club, to play for that team in a game. As such, the composition of a team may vary from game to game.
If for any reason a team is unable to organise enough players to play a game, that team will forfeit the game and their opponents will score a “walkover”. A walkover awards 28 points to the team who scores a walkover and no score is recorded for the team who forfeited. If a game is cancelled (e.g. due to extreme heat, unsuitable playing pitch), no score is recorded against either team.
The Data Model
The physical ER model of Melbourne Touch Rugby database.
Implementation Notes
Each season consists of multiple rounds, the ‘round’ that each game is played in is stored as an attribute of Game.
A forfeit game (walkover) results in the winning team having a score of 28, and the forfeit team having a score of ‘NULL’. Cancelled games result in both teams having ‘NULL’ scores.
Assignment 2 Setup
A dataset is provided which you may use when developing your solutions. To set up the dataset, download the file rugby_2022.sql from the Assignment link on Canvas and run it in Workbench. This script creates the database tables and populates them with data. Note that this dataset is provided for you to experiment with, it is not the same dataset as what your queries will be tested against (the schema will stay the same, but the data itself may be different).
The script is designed to run against your account on the Engineering IT server (info20003db.eng.unimelb.edu.au). If you want to install the schema on your own MySQL Server installation, uncomment the lines at the beginning of the script.
Note: Do NOT disable only_full_group_by mode when completing this assignment. This mode is the default, and is turned on in all default installs of MySQL workbench. You can check whether it is turned on by running the following in workbench:
SELECT @@sql_mode;
The command should return a string containing “ONLY_FULL_GROUP_BY” or “ANSI”. When marking, our test server WILL have this mode turned on, and if your query fails due to this, you will lose marks. Run the below command to add the only_full_group_by mode:
SET sql_mode=(SELECT CONCAT(@@sql_mode,',ONLY_FULL_GROUP_BY'))
The SQL tasks
In this section are listed 10 questions for you to answer. Write one (single) SQL statement per question. Subqueries and nesting are allowed. You may be penalized for overly complicated SQL statements (e.g. multiple times longer than required, used vague/poorly named variables, formatting makes it difficult to read, etc). Unless specified in the question, you do NOT need to sort the results of your query. DO NOT USE VIEWS (or ‘WITH’ statements/common table expressions) to answer questions.
- Find the names of all players that are currently in the ‘Melbourne Tigers’ club. Your query should return data in the form (firstName, lastName) (1 mark)
- Find the team which has given the most walkovers (forfeited the most number of games). Assume there are no ties. Your query should return one row of the form (teamName). (1 mark)
- Find the player that has played in the most clubs over time. You may assume there are no ties. Don’t count the same club multiple times if a player has played for them multiple times (1 mark)
- Find players that have played more games in 2020 than in 2021. Return as (firstName, lastName, numGames2020, numGames2021) (2 marks)
- Find the team(s) that ended with the most points scored in the Williams Plate in the 2021 season. If multiple teams tied for total points, you must display all teams that tied for first. Return as (teamName, sumOfPoints) (2 marks)
- Find the names of all women players who have played in the Bingham Trophy, but never the Williams Plate. Return as (firstName, lastName). (2 marks)
- For all players that have ever played a game in the league, list the first and most recent club they ever joined. If these two clubs happen to be the same, return the
- Find players that have played for every club which has the word ‘Melbourne’ in its name, and have never joined any of those clubs more than once (i.e. they’ve never left and rejoined any club with the word ‘Melbourne’ in its name). Return as (firstName, lastName) (3 marks)
- A player is said to have ‘the magic touch’ when they win (or draw) every game they play in during a competition season. Find all players who had ‘the magic touch’ in some season, but one or more of the teams they played for during that season still lost a game during that season (when they weren’t playing!).
- Find the club that has the largest difference between the win-ratio (ratio of wins/losses; where a draw is neither a win nor loss) of their best and worst teams in 2021 (across all competitions). Assume there are no ties. Teams that did not play a game in 2021 should not be considered. Return as (clubName) (3 marks)
SQL Response Formatting Requirements
To help us mark your assignment queries as quickly/accurately as possible, please ensure that:
- Your query returns the projected attributes in the same order as given in the question, and does not include additional columns. E.g., if the question asks ‘return as (userId, name)’, please write “SELECT userId, name ...” instead of “SELECT name, userId...” (you can name the columns using `AS` however you’d like, only the order matters).
- Please do NOT use “databaseName.tableName” format. E.g., please write “SELECT userId FROM users...” instead of “SELECT userId FROM coltonc.users ...”.
- Ensure that you are using single quotes( ' ) for strings (e.g. ...WHERE name = ‘bob’...)and double quotes ( " ) only for table names (e.g. SELECT name FROM “some table name with spaces”...). Do NOT use double quotes for strings “...WHERE name = “bob”...”.
Submission Instructions
Your submission will be in the form of an SQL script. There is a template file on the LMS, into which you will paste your solutions and fill in your student details (more information below).
Filling in the template file:
The template file on the LMS has spaces for you to fill in your student details and your answers to the questions. There is also an example prefilled script available on the LMS as well. Below are screenshots from those two documents explaining the steps you need to take to submit your solutions:
Step | Example |
1. At the top of the template, you’ll need to replace “XXXXXXXX” with your student number and name | Template
|
Example Filled in
| |
2. For each question 1-10, place your SQL solution in between the “BEGIN QX” and “END QX” markers. Ensure each query is terminated with a semicolon “;”
| Template
|
Example Filled in
| |
3. Test that your script is valid SQL by running it from MySQL Workbench. Run the entire script by copy-pasting this entire file into a new workbench tab, placing your cursor at the start of the file (without selecting anything), and pressing the lightning bolt to run the entire file. All queries should run successfully one after another. If not, check to make sure you added semicolons ‘;’ after each query.
| All queries ran sequentially and were successful.
|
Late submission
Unless you have an approved extension (see below), you will be penalised -10% of the total number of marks in the assignment per day that your submission is late. For instance, if you received a 78% raw score, but submitted 2 days late, you'd receive a 58% score for the assignment.
Reminder: INFO20003 Hurdle Requirements
To pass INFO20003, you must pass two hurdles:
• Hurdle 1: Obtain at least 50% (15/30) for the three assignments (each worth 10%)
• Hurdle 2: Obtain at least 50% (35/70) for the combination of the quizzes and final exam
Therefore, it is our recommendation that you attempt every assignment and question in the exam.
GOOD LUCK!