1. Homepage
  2. Programming
  3. CS 2550 Principles of Database Systems Assignment #4: Query Processing & Optimization

CS 2550 Principles of Database Systems Assignment #4: Query Processing & Optimization

Engage in a Conversation
PITTCS2550Principles of Database SystemsQuery Processing & Optimization

CS 2550 – Principles of Database Systems (Spring 2024) CourseNana.COM

Assignment #4: Query Processing & Optimization CourseNana.COM

Goal CourseNana.COM

The goal of this assignment is to better understand the query evaluation process. CourseNana.COM

Description CourseNana.COM

The following is a fragment of the university database schema. CourseNana.COM

          Employee (EmployeeID, First, Last, Gender, DOB, Address, Phone, SNO, Salary)
          PK(EmployeeID)
          FK(SNO) --> Studio(StudioNum)
          Studio (StudioNum, StudioName, Manager, Budget)
          PK(StudioNum)
          FK(Manager) --> Employee(EmployeeID)
          Movie (movieID, producedBy, title, Budget, filmedAt)
          PK(movieID)
          FK(producedBy) --> Studio(StudioNum)

Consider four possible organizations of the tables Employee(E)-Studio(S)-Movie(M): CourseNana.COM

  1. Tables E, S and M are heap files & no access methods exist on any of them CourseNana.COM

  2. Tables E and S are sorted files and M is a heap file & no access methods exist on any of them CourseNana.COM

  3. Tables E, S and M are heap files & access methods (hashing) exist only on M CourseNana.COM

  4. Table E is a sorted file and tables S and M are heap files & access methods exist on E and M, one of which is hash and the other is index CourseNana.COM

Further assume the following statistics: CourseNana.COM

  • ˆ  Available Cache Memory (CM) to be 22 pages. CourseNana.COM

  • ˆ  Attributes belonging to the same table all have the same average size. CourseNana.COM

  • ˆ  When using hash join, the smaller record will be the partitioning part. CourseNana.COM

  • ˆ  The distribution of values for the attribute filmedAt in the table Movie is: 50% ’PGH’, 30% ’PHI’, and 20% ’NYC’. CourseNana.COM

  • ˆ  Movie: r = 500,bfr = 5,BPrj = 100. CourseNana.COM

  • ˆ  Studio: r = 250,bfr = 5,BDep = 50. CourseNana.COM

  • ˆ  Employee: r = 1250,bfr = 5,BEmp = 250. CourseNana.COM

  • ˆ  Note that r is the total number of tuples in a table, bfr is the blocking-factor, and B is the total number of blocks of a table. CourseNana.COM

Questions [100 points] CourseNana.COM

Q1 [20 points] Produce the heuristic query evaluation tree of the following SQL statement assuming no access structures available, and describe in details how the evaluation of the query would be: CourseNana.COM

    SELECT S.StudioName, COUNT(M.movieID), SUM(M.Budget)
    FROM   EMPLOYEE AS E, Studio AS S, Movie AS M
    WHERE  M.producedBy = S.StudioNum AND E.SNO = S.StudioNum
    GROUP BY S.StudioName
    HAVING COUNT(E.EmployeeID) < 15;

State any additional assumptions. (Hint: some operators can be combined into a single algo- rithm). CourseNana.COM

Q2 [80 points] Consider the query evaluation tree as discussed in class of the following SQL statement. CourseNana.COM

    SELECT M.movieID, M.producedBy, E.First, E.Last, E.Gender
    FROM   Employee AS E, Studio AS S, Movie AS M
    WHERE  M.producedBy = S.StudioNum AND S.Manager = E.EmployeeID
          AND M.filmedAt=‘PGH’;

Based on your pittID, you have two database organizations assigned to you from the above or- ganizations in order to answer this question. For those two organizations, identify sub trees that represent groups of operations which can be executed by a single algorithm. For each such sub tree, there are potentially multiple methods that can be combined to implement it as a single algorithm. Please provide the following: CourseNana.COM

ˆ The description of one such implementation (steps & data structures) out of all possible ones for each of the above cases. CourseNana.COM

ˆ The I/O cost of your implementation at each level. CourseNana.COM

ˆ Compute the total I/O cost.
You are asked to answer the question for two database organizations, those are based on your
CourseNana.COM

pittID, as follows: CourseNana.COM

ddh32, tiw81, yiy161, dhb51, yul251, sml153, nia135, dtk28: You are required to do the first and second organizations only. CourseNana.COM

mas937, yux85, anp407, shk148, nat134, aba166, mod53, jeb386: You are required to do the first and third organizations only. CourseNana.COM

aym50, bsp22, yuw328, shy158, btl26, shp184, jah292, tbt8: You are required to do the first and fourth organizations only. CourseNana.COM

CourseNana.COM

What & How to submit CourseNana.COM

  • ˆ  You are required to submit exactly one PDF file under your pitt user name (e.g., pitt01.pdf). In addition to providing the answer, you are expected to: include your name and pitt ID at the top of the PDF file. CourseNana.COM

  • ˆ  After preparing your solution, submit your file (i.e., the prepared PDF) that contains your solution by going to the class’ Gradescope by either navigating to the Gradescope Homepage and selecting the course CS 2550 from the Course Dashboard or by clicking the Gradescope Navigation option under our course Canvas page. CourseNana.COM

  • ˆ  You must submit your assignment before the due date (8:00 PM, Apr. 22, 2024). There are no late submissions. CourseNana.COM

  • ˆ  Note that you are required to use a graph tool (such as MS-Word, MS Powerpoint, MS Visio, idraw, draw.io, etc.) to generate your diagrams and the text that explain them. Handwritten/Hand-drawn diagrams and/or text will not be accepted/graded and will receive a zero. Hand-written and digitized/scanned images will receive a zero (including scanned, photos, or electronically drawn using a smart pen submissions). CourseNana.COM

Get in Touch with Our Experts

WeChat WeChat
Whatsapp WhatsApp
PITT代写,CS2550代写,Principles of Database Systems代写,Query Processing & Optimization代写,PITT代编,CS2550代编,Principles of Database Systems代编,Query Processing & Optimization代编,PITT代考,CS2550代考,Principles of Database Systems代考,Query Processing & Optimization代考,PITThelp,CS2550help,Principles of Database Systemshelp,Query Processing & Optimizationhelp,PITT作业代写,CS2550作业代写,Principles of Database Systems作业代写,Query Processing & Optimization作业代写,PITT编程代写,CS2550编程代写,Principles of Database Systems编程代写,Query Processing & Optimization编程代写,PITTprogramming help,CS2550programming help,Principles of Database Systemsprogramming help,Query Processing & Optimizationprogramming help,PITTassignment help,CS2550assignment help,Principles of Database Systemsassignment help,Query Processing & Optimizationassignment help,PITTsolution,CS2550solution,Principles of Database Systemssolution,Query Processing & Optimizationsolution,