FIT5195 Business intelligence and data warehousing 2022 Summer | Monash莫纳什大学 | final exam代考

Office Use Only
Summer Semester 2022
Examination Period
Faculty of Information Technology
EXAM CODES: FIT5195
TITLE OF PAPER: Business Intelligence and Data Warehousing – SAMPLE 3
EXAM DURATION: 2 hours 10 minutes or 130 minutes
THIS PAPER IS FOR STUDENTS STUDYING AT: (tick where applicable)
 Caulfield  Clayton  Parkville  Peninsula
 Monash Extension  Off Campus Learning  Malaysia  Sth Africa
 Other (specify)
During an exam, you must not have in your possession any item/material that has not been authorised for your
exam. This includes books, notes, paper, electronic device/s, mobile phone, smart watch/device, calculator, pencil
case, or writing on any part of your body. Any authorised items are listed below. Items/materials on your desk,
chair, in your clothing or otherwise on your person will be deemed to be in your possession.
No examination materials are to be removed from the room. This includes retaining, copying, memorising or
noting down content of exam material for personal use or to share with any other person by any means following
your exam.
Failure to comply with the above instructions, or attempting to cheat or cheating in an exam is a discipline
offence under Part 7 of the Monash University (Council) Regulations, or a breach of instructions under Part 3 of the
Monash University (Academic Board) Regulations.
AUTHORISED MATERIALS
OPEN BOOK  YES  NO
CALCULATORS  YES  NO
SPECIFICALLY PERMITTED ITEMS  YES  NO
if yes, items permitted are:
Candidates must complete this section if required to write answers within this paper
STUDENT ID: _ _ _ _ _ _ _ _ DESK NUMBER: _ _ _ _ __
Page 1 of 35
Page 2 of 35
Question 1:
Consider the following Student Enrolment star schemas: Star Schema Version-1 does not
have a dimension hierarchy, whereas Star Schema Version-2 has a dimension hierarchy: from
country to state, and to campus.
Star Schema Version-1
Star Schema Version-2
Page 3 of 35
Questions:
a. In contrasting both star schemas, is there any mistake in any of the two star schemas
(Note that Star Schema Version-1 does not have a hierarchy, and Star Schema
Version-2 does have)?

  • If yes, state which star schema, and explain your reason.
  • If no, also explain your reason.
    b. Compare both star schemas.
  • If there are mistakes in any (or both) star schemas, you need to draw the
    correct schema(s) first before comparing between each other.
  • If there are no mistakes in both star schemas, you can immediately compare
    the two star schemas.
    Also, when you compare the two star schemas, you need to use some sample data (in
    the fact and in certain dimensions) to support your arguments
    Write your answers here:
    Page 4 of 35
    Continue your answers here:
    Page 5 of 35
    Continue your answers here:
    Page 6 of 35
    Branch_ID
    Month_ID
    Book_ID
    Number_Books_Sold
    BOOKSALES_FACT
    Month_ID
    Month_DIM
    Branch_ID
    Branch_Address
    Branch_DIM
    Question 2
    This question is taken from the Bookshop Case Study on Temporal Data Warehousing. The
    following shows a star schema shows a fact table (number of books sold) and three
    dimensions (e.g. Month, Branch, and Book). The Book dimension is temporal dimension,
    which contains a temporal attribute, called Price, which is book price.
    Book_DIM
    Book_ID
    Book_Title
    Author
    Book_Price_DIM
    Book_ID
    Start_Date
    End_Date
    Price
    Remarks
    The tables for this star schema have been created and populated from the operational
    database. The sample data is as follows:
    Month_DIM Table
    Month_ID
    201503
    201502
    201501
    201412
    etc
    Branch_DIM Table
    Branch_ID Branch_Address
    City Melbourne Central Shopping Centre, Melbourne
    Chadstone 285 Dandenong Road, Chadstone
    Camberwell 199 Burke Road, Camberwell
    etc
    Page 7 of 35
    Book_DIM Table
    Book_ID Book_Title Author
    C1 CSIRO Diet CSIRO Team
    H6 Harry Potter 6 Rowling
    DV Da Vinci Code Dan Brown
    … … …
    Book_Price_DIM Table
    Book_ID Start_Date End_Date Price Remarks
    C1 201401 201407 $45.95 Full Price
    C1 201408 201410 $36.75 20% Discount
    C1 201411 201501 $23.00 Half Price
    C1 201502 201512 $45.95 Full Price
    H6 201401 201403 $21.95 Launching
    H6 201404 201501 $30.95 Full Price
    H6 201502 201512 $10.00 End of Product Sale
    DV 201401 201512 $27.95 Full Price
    … … … …
    BookSales_Fact Table
    Month_ID Branch_ID Book ID Number_Books_Sold
    201503 City C1 5
    201503 City H6 15
    201503 City DV 23
    201503 City …
    201503 Chadstone C1 15
    201503 Chadstone H6 3
    201503 Chadstone DV 2
    201503 Chadstone …
    201503 Camberwell C1 1
    201503 Camberwell H6 1
    201503 Camberwell DV 2
    201503 Camberwell …
    201503 … …
    … … …
    201412 City C1 15
    201412 City H6 6
    201412 City DV 6
    201412 City …
    201412 Chadstone C1 10
    201412 Chadstone H6 8
    201412 Chadstone DV 1
    201412 Chadstone …
    201412 Camberwell C1 18
    201412 Camberwell H6 3
    201412 Camberwell DV 2
    201412 Camberwell …
    201412 … …
    … … …
    Page 8 of 35
    Question:
    Write the SQL command to produce the following report (10 marks):
    Month_ID Branch_ID Book_ID Book_Title Author Price Number_Books
    _Sold
    201503 City C1 CSIRO Diet CSIRO Team $45.95 5
    201503 City H6 Harry Potter 6 Rowling $10.00 15
    201503 City DV Da Vinci Code Dan Brown $27.95 23
    201503 City …
    201503 Chadstone C1 CSIRO Diet CSIRO Team $45.95 15
    201503 Chadstone H6 Harry Potter 6 Rowling $10.00 3
    201503 Chadstone DV Da Vinci Code Dan Brown $27.95 2
    201503 Chadstone …
    201503 Camberwell C1 CSIRO Diet CSIRO Team $45.95 1
    201503 Camberwell H6 Harry Potter 6 Rowling $10.00 1
    201503 Camberwell DV Da Vinci Code Dan Brown $27.95 2
    201503 Camberwell …
    201503 … …
    … … …
    … … …
    201412 City C1 CSIRO Diet CSIRO Team $23.00 15
    201412 City H6 Harry Potter 6 Rowling $30.95 6
    201412 City DV Da Vinci Code Dan Brown $27.95 6
    201412 City …
    201412 Chadstone C1 CSIRO Diet CSIRO Team $23.00 10
    201412 Chadstone H6 Harry Potter 6 Rowling $30.95 8
    201412 Chadstone DV Da Vinci Code Dan Brown $27.95 1
    201412 Chadstone …
    201412 Camberwell C1 CSIRO Diet CSIRO Team $23.00 18
    201412 Camberwell H6 Harry Potter 6 Rowling $30.95 3
    201412 Camberwell DV Da Vinci Code Dan Brown $27.95 2
    201412 Camberwell …
    201412 … …
    … … …
    The structures of the above tables are as follows:
    SQL> desc
    Name
    Month_DIM;
    Null? Type
    MONTH_ID
    SQL> desc Branch_DIM;
    VARCHAR2(6)
    Name Null? Type
    BRANCH_ID
    BRANCH_ADDRESS
    VARCHAR2(15)
    VARCHAR2(50)
    Page 9 of 35
    SQL> desc Book_DIM;
    Name Null? Type
    BOOK_ID
    BOOK_TITLE
    AUTHOR
    VARCHAR2(5)
    VARCHAR2(20)
    VARCHAR2(20)
    SQL> desc Book_Price_DIM;
    Name Null? Type
    BOOK_ID
    START_DATE
    END_DATE
    PRICE
    REMARKS
    VARCHAR2(5)
    VARCHAR2(6)
    VARCHAR2(6)
    NUMBER(6,2)
    VARCHAR2(20)
    SQL> desc BookSales_Fact;
    Name Null? Type
    MONTH_ID
    BRANCH_ID
    BOOK_ID
    NUMBER_BOOKS_SOLD
    VARCHAR2(6)
    VARCHAR2(15)
    VARCHAR2(5)
    NUMBER
    Page 10 of 35
    Write your answer here:
    Page 11 of 35
    Question 3
    This question is taken from the Product-Sales-Supplier Case Study.
    The director of a company is interested in analyzing the statistics of its product sales history.
    The analysis is needed for identifying which products are popular, which suppliers supply
    those products, when is the best time to purchase more stock, etc. You are required to design
    a small Data Warehouse to keep track of the statistics.
    The director is particularly interested in analyzing the total sales (Quantity * Price) by
    product, customer locations (suburbs and postcodes), sales time periods (monthly and
    yearly), and supplier.
    The operational database currently has the following tables:
    Your snowflake schema will have a Bridge Table connecting Product Dimension and
    Supplier Dimension. A snowflake schema with a Bridge Table as shown below:
    Page 12 of 35
    The above snowflake schema is missing two attributes: WeightFactor attribute, and
    ListAGG attribute.
    Questions:
    a. Draw a new snowflake schema (call it Snowflake Schema version 2) for the above
    case study, but this new snowflake schema must use a WeightFactor attribute
    (without ListAGG attribute). You also need to show sample records in the Product
    Dimension, the Bridge Table, and the Supplier Dimension. The sample data must
    show the correct values for the Weight attribute. Make sure that in your snowflake
    schema, the attributes are clearly shown.
    b. Draw another snowflake schema (call it Star Schema version 3), which also has a
    Bridge Table and a WeightFactor attribute. But version-3 snowflake schema has the
    ListAGG attribute. You also need to show sample records in the Product
    Dimension, the Bridge Table, and the Supplier Dimension. The sample data must
    show the correct values for the Weight and ListAGG attributes.
    c. Write the SQL query to create the ProductDim table for the Star Schema version 3.
    TimeID
    Year
    Month
    1
    m
    m
    1
    m
    1 1
    CustLoc
    Suburb m
    Postcode
    m
    1
    SupplierID
    SupplierName
    SUPPLIERDIM
    ProductNo
    SupplierID
    PRODSUP-BRIDGE
    ProductNo
    ProductType
    PRODUCTDIM
    TIMEDIM
    CUSTLOCDIM
    TimeID
    ProductNo
    CustLoc
    TotalSales
    PRODUCTSALES
    FACT
    Page 13 of 35
    Write your answer here:
    Page 14 of 35
    Continue your answer here
    :
    Page 15 of 35
    Continue your answer here:
    Page 16 of 35
    Question 4
    This question is based on the Robcor case study. The following is the E/R diagram of the
    operational database in the Robcor case study:
    A star schema for the above operational database is shown as follows:
    Robcor Star Schema-1
    Page 17 of 35
    Questions:
    a. Is it possible to determine which level Robcor Star Schema-1 is? If it is possible, state the
    level and also give the reason. If it is not possible to state the level, then give the reason.
    b. Let’s have a look at the following star schema (Robcor Star Schema-2). Between the two
    star schemas (Robcor Star Schema-1 and Robcor Star Schema-2), which one has a higher
    level of aggregation? State the name of the star schema, whether it is Robcor Star Schema-1
    or Robcor Star Schema-2, and explain the reason.
    Robcor Star Schema-2
    Page 18 of 35
    Write your answers here:
    Page 19 of 35
    Question 5
    Given the following star schema:
    The tables (e.g. Fact and three dimensions) have been created and have also been populated
    with an adequate number of records. The table names and attributes are shown in the star
    schema above.
    Write the SQL for the following OLAP queries:
    a. Display the top 10 average prices by suburb of property
    b. Display the average price of properties by property type description and suburb. It is
    not required to show the subtotals or group totals or grand total
    Page 20 of 35
    Write your answer here:
    Page 21 of 35
    Question 6
    This question is about Top n% and Top k (such as Top 10% and Top 3) in OLAP. The tables
    are based on the ROBCOR data warehouse case study, which consists of one fact and three
    dimension tables: charter_fact, time, pilot, and model.
    SQL> desc charter_fact;
    Name Null? Type
    TIME_ID
    MOD_CODE
    EMP_NUM
    TOT_CHAR_HOURS
    TOT_FUEL
    REVENUE
    VARCHAR2(6)
    CHAR(10)
    NUMBER(10)
    NUMBER
    NUMBER
    NUMBER
    SQL> desc time;
    Name Null? Type
    TIME_ID
    TIME_YEAR
    TIME_MONTH
    CHAR(6)
    CHAR(4)
    CHAR(2)
    SQL> desc pilot;
    Name Null? Type
    EMP_NUM
    PIL_LICENSE
    PIL_RATINGS
    PIL_MED_TYPE
    PIL_MED_DATE
    PIL_PT135_DATE
    NUMBER(10)
    CHAR(25)
    CHAR(25)
    CHAR(1)
    DATE
    DATE
    SQL> desc model;
    Name Null? Type
    MOD_CODE
    MOD_MANUFACTURER
    MOD_NAME
    MOD_SEATS
    MOD_CHG_MILE
    MOD_CRUISE
    MOD_FUEL
    CHAR(10)
    CHAR(15)
    CHAR(20)
    FLOAT(126)
    NUMBER(19,4)
    FLOAT(126)
    FLOAT(126)
    Page 22 of 35
    Questions:
    a. Write the SQL command to display the time periods which had the revenue in the top 10%
    of the months.
    The result should be like this:
    TIME_ID TOTAL PERCENT_RANK
    199503 51144.16 1
    199408 49775.51 .975609756
    199510 48538.01 .951219512
    199409 47647.75 .926829268
    199703 45872.32 .902439024
    b. Write the SQL command to display the mod_code and mod_name of the two airplanes
    that have the largest total fuel used.
    The result should look like this:
    MOD_CODE MOD_NAME TOTAL MYRANK
    PA31-350 Navajo Chieftain 83790.5 1
    C-90A KingAir 61708.4 2
    Page 23 of 35
    Write your answer here:
    THE END