Highlights
Part 1 - Data Requirements
You are required to choose your own data for this homework.
Data Sourcing
Your first step with the project is to get familiar with the data. You need to understand how it is structured and most importantly find the data dictionary associated with it. If it is not there, then you will have to build a data dictionary. The latter should contain the name of the field, the description, the datatype and any constraints associated with the field.
You will need to source those data using one or more of the following methods:
Deliverables
Link of all data sources
Explanation of the data (where does it come from)
Link that shows the data dictionary (excel, google sheets)
Github/AzureDevops/Jira account created
Scripts that gather these data
Git Repository Created
You script should be stored in a git repository that is accessible to all members of your team and the professor.
Storage
Your next step is to choose the appropriate data store for your data. Remember in the previous step, you had to source the data using a script or a specific tool. The data stores of choice are the following Database, Storage S3, MongoDB. Make sure the data are properly stored and not scattered. If need be, you will need also to mark the date the data was stored as well. It is recommend you watch the async videos.
Deliverables
Modeling
Once you have done the storage, you will need to start the modeling of the data warehouse. Remember the Data Warehouse contains already two main aspects. A fact table and a dimension table. The fact table must have a surrogate key as well as each dimension table. Modeling can be done using any tools.
Deliverables
Part 2 - Homework Steps
You are required to choose the previous data from homework 1. If you want to change your data, you are free to do so. However, you will have to start the whole homework 1 and it will not be graded. You are free to use any cloud provider. You are required to check the feedback from the professor.
Transformation
Once you have stored the data, the next steps would be to transform the data. Data should be transformed according to specific business rules. While transformation the data, you should consider the following.
This is only a limited version of what you can do. There is more to that. Remember also to update your data dictionary.
You have the following options:
Deliverables
Modeling
Once you have done the transformation, you will need to update the modeling of the data warehouse. Remember the Data Warehouse contains already two main aspects. A fact table and a dimension table. The fact table must have a surrogate key as well as each dimension table. Modeling can be done using any tools. Your data warehouse should be in Redshift.
Deliverables
Serving Data
You will be using an online visualization tool to show the data that you have transform. You should apply all the visualization practices you have seen in all sessions. The following must be part of the Visual:
As part of the service DATA as well, you will need to create a api that will generate a csv file that contains a summary of the data. This is optional.
Deliverables
This CIS9440 - IT Computer Science has been solved by our PhD Experts at My Uni Paper.
Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction
© Copyright 2026 My Uni Papers – Student Hustle Made Hassle Free. All rights reserved.