Highlights
Task Description:
The application should provide the following functionality
• A means to load the initial data set and transfer it to a suitable file format (where applicable)
• A means to load and save the dataset to file in the selected format
• A process for cleaning and preparing the initial data set, managing inconsistencies, errors, missing values and any changes required by the client
• A graphical user interface/s for interacting with the data enables the user to:
o Load and clean the initial data set
o Load and save the prepared data set
o Use the prepared data set to generate output and visualisations
o Manipulate the range of values used to generate output and visualisations
It should be assumed that this application will be able to handle other sets of data generated from the same source. i.e. data with the same column row headings but containing different values and anomalies. However, the application is not required to be generic (work with multiple unknown data sets). Given this best practice regarding code reuse, encapsulation and well-defined programming interface should be applied where applicable.
The client initially wants the application to perform the following actions on the data:
• Outputs should not include any data from vendors that have a ‘PROGRAM STATUS’ of INACTIVE. (basic *)
• The ‘PE DESCRIPTION’ column contains information on whether or not the vendor is low, medium or high risk. Extract this out into a new column, retain all other information within that column. E.g. ‘FOOD MKT RETAIL (1-1,999 SF) LOW RISK’ Extract the greyed area out and retain the rest. (advanced *)
The client initially information need is to generate the following and output the results using appropriate representation:
• Produce the mean, mode and median for the inspection score per year:
o Across the whole data set (basic *)
o For each ‘Facility City’ (basic *)
Produce a suitable graph that displays the number of establishments that have committed each type of violation, you may need to consider how you group this data to make visualisation/s feasible (basic*)
• Determine if there is any significant correlation between the number of violations committed per vendor and their most recent inspection score. You will need to select an appropriate visualisation to demonstrate this. (advanced*)
None-functional requirements
• The GUI interface provides appropriate feedback to confirm or deny a user’s actions
• The application manages internal and user-generated errors
Technical requirements
• The application is built using Python 3.7.*
• The application uses advanced APIs such as NumPy, panda, Seaborn, Matplotlib
• The application runs within the anaconda environment using Jupyter notebook.
The data needs to be transformed into a MongoDB form or JSON.
This IT Assessment has been solved by our IT experts at My Uni Paper. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our experts are well trained to follow all marking rubrics & referencing style.
Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.
© Copyright 2026 My Uni Papers – Student Hustle Made Hassle Free. All rights reserved.