Highlights
The Data
For assignment 3 we will be using a selection of data from the Ontario census, reported at the dissemination area (DA). These data are stored as a shapefile with several columns describing a variety of socio-economic variables for Ontario Census DAs from the 2016 census.
The variables are described in more detail in the metadata alongside the shapefile.
Your Assignment
Your assignment is to perform multivariate-cluster analysis on the Ontario census data to generate a regionalization of Ontario’s DA for the purpose of socio-economic profiling. You can frame your assignment in any way you choose (e.g., business profiling, health, marketing, etc.).
Tasks
1. Choose a single economic area (EA) to focus on, subset the data appropriately.
2. Select a subset of 6 variables from the dataset to focus on for your analysis.
• At least one should be derived by combining two or more variables in some meaningful way.
3. Report initial descriptive statistics for each variable.
4. Show two (2) plots that are meaningful in the context of your problem.
5. Show two (2) choropleth maps that are meaningful in the context of your problem.
6. Scale your variables appropriately so that you remove the effect of different units/scales.
7. Multivariate clustering
• using your chosen variables run multivariate cluster analysis (k-means) for values of k = 2,3,. . . ,10.
• Record the cluster statistics including the Between/Total SS ratio for each value of k.
• Calculate the Davies-Bouldin index for each value of k.
• Choose the “best” clustering level.
8. Report and comment on the summary statistics for the cluster centres for your chosen value of k.
• cluster sizes
• cluster centers
• cluster within SS
9. Create meaningful names for each cluster in your final clustering based on the above summaries.
10. Create a bivariate scatterplot showing two of your 6 variables (you choose) with each point colored by cluster name.
11. Create a final cluster map that shows the distribution of your clusters across Southern ontario (within your EA)
• use the sf plotting functions
• use appropriate colors, titles, legend, etc.
12. Comment/interpret your findings in terms of how you framed your original problem.
This Statistics Assignment has been solved by our Statistics Experts at My Uni Paper. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.
Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.
© Copyright 2026 My Uni Papers – Student Hustle Made Hassle Free. All rights reserved.