Exploratory Data Analysis Report Writing - IT Assignment Help

Download Solution Order New Solution
Assignment Task:

Task:

Exam questions
Part 1: Exploratory data analysis
NB: for all questions below, display the R code in the Rmarkdown file. An answer without R code will be counted incorrect.
1. Read the file with chemical measurement values ??in R.
2. Change the column names of the dataset to more informative column names (note: do not use spaces in the column names). See the table above (in the introduction) for the contents of the different columns.
3. Calculate the average concentrations for tartaric acid, acetic acid and citric acid. Do this for: a) all bottles of wine (red and white); b) all bottles of red wine.
4. Sulfur dioxide in wine can be 'free' or 'bound'. The data gives the total amount of sulfur dioxide and the amount of free sulfur dioxide. Add an extra column to the dataset containing the amount of bound sulfur dioxide.
5. Determine the number of missing values ??for each column in the dataset.
6. The dataset contains data for bottles of white wine and for bottles of red wine. Determine the percentage of bottles with red wine.
7. Determine the number of bottles with: a) a low alcohol percentage (<10%); b) a high alcohol percentage (> 12%).
8. Which type of wine (red or white) has the highest number of bottles with a pH above 3.3? State your conclusion in the text of the Rmarkdown file.

9. Create a scatter plot in which you plot the total amount of sulfur dioxide against the amount of free sulfur dioxide. In this scatter plot, indicate in color which data points belong to red wine and which data points belong to white wine. Also give the chart clear axis labels, a legend and a title.

10. Create a bar graph for the mean values ??of sodium chloride and potassium sulfate. Calculate these mean values ??separately for the red wine and for the white wine. Indicate in color which averages in the bar chart belong to red wine and which averages belong to white wine. Also give the chart clear axis labels and a title.

Part 2: is there a difference in residual sugar concentration between red and white wine?
In this section you will see if there is a difference in the residual sugar concentration between red and white wine. You will perform statistical tests in R and make a graph.

Note: For all questions below, display the R code in the Rmarkdown file (unless only a conclusion is requested). An answer without R code is counted incorrect.

1. Perform an unpaired t-test to answer the research question. Assume normally distributed data (Shapiro-Wilk test p-value> 0.05) and an equal variance (Levene's test p-value> 0.05).
2. Indicate your conclusion in the text of the Rmarkdown file.
3. Plot the data that supports your conclusion. Give the chart clear axis labels and a title.

Part 3: is there a difference in quality between red and white wine?
In this section you will see if there is a difference in the quality scores between red and white wine. You will perform statistical tests in R and create a table.

Note: For all questions below, display the R code in the Rmarkdown file (unless only a conclusion is requested). An answer without R code is counted incorrect.

1. Read the file with the quality scores in R and add the quality scores to the dataset with the chemical measurements (use a 'left join' for this last step).
2. Make a table in which you give the number of bottles per quality score. In the table, differentiate between red and white wine (ie make a separate column for each type of wine).
3. Take a wilcox test to answer the research question. Assume unpaired data.
4. Indicate your conclusion in the text of the Rmarkdown file.

Part 4: knit your Rmarkdown file into an HTML file
In this section you will convert your Rmarkdown file to an HTML file. Make sure that the code, output of the code and graphs are visible in the HTML file. Also make sure that warnings and other R messages are not displayed.

 

This IT Assignment has been solved by our IT  Experts at My Uni Paper. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.

Be it a used or new solution, the quality of the work submitted by our assignment experts remains unhampered. You may continue to expect the same or even better quality with the used and new assignment solution files respectively. There’s one thing to be noticed that you could choose one between the two and acquire an HD either way. You could choose a new assignment solution file to get yourself an exclusive, plagiarism (with free Turnitin file), expert quality assignment or order an old solution file that was considered worthy of the highest distinction.

Get It Done! Today

Country
Applicable Time Zone is AEST [Sydney, NSW] (GMT+11)
+

Every Assignment. Every Solution. Instantly. Deadline Ahead? Grab Your Sample Now.