Subject Code: BUS5WB
Internal Code: 1AIECJ
Report Writing Assessment Answer
Assignment Task: BUS5WB
What you are required to do
1. HDInsight to aggregate reviews
Develop an aggregate of these reviews using your knowledge of Hadoop and MapReduce in Microsoft HDInsight.
a) Follow the same approach as the Big Data analytics workshop (using the word count method in HDInsight) to determine the contributory words for each level of rating and sentiment category.
b) Present the workflow of using HDInsight (you may use screen captures) along with a summary of findings for each level of rating and sentiment category. MapReduce documentation for HDInsight is available here.
2. Azure Machine Learning for sentiment analysis
Use Azure ML Studio to analyse customer reviews based on sentiment score. Use the ‘review’ field for text clustering. In the Filter-based feature selection module, use ‘sentiment’ field in order to cluster reviews based on sentiment score. Download the cluster outputs into a CSV file to interpret the results and derive insights. You will need to calibrate algorithmic parameters by using different Number of Centroids and Distance Metric to derive meaningful clusters. Exclude sentiment, rating or posted as selected columns to train the clustering model. Use only the preprocessed hashing features.
3. Findings
Summarise your findings from 1) and 2), on user rating, hotel rating and sentiment towards accommodation options in Vietnam. Consider the challenges you faced in conducting Big Data analytics on a real-life text dataset.
Deliverables 1. A report on the three activities.
- The report should be compiled in Microsoft Word only, font size 11.
- The report should not exceed 10 pages. Diagrams, tables and any other visualisations/ screen captures should be in the main body of the report.
- Make realistic assumptions on missing information and state these in the report. 2. A compressed folder of any other files that would be useful to assess your work.
HDInsight to aggregate reviews 5 marks
- 2 A minimal attempt at using HDInsight
- A basic attempt at using HDInsight
- A good attempt at using HDInsight
- A complete attempt at deriving insights using HDInsight.
Azure Machine Learning for sentiment analysis 10 marks
- A minimal attempt at clustering and analysis.
- A basic attempt at clustering and cluster analysis.
- A good attempt at text clustering and cluster analysis.
- A complete attempt at clustering and cluster analysis.
Findings 10 marks
- Basic summary of findings.
- A fair effort that captures some of the key findings.
- A good effort, accounting for most potential findings.
- A comprehensive effort, accounting for all findings and further analysis.
This Report Writing Assessment has been solved by our Report Writing experts at onlineassignmentbank. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.