POLS0010: Data Analysis - Switzerland’s Gun Control Referendum - Estimating Constituency-Level - Report Writing Assessment Answer

Download Solution Order New Solution
Internal Code: 1AGDFC Code: POLS0010

Data Analysis Report Writing Assessment Answer

Assignment Task: POLS0010 Both questions require you to write a brief report. It is up to you how you structure the reports, but it is advisable to keep introductory material to a minimum, given the word limit. Your reports should discuss your methods, your results and the conclusions that you draw from them. QUESTION A: Switzerland’s Gun Control Referendum - POLS0010 In Switzerland in 2011, a legally binding referendum was held that would have banned people from keeping guns at home, as well as introducing stricter background checks for those wishing to purchase them. The referendum failed, with 56% of voters opposing it. For this question, suppose that the referendum is going to be repeated next year, and the pro-gun-control campaign asks for your advice. Specifically, the campaign group want to run an advertising campaign targeted at groups who are most likely to support the new referendum, to persuade them to turn out and vote. Your job is to tell them which types of people are most supportive of gun control. To help measure the likely effectiveness of their advertising, they also want to know how much each characteristic matters in explaining support. You’ll use a survey of voters taken after the first referendum that asked about support for gun control. You need to: POLS0010
  1. Choose a logit model that predicts support for gun control, carefully justifying your selection of variables for the model. You must use a minimum of three independent variables. ii) Present the model’s findings in ways that clearly explain how much the variables matter in explaining support for gun control. POLS0010
You should present your approach and your findings in the form of a brief report. It should conclude by explaining which types of people you think the campaign should target. The dataset is called “s” and is contained in the file “swiss.Rda”. It contains the following variables for each individual in the survey: POLS0010 Variable name Variable description VoteYes dependent variable: =1 if respondent voted for gun control, 0 otherwise female =1 if female, 0 otherwise age in years LeftRight individual's own assessment of how right-wing they are on a scale from  0-9, where higher values mean more right-wing [treat this as a continuous variable] trust respondent's trust in government. =1 if trusts government, 0 otherwise university =1 if respondent has a university degree, 0 otherwise urban =1 if respondent lives in an urban area, 0 otherwise suburb =1 if respondent lives in a suburban area, 0 otherwise. POLS0010 QUESTION B: Estimating Constituency-Level Results from the EU Referendum - POLS0010 In the 2016 UK referendum on leaving the EU, the results of the vote were not released for individual electoral constituencies. However, many scholars would like to know why people voted to leave the EU, and how support for leaving differed across constituencies. One previous study has already estimated constituency-level support for ‘leave’ in an authoritative way. Your tasks in this question are (i) to produce estimates of the percentage of voters that voted ‘leave’ in every constituency using multilevel modeling and post-stratification that are as close as possible to this existing set of estimates, as measured by the Mean Absolute Error (MAE), and (ii) to use your results to explain why people voted to leave. POLS0010 You need to:
  1. Estimate an appropriate logistic multilevel model explaining voting for leave, using the predictors in the dataset.1
  2. Present the multilevel model results and interpret how the variables affect voting to leave the EU (Note: you do not need to discuss statistical significance). iii) Produce post-stratified estimates of the percentage of people who voted ‘leave’ in all 631 constituencies in England, Scotland and Wales iv) Compare your results to the existing estimates using the Mean Absolute Error.  POLS0010
You should present and explain your approach and results in a brief report, explaining why your estimates do or not perform well compared to the existing estimates. Note: if you cannot get very close to the existing results, do not worry. Your grade depends on the quality of your analysis, presentation and interpretation, not how close your results are to the existing estimates. POLS0010 The survey data is called “e” and is in the file “eusurvey.Rda”. It comes from the 2017 British Election Study and it contains the following variables: Variable name Variable description cname constituency name ccode constituency code leave dependent variable: =1 if respondent voted to leave EU, 0 if respondent  voted to remain in the EU votecon =1 if respondent voted Conservative in the 2015 election, 0 otherwise voteukip =1 if respondent voted UKIP in the 2015 election, 0 otherwise [note:  UKIP is the United Kingdom Independence Party, which campaigns in favour of the UK leaving the EU] female =1 if female, 0 otherwise age in years highed =1 if respondent is educated to degree level or higher, 0 otherwise lowed =1 if respondent has no educational qualifications, 0 otherwise c_con15 percent vote for Conservative party in the constituency, 2015 election c_ukip15 percent vote for UKIP in the constituency, 2015 election c_unemployed constituency unemployment rate, percent c_whitebritish percent of constituency population who are white British c_deprived percent of constituency population living in poverty  1 As in the practical exercise, use the option “nAGQ=0” to avoid estimation errors. POLS0010 Post-stratification data for the 631 constituencies is called “post” and is contained in the file “eupoststrat.Rda”. Each row contains one particular demographic group in one constituency. In addition to the variables in “e”, it also contains these variables: Variable name Variable description c_count Number of people in the demographic group c_total Number of people in the constituency percent of constituency represented by the demographic group Finally, the comparison data containing the existing estimates by constituency is called “est” and is in the file “existing_estimates.Rda”. In addition to the constituency name and code, it contains the existing estimate of the leave vote share for each constituency (called estimate). POLS0010 QUESTION C: Describing and Classifying Tweets - POLS0010 Many companies monitor social media posts in order to gauge how customers feel about their company and their competitors. For this question, imagine that you have been hired as a consultant by one of the major American airline companies to analyse tweets about airlines. They want to find out how people talk about airlines on Twitter, and then build a predictive tool that can classify tweets in future into ‘negative’ or ‘positive’ sentiment toward airlines, to help them respond better to their customers in real time. They have provided you with a dataset of 11,541 tweets about airlines that have been labelled as ‘negative’ or ‘positive’ by their staff. The dataset also identifies which airline each tweet is talking about. Your task is to prepare a brief report that describes the tweets, and recommends a classification method for future tweets. You need to: POLS0010
  1. Use appropriate tools to describe the tweets. In particular, what words are associated with negative or positive sentiment? How does word usage differ across different airlines?
  2. Use your analysis from (1) to build a short dictionary of negative and positive words describing airlines, then use it to classify tweets as ‘negative’ if they contain more negative than positive language, and ‘positive’ otherwise [code for creating your own dictionary is provided below]
  3. Use an appropriate supervised machine-learning method to classify the tweets into ‘negative’ and ‘positive’
  4. Compare the performance of your classifiers from (2) and (3), and use this analysis to decide which one would be the better classifier for the company to use for future tweets
Here is some advice for part (2): POLS0010
  • Your dictionary should contain a minimum of 5 words and a maximum of 15 words in each category
  • You are not expected to exhaustively compare the performance of different dictionaries. Instead, simply choose one dictionary based on your analysis from (1), and explain how you chose the words.
The dataset for this question is called “tweets” and is contained in the file “tweets.Rda”. It contains the following variables: POLS0010 Variable name Variable description text The text of each tweet sentiment Labeled sentiment of each tweet: 1=negative, 0=positive airline The airline company featured in the tweet: United, JetBlue, American Airlines, US Airways, Virgin America or Southwest. You should first create a corpus of tweets using the following code: speechCorpus <- corpus(tweets$text, docvars = tweets) Code for creating a dictionary: You can create a dictionary called “mydict” in R that contains two categories (‘negative’ and ‘positive’) using the following code: POLS0010 neg.words <- c() pos.words <- c() mydict <- dictionary(list(negative = neg.words, positive = pos.words)) You need to insert your chosen lists of negative and positive words in ‘neg.words’ and ‘pos.words’. This dictionary can then be used with the quanteda package in exactly the same way as any of the existing built-in dictionaries.
This POLS0010: Data Analysis Assessment has been solved by our IT experts at My Uni Paper. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.

Get It Done! Today

Country
Applicable Time Zone is AEST [Sydney, NSW] (GMT+11)
+

Every Assignment. Every Solution. Instantly. Deadline Ahead? Grab Your Sample Now.