Predicting The Survival Of Titanic Passengers - Titanic Data Set Assignment Help

Download Solution Order New Solution

Titanic Data Set Assignment Help

Assignment Task: Given the data set in Table 1, which is extracted from the Titanic dataset. The attributes are defined as follows:
  • PassengerId: The index of the passengers;
  • Survived: 1 represents survived and 0 represents not;
  • PassengerClass: The class of the passenger on ship;
  • Sex: Indicate a passenger’s sex;
  • Age: A passenger’s age at time of ship departure;
  • SiblingSpouse: The number of Siblings/Spouses that a passenger has on the ship;
  • ParentChild: The number of parents or children that are present on the ship;
The goal is to use the given data to train decision tree models to predict whether certain passengers on the Titanic will survive or not. You are required to:
  1. Cleaning the data and pre-process the data. The values of the ‘Age’ attribute are required to be categorised into three groups: Child (<=12); Teenage (>12 and <20); Adult (>=20). The values of the ‘SiblingSouse’ and ‘ParentChild’ attributes are required to be categorised into two groups respectively: zero (=0); and non-zero (>0).
  2. State the attributes and the class label for building the decision tree model. Select the first 40 records from the data set as a training set to learn a decision tree. The rest of data are treated as the test data set.
  3. Apply the basic Hunt’s Algorithm to train a decision tree model with appropriate explanation. Test the model using the test data set.
  4. An important issue in training decision tree model is to determine which attribute should be selected to split the tree. You are required to rebuild the decision tree model by applying the concept of Gini index and the greedy strategy when splitting the tree. Test the new model using the test data set.
  5. Discuss the results and the decision tree models in the case context and in broad context.
This Titanic Data Set Assignment has been solved by our Data Set experts at My Uni Paper. Our Assignment Writing Experts are efficient to provide a fresh solution to this question. We are serving more than 10000+ Students in Australia, UK & US by helping them to score HD in their academics. Our Experts are well trained to follow all marking rubrics & referencing style.

Get It Done! Today

Country
Applicable Time Zone is AEST [Sydney, NSW] (GMT+11)
+

Every Assignment. Every Solution. Instantly. Deadline Ahead? Grab Your Sample Now.