Question 1: Consider the following data set: price low low low low med med med med high high high maintenance low med lo
-
- Site Admin
- Posts: 899603
- Joined: Mon Aug 02, 2021 8:13 am
Question 1: Consider the following data set: price low low low low med med med med high high high maintenance low med lo
Question 1: Consider the following data set: price low low low low med med med med high high high maintenance low med low high med med high high med high high capacity airbag profitable 2 no yes 4 yes no 4 no yes 4 no no 4. no no 4 yes yes 2 yes no 5 no yes 4 yes yes 2 yes no 5 yes yes (a) We are trying to predict profitable’, please illustrate the steps to select the root in a decision tree if we use multi-way splits and the Gini index impurity measure? (8 points) (b) For the same data set, suppose we decide to construct a decision tree using binary splits and the entropy impurity measure. Which among the following feature and split point combinations would be the best to use as the root node assuming that we consider each of the input features to be unordered? We only consider the following four choices ((1)price - {low, med}|{high} (2) maintenance - {high}|{med, low} (3) maintenance - {high, med}|{low} (4) capacity - {2}|{4, 5}) (8 points) - -