Question 1 (10 points)
Data: āAmes Iowa Housing Prices Datasetā (https://www.kaggle.com/datasets/emurphy/ames-iowahousing-prices-dataset)
a) Use multiple linear regression to fit the āSalePriceā column using data in the file ātrain1.csvā. Select up to 10 variables for your model. The choice is not key at this stage but a reasonable choice of a subset it needed with a brief selection exploration. Report the RMSE upon using the ātest1.csvā dataset.
b) Fit a decision tree to the set of independent variables chosen above using ātrain1.csvā. Report the RMSE from using ātest1.csvā. ( library(Metrics); rmseDt = rmse(actual=testpricestarget, predicted=predictedprices )
c) Fit a Random Forest to the set of independent variables chosen above using ātrain1.csvā. Report the RMSE from using ātest1.csvā.
d) Fit an XGBoost model to the set of independent variables chosen above using ātrain1.csvā. Report the RMSE from using ātest1.csvā.
e) Which parameters of XGBoost can you tune which change the RMSE from your experience?
f) Comment on the quality of the fits produced in each case? What can you conclude from this predictive task about the nature of the sales price prediction?
Question 2 (10 points)
Data: āPokemon for Data Mining and Machine Learningā
(https://www.kaggle.com/datasets/alopez247/pokemon)
a) Randomly select 70% of the rows as training data and the remaining rows as testing data.
b) Fit to the training data a decision tree, random forest and XGBoost model where the dependent variable is the āType_1ā column.
c) Report on the accuracy for predicting āType_1ā on the testing rows for each model.
d) Produce a confusion matrix for each model and then comment on the model performances. Discuss in terms of the confusion matrix and accuracy.
[SOLVED] STA4102 - Assignment-2
39.99 $
You will receive the following solution file(s) instantly after successful payment:
Question 1 (10 points)
Data: āAmes Iowa Housing Prices Datasetā (https://www.kaggle.com/datasets/emurphy/ames-iowahousing-prices-dataset)
a) Use multiple linear regression to fit the āSalePriceā column using data in the file ātrain1.csvā. Select up to 10 variables for your model. The choice is not key at this stage but a reasonable choice of a subset it needed with a brief selection exploration. Report the RMSE upon using the ātest1.csvā dataset.
b) Fit a decision tree to the set of independent variables chosen above using ātrain1.csvā. Report the RMSE from using ātest1.csvā. ( library(Metrics); rmseDt = rmse(actual=testpricestarget, predicted=predictedprices )
c) Fit a Random Forest to the set of independent variables chosen above using ātrain1.csvā. Report the RMSE from using ātest1.csvā.
d) Fit an XGBoost model to the set of independent variables chosen above using ātrain1.csvā. Report the RMSE from using ātest1.csvā.
e) Which parameters of XGBoost can you tune which change the RMSE from your experience?
f) Comment on the quality of the fits produced in each case? What can you conclude from this predictive task about the nature of the sales price prediction?
Question 2 (10 points)
Data: āPokemon for Data Mining and Machine Learningā
(https://www.kaggle.com/datasets/alopez247/pokemon)
a) Randomly select 70% of the rows as training data and the remaining rows as testing data.
b) Fit to the training data a decision tree, random forest and XGBoost model where the dependent variable is the āType_1ā column.
c) Report on the accuracy for predicting āType_1ā on the testing rows for each model.
d) Produce a confusion matrix for each model and then comment on the model performances. Discuss in terms of the confusion matrix and accuracy.
Related products
Assignment 3 – A3.py
35.00 $ Add to cartSOLVED:CMIS 242 – Intermediate Programming Programming Project 1
30.00 $ Add to cartProject 1 – Sound Blaster! (SOLVED)
25.00 $ Add to cartRelated in this category
More in this category
STA4102 Assignment #2
25.00 $ Add to cartAssignment 1 STA4102
49.99 $ Add to cartSTA4102 Assignment 4
30.00 $ Add to cartSTA4102 Homework #1
35.00 $ Add to cartSTA4102 Assignment 3
25.00 $ Add to cartSTA4102 – Final-Exam
34.99 $ Add to cart