Name: CS3120 Hw2 Logistic Regression for binary classification Solved
SKU: 57301
Price: 35.00 USD
Availability: InStock

Description

Rate this product

(NumPy, Pandas and data visualization packages are allowed.)

(SKLearn regression models are allowed!)

Reference code: 2_Logostic_ExSKLearn_Demo.py in blackboard

e.g. banknote or diabetes dataset

e.g. The following code helps import pima diabetes dataset

col_names = [‘pregnant’, ‘glucose’, ‘bp’, ‘skin’, ‘insulin’, ‘bmi’, ‘pedigree’, ‘age’, ‘label’]

# load dataset

pima = pd.read_csv(“pima-indians-diabetes-database.csv”, header=None, names=col_names)

Select 5 (if not possible then select 4) features from the chosen dataset. (1pt)

List all features you selected in your report.

For example, the following code will select two features

feature_cols = [‘pregnant’, ‘age’]

X = pima[feature_cols]

Use “train _test_split” from “sklearn.cross_validationtrain” to split test and training data by 40% testing + 60% training. (1pt)

the confusion matrix (1pt)

precision score, recall score, F score (3pts)

Copy your console output (these scores) to your report.

Plot out the ROC curve and print out the ROC_AUC score (sklearn.metrics.roc_curve() and sklearn.metrics.roc_auc_score() can be used.) (3pts)

——————————————————————————————————————–

Submit your report and your code in two different files.

Please include the required figure/plot in your report.

e.g.

File1: Assignment2_FirstnameLastname.doc/.pdf (this is the report)

File2: Assignment2_ FirstnameLastname.py (this is the code. only “.py” files accepted.

Assignment2_ FirstnameLastname.zip if you have multiple “.py” files.)

CS3120 Hw2 Logistic Regression for binary classification Solved