Logistic-Regression Homework 2-Banking Insurance Product 2 Solved

35.00 $

Category: Tags: , , , , ,
Click Category Button to View Your Next Assignment | Homework

You'll get a download link with a: zip solution files instantly, after Payment

Securely Powered by: Secure Checkout

Description

Rate this product

 

Banking Insurance Product – Phase 2: IP – F1.H2

Purpose

By responding to this Request for Proposal (RFP), the Proposer agrees that s/he has read and understood all documents within this RFP package.

Submission Details

Responders to this RFP should supply:

  • A business report up to 4 pages (not including cover page, table of contents, or any needed

    appendix), including any supporting plots and tables.

  • The commented code used to produce the results.

    The report should address all points described in the “Objective” section below. The report should be returned in the following way:

• Electronic (Submit via Moodle)

Background

The Commercial Banking Corporation (hereafter the “Bank”), acting by and through its department of Customer Services and New Products is seeking proposals for banking services. The Bank ultimately wants to predict which customers will buy a variable rate annuity product.

A variable annuity is a contract between you and an insurance company / bank, under which the insurer agrees to make periodic payments to you, beginning either immediately or at some future date. You purchase a variable annuity contract by making either a single purchase payment or a series of purchase payments.

A variable annuity offers a range of investment options. The value of your investment as a variable annuity owner will vary depending on the performance of the investment options you choose. The investment options for a variable annuity are typically mutual funds that invest in stocks, bonds, money market instruments, or some combination of the three. If you are interested in more information, see: http://www.sec.gov/investor/pubs/varannty.htm

The project will be broken down into 3 phases:

  • Phase 1 – Variable Understanding and Assumptions
  • Phase 2 – Variable Selection and Modeling Building
  • Phase 3 – Model Assessment and Prediction

    Objective – Phase 2

    The scope of services in this phase includes the following:

• For this phase use only the binned training data set.

• Based on your first report, the Bank has strategically binned each of the continuous variables in the data set to help facilitate any further analysis.

o Foranyvariablewithmissingvalues,changethedatatoincludeamissingcategory instead of a missing value for the categorical variable.

§ (HINT: Now all variables should be categorized (treated as categorical variables so no more continuous variable assumptions) and without missing values. Banks do this for more advanced modeling purposes that we will talk about in the spring.)

o Checkeachvariableforseparationconcerns.Documentinthereportandadjustany variables with complete or quasi-separation concerns.

• Build a main effects only binary logistic regression model to predict the purchase of the insurance product.

o Usebackwardselectiontodothevariableselection–theBankcurrentlyuses𝛼=0.002 and p-values to perform backward, but is open to another technique and/or significance level if documented in your report.

o Reportthefinalvariablesfromthismodelrankedbyp-value.
§ (HINT: Even if you choose to not use p-values to select your variables, you

should still rank all final variables by their p-value in this report.)

  • Interpret one variable’s odds ratio from your final model as an example.

    o Reportonanyinterestingfindingsfromyouroddsratiosfromyourmodel.
    § (HINT: This is open-ended and has no correct answer. However, you should get

    use to keeping an eye out for what you might deem important or interesting

    when exploring data to report in an executive summary.)

  • Investigate possible interactions using forward selection including only the main effects from

    your previous final model.
    o Reportthefinalinteractionvariablesfromthismodelrankedbyp-value.

  • Report your final logistic regression model’s variables by significance.
    o (HINT:Thesestepsareheretohelpyoubuildyourmodel,butnottotellyouwhich

    order to write your report. Consider the most important information when done with these questions and write your report accordingly.)

Data Provided

The following two sets of data are provided for the proposal:
• The training data set insurance_t_bin contains 8,495 observations and 47 variables.

o Allofthesecustomershavebeenofferedtheproductinthedatasetunderthevariable INS, which takes a value of 1 if they bought and 0 if they did not buy.

o Thereare46variablesdescribingthecustomer’sattributesbeforetheywereoffered the new insurance product.

o TheBankhasstrategicallybinnedeachofthecontinuousvariablesinthedatasetto help facilitate any further analysis.

§ (HINT: The original insurance_t and the new insurance_t_bin can be 1:1 row matched in case you wanted to know where the bins were split on.)

  • The validation data set insurance_v_bin contains 2,124 observations and 47 variables.
  • The table below describes the Roles and Description of the variables found in both data sets.

Name Model Role Description

ACCTAGE DDA DDABAL DEPAMT CASHBK CHECKS DIRDEP NSF NSFAMT PHONE TELLER SAV SAVBAL ATM ATMAMT POS POSAMT CD CDBAL IRA IRABAL LOC LOCBAL INV INVBAL ILS ILSBAL MM MMBAL MMCRED MTG MTGBAL CC CCBAL CCPURC SDB INCOME HMOWN LORES HMVAL AGE

Input Age of oldest account

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Input

Indicator for checking account

Checking account balance

Total amount deposited

Number of cash back requests

Number of checks written

Indicator for direct deposit

Number of insufficient fund issues

Amount of NSF

Number of telephone banking interactions

Number of teller visit interactions

Indicator for savings account

Savings account balance

Indicator for ATM interaction

Total ATM withdrawal amount

Number of point of sale interactions

Total amount for point of sale interactions

Indicator for certificate of deposit account

CD balance

Indicator for retirement account

IRA balance

Indicator for line of credit

LOC balance

Indicator for investment account

INV balance

Indicator for installment loan

ILS balance

Indicator for money market account

MM balance

Number of money market credits

Indicator for mortgage

MTG balance

Indicator for credit card

CC balance

Number of credit card purchases

Indicator for safety deposit box

Income

Indicator for home ownership

Length of residence in years

Value of home

Input Age

CRSCORE MOVED INAREA INS BRANCH RES

Input

Input

Input

Target

Input

Credit score

Recent address change

Indicator for local address

Indicator for purchase of insurance product

Branch of bank

Input Area classification

  • HW-2-nrnm7k.zip