Name: ELE888 Lab1-Bayesian Decision Theory Solved
SKU: 67421
Price: 35.00 USD
Availability: InStock

[SOLVED] ELE888 Lab1-Bayesian Decision Theory

35.00 $

Category: ELE888

Click Category Button to View Your Next Assignment | Homework

You will receive the following solution file(s) instantly after successful payment:

lab1-dxpfrb.zip (592 KB)

Assignment Instructions Updated Recently? Submit Below and we will provide new Solution!

Submit New Instructions

🔒 Securely Powered by:

5/5 - (2 votes)

Using a single discriminant function g(x2), design a 2-class minimum-error-rate classifier (dichotomizer) from the given data, to classify IRIS samples into either Iris Setosa or Iris Versicolour, according to the feature: sepal width.
Using the shell programlab1.m, write a program that will take an individual sample value as the input and will return the posterior probabilities and the value of g(x2)
Identify the class labels for the feature values using your program, and indicate their respective posterior probabilities and discriminant function values: x1= [3.3, 4.4, 5.0, 5.7, 6.3]

Since the range of sepal width for w1 is between 3 to 4.4 and w2 is 2 to 3.5, and the lowest number in the given dataset is 3.3, the probability of choosing w2 is much lower than probability of choosing w1 for every testing set. Thus, the result is always Iris-Setosa (w1) with Sepal Width.

Table 1: Classifying samples by label, posterior, and discriminate function according to sepal width

X₂	Class	Posterior Probabilities [P(w₁\|x),P(w₂\|x)]	Discriminate Function [g(x)]
3.3	w1	[0.8281, 0.1719]	0.6563
4.4	w1	[1.0000, 0.0000]	0.9999
5.0	w1	[1.0000, 0.0000]	1.0000
5.7	w1	[1.0000, 0.0000]	1.0000
6.3	w1	[1.0000, 0.0000]	1.0000

Arrive at an optimal threshold (Th1) that separates classes w1 and w2 (theoretically or experimentally). Justify your result.
Suggest how T_h1 would be a effected if a higher penalty is associated with classifying class w2 as class w1 – show with experiment

The effect of a higher penalty associated with classifying class w2 as w2 requires the Bayes Risk Formula which uses the loss function to describe the penalties for misclassification. We are assuming that the penalty associated with misclassifying class 2 as class 1 is greater than class 1 as class 2.

𝑝(𝑥 |𝜔 ) 𝜆 − 𝜆 𝑃(𝜔 )

∗

𝑝(𝑥 |𝜔 )

𝜆 is the penalty associated with misclassifying class 1 as class 2 𝜆 is the penalty associated with misclassifying class 2 as class 1

𝜆 𝑎𝑛𝑑 𝜆 are assumed to be 0 because there is no penalty for correct classification.

In this case, we assume that 𝜆 > 𝜆 . We know that both probabilities 𝑃(𝜔 ) and 𝑃(𝜔 ) are 0.5, the term can be neglected. The optimal threshold is when the ccp for both classes are equal ( =1). The threshold value can be defined as:

𝑝(𝑥 |𝜔 ) 𝜆

And since 𝜆 > 𝜆 , we can see that

𝑝(𝑥 |𝜔 )

< 1

𝑝(𝑥 |𝜔 )

Finally, we can conclude that:

𝑝(𝑥 |𝜔 ) < 𝑝(𝑥 |𝜔 )

So in this case, the threshold line will move towards the left where 𝑝(𝑥 |𝜔 ) is bigger.

Adjust your program to accept Sepal Length as the discriminating feature g(x1). Suggest which of the two features (x1, x2) might be a better choice for separating the two classes w₁ and w₂. Justify.

Bayes decision theory involves computation of posterior probabilities and the decision regarding which class an unknown object belongs to is based upon which posterior probability is the highest. However, we also need to consider the error rate. This error rate equates to the sum of all posterior probabilities that we did not choose. For example, for a two categories dichotomizer, if we choose ω1, then the error rate will be equal to P(ω2 | x).

Table 2: Classifying samples by label, posterior, and discriminate function according to sepal length

X₁	Class	Posterior Probabilities [P(w₁\|x),P(w₂\|x)]	Discriminate Function [g(x)]
3.3	w1	[0.7204, 0.2796]	0.4408
4.4	w1	[0.9288, 0.0712]	0.8576
5.0	w1	[0.7795, 0.2205]	0.5589
5.7	w2	[0.0984, 0.9016]	-0.8032
6.3	w2	[0.0010, 0.9990]	-0.9979

Sepal length is the better choice for separating the two classes as the variance of the sepal length is greater than the variance of the sepal width. This means that the lengths are more spread out than the width which means there is less overlap between the samples if you classify them by length. Since, we are minimizing the classification error, the accuracy is improved.

lab1-dxpfrb.zip

[SOLVED] ELE888 Lab1-Bayesian Decision Theory

Related products

ELE888 Lab2-Linear Discriminant Functions

ELE888 Lab3-Multilayer Neural Networks

Related in this category

More in this category

ELE888 Lab3-Multilayer Neural Networks

ELE888 Lab2-Linear Discriminant Functions

Want to See Past Work First?