intTypePromotion=1
zunia.vn Tuyển sinh 2024 dành cho Gen-Z zunia.vn zunia.vn
ADSENSE

Lecture Applied data science: Classification

Chia sẻ: _ _ | Ngày: | Loại File: PDF | Số trang:18

6
lượt xem
3
download
 
  Download Vui lòng tải xuống để xem tài liệu đầy đủ

Lecture "Applied data science: Classification" includes content: Classification - logistic regression review ; classification evaluation metrics; the expected value framework;... We invite you to consult!

Chủ đề:
Lưu

Nội dung Text: Lecture Applied data science: Classification

  1. Classification
  2. Overview 1. Introduction 8. Validation 2. Application 9. Regularisation 3. EDA 10. Clustering 4. Learning Process 11. Evaluation 5. Bias-Variance Tradeoff 12. Deployment 6. Regression (review) 13. Ethics 7. Classification
  3. Lecture outline - Classification - Logistic regression review - Classification evaluation metrics - The expected value framework
  4. Classification problems Response is categorical, e.g. credit card default (Yes/No), favourite movie types (Action/Drama/Animation) Exemplary techniques - logistic regression, classification tree, K-NN, etc.
  5. Logistic regression formulation Logistic regression coefficients are estimated by maximising the likelihood function
  6. Logistic regression example responding Yes No student_Yes 127 2817 student_No 206 6850 Total 333 9667
  7. Training set responding Yes No student_Yes 84 1959 student_No 150 4808 Total 234 6767 Test set responding Yes No student_Yes 43 858 student_No 56 2042 Total 99 2900
  8. Logistic regression results
  9. Logistic regression results interpretation
  10. Prediction from multiple classifiers
  11. The ROC curve
  12. The ROC curve Each point corresponds to a confusion matrix Point A is more ‘conservative’ than B, which is more ‘conservative’ than C Points that are closer to the upper left are preferred. Point (0,1) represents the perfect classifier Points along the diagonal represent random guessing - no classifiers should be in the lower right
  13. The ROC curves from different classifiers
  14. p n Predicted Yes 46 12 Predicted No 53 2888
  15. The expected value analytical framework The targeted marketing example. Assume that we sell the product for $200, production related cost is $100 and shipping and handling cost is $1. What would be the minimum probability of responding we should target.
  16. Expected value of a classifier
  17. Expected value of a classifier From the above example, let’s use 0.35 as the threshold and assume the matrix of cost/benefit information is as below. What would be total expected value of the logistic regression classifier per customer? Actual Yes Actual No Predicted Yes $99 $-1 Predicted No $0 $0
  18. The profit curves Actual Yes Actual No Actual Yes Actual No Predicted Yes $99 $-1 Predicted Yes $99 $-10 Predicted No $0 $0 Predicted No $0 $0
ADSENSE

CÓ THỂ BẠN MUỐN DOWNLOAD

 

Đồng bộ tài khoản
4=>1