Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. ·...

39
Boosting, Bagging, etc. Based mostly on notes by Greg Ridgeway David Madigan

Transcript of Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. ·...

Page 1: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

Boosting, Bagging, etc.

Based mostly on notes by Greg Ridgeway

David Madigan

Page 2: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

ε ε

Page 3: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

ε

Page 4: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

ε

Page 5: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 6: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 7: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 8: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 9: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 10: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 11: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 12: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

Random Forests“

Page 13: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 14: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 15: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

> spam7.rf <- randomForest(yesno ~ ., data=spam7, importance=TRUE)> print(spam7.rf)

Call: randomForest(formula = yesno ~ ., data = spam7, importance = TRUE) Type of random forest: classification Number of trees: 500No. of variables tried at each split: 2

OOB estimate of error rate: 11.89%Confusion matrix: n y class.errorn 2642 146 0.05236729y 401 1412 0.22118036>

in R: library(randomForest)

Page 16: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

> tuneRF(x=spam7[,-7], y=spam7$yesno, trace=FALSE)-0.0582878 0.05 -0.01821494 0.05 mtry OOBError1 1 0.12627692 2 0.11932194 4 0.1214953>

> importance(spam7.rf) n y MeanDecreaseAccuracy MeanDecreaseGinicrl.tot 0.6355890 0.8020131 0.5177274 241.5807dollar 0.6494899 0.7958824 0.5210265 437.7034bang 0.6810718 0.8493533 0.5349533 575.5270money 0.5864884 0.7986897 0.5048771 211.2677n000 0.6500864 0.4718939 0.5107114 123.9895make 0.3840864 0.5809521 0.4117850 41.0645

Page 17: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 18: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 19: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 20: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 21: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 22: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

Andreas Buja’s home page:

Page 23: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 24: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 25: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 26: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 27: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 28: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 29: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 30: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 31: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 32: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 33: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 34: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 35: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 36: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 37: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 38: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning
Page 39: Boosting, Bagging, etc. - Columbia Universitymadigan/G6101/notes/boostbag.pdf · 2007. 12. 9. · Bootstrap aggregation ("bagging") is a device for reducing the variance of learning

http://www.cs.princeton.edu/~schapire/boost.html