{"@context":"http://iiif.io/api/presentation/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/manifest.json","@type":"sc:Manifest","label":"Multi-Class ROC Random Forest for Imbalanced Classification","metadata":[{"label":"dc.description.sponsorship","value":"This work is sponsored by the Stony Brook University Graduate School in compliance with the requirements for completion of degree."},{"label":"dc.format","value":"Monograph"},{"label":"dc.format.medium","value":"Electronic Resource"},{"label":"dc.identifier.uri","value":"http://hdl.handle.net/11401/77355"},{"label":"dc.language.iso","value":"en_US"},{"label":"dc.publisher","value":"The Graduate School, Stony Brook University: Stony Brook, NY."},{"label":"dcterms.abstract","value":"The imbalanced class problem in classification is highly relevant in many realistic scenarios such as the detection of a rare condition. One solution is to design specific algorithms incorporating the unbalanced classes in the training process of a classifier. In this dissertation, we propose a novel multi-class classification tree based on the area under the ROC curve (AUC) to resolve the imbalanced classification problem. This tree classifier aims to maximize the sum of AUC for all one versus all classifiers at the node attribute selection stage while balancing the performance of sensitivity and specificity of all one versus all classification at the node threshold selection stage. The ROC tree is extended to ROC random forest with suitable modifications. Furthermore, the volume under surface (VUS), the extension of AUC for multi-class classification, is discussed in this dissertation as well and used to measure the performance of classifiers. The simulation results show that this multi-class ROC tree/forest method is superior to the classic CART/random forest on severely imbalanced multi-class classification problems, while the ROC random forest performs equally well as the SMOTE random forest on imbalanced binary classification problems. The application on Boston housing data shows that the ROC random forest can also be used for model ensemble and it performs better than all the base models and other ensemble methods in this application."},{"label":"dcterms.available","value":"2017-09-20T16:52:33Z"},{"label":"dcterms.contributor","value":"Zhu, Wei"},{"label":"dcterms.creator","value":"Yan, Jiaju"},{"label":"dcterms.dateAccepted","value":"2017-09-20T16:52:33Z"},{"label":"dcterms.dateSubmitted","value":"2017-09-20T16:52:33Z"},{"label":"dcterms.description","value":"Department of Applied Mathematics and Statistics"},{"label":"dcterms.extent","value":"100 pg."},{"label":"dcterms.format","value":"Application/PDF"},{"label":"dcterms.identifier","value":"http://hdl.handle.net/11401/77355"},{"label":"dcterms.issued","value":"2017-05-01"},{"label":"dcterms.language","value":"en_US"},{"label":"dcterms.provenance","value":"Made available in DSpace on 2017-09-20T16:52:33Z (GMT). No. of bitstreams: 1\nYan_grad.sunysb_0771E_13287.pdf: 1218593 bytes, checksum: 33979f0ce95cd72b3427f233cb15d5c6 (MD5)\n Previous issue date: 1"},{"label":"dcterms.publisher","value":"The Graduate School, Stony Brook University: Stony Brook, NY."},{"label":"dcterms.subject","value":"Statistics"},{"label":"dcterms.title","value":"Multi-Class ROC Random Forest for Imbalanced Classification"},{"label":"dcterms.type","value":"Dissertation"},{"label":"dc.type","value":"Dissertation"}],"description":"This manifest was generated dynamically","viewingDirection":"left-to-right","sequences":[{"@type":"sc:Sequence","canvases":[{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json","@type":"sc:Canvas","label":"Page 1","height":1650,"width":1275,"images":[{"@type":"oa:Annotation","motivation":"sc:painting","resource":{"@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/13%2F92%2F35%2F139235709352834860899467369424808589036/full/full/0/default.jpg","@type":"dctypes:Image","format":"image/jpeg","height":1650,"width":1275,"service":{"@context":"http://iiif.io/api/image/2/context.json","@id":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/13%2F92%2F35%2F139235709352834860899467369424808589036","profile":"http://iiif.io/api/image/2/level2.json"}},"on":"https://repo.library.stonybrook.edu/cantaloupe/iiif/2/canvas/page-1.json"}]}]}]}