Full metadata
Title
Learning from asymmetric models and matched pairs
Description
With the increase in computing power and availability of data, there has never been a greater need to understand data and make decisions from it. Traditional statistical techniques may not be adequate to handle the size of today's data or the complexities of the information hidden within the data. Thus knowledge discovery by machine learning techniques is necessary if we want to better understand information from data. In this dissertation, we explore the topics of asymmetric loss and asymmetric data in machine learning and propose new algorithms as solutions to some of the problems in these topics. We also studied variable selection of matched data sets and proposed a solution when there is non-linearity in the matched data. The research is divided into three parts. The first part addresses the problem of asymmetric loss. A proposed asymmetric support vector machine (aSVM) is used to predict specific classes with high accuracy. aSVM was shown to produce higher precision than a regular SVM. The second part addresses asymmetric data sets where variables are only predictive for a subset of the predictor classes. Asymmetric Random Forest (ARF) was proposed to detect these kinds of variables. The third part explores variable selection for matched data sets. Matched Random Forest (MRF) was proposed to find variables that are able to distinguish case and control without the restrictions that exists in linear models. MRF detects variables that are able to distinguish case and control even in the presence of interaction and qualitative variables.
Date Created
2013
Contributors
- Koh, Derek (Author)
- Runger, George C. (Thesis advisor)
- Wu, Tong (Committee member)
- Pan, Rong (Committee member)
- Cesta, John (Committee member)
- Arizona State University (Publisher)
Topical Subject
Resource Type
Extent
xv, 137 p. : ill. (some col.)
Language
eng
Copyright Statement
In Copyright
Primary Member of
Peer-reviewed
No
Open Access
No
Handle
https://hdl.handle.net/2286/R.I.16445
Statement of Responsibility
by Derek Koh
Description Source
Retrieved on Oct. 25, 2013
Level of coding
full
Note
thesis
Partial requirement for: Ph.D., Arizona State University, 2013
bibliography
Includes bibliographical references (p. 130-137)
Field of study: Industrial engineering
System Created
- 2013-03-25 02:14:55
System Modified
- 2021-08-30 01:43:08
- 3 years 2 months ago
Additional Formats