Full metadata

Title

Characterizing the Performance of Machine Learning Algorithms: A Study and Novel Techniques

Description

Classification in machine learning is quite crucial to solve many problems that the world is presented with today. Therefore, it is key to understand one’s problem and develop an efficient model to achieve a solution. One technique to achieve greater model selection and thus further ease in problem solving is estimation of the Bayes Error Rate. This paper provides the development and analysis of two methods used to estimate the Bayes Error Rate on a given set of data to evaluate performance. The first method takes a “global” approach, looking at the data as a whole, and the second is more “local”—partitioning the data at the outset and then building up to a Bayes Error Estimation of the whole. It is found that one of the methods provides an accurate estimation of the true Bayes Error Rate when the dataset is at high dimension, while the other method provides accurate estimation at large sample size. This second conclusion, in particular, can have significant ramifications on “big data” problems, as one would be able to clarify the distribution with an accurate estimation of the Bayes Error Rate by using this method.

Date Created

2021-12

Contributors

Lattus, Robert (Author)
Dasarathy, Gautam (Thesis director)
Berisha, Visar (Committee member)
Turaga, Pavan (Committee member)
Barrett, The Honors College (Contributor)
Electrical Engineering Program (Contributor)

Topical Subject

Resource Type

Text

Extent

19 pages

Copyright Statement

In Copyright

Reuse Permissions

Attribution-NonCommercial-ShareAlike

Primary Member of

Barrett, The Honors College Thesis/Creative Project Collection

Peer-reviewed

No

Open Access

No

Series

Academic Year 2021-2022

Handle

https://hdl.handle.net/2286/R.2.N.161220

System Created

2021-11-12 10:58:42

System Modified

2022-01-28 05:14:47
2 years 9 months ago

Additional Formats