Full metadata
Title
Industrial applications of data mining: engineering effort forecasting based on mining and analysis of patterns in historical project execution data
Description
Data mining is increasing in importance in solving a variety of industry problems. Our initiative involves the estimation of resource requirements by skill set for future projects by mining and analyzing actual resource consumption data from past projects in the semiconductor industry. To achieve this goal we face difficulties like data with relevant consumption information but stored in different format and insufficient data about project attributes to interpret consumption data. Our first goal is to clean the historical data and organize it into meaningful structures for analysis. Once the preprocessing on data is completed, different data mining techniques like clustering is applied to find projects which involve resources of similar skillsets and which involve similar complexities and size. This results in "resource utilization templates" for groups of related projects from a resource consumption perspective. Then project characteristics are identified which generate this diversity in headcounts and skillsets. These characteristics are not currently contained in the data base and are elicited from the managers of historical projects. This represents an opportunity to improve the usefulness of the data collection system for the future. The ultimate goal is to match the product technical features with the resource requirement for projects in the past as a model to forecast resource requirements by skill set for future projects. The forecasting model is developed using linear regression with cross validation of the training data as the past project execution are relatively few in number. Acceptable levels of forecast accuracy are achieved relative to human experts' results and the tool is applied to forecast some future projects' resource demand.
Date Created
2013
Contributors
- Bhattacharya, Indrani (Author)
- Sen, Arunabha (Thesis advisor)
- Kempf, Karl G. (Thesis advisor)
- Liu, Huan (Committee member)
- Arizona State University (Publisher)
Topical Subject
Resource Type
Extent
vii, 72 p. : col. ill
Language
eng
Copyright Statement
In Copyright
Primary Member of
Peer-reviewed
No
Open Access
No
Handle
https://hdl.handle.net/2286/R.I.16452
Statement of Responsibility
by Indrani Bhattacharya
Level of coding
full
Note
thesis
Partial requirement for: M.S., Arizona State University, 2013
bibliography
Includes bibliographical references (p. 71-72)
Field of study: Computer science
System Created
- 2013-03-25 02:23:46
System Modified
- 2021-08-30 01:43:08
- 3 years 2 months ago
Additional Formats