Use of Machine Learning and Data Science on Infrastructure Transportation and Construction Projects

168518-Thumbnail Image.png
Description
By the evolution of technologies and computing power, it is possible to capture and save large amounts of data and then find patterns in large and complex datasets using data science and machine learning. This dissertation introduces machine-learning models and

By the evolution of technologies and computing power, it is possible to capture and save large amounts of data and then find patterns in large and complex datasets using data science and machine learning. This dissertation introduces machine-learning models and econometric models to use in infrastructure transportation projects. Among transportation infrastructure projects, the airline industry and highways are selected to implement the models.The first topic of this dissertation focuses on using machine-learning models in highway projects. The International Roughness Index (IRI) for asphalt concrete pavement is predicted based on the 12,637 observations in the Long-Term Pavement Performance (LTPP) dataset for 1,390 roads and highways in the 50 states of the United States and the District of Columbia from 1989 to 2018. The results show that XGBoost provides a better model fit in terms of mean absolute error and coefficient of determination than other studied models. Also, the most important factors in predicting the IRI are identified. The second topic of this dissertation aims to develop machine-learning models to predict customer dissatisfaction in the airline industry. The relationship between measures of service failure (flight delay and mishandled baggage) and customer dissatisfaction is predicted by using longitudinal data from 2003 to 2019 from the U.S. airline industry. Data was obtained from the Air Travel Consumer Report (ATCR) published by the U.S. Department of Transportation. Flight delay is more important in low-cost airlines, while mishandled baggage is more important in legacy airlines. Also, the effect of the train-test split ratio on each machine-learning model is examined by running each model using four train-test splits. Results indicate that the train-test split ratio could influence the selection of the best model. The third topic in this dissertation uses econometric analysis to investigate the relationship between customer dissatisfaction and two measures of service failure in the U.S. airline industry. Results are: 1) Mishandled baggage has more impact than flight delay on customer complaints. 2) The effect of an airline’s service failures on customer complaints is contingent on the category of the airline. 3) The effect of flight delay on customer complaints is lower for low-cost airlines compared to legacy airlines.
Date Created
2021
Agent

Assessing Experiential Learning in Construction Education by Modeling Student Performance

157858-Thumbnail Image.png
Description
The typical engineering curriculum has become less effective in training construction professionals because of the evolving construction industry needs. The latest National Science Foundation and the National Academies report indicate that industry-valued skills are changing. The Associated General Contractors of

The typical engineering curriculum has become less effective in training construction professionals because of the evolving construction industry needs. The latest National Science Foundation and the National Academies report indicate that industry-valued skills are changing. The Associated General Contractors of America recently stated that contractors expect growth in all sectors; however, companies are worried about the supply of skilled professionals. Workforce development has been of a growing interest in the construction industry, and this study approaches it by conducting an exploratory analysis applied to students that have completed a mandatory internship as part of their construction program at Arizona State University, in the School of Sustainable Engineering and the Built Environment. Data is collected from surveys, including grades by a direct evaluator from the company reflecting each student’s performance based on recent Student Learning Objectives. Preliminary correlations are computed between scores received on the 15 metrics in the survey and the final industry suggested grade. Based on the factors identified as highest predictors: ingenuity and creativity, punctuality and attendance, and initiative; a prognostic model of student performance in the construction industry is generated. With regard to graduate employability, student performance in the industry and human predispositions are also tested in order to evaluate their contribution to the generated model. The study finally identifies threats to validity and opportunities presented in a dynamic learning environment presented by internships. Results indicate that measuring student performance during internships in the construction industry creates challenges for the evaluator from the host company. Scoring definitions are introduced to standardize the evaluators’ grading based on observations of student behavior. 12 questions covering more Student Learning Objectives identified by the industry are added to the survey, potentially improving the reliability of the predictive model.
Date Created
2019
Agent