Edwards, Michael C

Missing Data in Conditional Inference Trees

Description

Decision trees is a machine learning technique that searches the predictor space for the variable and observed value that leads to the best prediction when the data are split into two nodes based on the variable and splitting value. Conditional Inference Trees (CTREEs) is a non-parametric class of decision trees that uses statistical theory in order to select variables for splitting. Missing data can be problematic in decision trees because of an inability to place an observation with a missing value into a node based on the chosen splitting variable. Moreover, missing data can alter the selection process because of its inability to place observations with missing values. Simple missing data approaches (e.g., deletion, majority rule, and surrogate split) have been implemented in decision tree algorithms; however, more sophisticated missing data techniques have not been thoroughly examined. In addition to these approaches, this dissertation proposed a modified multiple imputation approach to handling missing data in CTREEs. A simulation was conducted to compare this approach with simple missing data approaches as well as single imputation and a multiple imputation with prediction averaging. Results revealed that simple approaches (i.e., majority rule, treat missing as its own category, and listwise deletion) were effective in handling missing data in CTREEs. The modified multiple imputation approach did not perform very well against simple approaches in most conditions, but this approach did seem best suited for small sample sizes and extreme missingness situations.

Date Created

2023

Agent

Author (aut): Manapat, Danielle Marie
Thesis advisor (ths): Grimm, Kevin J
Thesis advisor (ths): Edwards, Michael C
Committee member: McNeish, Daniel
Committee member: Anderson, Samantha F
Publisher (pbl): Arizona State University

Evaluating the Performance of the LI3P in Latent Profile Analysis Models

Description

Latent profile analysis (LPA), a type of finite mixture model, has grown in popularity due to its ability to detect latent classes or unobserved subgroups within a sample. Though numerous methods exist to determine the correct number of classes, past research has repeatedly demonstrated that no one method is consistently the best as each tends to struggle under specific conditions. Recently, the likelihood incremental percentage per parameter (LI3P), a method using a new approach, was proposed and tested which yielded promising initial results. To evaluate this new method more thoroughly, this study simulated 50,000 datasets, manipulating factors such as sample size, class distance, number of items, and number of classes. After evaluating the performance of the LI3P on simulated data, the LI3P is applied to LPA models fit to an empirical dataset to illustrate the method’s application. Results indicate the LI3P performs in line with standard class enumeration techniques, and primarily reflects class separation and the number of classes.

Date Created

2022

Agent

Author (aut): Houpt, Russell Paul
Thesis advisor (ths): Grimm, Kevin J
Committee member: McNeish, Daniel
Committee member: Edwards, Michael C
Publisher (pbl): Arizona State University

Evaluating When Subscores Can Have Value in Psychological and Health Applications

Description

Scale scores play a significant role in research and practice in a wide range of areas such as education, psychology, and health sciences. Although the methods of scale scoring have advanced considerably over the last 100 years, researchers and practitioners have generally been slow to implement these advances. There are many topics that fall under this umbrella but the current study focuses on two. The first topic is that of subscores and total scores. Many of the scales in psychological and health research are designed to yield subscores, yet it is common to see total scores reported instead. Simplifying scores in this way, however, may have important implications for researchers and scale users in terms of interpretation and use. The second topic is subscore augmentation. That is, if there are subscores, how much value is there in using a subscore augmentation method? Most people using psychological assessments are unfamiliar with score augmentation techniques and the potential benefits they may have over the traditional sum score approach. The current study borrows methods from education to explore the magnitude of improvement of using augmented scores over observed scores. Data was simulated using the Graded Response Model. Factors controlled in the simulation were number of subscales, number of items per subscale, level of correlation between subscales, and sample size. Four estimates of the true subscore were considered (raw, subscore-adjusted, total score-adjusted, joint score-adjusted). Results from the simulation suggest that the score adjusted with total score information may perform poorly when the level of inter-subscore correlation is 0.3. Joint scores perform well most of the time, and the subscore-adjusted scores and joint-adjusted scores were always better performers than raw scores. Finally, general advice to applied users is provided.

Date Created

2022

Agent

Author (aut): Gardner, Molly
Thesis advisor (ths): Edwards, Michael C
Committee member: McNeish, Daniel
Committee member: Levy, Roy
Publisher (pbl): Arizona State University

Introduction, Estimation, and Potential Solutions to Collider Bias

Description

Collider effects pose a major problem in psychological research. Colliders are third variables that bias the relationship between an independent and dependent variable when (1) the composition of a research sample is restricted by the scores on a collider variable or (2) researchers adjust for a collider variable in their statistical analyses. Both cases interfere with the accuracy and generalizability of statistical results. Despite their importance, collider effects remain relatively unknown in the social sciences. This research introduces both the conceptual and the mathematical foundation for collider effects and demonstrates how to calculate a collider effect and test it for statistical significance. Simulation studies examined the efficiency and accuracy of the collider estimation methods and tested the viability of Thorndike’s Case III equation as a potential solution to correcting for collider bias in cases of biased sample selection.

Date Created

2021

Agent

Author (aut): Lamp, Sophia Josephine
Thesis advisor (ths): Mackinnon, David P
Committee member: Anderson, Samantha F
Committee member: Edwards, Michael C
Publisher (pbl): Arizona State University

Evaluating Person-Oriented Methods for Mediation

Description

Statistical inference from mediation analysis applies to populations, however, researchers and clinicians may be interested in making inference to individual clients or small, localized groups of people. Person-oriented approaches focus on the differences between people, or latent groups of people, to ask how individuals differ across variables, and can help researchers avoid ecological fallacies when making inferences about individuals. Traditional variable-oriented mediation assumes the population undergoes a homogenous reaction to the mediating process. However, mediation is also described as an intra-individual process where each person passes from a predictor, through a mediator, to an outcome (Collins, Graham, & Flaherty, 1998). Configural frequency mediation is a person-oriented analysis of contingency tables that has not been well-studied or implemented since its introduction in the literature (von Eye, Mair, & Mun, 2010; von Eye, Mun, & Mair, 2009). The purpose of this study is to describe CFM and investigate its statistical properties while comparing it to traditional and casual inference mediation methods. The results of this study show that joint significance mediation tests results in better Type I error rates but limit the person-oriented interpretations of CFM. Although the estimator for logistic regression and causal mediation are different, they both perform well in terms of Type I error and power, although the causal estimator had higher bias than expected, which is discussed in the limitations section.

Date Created

2019

Agent

Author (aut): Smyth, Heather Lynn
Thesis advisor (ths): Mackinnon, David P
Committee member: Grimm, Kevin J.
Committee member: Edwards, Michael C
Publisher (pbl): Arizona State University

Middle Eastern and North African (MENA) American youth reports of their parenting experiences: associations with mental and physical health

Description

Scant research examines the associations between parenting behaviors and the psychological health of Middle Eastern and North African (MENA) American youth. Developmental research consistently demonstrates that an authoritarian parenting style (often characterized by rejecting and controlling behaviors, and a common style among MENA parents) is maladaptive for offspring health; however, no study has empirically tested the associations of these behaviors from mothers and fathers with the health of MENA American youth. Using survey data from 314 MENA American young adults (Mage = 20 years, range 18 – 25 years, 56% female), the current study tested the associations between commonly studied parenting behaviors - acceptance, rejection, harsh parenting, and control - with the mental (stress, depression, and anxiety) and physical health (general health perceptions, pain, and somatization) of MENA American youth. Confirmatory factor analysis tested new items informed by preliminary focus groups with original items from the Child Report Parenting Behavior Inventory (CRPBI) to create culturally-informed parenting factors. Results indicated that youth-reported higher maternal acceptance was associated with fewer mental health symptoms, higher maternal harsh parenting with higher mental health symptoms, and higher maternal rejection with worse physical health; father rejection was associated with higher mental health symptoms and worse physical health. Further, the associations between parenting and physical health were moderated by youth Arabic orientation, such that those with higher Arabic orientation showed the best physical health at higher levels of acceptance, and the worst physical health at higher levels of rejection, harsh parenting, and control. Associations between parenting and health did not differ by youth gender. The current findings suggest cross-cultural similarities in the beneficial functions of parental acceptance, and detrimental functions of parental rejection and harsh parenting, with MENA American youth. The associations between parenting and health were exacerbated, for better or for worse, for more Arabic-oriented youth, suggesting these youth may be more greatly impacted by perceptions of their parents’ behaviors. Findings have implications for family interventions working with MENA populations.

Date Created

2019

Agent

Author (aut): Ibrahim, Mariam Hanna
Thesis advisor (ths): Luecken, Linda J.
Committee member: Gonzales, Nancy A.
Committee member: Edwards, Michael C
Committee member: Doane, Leah D
Publisher (pbl): Arizona State University

Evaluation of five effect size measures of measurement non-invariance for continuous outcomes

Description

To make meaningful comparisons on a construct of interest across groups or over time, measurement invariance needs to exist for at least a subset of the observed variables that define the construct. Often, chi-square difference tests are used to test for measurement invariance. However, these statistics are affected by sample size such that larger sample sizes are associated with a greater prevalence of significant tests. Thus, using other measures of non-invariance to aid in the decision process would be beneficial. For this dissertation project, I proposed four new effect size measures of measurement non-invariance and analyzed a Monte Carlo simulation study to evaluate their properties and behavior in addition to the properties and behavior of an already existing effect size measure of non-invariance. The effect size measures were evaluated based on bias, variability, and consistency. Additionally, the factors that affected the value of the effect size measures were analyzed. All studied effect sizes were consistent, but three were biased under certain conditions. Further work is needed to establish benchmarks for the unbiased effect sizes.

Date Created

2019

Agent

Author (aut): Gunn, Heather J
Thesis advisor (ths): Grimm, Kevin J.
Thesis advisor (ths): Edwards, Michael C
Committee member: Tein, Jenn-Yun
Committee member: Anderson, Samantha F.
Publisher (pbl): Arizona State University

Psychometric and Machine Learning Approaches to Diagnostic Classification

Description

The goal of diagnostic assessment is to discriminate between groups. In many cases, a binary decision is made conditional on a cut score from a continuous scale. Psychometric methods can improve assessment by modeling a latent variable using item response theory (IRT), and IRT scores can subsequently be used to determine a cut score using receiver operating characteristic (ROC) curves. Psychometric methods provide reliable and interpretable scores, but the prediction of the diagnosis is not the primary product of the measurement process. In contrast, machine learning methods, such as regularization or binary recursive partitioning, can build a model from the assessment items to predict the probability of diagnosis. Machine learning predicts the diagnosis directly, but does not provide an inferential framework to explain why item responses are related to the diagnosis. It remains unclear whether psychometric and machine learning methods have comparable accuracy or if one method is preferable in some situations. In this study, Monte Carlo simulation methods were used to compare psychometric and machine learning methods on diagnostic classification accuracy. Results suggest that classification accuracy of psychometric models depends on the diagnostic-test correlation and prevalence of diagnosis. Also, machine learning methods that reduce prediction error have inflated specificity and very low sensitivity compared to the data-generating model, especially when prevalence is low. Finally, machine learning methods that use ROC curves to determine probability thresholds have comparable classification accuracy to the psychometric models as sample size, number of items, and number of item categories increase. Therefore, results suggest that machine learning models could provide a viable alternative for classification in diagnostic assessments. Strengths and limitations for each of the methods are discussed, and future directions are considered.

Date Created

2018

Agent

Author (aut): González, Oscar
Thesis advisor (ths): Mackinnon, David P
Thesis advisor (ths): Edwards, Michael C
Committee member: Grimm, Kevin J.
Committee member: Zheng, Yi
Publisher (pbl): Arizona State University

Subscribe to Edwards, Michael C