COVID-19 Literature Search via Natural Language Processing Tools
The COVID-19 pandemic began in March of 2020 and drastically affected the global human population. Millions of people died due to a SARS-CoV-2 infection while many who survived developed devastating sequelae of the disease. In addition, the closure of schools and businesses led to international economic struggle in the year 2020 as global economies declined. Since the beginning of the pandemic, over 200,000 scientific articles have been published and compiled into a database that grows daily— a rare occurrence within the scientific community. This thesis uses natural language processing tools via Python and VOSviewer software to perform a bibliometric analysis on 205,712 papers published between January of 2020 and February of 2021 pertaining to COVID-19. We first investigate how to analyze these publications most effectively in terms of title versus abstract keyword searches, we further obtain the focus of the current scientific literature via co-occurrence analysis and clustering, and we at last discuss the time evolution of these topics over the course of 14 months.
- Author (aut): Lovell, Madison Ray
- Thesis director: Zheng, Wenwei
- Committee member: Melkozernov, Alexander
- Contributor (ctb): College of Integrative Sciences and Arts
- Contributor (ctb): Barrett, The Honors College