Date of Award
4-1-2023
Degree Name
Doctor of Philosophy
Department
Statistics
First Advisor
Kevin H. Lee, Ph.D.
Second Advisor
Joshua Naranjo, Ph.D.
Third Advisor
Hyun Bin Kang, Ph.D.
Fourth Advisor
Jinseok Kim, Ph.D.
Keywords
Amazon product review data, clustering networks, exponential random graph models, stochastic block models, topic modelling (LDA), variational em algorithms mm algorithms
Abstract
As the online market grows rapidly, many companies and researchers are interested in analyzing product review dataset which includes ratings and text review data. In the first project, we mainly focus on analyzing the text review data. In the current literature, it is common to use only text analysis tools to analyze review dataset. But in our work, we propose a method that utilizes both a text analysis method such as topic modeling and a statistical network model to build network among individuals and find interesting communities. We introduce a promising framework that incorporates topic modeling technique to define the edges among the individuals and form a network and uses stochastic Blockmodels (SBM) to find the communities.
The second project involves clustering networks leveraging covariate information. Exponential random graph models (ERGMs) offer a variety of statistical network data and applications. Finding nodes that have similar patterns is an imperative research question. Our concept of utilizing covariate information in networks is a response to the non-homogeneous situation that exists in many network data sets. To find approximate maximum likelihood estimates of network and covariates parameters, and mixing proportions, we present a variational expectation maximization algorithm. The power of our model is demonstrated in simulation studies.
Access Setting
Dissertation-Open Access
Recommended Citation
Atandoh, Paul, "Statistical Clustering of Networks with Additional Information" (2023). Dissertations. 3932.
https://scholarworks.wmich.edu/dissertations/3932