Your browser doesn't support javascript.
Generalized k -means in GLMs with applications to the outbreak of COVID-19 in the United States.
Zhang, Tonglin; Lin, Ge.
  • Zhang T; Department of Statistics, Purdue University, 250 North University Street, West Lafayette, IN 47907-2066, USA.
  • Lin G; Department of Environmental and Occupational Health, University of Nevada Las Vegas, Las Vegas, NV 89154, USA.
Comput Stat Data Anal ; 159: 107217, 2021 Jul.
Article in English | MEDLINE | ID: covidwho-1126791
ABSTRACT
Generalized k -means can be combined with any similarity or dissimilarity measure for clustering. Using the well known likelihood ratio or F -statistic as the dissimilarity measure, a generalized k -means method is proposed to group generalized linear models (GLMs) for exponential family distributions. Given the number of clusters k , the proposed method is established by the uniform most powerful unbiased (UMPU) test statistic for the comparison between GLMs. If k is unknown, then the proposed method can be combined with generalized liformation criterion (GIC) to automatically select the best k for clustering. Both AIC and BIC are investigated as special cases of GIC. Theoretical and simulation results show that the number of clusters can be correctly identified by BIC but not AIC. The proposed method is applied to the state-level daily COVID-19 data in the United States, and it identifies 6 clusters. A further study shows that the models between clusters are significantly different from each other, which confirms the result with 6 clusters.
Keywords

Full text: Available Collection: International databases Database: MEDLINE Language: English Journal: Comput Stat Data Anal Year: 2021 Document Type: Article Affiliation country: J.csda.2021.107217

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Language: English Journal: Comput Stat Data Anal Year: 2021 Document Type: Article Affiliation country: J.csda.2021.107217