"A Testing Based Extraction Algorithm for Identifying Significant Commu" by James D. Wilson, Simi Wang et al.

Mathematics

Title

A Testing Based Extraction Algorithm for Identifying Significant Communities in Networks

Authors

James D. Wilson, University of San FranciscoFollow
Simi Wang
Peter J. Mucha
Shankar Bhamidi
Andrew B. Nobel

Document Type

Article

Publication Date

2014

Abstract

A common and important problem arising in the study of networks is how to divide the vertices of a given network into one or more groups, called communities, in such a way that vertices of the same community are more interconnected than vertices belonging to different ones. We propose and investigate a testing based community detection procedure called Extraction of Statistically Significant Communities (ESSC). The ESSC procedure is based on p-values for the strength of connection between a single vertex and a set of vertices under a reference distribution derived from a conditional configuration network model. The procedure automatically selects both the number of communities in the network and their size. Moreover, ESSC can handle overlapping communities and, unlike the majority of existing methods, identifies “background” vertices that do not belong to a well-defined community. The method has only one parameter, which controls the stringency of the hypothesis tests. We investigate the performance and potential use of ESSC and compare it with a number of existing methods, through a validation study using four real network data sets. In addition, we carry out a simulation study to assess the effectiveness of ESSC in networks with various types of community structure, including networks with overlapping communities and those with background vertices. These results suggest that ESSC is an effective exploratory tool for the discovery of relevant community structure in complex network systems.

Comments

Originally published in Annals of Applied Statistics.

Original published version available at: http://dx.doi.org/10.1214/14-AOAS760

DOI

10.1214/14-AOAS760

Recommended Citation

Wilson, J. D., Wang, S., Mucha, P.J., Bhamidi, S., & Nobel, A.B. (2014). A testing based extraction algorithm for identifying significant communities in networks. Annals of Applied Statistics, Vol. 8, No. 3, 1853-1891. http://dx.doi.org/10.1214/14-AOAS760

Download

Find in your library

Included in

Mathematics Commons

COinS

Mathematics

Title

Authors

Document Type

Publication Date

Abstract

Comments

DOI

Recommended Citation

Included in

Search

Browse

Author Corner

Links

Library Links

Mathematics

Title

Authors

Document Type

Publication Date

Abstract

Comments

DOI

Recommended Citation

Included in

Share

Search

Browse

Author Corner

Links

Library Links