Efficient Structural Clustering on Probabilistic Graphs

Yu Xuan Qiu, Rong Hua Li, Jianxin Li, Shaojie Qiao, Guoren Wang, Jeffrey Xu Yu, Rui Mao

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Structural clustering is a fundamental graph mining operator which is not only able to find densely-connected clusters, but it can also identify hub vertices and outliers in the graph. Previous structural clustering algorithms are tailored to deterministic graphs. Many real-world graphs, however, are not deterministic, but are probabilistic in nature because the existence of the edge is often inferred using a variety of statistical approaches. In this paper, we formulate the problem of structural clustering on probabilistic graphs, with the aim of finding reliable clusters in a given probabilistic graph. Unlike the traditional structural clustering problem, our problem relies mainly on a novel concept called reliable structural similarity which measures the probability of the similarity between two vertices in the probabilistic graph. We develop a dynamic programming algorithm with several powerful pruning strategies to efficiently compute the reliable structural similarities. With the reliable structural similarities, we adapt an existing solution framework to calculate the structural clustering on probabilistic graphs. Comprehensive experiments on five real-life datasets demonstrate the effectiveness and efficiency of the proposed approaches.

Original languageEnglish
Article number8476242
Pages (from-to)1954-1968
Number of pages15
JournalIEEE Transactions on Knowledge and Data Engineering
Volume31
Issue number10
DOIs
Publication statusPublished - 28 Sep 2019

Fingerprint

Dynamic programming
Clustering algorithms
Experiments

Cite this

Qiu, Y. X., Li, R. H., Li, J., Qiao, S., Wang, G., Yu, J. X., & Mao, R. (2019). Efficient Structural Clustering on Probabilistic Graphs. IEEE Transactions on Knowledge and Data Engineering, 31(10), 1954-1968. [8476242]. https://doi.org/10.1109/TKDE.2018.2872553
Qiu, Yu Xuan ; Li, Rong Hua ; Li, Jianxin ; Qiao, Shaojie ; Wang, Guoren ; Yu, Jeffrey Xu ; Mao, Rui. / Efficient Structural Clustering on Probabilistic Graphs. In: IEEE Transactions on Knowledge and Data Engineering. 2019 ; Vol. 31, No. 10. pp. 1954-1968.
@article{8acd6b1c9b9a444695435e6e4fb7f8ef,
title = "Efficient Structural Clustering on Probabilistic Graphs",
abstract = "Structural clustering is a fundamental graph mining operator which is not only able to find densely-connected clusters, but it can also identify hub vertices and outliers in the graph. Previous structural clustering algorithms are tailored to deterministic graphs. Many real-world graphs, however, are not deterministic, but are probabilistic in nature because the existence of the edge is often inferred using a variety of statistical approaches. In this paper, we formulate the problem of structural clustering on probabilistic graphs, with the aim of finding reliable clusters in a given probabilistic graph. Unlike the traditional structural clustering problem, our problem relies mainly on a novel concept called reliable structural similarity which measures the probability of the similarity between two vertices in the probabilistic graph. We develop a dynamic programming algorithm with several powerful pruning strategies to efficiently compute the reliable structural similarities. With the reliable structural similarities, we adapt an existing solution framework to calculate the structural clustering on probabilistic graphs. Comprehensive experiments on five real-life datasets demonstrate the effectiveness and efficiency of the proposed approaches.",
keywords = "Probabilistic graph, reliable structural similarity, structural clustering",
author = "Qiu, {Yu Xuan} and Li, {Rong Hua} and Jianxin Li and Shaojie Qiao and Guoren Wang and Yu, {Jeffrey Xu} and Rui Mao",
year = "2019",
month = "9",
day = "28",
doi = "10.1109/TKDE.2018.2872553",
language = "English",
volume = "31",
pages = "1954--1968",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "IEEE, Institute of Electrical and Electronics Engineers",
number = "10",

}

Qiu, YX, Li, RH, Li, J, Qiao, S, Wang, G, Yu, JX & Mao, R 2019, 'Efficient Structural Clustering on Probabilistic Graphs' IEEE Transactions on Knowledge and Data Engineering, vol. 31, no. 10, 8476242, pp. 1954-1968. https://doi.org/10.1109/TKDE.2018.2872553

Efficient Structural Clustering on Probabilistic Graphs. / Qiu, Yu Xuan; Li, Rong Hua; Li, Jianxin; Qiao, Shaojie; Wang, Guoren; Yu, Jeffrey Xu; Mao, Rui.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 31, No. 10, 8476242, 28.09.2019, p. 1954-1968.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Efficient Structural Clustering on Probabilistic Graphs

AU - Qiu, Yu Xuan

AU - Li, Rong Hua

AU - Li, Jianxin

AU - Qiao, Shaojie

AU - Wang, Guoren

AU - Yu, Jeffrey Xu

AU - Mao, Rui

PY - 2019/9/28

Y1 - 2019/9/28

N2 - Structural clustering is a fundamental graph mining operator which is not only able to find densely-connected clusters, but it can also identify hub vertices and outliers in the graph. Previous structural clustering algorithms are tailored to deterministic graphs. Many real-world graphs, however, are not deterministic, but are probabilistic in nature because the existence of the edge is often inferred using a variety of statistical approaches. In this paper, we formulate the problem of structural clustering on probabilistic graphs, with the aim of finding reliable clusters in a given probabilistic graph. Unlike the traditional structural clustering problem, our problem relies mainly on a novel concept called reliable structural similarity which measures the probability of the similarity between two vertices in the probabilistic graph. We develop a dynamic programming algorithm with several powerful pruning strategies to efficiently compute the reliable structural similarities. With the reliable structural similarities, we adapt an existing solution framework to calculate the structural clustering on probabilistic graphs. Comprehensive experiments on five real-life datasets demonstrate the effectiveness and efficiency of the proposed approaches.

AB - Structural clustering is a fundamental graph mining operator which is not only able to find densely-connected clusters, but it can also identify hub vertices and outliers in the graph. Previous structural clustering algorithms are tailored to deterministic graphs. Many real-world graphs, however, are not deterministic, but are probabilistic in nature because the existence of the edge is often inferred using a variety of statistical approaches. In this paper, we formulate the problem of structural clustering on probabilistic graphs, with the aim of finding reliable clusters in a given probabilistic graph. Unlike the traditional structural clustering problem, our problem relies mainly on a novel concept called reliable structural similarity which measures the probability of the similarity between two vertices in the probabilistic graph. We develop a dynamic programming algorithm with several powerful pruning strategies to efficiently compute the reliable structural similarities. With the reliable structural similarities, we adapt an existing solution framework to calculate the structural clustering on probabilistic graphs. Comprehensive experiments on five real-life datasets demonstrate the effectiveness and efficiency of the proposed approaches.

KW - Probabilistic graph

KW - reliable structural similarity

KW - structural clustering

UR - http://www.scopus.com/inward/record.url?scp=85054383229&partnerID=8YFLogxK

U2 - 10.1109/TKDE.2018.2872553

DO - 10.1109/TKDE.2018.2872553

M3 - Article

VL - 31

SP - 1954

EP - 1968

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 10

M1 - 8476242

ER -