Truss decomposition on shared-memory parallel systems

Shaden Smith; Xing Liu; Nesreen K. Ahmed; Ancy Sarah Tom; Fabrizio Petrini; George Karypis

doi:10.1109/HPEC.2017.8091049

Truss decomposition on shared-memory parallel systems

Shaden Smith, Xing Liu, Nesreen K. Ahmed, Ancy Sarah Tom, Fabrizio Petrini, George Karypis

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

43 Scopus citations

Abstract

The scale of data used in graph analytics grows at an unprecedented rate. More than ever, domain experts require efficient and parallel algorithms for tasks in graph analytics. One such task is the truss decomposition, which is a hierarchical decomposition of the edges of a graph and is closely related to the task of triangle enumeration. As evidenced by the recent GraphChallenge, existing algorithms and implementations for truss decomposition are insufficient for the scale of modern datasets. In this work, we propose a parallel algorithm for computing the truss decomposition of massive graphs on a shared-memory system. Our algorithm breaks a computation-efficient serial algorithm into several bulk-synchronous parallel steps which do not rely on atomics or other fine-grained synchronization. We evaluate our algorithm across a variety of synthetic and real-world datasets on a 56-core Intel Xeon system. Our serial implementation achieves over 1400 × speedup over the provided GraphChallenge serial benchmark implementation and is up to 28 × faster than the state-of-the-art shared-memory parallel algorithm.

Original language	English (US)
Title of host publication	2017 IEEE High Performance Extreme Computing Conference, HPEC 2017
Publisher	Institute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)	9781538634721
DOIs	https://doi.org/10.1109/HPEC.2017.8091049
State	Published - Oct 30 2017
Event	2017 IEEE High Performance Extreme Computing Conference, HPEC 2017 - Waltham, United States Duration: Sep 12 2017 → Sep 14 2017

Publication series

Name	2017 IEEE High Performance Extreme Computing Conference, HPEC 2017

Other

Other	2017 IEEE High Performance Extreme Computing Conference, HPEC 2017
Country/Territory	United States
City	Waltham
Period	9/12/17 → 9/14/17

Access

10.1109/HPEC.2017.8091049

OpenUrl availability

Full text

Cite this

Smith, S., Liu, X., Ahmed, N. K., Tom, A. S., Petrini, F., & Karypis, G. (2017). Truss decomposition on shared-memory parallel systems. In 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017 Article 8091049 (2017 IEEE High Performance Extreme Computing Conference, HPEC 2017). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/HPEC.2017.8091049

Truss decomposition on shared-memory parallel systems. / Smith, Shaden; Liu, Xing; Ahmed, Nesreen K. et al.
2017 IEEE High Performance Extreme Computing Conference, HPEC 2017. Institute of Electrical and Electronics Engineers Inc., 2017. 8091049 (2017 IEEE High Performance Extreme Computing Conference, HPEC 2017).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Smith, S, Liu, X, Ahmed, NK, Tom, AS, Petrini, F & Karypis, G 2017, Truss decomposition on shared-memory parallel systems. in 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017., 8091049, 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017, Institute of Electrical and Electronics Engineers Inc., 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017, Waltham, United States, 9/12/17. https://doi.org/10.1109/HPEC.2017.8091049

@inproceedings{1fe7fbb0ac5843f8b5ee9650a86d5b59,

title = "Truss decomposition on shared-memory parallel systems",

abstract = "The scale of data used in graph analytics grows at an unprecedented rate. More than ever, domain experts require efficient and parallel algorithms for tasks in graph analytics. One such task is the truss decomposition, which is a hierarchical decomposition of the edges of a graph and is closely related to the task of triangle enumeration. As evidenced by the recent GraphChallenge, existing algorithms and implementations for truss decomposition are insufficient for the scale of modern datasets. In this work, we propose a parallel algorithm for computing the truss decomposition of massive graphs on a shared-memory system. Our algorithm breaks a computation-efficient serial algorithm into several bulk-synchronous parallel steps which do not rely on atomics or other fine-grained synchronization. We evaluate our algorithm across a variety of synthetic and real-world datasets on a 56-core Intel Xeon system. Our serial implementation achieves over 1400 × speedup over the provided GraphChallenge serial benchmark implementation and is up to 28 × faster than the state-of-the-art shared-memory parallel algorithm.",

author = "Shaden Smith and Xing Liu and Ahmed, {Nesreen K.} and Tom, {Ancy Sarah} and Fabrizio Petrini and George Karypis",

year = "2017",

month = oct,

day = "30",

doi = "10.1109/HPEC.2017.8091049",

language = "English (US)",

series = "2017 IEEE High Performance Extreme Computing Conference, HPEC 2017",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

booktitle = "2017 IEEE High Performance Extreme Computing Conference, HPEC 2017",

note = "2017 IEEE High Performance Extreme Computing Conference, HPEC 2017 ; Conference date: 12-09-2017 Through 14-09-2017",

}

TY - GEN

T1 - Truss decomposition on shared-memory parallel systems

AU - Smith, Shaden

AU - Liu, Xing

AU - Ahmed, Nesreen K.

AU - Tom, Ancy Sarah

AU - Petrini, Fabrizio

AU - Karypis, George

PY - 2017/10/30

Y1 - 2017/10/30

N2 - The scale of data used in graph analytics grows at an unprecedented rate. More than ever, domain experts require efficient and parallel algorithms for tasks in graph analytics. One such task is the truss decomposition, which is a hierarchical decomposition of the edges of a graph and is closely related to the task of triangle enumeration. As evidenced by the recent GraphChallenge, existing algorithms and implementations for truss decomposition are insufficient for the scale of modern datasets. In this work, we propose a parallel algorithm for computing the truss decomposition of massive graphs on a shared-memory system. Our algorithm breaks a computation-efficient serial algorithm into several bulk-synchronous parallel steps which do not rely on atomics or other fine-grained synchronization. We evaluate our algorithm across a variety of synthetic and real-world datasets on a 56-core Intel Xeon system. Our serial implementation achieves over 1400 × speedup over the provided GraphChallenge serial benchmark implementation and is up to 28 × faster than the state-of-the-art shared-memory parallel algorithm.

AB - The scale of data used in graph analytics grows at an unprecedented rate. More than ever, domain experts require efficient and parallel algorithms for tasks in graph analytics. One such task is the truss decomposition, which is a hierarchical decomposition of the edges of a graph and is closely related to the task of triangle enumeration. As evidenced by the recent GraphChallenge, existing algorithms and implementations for truss decomposition are insufficient for the scale of modern datasets. In this work, we propose a parallel algorithm for computing the truss decomposition of massive graphs on a shared-memory system. Our algorithm breaks a computation-efficient serial algorithm into several bulk-synchronous parallel steps which do not rely on atomics or other fine-grained synchronization. We evaluate our algorithm across a variety of synthetic and real-world datasets on a 56-core Intel Xeon system. Our serial implementation achieves over 1400 × speedup over the provided GraphChallenge serial benchmark implementation and is up to 28 × faster than the state-of-the-art shared-memory parallel algorithm.

UR - http://www.scopus.com/inward/record.url?scp=85041205217&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85041205217&partnerID=8YFLogxK

U2 - 10.1109/HPEC.2017.8091049

DO - 10.1109/HPEC.2017.8091049

M3 - Conference contribution

AN - SCOPUS:85041205217

T3 - 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017

BT - 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2017 IEEE High Performance Extreme Computing Conference, HPEC 2017

Y2 - 12 September 2017 through 14 September 2017

ER -

Truss decomposition on shared-memory parallel systems

Abstract

Publication series

Other

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this