Backpropagation Computation for Training Graph Attention Networks

Joe Gould; Keshab K. Parhi

doi:10.1007/s11265-023-01897-1

Backpropagation Computation for Training Graph Attention Networks

Research output: Contribution to journal › Article › peer-review

Abstract

Graph Neural Networks (GNNs) are a form of deep learning that have found use for a variety of problems, including the modeling of drug interactions, time-series analysis, and traffic prediction. They represent the problem using non-Euclidian graphs, allowing for a high degree of versatility, and are able to learn complex relationships by iteratively aggregating more contextual information from neighbors that are farther away. Inspired by its power in transformers, Graph Attention Networks (GATs) incorporate an attention mechanism on top of graph aggregation. GATs are considered the state of the art due to their superior performance. To learn the best parameters for a given graph problem, GATs use traditional backpropagation to compute weight updates. To the best of our knowledge, these updates are calculated in software, and closed-form equations describing their calculation for GATs aren’t well known. This paper derives closed-form equations for backpropagation in GATs using matrix notation. These equations can form the basis for design of hardware accelerators for training GATs.

Original language	English (US)
Pages (from-to)	1-14
Number of pages	14
Journal	Journal of Signal Processing Systems
Volume	96
Issue number	1
DOIs	https://doi.org/10.1007/s11265-023-01897-1
State	Published - Jan 2024
Externally published	Yes

Bibliographical note

Publisher Copyright:
© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.

Keywords

Backpropagation
Gradient computation
Graph attention networks
Neural network training

Access

10.1007/s11265-023-01897-1

OpenUrl availability

Full text

Cite this

@article{a37b2a88ba8e4764afc3ab5e436c142c,

title = "Backpropagation Computation for Training Graph Attention Networks",

abstract = "Graph Neural Networks (GNNs) are a form of deep learning that have found use for a variety of problems, including the modeling of drug interactions, time-series analysis, and traffic prediction. They represent the problem using non-Euclidian graphs, allowing for a high degree of versatility, and are able to learn complex relationships by iteratively aggregating more contextual information from neighbors that are farther away. Inspired by its power in transformers, Graph Attention Networks (GATs) incorporate an attention mechanism on top of graph aggregation. GATs are considered the state of the art due to their superior performance. To learn the best parameters for a given graph problem, GATs use traditional backpropagation to compute weight updates. To the best of our knowledge, these updates are calculated in software, and closed-form equations describing their calculation for GATs aren{\textquoteright}t well known. This paper derives closed-form equations for backpropagation in GATs using matrix notation. These equations can form the basis for design of hardware accelerators for training GATs.",

keywords = "Backpropagation, Gradient computation, Graph attention networks, Neural network training",

author = "Joe Gould and Parhi, {Keshab K.}",

note = "Publisher Copyright: {\textcopyright} The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.",

year = "2024",

month = jan,

doi = "10.1007/s11265-023-01897-1",

language = "English (US)",

volume = "96",

pages = "1--14",

journal = "Journal of Signal Processing Systems",

issn = "1939-8018",

publisher = "Springer New York",

number = "1",

}

TY - JOUR

T1 - Backpropagation Computation for Training Graph Attention Networks

AU - Gould, Joe

AU - Parhi, Keshab K.

N1 - Publisher Copyright: © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023.

PY - 2024/1

Y1 - 2024/1

N2 - Graph Neural Networks (GNNs) are a form of deep learning that have found use for a variety of problems, including the modeling of drug interactions, time-series analysis, and traffic prediction. They represent the problem using non-Euclidian graphs, allowing for a high degree of versatility, and are able to learn complex relationships by iteratively aggregating more contextual information from neighbors that are farther away. Inspired by its power in transformers, Graph Attention Networks (GATs) incorporate an attention mechanism on top of graph aggregation. GATs are considered the state of the art due to their superior performance. To learn the best parameters for a given graph problem, GATs use traditional backpropagation to compute weight updates. To the best of our knowledge, these updates are calculated in software, and closed-form equations describing their calculation for GATs aren’t well known. This paper derives closed-form equations for backpropagation in GATs using matrix notation. These equations can form the basis for design of hardware accelerators for training GATs.

AB - Graph Neural Networks (GNNs) are a form of deep learning that have found use for a variety of problems, including the modeling of drug interactions, time-series analysis, and traffic prediction. They represent the problem using non-Euclidian graphs, allowing for a high degree of versatility, and are able to learn complex relationships by iteratively aggregating more contextual information from neighbors that are farther away. Inspired by its power in transformers, Graph Attention Networks (GATs) incorporate an attention mechanism on top of graph aggregation. GATs are considered the state of the art due to their superior performance. To learn the best parameters for a given graph problem, GATs use traditional backpropagation to compute weight updates. To the best of our knowledge, these updates are calculated in software, and closed-form equations describing their calculation for GATs aren’t well known. This paper derives closed-form equations for backpropagation in GATs using matrix notation. These equations can form the basis for design of hardware accelerators for training GATs.

KW - Backpropagation

KW - Gradient computation

KW - Graph attention networks

KW - Neural network training

UR - http://www.scopus.com/inward/record.url?scp=85174241976&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85174241976&partnerID=8YFLogxK

U2 - 10.1007/s11265-023-01897-1

DO - 10.1007/s11265-023-01897-1

M3 - Article

AN - SCOPUS:85174241976

SN - 1939-8018

VL - 96

SP - 1

EP - 14

JO - Journal of Signal Processing Systems

JF - Journal of Signal Processing Systems

IS - 1

ER -

Backpropagation Computation for Training Graph Attention Networks

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this