Saliency prediction with external knowledge

Yifeng Zhang; Ming Jiang; Qi Zhao

doi:10.1109/WACV48630.2021.00053

Saliency prediction with external knowledge

Yifeng Zhang, Ming Jiang, Qi Zhao

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

The last decades have seen great progress in saliency prediction, with the success of deep neural networks that are able to encode high-level semantics. Yet, while humans have the innate capability in leveraging their knowledge to decide where to look (e.g. people pay more attention to familiar faces such as celebrities), saliency prediction models have only been trained with large eye-tracking datasets. This work proposes to bridge this gap by explicitly incorporating external knowledge for saliency models as humans do. We develop networks that learn to highlight regions by incorporating prior knowledge of semantic relationships, be it general or domain-specific, depending on the task of interest. At the core of the method is a new Graph Semantic Saliency Network (GraSSNet) that constructs a graph that encodes semantic relationships learned from external knowledge. A Spatial Graph Attention Network is then developed to update saliency features based on the learned graph. Experiments show that the proposed model learns to predict saliency from the external knowledge and outperforms the state-of-the-art on four saliency benchmarks.

Original language	English (US)
Title of host publication	Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	484-493
Number of pages	10
ISBN (Electronic)	9780738142661
DOIs	https://doi.org/10.1109/WACV48630.2021.00053
State	Published - Jan 2021
Event	2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 - Virtual, Online, United States Duration: Jan 5 2021 → Jan 9 2021

Publication series

Name	Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021

Conference

Conference	2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021
Country/Territory	United States
City	Virtual, Online
Period	1/5/21 → 1/9/21

Bibliographical note

Publisher Copyright:
© 2021 IEEE.

Access

10.1109/WACV48630.2021.00053

OpenUrl availability

Full text

Cite this

Zhang, Y., Jiang, M., & Zhao, Q. (2021). Saliency prediction with external knowledge. In Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 (pp. 484-493). (Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WACV48630.2021.00053

Saliency prediction with external knowledge. / Zhang, Yifeng; Jiang, Ming ; Zhao, Qi.
Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021. Institute of Electrical and Electronics Engineers Inc., 2021. p. 484-493 (Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Zhang, Y, Jiang, M & Zhao, Q 2021, Saliency prediction with external knowledge. in Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021. Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Institute of Electrical and Electronics Engineers Inc., pp. 484-493, 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021, Virtual, Online, United States, 1/5/21. https://doi.org/10.1109/WACV48630.2021.00053

@inproceedings{df5d2d86c40b41cdb1958c862a0f6850,

title = "Saliency prediction with external knowledge",

abstract = "The last decades have seen great progress in saliency prediction, with the success of deep neural networks that are able to encode high-level semantics. Yet, while humans have the innate capability in leveraging their knowledge to decide where to look (e.g. people pay more attention to familiar faces such as celebrities), saliency prediction models have only been trained with large eye-tracking datasets. This work proposes to bridge this gap by explicitly incorporating external knowledge for saliency models as humans do. We develop networks that learn to highlight regions by incorporating prior knowledge of semantic relationships, be it general or domain-specific, depending on the task of interest. At the core of the method is a new Graph Semantic Saliency Network (GraSSNet) that constructs a graph that encodes semantic relationships learned from external knowledge. A Spatial Graph Attention Network is then developed to update saliency features based on the learned graph. Experiments show that the proposed model learns to predict saliency from the external knowledge and outperforms the state-of-the-art on four saliency benchmarks.",

author = "Yifeng Zhang and Ming Jiang and Qi Zhao",

note = "Publisher Copyright: {\textcopyright} 2021 IEEE.; 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021 ; Conference date: 05-01-2021 Through 09-01-2021",

year = "2021",

month = jan,

doi = "10.1109/WACV48630.2021.00053",

language = "English (US)",

series = "Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "484--493",

booktitle = "Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021",

}

TY - GEN

T1 - Saliency prediction with external knowledge

AU - Zhang, Yifeng

AU - Jiang, Ming

AU - Zhao, Qi

PY - 2021/1

Y1 - 2021/1

N2 - The last decades have seen great progress in saliency prediction, with the success of deep neural networks that are able to encode high-level semantics. Yet, while humans have the innate capability in leveraging their knowledge to decide where to look (e.g. people pay more attention to familiar faces such as celebrities), saliency prediction models have only been trained with large eye-tracking datasets. This work proposes to bridge this gap by explicitly incorporating external knowledge for saliency models as humans do. We develop networks that learn to highlight regions by incorporating prior knowledge of semantic relationships, be it general or domain-specific, depending on the task of interest. At the core of the method is a new Graph Semantic Saliency Network (GraSSNet) that constructs a graph that encodes semantic relationships learned from external knowledge. A Spatial Graph Attention Network is then developed to update saliency features based on the learned graph. Experiments show that the proposed model learns to predict saliency from the external knowledge and outperforms the state-of-the-art on four saliency benchmarks.

AB - The last decades have seen great progress in saliency prediction, with the success of deep neural networks that are able to encode high-level semantics. Yet, while humans have the innate capability in leveraging their knowledge to decide where to look (e.g. people pay more attention to familiar faces such as celebrities), saliency prediction models have only been trained with large eye-tracking datasets. This work proposes to bridge this gap by explicitly incorporating external knowledge for saliency models as humans do. We develop networks that learn to highlight regions by incorporating prior knowledge of semantic relationships, be it general or domain-specific, depending on the task of interest. At the core of the method is a new Graph Semantic Saliency Network (GraSSNet) that constructs a graph that encodes semantic relationships learned from external knowledge. A Spatial Graph Attention Network is then developed to update saliency features based on the learned graph. Experiments show that the proposed model learns to predict saliency from the external knowledge and outperforms the state-of-the-art on four saliency benchmarks.

UR - http://www.scopus.com/inward/record.url?scp=85115853518&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85115853518&partnerID=8YFLogxK

U2 - 10.1109/WACV48630.2021.00053

DO - 10.1109/WACV48630.2021.00053

M3 - Conference contribution

AN - SCOPUS:85115853518

T3 - Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021

SP - 484

EP - 493

BT - Proceedings - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021

Y2 - 5 January 2021 through 9 January 2021

ER -

Saliency prediction with external knowledge

Abstract

Publication series

Conference

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this