On the Fundamental Limits of Matrix Completion: Leveraging Hierarchical Similarity Graphs

Junhyung Ahn; Adel Elmahdy; Soheil Mohajer; Changho Suh

doi:10.1109/TIT.2023.3345902

On the Fundamental Limits of Matrix Completion: Leveraging Hierarchical Similarity Graphs

Junhyung Ahn, Adel Elmahdy, Soheil Mohajer, Changho Suh

Research output: Contribution to journal › Article › peer-review

1 Scopus citations

Abstract

We study a matrix completion problem which leverages a hierarchical structure of social similarity graphs as side information in the context of recommender systems. We assume that users are categorized into clusters, each of which comprises sub-clusters (or what we call 'groups'). We consider a hierarchical stochastic block model that well respects practically-relevant social graphs and follows a low-rank rating matrix model. Under this setting, we characterize the information-theoretic limit on the number of observed matrix entries (i.e., optimal sample complexity) as a function of the quality of graph side information (to be detailed) by proving sharp upper and lower bounds on the sample complexity. One important consequence of this result is that leveraging the hierarchical structure of similarity graphs yields a substantial gain in sample complexity relative to the one that simply identifies different groups without resorting to the relational structure across them. Another implication of the result is when the graph information is rich, the optimal sample complexity is proportional to the number of clusters, while it nearly stays constant as the number of groups in a cluster increases. We empirically demonstrate through extensive experiments that the proposed algorithm achieves the optimal sample complexity.

Original language	English (US)
Pages (from-to)	2039-2075
Number of pages	37
Journal	IEEE Transactions on Information Theory
Volume	70
Issue number	3
DOIs	https://doi.org/10.1109/TIT.2023.3345902
State	Published - Mar 1 2024
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 1963-2012 IEEE.

Keywords

Recommender systems
graph side information
matrix completion problem

Access

10.1109/TIT.2023.3345902

OpenUrl availability

Full text

Cite this

@article{3408cd46001a440ba590db068d911ffc,

title = "On the Fundamental Limits of Matrix Completion: Leveraging Hierarchical Similarity Graphs",

abstract = "We study a matrix completion problem which leverages a hierarchical structure of social similarity graphs as side information in the context of recommender systems. We assume that users are categorized into clusters, each of which comprises sub-clusters (or what we call 'groups'). We consider a hierarchical stochastic block model that well respects practically-relevant social graphs and follows a low-rank rating matrix model. Under this setting, we characterize the information-theoretic limit on the number of observed matrix entries (i.e., optimal sample complexity) as a function of the quality of graph side information (to be detailed) by proving sharp upper and lower bounds on the sample complexity. One important consequence of this result is that leveraging the hierarchical structure of similarity graphs yields a substantial gain in sample complexity relative to the one that simply identifies different groups without resorting to the relational structure across them. Another implication of the result is when the graph information is rich, the optimal sample complexity is proportional to the number of clusters, while it nearly stays constant as the number of groups in a cluster increases. We empirically demonstrate through extensive experiments that the proposed algorithm achieves the optimal sample complexity.",

keywords = "Recommender systems, graph side information, matrix completion problem",

author = "Junhyung Ahn and Adel Elmahdy and Soheil Mohajer and Changho Suh",

note = "Publisher Copyright: {\textcopyright} 1963-2012 IEEE.",

year = "2024",

month = mar,

day = "1",

doi = "10.1109/TIT.2023.3345902",

language = "English (US)",

volume = "70",

pages = "2039--2075",

journal = "IEEE Transactions on Information Theory",

issn = "0018-9448",

publisher = "IEEE",

number = "3",

}

TY - JOUR

T1 - On the Fundamental Limits of Matrix Completion

T2 - Leveraging Hierarchical Similarity Graphs

AU - Ahn, Junhyung

AU - Elmahdy, Adel

AU - Mohajer, Soheil

AU - Suh, Changho

PY - 2024/3/1

Y1 - 2024/3/1

N2 - We study a matrix completion problem which leverages a hierarchical structure of social similarity graphs as side information in the context of recommender systems. We assume that users are categorized into clusters, each of which comprises sub-clusters (or what we call 'groups'). We consider a hierarchical stochastic block model that well respects practically-relevant social graphs and follows a low-rank rating matrix model. Under this setting, we characterize the information-theoretic limit on the number of observed matrix entries (i.e., optimal sample complexity) as a function of the quality of graph side information (to be detailed) by proving sharp upper and lower bounds on the sample complexity. One important consequence of this result is that leveraging the hierarchical structure of similarity graphs yields a substantial gain in sample complexity relative to the one that simply identifies different groups without resorting to the relational structure across them. Another implication of the result is when the graph information is rich, the optimal sample complexity is proportional to the number of clusters, while it nearly stays constant as the number of groups in a cluster increases. We empirically demonstrate through extensive experiments that the proposed algorithm achieves the optimal sample complexity.

AB - We study a matrix completion problem which leverages a hierarchical structure of social similarity graphs as side information in the context of recommender systems. We assume that users are categorized into clusters, each of which comprises sub-clusters (or what we call 'groups'). We consider a hierarchical stochastic block model that well respects practically-relevant social graphs and follows a low-rank rating matrix model. Under this setting, we characterize the information-theoretic limit on the number of observed matrix entries (i.e., optimal sample complexity) as a function of the quality of graph side information (to be detailed) by proving sharp upper and lower bounds on the sample complexity. One important consequence of this result is that leveraging the hierarchical structure of similarity graphs yields a substantial gain in sample complexity relative to the one that simply identifies different groups without resorting to the relational structure across them. Another implication of the result is when the graph information is rich, the optimal sample complexity is proportional to the number of clusters, while it nearly stays constant as the number of groups in a cluster increases. We empirically demonstrate through extensive experiments that the proposed algorithm achieves the optimal sample complexity.

KW - Recommender systems

KW - graph side information

KW - matrix completion problem

UR - http://www.scopus.com/inward/record.url?scp=85181555409&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85181555409&partnerID=8YFLogxK

U2 - 10.1109/TIT.2023.3345902

DO - 10.1109/TIT.2023.3345902

M3 - Article

AN - SCOPUS:85181555409

SN - 0018-9448

VL - 70

SP - 2039

EP - 2075

JO - IEEE Transactions on Information Theory

JF - IEEE Transactions on Information Theory

IS - 3

ER -

On the Fundamental Limits of Matrix Completion: Leveraging Hierarchical Similarity Graphs

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this