Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching

Xinyue Hu; Eman Ramadan; Wei Ye; Feng Tian; Zhi Li Zhang

doi:10.1145/3555050.3569134

Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching

Xinyue Hu, Eman Ramadan, Wei Ye, Feng Tian, Zhi Li Zhang

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

2 Scopus citations

Abstract

Performance of caching algorithms not only determines the quality of experience for users, but also affects the operating and capital expenditures for cloud service providers. Today's production systems rely on heuristics such as LRU (least recently used) and its variants, which work well for certain types of workloads, and cannot effectively cope with diverse and time-varying workload characteristics. While learning-based caching algorithms have been proposed to deal with these challenges, they still impose assumptions about workload characteristics and often suffer poor generalizability. In this paper, we propose Raven, a general learning-based caching framework that leverages the insights from the offline optimal Belady algorithm for both in-memory and content caching. Raven learns the distributions of objects' next-request arrival times without any prior assumptions by employing Mixture Density Network (MDN)-based universal distribution estimation. It utilizes the estimated distributions to compute the probability of an object that arrives farthest than any other objects in the cache and evicts the one with the largest such probability, regulated by the sizes of objects if appropriate. Raven (probabilistically) approximates Belady by explicitly accounting for the stochastic, time-varying, and non-stationary nature of object arrival processes. Evaluation results on production workloads demonstrate that, compared with the best existing caching algorithms, Raven improves the object hit ratio and byte hit ratio by up to 7.3% and 7.1%, respectively, reduces the average access latency by up to 17.9% and the traffic to the origin servers by up to 18.8%.

Original language	English (US)
Title of host publication	CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies
Publisher	Association for Computing Machinery, Inc
Pages	72-90
Number of pages	19
ISBN (Electronic)	9781450395083
DOIs	https://doi.org/10.1145/3555050.3569134
State	Published - Nov 30 2022
Event	18th ACM Conference on Emerging Networking Experiment and Technologies, CoNEXT 2022 - Rome, Italy Duration: Dec 6 2022 → Dec 9 2022

Publication series

Name	CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies

Conference

Conference	18th ACM Conference on Emerging Networking Experiment and Technologies, CoNEXT 2022
Country/Territory	Italy
City	Rome
Period	12/6/22 → 12/9/22

Bibliographical note

Publisher Copyright:
© 2022 ACM.

Access

10.1145/3555050.3569134

OpenUrl availability

Full text

Cite this

Hu, X., Ramadan, E., Ye, W., Tian, F., & Zhang, Z. L. (2022). Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching. In CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies (pp. 72-90). (CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies). Association for Computing Machinery, Inc. https://doi.org/10.1145/3555050.3569134

Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching. / Hu, Xinyue; Ramadan, Eman; Ye, Wei et al.
CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies. Association for Computing Machinery, Inc, 2022. p. 72-90 (CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Hu, X, Ramadan, E, Ye, W, Tian, F & Zhang, ZL 2022, Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching. in CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies. CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies, Association for Computing Machinery, Inc, pp. 72-90, 18th ACM Conference on Emerging Networking Experiment and Technologies, CoNEXT 2022, Rome, Italy, 12/6/22. https://doi.org/10.1145/3555050.3569134

Hu X, Ramadan E, Ye W, Tian F, Zhang ZL. Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching. In CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies. Association for Computing Machinery, Inc. 2022. p. 72-90. (CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies). doi: 10.1145/3555050.3569134

Hu, Xinyue ; Ramadan, Eman ; Ye, Wei et al. / Raven : Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching. CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies. Association for Computing Machinery, Inc, 2022. pp. 72-90 (CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies).

@inproceedings{921693e6b42b46018578df72b274934e,

title = "Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching",

abstract = "Performance of caching algorithms not only determines the quality of experience for users, but also affects the operating and capital expenditures for cloud service providers. Today's production systems rely on heuristics such as LRU (least recently used) and its variants, which work well for certain types of workloads, and cannot effectively cope with diverse and time-varying workload characteristics. While learning-based caching algorithms have been proposed to deal with these challenges, they still impose assumptions about workload characteristics and often suffer poor generalizability. In this paper, we propose Raven, a general learning-based caching framework that leverages the insights from the offline optimal Belady algorithm for both in-memory and content caching. Raven learns the distributions of objects' next-request arrival times without any prior assumptions by employing Mixture Density Network (MDN)-based universal distribution estimation. It utilizes the estimated distributions to compute the probability of an object that arrives farthest than any other objects in the cache and evicts the one with the largest such probability, regulated by the sizes of objects if appropriate. Raven (probabilistically) approximates Belady by explicitly accounting for the stochastic, time-varying, and non-stationary nature of object arrival processes. Evaluation results on production workloads demonstrate that, compared with the best existing caching algorithms, Raven improves the object hit ratio and byte hit ratio by up to 7.3% and 7.1%, respectively, reduces the average access latency by up to 17.9% and the traffic to the origin servers by up to 18.8%.",

author = "Xinyue Hu and Eman Ramadan and Wei Ye and Feng Tian and Zhang, {Zhi Li}",

note = "Publisher Copyright: {\textcopyright} 2022 ACM.; 18th ACM Conference on Emerging Networking Experiment and Technologies, CoNEXT 2022 ; Conference date: 06-12-2022 Through 09-12-2022",

year = "2022",

month = nov,

day = "30",

doi = "10.1145/3555050.3569134",

language = "English (US)",

series = "CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies",

publisher = "Association for Computing Machinery, Inc",

pages = "72--90",

booktitle = "CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies",

}

TY - GEN

T1 - Raven

T2 - 18th ACM Conference on Emerging Networking Experiment and Technologies, CoNEXT 2022

AU - Hu, Xinyue

AU - Ramadan, Eman

AU - Ye, Wei

AU - Tian, Feng

AU - Zhang, Zhi Li

PY - 2022/11/30

Y1 - 2022/11/30

N2 - Performance of caching algorithms not only determines the quality of experience for users, but also affects the operating and capital expenditures for cloud service providers. Today's production systems rely on heuristics such as LRU (least recently used) and its variants, which work well for certain types of workloads, and cannot effectively cope with diverse and time-varying workload characteristics. While learning-based caching algorithms have been proposed to deal with these challenges, they still impose assumptions about workload characteristics and often suffer poor generalizability. In this paper, we propose Raven, a general learning-based caching framework that leverages the insights from the offline optimal Belady algorithm for both in-memory and content caching. Raven learns the distributions of objects' next-request arrival times without any prior assumptions by employing Mixture Density Network (MDN)-based universal distribution estimation. It utilizes the estimated distributions to compute the probability of an object that arrives farthest than any other objects in the cache and evicts the one with the largest such probability, regulated by the sizes of objects if appropriate. Raven (probabilistically) approximates Belady by explicitly accounting for the stochastic, time-varying, and non-stationary nature of object arrival processes. Evaluation results on production workloads demonstrate that, compared with the best existing caching algorithms, Raven improves the object hit ratio and byte hit ratio by up to 7.3% and 7.1%, respectively, reduces the average access latency by up to 17.9% and the traffic to the origin servers by up to 18.8%.

AB - Performance of caching algorithms not only determines the quality of experience for users, but also affects the operating and capital expenditures for cloud service providers. Today's production systems rely on heuristics such as LRU (least recently used) and its variants, which work well for certain types of workloads, and cannot effectively cope with diverse and time-varying workload characteristics. While learning-based caching algorithms have been proposed to deal with these challenges, they still impose assumptions about workload characteristics and often suffer poor generalizability. In this paper, we propose Raven, a general learning-based caching framework that leverages the insights from the offline optimal Belady algorithm for both in-memory and content caching. Raven learns the distributions of objects' next-request arrival times without any prior assumptions by employing Mixture Density Network (MDN)-based universal distribution estimation. It utilizes the estimated distributions to compute the probability of an object that arrives farthest than any other objects in the cache and evicts the one with the largest such probability, regulated by the sizes of objects if appropriate. Raven (probabilistically) approximates Belady by explicitly accounting for the stochastic, time-varying, and non-stationary nature of object arrival processes. Evaluation results on production workloads demonstrate that, compared with the best existing caching algorithms, Raven improves the object hit ratio and byte hit ratio by up to 7.3% and 7.1%, respectively, reduces the average access latency by up to 17.9% and the traffic to the origin servers by up to 18.8%.

UR - http://www.scopus.com/inward/record.url?scp=85144825823&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85144825823&partnerID=8YFLogxK

U2 - 10.1145/3555050.3569134

DO - 10.1145/3555050.3569134

M3 - Conference contribution

AN - SCOPUS:85144825823

T3 - CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies

SP - 72

EP - 90

BT - CoNEXT 2022 - Proceedings of the 18th International Conference on emerging Networking EXperiments and Technologies

PB - Association for Computing Machinery, Inc

Y2 - 6 December 2022 through 9 December 2022

ER -

Raven: Belady-Guided, Predictive (Deep) Learning for In-Memory and Content Caching

Abstract

Publication series

Conference

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this