Distributed Rate Scaling in Large-Scale Service Systems

Daan Rutten; Martin Zubeldia; Debankur Mukherjee

doi:10.1145/3626570.3626579

Distributed Rate Scaling in Large-Scale Service Systems

Daan Rutten, Martin Zubeldia, Debankur Mukherjee

Industrial and Systems Engineering

Research output: Contribution to journal › Article › peer-review

Abstract

We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.

Original language	English (US)
Pages (from-to)	21-23
Number of pages	3
Journal	Performance Evaluation Review
Volume	51
Issue number	2
DOIs	https://doi.org/10.1145/3626570.3626579
State	Published - Oct 2 2023

Bibliographical note

Publisher Copyright:
© 2023 Copyright is held by the owner/author(s).

Keywords

distributed optimization
load balancing
rate-scaling

Access

10.1145/3626570.3626579

OpenUrl availability

Full text

Cite this

@article{2f5ff26a84b64b0eb0364d22e7a59075,

title = "Distributed Rate Scaling in Large-Scale Service Systems",

abstract = "We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.",

keywords = "distributed optimization, load balancing, rate-scaling",

author = "Daan Rutten and Martin Zubeldia and Debankur Mukherjee",

note = "Publisher Copyright: {\textcopyright} 2023 Copyright is held by the owner/author(s).",

year = "2023",

month = oct,

day = "2",

doi = "10.1145/3626570.3626579",

language = "English (US)",

volume = "51",

pages = "21--23",

journal = "Performance Evaluation Review",

issn = "0163-5999",

publisher = "Association for Computing Machinery (ACM)",

number = "2",

}

TY - JOUR

T1 - Distributed Rate Scaling in Large-Scale Service Systems

AU - Rutten, Daan

AU - Zubeldia, Martin

AU - Mukherjee, Debankur

PY - 2023/10/2

Y1 - 2023/10/2

N2 - We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.

AB - We consider a large-scale parallel-server system, where each server dynamically chooses its processing speed in a completely distributed fashion. The goal is to minimize the global cost that is the sum of the average cost of maintaining the respective processing speeds of all servers and a certain non-decreasing function of the sojourn time of tasks. The key challenges arise from the facts that the arrival rate of tasks is unknown and that there is no centralized control or communication among the servers. Using insights from stochastic approximation, we develop a novel rate-scaling algorithm and prove that the cost of the processing rates under our algorithm converges to the globally optimum value as the system size becomes large. En route, we also analyze the performance of a fully heterogeneous parallel-server system (i.e, where each server has a different processing speed), which might be of independent interest.

KW - distributed optimization

KW - load balancing

KW - rate-scaling

UR - http://www.scopus.com/inward/record.url?scp=85173612488&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85173612488&partnerID=8YFLogxK

U2 - 10.1145/3626570.3626579

DO - 10.1145/3626570.3626579

M3 - Article

AN - SCOPUS:85173612488

SN - 0163-5999

VL - 51

SP - 21

EP - 23

JO - Performance Evaluation Review

JF - Performance Evaluation Review

IS - 2

ER -

Distributed Rate Scaling in Large-Scale Service Systems

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this