Long-Tail Theory Under Gaussian Mixtures

Arman Bolatov; Maxat Tezekbayev; Igor Melnykov; Artur Pak; Vassilina Nikoulina; Zhenisbek Assylbekov

doi:10.3233/FAIA230260

Long-Tail Theory Under Gaussian Mixtures

Arman Bolatov, Maxat Tezekbayev, Igor Melnykov, Artur Pak, Vassilina Nikoulina, Zhenisbek Assylbekov

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Abstract

We suggest a simple Gaussian mixture model for data generation that complies with Feldman's long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data.

Original language	English (US)
Title of host publication	ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings
Editors	Kobi Gal, Kobi Gal, Ann Nowe, Grzegorz J. Nalepa, Roy Fairstein, Roxana Radulescu
Publisher	IOS Press BV
Pages	109-116
Number of pages	8
ISBN (Electronic)	9781643684369
DOIs	https://doi.org/10.3233/FAIA230260
State	Published - Sep 28 2023
Externally published	Yes
Event	26th European Conference on Artificial Intelligence, ECAI 2023 - Krakow, Poland Duration: Sep 30 2023 → Oct 4 2023

Publication series

Name	Frontiers in Artificial Intelligence and Applications
Volume	372
ISSN (Print)	0922-6389
ISSN (Electronic)	1879-8314

Conference

Conference	26th European Conference on Artificial Intelligence, ECAI 2023
Country/Territory	Poland
City	Krakow
Period	9/30/23 → 10/4/23

Bibliographical note

Publisher Copyright:
© 2023 The Authors.

Access

10.3233/FAIA230260

OpenUrl availability

Full text

Cite this

Bolatov, A., Tezekbayev, M., Melnykov, I., Pak, A., Nikoulina, V., & Assylbekov, Z. (2023). Long-Tail Theory Under Gaussian Mixtures. In K. Gal, K. Gal, A. Nowe, G. J. Nalepa, R. Fairstein, & R. Radulescu (Eds.), ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings (pp. 109-116). (Frontiers in Artificial Intelligence and Applications; Vol. 372). IOS Press BV. https://doi.org/10.3233/FAIA230260

Long-Tail Theory Under Gaussian Mixtures. / Bolatov, Arman; Tezekbayev, Maxat; Melnykov, Igor et al.
ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings. ed. / Kobi Gal; Kobi Gal; Ann Nowe; Grzegorz J. Nalepa; Roy Fairstein; Roxana Radulescu. IOS Press BV, 2023. p. 109-116 (Frontiers in Artificial Intelligence and Applications; Vol. 372).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Bolatov, A, Tezekbayev, M, Melnykov, I, Pak, A, Nikoulina, V & Assylbekov, Z 2023, Long-Tail Theory Under Gaussian Mixtures. in K Gal, K Gal, A Nowe, GJ Nalepa, R Fairstein & R Radulescu (eds), ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings. Frontiers in Artificial Intelligence and Applications, vol. 372, IOS Press BV, pp. 109-116, 26th European Conference on Artificial Intelligence, ECAI 2023, Krakow, Poland, 9/30/23. https://doi.org/10.3233/FAIA230260

Bolatov A, Tezekbayev M, Melnykov I, Pak A, Nikoulina V, Assylbekov Z. Long-Tail Theory Under Gaussian Mixtures. In Gal K, Gal K, Nowe A, Nalepa GJ, Fairstein R, Radulescu R, editors, ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings. IOS Press BV. 2023. p. 109-116. (Frontiers in Artificial Intelligence and Applications). doi: 10.3233/FAIA230260

Bolatov, Arman ; Tezekbayev, Maxat ; Melnykov, Igor et al. / Long-Tail Theory Under Gaussian Mixtures. ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings. editor / Kobi Gal ; Kobi Gal ; Ann Nowe ; Grzegorz J. Nalepa ; Roy Fairstein ; Roxana Radulescu. IOS Press BV, 2023. pp. 109-116 (Frontiers in Artificial Intelligence and Applications).

@inproceedings{c2948f8e4d424e9abb0fde3618901f3d,

title = "Long-Tail Theory Under Gaussian Mixtures",

abstract = "We suggest a simple Gaussian mixture model for data generation that complies with Feldman's long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data.",

author = "Arman Bolatov and Maxat Tezekbayev and Igor Melnykov and Artur Pak and Vassilina Nikoulina and Zhenisbek Assylbekov",

note = "Publisher Copyright: {\textcopyright} 2023 The Authors.; 26th European Conference on Artificial Intelligence, ECAI 2023 ; Conference date: 30-09-2023 Through 04-10-2023",

year = "2023",

month = sep,

day = "28",

doi = "10.3233/FAIA230260",

language = "English (US)",

series = "Frontiers in Artificial Intelligence and Applications",

publisher = "IOS Press BV",

pages = "109--116",

editor = "Kobi Gal and Kobi Gal and Ann Nowe and Nalepa, {Grzegorz J.} and Roy Fairstein and Roxana Radulescu",

booktitle = "ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings",

address = "Netherlands",

}

TY - GEN

T1 - Long-Tail Theory Under Gaussian Mixtures

AU - Bolatov, Arman

AU - Tezekbayev, Maxat

AU - Melnykov, Igor

AU - Pak, Artur

AU - Nikoulina, Vassilina

AU - Assylbekov, Zhenisbek

PY - 2023/9/28

Y1 - 2023/9/28

N2 - We suggest a simple Gaussian mixture model for data generation that complies with Feldman's long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data.

AB - We suggest a simple Gaussian mixture model for data generation that complies with Feldman's long tail theory (2020). We demonstrate that a linear classifier cannot decrease the generalization error below a certain level in the proposed model, whereas a nonlinear classifier with a memorization capacity can. This confirms that for long-tailed distributions, rare training examples must be considered for optimal generalization to new data. Finally, we show that the performance gap between linear and nonlinear models can be lessened as the tail becomes shorter in the subpopulation frequency distribution, as confirmed by experiments on synthetic and real data.

UR - http://www.scopus.com/inward/record.url?scp=85175873847&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85175873847&partnerID=8YFLogxK

U2 - 10.3233/FAIA230260

DO - 10.3233/FAIA230260

M3 - Conference contribution

AN - SCOPUS:85175873847

T3 - Frontiers in Artificial Intelligence and Applications

SP - 109

EP - 116

BT - ECAI 2023 - 26th European Conference on Artificial Intelligence, including 12th Conference on Prestigious Applications of Intelligent Systems, PAIS 2023 - Proceedings

A2 - Gal, Kobi

A2 - Nowe, Ann

A2 - Nalepa, Grzegorz J.

A2 - Fairstein, Roy

A2 - Radulescu, Roxana

PB - IOS Press BV

T2 - 26th European Conference on Artificial Intelligence, ECAI 2023

Y2 - 30 September 2023 through 4 October 2023

ER -

Long-Tail Theory Under Gaussian Mixtures

Abstract

Publication series

Conference

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this