Model-agnostic Methods for Text Classification with Inherent Noise

Kshitij Tayal; Rahul Ghosh; Vipin Kumar

doi:10.18653/v1/2020.coling-industry.19

Model-agnostic Methods for Text Classification with Inherent Noise

Kshitij Tayal, Rahul Ghosh, Vipin Kumar

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

6 Scopus citations

Abstract

Text classification is a fundamental problem, and recently, deep neural networks (DNN) have shown promising results in many natural language tasks. However, their human-level performance relies on high-quality annotations, which are time-consuming and expensive to collect. As we move towards large inexpensive datasets, the inherent label noise degrades the generalization of DNN. While most machine learning literature focuses on building complex networks to handle noise, in this work, we evaluate model-agnostic methods to handle inherent noise in large scale text classification that can be easily incorporated into existing machine learning workflows with minimal interruption. Specifically, we conduct a point-by-point comparative study between several noise-robust methods on three datasets encompassing three popular classification models. To our knowledge, this is the first time such a comprehensive study in text classification encircling popular models and model-agnostic loss methods has been conducted. In this study, we describe our learning and demonstrate the application of our approach, which outperformed baselines by up to 10 % in classification accuracy while requiring no network modifications. Code for this paper is hosted at www.kshitijtayal.com/code/model-agnostic-methods.

Original language	English (US)
Title of host publication	COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track
Editors	Ann Clifton, Courtney Napoles
Publisher	Association for Computational Linguistics (ACL)
Pages	202-213
Number of pages	12
ISBN (Electronic)	9781952148293
DOIs	https://doi.org/10.18653/v1/2020.coling-industry.19
State	Published - 2020
Event	28th International Conference on Computational Linguistics, COLING 2020 - Virtual, Online, Spain Duration: Dec 12 2020 → …

Publication series

Name	COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track

Conference

Conference	28th International Conference on Computational Linguistics, COLING 2020
Country/Territory	Spain
City	Virtual, Online
Period	12/12/20 → …

Bibliographical note

Funding Information:
This research was supported by National Science Foundation under the grant 1838159 and 1739191. Access to computing facilities was provided by the University of Minnesota Supercomputing Institute.

Publisher Copyright:
© COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track.

Access

10.18653/v1/2020.coling-industry.19

OpenUrl availability

Full text

Cite this

Tayal, K., Ghosh, R., & Kumar, V. (2020). Model-agnostic Methods for Text Classification with Inherent Noise. In A. Clifton, & C. Napoles (Eds.), COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track (pp. 202-213). (COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2020.coling-industry.19

Model-agnostic Methods for Text Classification with Inherent Noise. / Tayal, Kshitij; Ghosh, Rahul; Kumar, Vipin.
COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track. ed. / Ann Clifton; Courtney Napoles. Association for Computational Linguistics (ACL), 2020. p. 202-213 (COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Tayal, K, Ghosh, R & Kumar, V 2020, Model-agnostic Methods for Text Classification with Inherent Noise. in A Clifton & C Napoles (eds), COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track. COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track, Association for Computational Linguistics (ACL), pp. 202-213, 28th International Conference on Computational Linguistics, COLING 2020, Virtual, Online, Spain, 12/12/20. https://doi.org/10.18653/v1/2020.coling-industry.19

Tayal K, Ghosh R, Kumar V. Model-agnostic Methods for Text Classification with Inherent Noise. In Clifton A, Napoles C, editors, COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track. Association for Computational Linguistics (ACL). 2020. p. 202-213. (COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track). doi: 10.18653/v1/2020.coling-industry.19

Tayal, Kshitij ; Ghosh, Rahul ; Kumar, Vipin. / Model-agnostic Methods for Text Classification with Inherent Noise. COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track. editor / Ann Clifton ; Courtney Napoles. Association for Computational Linguistics (ACL), 2020. pp. 202-213 (COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track).

@inproceedings{dfd3f033a6394892b1ec06efca493842,

title = "Model-agnostic Methods for Text Classification with Inherent Noise",

abstract = "Text classification is a fundamental problem, and recently, deep neural networks (DNN) have shown promising results in many natural language tasks. However, their human-level performance relies on high-quality annotations, which are time-consuming and expensive to collect. As we move towards large inexpensive datasets, the inherent label noise degrades the generalization of DNN. While most machine learning literature focuses on building complex networks to handle noise, in this work, we evaluate model-agnostic methods to handle inherent noise in large scale text classification that can be easily incorporated into existing machine learning workflows with minimal interruption. Specifically, we conduct a point-by-point comparative study between several noise-robust methods on three datasets encompassing three popular classification models. To our knowledge, this is the first time such a comprehensive study in text classification encircling popular models and model-agnostic loss methods has been conducted. In this study, we describe our learning and demonstrate the application of our approach, which outperformed baselines by up to 10 % in classification accuracy while requiring no network modifications. Code for this paper is hosted at www.kshitijtayal.com/code/model-agnostic-methods.",

author = "Kshitij Tayal and Rahul Ghosh and Vipin Kumar",

note = "Funding Information: This research was supported by National Science Foundation under the grant 1838159 and 1739191. Access to computing facilities was provided by the University of Minnesota Supercomputing Institute. Publisher Copyright: {\textcopyright} COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track.; 28th International Conference on Computational Linguistics, COLING 2020 ; Conference date: 12-12-2020",

year = "2020",

doi = "10.18653/v1/2020.coling-industry.19",

language = "English (US)",

series = "COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track",

publisher = "Association for Computational Linguistics (ACL)",

pages = "202--213",

editor = "Ann Clifton and Courtney Napoles",

booktitle = "COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track",

}

TY - GEN

T1 - Model-agnostic Methods for Text Classification with Inherent Noise

AU - Tayal, Kshitij

AU - Ghosh, Rahul

AU - Kumar, Vipin

N1 - Funding Information: This research was supported by National Science Foundation under the grant 1838159 and 1739191. Access to computing facilities was provided by the University of Minnesota Supercomputing Institute. Publisher Copyright: © COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track.

PY - 2020

Y1 - 2020

N2 - Text classification is a fundamental problem, and recently, deep neural networks (DNN) have shown promising results in many natural language tasks. However, their human-level performance relies on high-quality annotations, which are time-consuming and expensive to collect. As we move towards large inexpensive datasets, the inherent label noise degrades the generalization of DNN. While most machine learning literature focuses on building complex networks to handle noise, in this work, we evaluate model-agnostic methods to handle inherent noise in large scale text classification that can be easily incorporated into existing machine learning workflows with minimal interruption. Specifically, we conduct a point-by-point comparative study between several noise-robust methods on three datasets encompassing three popular classification models. To our knowledge, this is the first time such a comprehensive study in text classification encircling popular models and model-agnostic loss methods has been conducted. In this study, we describe our learning and demonstrate the application of our approach, which outperformed baselines by up to 10 % in classification accuracy while requiring no network modifications. Code for this paper is hosted at www.kshitijtayal.com/code/model-agnostic-methods.

AB - Text classification is a fundamental problem, and recently, deep neural networks (DNN) have shown promising results in many natural language tasks. However, their human-level performance relies on high-quality annotations, which are time-consuming and expensive to collect. As we move towards large inexpensive datasets, the inherent label noise degrades the generalization of DNN. While most machine learning literature focuses on building complex networks to handle noise, in this work, we evaluate model-agnostic methods to handle inherent noise in large scale text classification that can be easily incorporated into existing machine learning workflows with minimal interruption. Specifically, we conduct a point-by-point comparative study between several noise-robust methods on three datasets encompassing three popular classification models. To our knowledge, this is the first time such a comprehensive study in text classification encircling popular models and model-agnostic loss methods has been conducted. In this study, we describe our learning and demonstrate the application of our approach, which outperformed baselines by up to 10 % in classification accuracy while requiring no network modifications. Code for this paper is hosted at www.kshitijtayal.com/code/model-agnostic-methods.

UR - http://www.scopus.com/inward/record.url?scp=85120603777&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85120603777&partnerID=8YFLogxK

U2 - 10.18653/v1/2020.coling-industry.19

DO - 10.18653/v1/2020.coling-industry.19

M3 - Conference contribution

AN - SCOPUS:85120603777

T3 - COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track

SP - 202

EP - 213

BT - COLING 2020 - 28th International Conference on Computational Linguistics, Proceedings of the Industry Track

A2 - Clifton, Ann

A2 - Napoles, Courtney

PB - Association for Computational Linguistics (ACL)

T2 - 28th International Conference on Computational Linguistics, COLING 2020

Y2 - 12 December 2020

ER -

Model-agnostic Methods for Text Classification with Inherent Noise

Abstract

Publication series

Conference

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this