Enabling rapid classification of social media communications during crises

Muhammad Imran; Prasenjit Mitra; Jaideep Srivastava

doi:10.4018/978-1-7998-2460-2.ch064

Enabling rapid classification of social media communications during crises

Muhammad Imran, Prasenjit Mitra, Jaideep Srivastava

Research output: Chapter in Book/Report/Conference proceeding › Chapter

Abstract

The use of social media platforms such as Twitter by affected people during crises is considered a vital source of information for crisis response. However, rapid crisis response requires real-time analysis of online information. When a disaster happens, among other data processing techniques, supervised machine learning can help classify online information in real-time. However, scarcity of labeled data causes poor performance in machine training. Often labeled data from past event is available. Can past labeled data be reused to train classifiers? We study the usefulness of labeled data of past events. We observe the performance of our classifiers trained using different combinations of training sets obtained from past disasters. Moreover, we propose two approaches (target labeling and active learning) to boost classification performance of a learning scheme. We perform extensive experimentation on real crisis datasets and show the utility of past-labeled data to train machine learning classifiers to process sudden-onset crisis-related data in real-time.

Original language	English (US)
Title of host publication	Cognitive Analytics
Subtitle of host publication	Concepts, Methodologies, Tools, and Applications
Publisher	IGI Global
Pages	1272-1289
Number of pages	18
ISBN (Electronic)	9781799824619
ISBN (Print)	9781799824602
DOIs	https://doi.org/10.4018/978-1-7998-2460-2.ch064
State	Published - Mar 6 2020
Externally published	Yes

Bibliographical note

Publisher Copyright:
© 2020, IGI Global.

Access

10.4018/978-1-7998-2460-2.ch064

OpenUrl availability

Full text

Cite this

@inbook{aee45964b5cb4dc0abe95e75c881864a,

title = "Enabling rapid classification of social media communications during crises",

abstract = "The use of social media platforms such as Twitter by affected people during crises is considered a vital source of information for crisis response. However, rapid crisis response requires real-time analysis of online information. When a disaster happens, among other data processing techniques, supervised machine learning can help classify online information in real-time. However, scarcity of labeled data causes poor performance in machine training. Often labeled data from past event is available. Can past labeled data be reused to train classifiers? We study the usefulness of labeled data of past events. We observe the performance of our classifiers trained using different combinations of training sets obtained from past disasters. Moreover, we propose two approaches (target labeling and active learning) to boost classification performance of a learning scheme. We perform extensive experimentation on real crisis datasets and show the utility of past-labeled data to train machine learning classifiers to process sudden-onset crisis-related data in real-time.",

author = "Muhammad Imran and Prasenjit Mitra and Jaideep Srivastava",

note = "Publisher Copyright: {\textcopyright} 2020, IGI Global.",

year = "2020",

month = mar,

day = "6",

doi = "10.4018/978-1-7998-2460-2.ch064",

language = "English (US)",

isbn = "9781799824602",

pages = "1272--1289",

booktitle = "Cognitive Analytics",

publisher = "IGI Global",

}

TY - CHAP

T1 - Enabling rapid classification of social media communications during crises

AU - Imran, Muhammad

AU - Mitra, Prasenjit

AU - Srivastava, Jaideep

PY - 2020/3/6

Y1 - 2020/3/6

N2 - The use of social media platforms such as Twitter by affected people during crises is considered a vital source of information for crisis response. However, rapid crisis response requires real-time analysis of online information. When a disaster happens, among other data processing techniques, supervised machine learning can help classify online information in real-time. However, scarcity of labeled data causes poor performance in machine training. Often labeled data from past event is available. Can past labeled data be reused to train classifiers? We study the usefulness of labeled data of past events. We observe the performance of our classifiers trained using different combinations of training sets obtained from past disasters. Moreover, we propose two approaches (target labeling and active learning) to boost classification performance of a learning scheme. We perform extensive experimentation on real crisis datasets and show the utility of past-labeled data to train machine learning classifiers to process sudden-onset crisis-related data in real-time.

AB - The use of social media platforms such as Twitter by affected people during crises is considered a vital source of information for crisis response. However, rapid crisis response requires real-time analysis of online information. When a disaster happens, among other data processing techniques, supervised machine learning can help classify online information in real-time. However, scarcity of labeled data causes poor performance in machine training. Often labeled data from past event is available. Can past labeled data be reused to train classifiers? We study the usefulness of labeled data of past events. We observe the performance of our classifiers trained using different combinations of training sets obtained from past disasters. Moreover, we propose two approaches (target labeling and active learning) to boost classification performance of a learning scheme. We perform extensive experimentation on real crisis datasets and show the utility of past-labeled data to train machine learning classifiers to process sudden-onset crisis-related data in real-time.

UR - http://www.scopus.com/inward/record.url?scp=85138381085&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85138381085&partnerID=8YFLogxK

U2 - 10.4018/978-1-7998-2460-2.ch064

DO - 10.4018/978-1-7998-2460-2.ch064

M3 - Chapter

AN - SCOPUS:85138381085

SN - 9781799824602

SP - 1272

EP - 1289

BT - Cognitive Analytics

PB - IGI Global

ER -

Enabling rapid classification of social media communications during crises

Abstract

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this