Enabling rapid classification of social media communications during crises

Muhammad Imran, Prasenjit Mitra, Jaideep Srivastava

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

The use of social media platforms such as Twitter by affected people during crises is considered a vital source of information for crisis response. However, rapid crisis response requires real-time analysis of online information. When a disaster happens, among other data processing techniques, supervised machine learning can help classify online information in real-time. However, scarcity of labeled data causes poor performance in machine training. Often labeled data from past event is available. Can past labeled data be reused to train classifiers? We study the usefulness of labeled data of past events. We observe the performance of our classifiers trained using different combinations of training sets obtained from past disasters. Moreover, we propose two approaches (target labeling and active learning) to boost classification performance of a learning scheme. We perform extensive experimentation on real crisis datasets and show the utility of past-labeled data to train machine learning classifiers to process sudden-onset crisis-related data in real-time.

Original languageEnglish (US)
Title of host publicationCognitive Analytics
Subtitle of host publicationConcepts, Methodologies, Tools, and Applications
PublisherIGI Global
Pages1272-1289
Number of pages18
ISBN (Electronic)9781799824619
ISBN (Print)9781799824602
DOIs
StatePublished - Mar 6 2020
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2020, IGI Global.

Fingerprint

Dive into the research topics of 'Enabling rapid classification of social media communications during crises'. Together they form a unique fingerprint.

Cite this