Improving data integrity in linux software RAID with protection information (T10-PI)

Baoquan Zhang, Raghunath Raja Chandrasekar, Alireza Haghdoost, Lance Evans, David Du

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The T10 DIF (Data Integrity Field) and DIX (Data Integrity Extension) specifications provide mechanisms to guarantee end-to-end data integrity and protection in the face of silent data corruption in modern storage systems. However, the Multiple Devices (MD) software RAID driver in Linux does not fully leverage these capabilities to provide such end-to-end guarantees with widely-used RAID modes such as 5 and 6, thereby causing an 'integrity gap' in the Linux I/O stack. This paper describes the design and performance characteristics of a DIX-aware MD module that plugs this integrity gap with minimal overhead to client applications. A PI (Protection Information) operator is added in MD to handle the PI-related operations, and dedicated buffers for PI are allocated and managed in MD RAID-5/6 personality's stripe structures to generate, store, and verify the PI. This allows seamless exchange of PI information among end-applications running in user mode, file systems, the linux block layer, and PI-capable HBAs and drives. Our evaluations show that the DIX-aware MD module has the capability of detecting SDC with the tolerable performance penalty.

Original languageEnglish (US)
Title of host publicationProceedings - 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages609-615
Number of pages7
ISBN (Electronic)9781538658154
DOIs
StatePublished - Jul 13 2018
Event18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2018 - Washington, United States
Duration: May 1 2018May 4 2018

Publication series

NameProceedings - 18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2018

Other

Other18th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGRID 2018
Country/TerritoryUnited States
CityWashington
Period5/1/185/4/18

Bibliographical note

Funding Information:
This work is partially supported by a grant from CRAY and NSF awards 1305237, 1421913, 1439622 and 1525617. The authors also express their gratitude to Martin K. Petersen from Oracle, Chaitra Basappa from Broadcom and others who offer significant help during the experimental setup.

Publisher Copyright:
© 2018 IEEE.

Keywords

  • Linux kernel
  • Software RAID
  • T10 PI

Fingerprint

Dive into the research topics of 'Improving data integrity in linux software RAID with protection information (T10-PI)'. Together they form a unique fingerprint.

Cite this