Bug #11989: Cephfs Kernel Client data corruption - CephFS - Ceph

Actions

Copy link

Bug #11989

closed

Cephfs Kernel Client data corruption

Added by Bernd Helm almost 9 years ago. Updated almost 8 years ago.

Status:

Resolved

Priority:

Normal

Assignee:

Zheng Yan

Category:

Target version:

% Done:

Source:

other

Tags:

Backport:

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

Ceph - v0.94.2

ceph-qa-suite:

Component(FS):

kceph

Labels (FS):

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

Hi. i get random data corruption with the cephfs kernel client. i do streaming from a non-ceph machine using "cat <file> | nc -l -p 444 -q" into cephfs using "nc <ip> 444 | pv -r -t -a > /mnt/cephfs/testfile". the file is 30GB of size and contains ascii csv data. when i do this, i get a high chance for a file corruption. The corruption manifests in a block of 0x00 bytes with a random size between ~500 to 4000 bytes.

After testing for around for 80+ hours, here are the things i know for sure:

i was able to reproduce this with with giant (0.87.2-1~bpo70+1) and hammer (0.94.1-1~bpo70+1, 0.94.2-1~bpo70+1) ceph versions
tried kernel versions 4.0.4, 3.16.0-4-amd64 and 3.14.43-031443
it also happens if the node that is writing to cephfs is NOT running any osds, but it happens more often if a server that is running osds is mounting cephfs locally. which may also be traffic related
if i test the file directly after writing, the md5 sum matches, which makes me think that it got handled over to cephfs correctly. if i then issue a "echo 3 > /proc/sys/vm/drop_caches" locally to clear the buffers and md5sum again, it does not match the previous checksum.
with a 30GB file, i have a chance of 1-4 corrupt blocks in that file.
i were also not able to reproduce this with the ceph-fuse client.
i were not able to reproduce it when i copy the file from a local tmpfs location, it seems somehow related to the network traffic.

mount:
172.16.0.101:6789:/ on /ceph-kernel type ceph (rw,relatime,name=admin,secret=<hidden>,nodcache,nofsc,acl)

root@idbi5:/ceph-kernel/kernel# nc 172.16.0.62 445 | pv -r -a -t > testfile1
0:01:11 [ 432MB/s] [ 432MB/s]
root@idbi5:/ceph-kernel/kernel# md5sum testfile1
801a0ec20f59aa4e4da51a8337cb722f testfile1 #this is the correct checksum
root@idbi5:/ceph-kernel/kernel# echo 3 > /proc/sys/vm/drop_caches
root@idbi5:/ceph-kernel/kernel# md5sum testfile1
8558af60da1f3c062fea18f0ce81ef24 testfile1 #this checksum is wrong

i have already reinstalled all servers with debian 7.8 wheezy
reinstalled ceph multiple times with stock configuration and crush map.
I also removed every server from the cluster to exclude hardware-related problems like bad ram.
i have a 5 node cluster with 20 osds on xfs, 10 osds with hdd, 10 with ssd. i testet this in hdd-only and ssd-only cephfs pools.
i issued a deep scrub on all pgs, no errors showed up
hardware used: Intel(R) Xeon(R) CPU E5-2650 v2 @ 2.60GHz, 128GB ECC ram, 10GBit Ethernet

Are more informations required? Thank you.

Files

0001-ceph-fix-ceph_writepages_start.patch (3.84 KB) 0001-ceph-fix-ceph_writepages_start.patch

Zheng Yan, 06/18/2015 03:52 AM

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Ceph » CephFS

Custom queries

Bug #11989

Cephfs Kernel Client data corruption

Updated by Greg Farnum almost 9 years ago

Updated by Zheng Yan almost 9 years ago

Updated by Bernd Helm almost 9 years ago

Updated by Zheng Yan almost 9 years ago

Updated by Zheng Yan almost 9 years ago

Updated by Zheng Yan almost 9 years ago

Updated by Bernd Helm almost 9 years ago

Updated by Zheng Yan almost 9 years ago

Updated by Greg Farnum almost 8 years ago