Bug #8464
closed
Added by Zack Cerza almost 10 years ago.
Updated almost 10 years ago.
Description
http://pulpito.ceph.com/teuthology-2014-05-27_23:09:10-krbd-firefly-testing-basic-plana/277770/
ubuntu 16681 0.0 0.0 0 0 ? Zl May28 0:28 [ffsb] <defunct>
ubuntu@plana90:~$ sudo ls /proc/16681/fd
ubuntu@plana90:~$ sudo cat /proc/16681/stat
16681 (ffsb) Z 1 15015 15013 0 -1 4244492 455 176 0 0 160 2678 0 0 20 0 31 0 466989 0 0 18446744073709551615 0 0 0 0 0 0 0 0 0 18446744073709551615 0 0 17 2 0 0 506 0 0 0 0 0 0 0 0 0 15
ubuntu@plana90:~$ sudo ls /proc/15015
ls: cannot access /proc/15015: No such file or directory
Its parent PID has no /proc/ entry. I don't know why the ffsb.sh process is not terminating. Maybe it has to do with the crash (dmesg output attached)
Files
- Subject changed from Job hung during workunit suites/ffsb.sh with crashed to Job hung during workunit suites/ffsb.sh with crashed ceph process
yeah, looks like it's just because the block io is hung. also, this:
12432 ? Ss 0:00 cron
18720 ? S 0:00 \_ CRON
18722 ? Ss 0:00 \_ /bin/sh -c test -x /usr/sbin/anacron || ( cd / && run-parts --report /etc/cron.daily )
18723 ? S 0:00 \_ run-parts --report /etc/cron.daily
18829 ? S 0:00 \_ /bin/bash /etc/cron.daily/mlocate
18835 ? D 0:00 \_ /usr/bin/updatedb.mlocate
which might be worth disabling somewhere for all teuthology runs...
- Project changed from teuthology to rbd
- Subject changed from Job hung during workunit suites/ffsb.sh with crashed ceph process to krbd: deadlock
- Assignee set to Ilya Dryomov
- Priority changed from Normal to Urgent
- Source changed from other to Q/A
I haven't seen this on nightly runs (the only place it seemed to pop up) in a while.
- Project changed from rbd to Linux kernel client
- Status changed from New to Resolved
OK, thanks everybody.
rbd: rework rbd_request_fn()
Also available in: Atom
PDF