Project

General

Profile

Actions

Bug #1368

closed

mds crash after blogbench on cfuse

Added by Josh Durgin over 12 years ago. Updated over 7 years ago.

Status:
Can't reproduce
Priority:
Normal
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

This is from teuthology run 216 (logs in teuthology:~teuthworker/archive/full_suite_coverage_20110805/216/).
The relevant nodes, sepia{32,33,37} are still locked.
The backtrace is:

2011-08-05T13:42:22.404 INFO:teuthology.task.ceph.mds.0.err:*** Caught signal (Segmentation fault) **
2011-08-05T13:42:22.404 INFO:teuthology.task.ceph.mds.0.err: in thread 0x7f177ff15700
2011-08-05T13:42:22.405 INFO:teuthology.task.ceph.mds.0.err: ceph version 0.32-159-g66c3d8f (commit:66c3d8ff60ca585b97540daee942e2c5c6e5538f)
2011-08-05T13:42:22.406 INFO:teuthology.task.ceph.mds.0.err: 1: /tmp/cephtest/binary/usr/local/bin/cmds() [0x8cf1a4]
2011-08-05T13:42:22.406 INFO:teuthology.task.ceph.mds.0.err: 2: (()+0xfb40) [0x7f178378db40]
2011-08-05T13:42:22.406 INFO:teuthology.task.ceph.mds.0.err: 3: (Locker::scatter_writebehind_finish(ScatterLock*, Mutation*)+0x43c) [0x68402c]
2011-08-05T13:42:22.406 INFO:teuthology.task.ceph.mds.0.err: 4: (Locker::C_Locker_ScatterWB::finish(int)+0x1d) [0x69924d]
2011-08-05T13:42:22.406 INFO:teuthology.task.ceph.mds.0.err: 5: (Context::complete(int)+0x12) [0x4973e2]
2011-08-05T13:42:22.407 INFO:teuthology.task.ceph.mds.0.err: 6: (finish_contexts(CephContext*, std::list<Context*, std::allocator<Context*> >&, int)+0x12d) [0x63c8ed]
2011-08-05T13:42:22.407 INFO:teuthology.task.ceph.mds.0.err: 7: (Journaler::_finish_flush(int, unsigned long, utime_t)+0x206) [0x7d0f36]
2011-08-05T13:42:22.407 INFO:teuthology.task.ceph.mds.0.err: 8: (Journaler::C_Flush::finish(int)+0x1d) [0x7dfa0d]
2011-08-05T13:42:22.407 INFO:teuthology.task.ceph.mds.0.err: 9: (Objecter::handle_osd_op_reply(MOSDOpReply*)+0xd72) [0x7a4ef2]
2011-08-05T13:42:22.407 INFO:teuthology.task.ceph.mds.0.err: 10: (MDS::handle_core_message(Message*)+0xecf) [0x4bfebf]
2011-08-05T13:42:22.408 INFO:teuthology.task.ceph.mds.0.err: 11: (MDS::_dispatch(Message*)+0x3c) [0x4c001c]
2011-08-05T13:42:22.408 INFO:teuthology.task.ceph.mds.0.err: 12: (MDS::ms_dispatch(Message*)+0xa1) [0x4c26b1]
2011-08-05T13:42:22.408 INFO:teuthology.task.ceph.mds.0.err: 13: (SimpleMessenger::dispatch_entry()+0x9d2) [0x81b7b2]
2011-08-05T13:42:22.408 INFO:teuthology.task.ceph.mds.0.err: 14: (SimpleMessenger::DispatchThread::entry()+0x2c) [0x48e7fc]
2011-08-05T13:42:22.408 INFO:teuthology.task.ceph.mds.0.err: 15: (Thread::_entry_func(void*)+0x12) [0x808e52]
2011-08-05T13:42:22.409 INFO:teuthology.task.ceph.mds.0.err: 16: (()+0x7971) [0x7f1783785971]
2011-08-05T13:42:22.409 INFO:teuthology.task.ceph.mds.0.err: 17: (clone()+0x6d) [0x7f178221992d]

Actions #1

Updated by Sage Weil over 12 years ago

  • Target version set to v0.34
  • Translation missing: en.field_position set to 796
Actions #2

Updated by Sage Weil over 12 years ago

  • Translation missing: en.field_position deleted (802)
  • Translation missing: en.field_position set to 33
Actions #3

Updated by Sage Weil over 12 years ago

  • Assignee set to Sage Weil
Actions #4

Updated by Sage Weil over 12 years ago

The crash was on shutdown. Have core file but gitbuilder binaries were expired.

Running in a loop to reproduce.

Actions #5

Updated by Sage Weil over 12 years ago

unlocked the nodes

Actions #6

Updated by Sage Weil over 12 years ago

  • Target version changed from v0.34 to v0.35
Actions #7

Updated by Sage Weil over 12 years ago

  • Status changed from New to Can't reproduce
Actions #8

Updated by John Spray over 7 years ago

  • Project changed from Ceph to CephFS
  • Category deleted (1)
  • Target version deleted (v0.35)

Bulk updating project=ceph category=mds bugs so that I can remove the MDS category from the Ceph project to avoid confusion.

Actions

Also available in: Atom PDF