Actions
Bug #12437
closedMutex Assert from PipeConnection::try_get_pipe
Status:
Resolved
Priority:
Urgent
Assignee:
David Zafman
Category:
-
Target version:
-
% Done:
0%
Source:
other
Tags:
Backport:
firefly hammer
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
This occured during a trial run of cbt's ceph_test_rados benchmark while OSD 3 was marked out/down or up/in in a loop. State transitions occured when "ceph health" no longer reported degraded, peering, recovery_wait, stuck, inactive, unclean, or recovery warnings.
0> 2015-07-22 13:36:47.217698 7fed761ba700 -1 common/Mutex.cc: In function 'void Mutex::Lock(bool)' thread 7fed761ba700 time 2015-07-22 13:36:47.213562 common/Mutex.cc: 95: FAILED assert(r == 0) ceph version 0.94.2-108-g45beb86 (45beb86423c3bd74dbafd36c6822e71ad9680e17) 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x78) [0xbc9578] 2: (Mutex::Lock(bool)+0x105) [0xb79ff5] 3: (PipeConnection::try_get_pipe(Pipe**)+0x18) [0xca9828] 4: (SimpleMessenger::submit_message(Message*, PipeConnection*, entity_addr_t const&, int, bool)+0x66) [0xba5a96] 5: (SimpleMessenger::submit_message(Message*, PipeConnection*, entity_addr_t const&, int, bool)+0x427) [0xba5e57] 6: (SimpleMessenger::_send_message(Message*, Connection*)+0x97) [0xba7977] 7: (OSDService::send_message_osd_cluster(int, Message*, unsigned int)+0x1fe) [0x6aca9e] 8: (PG::share_pg_info()+0x4d1) [0x7ed341] 9: (ReplicatedPG::snap_trimmer()+0x603) [0x84f953] 10: (OSD::SnapTrimWQ::_process(PG*)+0x1a) [0x6d709a] 11: (ThreadPool::worker(ThreadPool::WorkThread*)+0xa56) [0xbba226] 12: (ThreadPool::WorkThread::entry()+0x10) [0xbbb2d0] 13: (()+0x7ee5) [0x7fed95ee7ee5] 14: (clone()+0x6d) [0x7fed949c5b8d] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Files
Actions