Project

General

Profile

Actions

Bug #10778

closed

Rollback+ReplicationPG -> Segmentation fault

Added by Irek Fasikhov about 9 years ago. Updated about 9 years ago.

Status:
Can't reproduce
Priority:
Urgent
Assignee:
-
Category:
OSD
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Morning found that some OSD dropped out of Tier Cache Pool. Maybe it's a coincidence, but at this point was rollback.

2015-02-05 23:23:18.231723 7fd747ff1700 -1 *** Caught signal (Segmentation fault) **
 in thread 7fd747ff1700

 ceph version 0.80.8 (69eaad7f8308f21573c604f121956e64679a52a7)
 1: /usr/bin/ceph-osd() [0x9bde51]
 2: (()+0xf710) [0x7fd766f97710]
 3: (std::_Rb_tree_decrement(std::_Rb_tree_node_base*)+0xa) [0x7fd7666c1eca]
 4: (ReplicatedPG::make_writeable(ReplicatedPG::OpContext*)+0x14c) [0x87cd5c]
 5: (ReplicatedPG::prepare_transaction(ReplicatedPG::OpContext*)+0x1db) [0x89d29b]
 6: (ReplicatedPG::execute_ctx(ReplicatedPG::OpContext*)+0xcd4) [0x89e0f4]
 7: (ReplicatedPG::do_op(std::tr1::shared_ptr<OpRequest>)+0x2ca5) [0x8a2a55]
 8: (ReplicatedPG::do_request(std::tr1::shared_ptr<OpRequest>, ThreadPool::TPHandle&)+0x5b1) [0x832251]
 9: (OSD::dequeue_op(boost::intrusive_ptr<PG>, std::tr1::shared_ptr<OpRequest>, ThreadPool::TPHandle&)+0x37c) [0x61344c]
 10: (OSD::OpWQ::_process(boost::intrusive_ptr<PG>, ThreadPool::TPHandle&)+0x63d) [0x6472ad]
 11: (ThreadPool::WorkQueueVal<std::pair<boost::intrusive_ptr<PG>, std::tr1::shared_ptr<OpRequest> >, boost::intrusive_ptr<PG> >::_void_process(void*, ThreadPool::TPHandle&)+0xae) [0x67dcde]
 12: (ThreadPool::worker(ThreadPool::WorkThread*)+0x551) [0xa2a181]
 13: (ThreadPool::WorkThread::entry()+0x10) [0xa2d260]
 14: (()+0x79d1) [0x7fd766f8f9d1]
 15: (clone()+0x6d) [0x7fd765f088fd]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

Files

logs.tar.gz (438 KB) logs.tar.gz Irek Fasikhov, 02/06/2015 08:02 AM
Actions #1

Updated by Irek Fasikhov about 9 years ago

Attaching logs

Actions #2

Updated by Greg Farnum about 9 years ago

  • Priority changed from Immediate to Normal

What do you mean, "at this point was rollback"?

Actions #3

Updated by Irek Fasikhov about 9 years ago

Sorry.

Greg,At that time was a snapshot rollback rbd

Actions #4

Updated by Irek Fasikhov about 9 years ago

Greg, I suppose it's a patch to solve the problem?

https://github.com/ceph/ceph/commit/1228658871e53e350bdab3e72cdefd1caf33c291

Actions #5

Updated by Greg Farnum about 9 years ago

I don't know what this particular bug might mean, but that commit is included in v0.80.8, which you appear to be running. Sorry I can't be more help.

Actions #6

Updated by Samuel Just about 9 years ago

  • Priority changed from Normal to Urgent
Actions #7

Updated by Samuel Just about 9 years ago

  • Status changed from New to Can't reproduce
Actions

Also available in: Atom PDF