Project

General

Profile

Actions

Bug #10908

closed

"Crash: timed out waiting for admin_socket to appear after osd.11 restart" in upgrade:giant-x-wip-sam-testing-distro-basic-mult

Added by Yuri Weinstein about 9 years ago. Updated about 9 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Run: http://pulpito.front.sepia.ceph.com/teuthology-2015-02-17_12:36:03-upgrade:giant-x-wip-sam-testing-distro-basic-multi/
Job: ['762875']
Logs: http://qa-proxy.ceph.com/teuthology/teuthology-2015-02-17_12:36:03-upgrade:giant-x-wip-sam-testing-distro-basic-multi/762875/

Crash: timed out waiting for admin_socket to appear after osd.11 restart
ceph version 0.87-161-g4178e32 (4178e32dd085adeead84fb168ab8a8a121256259)
 1: ceph-osd() [0x9b75e5]
 2: (()+0xfcb0) [0x7fa2e443ccb0]
 3: (gsignal()+0x35) [0x7fa2e2d28425]
 4: (abort()+0x17b) [0x7fa2e2d2bb8b]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x11d) [0x7fa2e367b69d]
 6: (()+0xb5846) [0x7fa2e3679846]
 7: (()+0xb5873) [0x7fa2e3679873]
 8: (()+0xb596e) [0x7fa2e367996e]
 9: (ObjectStore::Transaction::decode(ceph::buffer::list::iterator&)+0x219) [0x84ed19]
 10: (ReplicatedBackend::sub_op_modify(std::tr1::shared_ptr<OpRequest>)+0x5ea) [0x7eb83a]
 11: (ReplicatedBackend::handle_message(std::tr1::shared_ptr<OpRequest>)+0x55c) [0x9377bc]
 12: (ReplicatedPG::do_request(std::tr1::shared_ptr<OpRequest>&, ThreadPool::TPHandle&)+0x15a) [0x7ce93a]
 13: (OSD::dequeue_op(boost::intrusive_ptr<PG>, std::tr1::shared_ptr<OpRequest>, ThreadPool::TPHandle&)+0x17f) [0x64e4ef]
 14: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x65f) [0x64ef5f]
 15: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x652) [0xa921a2]
 16: (ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0xa938d0]
 17: (()+0x7e9a) [0x7fa2e4434e9a]
 18: (clone()+0x6d) [0x7fa2e2de63fd]

Related issues 1 (0 open1 closed)

Related to Ceph - Bug #10370: "MaxWhileTries: 'wait_until_healthy'reached maximum tries (150) after waiting for 900 seconds" in upgrade:dumpling-firefly-x:stress-split-next-distro-basic-vps runDuplicate12/18/2014

Actions
Actions #1

Updated by Samuel Just about 9 years ago

  • Status changed from New to 7
  • Assignee set to Samuel Just
Actions #2

Updated by Yuri Weinstein about 9 years ago

Same
Run: http://pulpito.front.sepia.ceph.com/teuthology-2015-02-18_17:05:01-upgrade:giant-x-hammer-distro-basic-multi/
Jobs: ['764940', '764941']
Logs: http://qa-proxy.ceph.com/teuthology/teuthology-2015-02-18_17:05:01-upgrade:giant-x-hammer-distro-basic-multi/764940/

ceph version 0.87-161-g4178e32 (4178e32dd085adeead84fb168ab8a8a121256259)
 1: ceph-osd() [0x9b75e5]
 2: (()+0xfcb0) [0x7f4e60e9dcb0]
 3: (gsignal()+0x35) [0x7f4e5f789425]
 4: (abort()+0x17b) [0x7f4e5f78cb8b]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x11d) [0x7f4e600dc69d]
 6: (()+0xb5846) [0x7f4e600da846]
 7: (()+0xb5873) [0x7f4e600da873]
 8: (()+0xb596e) [0x7f4e600da96e]
 9: (ObjectStore::Transaction::decode(ceph::buffer::list::iterator&)+0x219) [0x84ed19]
 10: (ReplicatedBackend::sub_op_modify(std::tr1::shared_ptr<OpRequest>)+0x5ea) [0x7eb83a]
 11: (ReplicatedBackend::handle_message(std::tr1::shared_ptr<OpRequest>)+0x55c) [0x9377bc]
 12: (ReplicatedPG::do_request(std::tr1::shared_ptr<OpRequest>&, ThreadPool::TPHandle&)+0x15a) [0x7ce93a]
 13: (OSD::dequeue_op(boost::intrusive_ptr<PG>, std::tr1::shared_ptr<OpRequest>, ThreadPool::TPHandle&)+0x17f) [0x64e4ef]
 14: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x65f) [0x64ef5f]
 15: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x652) [0xa921a2]
 16: (ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0xa938d0]
 17: (()+0x7e9a) [0x7f4e60e95e9a]
 18: (clone()+0x6d) [0x7f4e5f8473fd]
Actions #3

Updated by Yuri Weinstein about 9 years ago

Run: http://pulpito.ceph.com/teuthology-2015-02-18_17:05:01-upgrade:giant-x-hammer-distro-basic-multi/
Jobs: ['764940', '764941']

ceph version 0.87-161-g4178e32 (4178e32dd085adeead84fb168ab8a8a121256259)
 1: ceph-osd() [0x9b75e5]
 2: (()+0xfcb0) [0x7f4e60e9dcb0]
 3: (gsignal()+0x35) [0x7f4e5f789425]
 4: (abort()+0x17b) [0x7f4e5f78cb8b]
 5: (__gnu_cxx::__verbose_terminate_handler()+0x11d) [0x7f4e600dc69d]
 6: (()+0xb5846) [0x7f4e600da846]
 7: (()+0xb5873) [0x7f4e600da873]
 8: (()+0xb596e) [0x7f4e600da96e]
 9: (ObjectStore::Transaction::decode(ceph::buffer::list::iterator&)+0x219) [0x84ed19]
 10: (ReplicatedBackend::sub_op_modify(std::tr1::shared_ptr<OpRequest>)+0x5ea) [0x7eb83a]
 11: (ReplicatedBackend::handle_message(std::tr1::shared_ptr<OpRequest>)+0x55c) [0x9377bc]
 12: (ReplicatedPG::do_request(std::tr1::shared_ptr<OpRequest>&, ThreadPool::TPHandle&)+0x15a) [0x7ce93a]
 13: (OSD::dequeue_op(boost::intrusive_ptr<PG>, std::tr1::shared_ptr<OpRequest>, ThreadPool::TPHandle&)+0x17f) [0x64e4ef]
 14: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x65f) [0x64ef5f]
 15: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x652) [0xa921a2]
 16: (ShardedThreadPool::WorkThreadSharded::entry()+0x10) [0xa938d0]
 17: (()+0x7e9a) [0x7f4e60e95e9a]
 18: (clone()+0x6d) [0x7f4e5f8473fd]
Actions #4

Updated by Sage Weil about 9 years ago

  • Status changed from 7 to Resolved
Actions

Also available in: Atom PDF