Project

General

Profile

Actions

Bug #58940

closed

src/osd/PrimaryLogPG.cc: 4284: ceph_abort_msg("out of order op")

Added by Venky Shankar about 1 year ago. Updated 4 months ago.

Status:
Resolved
Priority:
Urgent
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
backport_processed
Backport:
reef
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

This looks similar to https://tracker.ceph.com/issues/48793, but since that tracker is from a couple of years ago, I thought of creating a fresh one.

I'm seeing the following OSD crash for a job in fs suite: https://pulpito.ceph.com/vshankar-2023-03-08_15:12:36-fs-wip-vshankar-testing-20230308.112059-testing-default-smithi/7197018/

2023-03-08T15:46:23.413+0000 7f0d2c737700 -1 /home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/18.0.0-2861-ga2b8f27f/rpm/el8/BUILD/ceph-18.0.0-2861-ga2b8f27f/src/osd/PrimaryLogPG.cc: In function 'void PrimaryLogPG::execute_ctx(PrimaryLogPG::OpContext*)' thread 7f0d2c737700 time 2023-03-08T15:46:23.411114+0000     /home/jenkins-build/build/workspace/ceph-dev-new-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/18.0.0-2861-ga2b8f27f/rpm/el8/BUILD/ceph-18.0.0-2861-ga2b8f27f/src/osd/PrimaryLogPG.cc: 4284: ceph_abort_msg("out of order op")

ceph version 18.0.0-2861-ga2b8f27f (a2b8f27f8eef4b6c46c66813a64c178fb019988a) reef (dev)
1: (ceph::__ceph_abort(char const*, int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&)+0xd7) [0x55a3fdaddc85]
2: (PrimaryLogPG::execute_ctx(PrimaryLogPG::OpContext*)+0x13f4) [0x55a3fdd96694]
3: (PrimaryLogPG::do_op(boost::intrusive_ptr<OpRequest>&)+0x281e) [0x55a3fdd9cd0e]
4: (OSD::dequeue_op(boost::intrusive_ptr<PG>, boost::intrusive_ptr<OpRequest>, ThreadPool::TPHandle&)+0x1c6) [0x55a3fdbec936]
5: (ceph::osd::scheduler::PGOpItem::run(OSD*, OSDShard*, boost::intrusive_ptr<PG>&, ThreadPool::TPHandle&)+0x69) [0x55a3fdf026d9]
6: (OSD::ShardedOpWQ::_process(unsigned int, ceph::heartbeat_handle_d*)+0x115f) [0x55a3fdc01e6f]
7: (ShardedThreadPool::shardedthreadpool_worker(unsigned int)+0x435) [0x55a3fe2b3e65]
8: (ShardedThreadPool::WorkThreadSharded::entry()+0x14) [0x55a3fe2b7064]
9: /lib64/libpthread.so.0(+0x81ca) [0x7f0d554a31ca]
10: clone()

Related issues 1 (0 open1 closed)

Copied to RADOS - Backport #61303: reef: src/osd/PrimaryLogPG.cc: 4284: ceph_abort_msg("out of order op")ResolvedAishwarya MathuriaActions
Actions

Also available in: Atom PDF