Project

General

Profile

Bug #21754

mds: src/osdc/Journaler.cc: 402: FAILED assert(!r)

Added by Patrick Donnelly almost 2 years ago. Updated 7 months ago.

Status:
Rejected
Priority:
High
Assignee:
-
Category:
Correctness/Safety
Target version:
Start date:
10/10/2017
Due date:
% Done:

0%

Source:
Q/A
Tags:
Backport:
mimic,luminous
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
MDS
Labels (FS):
Pull request ID:

Description

2017-10-07T02:46:50.607 INFO:tasks.ceph.mds.b.pluto009.stderr:/builddir/build/BUILD/ceph-12.2.1/src/osdc/Journaler.cc: In function 'void Journaler::_finish_reread_head_and_probe(int, C_OnFinisher*)' thread 7f7e0a05a700 time 2017-10-07 06:48:56.751895
2017-10-07T02:46:50.607 INFO:tasks.ceph.mds.b.pluto009.stderr:/builddir/build/BUILD/ceph-12.2.1/src/osdc/Journaler.cc: 402: FAILED assert(!r)
2017-10-07T02:46:50.608 INFO:tasks.ceph.mds.b.pluto009.stderr: ceph version 12.2.1-10.el7cp (5ba1c3fa606d7bf16f72756b0026f04a40297673) luminous (stable)
2017-10-07T02:46:50.609 INFO:tasks.ceph.mds.b.pluto009.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x56380d90b390]
2017-10-07T02:46:50.609 INFO:tasks.ceph.mds.b.pluto009.stderr: 2: (()+0x54636c) [0x56380d87c36c]
2017-10-07T02:46:50.609 INFO:tasks.ceph.mds.b.pluto009.stderr: 3: (Context::complete(int)+0x9) [0x56380d5eafa9]
2017-10-07T02:46:50.609 INFO:tasks.ceph.mds.b.pluto009.stderr: 4: (Journaler::_finish_reread_head(int, ceph::buffer::list&, Context*)+0x297) [0x56380d87ac17]
2017-10-07T02:46:50.609 INFO:tasks.ceph.mds.b.pluto009.stderr: 5: (Context::complete(int)+0x9) [0x56380d5eafa9]
2017-10-07T02:46:50.610 INFO:tasks.ceph.mds.b.pluto009.stderr: 6: (Finisher::finisher_thread_entry()+0x1c5) [0x56380d90a335]
2017-10-07T02:46:50.610 INFO:tasks.ceph.mds.b.pluto009.stderr: 7: (()+0x7e25) [0x7f7e150a5e25]
2017-10-07T02:46:50.610 INFO:tasks.ceph.mds.b.pluto009.stderr: 8: (clone()+0x6d) [0x7f7e1418834d]
2017-10-07T02:46:50.610 INFO:tasks.ceph.mds.b.pluto009.stderr: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

From: https://bugzilla.redhat.com/show_bug.cgi?id=1500501

History

#1 Updated by Zheng Yan almost 2 years ago

    -1> 2017-10-07 06:48:56.751800 7f7e12fe1700  1 -- 10.8.129.109:6806/3045579401 <== osd.0 10.8.129.109:6801/706613 21 ==== osd_op_reply(23 200.00000000 [read 0~0] v0'0 uv0 ondisk = -2 ((2) No such file or directory)) v8 ==== 156+0+0 (948428715 0 0) 0x563818e89c00 con 0x563817ef8000
     0> 2017-10-07 06:48:56.753567 7f7e0a05a700 -1 /builddir/build/BUILD/ceph-12.2.1/src/osdc/Journaler.cc: In function 'void Journaler::_finish_reread_head_and_probe(int, C_OnFinisher*)' thread 7f7e0a05a700 time 2017-10-07 06:48:56.751895
/builddir/build/BUILD/ceph-12.2.1/src/osdc/Journaler.cc: 402: FAILED assert(!r)

 ceph version 12.2.1-10.el7cp (5ba1c3fa606d7bf16f72756b0026f04a40297673) luminous (stable)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x56380d90b390]
 2: (()+0x54636c) [0x56380d87c36c]
 3: (Context::complete(int)+0x9) [0x56380d5eafa9]
 4: (Journaler::_finish_reread_head(int, ceph::buffer::list&, Context*)+0x297) [0x56380d87ac17]
 5: (Context::complete(int)+0x9) [0x56380d5eafa9]
 6: (Finisher::finisher_thread_entry()+0x1c5) [0x56380d90a335]
 7: (()+0x7e25) [0x7f7e150a5e25]
 8: (clone()+0x6d) [0x7f7e1418834d]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

MDLog::_reformat_journal() deleted object 200.00000000 and created head object 300.00000000 for new journal. The standby replay mds found that object 200.00000000 is missing.

This should be test case bug.

#2 Updated by Patrick Donnelly 7 months ago

  • Priority changed from Normal to High
  • Target version set to v14.0.0
  • Source set to Q/A
  • Backport changed from luminous to mimic,luminous

#3 Updated by Patrick Donnelly 7 months ago

  • Status changed from New to Rejected

Seems this no longer happens anymore.

Also available in: Atom PDF