Project

General

Profile

Bug #22266

mgr/PyModuleRegistry.cc: 139: FAILED assert(map.epoch > 0)

Added by Sage Weil 18 days ago. Updated 16 days ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
Correctness/Safety
Target version:
-
Start date:
11/28/2017
Due date:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
rados
Release:
Needs Doc:
No
Component(RADOS):

Description

2017-11-28T16:02:48.792 INFO:tasks.ceph.mgr.x.smithi106.stderr:/build/ceph-12.2.1-838-gacb0271/src/mgr/PyModuleRegistry.cc: In function 'int PyModuleRegistry::init(const MgrMap&)' thread 7f9148335700 time 2017-11-28 16:02:48.618200
2017-11-28T16:02:48.792 INFO:tasks.ceph.mgr.x.smithi106.stderr:/build/ceph-12.2.1-838-gacb0271/src/mgr/PyModuleRegistry.cc: 139: FAILED assert(map.epoch > 0)
2017-11-28T16:02:48.795 INFO:tasks.ceph.mgr.x.smithi106.stderr: ceph version 12.2.1-838-gacb0271 (acb02717f6e96f96d4128bbebd946238d3c79291) luminous (stable)
2017-11-28T16:02:48.795 INFO:tasks.ceph.mgr.x.smithi106.stderr: 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x102) [0x558bcc270fa2]
2017-11-28T16:02:48.795 INFO:tasks.ceph.mgr.x.smithi106.stderr: 2: (PyModuleRegistry::init(MgrMap const&)+0xd24) [0x558bcc113e34]
2017-11-28T16:02:48.795 INFO:tasks.ceph.mgr.x.smithi106.stderr: 3: (MgrStandby::handle_mgr_map(MMgrMap*)+0x1c6) [0x558bcc1279e6]
2017-11-28T16:02:48.796 INFO:tasks.ceph.mgr.x.smithi106.stderr: 4: (MgrStandby::ms_dispatch(Message*)+0x254) [0x558bcc1287d4]
2017-11-28T16:02:48.796 INFO:tasks.ceph.mgr.x.smithi106.stderr: 5: (DispatchQueue::entry()+0xf4a) [0x558bcc58c17a]
2017-11-28T16:02:48.796 INFO:tasks.ceph.mgr.x.smithi106.stderr: 6: (DispatchQueue::DispatchThread::entry()+0xd) [0x558bcc32ad7d]
2017-11-28T16:02:48.796 INFO:tasks.ceph.mgr.x.smithi106.stderr: 7: (()+0x76ba) [0x7f914efee6ba]
2017-11-28T16:02:48.796 INFO:tasks.ceph.mgr.x.smithi106.stderr: 8: (clone()+0x6d) [0x7f914e05a3dd]
2017-11-28T16:02:48.796 INFO:tasks.ceph.mgr.x.smithi106.stderr: NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

/a/yuriw-2017-11-28_15:44:54-rados-luminous-distro-basic-smithi/1901404

Related issues

Copied to RADOS - Backport #22275: mgr/PyModuleRegistry.cc: 139: FAILED assert(map.epoch > 0) Resolved

History

#1 Updated by John Spray 18 days ago

The monitor is sending the manager a MgrMap with epoch zero. It's happening immediately after a monitor restart, which correlates with the rebuild_mondb task executing.

I think the manager is correct to assert on this: the rebuild_mondb task should be stopping other services if it's going to make map epochs go back in time.

#2 Updated by Sage Weil 18 days ago

/a/yuriw-2017-11-27_23:31:26-rados-luminous-distro-basic-smithi/1897131

reproducible!

#3 Updated by Kefu Chai 18 days ago

  • Assignee set to Kefu Chai

#4 Updated by Kefu Chai 17 days ago

  • Status changed from Verified to Testing

#5 Updated by Kefu Chai 17 days ago

  • Status changed from Testing to Need Review

#6 Updated by Kefu Chai 17 days ago

  • Project changed from mgr to RADOS
  • Category set to Correctness/Safety
  • ceph-qa-suite rados added

#7 Updated by Kefu Chai 17 days ago

  • Copied to Backport #22275: mgr/PyModuleRegistry.cc: 139: FAILED assert(map.epoch > 0) added

#8 Updated by Josh Durgin 17 days ago

  • Status changed from Need Review to Pending Backport

#9 Updated by Kefu Chai 16 days ago

  • Status changed from Pending Backport to Resolved

Also available in: Atom PDF