Bug #61948: Failed assert "pg_upmap_primaries.empty()" in the read balancer - RADOS - Ceph

Actions

Copy link

Bug #61948

open

Failed assert "pg_upmap_primaries.empty()" in the read balancer

Added by Laura Flores 10 months ago. Updated 10 months ago.

Status:

Need More Info

Priority:

High

Assignee:

Laura Flores

Category:

Target version:

% Done:

Source:

Tags:

Backport:

Regression:

Severity:

3 - minor

Reviewed:

Affected Versions:

ceph-qa-suite:

Component(RADOS):

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

Reported by Stefan:

This bug was hit on the RC (v18.1.2).

ceph crash info
2023-07-10T19:02:42.116150Z_8061077a-2257-49c7-8088-c94a5a580f6a
{
     "assert_condition": "pg_upmap_primaries.empty()",
     "assert_file": "/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/18.1.2/rpm/el8/BUILD/ceph-18.1.2/src/osd/OSDMap.cc",
     "assert_func": "void OSDMap::encode(ceph::buffer::v15_2_0::list&, uint64_t) const",
     "assert_line": 3239,
     "assert_msg": "/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/18.1.2/rpm/el8/BUILD/ceph-18.1.2/src/osd/OSDMap.cc: In function 'void OSDMap::encode(ceph::buffer::v15_2_0::list&, uint64_t) const' thread 7f95f67cb700 time 2023-07-10T19:02:42.106910+0000\n/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/18.1.2/rpm/el8/BUILD/ceph-18.1.2/src/osd/OSDMap.cc: 3239: FAILED ceph_assert(pg_upmap_primaries.empty())\n",
     "assert_thread_name": "ms_dispatch",
     "backtrace": [
         "/lib64/libpthread.so.0(+0x12cf0) [0x7f9601c12cf0]",
         "gsignal()",
         "abort()",
         "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x18f) [0x7f96043f0e15]",
         "/usr/lib64/ceph/libceph-common.so.2(+0x2a9f81) [0x7f96043f0f81]",
         "(OSDMap::encode(ceph::buffer::v15_2_0::list&, unsigned long) const+0x1229) [0x7f96048a4a19]",
         "(OSDMonitor::reencode_full_map(ceph::buffer::v15_2_0::list&, unsigned long)+0xe0) [0x55d71adf3d30]",
         "(OSDMonitor::get_version_full(unsigned long, unsigned long, ceph::buffer::v15_2_0::list&)+0x326) [0x55d71ae27616]",
         "(OSDMonitor::build_latest_full(unsigned long)+0x306) [0x55d71ae27986]",
         "(OSDMonitor::check_osdmap_sub(Subscription*)+0x7a) [0x55d71ae2b81a]",
         "(Monitor::handle_subscribe(boost::intrusive_ptr<MonOpRequest>)+0xeed) [0x55d71ac4b25d]",
         "(Monitor::dispatch_op(boost::intrusive_ptr<MonOpRequest>)+0x7d6) [0x55d71ac5fea6]",
         "(Monitor::_ms_dispatch(Message*)+0x406) [0x55d71ac610a6]",
         "(Dispatcher::ms_dispatch2(boost::intrusive_ptr<Message> const&)+0x5d) [0x55d71ac91bcd]",
         "(Messenger::ms_deliver_dispatch(boost::intrusive_ptr<Message> const&)+0x478) [0x7f9604669c08]",
         "(DispatchQueue::entry()+0x50f) [0x7f9604666daf]",
         "(DispatchQueue::DispatchThread::entry()+0x11) [0x7f960472d391]",
         "/lib64/libpthread.so.0(+0x81ca) [0x7f9601c081ca]",
         "clone()" 
     ],
     "ceph_version": "18.1.2",
     "crash_id": "2023-07-10T19:02:42.116150Z_8061077a-2257-49c7-8088-c94a5a580f6a",
     "entity_name": "mon.reef03",
     "os_id": "centos",
     "os_name": "CentOS Stream",
     "os_version": "8",
     "os_version_id": "8",
     "process_name": "ceph-mon",
     "stack_sig": "36e64bb49053b7de85c9cf46c6739fe42dd443b8320310aef2318e06c48ee87e",
     "timestamp": "2023-07-10T19:02:42.116150Z",
     "utsname_hostname": "reef03",
     "utsname_machine": "x86_64",
     "utsname_release": "5.4.0-147-generic",
     "utsname_sysname": "Linux",
     "utsname_version": "#164-Ubuntu SMP Tue Mar 21 14:23:17 UTC 2023" 
}

Error EINVAL: osd.7 is not in acting set for pg 2.23

But osd.7 was in up and acting set:

2.23          36                   0         0          0        0
150994944            0           0    53         0        53
active+clean  2023-07-10T14:33:28.050243+0000    48'53    76:178
[4,1,7]           4  [4,1,7]               4         0'0
2023-07-10T13:47:36.917296+0000              0'0
2023-07-10T13:47:36.917296+0000              0                    0
periodic scrub scheduled @ 2023-07-11T15:01:20.025433+0000
   0                0

Details that led to the bug:
- Installed v18.1.2 cephadm
- Deployed a Ceph cluster with cephadm, three nodes (VMs) with 3 OSDs each, which installed Ceph v18.0.0-4795-g2f6e4f7d by default
- Set the min_compat_client to reef
- Hit the crash
- Redeployed the cluster to v18.1.2
- Deleted the pool, recreated it, and repeated the above steps
- Mon daemon crashed again

Affected logs attached.

Files

Download all files

osd.6.log (90.9 KB) osd.6.log		Laura Flores, 07/10/2023 07:39 PM
reef.mon3.log (90.9 KB) reef.mon3.log		Laura Flores, 07/10/2023 07:39 PM
om (8.17 KB) om		Stefan Kooman, 07/10/2023 07:55 PM
om_old (6.39 KB) om_old		Stefan Kooman, 07/10/2023 07:59 PM

Actions

Copy link

Updated by Laura Flores 10 months ago

Assignee set to Laura Flores

Actions

Copy link Download all files

Updated by Stefan Kooman 10 months ago

File om om added
File om_old om_old added

In the attachment the first osd map I used (om_old), and the osd map used the second time with version 18.1.2 (om).

Actions

Copy link

Updated by Laura Flores 10 months ago

Thanks Stefan!

Actions

Copy link

Updated by Laura Flores 10 months ago

The crash occurred during a step when the osdmap is encoded.
Since the read balancer is a new feature in reef, the encoder is required to be at the most updated version (10). However, this cluster detected an older encoder version (<10), which assumes there are no pg upmap primary entries. However, there were pg upmap primary entries in the osdmap, and this caused a crash.
https://github.com/ceph/ceph/blob/v18.1.2/src/osd/OSDMap.cc#L3236-L3240

    if (v >= 10) {
      encode(pg_upmap_primaries, bl);
    } else {
      ceph_assert(pg_upmap_primaries.empty());
    }

The information to figure out is why this cluster detected older features and settled on an older encoder version.

This may be relevant in the mon log:

  -341> 2023-07-10T19:02:16.674+0000 7f9604fb2c80  0 mon.reef03@-1(???).osd e184 crush map has features 3314933000854323200, adjusting msgr requires
  -340> 2023-07-10T19:02:16.674+0000 7f9604fb2c80  0 mon.reef03@-1(???).osd e184 crush map has features 2738472248550883328, adjusting msgr requires
  -339> 2023-07-10T19:02:16.674+0000 7f9604fb2c80  0 mon.reef03@-1(???).osd e184 crush map has features 2738472248550883328, adjusting msgr requires
  -338> 2023-07-10T19:02:16.674+0000 7f9604fb2c80  0 mon.reef03@-1(???).osd e184 crush map has features 2738472248550883328, adjusting msgr requires

Actions

Copy link

Updated by Stefan Kooman 10 months ago

I have created a new cluster. Upgraded it to 18.1.2 (from 18.0.0) before I created any OSDs (and pool/image). I redid the tests. The OSDs asserted again (same like before). The monitor(s) did hit an assert this time. I had debug logging for the monitors set to 20/20 ... but not for the OSDs. I will try again with debug set to 20/20 for the OSDs. Strangely ceph-crash did not report the OSDs had crashed (it did in the other cluster). But that might be another issue / bug...

Pretty sure (I checked the timestamps) that the "crush map has features 2738472248550883328, adjusting msgr requires" logging is a result of "ceph osd set-require-min-compat-client reef"

Actions

Copy link

Updated by Stefan Kooman 10 months ago

I deleted all OSDs, and recreated them. Repeated the tests, but could not reproduce. So there must be (a) condition(s) that trigger it, but not sure what. I hope that my observations proof to be helpful.

Actions

Copy link

Updated by Laura Flores 10 months ago

Hey Stefan,

Thanks for your analysis. This failure indicates that there is a client-server version mismatch (could be a deeper issue, but this is what the failure points to). If you have mon and osd logs with a 20/20 logging level, that would definitely be helpful.

In the meantime, I am working to see if I can reproduce your issue on my end.

Actions

Copy link

Updated by Stefan Kooman 10 months ago

I recreated yet another cluster and enabled debug logging for osd / mon, but I could not reproduce.

Actions

Copy link

Updated by Laura Flores 10 months ago

Thanks Stefan. We're continuing to look into it. If you're able to reproduce the issue again, it may help to dump your mon_status, as it contains a feature map. There is some kind of feature incompatibility going on here, so that could be telling.

`ceph tell mon.* mon_status`

Or use this if the mons are not accessible:
`ceph --admin-daemon /var/run/ceph/<cluster id>/ceph-mon.<mon id>.asok mon_status`

Actions

Copy link

#10