Bug #40777
openhit assert in AuthMonitor::update_from_paxos
0%
0810347d2b7e07374c20a010e7ca411ca125d50c01528c53c2e7d8556e1ef3ee
0d54c46a0fb63aa4b54e651261bdf5fd8f1a40f76131cbee615df5e5b8d412c7
3fdd4ce4b0d6529edfd9648756a49bcc247aa294fddd59d16b79cbb5b5fc93a8
47453f0f1c52fc416989fa4834738e749589093ee2e023ce22ca034c17441e3f
5cc0dd2441a47d889aadcd5ab5fd98685bea94235e4c052c8eb2fe64819d6012
5e65c408af9278ef1649d782df52b4b944463a56a06c5f5a4628735c3ed71329
69e63e0e1ca4aadc0d3ce45b8820b2717654642518a78a41783b1010b47c2334
c961c55e7b3d0a454cbb0f8ba37201e2a25f07d575dfc1303cab2f7b211cbab5
d824e0db27c500cc0793b2f262a6d29db2cb075bb14b2162b72d382f232bec9a
e4de3f9b33f756f03d809432dc35f5e9bbef8eba284d91af5905a591eb680b92
ed678e97c3a893fd090c48f06f8931a04d01db69fcf059c0eed46070360cc8bc
Description
I created the ceph cluster by the rook(https://github.com/rook/rook), and ceph version is 12.2.7 stable.
After I reboot the host, 1 of 3 monitors hit the assert:
ceph version 12.2.7 (3ec878d1e53e1aeb47a9f619c49d9e7c0aa384d5) luminous (stable) 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x110) [0x7f4bf21c3160 2: (AuthMonitor::update_from_paxos(bool*)+0x159d) [0x7f4bf1fd50fd] 3: (PaxosService::refresh(bool*)+0x1ae) [0x7f4bf20a0d7e] 4: (Monitor::refresh_from_paxos(bool*)+0x19b) [0x7f4bf1f6a57b] 5: (Monitor::init_paxos()+0x115) [0x7f4bf1f6a9c5] 6: (Monitor::preinit()+0x9c6) [0x7f4bf1f6b3e6] 7: (main()+0x4012) [0x7f4bf1e9b042] 8: (__libc_start_main()+0xf5) [0x7f4bee24c445] 9: (()+0x3afd5e) [0x7f4bf1f3fd5e] NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.
Just found some similar but very old issues, and the fix are in the code already.