Project

General

Profile

Actions

Bug #17412

closed

Applying ruleset halts monitor

Added by Arvydas Opulskis over 7 years ago. Updated almost 7 years ago.

Status:
Duplicate
Priority:
Low
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Hi,

I noticed strange behavior of monitors during changing pool's crush_ruleset parameter. In some cases monitor halts and if command is not canceled (ctrl+c) other monitors goes down after some time too. After some investigation I've found what caused this:

1. Monitor halts only when applying last rule in crush map
2. This happens only if rules are not numbered in sequence

I had this layout in my crush map:

 rule default {
    ruleset 0
    ...
  }

  rule xxx-sata {
    ruleset 2
    ...
  }

  rule sata {
    ruleset 3
    ...
  }

  rule ssd {
    ruleset 4
    ...
  }

  rule sas {
    ruleset 5
    ...
  }

So, when applying "ceph osd pool set rbd crush_ruleset 5" I got monitor halt.
After changing numbering of rulesets in crush map from 0 to 4, there were no issues assigning ruleset 4 (sas) to rbd pool.
I also tried to set not existing ruleset (6) to pool, but then I got error about not known ruleset, which is ok.

I guess "ceph osd pool set" command checks if that rule exist, but when applying it, it ignores ruleset number from crushmap and searches it by sequence.

I've tested it on Jewel 10.2.2 version.


Related issues 1 (0 open1 closed)

Is duplicate of Ceph - Bug #16653: ceph mon Segmentation fault after set crush_ruleset ceph 10.2.2ResolvedXiaoxi Chen

Actions
Actions #1

Updated by Igor Podoski over 7 years ago

This could be duplicate of http://tracker.ceph.com/issues/16653

@Arvydas, please check your monitor log, if you're seeing something similar to http://tracker.ceph.com/issues/16653#note-7, it's for sure a duplicate.

I can see the same behavior on 10.2.2 at my cluster.

Actions #2

Updated by Sage Weil almost 7 years ago

  • Status changed from New to Duplicate
Actions #3

Updated by Sage Weil almost 7 years ago

  • Is duplicate of Bug #16653: ceph mon Segmentation fault after set crush_ruleset ceph 10.2.2 added
Actions

Also available in: Atom PDF