Project

General

Profile

Actions

Bug #8178

closed

0.79: feature set mismatch, my 4a042a42 < server's 104a042a42, missing 1000000000

Added by Dmitry Smirnov about 10 years ago. Updated about 10 years ago.

Status:
Resolved
Priority:
Normal
Assignee:
-
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

For some weeks I knew no troubles with RBD clients on Linux-3.13.10 x86_64.
Today after I created new erasure pool all RBD clients suddenly stopped:

libceph: mon1 {IP:6789} feature set mismatch, my 4a042a42 < server's 104a042a42, missing 1000000000

My attempt to recover by using previously working "ceph osd crush tunables default" was unsuccessful.
Removing new pool did not help either.

At the moment I was able to recover some clients by upgrading Linux kernel to version 3.14. This upgrade was undesirable and I wish I could recover by other means (how?). Is it possible?
Other Linux-3.13 clients are still unable to use RBD devices from replicated pool.

As my poor experience proved it is surprisingly easy to experience significant downtime as result of isolated experiment with erasure pool on cluster with RBD clients working on another pool(s).

If possible please prevent that kind of outcome?
Please advise how to recover without upgrading to Linux-3.14.

If my reading of "src/include/ceph_features.h" is correct missing feature 1000000000 translates to "CEPH_FEATURE_CRUSH_V2 (1ULL<<36)", right?

Thanks.

Actions

Also available in: Atom PDF