Hi,
I have installed this version of different packages from Sage's deb repo:
ceph-base/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
ceph-common/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
ceph-fuse/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
ceph-mds/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
ceph-mgr-dashboard/stable,now 14.2.4-1-gd592e56-1bionic all [installed]
ceph-mgr-diskprediction-cloud/stable,now 14.2.4-1-gd592e56-1bionic all [installed,automatic]
ceph-mgr-diskprediction-local/stable,now 14.2.4-1-gd592e56-1bionic all [installed,automatic]
ceph-mgr-rook/stable,now 14.2.4-1-gd592e56-1bionic all [installed,automatic]
ceph-mgr-ssh/stable,now 14.2.4-1-gd592e56-1bionic all [installed,automatic]
ceph-mgr/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
ceph-mon/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
ceph-osd/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
ceph/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
libcephfs1/oldstable,now 10.2.11-2 amd64 [installed]
libcephfs2/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
python-ceph-argparse/stable,now 14.2.4-1-gd592e56-1bionic all [installed]
python-cephfs/stable,now 14.2.4-1-gd592e56-1bionic amd64 [installed]
Is there a more recent version available?
In order to stabilize the cluster I have executed several measure:
1. setting options: noout nobackfill norecover norebalance nodown
2. stopping all OSDs
3. stopping all MGRs and MONs
4. setting in ceph.conf: cephx_require_signatures = false cephx_cluster_require_signatures = false cephx_sign_messages = false
5. starting all OSDs
6. starting all MGRs and MONs
Hereby the cluster recovered to a state with some slow requests and a few stuck requests, but not with the error in MGR log.
Then I unset the options noout nobackfill norecover norebalance nodown again and delete the settings for cephx in ceph.conf.
Unfortunately the cluster is still not fully recovered, but the error message in MGR log is not recorded anymore.