leveldb corruption leads to "Operation not permitted not handled" and assert
we have a jewel OSD that doesn't start due to leveldb corruption (looks like). We upped the debug levels and can see filestore emitting an odd message "Operation not permitted not handled", followed "unexpected error code" and a dump of the operation, followed by assert(0 == "unexpected error").
Detailed log attached.
#4 Updated by Nathan Cutler 4 months ago
- Description updated (diff)
EPERM Operation not permitted; only the owner of the file (or other resource) or processes with special privileges can perform the operation.
Hm. This OSD had XFS corruption which was repaired. Maybe the xfs_repair restored a file, but left it owned by root? I'll try "chown -R ceph.ceph /var/lib/osd/..." and report back.