Bug #18037
closedleveldb corruption leads to "Operation not permitted not handled" and assert
0%
Description
we have a jewel OSD that doesn't start due to leveldb corruption (looks like). We upped the debug levels and can see filestore emitting an odd message "Operation not permitted not handled", followed "unexpected error code" and a dump of the operation, followed by assert(0 == "unexpected error").
Detailed log attached.
Files
Updated by Nathan Cutler over 7 years ago
- Related to Bug #16257: Ceph random bug is killing osds (os/filestore/FileStore.cc: 2912: FAILED assert(0 == "unexpected error") added
Updated by Nathan Cutler over 7 years ago
- Description updated (diff)
EPERM Operation not permitted; only the owner of the file (or other resource) or processes with special privileges can perform the operation.
Hm. This OSD had XFS corruption which was repaired. Maybe the xfs_repair restored a file, but left it owned by root? I'll try "chown -R ceph.ceph /var/lib/osd/..." and report back.
Updated by Nathan Cutler over 7 years ago
Updated by Nathan Cutler over 7 years ago
- Status changed from New to In Progress
- Assignee set to Nathan Cutler
Updated by Nathan Cutler over 7 years ago
- Status changed from In Progress to Fix Under Review
- Backport set to jewel
This PR improves the error message: https://github.com/ceph/ceph/pull/12181
Updated by Nathan Cutler over 7 years ago
This OSD had XFS corruption which was repaired. Maybe the xfs_repair restored a file, but left it owned by root? I'll try "chown -R ceph.ceph /var/lib/osd/..." and report back.
I just got word that the recursive chown does not help.
Updated by Nathan Cutler over 7 years ago
- Status changed from Fix Under Review to Pending Backport
- Backport changed from jewel to jewel,kraken
Updated by Nathan Cutler over 7 years ago
- Copied to Backport #18417: jewel: leveldb corruption leads to "Operation not permitted not handled" and assert added
Updated by Nathan Cutler over 7 years ago
- Copied to Backport #18418: kraken: leveldb corruption leads to "Operation not permitted not handled" and assert added
Updated by Nathan Cutler about 7 years ago
- Status changed from Pending Backport to Resolved