Bug #62925: cephfs-journal-tool: Add preventive measures in the tool to avoid corruting a ceph file system - CephFS - Ceph

Actions

Copy link

#1

Updated by Prashant D 8 months ago

Description updated (diff)

Actions

Copy link

#2

Updated by Venky Shankar 8 months ago

Category set to Code Hygiene
Assignee set to Jos Collin
Target version set to v19.0.0
Backport set to reef,quincy
Component(FS) tools added

Prashant D wrote:

The cephfs-journal-tool should be used by expert who has the knowledge of CephFS internals. Though we have a clear warning message on https://docs.ceph.com/en/latest/cephfs/disaster-recovery-experts/#recovery-from-missing-metadata-objects doc to not to use cephfs-journal-tool to reset journal without cephfs team's advice, still some users venture out to try this tools without much thought which can result in MDS crash as observed in https://tracker.ceph.com/issues/58878.

[...]

We should have a warning message with a prompt to continue or not when we run this tool to reset the journal. Also cephfs-journal-tool should not be run when cephfs is online or we should have a clear warning message when user attempts to run against live cephfs, mostly when "event recover_dentries summary" command to write any inodes/dentries recoverable from the journal to the RADOS store.

Fair point.

Jos, please take this one,

Actions

Copy link

#3