Bug #50035
closed
cephfs-mirror: use sensible mount/shutdown timeouts
Added by Venky Shankar about 3 years ago.
Updated almost 3 years ago.
Category:
Administration/Usability
Description
The mirror daemon just relies on the defaults which are pretty high:
Option("client_mount_timeout", Option::TYPE_FLOAT, Option::LEVEL_ADVANCED)
.set_default(300.0)
.set_description("timeout for mounting CephFS (seconds)"),
Option("client_shutdown_timeout", Option::TYPE_SECS, Option::LEVEL_ADVANCED)
.set_flag(Option::FLAG_RUNTIME)
.set_default(30)
.set_min(0)
.set_description("timeout for shutting down CephFS")
.set_long_description("Timeout for shutting down CephFS via unmount or shutdown.")
.add_tag("client")
Especially `client_mount_timeout`. When a (remote) filesystem is not reachable (for various reason), this can stall the updater timer thread for 300 seconds! `cephfs-mirror` should define (in its config) and use sensible defaults. It's ok to fail mounting a remote file system since the mirror daemon will retry connecting periodically.
- Status changed from New to Triaged
- Assignee set to Venky Shankar
Mostly setting this as a config option (in ceph.conf) would suffice. However, cephfs-mirror can connect to the remote cluster with monitor addresss and does not require remote cluster ceph config file on the primary cluster. How would we want the apply remote cluster config in this case?
Venky Shankar wrote:
Mostly setting this as a config option (in ceph.conf) would suffice. However, cephfs-mirror can connect to the remote cluster with monitor addresss and does not require remote cluster ceph config file on the primary cluster. How would we want the apply remote cluster config in this case?
cephfs-mirror can update the defaults in its cct->_conf. It woudl do this for both CephContexts it holds for each cluster, right?
Patrick Donnelly wrote:
Venky Shankar wrote:
Mostly setting this as a config option (in ceph.conf) would suffice. However, cephfs-mirror can connect to the remote cluster with monitor addresss and does not require remote cluster ceph config file on the primary cluster. How would we want the apply remote cluster config in this case?
cephfs-mirror can update the defaults in its cct->_conf. It woudl do this for both CephContexts it holds for each cluster, right?
Right -- that's the reason to introduce a config option to override the override :/
- Status changed from Triaged to In Progress
- Status changed from In Progress to Fix Under Review
- Pull request ID set to 40885
- Related to Bug #50224: qa: test_mirroring_init_failure_with_recovery failure added
- Status changed from Fix Under Review to Pending Backport
- Copied to Backport #50871: pacific: cephfs-mirror: use sensible mount/shutdown timeouts added
- Status changed from Pending Backport to Resolved
While running with --resolve-parent, the script "backport-create-issue" noticed that all backports of this issue are in status "Resolved" or "Rejected".
Also available in: Atom
PDF