Project

General

Profile

Actions

Bug #63888

open

rbd/clone: parent image does not exist but child image still have clone-child tag

Added by yite gu 4 months ago. Updated 4 months ago.

Status:
Need More Info
Priority:
Normal
Assignee:
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

# rbd -p replicapool-ssd children csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b
2023-12-22T08:31:46.582+0000 7f7f55347500  5 librbd::io::ImageRequestWQ: 0x7f7f3c0015b0 ImageRequestWQ: ictx=0x563664c28db0
2023-12-22T08:31:46.582+0000 7f7f55347500 20 librbd::ImageState: 0x563664c659a0 open
2023-12-22T08:31:46.582+0000 7f7f55347500 10 librbd::ImageState: 0x563664c659a0 0x563664c659a0 send_open_unlock
2023-12-22T08:31:46.582+0000 7f7f55347500 10 librbd::image::OpenRequest: 0x563664c65c70 send_v2_detect_header
2023-12-22T08:31:46.583+0000 7f7f2bfff700 10 librbd::image::OpenRequest: handle_v2_detect_header: r=0
2023-12-22T08:31:46.583+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 send_v2_get_id
2023-12-22T08:31:46.583+0000 7f7f2bfff700 10 librbd::image::OpenRequest: handle_v2_get_id: r=0
2023-12-22T08:31:46.583+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 send_v2_get_initial_metadata
2023-12-22T08:31:46.584+0000 7f7f2bfff700 10 librbd::image::OpenRequest: handle_v2_get_initial_metadata: r=0
2023-12-22T08:31:46.584+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 send_v2_get_create_timestamp
2023-12-22T08:31:46.585+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 handle_v2_get_create_timestamp: r=0
2023-12-22T08:31:46.585+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 send_v2_get_access_modify_timestamp
2023-12-22T08:31:46.585+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 handle_v2_get_access_modify_timestamp: r=0
2023-12-22T08:31:46.585+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 send_v2_get_data_pool
2023-12-22T08:31:46.585+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 handle_v2_get_data_pool: r=0
2023-12-22T08:31:46.585+0000 7f7f2bfff700 10 librbd::ImageCtx: init_layout stripe_unit 4194304 stripe_count 1 object_size 4194304 prefix rbd_data.dddf572dfa189a format rbd_data.dddf572dfa189a.%016llx
2023-12-22T08:31:46.586+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x563664c65c70 send_refresh
2023-12-22T08:31:46.586+0000 7f7f2bfff700 10 librbd::ConfigWatcher: init:
2023-12-22T08:31:46.586+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_get_mutable_metadata
2023-12-22T08:31:46.586+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 handle_v2_get_mutable_metadata: r=0
2023-12-22T08:31:46.586+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_get_parent: legacy=0
2023-12-22T08:31:46.586+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 handle_v2_get_parent: r=0
2023-12-22T08:31:46.586+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_get_metadata
2023-12-22T08:31:46.586+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c017150 metadata_list: start_key=conf_
2023-12-22T08:31:46.586+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c017150 handle_metadata_list: r=0
2023-12-22T08:31:46.586+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c017150 finish: r=0
2023-12-22T08:31:46.587+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 handle_v2_get_metadata: r=0
2023-12-22T08:31:46.587+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_get_pool_metadata
2023-12-22T08:31:46.587+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c016cd0 metadata_list: start_key=conf_
2023-12-22T08:31:46.588+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c016cd0 handle_metadata_list: r=0
2023-12-22T08:31:46.588+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c016cd0 finish: r=0
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 handle_v2_get_pool_metadata: r=0
2023-12-22T08:31:46.588+0000 7f7f2bfff700 20 librbd::ImageCtx: apply_metadata
2023-12-22T08:31:46.588+0000 7f7f2bfff700  5 librbd::ImageCtx: 0x563664c28db0: disabling zero-copy writes
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_get_op_features
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 handle_v2_get_op_features: r=0
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_get_group
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 handle_v2_get_group: r=0
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_get_snapshots
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 handle_v2_get_snapshots: r=0
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c012190 send_v2_refresh_parent
2023-12-22T08:31:46.588+0000 7f7f2bfff700 10 librbd::image::RefreshParentRequest: 0x7f7f1c0153e0 send_open_parent
2023-12-22T08:31:46.590+0000 7f7f2bfff700  5 librbd::io::ImageRequestWQ: 0x7f7f1c05d930 ImageRequestWQ: ictx=0x7f7f1c01cea0
2023-12-22T08:31:46.590+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 send_v2_get_name
2023-12-22T08:31:46.591+0000 7f7f2bfff700 10 librbd::image::OpenRequest: handle_v2_get_name: r=0
2023-12-22T08:31:46.591+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 send_v2_get_initial_metadata
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: handle_v2_get_initial_metadata: r=0
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 send_v2_get_create_timestamp
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 handle_v2_get_create_timestamp: r=0
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 send_v2_get_access_modify_timestamp
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 handle_v2_get_access_modify_timestamp: r=0
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 send_v2_get_data_pool
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 handle_v2_get_data_pool: r=0
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::ImageCtx: init_layout stripe_unit 4194304 stripe_count 1 object_size 4194304 prefix rbd_data.dddf57aa5a5a3a format rbd_data.dddf57aa5a5a3a.%016llx
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 send_refresh
2023-12-22T08:31:46.593+0000 7f7f2bfff700 10 librbd::ConfigWatcher: init:
2023-12-22T08:31:46.594+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_get_mutable_metadata
2023-12-22T08:31:46.594+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_get_mutable_metadata: r=0
2023-12-22T08:31:46.594+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_get_parent: legacy=0
2023-12-22T08:31:46.594+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_get_parent: r=0
2023-12-22T08:31:46.594+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_get_metadata
2023-12-22T08:31:46.594+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c016cd0 metadata_list: start_key=conf_
2023-12-22T08:31:46.594+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c016cd0 handle_metadata_list: r=0
2023-12-22T08:31:46.594+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c016cd0 finish: r=0
2023-12-22T08:31:46.594+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_get_metadata: r=0
2023-12-22T08:31:46.594+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_get_pool_metadata
2023-12-22T08:31:46.594+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c017150 metadata_list: start_key=conf_
2023-12-22T08:31:46.595+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c017150 handle_metadata_list: r=0
2023-12-22T08:31:46.595+0000 7f7f2bfff700 15 librbd::image::GetMetadataRequest: 0x7f7f1c017150 finish: r=0
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_get_pool_metadata: r=0
2023-12-22T08:31:46.595+0000 7f7f2bfff700 20 librbd::ImageCtx: apply_metadata
2023-12-22T08:31:46.595+0000 7f7f2bfff700  5 librbd::ImageCtx: 0x7f7f1c01cea0: disabling zero-copy writes
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_get_op_features
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_get_op_features: r=0
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_get_group
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_get_group: r=0
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_get_snapshots
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_get_snapshots: r=0
2023-12-22T08:31:46.595+0000 7f7f2bfff700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_v2_apply
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_v2_apply
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::image::RefreshRequest: 0x7f7f1c06cc70 apply
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::image::RefreshRequest: new snapshot id=1098 name=2e13d4ed-f447-41c0-8731-aad6bc522086 size=100931731456
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 send_flush_aio
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AsyncOperation: 0x7f7f24002668 start_op
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::ImageRequest: 0x7f7f2b7f8b10 send: aio_flush: ictx=0x7f7f1c01cea0, completion=0x7f7f24002570
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AioCompletion: 0x7f7f24002570 set_request_count: pending=1
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::ObjectDispatcher: 0x7f7f1c05bab0 send: object_dispatch_spec=0x7f7f24002790
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AioCompletion: 0x7f7f24002570 complete_request: cb=1, pending=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AioCompletion: 0x7f7f24002570 finalize: r=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::RefreshRequest: 0x7f7f1c06cc70 handle_flush_aio: r=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AsyncOperation: 0x7f7f24002668 finish_op
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::OpenRequest: handle_refresh: r=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::OpenRequest: send_parent_cache: r=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::OpenRequest: 0x7f7f1c05bf30 send_set_snap
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::SetSnapRequest: send_block_writes
2023-12-22T08:31:46.595+0000 7f7f2b7fe700  5 librbd::io::ImageRequestWQ: 0x7f7f1c05d930 block_writes: 0x7f7f1c01cea0, num=1
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AsyncOperation: 0x7f7f24002668 start_op
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::ImageRequest: 0x7f7f2b7f8900 send: aio_flush: ictx=0x7f7f1c01cea0, completion=0x7f7f24002570
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AioCompletion: 0x7f7f24002570 set_request_count: pending=1
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::ObjectDispatcher: 0x7f7f1c05bab0 send: object_dispatch_spec=0x7f7f24002790
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AioCompletion: 0x7f7f24002570 complete_request: cb=1, pending=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 20 librbd::io::AioCompletion: 0x7f7f24002570 finalize: r=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::SetSnapRequest: handle_block_writes: r=0
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::SetSnapRequest: send_refresh_parent
2023-12-22T08:31:46.595+0000 7f7f2b7fe700 10 librbd::image::RefreshParentRequest: 0x7f7f24004ae0 send_open_parent
2023-12-22T08:31:46.618+0000 7f7f2b7fe700  5 librbd::io::ImageRequestWQ: 0x7f7f24049f10 ImageRequestWQ: ictx=0x7f7f24004c10
2023-12-22T08:31:46.618+0000 7f7f2b7fe700 10 librbd::image::OpenRequest: 0x7f7f24047da0 send_v2_get_name
2023-12-22T08:31:46.618+0000 7f7f2b7fe700 20 librbd::io::AsyncOperation: 0x7f7f24002668 finish_op
2023-12-22T08:31:46.618+0000 7f7f2bfff700 10 librbd::image::OpenRequest: handle_v2_get_name: r=-2
2023-12-22T08:31:46.618+0000 7f7f2bfff700 10 librbd::image::OpenRequest: image id dddf576ad516ec does not exist in rbd directory, searching in rbd trash...
2023-12-22T08:31:46.618+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f24047da0 send_v2_get_name_from_trash
2023-12-22T08:31:46.619+0000 7f7f2bfff700 10 librbd::image::OpenRequest: handle_v2_get_name_from_trash: r=-2
2023-12-22T08:31:46.619+0000 7f7f2bfff700  5 librbd::image::OpenRequest: failed to retrieve name for image id dddf576ad516ec
2023-12-22T08:31:46.619+0000 7f7f2bfff700 10 librbd::image::OpenRequest: 0x7f7f24047da0 send_close_image
2023-12-22T08:31:46.619+0000 7f7f2bfff700 10 librbd::image::CloseRequest: 0x7f7f1c076fe0 send_shut_down_update_watchers
2023-12-22T08:31:46.619+0000 7f7f2bfff700 20 librbd::ImageState: 0x7f7f24048060 shut_down_update_watchers
2023-12-22T08:31:46.619+0000 7f7f2bfff700 20 librbd::ImageState: 0x7f7f24048330 ImageUpdateWatchers::shut_down
2023-12-22T08:31:46.619+0000 7f7f2bfff700 20 librbd::ImageState: 0x7f7f24048330 ImageUpdateWatchers::shut_down: completing shut down
Actions #1

Updated by yite gu 4 months ago

csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b clone from csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b

# rbd -p replicapool-ssd children csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b
replicapool-ssd/csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b

csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b info as below:
# rbd -p replicapool-ssd info csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b
rbd image 'csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b':
    size 94 GiB in 24064 objects
    order 22 (4 MiB objects)
    snapshot_count: 1
    id: dddf57aa5a5a3a
    block_name_prefix: rbd_data.dddf57aa5a5a3a
    format: 2
    features: layering, operations
    op_features: clone-parent, clone-child, snap-trash
    flags:
    create_timestamp: Thu Dec 21 03:53:03 2023
    access_timestamp: Thu Dec 21 03:53:03 2023
    modify_timestamp: Thu Dec 21 03:53:03 2023

I want to open csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b, but fail:
# rbd -p replicapool-ssd children csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b
2023-12-22T08:03:40.157+0000 7f71b4ff9700 -1 librbd::image::RefreshParentRequest: failed to open parent image: (2) No such file or directory
2023-12-22T08:03:40.157+0000 7f71b4ff9700 -1 librbd::image::SetSnapRequest: failed to refresh snapshot parent: (2) No such file or directory
2023-12-22T08:03:40.157+0000 7f71b4ff9700 -1 librbd::image::OpenRequest: failed to set image snapshot: (2) No such file or directory
2023-12-22T08:03:40.157+0000 7f71b4ff9700 -1 librbd::image::RefreshParentRequest: failed to open parent image: (2) No such file or directory
2023-12-22T08:03:40.157+0000 7f71b4ff9700 -1 librbd::image::RefreshRequest: failed to refresh parent image: (2) No such file or directory
2023-12-22T08:03:40.157+0000 7f71b4ff9700 -1 librbd::image::OpenRequest: failed to refresh image: (2) No such file or directory
rbd: error opening image csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b: (2) No such file or directory

I set debug_rbd to 20, it want to find csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b's parent, but it not exist in rbd directory and trash.

Actions #2

Updated by yite gu 4 months ago

Why csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b have clone-child, but parent image already not exist.

Actions #3

Updated by yite gu 4 months ago

Actions #4

Updated by yite gu 4 months ago

restore step:
1. remove parent key

# rados -p replicapool-ssd rmomapkey rbd_header.dddf57aa5a5a3a parent

2. open csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b success.
# rbd -p replicapool-ssd info csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b
rbd image 'csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b':
    size 94 GiB in 24064 objects
    order 22 (4 MiB objects)
    snapshot_count: 1
    id: dddf572dfa189a
    block_name_prefix: rbd_data.dddf572dfa189a
    format: 2
    features: layering, deep-flatten, operations
    op_features: clone-parent, clone-child
    flags:
    create_timestamp: Thu Dec 21 03:54:44 2023
    access_timestamp: Thu Dec 21 03:54:44 2023
    modify_timestamp: Thu Dec 21 03:54:44 2023
    parent: replicapool-ssd/csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b@2e13d4ed-f447-41c0-8731-aad6bc522086
    overlap: 94 GiB

Actions #5

Updated by Ilya Dryomov 4 months ago

  • Status changed from New to Need More Info
  • Assignee set to Ilya Dryomov

yite gu wrote:

Why csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b have clone-child, but parent image already not exist.

Hi Yite,

It looks like a bug, but it's hard to tell what lead to it -- you are just dealing with the consequence here. To investigate, we would need to know the entire history of this parent/child image chain (including trash and flatten operations), whether any of those returned an error or suffered from an abrupt interruption in the past, etc.

the same issue as https://tracker.ceph.com/issues/52810

I don't think it's the same issue. https://tracker.ceph.com/issues/52810 was a race condition where "rbd info" could fail sporadically but always complete successfully on rerun. This isn't the case here: "rbd info" on csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b succeeds, but doesn't show "parent" and "overlap" fields despite "parent" key apparently being there on the image header. IIRC this can happen when the overlap becomes 0. Was csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b ever resized, specifically shrunk to 0?

Actions #6

Updated by yite gu 4 months ago

Yes, Now it seems that I also don't think it's the same issue. sorry, I can't find the logs indicating whether there were any errors or interruptions at that time, but I can confirm the user's operational process at that time:

1. create vm a, vm a have system disk rbd a
2. shutdown vm a
3. clone rbd b from rbd a, and flatten b
4. delete rbd a
5. rbd b as new vm OS image
5. clone rbd c from rbd b, and rbd c as system disk of vm a 
6. start vm a

Actions #7

Updated by yite gu 4 months ago

Ilya Dryomov wrote:

I don't think it's the same issue. https://tracker.ceph.com/issues/52810 was a race condition where "rbd info" could fail sporadically but always complete successfully on rerun. This isn't the case here: "rbd info" on csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b succeeds, but doesn't show "parent" and "overlap" fields despite "parent" key apparently being there on the image header. IIRC this can happen when the overlap becomes 0. Was csi-vol-7136789b-9fb4-11ee-a46b-7ab9ec75308b ever resized, specifically shrunk to 0?

haven't resize operation.

Actions #8

Updated by yite gu 4 months ago

yite gu wrote:

restore step:
1. remove parent key
[...]
2. open csi-snap-ad37e254-9fb4-11ee-a46b-7ab9ec75308b success.
[...]

Fortunately, after doing this, RBD was restored.

Actions

Also available in: Atom PDF