Project

General

Profile

Actions

Bug #19119

closed

pre-jewel "osd rm" incrementals are misinterpreted

Added by Ilya Dryomov about 7 years ago. Updated almost 7 years ago.

Status:
Resolved
Priority:
Immediate
Assignee:
Category:
OSDMap
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
kraken,jewel
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I have a bunch of misdirected requests from a recent kernel client to a hammer cluster, triggered by osd rm:

2017-02-27 15:37:56.976845 osd.190 10.115.1.133:6808/3914 97 : cluster [WRN] client.9450549 10.115.1.35:0/1493770383 misdirected client.9450549.1:1379645861 pg 2.ec640804 to osd.190 in e241865, client e241865 pg 2.804 features 288863570635346

e241864 -> e241865 incremental:

{
    "epoch": 241865,
    "fsid": "9e3e9015-f626-4a44-83f7-0a939ef7ec02",
    "modified": "2017-02-27 11:07:56.497658",
    "new_pool_max": -1,
    "new_flags": -1,
    "new_max_osd": -1,
    "new_pools": [],
    "new_pool_names": [],
    "old_pools": [],
    "new_up_osds": [],
    "new_weight": [],
    "osd_state_xor": [
        {
            "osd": 204,
            "state_xor": [
                "autoout",
                "exists" 
            ]
        },
        [],
        [],
        [],
        [],
        [],
        [],
        [],
        [],
        [
            {
                "osd": 204,
                "uuid": "00000000-0000-0000-0000-000000000000" 
            }
        ],
        {},
        []
    ]

On master:

$ bin/osdmaptool --test-map-pg 2.ec640804 /tmp/map-241864.bin 
bin/osdmaptool: osdmap file '/tmp/map-241864.bin'
 parsed '2.ec640804' -> 2.ec640804
2.ec640804 raw ([197,201,1], p197) up ([197,201,1], p197) acting ([197,201,1], p197)
$ bin/osdmaptool --test-map-pg 2.ec640804 /tmp/map-241865.bin 
bin/osdmaptool: osdmap file '/tmp/map-241865.bin'
 parsed '2.ec640804' -> 2.ec640804
2.ec640804 raw ([197,201,1], p197) up ([197,201,1], p197) acting ([197,201,1], p197)

but (with osdmaptool patched to accept and apply incrementals):

$ bin/osdmaptool --test-map-pg 2.ec640804 /tmp/map-241864.bin /tmp/inc-241865.bin 
bin/osdmaptool: osdmap file '/tmp/map-241864.bin'
bin/osdmaptool: incremental file '/tmp/inc-241865.bin'
 parsed '2.ec640804' -> 2.ec640804
2.ec640804 raw ([190,1], p190) up ([190,1], p190) acting ([190,1], p190)

which is where the misdirected request was sent.

On hammer:

$ ./osdmaptool --test-map-pg 2.ec640804 /tmp/map-241864.bin
./osdmaptool: osdmap file '/tmp/map-241864.bin'
 parsed '2.ec640804' -> 2.ec640804
2.ec640804 raw ([197,201,1], p197) up ([197,201,1], p197) acting ([197,201,1], p197)
$ ./osdmaptool --test-map-pg 2.ec640804 /tmp/map-241865.bin
./osdmaptool: osdmap file '/tmp/map-241865.bin'
 parsed '2.ec640804' -> 2.ec640804
2.ec640804 raw ([197,201,1], p197) up ([197,201,1], p197) acting ([197,201,1], p197)

and (same osdmaptool patch):

$ ./osdmaptool --test-map-pg 2.ec640804 /tmp/map-241864.bin /tmp/inc-241865.bin
./osdmaptool: osdmap file '/tmp/map-241864.bin'
./osdmaptool: incremental file '/tmp/inc-241865.bin'
 parsed '2.ec640804' -> 2.ec640804
2.ec640804 raw ([197,201,1], p197) up ([197,201,1], p197) acting ([197,201,1], p197)

Files

osdmaptool.diff (2.08 KB) osdmaptool.diff Ilya Dryomov, 03/01/2017 06:45 PM

Related issues 3 (0 open3 closed)

Related to Ceph - Bug #13988: new OSD re-using old OSD id fails to bootResolvedLoïc Dachary12/05/2015

Actions
Copied to Ceph - Backport #19209: kraken: pre-jewel "osd rm" incrementals are misinterpretedResolvedShinobu KinjoActions
Copied to Ceph - Backport #19210: jewel: pre-jewel "osd rm" incrementals are misinterpretedResolvedShinobu KinjoActions
Actions

Also available in: Atom PDF