Project

General

Profile

Actions

Bug #19744

closed

mon: pg_create not re-sent after primary osd changed

Added by Sage Weil about 7 years ago. Updated about 7 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

This happened during an upgrade test, but I think the issue has nothing to do with upgrades.

pg_create arrives while osd is starting (pool create and osd restart race! this is probably why teh upgrade suite triggered it):

2017-04-20 20:35:45.300933 7fef1a23e700  1 -- 172.21.15.67:6811/13837 <== mon.0 172.21.15.47:6789/0 10 ==== osd_pg_create(e6 3.2:6 3.4:6 3.7:6) v3 ==== 235+0+0 (1372823504 0 0) 0x7fef36d83200 con 0x7fef36c4a000
2017-04-20 20:35:45.300946 7fef1a23e700 20 osd.1 5 OSD::ms_dispatch: osd_pg_create(e6 3.2:6 3.4:6 3.7:6) v3
2017-04-20 20:35:45.300950 7fef1a23e700 10 osd.1 5 do_waiters -- start
2017-04-20 20:35:45.300952 7fef1a23e700 10 osd.1 5 do_waiters -- finish
2017-04-20 20:35:45.300954 7fef1a23e700 20 osd.1 5 _dispatch 0x7fef36d83200 osd_pg_create(e6 3.2:6 3.4:6 3.7:6) v3
2017-04-20 20:35:45.300969 7fef1a23e700 10 osd.1 5 handle_pg_create osd_pg_create(e6 3.2:6 3.4:6 3.7:6) v3
2017-04-20 20:35:45.300974 7fef1a23e700 15 osd.1 5 require_same_or_newer_map 6 (i am 5) 0x7fef36d83200
2017-04-20 20:35:45.300976 7fef1a23e700  7 osd.1 5 waiting for newer map epoch 6 > my 5 with 0x7fef36d83200

the osd goes active..

2017-04-20 20:35:47.446319 7fef1f17e700  1 osd.1 8 state: booting -> active

and then we reprocess teh message

2017-04-20 20:35:47.456488 7fef1923c700 10 osd.1 8 handle_pg_create osd_pg_create(e6 3.2:6 3.4:6 3.7:6) v3
2017-04-20 20:35:47.456494 7fef1923c700 15 osd.1 8 require_same_or_newer_map 6 (i am 8) 0x7fef36d83200
2017-04-20 20:35:47.456496 7fef1923c700  7 osd.1 8 from pre-up epoch 6 < 8

Actions

Also available in: Atom PDF