Project

General

Profile

Actions

Bug #3050

closed

objecter: need to resend requests when we get first map

Added by Sage Weil over 11 years ago. Updated over 11 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description


2012-08-26 21:29:07.060609 7fd480998700  1 -- 10.3.64.22:6831/21296 <== mon.0 10.3.64.22:6789/0 3 ==== osd_map(5..5 src has 1..5) v3 ==== 1376+0+0 (2637291669 0 0) 0x38df400 con 0x38bda00
2012-08-26 21:29:07.061027 7fd480998700  3 mds.14.objecter handle_osd_map got epochs [5,5] > 0
2012-08-26 21:29:07.061033 7fd480998700  3 mds.14.objecter handle_osd_map decoding full epoch 5
2012-08-26 21:29:07.061511 7fd480998700 20 mds.14.objecter dump_active .. 21 homeless
2012-08-26 21:29:07.061537 7fd480998700 20 mds.14.objecter 1    0.0     osd.-1  20e.00000000    [writefull 0~84]
2012-08-26 21:29:07.061544 7fd480998700 20 mds.14.objecter 2    0.0     osd.-1  20e.00000001    [delete]
2012-08-26 21:29:07.061548 7fd480998700 20 mds.14.objecter 3    0.0     osd.-1  20e.00000002    [delete]
2012-08-26 21:29:07.061551 7fd480998700 20 mds.14.objecter 4    0.0     osd.-1  20e.00000003    [delete]
2012-08-26 21:29:07.061554 7fd480998700 20 mds.14.objecter 5    0.0     osd.-1  20e.00000004    [delete]
2012-08-26 21:29:07.061557 7fd480998700 20 mds.14.objecter 6    0.0     osd.-1  20e.00000005    [delete]
2012-08-26 21:29:07.061561 7fd480998700 20 mds.14.objecter 7    0.0     osd.-1  20e.00000006    [delete]
2012-08-26 21:29:07.061564 7fd480998700 20 mds.14.objecter 8    0.0     osd.-1  68c.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061574 7fd480998700 20 mds.14.objecter 9    0.0     osd.-1  68d.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061582 7fd480998700 20 mds.14.objecter 10   0.0     osd.-1  68e.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061590 7fd480998700 20 mds.14.objecter 11   0.0     osd.-1  68f.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061597 7fd480998700 20 mds.14.objecter 12   0.0     osd.-1  690.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061605 7fd480998700 20 mds.14.objecter 13   0.0     osd.-1  691.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061613 7fd480998700 20 mds.14.objecter 14   0.0     osd.-1  692.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061623 7fd480998700 20 mds.14.objecter 15   0.0     osd.-1  693.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061630 7fd480998700 20 mds.14.objecter 16   0.0     osd.-1  694.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061637 7fd480998700 20 mds.14.objecter 17   0.0     osd.-1  695.00000000    [setxattr path (13),setxattr parent (39),tmapup 0~0]
2012-08-26 21:29:07.061645 7fd480998700 20 mds.14.objecter 18   0.0     osd.-1  10e.00000000    [setxattr path (6),setxattr parent (13),tmapput 0~4089]
2012-08-26 21:29:07.061652 7fd480998700 20 mds.14.objecter 19   0.0     osd.-1  10e.00000000.inode      [writefull 0~399]
2012-08-26 21:29:07.061655 7fd480998700 20 mds.14.objecter 20   0.0     osd.-1  mds14_inotable  [writefull 0~29]
2012-08-26 21:29:07.061658 7fd480998700 20 mds.14.objecter 21   0.0     osd.-1  mds14_sessionmap        [writefull 0~17]

bug is in Objecter::handle_osd_map.. if our first map is a full map and there are already ops queued, they don't get evaluated.

reproduced this with

./stop.sh  ; CEPH_NUM_MDS=15 CEPH_NUM_MON=1 ./vstart.sh  -d -n -x -o 'debug monc = 20' -o 'debug objecter = 20' -o 'ms inject socket failures = 250'

Actions #1

Updated by Sage Weil over 11 years ago

  • Status changed from 12 to 7
Actions #2

Updated by Sage Weil over 11 years ago

  • Status changed from 7 to Resolved
Actions

Also available in: Atom PDF