Actions
Bug #3050
closedobjecter: need to resend requests when we get first map
% Done:
0%
Source:
Development
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):
Description
2012-08-26 21:29:07.060609 7fd480998700 1 -- 10.3.64.22:6831/21296 <== mon.0 10.3.64.22:6789/0 3 ==== osd_map(5..5 src has 1..5) v3 ==== 1376+0+0 (2637291669 0 0) 0x38df400 con 0x38bda00 2012-08-26 21:29:07.061027 7fd480998700 3 mds.14.objecter handle_osd_map got epochs [5,5] > 0 2012-08-26 21:29:07.061033 7fd480998700 3 mds.14.objecter handle_osd_map decoding full epoch 5 2012-08-26 21:29:07.061511 7fd480998700 20 mds.14.objecter dump_active .. 21 homeless 2012-08-26 21:29:07.061537 7fd480998700 20 mds.14.objecter 1 0.0 osd.-1 20e.00000000 [writefull 0~84] 2012-08-26 21:29:07.061544 7fd480998700 20 mds.14.objecter 2 0.0 osd.-1 20e.00000001 [delete] 2012-08-26 21:29:07.061548 7fd480998700 20 mds.14.objecter 3 0.0 osd.-1 20e.00000002 [delete] 2012-08-26 21:29:07.061551 7fd480998700 20 mds.14.objecter 4 0.0 osd.-1 20e.00000003 [delete] 2012-08-26 21:29:07.061554 7fd480998700 20 mds.14.objecter 5 0.0 osd.-1 20e.00000004 [delete] 2012-08-26 21:29:07.061557 7fd480998700 20 mds.14.objecter 6 0.0 osd.-1 20e.00000005 [delete] 2012-08-26 21:29:07.061561 7fd480998700 20 mds.14.objecter 7 0.0 osd.-1 20e.00000006 [delete] 2012-08-26 21:29:07.061564 7fd480998700 20 mds.14.objecter 8 0.0 osd.-1 68c.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061574 7fd480998700 20 mds.14.objecter 9 0.0 osd.-1 68d.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061582 7fd480998700 20 mds.14.objecter 10 0.0 osd.-1 68e.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061590 7fd480998700 20 mds.14.objecter 11 0.0 osd.-1 68f.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061597 7fd480998700 20 mds.14.objecter 12 0.0 osd.-1 690.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061605 7fd480998700 20 mds.14.objecter 13 0.0 osd.-1 691.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061613 7fd480998700 20 mds.14.objecter 14 0.0 osd.-1 692.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061623 7fd480998700 20 mds.14.objecter 15 0.0 osd.-1 693.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061630 7fd480998700 20 mds.14.objecter 16 0.0 osd.-1 694.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061637 7fd480998700 20 mds.14.objecter 17 0.0 osd.-1 695.00000000 [setxattr path (13),setxattr parent (39),tmapup 0~0] 2012-08-26 21:29:07.061645 7fd480998700 20 mds.14.objecter 18 0.0 osd.-1 10e.00000000 [setxattr path (6),setxattr parent (13),tmapput 0~4089] 2012-08-26 21:29:07.061652 7fd480998700 20 mds.14.objecter 19 0.0 osd.-1 10e.00000000.inode [writefull 0~399] 2012-08-26 21:29:07.061655 7fd480998700 20 mds.14.objecter 20 0.0 osd.-1 mds14_inotable [writefull 0~29] 2012-08-26 21:29:07.061658 7fd480998700 20 mds.14.objecter 21 0.0 osd.-1 mds14_sessionmap [writefull 0~17]
bug is in Objecter::handle_osd_map.. if our first map is a full map and there are already ops queued, they don't get evaluated.
reproduced this with
./stop.sh ; CEPH_NUM_MDS=15 CEPH_NUM_MON=1 ./vstart.sh -d -n -x -o 'debug monc = 20' -o 'debug objecter = 20' -o 'ms inject socket failures = 250'
Actions