Project

General

Profile

Actions

Bug #13213

closed

stuck recovering apparently due to pg_query not sent by async messenger?

Added by Samuel Just over 8 years ago. Updated over 8 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

pg 0.b7 seems to be stuck due to a pg_query not getting sent by async messenger.

Interval starts around 2015-09-23 01:22:40.720925 epoch 1134. osd.0 never seems to get the pg_query message sent at:

2015-09-23 01:22:40.733947 7f12b5581700 1 -- 10.214.136.2:6812/1005806 >> 10.214.136.2:6803/46542 conn(0x7f12cdd41000 sd=40 :-1 s=STATE_CONNECTING_WAIT_CONNECT_REPLY pgs=777 cs=2 l=0). tx 0x7f12d041cc00 pg_query(0.37,0.b7,1.1 epoch 1134) v3

(search forward for '10.214.136.2:6803/46542.*pg_query\(0.37,0.b7,1.1 epoch 1134\)' in ceph-osd.sorted)

ubuntu@teuthology:/a/samuelj-2015-09-22_22:28:00-rados-wip-sam-testing-distro-basic-multi/1065726/remote/ceph-osd.sorted

Actions

Also available in: Atom PDF