Project

General

Profile

Actions

Bug #15679

closed

ceph-qa-suite do_pg_scrub() does nothing due to scrub stamp change

Added by David Zafman almost 8 years ago. Updated almost 8 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

2016-05-01T02:38:57.946 INFO:tasks.ceph.ceph_manager:waiting for scrub type deep-scrub
2016-05-01T02:38:57.947 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph osd dump --format=json'
2016-05-01T02:38:58.199 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph pg deep-scrub 4.0'
2016-05-01T02:38:58.333 INFO:teuthology.orchestra.run.smithi067.stderr:instructing pg 4.0 on osd.4 to deep-scrub
2016-05-01T02:39:06.574 INFO:tasks.ceph.osd.4.smithi030.stderr:2016-05-01 09:39:06.575235 7fbe0baa2700 -1 log_channel(cluster) log [ERR] : 4.0 shard 0: soid 4:a0216fbc:::repair_test_obj:head candidate had a read error
2016-05-01T02:39:06.575 INFO:tasks.ceph.osd.4.smithi030.stderr:2016-05-01 09:39:06.575665 7fbe0baa2700 -1 log_channel(cluster) log [ERR] : 4.0 deep-scrub 0 missing, 1 inconsistent objects
2016-05-01T02:39:06.575 INFO:tasks.ceph.osd.4.smithi030.stderr:2016-05-01 09:39:06.575675 7fbe0baa2700 -1 log_channel(cluster) log [ERR] : 4.0 deep-scrub 1 errors

No repair happened below. Possibly because a schedule deep-scrub made do_pg_scrub() see a scrub stamp change

2016-05-01T02:39:09.072 INFO:tasks.repair_test:repairing
2016-05-01T02:39:09.072 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph osd dump --format=json'
2016-05-01T02:39:09.220 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph pg dump --format=json'
2016-05-01T02:39:09.357 INFO:teuthology.orchestra.run.smithi067.stderr:dumped all in format json
2016-05-01T02:39:09.370 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph osd dump --format=json'
2016-05-01T02:39:12.521 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph pg dump --format=json'
2016-05-01T02:39:12.667 INFO:teuthology.orchestra.run.smithi067.stderr:dumped all in format json
2016-05-01T02:39:12.679 INFO:tasks.repair_test:re-scrubbing

2016-05-01T02:39:13.585 INFO:teuthology.orchestra.run.smithi067.stderr:instructing pg 4.0 on osd.4 to deep-scrub
2016-05-01T02:39:16.576 INFO:tasks.ceph.osd.4.smithi030.stderr:2016-05-01 09:39:16.577236 7fbe0e2a7700 -1 log_channel(cluster) log [ERR] : 4.0 shard 0: soid 4:a0216fbc:::repair_test_obj:head candidate had a read error
2016-05-01T02:39:16.577 INFO:tasks.ceph.osd.4.smithi030.stderr:2016-05-01 09:39:16.577680 7fbe0e2a7700 -1 log_channel(cluster) log [ERR] : 4.0 deep-scrub 0 missing, 1 inconsistent objects
2016-05-01T02:39:16.578 INFO:tasks.ceph.osd.4.smithi030.stderr:2016-05-01 09:39:16.577690 7fbe0e2a7700 -1 log_channel(cluster) log [ERR] : 4.0 deep-scrub 1 errors

2016-05-01T02:39:23.596 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph osd dump --format=json'
2016-05-01T02:39:23.677 INFO:teuthology.orchestra.run.smithi067.stderr:2016-05-01 09:39:23.666872 7f7379ffb700 0 monclient: hunting for new mon
2016-05-01T02:39:23.753 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph pg dump --format=json'
2016-05-01T02:39:24.143 INFO:teuthology.orchestra.run.smithi067.stderr:dumped all in format json
2016-05-01T02:39:24.155 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph osd dump --format=json'
2016-05-01T02:39:24.307 INFO:teuthology.orchestra.run.smithi067:Running: 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 120 ceph pg dump --format=json'
2016-05-01T02:39:24.446 INFO:teuthology.orchestra.run.smithi067.stderr:dumped all in format json
2016-05-01T02:39:24.458 ERROR:teuthology.run_tasks:Saw exception from tasks.
Traceback (most recent call last):
File "/home/teuthworker/src/teuthology_master/teuthology/run_tasks.py", line 66, in run_tasks
manager = run_one_task(taskname, ctx=ctx, config=config)
File "/home/teuthworker/src/teuthology_master/teuthology/run_tasks.py", line 45, in run_one_task
return fn(**kwargs)
File "/var/lib/teuthworker/src/ceph-qa-suite_wip-8885/tasks/repair_test.py", line 304, in task
repair_test_1(ctx, dataerr, choose_replica, "deep-scrub")
File "/var/lib/teuthworker/src/ceph-qa-suite_wip-8885/tasks/repair_test.py", line 108, in repair_test_1
assert not ctx.manager.pg_inconsistent(pool, 0)
AssertionError


Related issues 1 (0 open1 closed)

Copied to Ceph - Bug #16847: hammer: ceph-qa-suite do_pg_scrub() does nothing due to scrub stamp changeResolvedNathan Cutler05/01/2016

Actions
Actions #1

Updated by David Zafman almost 8 years ago

  • Status changed from In Progress to 7
Actions #2

Updated by Sage Weil almost 8 years ago

  • Priority changed from Normal to High
Actions #3

Updated by David Zafman almost 8 years ago

  • Status changed from 7 to Resolved
Actions #4

Updated by Nathan Cutler over 7 years ago

  • Copied to Bug #16847: hammer: ceph-qa-suite do_pg_scrub() does nothing due to scrub stamp change added
Actions

Also available in: Atom PDF