Actions
Bug #53723
closedCephadm agent fails to report and causes a health timeout
% Done:
0%
Source:
Tags:
Backport:
quincy
Regression:
No
Severity:
3 - minor
Reviewed:
Description
/a/yuriw-2021-12-22_22:11:35-rados-wip-yuri3-testing-2021-12-22-1047-distro-default-smithi/6580439
Description: rados/cephadm/workunits/{agent/on mon_election/connectivity task/test_orch_cli}
Failure reason: timeout expired in wait_until_healthy
2021-12-23T06:18:45.300 INFO:teuthology.orchestra.run.smithi068.stdout:
2021-12-23T06:18:45.300 INFO:teuthology.orchestra.run.smithi068.stdout:{"status":"HEALTH_WARN","checks":{"CEPHADM_AGENT_DOWN":{"severity":"HEALTH_WARN","summary":{"message":"1 Cephadm Agent(s) are not reporting. Hosts may be offline","count":1},"muted":false},"CEPHADM_FAILED_DAEMON":{"severity":"HEALTH_WARN","summary":{"message":"1 failed cephadm daemon(s)","count":1},"muted":false}},"mutes":[]}
2021-12-23T06:18:45.323 INFO:journalctl@ceph.mon.a.smithi068.stdout:Dec 23 06:18:44 smithi068 bash[14626]: cluster 2021-12-23T06:18:43.922425+0000 mgr.a (mgr.14150) 357 : cluster [DBG] pgmap v343: 1 pgs: 1 active+clean; 577 KiB data, 17 MiB used, 268 GiB / 268 GiB avail
2021-12-23T06:18:46.323 INFO:journalctl@ceph.mon.a.smithi068.stdout:Dec 23 06:18:46 smithi068 bash[14626]: audit 2021-12-23T06:18:45.297779+0000 mon.a (mon.0) 347 : audit [DBG] from='client.? 172.21.15.68:0/3865627448' entity='client.admin' cmd=[{"prefix": "health", "format": "json"}]: dispatch
2021-12-23T06:18:46.726 INFO:tasks.cephadm:Teardown begin
2021-12-23T06:18:46.727 ERROR:teuthology.contextutil:Saw exception from nested tasks
Traceback (most recent call last):
File "/home/teuthworker/src/git.ceph.com_git_teuthology_95a7d4799b562f3bbb5ec66107094963abd62fa1/teuthology/contextutil.py", line 33, in nested
yield vars
File "/home/teuthworker/src/github.com_ceph_ceph-c_1121b3c9661a85cfbc852d654ea7d22c1d1be751/qa/tasks/cephadm.py", line 1548, in task
healthy(ctx=ctx, config=config)
File "/home/teuthworker/src/github.com_ceph_ceph-c_1121b3c9661a85cfbc852d654ea7d22c1d1be751/qa/tasks/ceph.py", line 1469, in healthy
manager.wait_until_healthy(timeout=300)
File "/home/teuthworker/src/github.com_ceph_ceph-c_1121b3c9661a85cfbc852d654ea7d22c1d1be751/qa/tasks/ceph_manager.py", line 3146, in wait_until_healthy
'timeout expired in wait_until_healthy'
AssertionError: timeout expired in wait_until_healthy
Actions