Bug #64343
openExpected warnings that need to be whitelisted cause rados/cephadm tests to fail
0%
Description
Follows the merge of https://github.com/ceph/ceph/pull/41479 and https://github.com/ceph/ceph/pull/54312, there are some warnings that we expect to see in testing environments that are causing some tests to fail. They need to be whitelisted.
See https://pulpito.ceph.com/yuriw-2024-02-06_00:23:51-rados-wip-yuri10-testing-2024-02-02-1149-pacific-distro-default-smithi/7548075/ for examples.
Warnings include:
- MON_DOWN
- PG_AVAILABILITY
- OSD_DOWN
Updated by Laura Flores 3 months ago
- Related to Bug #64344: rados/cephadm/dashboard: test that expects a HOST_MAINTENANCE_MODE scenario fails due to warning in cluster log added
Updated by Laura Flores 3 months ago
Example of a main run with these failures: https://pulpito.ceph.com/yuriw-2024-02-07_00:12:44-rados-wip-yuri2-testing-2024-02-06-1154-distro-default-smithi/
Updated by Laura Flores 3 months ago
- Status changed from New to Fix Under Review
- Pull request ID set to 55498
Updated by Laura Flores 3 months ago
An alternative to reverting: https://github.com/ceph/ceph/pull/55507
Updated by Mark Nelson 3 months ago
I grepped through the example suite 'yuriw-2024-02-07_00:12:44-rados-wip-yuri2-testing-2024-02-06-1154-distro-default-smithi':
find . -name "teuthology.log" -exec grep -l -H 'FAIL' {} \; | xargs grep -l -H 'MON_DOWN' | wc -l 35
Those should be covered hopefully by #55507. Then I took the list of failures that did not have MON_DOWN in them and looked for the other two keywords:
cat ~/whitelist-testing/other-failures.txt | xargs grep -l -H 'OSD_DOWN' ./7549140/teuthology.log ./7549184/teuthology.log ./7549497/teuthology.log ./7549527/teuthology.log ./7549106/teuthology.log ./7549481/teuthology.log ./7549519/teuthology.log ./7549369/teuthology.log ./7549611/teuthology.log ./7549330/teuthology.log ./7549381/teuthology.log ./7549284/teuthology.log ./7549427/teuthology.log ./7549665/teuthology.log ./7549588/teuthology.log ./7549625/teuthology.log
and
cat ~/whitelist-testing/other-failures.txt | xargs grep -l -H 'PG_AVAILABILITY' ./7549519/teuthology.log ./7549164/teuthology.log
I think Laura has already looked through these, but I thought it might be helpful to have a record of potential other failure conditions.
Updated by Radoslaw Zarzynski 3 months ago
- Pull request ID changed from 55498 to 55507
Updated by Radoslaw Zarzynski 3 months ago
- Status changed from Fix Under Review to Pending Backport
Updated by Backport Bot 3 months ago
- Copied to Backport #64407: pacific: Expected warnings that need to be whitelisted cause rados/cephadm tests to fail added
Updated by Backport Bot 3 months ago
- Copied to Backport #64408: quincy: Expected warnings that need to be whitelisted cause rados/cephadm tests to fail added
Updated by Backport Bot 3 months ago
- Copied to Backport #64409: reef: Expected warnings that need to be whitelisted cause rados/cephadm tests to fail added
Updated by Konstantin Shalygin about 1 month ago
- Assignee set to Laura Flores
- Source set to Development