Project

General

Profile

Bug #38702

test_selftest_cluster_log (tasks.mgr.test_module_selftest.TestModuleSelftest) fails in vstart

Added by Volker Theile over 1 year ago. Updated over 1 year ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
testing
Target version:
% Done:

0%

Source:
Development
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature:

Description

The QA cluster log test (https://github.com/ceph/ceph/blob/master/qa/tasks/mgr/test_module_selftest.py#L292) fails because the command 'ceph -w --watch-channel cluster|audit' does not display the events immediatelly. The QA backend defines a timeout of 15 seconds (https://github.com/ceph/ceph/blob/master/qa/tasks/ceph_test_case.py#L52) to allow the command to display the event, otherwise it fails.

If you do the tests manually, it sometimes takes 10 seconds until the event is displayed.

Start a vstart cluster and execute the following commands:

$ ceph --watch --watch-channel "*"

Run the following commands in a separate shell:

$ ceph mgr module enable selftest
$ ceph mgr self-test cluster-log audit info "foo bar info"
$ ceph mgr self-test cluster-log cluster error "foo bar err"
$ ceph mgr self-test cluster-log ...

In Ceph Mimic the events are displayed much faster, and the QA test does not fail. Increasing the timeout to 20seconds or more does not make sense IMHO.


Related issues

Related to mgr - Feature #38394: mgr/dashboard: QA - try to run mgr Teuthology tests as part of run-backend-api-tests.sh Resolved 02/20/2019
Related to mgr - Bug #46221: test_selftest_cluster_log (tasks.mgr.test_module_selftest.TestModuleSelftest): Expected log message not found New

History

#1 Updated by Volker Theile over 1 year ago

  • Description updated (diff)

#2 Updated by Alfonso Martínez over 1 year ago

  • Related to Feature #38394: mgr/dashboard: QA - try to run mgr Teuthology tests as part of run-backend-api-tests.sh added

#3 Updated by Alfonso Martínez over 1 year ago

The fact is that if you run vstart_runner.py with this watcher:

bin/ceph -w --watch-channel "*"

the tests pass (the default timeout is enough).

But if you run:

bin/ceph -w --watch-channel cluster

or just:
bin/ceph -w

and in another terminal:
bin/ceph mgr self-test cluster-log cluster error "foo bar err"

You don't see the notifcication.

Is this expected behaviour?

#4 Updated by Alfonso Martínez over 1 year ago

@Sage Weil pointed out that in python 3 the "watch_channel" ceph arg was not being parsed correctly (even default value).

This PR fixes it:
https://github.com/ceph/ceph/pull/26949

With the fix, the related tests pass.

So this issue can be focused on analyzing the increase of time when displaying events.

#5 Updated by Sebastian Wagner 3 months ago

  • Related to Bug #46221: test_selftest_cluster_log (tasks.mgr.test_module_selftest.TestModuleSelftest): Expected log message not found added

Also available in: Atom PDF