Project

General

Profile

Bug #18016

cephtool-test-mds.sh waiting for an active MDS daemon (intermittent)

Added by Loïc Dachary over 7 years ago. Updated over 7 years ago.

Status:
Duplicate
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(FS):
Labels (FS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

/home/jenkins-build/build/workspace/ceph-pull-requests/qa/workunits/cephtool/test.sh:673: wait_mds_active:  echo 'waiting for an active MDS daemon (300/300)'
/home/jenkins-build/build/workspace/ceph-pull-requests/qa/workunits/cephtool/test.sh:674: wait_mds_active:  sleep 5
/home/jenkins-build/build/workspace/ceph-pull-requests/qa/workunits/cephtool/test.sh:679: wait_mds_active:  check_mds_active cephfs
/home/jenkins-build/build/workspace/ceph-pull-requests/qa/workunits/cephtool/test.sh:663: check_mds_active:  fs_name=cephfs
/home/jenkins-build/build/workspace/ceph-pull-requests/qa/workunits/cephtool/test.sh:664: check_mds_active:  ceph fs get cephfs
/home/jenkins-build/build/workspace/ceph-pull-requests/qa/workunits/cephtool/test.sh:664: check_mds_active:  grep active
2016-11-24 03:15:55.049103 7fddb1437700 -1 WARNING: the following dangerous and experimental features are enabled: *
2016-11-24 03:15:55.069154 7fddb1437700 -1 WARNING: the following dangerous and experimental features are enabled: *
/home/jenkins-build/build/workspace/ceph-pull-requests/qa/workunits/cephtool/test.sh:1: check_mds_active:  rm -fr /tmp/cephtool.VHi

mds.txt.gz (154 KB) Loïc Dachary, 11/24/2016 06:08 AM

consoleText.3.gz (147 KB) Loïc Dachary, 12/02/2016 04:50 PM

History

#1 Updated by Loïc Dachary over 7 years ago

#3 Updated by John Spray over 7 years ago

Can't tell what was going on here without logs from the services (which afaik we don't gather in these situations?)

One of the main reasons an MDS might not become active would be if the OSDs were not accepting writes for some reason, so I would say it's 50:50 that this isn't a cephfs issue at all.

#4 Updated by Loïc Dachary over 7 years ago

  • Status changed from New to Need More Info

I'll keep posting on this issue whenever I find such an error. If nothing happens in a month or two we can probably forget about it.

#6 Updated by Loïc Dachary over 7 years ago

  • Subject changed from cephtool-test-mds.sh transient failure to cephtool-test-mds.sh waiting for an active MDS daemon (intermittent)
  • Status changed from Need More Info to 12

#7 Updated by Loïc Dachary over 7 years ago

@jcsp did you run into this before ?

#8 Updated by John Spray over 7 years ago

  • Status changed from 12 to Duplicate

Will assume this is duplicate of https://github.com/ceph/ceph/pull/12234 unless we can see evidence otherwise -- this may crop up again on PRs that aren't rebased on master since the fix landed.

Also available in: Atom PDF