Project

General

Profile

Actions

Bug #41758

closed

Ceph status in some cases does not report slow ops

Added by Sridhar Seshasayee over 4 years ago. Updated over 4 years ago.

Status:
Duplicate
Priority:
High
Category:
Administration/Usability
Target version:
-
% Done:

0%

Source:
Development
Tags:
Backport:
mimic,nautilus
Regression:
No
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Monitor, OSD
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

In cases when only osds report slow ops, it is observed that ceph summary status doesn't report the same. This issue was reported on mimic #40993 and upon further investigation the same issue was also seen to be present on the master. This tracker will be used to track the fix on the master and then back-ported to all appropriate downstream branches where the issue is present.


Related issues 2 (0 open2 closed)

Is duplicate of mgr - Bug #41741: Slow op warning does not display correctlyResolved09/10/2019

Actions
Copied to RADOS - Backport #40993: mimic: Ceph status in some cases does not report slow opsRejectedActions
Actions #1

Updated by Sridhar Seshasayee over 4 years ago

  • Related to Backport #40993: mimic: Ceph status in some cases does not report slow ops added
Actions #2

Updated by Sridhar Seshasayee over 4 years ago

  • Pull request ID set to 30337
Actions #3

Updated by Sridhar Seshasayee over 4 years ago

After applying the fix, health warning pertaining to slow ops show up as shown below,

$ bin/ceph -s
*** DEVELOPER MODE: setting PATH, PYTHONPATH and LD_LIBRARY_PATH ***
2019-09-11T16:53:32.198+0530 7f65a12c1700 -1 WARNING: all dangerous and experimental features are enabled.
2019-09-11T16:53:32.217+0530 7f65a12c1700 -1 WARNING: all dangerous and experimental features are enabled.
  cluster:
    id:     e093c322-66ca-4a6c-b4e2-19b8c373472d
    health: HEALTH_WARN
            55 slow ops, oldest one blocked for 5 sec, daemons [osd,0,osd,1,osd,2,mon,a,mon,b,mon,c] have slow ops.

  services:
    mon: 3 daemons, quorum a,b,c (age 2m)
    mgr: x(active, since 2m)
    mds: a:1 {0=a=up:active} 2 up:standby
    osd: 3 osds: 3 up (since 66s), 3 in (since 66s)

  task status:
    scrub status:
        mds.0: idle

  data:
    pools:   3 pools, 34 pgs
    objects: 2.38k objects, 1.2 MiB
    usage:   6.1 GiB used, 3.0 TiB / 3.0 TiB avail
    pgs:     34 active+clean

  io:
    client:   103 KiB/s wr, 0 op/s rd, 103 op/s wr
Actions #4

Updated by Neha Ojha over 4 years ago

  • Status changed from In Progress to Fix Under Review
Actions #5

Updated by Neha Ojha over 4 years ago

  • Backport changed from mimic to mimic,nautilus
Actions #6

Updated by Nathan Cutler over 4 years ago

  • Related to deleted (Backport #40993: mimic: Ceph status in some cases does not report slow ops)
Actions #7

Updated by Nathan Cutler over 4 years ago

  • Copied to Backport #40993: mimic: Ceph status in some cases does not report slow ops added
Actions #8

Updated by Kefu Chai over 4 years ago

  • Status changed from Fix Under Review to Duplicate
Actions #9

Updated by Kefu Chai over 4 years ago

  • Is duplicate of Bug #41741: Slow op warning does not display correctly added
Actions

Also available in: Atom PDF