Project

General

Profile

Actions

Bug #50248

closed

rgw-nfs daemons marked as stray

Added by Daniel Pivonka about 3 years ago. Updated almost 3 years ago.

Status:
Resolved
Priority:
Normal
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
pacific
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

[ceph: root@vm-00 /]# ceph -s
cluster:
id:     ede3e474-9890-11eb-9175-5254007bd8c8
health: HEALTH_WARN
1 stray daemon(s) not managed by cephadm

services:
mon:     3 daemons, quorum vm-00,vm-02,vm-01 (age 5m)
mgr:     vm-00.gpoxjw(active, since 8m), standbys: vm-02.epkyvb
osd:     3 osds: 3 up (since 5m), 3 in (since 5m)
rgw:     2 daemons active (2 hosts, 1 zones)
rgw-nfs: 1 daemon active (1 hosts, 1 zones)

data:
pools:   7 pools, 193 pgs
objects: 349 objects, 28 KiB
usage:   40 MiB used, 450 GiB / 450 GiB avail
pgs:     193 active+clean

io:
client:   824 B/s rd, 1 op/s rd, 0 op/s wr

[ceph: root@vm-00 /]# ceph health detail
HEALTH_WARN 1 stray daemon(s) not managed by cephadm
[WRN] CEPHADM_STRAY_DAEMON: 1 stray daemon(s) not managed by cephadm
stray daemon 14523 on host vm-01 not managed by cephadm
[ceph: root@vm-00 /]# ceph service status
{
"rgw": {
"14469": {
"status_stamp": "2021-04-08T17:46:34.515799+0000",
"last_beacon": "2021-04-08T17:46:34.515799+0000",
"status": {
"current_sync": "[]" 
}
},
"24272": {
"status_stamp": "2021-04-08T17:46:29.569272+0000",
"last_beacon": "2021-04-08T17:46:34.569691+0000",
"status": {
"current_sync": "[]" 
}
}
},
"rgw-nfs": {
"14523": {
"status_stamp": "2021-04-08T17:46:31.862084+0000",
"last_beacon": "2021-04-08T17:46:36.862415+0000",
"status": {
"current_sync": "[]" 
}
}
}
}
[ceph: root@vm-00 /]# ceph orch ps
NAME                                 HOST   PORTS          STATUS         REFRESHED  AGE  VERSION                IMAGE ID      CONTAINER ID  
alertmanager.vm-00                   vm-00  *:9093 *:9094  running (6m)   2m ago     9m   0.20.0                 0881eb8f169f  eab0ec8a7e2e  
crash.vm-00                          vm-00  -              running (9m)   2m ago     9m   17.0.0-2394-gc553763e  6376430f9659  df94ec196887  
crash.vm-01                          vm-01  -              running (6m)   3m ago     6m   17.0.0-2394-gc553763e  6376430f9659  8b34f031b436  
crash.vm-02                          vm-02  -              running (6m)   3m ago     6m   17.0.0-2394-gc553763e  6376430f9659  b8d161b39f56  
grafana.vm-00                        vm-00  *:3000         running (5m)   2m ago     8m   6.7.4                  80728b29ad3f  3049ae9860bc  
mgr.vm-00.gpoxjw                     vm-00  *:9283         running (10m)  2m ago     10m  17.0.0-2394-gc553763e  6376430f9659  67319da0d51b  
mgr.vm-02.epkyvb                     vm-02  *:8443 *:9283  running (6m)   3m ago     6m   17.0.0-2394-gc553763e  6376430f9659  07017c8df192  
mon.vm-00                            vm-00  -              running (10m)  2m ago     10m  17.0.0-2394-gc553763e  6376430f9659  0cd485da81bd  
mon.vm-01                            vm-01  -              running (6m)   3m ago     6m   17.0.0-2394-gc553763e  6376430f9659  0dd0ac6ef48e  
mon.vm-02                            vm-02  -              running (6m)   3m ago     6m   17.0.0-2394-gc553763e  6376430f9659  9f2ec5545c33  
nfs.foo.vm-01                        vm-01  *:2049         running (3m)   3m ago     3m   3.5                    6376430f9659  4645204b975e  
node-exporter.vm-00                  vm-00  *:9100         running (8m)   2m ago     8m   0.18.1                 e5a616e4b9cf  b6b1b299f964  
node-exporter.vm-01                  vm-01  *:9100         running (6m)   3m ago     6m   0.18.1                 e5a616e4b9cf  77b15460b6b6  
node-exporter.vm-02                  vm-02  *:9100         running (6m)   3m ago     6m   0.18.1                 e5a616e4b9cf  0c00946ad0b0  
osd.0                                vm-02  -              running (6m)   3m ago     6m   17.0.0-2394-gc553763e  6376430f9659  6851928922cf  
osd.1                                vm-01  -              running (6m)   3m ago     6m   17.0.0-2394-gc553763e  6376430f9659  4d9ea63707f8  
osd.2                                vm-00  -              running (6m)   2m ago     6m   17.0.0-2394-gc553763e  6376430f9659  454b4f605360  
prometheus.vm-00                     vm-00  *:9095         running (6m)   2m ago     8m   2.18.1                 de242295e225  e8644f0802f7  
rgw.example_service_id.vm-00.dlfbug  vm-00  *:80           running (3m)   2m ago     3m   17.0.0-2394-gc553763e  6376430f9659  e27b6ee4b0fe  
rgw.example_service_id.vm-01.oqigli  vm-01  *:80           running (3m)   3m ago     3m   17.0.0-2394-gc553763e  6376430f9659  16f7c564ab0f  
[ceph: root@vm-00 /]#

looks like the logic here is not working https://github.com/ceph/ceph/blob/master/src/pybind/mgr/cephadm/serve.py#L413-#L427

rgw-nfs needs to be included in the daemon_id to metadata['id'] conversion logic

related prs: https://github.com/ceph/ceph/pull/40220 https://github.com/ceph/ceph/pull/37397

Actions #1

Updated by Daniel Pivonka about 3 years ago

  • Status changed from New to Fix Under Review
  • Backport set to pacific
  • Pull request ID set to 40711
Actions #2

Updated by Daniel Pivonka almost 3 years ago

  • Status changed from Fix Under Review to Resolved
Actions

Also available in: Atom PDF