Project

General

Profile

Actions

Bug #9470

closed

daemon pid file is not being created when running service ceph

Added by Kenneth Waegeman over 9 years ago. Updated over 9 years ago.

Status:
Resolved
Priority:
High
Assignee:
Category:
common
Target version:
-
% Done:

100%

Source:
Community (user)
Tags:
Backport:
Regression:
Severity:
2 - major
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Hi,

We have been seeing some strange issues with the latest version(s) of ceph. I'm testing on 0.85 right now, and the logging system as well as the service commands are not working properly:
- Logging stops after some time
- service restart does output correctly: === osd.39 ===
Stopping Ceph osd.39 on ceph003...done === osd.39 ===
Starting Ceph osd.39 on ceph003...
Running as unit run-60713.service.
BUT the service is not restarted (`ps` gives same time and process id after 'restart')

I think we found out what could cause this: All the processes are runned with the --pid-file option, but the pidfile is not created! :

[root@ceph003 ~]# ps -ef | grep ceph
avahi 2035 1 0 Sep11 ? 00:00:00 avahi-daemon: running [ceph003.local]
root 5940 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-mon -i ceph003 --pid-file /var/run/ceph/mon.ceph003.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 5943 5940 0 Sep11 ? 00:21:30 /usr/bin/ceph-mon -i ceph003 --pid-file /var/run/ceph/mon.ceph003.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 21759 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-osd -i 32 --pid-file /var/run/ceph/osd.32.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 21763 21759 1 Sep11 ? 01:09:16 /usr/bin/ceph-osd -i 32 --pid-file /var/run/ceph/osd.32.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 24415 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-osd -i 33 --pid-file /var/run/ceph/osd.33.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 24419 24415 35 Sep11 ? 1-10:19:39 /usr/bin/ceph-osd -i 33 --pid-file /var/run/ceph/osd.33.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 27095 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-osd -i 34 --pid-file /var/run/ceph/osd.34.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 27099 27095 1 Sep11 ? 01:07:34 /usr/bin/ceph-osd -i 34 --pid-file /var/run/ceph/osd.34.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 40530 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-osd -i 39 --pid-file /var/run/ceph/osd.39.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 40533 40530 13 Sep11 ? 13:24:44 /usr/bin/ceph-osd -i 39 --pid-file /var/run/ceph/osd.39.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 43233 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-osd -i 40 --pid-file /var/run/ceph/osd.40.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 43236 43233 8 Sep11 ? 08:16:53 /usr/bin/ceph-osd -i 40 --pid-file /var/run/ceph/osd.40.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 54164 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-osd -i 44 --pid-file /var/run/ceph/osd.44.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 54168 54164 16 Sep11 ? 15:51:46 /usr/bin/ceph-osd -i 44 --pid-file /var/run/ceph/osd.44.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 60850 56258 0 13:48 pts/1 00:00:00 grep --color=auto ceph
root 62398 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-osd -i 47 --pid-file /var/run/ceph/osd.47.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 62402 62398 11 Sep11 ? 11:06:09 /usr/bin/ceph-osd -i 47 --pid-file /var/run/ceph/osd.47.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 62931 1 0 Sep11 ? 00:00:00 /bin/bash -c ulimit -n 32768; /usr/bin/ceph-mds -i ceph003 --pid-file /var/run/ceph/mds.ceph003.pid -c /etc/ceph/ceph.conf --cluster ceph -f
root 62935 62931 4 Sep11 ? 04:36:20 /usr/bin/ceph-mds -i ceph003 --pid-file /var/run/ceph/mds.ceph003.pid -c /etc/ceph/ceph.conf --cluster ceph -f
[root@ceph003 ~]# ls /var/run/ceph/osd.44.pid
ls: cannot access /var/run/ceph/osd.44.pid: No such file or directory
[root@ceph003 ~]# ls /var/run/ceph/*
/var/run/ceph/ceph-mds.ceph003.asok /var/run/ceph/ceph-osd.34.asok /var/run/ceph/ceph-osd.38.asok /var/run/ceph/ceph-osd.42.asok /var/run/ceph/ceph-osd.46.asok
/var/run/ceph/ceph-mon.ceph003.asok /var/run/ceph/ceph-osd.35.asok /var/run/ceph/ceph-osd.39.asok /var/run/ceph/ceph-osd.43.asok /var/run/ceph/ceph-osd.47.asok
/var/run/ceph/ceph-osd.32.asok /var/run/ceph/ceph-osd.36.asok /var/run/ceph/ceph-osd.40.asok /var/run/ceph/ceph-osd.44.asok
/var/run/ceph/ceph-osd.33.asok /var/run/ceph/ceph-osd.37.asok /var/run/ceph/ceph-osd.41.asok /var/run/ceph/ceph-osd.45.asok

What could here be the problem?
This is running on centos7, 3.10.0-123.el7.x86_64

Many thanks!


Related issues 1 (0 open1 closed)

Has duplicate Ceph - Bug #9509: init script cannot stop OSDsResolved09/17/2014

Actions
Actions #1

Updated by Greg Farnum over 9 years ago

  • Status changed from New to Resolved

This was fixed by bccb0eb64891f65fd475e96b6386494044cae8c1, which will be in Giant.

Actions #2

Updated by Loïc Dachary over 9 years ago

  • Status changed from Resolved to Pending Backport
  • Assignee set to Loïc Dachary
Actions #3

Updated by Loïc Dachary over 9 years ago

  • Status changed from Pending Backport to Resolved
  • % Done changed from 0 to 100
Actions

Also available in: Atom PDF