Project

General

Profile

Actions

Bug #17148

closed

Fault from device name change

Added by Tyler Benster over 7 years ago. Updated almost 3 years ago.

Status:
Closed
Priority:
Normal
Assignee:
-
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
Backport:
Regression:
Severity:
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

I used ceph-deploy to set up several osd's and a journal on a separate ssd (sde). I then attached a new drive. This inadvertently changed the ssd name to sdf. As a result, typing ceph into commandline gave a very undescriptive error:

2016-08-25 21:03:48.738442 7f0d285d9700 0 -- :/3678840000 >> 192.168.29.240:6789/0 pipe(0x7f0d24059a20 sd=3 :0 s=1 pgs=0 cs=0 l=1 c=0x7f0d2405ad00).fault
2016-08-25 21:03:51.738604 7f0d284d8700 0 -- :/3678840000 >> 192.168.29.240:6789/0 pipe(0x7f0d18000cc0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f0d18002000).fault
2016-08-25 21:03:54.738908 7f0d285d9700 0 -- :/3678840000 >> 192.168.29.240:6789/0 pipe(0x7f0d180052c0 sd=4 :0 s=1 pgs=0 cs=0 l=1 c=0x7f0d180065a0).fault
^CTraceback (most recent call last):
File "/usr/bin/ceph", line 948, in <module>
retval = main()
File "/usr/bin/ceph", line 852, in main
prefix='get_command_descriptions')
File "/usr/lib/python2.7/dist-packages/ceph_argparse.py", line 1291, in json_command
raise RuntimeError('"{0}": exception {1}'.format(argdict, e))
RuntimeError: "None": exception "['{"prefix": "get_command_descriptions"}']": exception You cannot perform that operation on a Rados object in state configuring.

Two suggestions:
- ceph ideally should not crash as the monitor should be able to run. Alternatively, a real error message would be great
- ceph-deploy should use a stable identifier like uuid

Actions

Also available in: Atom PDF