Project

General

Profile

Backport #11975

mon crashes when "ceph osd tree 85 --format json"

Added by Kefu Chai over 3 years ago. Updated over 3 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Target version:
Release:
hammer


Related issues

Copied from Ceph - Bug #11680: mon crashes when "ceph osd tree 85 --format json" Can't reproduce 05/19/2015

Associated revisions

Revision 2aaeea14 (diff)
Added by Loic Dachary over 3 years ago

tests: TEST_crush_reject_empty must not run a mon

  • Back in Hammer, the osd-crush.sh individual tests did not run the
    monitor, it was taken care of by the run() function. An attempt to run
    another mon fails with:

    error: IO lock testdir/osd-crush/a/store.db/LOCK: Resource temporarily
    unavailable

    This problem was introduced by cc1cc033930e8690a57674e842a003f6bbc7a242
    from https://github.com/ceph/ceph/pull/4936

  • replace test/mon/mon-test-helpers.sh with test/ceph-helpers.sh as
    we need run_osd() in this newly added test
  • update the run-dir of commands: ceph-helpers.sh use the different
    convention for the run-dir of daemons.

http://tracker.ceph.com/issues/11975 Refs: #11975

Signed-off-by: Loic Dachary <>

History

#1 Updated by Kefu Chai over 3 years ago

  • Tracker changed from Bug to Backport

#2 Updated by Kefu Chai over 3 years ago

  • Priority changed from Normal to High
  • Target version set to v0.94.3

#4 Updated by Kefu Chai over 3 years ago

  • Priority changed from High to Urgent
  • Release set to hammer

#6 Updated by Loic Dachary over 3 years ago

  • File deleted (file_1222509.log.txt)

#7 Updated by Loic Dachary over 3 years ago

  • File deleted (ceph-mon.tgz)

#8 Updated by Loic Dachary over 3 years ago

  • File deleted (1029402.log.gz)

#9 Updated by Loic Dachary over 3 years ago

  • File deleted (1029386.log.gz)

#10 Updated by Loic Dachary over 3 years ago

  • File deleted (CalAPITester.py)

#11 Updated by Loic Dachary over 3 years ago

  • Description updated (diff)
  • Status changed from Pending Backport to In Progress

#12 Updated by Loic Dachary over 3 years ago

https://github.com/ceph/ceph/pull/5195 is a fix for the gitbuilder / bot failure:

 run_mon: 43: ./ceph-mon --id a --mkfs --mon-data=testdir/osd-crush/a --run-dir=testdir/osd-crush/a --public-addr 127.0.0.1:7104
./ceph-mon: renaming mon.noname-a 127.0.0.1:7104/0 to mon.a
./ceph-mon: set fsid to d44c4bcd-c43d-4a0f-b5c2-1aee60666c41
./ceph-mon: created monfs at testdir/osd-crush/a for mon.a
run_mon: 49: ./ceph-mon --id a --mon-osd-full-ratio=.99 --mon-data-avail-crit=1 --paxos-propose-interval=0.1 --osd-crush-chooseleaf-type=0 --osd-pool-default-erasure-code-directory=.libs --debug-mon 20 --debug-ms 20 --debug-paxos 20 --chdir= --mon-data=testdir/osd-crush/a --log-file=testdir/osd-crush/a/log --mon-cluster-log-file=testdir/osd-crush/a/log --run-dir=testdir/osd-crush/a '--pid-file=testdir/osd-crush/a/$name.pid' --public-addr 127.0.0.1:7104
run: 32: TEST_crush_reject_empty testdir/osd-crush
TEST_crush_reject_empty: 192: local dir=testdir/osd-crush
TEST_crush_reject_empty: 193: run_mon testdir/osd-crush a
run_mon: 36: local dir=testdir/osd-crush
run_mon: 37: shift
run_mon: 38: local id=a
run_mon: 39: shift
run_mon: 40: dir+=/a
run_mon: 43: ./ceph-mon --id a --mkfs --mon-data=testdir/osd-crush/a --run-dir=testdir/osd-crush/a
'testdir/osd-crush/a' already exists and is not empty: monitor may already exist
run_mon: 49: ./ceph-mon --id a --mon-osd-full-ratio=.99 --mon-data-avail-crit=1 --paxos-propose-interval=0.1 --osd-crush-chooseleaf-type=0 --osd-pool-default-erasure-code-directory=.libs --debug-mon 20 --debug-ms 20 --debug-paxos 20 --chdir= --mon-data=testdir/osd-crush/a --log-file=testdir/osd-crush/a/log --mon-cluster-log-file=testdir/osd-crush/a/log --run-dir=testdir/osd-crush/a '--pid-file=testdir/osd-crush/a/$name.pid'
2015-07-10 11:06:10.996607 405e2700 -1 asok(0xc69c980) AdminSocketConfigObs::init: failed: AdminSocket::bind_and_listen: failed to bind the UNIX domain socket to 'testdir/osd-crush/a/ceph-mon.a.asok': (17) File exists

    error: IO lock testdir/osd-crush/a/store.db/LOCK: Resource temporarily unavailable 

2015-07-10 11:06:11.031421 405e2700 -1 error opening mon data directory at 'testdir/osd-crush/a': (22) Invalid argument
TEST_crush_reject_empty: 193: return 1 

#13 Updated by Loic Dachary over 3 years ago

  • Status changed from In Progress to Need Review

#14 Updated by Loic Dachary over 3 years ago

  • Status changed from Need Review to Resolved

Also available in: Atom PDF