Project

General

Profile

Bug #2075

osd: recover_got assert

Added by Sage Weil about 8 years ago. Updated over 7 years ago.

Status:
Resolved
Priority:
High
Assignee:
-
Category:
OSD
Target version:
% Done:

0%

Source:
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature:

Description


osd/ReplicatedPG.cc: In function 'void ReplicatedPG::recover_got(hobject_t, eversion_t)' thread 7ff9a05c8700 time 2012-02-16 14:38:03.928946
osd/ReplicatedPG.cc: 4753: FAILED assert(missing.num_missing() == 0)
 ceph version 0.41-369-g4b3bb5a (commit:4b3bb5ab37a05fa001d59f24da7d9c30d650321b)
 1: (ReplicatedPG::recover_got(hobject_t, eversion_t)+0x40c) [0x4c76ec]
 2: (ReplicatedPG::submit_push_complete(ObjectRecoveryInfo&, ObjectStore::Transaction*)+0x425) [0x4c7b65]
 3: (ReplicatedPG::handle_push(OpRequest*)+0x27a) [0x4c831a]
 4: (ReplicatedPG::sub_op_push(OpRequest*)+0x1d) [0x4db81d]
 5: (ReplicatedPG::do_sub_op(OpRequest*)+0x22b) [0x4dba8b]
 6: (OSD::dequeue_op(PG*)+0x121) [0x548641]
 7: (ThreadPool::worker()+0xa26) [0x61a556]
 8: (ThreadPool::WorkThread::entry()+0xd) [0x57b8dd]
 9: (()+0x7971) [0x7ff9b0b5b971]
 10: (clone()+0x6d) [0x7ff9af1e692d]

DEBUG:teuthology.run:Config:
  kernel:
    sha1: 07fd42934a53b8486709f7f866346a9e4bb6d5ce
  nuke-on-error: true
  overrides:
    ceph:
      conf:
        osd:
          osd op complaint time: 120
      coverage: true
      fs: btrfs
      log-whitelist:
      - clocks not synchronized
      - old request
      sha1: 4b3bb5ab37a05fa001d59f24da7d9c30d650321b
  roles:
  - - mon.a
    - mds.0
    - osd.0
  - - osd.1
  - - osd.2
  - - osd.3
  - - osd.4
  - - client.0
  tasks:
  - chef: null
  - ceph:
      log-whitelist:
      - wrongly marked me down or wrong addr
  - thrashosds:
      chance_down: 0.5
      clean_interval: 120
      op_delay: 30
  - cfuse: null
  - workunit:
      all:
      - rados/load-gen-mix-small.sh

ubuntu@teuthology:/a/nightly_coverage_2012-02-16-b/12290

ceph-osd.40.log.1.gz (223 KB) Oliver Francke, 09/19/2012 01:51 AM


Related issues

Duplicated by Ceph - Bug #2070: osd/ReplicatedPG.cc: 3627: FAILED assert(is_active()) Duplicate 02/16/2012

Associated revisions

Revision 344c2022 (diff)
Added by Sage Weil about 8 years ago

osd: fix up argument to PG::init()

Commit cefa55b288b40e17ade9875493dd94de52ac22bf moved PG initialization
into init(), but passed acting for both up and acting args. This lead to
confusion between primary and replica.

Also fix debug print so that the output is useful.

Fixes: #2075, #2070
Signed-off-by: Sage Weil <>

History

#1 Updated by Sage Weil about 8 years ago

ubuntu@teuthology:/a/nightly_coverage_2012-02-18-a/12489

  kernel:
    sha1: 07fd42934a53b8486709f7f866346a9e4bb6d5ce
  nuke-on-error: true
  overrides:
    ceph:
      conf:
        osd:
          osd op complaint time: 120
      coverage: true
      fs: btrfs
      log-whitelist:
      - clocks not synchronized
      - old request
      sha1: c1db9009c2cde9dc7ab8857b0d28a1b6d931e98a
  roles:
  - - mon.a
    - osd.0
    - osd.1
    - osd.2
  - - mds.a
    - client.0
    - osd.3
    - osd.4
    - osd.5
  tasks:
  - chef: null
  - ceph:
      log-whitelist:
      - wrongly marked me down or wrong addr
  - thrashosds: null
  - rados:
      clients:
      - client.0
      objects: 50
      op_weights:
        delete: 50
        read: 100
        snap_create: 50
        snap_remove: 50
        snap_rollback: 50
        write: 100
      ops: 4000

#2 Updated by Sage Weil about 8 years ago

  • Status changed from New to Resolved

#3 Updated by Oliver Francke over 7 years ago

Hi Sage,

as announced on the list, find the logfile attached. Don't know if it's exactly the same, at least almost, so feel free to open a new one ;)

Kind regards,

Oliver.

Also available in: Atom PDF