Project

General

Profile

Actions

Bug #5507

closed

osd: ENOENT on clone

Added by Sage Weil almost 11 years ago. Updated almost 11 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
-
Category:
OSD
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

    -3> 2013-07-05 01:24:04.554423 7fe90ff95700  0 filestore(/var/lib/ceph/osd/ceph-2)  error (2) No such file or directory not handled on operation 17 (4014.1.0, or op 0, counting from 0)
    -2> 2013-07-05 01:24:04.554449 7fe90ff95700  0 filestore(/var/lib/ceph/osd/ceph-2) ENOENT on clone suggests osd bug
    -1> 2013-07-05 01:24:04.554453 7fe90ff95700  0 filestore(/var/lib/ceph/osd/ceph-2)  transaction dump:
{ "ops": [
        { "op_num": 0,
          "op_name": "clone",
          "collection": "3.d_head",
          "src_oid": "105bd74d\/plana4931304-46\/head\/\/3",
          "dst_oid": "105bd74d\/plana4931304-46\/108\/\/3"},
        { "op_num": 1,
          "op_name": "setattr",
          "collection": "3.d_head",
          "oid": "105bd74d\/plana4931304-46\/108\/\/3",
          "name": "_",
          "length": 211},
        { "op_num": 2,
          "op_name": "rmattr",
          "collection": "3.d_head",
          "oid": "105bd74d\/plana4931304-46\/108\/\/3",
          "name": "snapset"},
        { "op_num": 3,
          "op_name": "write",
          "collection": "3.d_head",
          "oid": "105bd74d\/plana4931304-46\/head\/\/3",
          "length": 58,
          "offset": 0,
          "bufferlist length": 58},
        { "op_num": 4,
          "op_name": "setattr",
          "collection": "3.d_head",
          "oid": "105bd74d\/plana4931304-46\/head\/\/3",
          "name": "_",
          "length": 218},
        { "op_num": 5,
          "op_name": "setattr",
          "collection": "3.d_head",
          "oid": "105bd74d\/plana4931304-46\/head\/\/3",
          "name": "snapset",
          "length": 143}]}

job was
ubuntu@teuthology:/a/teuthology-2013-07-05_01:00:13-rados-master-testing-basic/55413$ cat orig.config.yaml 
kernel:
  kdb: true
  sha1: ffa60fc0043054088da3fc7064a6f95f1e8b33e4
machine_type: plana
nuke-on-error: true
overrides:
  admin_socket:
    branch: master
  ceph:
    conf:
      global:
        ms inject socket failures: 5000
      mon:
        debug mon: 20
        debug ms: 20
        debug paxos: 20
    fs: xfs
    log-whitelist:
    - slow request
    sha1: 22227cd1c1bec01de4c5a3cfd46a52ce596e345a
  install:
    ceph:
      sha1: 22227cd1c1bec01de4c5a3cfd46a52ce596e345a
  s3tests:
    branch: master
  workunit:
    sha1: 22227cd1c1bec01de4c5a3cfd46a52ce596e345a
roles:
- - mon.a
  - mon.c
  - osd.0
  - osd.1
  - osd.2
- - mon.b
  - mds.a
  - osd.3
  - osd.4
  - osd.5
  - client.0
tasks:
- chef: null
- clock.check: null
- install: null
- ceph:
    log-whitelist:
    - wrongly marked me down
    - objects unfound and apparently lost
- thrashosds:
    chance_pgnum_grow: 1
    chance_pgpnum_fix: 1
    timeout: 1200
- rados:
    clients:
    - client.0
    objects: 50
    op_weights:
      delete: 50
      read: 100
      rollback: 50
      snap_create: 50
      snap_remove: 50
      write: 100
    ops: 4000


Related issues 1 (0 open1 closed)

Related to Ceph - Bug #5269: osd: EEXIST on mkcollResolved06/06/2013

Actions
Actions #1

Updated by Ian Colle almost 11 years ago

  • Assignee set to Samuel Just
Actions #2

Updated by Sage Weil almost 11 years ago

  • Status changed from New to 12
Actions #3

Updated by Samuel Just almost 11 years ago

  • Status changed from 12 to 7

This could be explained by a failure to resurrect a parent pg causing recovery on an hobject to perform writes on an object left in a deleting parent pg collection.

Actions #4

Updated by Samuel Just almost 11 years ago

  • Assignee deleted (Samuel Just)
Actions #5

Updated by Samuel Just almost 11 years ago

  • Status changed from 7 to Resolved
Actions

Also available in: Atom PDF