Project

General

Profile

Bug #9740

FileJournal::do_write assert(0)

Added by Loïc Dachary over 9 years ago. Updated over 9 years ago.

Status:
Duplicate
Priority:
High
Assignee:
-
Category:
OSD
Target version:
-
% Done:

0%

Source:
other
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

http://pulpito.ceph.com/loic-2014-10-10_08:45:20-rados:thrash-erasure-code-isa-master-testing-basic-vps/536207/

 -2145> 2014-10-10 07:43:33.114362 7fb2f02be700 10 btrfsfilestorebackend(/var/lib/ceph/osd/ceph-5) destroy_checkpoint: 'snap_36597'

...
  -214> 2014-10-10 07:43:31.794464 7fb2efabd700 -1 journal FileJournal::write_bl : write_fd failed: (17) File exists
  -221> 2014-10-10 07:43:31.794365 7fb2efabd700 10 journal check_for_full at 8617984 : 4096 < 101834751
  -220> 2014-10-10 07:43:31.794367 7fb2efabd700 15 journal prepare_single_write 1 will write 8617984 : seq 36998 len 881 -> 4096 (head 40 pre_pad 0 ebl 881 post_pad 3135 tail 40) (ebl alignment -1)
  -219> 2014-10-10 07:43:31.794381 7fb2efabd700 10 journal room 101830655 max_size 104857600 pos 8622080 header.start 5599232 top 4096
  -218> 2014-10-10 07:43:31.794383 7fb2efabd700 10 journal check_for_full at 8622080 : 4096 < 101830655
  -217> 2014-10-10 07:43:31.794384 7fb2efabd700 15 journal prepare_single_write 2 will write 8622080 : seq 36999 len 881 -> 4096 (head 40 pre_pad 0 ebl 881 post_pad 3135 tail 40) (ebl alignment -1)
  -216> 2014-10-10 07:43:31.794392 7fb2efabd700 20 journal prepare_multi_write queue_pos now 8626176
  -215> 2014-10-10 07:43:31.794393 7fb2efabd700 15 journal do_write writing 8617984~8192
  -214> 2014-10-10 07:43:31.794464 7fb2efabd700 -1 journal FileJournal::write_bl : write_fd failed: (17) File exists
  -213> 2014-10-10 07:43:31.794481 7fb2efabd700 -1 journal FileJournal::do_write: write_bl(pos=8617984) failed
...
   -57> 2014-10-10 07:43:31.814872 7fb2ee2ba700 10 filestore(/var/lib/ceph/osd/ceph-5) remove 1.65s2_TEMP/acdb5565/vpm0367513-41/189//1/18446744073709551615/2 = -2
   -56> 2014-10-10 07:43:31.814878 7fb2ee2ba700 15 filestore(/var/lib/ceph/osd/ceph-5) touch 1.65s2_TEMP/acdb5565/vpm0367513-41/189//1/18446744073709551615/2
   -55> 2014-10-10 07:43:31.815053 7fb2ee2ba700 10 filestore(/var/lib/ceph/osd/ceph-5) touch 1.65s2_TEMP/acdb5565/vpm0367513-41/189//1/18446744073709551615/2 = 0
   -54> 2014-10-10 07:43:31.815062 7fb2ee2ba700 15 filestore(/var/lib/ceph/osd/ceph-5) write 1.65s2_TEMP/acdb5565/vpm0367513-41/189//1/18446744073709551615/2 0~524288
   -53> 2014-10-10 07:43:31.815834 7fb2ee2ba700 10 filestore(/var/lib/ceph/osd/ceph-5) write 1.65s2_TEMP/acdb5565/vpm0367513-41/189//1/18446744073709551615/2 0~524288 = 524288
   -52> 2014-10-10 07:43:31.815950 7fb2ee2ba700 15 filestore(/var/lib/ceph/osd/ceph-5) setattrs 1.65s2_TEMP/acdb5565/vpm0367513-41/189//1/18446744073709551615/2
   -51> 2014-10-10 07:43:31.816466 7fb2ee2ba700 20 filestore dbobjectmap: seq is 1955
   -50> 2014-10-10 07:43:31.816554 7fb2ee2ba700 20 filestore set_map_header: setting 1954 oid acdb5565/vpm0367513-41/189//1/18446744073709551615/2 parent seq 0
   -49> 2014-10-10 07:43:31.816625 7fb2ee2ba700 10 filestore oid: acdb5565/vpm0367513-41/189//1/18446744073709551615/2 not skipping op, *spos 36997.0.7
   -48> 2014-10-10 07:43:31.816660 7fb2ee2ba700 10 filestore  > header.spos 0.0.0
   -47> 2014-10-10 07:43:31.816974 7fb2ee2ba700 10 filestore(/var/lib/ceph/osd/ceph-5) setattrs 1.65s2_TEMP/acdb5565/vpm0367513-41/189//1/18446744073709551615/2 = 0
   -46> 2014-10-10 07:43:31.817038 7fb2ee2ba700 15 filestore(/var/lib/ceph/osd/ceph-5) remove 1.65s2_head/acdb5565/vpm0367513-41/199//1/18446744073709551615/2
   -45> 2014-10-10 07:43:31.817155 7fb2ee2ba700 20 filestore(/var/lib/ceph/osd/ceph-5) lfn_unlink: clearing omap on acdb5565/vpm0367513-41/199//1/18446744073709551615/2 in cid 1.65s2_head
   -44> 2014-10-10 07:43:31.817235 7fb2ee2ba700 10 filestore oid: acdb5565/vpm0367513-41/199//1/18446744073709551615/2 not skipping op, *spos 36997.0.8
   -43> 2014-10-10 07:43:31.817273 7fb2ee2ba700 10 filestore  > header.spos 36136.0.7
   -42> 2014-10-10 07:43:31.817304 7fb2ee2ba700 20 filestore remove_map_header: removing 1906 oid acdb5565/vpm0367513-41/199//1/18446744073709551615/2
   -41> 2014-10-10 07:43:31.817342 7fb2ee2ba700 20 filestore clear_header: clearing seq 1906
   -40> 2014-10-10 07:43:31.823339 7fb2efabd700 -1 os/FileJournal.cc: In function 'void FileJournal::do_write(ceph::bufferlist&)' thread 7fb2efabd700 time 2014-10-10 07:43:31.794493
os/FileJournal.cc: 1039: FAILED assert(0)

 ceph version 0.86-315-g78be62f (78be62f739e39d1be10648eea015a756d5e169b5)
 1: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x8b) [0xb82eab]
 2: (FileJournal::do_write(ceph::buffer::list&)+0xa5a) [0xa4b7ca]
 3: (FileJournal::write_thread_entry()+0x151) [0xa4ecd1]
 4: (FileJournal::Writer::entry()+0xd) [0x917bdd]
 5: (()+0x8182) [0x7fb2fbe73182]
 6: (clone()+0x6d) [0x7fb2fa3ddfbd]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

Related issues

Duplicates Ceph - Bug #9443: btrfs pwrite returns EEXIST on journal FileJournal::write_bl Rejected 09/11/2014

History

#1 Updated by Loïc Dachary over 9 years ago

  • Description updated (diff)

#2 Updated by Loïc Dachary over 9 years ago

  • Description updated (diff)
  • Status changed from New to Duplicate

Also available in: Atom PDF