Project

General

Profile

Activity

From 10/28/2018 to 11/26/2018

11/26/2018

11:41 PM Backport #36754 (Resolved): mimic: _aio_log_start inflight overlap of 0x10000~1000 with [65536~4096]
Nathan Cutler
08:49 PM Backport #36754: mimic: _aio_log_start inflight overlap of 0x10000~1000 with [65536~4096]
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25062
merged
Yuri Weinstein

11/22/2018

11:25 AM Bug #37360: bluefs-bdev-expand aborts
Got it. Thanks, Mark!
So as I said before main device resize isn't supported at the moment.
Will probably start a...
Igor Fedotov
11:10 AM Bug #37360: bluefs-bdev-expand aborts
I decided to enlarge OSD backing store device to be able to store more data on this OSD without re-creating it.
Se...
Марк Коренберг
10:17 AM Bug #37360: bluefs-bdev-expand aborts
Actually there are 2 aspects for this ticket:
1) the tool improperly handles OSD deployments that lack DB and/or WAL...
Igor Fedotov
09:34 AM Bug #37360 (In Progress): bluefs-bdev-expand aborts
Igor Fedotov
09:04 AM Bug #37360: bluefs-bdev-expand aborts
Problem is still triggered every time. Марк Коренберг
09:04 AM Bug #37360: bluefs-bdev-expand aborts
... Марк Коренберг
09:03 AM Bug #37360: bluefs-bdev-expand aborts
... Марк Коренберг
08:46 AM Bug #37360: bluefs-bdev-expand aborts
Wondering if bluefs-bdev-sizes command works fine? What's about fsck? Igor Fedotov

11/21/2018

09:35 PM Bug #37360 (Resolved): bluefs-bdev-expand aborts
root@node1:~# ceph-bluestore-tool bluefs-bdev-expand --path /var/lib/ceph/osd/ceph-16
infering bluefs devices from b...
Марк Коренберг

11/16/2018

05:31 PM Bug #37282: rocksdb: submit_transaction_sync error: Corruption: block checksum mismatch code = 2
I have checked the kernel log and smartctl and do not see any errors. Jeff Smith
09:48 AM Bug #37282: rocksdb: submit_transaction_sync error: Corruption: block checksum mismatch code = 2
Firstly I suggest to verify the disk drive behind DB volume for physical errors. Igor Fedotov
05:28 AM Bug #37282 (Need More Info): rocksdb: submit_transaction_sync error: Corruption: block checksum m...
I have an OSD that will not start. It keep crashing. Not sure where to go from here. Unfortunately, it happened ri... Jeff Smith

11/14/2018

09:13 PM Bug #37090: BlueStore.cc: 3099: FAILED assert(0 == "uh oh, missing shared_blob")
Kjetil Joergensen wrote:
> Kjetil Joergensen wrote:
> > Ok - I think you can close this one. This is in all likelih...
Kjetil Joergensen
08:56 PM Bug #37090: BlueStore.cc: 3099: FAILED assert(0 == "uh oh, missing shared_blob")
Kjetil Joergensen wrote:
> Ok - I think you can close this one. This is in all likelihood a hardware error of some s...
Kjetil Joergensen
08:41 PM Bug #37090: BlueStore.cc: 3099: FAILED assert(0 == "uh oh, missing shared_blob")
Ok - I think you can close this one. This is in all likelihood a hardware error of some sort, on the same machine I h... Kjetil Joergensen
06:11 PM Bug #37090: BlueStore.cc: 3099: FAILED assert(0 == "uh oh, missing shared_blob")
Log posted with ceph-upload-file: fbc90b08-887d-40b9-99b9-0a843465a313
Console output below...
Kjetil Joergensen
09:47 AM Bug #37090: BlueStore.cc: 3099: FAILED assert(0 == "uh oh, missing shared_blob")
Could you please run fsck on this OSD with "debug bluestore" set to 20 and share the log? Igor Fedotov

11/13/2018

07:49 PM Bug #37090: BlueStore.cc: 3099: FAILED assert(0 == "uh oh, missing shared_blob")
Part of the osd log, should incude the first crash and maybe a couple of the subsequent ones, to make it fit within t... Kjetil Joergensen
07:27 PM Bug #37090 (Can't reproduce): BlueStore.cc: 3099: FAILED assert(0 == "uh oh, missing shared_blob")
Possibly a duplicate of #36303
What is slightly interesting, after setting the osd out and migrating off of it, it...
Kjetil Joergensen
06:38 PM Backport #36641 (Need More Info): mimic: Unable to recover from ENOSPC in BlueFS
Igor writes in the parent issue: "In fact previously mentioned PR is just a workaround to be able to manually fix the... Nathan Cutler
06:37 PM Backport #36640 (Need More Info): luminous: Unable to recover from ENOSPC in BlueFS
Igor writes in the parent issue: "In fact previously mentioned PR is just a workaround to be able to manually fix the... Nathan Cutler
10:36 AM Bug #36268 (In Progress): Unable to recover from ENOSPC in BlueFS
In fact previously mentioned PR is just a workaround to be able to manually fix the issue.
Working on the actual sol...
Igor Fedotov

11/12/2018

06:16 PM Backport #36755 (In Progress): luminous: _aio_log_start inflight overlap of 0x10000~1000 with [65...
Jonathan Brielmaier
04:26 PM Backport #36754 (In Progress): mimic: _aio_log_start inflight overlap of 0x10000~1000 with [65536...
Jonathan Brielmaier

11/10/2018

08:54 AM Backport #36755 (Rejected): luminous: _aio_log_start inflight overlap of 0x10000~1000 with [65536...
https://github.com/ceph/ceph/pull/25064 Nathan Cutler
08:54 AM Backport #36754 (Resolved): mimic: _aio_log_start inflight overlap of 0x10000~1000 with [65536~4096]
https://github.com/ceph/ceph/pull/25062 Nathan Cutler

11/08/2018

11:04 PM Bug #36606 (Resolved): osd: checksum failure during upgrade test
Igor Fedotov
11:04 PM Bug #36606: osd: checksum failure during upgrade test
Sage, no, it's specific to Nautilus for now. We need it when/if we backport BlueFS migrate stuff. Igor Fedotov
10:28 PM Bug #36606 (Pending Backport): osd: checksum failure during upgrade test
Igor, we should backport this, right? Sage Weil
10:29 PM Bug #36625 (Pending Backport): _aio_log_start inflight overlap of 0x10000~1000 with [65536~4096]
Sage Weil
01:56 PM Backport #26943 (In Progress): luminous: os/bluestore/BlueStore.cc: 1025: FAILED assert(buffer_by...
Jonathan Brielmaier
09:53 AM Backport #36638 (In Progress): luminous: rename does not old ref to replacement onode at old name
Jonathan Brielmaier

11/06/2018

03:37 PM Bug #36606: osd: checksum failure during upgrade test
https://github.com/ceph/ceph/pull/24948 Igor Fedotov
01:45 PM Bug #36606 (Fix Under Review): osd: checksum failure during upgrade test
Igor Fedotov
01:28 PM Bug #36606 (In Progress): osd: checksum failure during upgrade test
Igor Fedotov

11/05/2018

10:27 PM Bug #36526 (Resolved): segv in BlueStore::OldExtent::create
Nathan Cutler
10:26 PM Backport #36591 (Resolved): luminous: segv in BlueStore::OldExtent::create
Nathan Cutler
10:08 PM Backport #36591: luminous: segv in BlueStore::OldExtent::create
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/24746
merged
Yuri Weinstein

11/02/2018

04:46 PM Bug #36606: osd: checksum failure during upgrade test
Here's my analysis:
reproducer: https://tracker.ceph.com/issues/36606#note-9
commit before https://github.com/c...
Neha Ojha

10/31/2018

07:49 PM Backport #36592 (Resolved): mimic: segv in BlueStore::OldExtent::create
Nathan Cutler
12:27 AM Bug #36606: osd: checksum failure during upgrade test
The following seem to be the relevant pieces for one osd leading to the failure:... Neha Ojha

10/30/2018

10:50 PM Bug #36606: osd: checksum failure during upgrade test
Yes, the mkfs suceeds. That part of the logs is also present in the successful run of this test.... Neha Ojha
10:03 PM Bug #36606: osd: checksum failure during upgrade test
The --no-mon-config splats or normal.. qa/tasks/ceph.py tries first with --no-mon-config and, if it fails, does the m... Sage Weil
07:46 PM Backport #36592: mimic: segv in BlueStore::OldExtent::create
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/24745
merged
Yuri Weinstein
05:15 PM Backport #36641 (Rejected): mimic: Unable to recover from ENOSPC in BlueFS
Patrick Donnelly
05:15 PM Backport #36640 (Rejected): luminous: Unable to recover from ENOSPC in BlueFS
Patrick Donnelly
05:14 PM Backport #36639 (Resolved): mimic: rename does not old ref to replacement onode at old name
https://github.com/ceph/ceph/pull/25313 Patrick Donnelly
05:14 PM Backport #36638 (Resolved): luminous: rename does not old ref to replacement onode at old name
https://github.com/ceph/ceph/pull/24989 Patrick Donnelly
04:09 PM Bug #22534: Debian's bluestore *rocksdb* does not support neither fast CRC nor compression
Partly broken:... Марк Коренберг
03:25 PM Bug #22534: Debian's bluestore *rocksdb* does not support neither fast CRC nor compression
Is this still broken? Sage Weil
02:49 PM Bug #36567: Segmentation fault in BlueStore::Blob::discard_unallocated
Not that good ;-) it always happen, when we trigger a heavy backfill or recovery. But i don't want to pull that many ... Stefan Priebe
02:37 PM Bug #36567: Segmentation fault in BlueStore::Blob::discard_unallocated
Stefan Priebe wrote:
> Yes so my question is if all of those are may be just a result of the race mentioned here: ht...
Sage Weil
02:44 PM Bug #36268 (Pending Backport): Unable to recover from ENOSPC in BlueFS
also https://github.com/ceph/ceph/pull/23103 Sage Weil
02:41 PM Bug #36625 (In Progress): _aio_log_start inflight overlap of 0x10000~1000 with [65536~4096]
Sage Weil
07:00 AM Bug #36625: _aio_log_start inflight overlap of 0x10000~1000 with [65536~4096]
https://github.com/ceph/ceph/pull/24820 Honggang Yang
06:55 AM Bug #36625 (Resolved): _aio_log_start inflight overlap of 0x10000~1000 with [65536~4096]
h1. discription... Honggang Yang
02:40 PM Bug #36422 (Duplicate): ObjectStore/StoreTestSpecificAUSize.Many4KWritesTest/2 failure
Sage Weil
10:05 AM Bug #36284: Bluestore might be hanging OSD
My problem was fixed by:
https://github.com/ceph/ceph/commit/f755bed3e438d2e7d5ed0df30b8d5bebf2d0cf5a
I expect th...
Adam Kupczyk

10/29/2018

11:55 PM Bug #36606: osd: checksum failure during upgrade test
/a/nojha-2018-10-29_19:19:04-fs:upgrade-master-distro-basic-smithi/3201377/ Neha Ojha
06:21 PM Bug #36606: osd: checksum failure during upgrade test
Igor Fedotov wrote:
> I think mkfs doesn't run properly for bluestore since --no-mon-config param isn't recognized f...
Patrick Donnelly
08:55 AM Bug #36606: osd: checksum failure during upgrade test
I think mkfs doesn't run properly for bluestore since --no-mon-config param isn't recognized for unknown reason):
...
Igor Fedotov
08:10 PM Bug #22464: Bluestore: many checksum errors, always 0x6706be76 (which matches a zero block)
https://github.com/ceph/ceph/pull/24647 merged Yuri Weinstein
06:47 PM Bug #36604 (Rejected): osd-bluefs-volume-ops.sh test hangs

I ran cmake again and make then rebuilt the ceph-bluestore-tool and this problem went away.
David Zafman
02:16 PM Bug #36541 (Pending Backport): rename does not old ref to replacement onode at old name
Sage Weil

10/28/2018

12:14 AM Bug #36606: osd: checksum failure during upgrade test
Does not affect filestore. Only upgrade tests (fs:upgrade) with bluestore (replicated or EC). Patrick Donnelly
 

Also available in: Atom