Project

General

Profile

Activity

From 12/22/2022 to 01/20/2023

01/20/2023

10:05 PM Bug #58530 (Triaged): Pacific: Significant write amplification as compared to Nautilus
After upgrading multiple RBD clusters from 14.2.18 to 16.2.9, we've found that OSDs write significantly more to the u... Joshua Baergen
06:40 PM Bug #58113 (Fix Under Review): BLK/Kernel: Improve protection against running one OSD twice
Neha Ojha
05:38 PM Bug #58022: Fragmentation score rising by seemingly stuck thread
Here is a runaway one I restarted 2 days ago.
ELAPSED CMD
2-00:09:13 /usr/bin/ceph-osd -n osd.3 -f --setuser...
Kevin Fox
05:06 PM Bug #58022: Fragmentation score rising by seemingly stuck thread
I can get some more, but here's an initial bit.
osd.4 has been running away for a long time (at least 2 weeks. bas...
Kevin Fox
02:10 PM Bug #58022: Fragmentation score rising by seemingly stuck thread
At the first step I'd like to see allocation stats probes from OSD logs. Here is an example:
2023-01-20T16:28:41.4...
Igor Fedotov
01:59 PM Bug #58022: Fragmentation score rising by seemingly stuck thread
Vikhyat Umrao wrote:
> Igor/Adam - "But the behavior stops immediately on restart. So feels like some thread in the ...
Igor Fedotov

01/19/2023

10:52 PM Bug #58022: Fragmentation score rising by seemingly stuck thread
If I strace a run away osd, it shows up with 59 threads. If I do it to one that is not run away, it shows up with 59 ... Kevin Fox

01/18/2023

05:16 PM Bug #58022: Fragmentation score rising by seemingly stuck thread
Here's a picture. Kevin Fox
05:13 PM Bug #58022: Fragmentation score rising by seemingly stuck thread
We ended up slowly reformatting all of our osts and re-adding them. Things settled out to a fragmentation score of < ... Kevin Fox

01/17/2023

12:40 AM Bug #58463 (Fix Under Review): RocksDBTransactionImpl::rm_range_keys doesn't use bound iterator
Igor Fedotov

01/15/2023

11:24 PM Bug #58463 (Resolved): RocksDBTransactionImpl::rm_range_keys doesn't use bound iterator
Hence this might cause slow omap enumeration when rocksdb has got tons of tombstones.
Igor Fedotov

01/13/2023

04:35 PM Feature #58421: OSD metadata should show the min_alloc_size that each OSD was built with
Ideally this will be available both via `ceph osd metadata` and the admin socket so as to dovetail into common metric... Anthony D'Atri
12:14 PM Bug #53002 (Fix Under Review): crash BlueStore::Onode::put from BlueStore::TransContext::~TransCo...
Igor Fedotov
12:14 PM Bug #58439 (Duplicate): octopus osd crash
Igor Fedotov
09:57 AM Bug #58439 (Duplicate): octopus osd crash
Hi,
I was not able to find another bug which looks exactly like this (I found https://tracker.ceph.com/issues/2497...
Anonymous
11:59 AM Bug #58441 (New): ceph-bluestore-tool fsck crash with "FAILED ceph_assert(v.length() == p->shard_...
After OSD crashed with "FAILED ceph_assert(v.length() == p->shard_info->bytes)" (crash report here https://gist.githu... Changyuan Yu
10:46 AM Bug #58440 (Resolved): BlueFS spillover alert is broken
Apparently this has been removed by https://github.com/ceph/ceph/commit/d17cd6604b4031ca997deddc5440248aff451269#diff... Igor Fedotov

01/11/2023

10:53 PM Feature #58421 (Resolved): OSD metadata should show the min_alloc_size that each OSD was built with

To be very clear, the value the OSD was built with, *not* the prevailing value in `ceph.conf` or the central db.
...
Anthony D'Atri
06:42 AM Bug #53184: failed to start new osd due to SIGSEGV in BlueStore::read()
Hi Igor
I'm working with Satoru and Yuma, and I was trying to reproduce the problem with Ceph v17.2.5 and Rook v1....
Shinya Hayashi
02:01 AM Bug #58418 (Duplicate): unittest mempool always fail on Arm64 CI node
57: /root/ceph/src/test/test_mempool.cc:433: Failure
57: Expected: (missed) < (mempool::num_shards / 2), actual: 28 ...
Kevin Zhao

01/10/2023

11:40 AM Bug #56382: ONode ref counting is broken
Joshua Baergen wrote:
> All of the tickets related to each other for this problem are marked Duplicate. Which should...
Igor Fedotov
11:38 AM Bug #56382 (Fix Under Review): ONode ref counting is broken
Igor Fedotov

01/09/2023

02:40 PM Bug #56382: ONode ref counting is broken
All of the tickets related to each other for this problem are marked Duplicate. Which should be the main tracker for ... Joshua Baergen

01/03/2023

08:01 AM Bug #58274: BlueStore::collection_list becomes extremely slow due to unbounded rocksdb iteration
yixing hao wrote:
> Also observed from our HDD bluestore cluster with tens of billions of objects, the stack is like...
yixing hao
07:51 AM Bug #58274: BlueStore::collection_list becomes extremely slow due to unbounded rocksdb iteration
Also observed from our HDD bluestore cluster with tens of billions of objects, the stack is like the above.
7ffad9...
yixing hao

12/29/2022

07:26 AM Bug #53002: crash BlueStore::Onode::put from BlueStore::TransContext::~TransContext
(gdb) bt
#0 0x00007fc82cdb64aa in tc_newarray () from /lib64/libtcmalloc.so.4
#1 0x000055f6876050ba in ceph::buff...
王子敬 wang

12/24/2022

04:06 PM Bug #57895: OSD crash in Onode::put()
dongdong tao wrote:
> OK, thanks Igor for your confirmation, I'm reviewing your patch, we can discuss over there.
...
A. Saber Shenouda
 

Also available in: Atom