Project

General

Profile

Activity

From 10/30/2018 to 11/28/2018

11/28/2018

10:06 PM Bug #37439: Degraded PG does not discover remapped data on originating OSD
The first scenario definitely looks like an issue; perhaps we are improperly filtering for out rather than down durin... Greg Farnum
02:07 PM Bug #37439: Degraded PG does not discover remapped data on originating OSD
As I can't edit the post...
To clarify: With *missing* I mean the parts of the erasure coded object so the object ...
Jonas Jelten
02:00 PM Bug #37439 (Resolved): Degraded PG does not discover remapped data on originating OSD
There seems to be an issue that an OSD is not queried for *missing objects* that were *remapped*, but the OSD for thi... Jonas Jelten
05:22 PM Backport #37437: mimic: crushtool: add --reclassify operation to convert legacy crush maps to use...
h3. original description
The functionality has been added to master (nautilus) [1]. It would be nice to backport t...
Nathan Cutler
04:03 PM Backport #37437: mimic: crushtool: add --reclassify operation to convert legacy crush maps to use...
PR: https://github.com/ceph/ceph/pull/25306 Mykola Golub
01:39 PM Backport #37437 (Resolved): mimic: crushtool: add --reclassify operation to convert legacy crush ...
https://github.com/ceph/ceph/pull/25306 Mykola Golub
05:21 PM Backport #37438: luminous: crushtool: add --reclassify operation to convert legacy crush maps to ...
h3. original description
The functionality has been added to master (nautilus) [1]. It would be nice to backport t...
Nathan Cutler
04:02 PM Backport #37438: luminous: crushtool: add --reclassify operation to convert legacy crush maps to ...
PR: https://github.com/ceph/ceph/pull/25307 Mykola Golub
01:41 PM Backport #37438 (Resolved): luminous: crushtool: add --reclassify operation to convert legacy cru...
https://github.com/ceph/ceph/pull/25307 Mykola Golub
05:20 PM Bug #37443 (Resolved): crushtool: add --reclassify operation to convert legacy crush maps to use ...
The functionality has been added to master (nautilus) [1]. It would be nice to backport this.
[1] https://github.c...
Nathan Cutler
05:09 AM Bug #36732 (Resolved): tools/rados: fix segmentation fault
Kefu Chai

11/27/2018

08:40 PM Backport #36321 (In Progress): luminous: Add support for osd_delete_sleep configuration value
Nathan Cutler
08:39 PM Backport #36321: luminous: Add support for osd_delete_sleep configuration value
h3. original description
[RFE] Introduce an option or flag to throttle the pg deletion process
https://bugzilla.r...
Nathan Cutler
07:45 PM Bug #36250: ceph-osd process crashing
I believe this issue was due to a malfunctioning ceph-fuse client, although I don't have data to back that up as it w... Josh Haft
06:02 PM Fix #37410 (Duplicate): change default osd_objectstore to bluestore
duplicate of #36494 Douglas Fuller
05:53 PM Fix #37410 (Fix Under Review): change default osd_objectstore to bluestore
https://github.com/ceph/ceph/pull/25288 Douglas Fuller
05:38 PM Fix #37410 (Duplicate): change default osd_objectstore to bluestore
This way, the mon and associated tools know what the default actually is on the cluster. Douglas Fuller
06:01 PM Bug #36494: Change osd_objectstore default to bluestore
Can you set this for backport to mimic and luminous? Douglas Fuller
03:30 PM Backport #37341 (In Progress): luminous: doc: Add bluestore memory autotuning docs
Josh Durgin
03:26 PM Backport #37340 (In Progress): mimic: doc: Add bluestore memory autotuning docs
Josh Durgin
02:27 PM Bug #36525: osd-scrub-snaps.sh failure
/a/kchai-2018-11-27_11:44:27-rados-wip-kefu2-testing-2018-11-27-1724-distro-basic-smithi/3285226/teuthology.log Kefu Chai
11:45 AM Bug #37404 (Fix Under Review): OSD mkfs might assert when working agains bluestore disk that alre...
https://github.com/ceph/ceph/pull/25281/files Igor Fedotov
11:04 AM Bug #37404 (In Progress): OSD mkfs might assert when working agains bluestore disk that already h...
Igor Fedotov
11:01 AM Bug #37404 (Resolved): OSD mkfs might assert when working agains bluestore disk that already has ...
One might face an assert on collection's release which happens
after store destroy. For now is observable in some qa...
Igor Fedotov

11/26/2018

11:49 PM Bug #24612 (Resolved): FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
Nathan Cutler
11:49 PM Backport #35071 (Resolved): mimic: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::p...
Nathan Cutler
08:56 PM Backport #35071: mimic: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/24918
merged
Yuri Weinstein
11:48 PM Bug #22544 (Resolved): objecter cannot resend split-dropped op when racing with con reset
Nathan Cutler
11:48 PM Backport #35843 (Resolved): mimic: objecter cannot resend split-dropped op when racing with con r...
Nathan Cutler
08:55 PM Backport #35843: mimic: objecter cannot resend split-dropped op when racing with con reset
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/24970
merged
Yuri Weinstein
11:48 PM Bug #36358 (Resolved): Interactive mode CLI prints no output since Mimic
Nathan Cutler
11:47 PM Backport #36432 (Resolved): mimic: Interactive mode CLI prints no output since Mimic
Nathan Cutler
08:54 PM Backport #36432: mimic: Interactive mode CLI prints no output since Mimic
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/24971
merged
Yuri Weinstein
11:47 PM Backport #36433 (Resolved): mimic: monstore tool rebuild does not generate creating_pgs
Nathan Cutler
08:54 PM Backport #36433: mimic: monstore tool rebuild does not generate creating_pgs
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25016
merged
Yuri Weinstein
11:46 PM Backport #36435 (Resolved): mimic: rados rm --force-full is blocked when cluster is in full status
Nathan Cutler
08:53 PM Backport #36435: mimic: rados rm --force-full is blocked when cluster is in full status
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25017
merged
Yuri Weinstein
11:45 PM Backport #36505 (Resolved): mimic: mon osdmap cash too small during upgrade to mimic
Nathan Cutler
08:53 PM Backport #36505: mimic: mon osdmap cash too small during upgrade to mimic
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25019
merged
Yuri Weinstein
11:44 PM Backport #36557 (Resolved): mimic: RBD client IOPS pool stats are incorrect (2x higher; includes ...
Nathan Cutler
08:52 PM Backport #36557: mimic: RBD client IOPS pool stats are incorrect (2x higher; includes IO hints as...
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25024
merged
Yuri Weinstein
11:44 PM Backport #36637 (Resolved): mimic: osd: race condition opening heartbeat connection
Nathan Cutler
08:51 PM Backport #36637: mimic: osd: race condition opening heartbeat connection
Patrick Donnelly wrote:
> https://github.com/ceph/ceph/pull/25026
merged
Yuri Weinstein
11:43 PM Backport #36647 (Resolved): mimic: librados api aio tests race condition
Nathan Cutler
08:51 PM Backport #36647: mimic: librados api aio tests race condition
Patrick Donnelly wrote:
> https://github.com/ceph/ceph/pull/25027
merged
Yuri Weinstein
11:40 PM Backport #36658 (Resolved): mimic: Cache-tier forward mode hang in luminous (again)
Nathan Cutler
08:48 PM Backport #36658: mimic: Cache-tier forward mode hang in luminous (again)
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25075
merged
Yuri Weinstein
08:45 PM Bug #37393 (Resolved): mimic: osd-backfill-stats.sh fails in rados/standalone/osd.yaml
Run: http://pulpito.front.sepia.ceph.com/yuriw-2018-11-21_22:16:20-rados-wip-yuri5-testing-2018-11-21-1510-mimic-dist... Yuri Weinstein

11/25/2018

09:56 AM Bug #37326: Daily inconsistent objects
Anyone has any idea? Greg Smith

11/23/2018

04:52 PM Bug #22597: "sudo chown -R ceph:ceph /var/lib/ceph/osd/ceph-0'" fails in upgrade test
The problematic chown was introduced in mimic, so backporting only that far back.
See https://github.com/ceph/ceph...
Nathan Cutler
02:34 AM Backport #37288 (In Progress): mimic: "sudo chown -R ceph:ceph /var/lib/ceph/osd/ceph-0'" fails i...
https://github.com/ceph/ceph/pull/25227 Prashant D

11/22/2018

05:19 PM Backport #37273 (Resolved): mimic: debian: packaging need to reflect move of /etc/bash_completion...
Nathan Cutler
04:46 PM Backport #37273: mimic: debian: packaging need to reflect move of /etc/bash_completion.d/radosgw-...
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/25115
merged
Yuri Weinstein
07:32 AM Bug #36767: OSD: unrecoverable heartbeat connections
see also: https://tracker.ceph.com/issues/36175 Yan Jun

11/21/2018

08:25 AM Backport #37340 (Need More Info): mimic: doc: Add bluestore memory autotuning docs
Nathan Cutler
07:19 AM Bug #37326: Daily inconsistent objects
It happens on different disks, even on different host nodes. Greg Smith
06:40 AM Bug #24676: FreeBSD/Linux integration - monitor map with wrong sa_family
Hello,
Just tested this and received the same "NetHandler create_socket couldn't create socket (97) Address family...
Richard Gallamore

11/20/2018

09:42 PM Bug #36725: luminous: Apparent Memory Leak in OSD
Upgraded one OSD server to 12.2.9. Clean reboot. Generating hourly report on memory and mempools. Three examples a... John Jaser
09:10 PM Backport #37340: mimic: doc: Add bluestore memory autotuning docs
This is blocked by mimic version of https://github.com/ceph/ceph/pull/24065 Neha Ojha
07:54 PM Backport #37340 (Resolved): mimic: doc: Add bluestore memory autotuning docs
https://github.com/ceph/ceph/pull/25283 Nathan Cutler
07:54 PM Backport #37343 (Resolved): luminous: Prioritize user specified scrubs
https://github.com/ceph/ceph/pull/25514 Nathan Cutler
07:54 PM Backport #37342 (Resolved): mimic: Prioritize user specified scrubs
https://github.com/ceph/ceph/pull/25513 Nathan Cutler
07:54 PM Backport #37341 (Resolved): luminous: doc: Add bluestore memory autotuning docs
https://github.com/ceph/ceph/pull/25284 Nathan Cutler
11:01 AM Bug #37289: Issue with overfilled OSD for cache-tier pools
Whithout cache tiering everything is good.
After reaching 95% utilization of OSD for my replicated pool (whithout...
Oleksandr Mykhalskyi

11/19/2018

10:57 PM Bug #36667: OSD object_map sync returned error
This might also indicate something screwe dup the file permissions or ownership in /var/lib/ceph/osd/ceph-10. maybe ... Sage Weil
10:56 PM Bug #36709 (Need More Info): OSD stuck while flushing rocksdb WAL
I'm not sure know rocksdb is what's stuck.. can you dump 'ceph daemon osd.NNN ops' to see what state teh oeprations a... Sage Weil
10:54 PM Bug #37264: scrub warning check incorrectly uses mon scrub interval
You should be able to get the pool info out of the monitor's OSDMap, if that was a question... :) Greg Farnum
10:51 PM Bug #37289: Issue with overfilled OSD for cache-tier pools
I think teh first question to answer is if this can be reproduced without cache tiering. It's not immediately clear ... Sage Weil
10:48 PM Bug #37326 (Need More Info): Daily inconsistent objects
Is this happening on the same disk all the time, or the same node? If so, that suggests a piece of hardware (e.g. con... Josh Durgin
10:31 AM Bug #37326 (Need More Info): Daily inconsistent objects
We have many Ceph mimic 13.2.1 installed with a similar configuration on ubuntu, but on one of them we get inconsiste... Greg Smith
10:48 PM Bug #36304 (Can't reproduce): FAILED ceph_assert(p != pg_slots.end()) in OSDShard::register_and_w...
I'm guessing this was fixed by 450f337d6fd048c8c95a0ec0dec0d97f5474922e Sage Weil
10:43 PM Bug #36598: osd: "bluestore(/var/lib/ceph/osd/ceph-6) ENOENT on clone suggests osd bug"
Sage thinks this might also be #36739. Greg Farnum
10:40 PM Bug #36686 (In Progress): osd: pg log hard limit can cause crash during upgrade
Sage Weil
10:40 PM Bug #36725 (Need More Info): luminous: Apparent Memory Leak in OSD
can you dump the mempools (ceph daemon osd.NNN dump_mempools) several times over the growht of the process so we can ... Sage Weil
07:15 PM Bug #37269 (Pending Backport): Prioritize user specified scrubs
Sage Weil
04:47 PM Bug #37329 (Pending Backport): doc: Add bluestore memory autotuning docs
Neha Ojha
04:44 PM Bug #37329 (Resolved): doc: Add bluestore memory autotuning docs
https://github.com/ceph/ceph/pull/25069 Neha Ojha

11/17/2018

03:45 AM Bug #37299 (New): ceph-disk: ceph osd start failed: Command '['/usr/bin/systemctl', 'disable', 'c...
Please see the details at:
https://bugzilla.redhat.com/show_bug.cgi?id=1649208#c0
Han Han

11/16/2018

12:47 PM Bug #37289 (New): Issue with overfilled OSD for cache-tier pools
We have bad issue in our ceph cluster.
Centos 7.5 (3.10.0-862.3.2.el7.x86_64)
Luminous 12.2.5, bluestore OSDs, us...
Oleksandr Mykhalskyi
11:35 AM Backport #37288 (Resolved): mimic: "sudo chown -R ceph:ceph /var/lib/ceph/osd/ceph-0'" fails in u...
https://github.com/ceph/ceph/pull/25227 Nathan Cutler
10:34 AM Bug #16500 (Resolved): ceph_erasure_code_benchmark parameter checking error for LRC plugin
Kefu Chai
06:22 AM Bug #22597 (Pending Backport): "sudo chown -R ceph:ceph /var/lib/ceph/osd/ceph-0'" fails in upgra...
Sage Weil
04:53 AM Bug #36767 (Fix Under Review): OSD: unrecoverable heartbeat connections
Kefu Chai
02:53 AM Feature #23493: config: strip/escape single-quotes in values when setting them via conf file/assi...
Joao,
Could you take a look at https://github.com/ceph/ceph/pull/20610 and see whether you consider it something t...
Brad Hubbard
01:59 AM Bug #37264: scrub warning check incorrectly uses mon scrub interval

The scrub warning also doesn't consider the pool specific scrub interval if specified. The scrub code gets the p...
David Zafman

11/15/2018

01:16 PM Bug #25146 (Resolved): "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:paralle...
Kefu Chai
11:36 AM Backport #37273 (In Progress): mimic: debian: packaging need to reflect move of /etc/bash_complet...
Nathan Cutler
10:47 AM Backport #37273: mimic: debian: packaging need to reflect move of /etc/bash_completion.d/radosgw-...
PR with this backport is https://github.com/ceph/ceph/pull/25115 Matthew Vernon
09:44 AM Backport #37273 (Resolved): mimic: debian: packaging need to reflect move of /etc/bash_completion...
https://github.com/ceph/ceph/pull/25115 Nathan Cutler
10:36 AM Backport #37274 (In Progress): luminous: debian: packaging need to reflect move of /etc/bash_comp...
Nathan Cutler
09:45 AM Backport #37274 (Resolved): luminous: debian: packaging need to reflect move of /etc/bash_complet...
https://github.com/ceph/ceph/pull/24997 Nathan Cutler
09:38 AM Bug #36725: luminous: Apparent Memory Leak in OSD
raising priority since this might be a regression in 12.2.9 Nathan Cutler
06:31 AM Bug #36741 (Pending Backport): debian: packaging need to reflect move of /etc/bash_completion.d/r...
https://github.com/ceph/ceph/pull/24996 Kefu Chai
06:20 AM Bug #37269 (Resolved): Prioritize user specified scrubs

When scrubs start backing up, when a user asks for a scrub it doesn't get priority compared to overdue scrubs. The...
David Zafman
06:14 AM Bug #37264 (Resolved): scrub warning check incorrectly uses mon scrub interval

When checking the mon_warn_not_scrubbed the mon_scrub_interval is used instead of osd_scrub_max_interval.
David Zafman

11/14/2018

08:01 PM Bug #36725: luminous: Apparent Memory Leak in OSD
Note: Downgrading both OSD servers to v12.2.8 returned memory usage to normal. John Jaser
11:43 AM Backport #36636: luminous: osd: race condition opening heartbeat connection
std::lock_guard is a C++11 feature: https://en.cppreference.com/w/cpp/header/mutex Patrick Donnelly

11/13/2018

02:23 PM Backport #36658 (In Progress): mimic: Cache-tier forward mode hang in luminous (again)
Jonathan Brielmaier
02:15 PM Backport #36657 (In Progress): luminous: Cache-tier forward mode hang in luminous (again)
Jonathan Brielmaier
11:57 AM Bug #36388: osd: "out of order op"
This looks like the dup op entries were exceeded so the op was not detected as a dup. Perhaps we should increase the ... Josh Durgin
04:55 AM Bug #25146: "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:parallel-master-di...
https://github.com/ceph/ceph/pull/25070 Kefu Chai

11/12/2018

03:41 PM Bug #36767: OSD: unrecoverable heartbeat connections
Pull request:
https://github.com/ceph/ceph/pull/25061
Yury Z
03:09 PM Bug #36767 (Fix Under Review): OSD: unrecoverable heartbeat connections
There are several unrecoverable heartbeat connections according to logs.
They usually appears after problems/reprodu...
Yury Z
07:05 AM Bug #36758 (Duplicate): aborts in rocksdb::TableFileName() in mimic-x upgrade test suite
Brad Hubbard
05:26 AM Bug #36758: aborts in rocksdb::TableFileName() in mimic-x upgrade test suite
i think it's a dup of #25146 Kefu Chai
02:57 AM Bug #16500 (Fix Under Review): ceph_erasure_code_benchmark parameter checking error for LRC plugin
https://github.com/ceph/ceph/pull/25046 Kefu Chai

11/10/2018

10:01 PM Bug #36758: aborts in rocksdb::TableFileName() in mimic-x upgrade test suite
marking it "urgent", as it can be consistently reproducible. and it renders the cluster unusable after upgrading from... Kefu Chai
06:11 PM Bug #36758 (Duplicate): aborts in rocksdb::TableFileName() in mimic-x upgrade test suite
... Kefu Chai
02:33 PM Backport #36636 (In Progress): luminous: osd: race condition opening heartbeat connection
Nathan Cutler
11:46 AM Backport #36636 (Need More Info): luminous: osd: race condition opening heartbeat connection
The master commit uses std::lock_guard, which is a C++17-ism, and this makes the backport non-trivial (?) Nathan Cutler
12:42 PM Subtask #36091 (Resolved): [rbd top] collect client perf stats when query is enabled
*PR*: https://github.com/ceph/ceph/pull/24265 Jason Dillaman
11:56 AM Backport #36646 (In Progress): luminous: librados api aio tests race condition
Nathan Cutler
11:52 AM Backport #36647 (In Progress): mimic: librados api aio tests race condition
Nathan Cutler
11:40 AM Backport #36637 (In Progress): mimic: osd: race condition opening heartbeat connection
Nathan Cutler
11:38 AM Backport #36556 (In Progress): luminous: RBD client IOPS pool stats are incorrect (2x higher; inc...
Nathan Cutler
11:37 AM Backport #36557 (In Progress): mimic: RBD client IOPS pool stats are incorrect (2x higher; includ...
Nathan Cutler
10:19 AM Backport #36506 (In Progress): luminous: mon osdmap cash too small during upgrade to mimic
Nathan Cutler
10:05 AM Backport #36505 (In Progress): mimic: mon osdmap cash too small during upgrade to mimic
Nathan Cutler
09:59 AM Backport #36436 (In Progress): luminous: rados rm --force-full is blocked when cluster is in full...
Nathan Cutler
09:54 AM Backport #36435 (In Progress): mimic: rados rm --force-full is blocked when cluster is in full st...
Nathan Cutler
09:02 AM Backport #36433 (In Progress): mimic: monstore tool rebuild does not generate creating_pgs
Nathan Cutler

11/09/2018

10:08 PM Bug #36667: OSD object_map sync returned error
Check dmesg for hardware errors, this is leveldb/rocksdb returning an error writing to disk. You may want to ask the ... Josh Durgin
10:05 PM Bug #36677 (Resolved): /usr/include/rados/buffer.h:657:61: error: expected ',' before ')' token
Josh Durgin
10:05 PM Bug #36732 (Fix Under Review): tools/rados: fix segmentation fault
https://github.com/ceph/ceph/pull/24990 Josh Durgin
08:55 PM Bug #36610 (Resolved): filestore merge collection replay problem
Sage Weil
08:54 PM Bug #36748 (New): ms_deliver_verify_authorizer no AuthAuthorizeHandler found for protocol 0
... Sage Weil
05:18 PM Bug #36746 (New): Ignore osd_find_best_info_ignore_history_les for erasure-coded PGs

The only case that osd_find_best_info_ignore_history_les would work for erasure coded pools is if an interval didn'...
David Zafman
09:29 AM Bug #36741 (Resolved): debian: packaging need to reflect move of /etc/bash_completion.d/radosgw-a...
Hi,
Between version 12.0.2 and 12.0.3, the file /etc/bash_completion.d/radosgw-admin moved from the radosgw packag...
Matthew Vernon

11/08/2018

11:34 PM Bug #36739: ENOENT in collection_move_rename on EC backfill target
we create a gen object normally, on a backfill target,... Sage Weil
10:25 PM Bug #36739: ENOENT in collection_move_rename on EC backfill target
Sage Weil
10:24 PM Bug #36739 (Resolved): ENOENT in collection_move_rename on EC backfill target
... Sage Weil
09:13 PM Feature #36737: Allow multi instances of "make tests" on the same machine
@Kefu pls take a look, IIRC you mentioned that this may not be a big effort. Yuri Weinstein
09:12 PM Feature #36737 (Resolved): Allow multi instances of "make tests" on the same machine
Currently it's only possible to run `...make; make tests -j8; ctest ...` on the same machine.
Please consider chan...
Yuri Weinstein
10:02 AM Bug #36732 (Resolved): tools/rados: fix segmentation fault
when connected to ceph cluster, if call exit(1) directly, will
cause the finisher thread segmentation fault as follo...
Li Wang

11/07/2018

11:37 PM Feature #24917: Gracefully deal with upgrades when bluestore skipping of data_digest becomes active

Josh, this code needs to be written. It needs a feature bit AND a mon flag that can only be set when all OSDs are ...
David Zafman
10:07 PM Backport #36729 (Resolved): mimic: Add support for osd_delete_sleep configuration value
https://github.com/ceph/ceph/pull/25507 David Zafman
10:06 PM Feature #36474 (Pending Backport): Add support for osd_delete_sleep configuration value
David Zafman
04:40 PM Bug #36686: osd: pg log hard limit can cause crash during upgrade
Tests added:
https://github.com/ceph/ceph/pull/24954
https://github.com/ceph/ceph/pull/24938
Yuri Weinstein
04:27 PM Bug #36725 (Closed): luminous: Apparent Memory Leak in OSD
Since last update (late October), been experiencing apparent memory leak in OSD process on two ceph servers in small ... John Jaser
11:44 AM Backport #36432 (In Progress): mimic: Interactive mode CLI prints no output since Mimic
Nathan Cutler
11:42 AM Backport #35843 (In Progress): mimic: objecter cannot resend split-dropped op when racing with co...
Nathan Cutler

11/06/2018

01:22 PM Bug #20798: LibRadosLockECPP.LockExclusiveDurPP gets EEXIST
/a/sage-2018-11-05_22:04:25-rados-wip-sage3-testing-2018-11-05-1406-distro-basic-smithi/3227352 Sage Weil
11:54 AM Support #36326: Huge traffic spike and assert(is_primary())
Thanks for the answer! It looks like traffic spike was caused by another issue: ceph-mon's db grows up to 15GB and it... Aleksei Zakharov
10:07 AM Bug #36709 (Closed): OSD stuck while flushing rocksdb WAL
Hi all,
We use:
ceph version 12.2.8 (ae699615bac534ea496ee965ac6192cb7e0e07c0) luminous (stable)
Clients work on:
...
Aleksei Zakharov
01:30 AM Bug #36686: osd: pg log hard limit can cause crash during upgrade
Quoting my reply to ceph-devel for reference:
"Nathan, I don't think we want to revert it for 13.2.2.
This is b...
Neha Ojha

11/05/2018

10:42 PM Bug #22902 (Resolved): src/osd/PG.cc: 6455: FAILED assert(0 == "we got a bad state machine event")
Nathan Cutler
10:32 PM Bug #36686: osd: pg log hard limit can cause crash during upgrade
So, the luminous revert was merged. Neha, will there be a mimic revert as well? Since the pg hard limit patches are p... Nathan Cutler
10:13 PM Bug #36686: osd: pg log hard limit can cause crash during upgrade
https://github.com/ceph/ceph/pull/24903 merged Yuri Weinstein
10:28 PM Bug #36508 (Resolved): gperftools-libs-2.6.1-1 or newer required for binaries linked against corr...
Nathan Cutler
10:28 PM Backport #36552 (Resolved): luminous: gperftools-libs-2.6.1-1 or newer required for binaries link...
Nathan Cutler
10:10 PM Backport #36552: luminous: gperftools-libs-2.6.1-1 or newer required for binaries linked against ...
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/24706
merged
Yuri Weinstein
10:25 PM Bug #34541 (Resolved): deep scrub cannot find the bitrot if the object is cached
Nathan Cutler
10:25 PM Backport #35067 (Resolved): luminous: deep scrub cannot find the bitrot if the object is cached
Nathan Cutler
10:08 PM Backport #35067: luminous: deep scrub cannot find the bitrot if the object is cached
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/24802
merged
Yuri Weinstein
10:18 PM Backport #36678 (Resolved): luminous: src/osd/PG.cc: 6455: FAILED assert(0 == "we got a bad state...
David Zafman
05:20 PM Feature #24917: Gracefully deal with upgrades when bluestore skipping of data_digest becomes active
Let's include this with any other feature bit addition. David Zafman
01:30 PM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
> I suspect it shouldn't.
But it does exactly that.
> That's will only re-copy the data to the HEAD revision.
...
Vitaliy Filippov

11/04/2018

06:55 PM Bug #36677 (Fix Under Review): /usr/include/rados/buffer.h:657:61: error: expected ',' before ')'...
A fix is already available. See Sage's PR: https://github.com/ceph/ceph/pull/24835. Radoslaw Zarzynski

11/03/2018

11:27 PM Bug #24923 (Resolved): doc: http://docs.ceph.com/docs/mimic/rados/operations/pg-states/
Nathan Cutler
11:27 PM Backport #25055 (Resolved): mimic: doc: http://docs.ceph.com/docs/mimic/rados/operations/pg-states/
Nathan Cutler
11:26 PM Backport #35071 (In Progress): mimic: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor...
Nathan Cutler
04:42 AM Backport #23670 (In Progress): luminous: auth: ceph auth add does not sanity-check caps
Kefu Chai
04:24 AM Backport #23670 (New): luminous: auth: ceph auth add does not sanity-check caps
Kefu did the jewel backport, so assigning this to him in hopes he'll pick it up. Nathan Cutler
04:00 AM Bug #36686: osd: pg log hard limit can cause crash during upgrade
-Also, is this bug reproducible in master and mimic as well? If not, the Backport field should probably be modified..... Nathan Cutler
03:58 AM Bug #36686: osd: pg log hard limit can cause crash during upgrade
Neha, 12.2.9 has already been cut, so we'll need to expedite 12.2.10 to push the revert out to users. Nathan Cutler
03:52 AM Backport #36678 (In Progress): luminous: src/osd/PG.cc: 6455: FAILED assert(0 == "we got a bad st...
Nathan Cutler

11/02/2018

11:57 PM Bug #36686: osd: pg log hard limit can cause crash during upgrade
The immediate fix is to revert this for luminous before 12.2.9: https://github.com/ceph/ceph/pull/24903
Neha Ojha
11:51 PM Bug #36686 (Resolved): osd: pg log hard limit can cause crash during upgrade
During an upgrade from an earlier version, a primary running the new code will send a trim_to value to a replica that... Josh Durgin
05:14 PM Bug #36677: /usr/include/rados/buffer.h:657:61: error: expected ',' before ')' token
Ceph has already moved to C++17. The main question is: have we transitioned to C++17 also our public headers xor put ... Radoslaw Zarzynski
04:58 PM Bug #36677: /usr/include/rados/buffer.h:657:61: error: expected ',' before ')' token
The no-message-taking-variant of *static_assert* has been introduced in C++17. The code is being compiled with *-std=... Radoslaw Zarzynski
04:55 PM Bug #36677 (In Progress): /usr/include/rados/buffer.h:657:61: error: expected ',' before ')' token
Radoslaw Zarzynski
05:14 PM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
Back-and-forth question answering like this is probably better for the mailing list (the ticket is currently closed F... Jason Dillaman
04:57 PM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
since you've identified that this is an RBD workload, assigning it to that project so that RBD team notices it. HTH. Ben England
02:37 PM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
Oops. That's more than 2 questions. But anyway :) Vitaliy Filippov
02:36 PM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
OK, I looked into OSD datastore using ceph-objectstore-tool and I see that for almost every object there are two copi... Vitaliy Filippov
01:39 PM Bug #24835: osd daemon spontaneous segfault
We do use some configuration set by "ceph config set" or "ceph config-key set":... Soenke Schippmann

11/01/2018

11:46 PM Backport #36678 (Resolved): luminous: src/osd/PG.cc: 6455: FAILED assert(0 == "we got a bad state...
https://github.com/ceph/ceph/pull/24902 David Zafman
11:19 PM Bug #22902 (Pending Backport): src/osd/PG.cc: 6455: FAILED assert(0 == "we got a bad state machin...
Based on similar failures seen in luminous: http://pulpito.ceph.com/yuriw-2018-10-31_22:45:22-rados-wip-yuri4-testing... Neha Ojha
09:10 PM Bug #36677: /usr/include/rados/buffer.h:657:61: error: expected ',' before ')' token
... Neha Ojha
09:06 PM Bug #36677 (Resolved): /usr/include/rados/buffer.h:657:61: error: expected ',' before ')' token
... Neha Ojha
04:44 PM Bug #36289: Converting Filestore OSD from leveldb to rocksdb backend on CentOS
Looking through the ceph/rocksdb repo I don't see how it's possible for rocksdb to be compiled without snappy support... David Turner
03:35 PM Bug #36289: Converting Filestore OSD from leveldb to rocksdb backend on CentOS
This seems to be a problem where rocksdb on CentOS doesn't support snappy compression but the ceph-kvstore-tool is co... David Turner
06:14 AM Bug #36667 (New): OSD object_map sync returned error
i deploy a cephfs and the used the vdbench tool to wirte data in cephfs mount point,after a while osd appears down.
...
yp dai

10/31/2018

09:21 PM Bug #36411 (Closed): OSD crash starting recovery/backfill with EC pool
It's my current belief that these objects were broken as a result of intentional metadata manipulation when some of t... Greg Farnum
09:18 PM Bug #36572: ceph-in: --connect-timeout doesn't work while pinging mon
New PR: https://github.com/ceph/ceph/pull/24733 Greg Farnum
09:17 PM Support #36584 (Closed): OSD Anomaly behaviour in ceph-reweight
Are you running the command repeatedly? reweight-by-utilization does not provide a stable balance; it's really just a... Greg Farnum
08:43 PM Bug #21496: doc: Manually editing a CRUSH map, Word 'type' missing.
https://github.com/ceph/ceph/pull/24868 Sage Weil
05:35 PM Feature #36661: osd: add sanity check on startup to compare osd memory target to available memory...
- in OSD::handle_conf_change, we should sanity check this against current memory available on the system and refuse t... Sage Weil
04:59 PM Feature #36661 (New): osd: add sanity check on startup to compare osd memory target to available ...
This is needed so that we do not fail due to osd_memomory_target being set too high compared to the amount of memory ... Neha Ojha
11:42 AM Backport #36658 (Resolved): mimic: Cache-tier forward mode hang in luminous (again)
https://github.com/ceph/ceph/pull/25075 Nathan Cutler
11:42 AM Backport #36657 (Resolved): luminous: Cache-tier forward mode hang in luminous (again)
https://github.com/ceph/ceph/pull/25074 Nathan Cutler

10/30/2018

08:08 PM Bug #36345 (Resolved): librados C API aio read empty buffer
Sage Weil
08:07 PM Bug #36406 (Pending Backport): Cache-tier forward mode hang in luminous (again)
Sage Weil
05:16 PM Backport #36647 (Resolved): mimic: librados api aio tests race condition
https://github.com/ceph/ceph/pull/25027 Patrick Donnelly
05:16 PM Backport #36646 (Resolved): luminous: librados api aio tests race condition
https://github.com/ceph/ceph/pull/25028 Patrick Donnelly
05:14 PM Backport #36637 (Resolved): mimic: osd: race condition opening heartbeat connection
https://github.com/ceph/ceph/pull/25026 Patrick Donnelly
05:14 PM Backport #36636 (Resolved): luminous: osd: race condition opening heartbeat connection
https://github.com/ceph/ceph/pull/25035 Patrick Donnelly
04:06 PM Bug #36634 (New): LibRadosWatchNotify.WatchNotify2Timeout failure
... Sage Weil
03:33 PM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
Yes, I'm using EC with RBD and partial overwrites enabled. CephFS pools are only created recently for tests and do no... Vitaliy Filippov
01:05 PM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
How are you writing these objects? Most sites that used EC were using RGW, but I don't see all the pools that go wit... Ben England
10:31 AM Support #36614: Cluster uses substantially more space after rebalance (erasure codes)
In fact it doesn't seem that it will self-heal, and nobody seems to care about it in the mailing list by now...)
C...
Vitaliy Filippov
02:33 PM Bug #36631 (In Progress): potential deadlock in PG::_scan_snaps when repairing snap mapper
If during a pg scrub a snap mapper error is detected in PG::_scan_snaps, on repair `ObjectStore::apply_transactions` ... Mykola Golub
02:28 PM Backport #36630 (Resolved): luminous: potential deadlock in PG::_scan_snaps when repairing snap m...
If during a pg scrub a snap mapper error is detected in PG::_scan_snaps, on repair `ObjectStore::apply_transactions` ... Mykola Golub
02:00 PM Bug #36629 (New): osd:the new file was stored in cache pool which mode was none
ceph version:13.2.1
kernel client 4.17
I created the cache data pool as ceph's instructions:
(1) ceph osd tier add...
qinglong li
01:41 AM Bug #36620: osd:the vim will be hanged when I saved the file
the client: 4.17 kernel client qinglong li
01:36 AM Bug #36620 (New): osd:the vim will be hanged when I saved the file
ceph version: 13.2.1
situtation: the data pool tiered by a cache data pool and the cache tier pool's mode was read...
qinglong li
 

Also available in: Atom