Project

General

Profile

Activity

From 08/03/2018 to 09/01/2018

09/01/2018

08:49 PM Bug #22544 (Fix Under Review): objecter cannot resend split-dropped op when racing with con reset
https://github.com/ceph/ceph/pull/23850 Sage Weil
08:43 PM Bug #22544: objecter cannot resend split-dropped op when racing with con reset
Here, it happened:... Sage Weil
07:20 AM Bug #21142: OSD crashes when loading pgs with "FAILED assert(interval.last > last)"
Some steps tried to reproduce the bug:
1. Create a luminous cluster running in Kubernetes using hostNetwork and th...
Dexter John Genterone

08/31/2018

10:08 PM Bug #35076 (Resolved): mon: mgr options not parse propertly
... Sage Weil
05:17 PM Bug #35075 (New): copy-get stuck sending osd_op
... Sage Weil
11:07 AM Backport #35071 (Resolved): mimic: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::p...
https://github.com/ceph/ceph/pull/24918 Nathan Cutler
11:06 AM Backport #35068 (Resolved): mimic: deep scrub cannot find the bitrot if the object is cached
https://github.com/ceph/ceph/pull/23873 Nathan Cutler
11:06 AM Backport #35067 (Resolved): luminous: deep scrub cannot find the bitrot if the object is cached
https://github.com/ceph/ceph/pull/24802 Nathan Cutler
08:53 AM Bug #34541 (Pending Backport): deep scrub cannot find the bitrot if the object is cached
https://github.com/ceph/ceph/pull/23629 Kefu Chai
08:53 AM Bug #34541 (Resolved): deep scrub cannot find the bitrot if the object is cached
quote from https://github.com/ceph/ceph/pull/23629
> Say a object who has data caches, but in a while later, cache...
Kefu Chai

08/30/2018

03:20 PM Backport #34532 (Resolved): mimic: force-create-pg broken
https://github.com/ceph/ceph/pull/23872 Nathan Cutler
01:53 PM Bug #26940 (Pending Backport): force-create-pg broken
Sage Weil
12:06 PM Bug #34529 (Resolved): cbt tests in rados qa suite fails
seems http://drop.ceph.com/qa/cosbench-0.4.2.c3.1.zip is not reachable anymore.... Kefu Chai
05:10 AM Backport #26992 (In Progress): luminous: discover_all_missing() not always called during activating
https://github.com/ceph/ceph/pull/23817 Prashant D

08/29/2018

09:51 PM Bug #25076 (Duplicate): MON crash when upgrading luminous v12.2.7 -> mimic v13.2.0 during ceph-fu...
Sage Weil
09:29 PM Bug #34321 (New): OSD crash because of DBObjectMap.cc: 662: FAILED assert(state.legacy)
Version: 12.2.7
The following crash is observed during normal operation of the cluster, so no particular steps to ...
Maks Kowalik
08:08 PM Bug #27988: Warn if queue of scrubs ready to run exceeds some threshold

I'm want to fix 3 things here. First, user submitted scrubs are queued as due to occur immediately, but overdue sc...
David Zafman
05:25 PM Bug #24612 (Pending Backport): FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune...
Sage Weil
03:13 PM Bug #26994 (Resolved): test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) f...
Kefu Chai

08/28/2018

08:23 PM Bug #24033 (Resolved): rados: not all exceptions accept keyargs
Nathan Cutler
08:22 PM Backport #25178 (Resolved): mimic: rados: not all exceptions accept keyargs
Nathan Cutler
07:53 PM Backport #25178: mimic: rados: not all exceptions accept keyargs
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/23335
merged
Yuri Weinstein
12:42 PM Bug #33561 (New): PG repair doesn't start on an inconsistent group
Version: 12.2.7
Issue timeline:
1.Deep-scrub discovered inconsistency in one group on a pool with 4 replicas - the ...
Maks Kowalik
12:33 PM Bug #33420 (New): Forced deep-scrub doesn't start
Version: 12.2.7
Issue timeline:
1. Cyclic deep-scrub discovered inconsistency:
2018-08-23 17:21:07.933458 osd....
Maks Kowalik
11:11 AM Backport #32108 (Resolved): mimic: object errors found in be_select_auth_object() aren't logged t...
https://github.com/ceph/ceph/pull/23870 Nathan Cutler
11:11 AM Backport #32106 (Resolved): luminous: object errors found in be_select_auth_object() aren't logge...
https://github.com/ceph/ceph/pull/23871 Nathan Cutler
05:23 AM Bug #27988: Warn if queue of scrubs ready to run exceeds some threshold
Talking with Sage, he believes there is already a warning status if you have scrubs that haven't run for more than 2x... David Turner

08/27/2018

09:21 PM Bug #20775 (Resolved): ceph_test_rados parameter error
Brad Hubbard
07:55 PM Bug #25182: Upmaps forgotten after restarting OSDs
I believe these log messages explain why the upmaps are being removed, but I'll attach the relevant section of the lo... Bryan Stillwell
06:39 PM Bug #25182: Upmaps forgotten after restarting OSDs
Bryan Stillwell wrote:
> What debugging logs would be helpful in figuring this out? I just restarted an OSD on my 1...
Sage Weil
06:07 PM Bug #25182: Upmaps forgotten after restarting OSDs
What debugging logs would be helpful in figuring this out? I just restarted an OSD on my 13.2.1-based cluster and al... Bryan Stillwell
06:44 PM Bug #23576: osd: active+clean+inconsistent pg will not scrub or repair
Created tracker https://tracker.ceph.com/issues/27988 to add warning about too many scrubs pending. David Zafman
04:26 PM Bug #23576: osd: active+clean+inconsistent pg will not scrub or repair
David Turner wrote:
> I came across this again as well and I did some more testing. As it turns out what resolved t...
David Turner
04:26 PM Bug #23576: osd: active+clean+inconsistent pg will not scrub or repair
I cam across this again as well and I did some more testing. As it turns out what resolved this issue for me was inc... David Turner
01:33 PM Bug #23576: osd: active+clean+inconsistent pg will not scrub or repair
Hi - we are still experiencing this issue on 12.2.7 (so latest Luminous version)... Jacek S.
06:43 PM Bug #27988 (Rejected): Warn if queue of scrubs ready to run exceeds some threshold

The sched_scrub_pg set could be scanned during a new insert and the number of scrubs that are ready to be run could...
David Zafman
05:18 PM Bug #27985 (Resolved): force-backfill sets forced_recovery instead of forced_backfill in 13.2.1
I've noticed that using force-backfill in Mimic seems to be broken. It sets forced_recovery instead of forced_backfi... Bryan Stillwell
04:17 AM Support #27203: osd down while bucket is deleting
Actually,this issue still upset me
-2> 2018-08-23 16:14:52.673287 7f3aeb536700 1 heartbeat_map is_healthy 'OS...
伟杰 谭

08/26/2018

12:50 PM Bug #24612: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
https://github.com/ceph/ceph/pull/23742
Currently missing: a reproducer. Reproducing may not be trivial because th...
Joao Eduardo Luis

08/25/2018

08:42 PM Bug #27363 (New): 'rbd rm' does not clean tiered pool completly
mimic (13.2.1)
linux kernel: 4.18.3-1.el7.elrepo.x86_64
ceph osd crush rule create-replicated hddreplrule default...
Fyodor Ustinov
05:26 PM Bug #27362 (New): Wrong erasure pool MAX AVAIL size calculation with technique=reed_sol_r6_op
... Fyodor Ustinov
05:53 AM Bug #24022: "ceph tell osd.x bench" writes resulting JSON to stderr instead of stdout.
luminous backport https://github.com/ceph/ceph/pull/23680 Konstantin Shalygin

08/24/2018

05:14 PM Bug #25084 (Resolved): Attempt to read object that can't be repaired loops forever
David Zafman
05:13 PM Bug #25108 (Pending Backport): object errors found in be_select_auth_object() aren't logged the same
David Zafman
05:12 PM Bug #24801: PG num_bytes becomes huge

So far with assert added to object_stat_sum_t::add() we saw this. Still not sure why the num_bytes is off.
<pr...
David Zafman
12:54 PM Bug #24612 (In Progress): FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
Joao Eduardo Luis
02:00 AM Backport #26931 (In Progress): mimic: scrub livelock
https://github.com/ceph/ceph/pull/23722 Prashant D

08/23/2018

09:22 PM Backport #27213 (Resolved): mimic: libradosstriper conditional compile
https://github.com/ceph/ceph/pull/23869 Nathan Cutler
09:21 PM Backport #27212 (Resolved): mimic: rpm: should change ceph-mgr package depency from py-bcrypt to ...
https://github.com/ceph/ceph/pull/23868 Nathan Cutler
09:20 PM Bug #25057 (Resolved): jewel->luminous: osdmap crc mismatch
Nathan Cutler
09:20 PM Backport #25101 (Resolved): mimic: jewel->luminous: osdmap crc mismatch
Nathan Cutler
11:31 AM Feature #22750 (Pending Backport): libradosstriper conditional compile
Nathan Cutler
11:21 AM Feature #22750 (Resolved): libradosstriper conditional compile
Kefu Chai
11:28 AM Bug #27206 (Pending Backport): rpm: should change ceph-mgr package depency from py-bcrypt to pyth...
https://github.com/ceph/ceph/pull/23648 Kefu Chai
11:27 AM Bug #27206 (Resolved): rpm: should change ceph-mgr package depency from py-bcrypt to python2-bcrypt
Current deplist of ceph-mgr rpm package contains py-bcrypt depency which conflicts with python2-bcrypt needed for pyt... Kefu Chai
11:23 AM Bug #26998 (Resolved): IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
Kefu Chai
08:19 AM Support #27203: osd down while bucket is deleting
Format is ugly,my fault 伟杰 谭
07:59 AM Support #27203 (New): osd down while bucket is deleting
My environment is
[tanweijie@gz-ceph-52-202 ~]$ ceph --version
ceph version 12.2.5 (cad919881333ac92274171586c827e0...
伟杰 谭

08/22/2018

10:20 PM Feature #26975: Rados level IO priority for OSD operations
Do note that
1) "Messages" can already have priority, although its utility at this point is quite limited it's not t...
Greg Farnum
09:32 PM Bug #26880 (Resolved): ceph-base debian package compiled on ubuntu/xenial has unmet runtime depen...
Nathan Cutler
09:31 PM Backport #26881 (Resolved): mimic: ceph-base debian package compiled on ubuntu/xenial has unmet r...
Nathan Cutler
09:19 PM Bug #26971: failed to become clean before timeout expired
Looks like a PG is active+undersized state. Maybe the balancer screwed up? Greg Farnum
09:14 PM Backport #24359 (Resolved): mimic: osd: leaked Session on osd.7
Nathan Cutler
09:00 PM Bug #24875 (Resolved): OSD: still returning EIO instead of recovering objects on checksum errors
Nathan Cutler
09:00 PM Backport #25226 (Resolved): mimic: OSD: still returning EIO instead of recovering objects on chec...
Nathan Cutler
08:46 PM Backport #25101: mimic: jewel->luminous: osdmap crc mismatch
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/23226
merged
Yuri Weinstein
05:37 PM Bug #27053: qa: thrashosds: "[ERR] : 2.0 has 1 objects unfound and apparently lost"
Similar failure seen in mimic: /a/yuriw-2018-08-21_23:27:39-rados-wip-yuri5-testing-2018-08-21-2033-mimic-distro-basi... Neha Ojha
03:39 PM Bug #27053 (New): qa: thrashosds: "[ERR] : 2.0 has 1 objects unfound and apparently lost"
This is for 12.2.8
Run: http://pulpito.ceph.com/yuriw-2018-08-21_16:17:40-rados-luminous-distro-basic-smithi/
Job...
Yuri Weinstein
05:26 PM Bug #27055 (New): mimic: FAILED assert((uint64_t)buf.st_size == expected) in SyntheticWorkloadSta...
... Neha Ojha
08:51 AM Bug #24956: osd: parent process need to restart log service after fork, or ceph-osd will not work...
PR:https://github.com/ceph/ceph/pull/23685 Hsiao-Yin Tseng
06:28 AM Bug #26994 (Fix Under Review): test_module_commands (tasks.mgr.test_module_selftest.TestModuleSel...
https://github.com/ceph/ceph/pull/23681 Kefu Chai
03:45 AM Bug #23352 (Resolved): osd: segfaults under normal operation
The patch is only relevant to the osds. Brad Hubbard
02:56 AM Bug #26998: IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
Kefu Chai
02:14 AM Bug #26998: IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
- https://github.com/ceph/dmclock/pull/58
- https://github.com/ceph/ceph/pull/23643
Kefu Chai
02:13 AM Bug #26998 (Resolved): IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
for more details on this issue, please refer to https://github.com/ceph/dmclock/pull/58 . in short, if "osd op queue"... Kefu Chai

08/21/2018

08:22 PM Bug #25146 (In Progress): "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:para...
Very early fix: https://github.com/rzarzynski/rocksdb/tree/wip-bug-25146.
The case appears more complicated as the...
Radoslaw Zarzynski
07:58 PM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
https://github.com/ceph/ceph/pull/23490 merged Yuri Weinstein
07:30 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
Something like this will probably fix it... Noah Watkins
06:49 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
Here's the culprit: hello isn't packaged so it can't announce its commands.... Noah Watkins
06:45 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
The manager logs show all the modules except for `hello` being loaded... Noah Watkins
05:55 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
I can't reproduce this... it is as if the monitor has not received a summary of commands from the manager at the the ... Noah Watkins
04:39 PM Bug #26994 (Resolved): test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) f...
in https://github.com/ceph/ceph/pull/23558/commits/00223d2364b5a6cc32eb5f83f5a642b5aef2c946 , hello is used for testi... Kefu Chai
04:03 PM Backport #26992 (Resolved): luminous: discover_all_missing() not always called during activating
https://github.com/ceph/ceph/pull/23817 Nathan Cutler
04:01 PM Feature #26975: Rados level IO priority for OSD operations
For "Rados level" I mean librados API at least, and implementation in OSD too. Марк Коренберг
03:59 PM Feature #26975 (New): Rados level IO priority for OSD operations
What I mean:
Suppose busy Ceph cluster.
Every OSD has many IO requests from clients in it's queue. Today, all r...
Марк Коренберг
12:56 AM Bug #26972 (Resolved): cluster [ERR] Error -2 reading object

http://qa-proxy.ceph.com/teuthology/dzafman-2018-08-17_08:14:49-rados-wip-zafman-testing4-distro-basic-smithi/29146...
David Zafman
12:42 AM Bug #26971 (Duplicate): failed to become clean before timeout expired

http://qa-proxy.ceph.com/teuthology/dzafman-2018-08-16_17:35:08-rados:thrash-wip-zafman-testing4-distro-basic-smith...
David Zafman
12:32 AM Bug #26970 (Resolved): src/osd/OSDMap.h: 1065: FAILED assert(__null != pool)

http://qa-proxy.ceph.com/teuthology/dzafman-2018-08-16_17:35:08-rados:thrash-wip-zafman-testing4-distro-basic-smith...
David Zafman

08/20/2018

11:19 PM Bug #22837 (Pending Backport): discover_all_missing() not always called during activating

Based on information from http://lists.ceph.com/pipermail/ceph-users-ceph.com/2017-October/021512.html I'm marking ...
David Zafman
05:53 PM Feature #24232 (Fix Under Review): Add new command ceph mon status
Nathan Cutler

08/19/2018

03:12 PM Feature #26948 (Resolved): librados: add a way to get a count of omap vals in an iterator
https://github.com/ceph/ceph/pull/23593 Kefu Chai
01:58 PM Bug #24485: LibRadosTwoPoolsPP.ManifestUnset failure
/a/kchai-2018-08-19_13:01:23-rados-wip-kefu-testing-2018-08-19-1812-distro-basic-mira/2925024/ Kefu Chai

08/17/2018

09:10 PM Backport #24359: mimic: osd: leaked Session on osd.7
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22339
merged
Yuri Weinstein
02:27 PM Bug #26958 (Resolved): osd/ReplicatedBackend.cc: 1321: FAILED assert(get_parent()->get_log().get_...
... Sage Weil
09:36 AM Bug #26880 (Pending Backport): ceph-base debian package compiled on ubuntu/xenial has unmet runti...
Nathan Cutler
03:20 AM Feature #26955: os/filestore: Add switch to turn on/off filestore dir splitting
https://github.com/ceph/ceph/pull/23460
1. Refined HashIndex::must_split() to be more readable.
2. Introduced a h...
Zhi Zhang
03:19 AM Feature #26955 (New): os/filestore: Add switch to turn on/off filestore dir splitting
We had done pre-split and increased split multiple, etc, at the beginning of building cluster in order to reduce the ... Zhi Zhang
12:16 AM Bug #25108: object errors found in be_select_auth_object() aren't logged the same
David Zafman

08/16/2018

10:46 PM Backport #26870 (Resolved): mimic: osd: segfaults under normal operation
Brad Hubbard
05:58 PM Bug #24612: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
/a/sage-2018-08-15_15:49:39-rados-wip-sage2-testing-2018-08-15-0731-distro-basic-smithi/2908178
Sage Weil

08/15/2018

11:40 PM Bug #25084 (Fix Under Review): Attempt to read object that can't be repaired loops forever
David Zafman
11:35 PM Backport #25227 (Resolved): luminous: OSD: still returning EIO instead of recovering objects on c...
David Zafman
02:36 PM Feature #26948 (Resolved): librados: add a way to get a count of omap vals in an iterator
We currently have functions like rados_read_op_omap_get_vals2 that hand back an iterator to a userland caller. There ... Jeff Layton

08/14/2018

10:43 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
Generally yes, but I havne't been able to reproduce to test a solution. I take it this has happened to you?
I'm h...
Sage Weil
01:34 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
Guys, is there a way for an OSD to recover from this error? Kuba Stańczak
09:58 PM Bug #26947 (Resolved): ENOENT on collection_move_rename from divergent activate
... Sage Weil
04:56 PM Bug #26940 (Fix Under Review): force-create-pg broken
https://github.com/ceph/ceph/pull/23572 Sage Weil
03:53 PM Bug #26940 (Resolved): force-create-pg broken
This commit -
https://github.com/ceph/ceph/commit/7797ed67d2f9140b7eb9f182b06d04233e1e309c
has introduced regressio...
Sage Weil
04:33 AM Backport #26908 (Need More Info): luminous: kv: MergeOperator name() returns string, and caller c...
Prashant D
04:33 AM Backport #26908 (In Progress): luminous: kv: MergeOperator name() returns string, and caller call...
https://github.com/ceph/ceph/pull/23566 Prashant D

08/13/2018

06:46 PM Backport #26932 (Resolved): luminous: scrub livelock
https://github.com/ceph/ceph/pull/24396 (initial backport)
https://github.com/ceph/ceph/pull/24659 (follow-on fix)
Nathan Cutler
06:46 PM Backport #26931 (Resolved): mimic: scrub livelock
https://github.com/ceph/ceph/pull/23722 Nathan Cutler
06:01 PM Bug #26890 (Pending Backport): scrub livelock
Sage Weil
07:38 AM Bug #20059: miscounting degraded objects
Just adding another reference to #21803 here — this fix was meant to fix that issue as well, which it apparently did ... Florian Haas
03:14 AM Bug #23352: osd: segfaults under normal operation
Brad Hubbard wrote:
> I've created a test package here based on 12.2.7 and including the one line patch above.
>
...
lin zhou
12:59 AM Feature #24232: Add new command ceph mon status
PR: https://github.com/ceph/ceph/pull/23525 Hsiao-Yin Tseng

08/12/2018

10:32 PM Backport #26871 (Resolved): luminous: osd: segfaults under normal operation
Brad Hubbard
09:16 PM Backport #26910 (Resolved): luminous: PGLog.cc: saw valgrind issues while accessing complete_to->...
https://github.com/ceph/ceph/pull/23211 Patrick Donnelly
09:16 PM Backport #26909 (Resolved): mimic: PGLog.cc: saw valgrind issues while accessing complete_to->ver...
https://github.com/ceph/ceph/pull/23403 Patrick Donnelly
09:16 PM Backport #26908 (Resolved): luminous: kv: MergeOperator name() returns string, and caller calls c...
https://github.com/ceph/ceph/pull/23566 Patrick Donnelly
09:16 PM Backport #26907 (Resolved): mimic: kv: MergeOperator name() returns string, and caller calls c_st...
https://github.com/ceph/ceph/pull/23865 Patrick Donnelly
08:38 PM Bug #21592: LibRadosCWriteOps.CmpExt got 0 instead of -4095-1
/a/sage-2018-08-11_18:40:58-rados-wip-sage-testing-2018-08-11-1120-distro-basic-smithi/2893875... Sage Weil

08/10/2018

08:13 PM Bug #23352: osd: segfaults under normal operation
https://github.com/ceph/ceph/pull/23459 merged Yuri Weinstein
04:54 AM Bug #12615: Repair of Erasure Coded pool with an unrepairable object causes pg state to lose clea...
In a replicated case which in which all copies are bad, a rep_repair_primary_object() can cause loss of clean and ins... David Zafman
04:44 AM Bug #25084: Attempt to read object that can't be repaired loops forever
I don't think we should backport this change. In Luminous and possibly upgraded to Mimic there is a possibility that... David Zafman
12:01 AM Bug #25084: Attempt to read object that can't be repaired loops forever
https://github.com/ceph/ceph/pull/23518 David Zafman
02:54 AM Bug #26875 (Pending Backport): kv: MergeOperator name() returns string, and caller calls c_str() ...
Kefu Chai
02:47 AM Bug #24485: LibRadosTwoPoolsPP.ManifestUnset failure
/a/kchai-2018-08-09_12:29:04-rados-wip-kefu-testing-2018-08-08-1144-distro-basic-smithi/2885459/ Kefu Chai
12:04 AM Bug #19753: Deny reservation if expected backfill size would put us over backfill_full_ratio
https://github.com/ceph/ceph/pull/22797 David Zafman

08/09/2018

11:52 PM Bug #25084 (In Progress): Attempt to read object that can't be repaired loops forever
What I actually ran into is that when do_read() fails because of the CRC mismatch, the recovery repair can pull from ... David Zafman
07:56 PM Backport #24333 (In Progress): luminous: local_reserver double-reservation of backfilled pg
PR: https://github.com/ceph/ceph/pull/23493 Victor Denisov
06:37 PM Feature #21366 (Resolved): tools/ceph-objectstore-tool: split filestore directories offline to ta...
David Zafman
06:37 PM Backport #24845 (Resolved): luminous: tools/ceph-objectstore-tool: split filestore directories of...
David Zafman
02:00 PM Bug #26891 (New): backfill reservation deadlock/stall

on backfill target:
- get backfill request, queue RequestBackfillPrio...
Sage Weil
01:34 PM Bug #26890: scrub livelock
https://github.com/ceph/ceph/pull/23512 Sage Weil
01:32 PM Bug #26890 (Resolved): scrub livelock
- both osds locally reserve a scrub slot
- both osds send a scrub schedule request
- both scrub requests are reject...
Sage Weil
08:03 AM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
Full info for ceph-base package:... Piotr Dalek
08:00 AM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
Tried on fresh Ubuntu 16.04 vm to build Ceph packages for master branch, resulting .debs still depend on libstdc++6 (... Piotr Dalek
07:57 AM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
as per Piotr Dałek we can reproduce this issue on master even with the fix . Kefu Chai

08/08/2018

09:42 PM Bug #25146: "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:parallel-master-di...
I think we need to fix this sooner rather than later. My suggestion is to incorporate enough of the original rocksdb... Sage Weil
09:10 PM Bug #26878 (Closed): `osd destroy` command hangs
NOTABUG. :)
Presumably will have to update the ceph-volume tests but the louder notification PR is well on its way t...
Greg Farnum
06:17 PM Bug #26878: `osd destroy` command hangs
master PR https://github.com/ceph/ceph/pull/23492 Alfredo Deza
12:03 PM Bug #26878 (Closed): `osd destroy` command hangs
Running latest master without a manager daemon makes `osd destroy` commands hang.
ceph version 14.0.0-1906-g637bb2...
Alfredo Deza
06:52 PM Feature #1126 (Rejected): crush: extend rule definition
actually, you can do the above, just set size=3 and you'll get 2 in first rack and 1 in second rack. Sage Weil
06:49 PM Feature #85 (Fix Under Review): osd: pg_num shrink
https://github.com/ceph/ceph/pull/20469 Sage Weil
06:33 PM Feature #84 (In Progress): mon: auto adjust pg_num as pool grows
Sage Weil
05:16 PM Backport #24845: luminous: tools/ceph-objectstore-tool: split filestore directories offline to ta...
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/23418
merged
Yuri Weinstein
12:50 PM Backport #26881 (In Progress): mimic: ceph-base debian package compiled on ubuntu/xenial has unme...
Kefu Chai
12:44 PM Backport #26881 (Resolved): mimic: ceph-base debian package compiled on ubuntu/xenial has unmet r...
https://github.com/ceph/ceph/pull/23490 Kefu Chai
12:44 PM Bug #26880 (Pending Backport): ceph-base debian package compiled on ubuntu/xenial has unmet runti...
https://github.com/ceph/ceph/pull/22990
https://github.com/ceph/ceph/pull/23432
Kefu Chai
12:30 PM Bug #26880 (Resolved): ceph-base debian package compiled on ubuntu/xenial has unmet runtime depen...
... Kefu Chai
08:34 AM Backport #26839 (In Progress): mimic: librados application's symbol could conflict with the libce...
-https://github.com/ceph/ceph/pull/23484- Prashant D
08:32 AM Backport #26840 (In Progress): luminous: librados application's symbol could conflict with the li...
https://github.com/ceph/ceph/pull/23483 Prashant D
03:35 AM Bug #25209 (Resolved): cls/test_cls_numops.sh aborts
Kefu Chai
01:53 AM Bug #26875 (Fix Under Review): kv: MergeOperator name() returns string, and caller calls c_str() ...
https://github.com/ceph/ceph/pull/23477 Kefu Chai

08/07/2018

11:06 PM Bug #23857: flush (manifest) vs async recovery causes out of order op
/a/yuriw-2018-08-06_20:38:17-rados-wip_master_8_6_2018-distro-basic-smithi/2873966/
the order of events here:
<...
Neha Ojha
10:02 PM Bug #26875 (Resolved): kv: MergeOperator name() returns string, and caller calls c_str() on the t...
On Tue, 7 Aug 2018, Réka Nikolett Kovács wrote:
> Hi,
>
> I am working on a bug finding tool that looks for a ...
Sage Weil
07:26 PM Bug #21142: OSD crashes when loading pgs with "FAILED assert(interval.last > last)"
ubuntu@mastercontroller01:~$ ceph -s
cluster:
id: dc00b525-7dca-435a-bfa6-c0b9b216e1f2
health: HEALT...
Dexter John Genterone
07:24 PM Bug #21142: OSD crashes when loading pgs with "FAILED assert(interval.last > last)"
attaching new osd log. Dexter John Genterone
06:12 PM Bug #21142: OSD crashes when loading pgs with "FAILED assert(interval.last > last)"
We've encountered this again when we were adding a new OSD. Couldn't get the gdb as there was none installed and the ... Dexter John Genterone
06:21 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
attaching OSD log. Dexter John Genterone
06:19 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
We encountered this issue after trying out a patch for https://tracker.ceph.com/issues/21142.
Is it safe to bypas...
Dexter John Genterone
08:50 AM Bug #25108 (Fix Under Review): object errors found in be_select_auth_object() aren't logged the same
https://github.com/ceph/ceph/pull/23376/ Kefu Chai
03:32 AM Bug #26868 (Pending Backport): PGLog.cc: saw valgrind issues while accessing complete_to->version
Neha Ojha
02:02 AM Backport #26871 (In Progress): luminous: osd: segfaults under normal operation
https://github.com/ceph/ceph/pull/23459 Brad Hubbard
01:25 AM Backport #26871 (Resolved): luminous: osd: segfaults under normal operation
https://github.com/ceph/ceph/pull/23459 Brad Hubbard
02:01 AM Backport #26870 (In Progress): mimic: osd: segfaults under normal operation
https://github.com/ceph/ceph/pull/23458 Brad Hubbard
01:24 AM Backport #26870 (Resolved): mimic: osd: segfaults under normal operation
https://github.com/ceph/ceph/pull/23458 Brad Hubbard

08/06/2018

08:30 PM Backport #24495: luminous: osd: segv in Session::have_backoff
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22729
merged
Yuri Weinstein
08:24 PM Backport #24501: luminous: osd: eternal stuck PG in 'unfound_recovery'
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22546
merged
Yuri Weinstein
06:49 PM Bug #24613: luminous: rest/test.py fails with expected 200, got 400
This one looks similar.
/a/yuriw-2018-08-03_19:54:05-rados-wip-yuri-testing-2018-08-03-1639-luminous-distro-basic-...
Neha Ojha
06:46 PM Bug #26868 (Fix Under Review): PGLog.cc: saw valgrind issues while accessing complete_to->version
https://github.com/ceph/ceph/pull/23450 Neha Ojha
06:35 PM Bug #26868 (In Progress): PGLog.cc: saw valgrind issues while accessing complete_to->version
Neha Ojha
06:28 PM Bug #26868 (Resolved): PGLog.cc: saw valgrind issues while accessing complete_to->version
This occurred during a rados run of https://tracker.ceph.com/issues/24988. This failure has not been seen on master o... Neha Ojha
02:52 PM Bug #23352 (Pending Backport): osd: segfaults under normal operation
Kefu Chai
02:51 PM Bug #24875 (Pending Backport): OSD: still returning EIO instead of recovering objects on checksum...
Kefu Chai

08/04/2018

09:58 PM Bug #24174: PrimaryLogPG::try_flush_mark_clean mixplaced ctx release
This was seen in luminous. Could this be related?... Neha Ojha

08/03/2018

11:45 PM Feature #24917: Gracefully deal with upgrades when bluestore skipping of data_digest becomes active

We need to wait to turn off data_digest once all OSDs are running bluestore AND we must disallow a filestore OSD to...
David Zafman
10:42 PM Bug #23492 (Resolved): Abort in OSDMap::decode() during qa/standalone/erasure-code/test-erasure-e...
David Zafman
10:42 PM Backport #24864 (Resolved): luminous: Abort in OSDMap::decode() during qa/standalone/erasure-code...
David Zafman
03:11 PM Backport #24864: luminous: Abort in OSDMap::decode() during qa/standalone/erasure-code/test-erasu...
Patrick Donnelly wrote:
> https://github.com/ceph/ceph/pull/23025
merged
Yuri Weinstein
10:40 PM Feature #24949 (Resolved): luminous: Allow scrub to fix Luminous 12.2.6 corruption of data_digest
David Zafman
10:39 PM Backport #25128 (Resolved): mimic: Allow scrub to fix Luminous 12.2.6 corruption of data_digest
David Zafman
10:39 PM Backport #26841 (Closed): mimic: luminous: Allow scrub to fix Luminous 12.2.6 corruption of data_...
David Zafman
04:02 PM Backport #26841 (Closed): mimic: luminous: Allow scrub to fix Luminous 12.2.6 corruption of data_...
Patrick Donnelly
10:35 PM Backport #25126 (Resolved): mimic: Allow repair of an object with a bad data_digest in object_inf...
David Zafman
10:35 PM Feature #25085 (Resolved): Allow repair of an object with a bad data_digest in object_info on all...
David Zafman
10:18 PM Bug #24875 (In Progress): OSD: still returning EIO instead of recovering objects on checksum errors
David Zafman
05:59 PM Backport #24888: luminous: osd: crash in OpTracker::unregister_inflight_op via OSD::get_health_me...
Radek, can you take a look at backporting this? Josh Durgin
04:02 PM Backport #26840 (Resolved): luminous: librados application's symbol could conflict with the libce...
https://github.com/ceph/ceph/pull/23483 Patrick Donnelly
04:02 PM Backport #26839 (Resolved): mimic: librados application's symbol could conflict with the libceph-...
https://github.com/ceph/ceph/pull/24708 Patrick Donnelly
03:24 PM Backport #23772: luminous: ceph status shows wrong number of objects
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22680
merged
Yuri Weinstein
03:22 PM Backport #24471: luminous: Ceph-osd crash when activate SPDK
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22686
merged
Yuri Weinstein
03:15 PM Backport #24772: luminous: osd: may get empty info at recovery
Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22862
merged
Yuri Weinstein
12:57 PM Bug #25154 (Pending Backport): librados application's symbol could conflict with the libceph-common
Kefu Chai
08:10 AM Bug #24835: osd daemon spontaneous segfault
The problem still persists with Mimic 13.2.1 (on the same cluster as above). Errors in ceph::buffer::list appear to h... Soenke Schippmann
12:09 AM Backport #25199 (In Progress): luminous: FAILED assert(trim_to <= info.last_complete) in PGLog::t...
Neha Ojha
12:08 AM Backport #25219 (In Progress): luminous: osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
Neha Ojha
12:07 AM Backport #25200 (In Progress): mimic: FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
Neha Ojha
12:07 AM Backport #25220 (In Progress): mimic: osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
Neha Ojha
12:07 AM Backport #24989 (In Progress): mimic: Limit pg log length during recovery/backfill so that we don...
Neha Ojha
12:06 AM Bug #23352: osd: segfaults under normal operation
I've created a test package here based on 12.2.7 and including the one line patch above.
https://shaman.ceph.com/r...
Brad Hubbard
 

Also available in: Atom