Activity
From 07/27/2018 to 08/25/2018
08/25/2018
- 08:42 PM Bug #27363 (New): 'rbd rm' does not clean tiered pool completly
- mimic (13.2.1)
linux kernel: 4.18.3-1.el7.elrepo.x86_64
ceph osd crush rule create-replicated hddreplrule default... - 05:26 PM Bug #27362 (New): Wrong erasure pool MAX AVAIL size calculation with technique=reed_sol_r6_op
- ...
- 05:53 AM Bug #24022: "ceph tell osd.x bench" writes resulting JSON to stderr instead of stdout.
- luminous backport https://github.com/ceph/ceph/pull/23680
08/24/2018
- 05:14 PM Bug #25084 (Resolved): Attempt to read object that can't be repaired loops forever
- 05:13 PM Bug #25108 (Pending Backport): object errors found in be_select_auth_object() aren't logged the same
- 05:12 PM Bug #24801: PG num_bytes becomes huge
So far with assert added to object_stat_sum_t::add() we saw this. Still not sure why the num_bytes is off.
<pr...- 12:54 PM Bug #24612 (In Progress): FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
- 02:00 AM Backport #26931 (In Progress): mimic: scrub livelock
- https://github.com/ceph/ceph/pull/23722
08/23/2018
- 09:22 PM Backport #27213 (Resolved): mimic: libradosstriper conditional compile
- https://github.com/ceph/ceph/pull/23869
- 09:21 PM Backport #27212 (Resolved): mimic: rpm: should change ceph-mgr package depency from py-bcrypt to ...
- https://github.com/ceph/ceph/pull/23868
- 09:20 PM Bug #25057 (Resolved): jewel->luminous: osdmap crc mismatch
- 09:20 PM Backport #25101 (Resolved): mimic: jewel->luminous: osdmap crc mismatch
- 11:31 AM Feature #22750 (Pending Backport): libradosstriper conditional compile
- 11:21 AM Feature #22750 (Resolved): libradosstriper conditional compile
- 11:28 AM Bug #27206 (Pending Backport): rpm: should change ceph-mgr package depency from py-bcrypt to pyth...
- https://github.com/ceph/ceph/pull/23648
- 11:27 AM Bug #27206 (Resolved): rpm: should change ceph-mgr package depency from py-bcrypt to python2-bcrypt
- Current deplist of ceph-mgr rpm package contains py-bcrypt depency which conflicts with python2-bcrypt needed for pyt...
- 11:23 AM Bug #26998 (Resolved): IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
- 08:19 AM Support #27203: osd down while bucket is deleting
- Format is ugly,my fault
- 07:59 AM Support #27203 (New): osd down while bucket is deleting
- My environment is
[tanweijie@gz-ceph-52-202 ~]$ ceph --version
ceph version 12.2.5 (cad919881333ac92274171586c827e0...
08/22/2018
- 10:20 PM Feature #26975: Rados level IO priority for OSD operations
- Do note that
1) "Messages" can already have priority, although its utility at this point is quite limited it's not t... - 09:32 PM Bug #26880 (Resolved): ceph-base debian package compiled on ubuntu/xenial has unmet runtime depen...
- 09:31 PM Backport #26881 (Resolved): mimic: ceph-base debian package compiled on ubuntu/xenial has unmet r...
- 09:19 PM Bug #26971: failed to become clean before timeout expired
- Looks like a PG is active+undersized state. Maybe the balancer screwed up?
- 09:14 PM Backport #24359 (Resolved): mimic: osd: leaked Session on osd.7
- 09:00 PM Bug #24875 (Resolved): OSD: still returning EIO instead of recovering objects on checksum errors
- 09:00 PM Backport #25226 (Resolved): mimic: OSD: still returning EIO instead of recovering objects on chec...
- 08:46 PM Backport #25101: mimic: jewel->luminous: osdmap crc mismatch
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/23226
merged - 05:37 PM Bug #27053: qa: thrashosds: "[ERR] : 2.0 has 1 objects unfound and apparently lost"
- Similar failure seen in mimic: /a/yuriw-2018-08-21_23:27:39-rados-wip-yuri5-testing-2018-08-21-2033-mimic-distro-basi...
- 03:39 PM Bug #27053 (New): qa: thrashosds: "[ERR] : 2.0 has 1 objects unfound and apparently lost"
- This is for 12.2.8
Run: http://pulpito.ceph.com/yuriw-2018-08-21_16:17:40-rados-luminous-distro-basic-smithi/
Job... - 05:26 PM Bug #27055 (New): mimic: FAILED assert((uint64_t)buf.st_size == expected) in SyntheticWorkloadSta...
- ...
- 08:51 AM Bug #24956: osd: parent process need to restart log service after fork, or ceph-osd will not work...
- PR:https://github.com/ceph/ceph/pull/23685
- 06:28 AM Bug #26994 (Fix Under Review): test_module_commands (tasks.mgr.test_module_selftest.TestModuleSel...
- https://github.com/ceph/ceph/pull/23681
- 03:45 AM Bug #23352 (Resolved): osd: segfaults under normal operation
- The patch is only relevant to the osds.
- 02:56 AM Bug #26998: IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
- 02:14 AM Bug #26998: IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
- - https://github.com/ceph/dmclock/pull/58
- https://github.com/ceph/ceph/pull/23643 - 02:13 AM Bug #26998 (Resolved): IOPS churn with "osd op queue" = "mclock_opclass" or "mclock_client"
- for more details on this issue, please refer to https://github.com/ceph/dmclock/pull/58 . in short, if "osd op queue"...
08/21/2018
- 08:22 PM Bug #25146 (In Progress): "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:para...
- Very early fix: https://github.com/rzarzynski/rocksdb/tree/wip-bug-25146.
The case appears more complicated as the... - 07:58 PM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
- https://github.com/ceph/ceph/pull/23490 merged
- 07:30 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
- Something like this will probably fix it...
- 06:49 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
- Here's the culprit: hello isn't packaged so it can't announce its commands....
- 06:45 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
- The manager logs show all the modules except for `hello` being loaded...
- 05:55 PM Bug #26994: test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) fails
- I can't reproduce this... it is as if the monitor has not received a summary of commands from the manager at the the ...
- 04:39 PM Bug #26994 (Resolved): test_module_commands (tasks.mgr.test_module_selftest.TestModuleSelftest) f...
- in https://github.com/ceph/ceph/pull/23558/commits/00223d2364b5a6cc32eb5f83f5a642b5aef2c946 , hello is used for testi...
- 04:03 PM Backport #26992 (Resolved): luminous: discover_all_missing() not always called during activating
- https://github.com/ceph/ceph/pull/23817
- 04:01 PM Feature #26975: Rados level IO priority for OSD operations
- For "Rados level" I mean librados API at least, and implementation in OSD too.
- 03:59 PM Feature #26975 (New): Rados level IO priority for OSD operations
- What I mean:
Suppose busy Ceph cluster.
Every OSD has many IO requests from clients in it's queue. Today, all r... - 12:56 AM Bug #26972 (Resolved): cluster [ERR] Error -2 reading object
http://qa-proxy.ceph.com/teuthology/dzafman-2018-08-17_08:14:49-rados-wip-zafman-testing4-distro-basic-smithi/29146...- 12:42 AM Bug #26971 (Duplicate): failed to become clean before timeout expired
http://qa-proxy.ceph.com/teuthology/dzafman-2018-08-16_17:35:08-rados:thrash-wip-zafman-testing4-distro-basic-smith...- 12:32 AM Bug #26970 (Resolved): src/osd/OSDMap.h: 1065: FAILED assert(__null != pool)
http://qa-proxy.ceph.com/teuthology/dzafman-2018-08-16_17:35:08-rados:thrash-wip-zafman-testing4-distro-basic-smith...
08/20/2018
- 11:19 PM Bug #22837 (Pending Backport): discover_all_missing() not always called during activating
Based on information from http://lists.ceph.com/pipermail/ceph-users-ceph.com/2017-October/021512.html I'm marking ...- 05:53 PM Feature #24232 (Fix Under Review): Add new command ceph mon status
08/19/2018
- 03:12 PM Feature #26948 (Resolved): librados: add a way to get a count of omap vals in an iterator
- https://github.com/ceph/ceph/pull/23593
- 01:58 PM Bug #24485: LibRadosTwoPoolsPP.ManifestUnset failure
- /a/kchai-2018-08-19_13:01:23-rados-wip-kefu-testing-2018-08-19-1812-distro-basic-mira/2925024/
08/17/2018
- 09:10 PM Backport #24359: mimic: osd: leaked Session on osd.7
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22339
merged - 02:27 PM Bug #26958 (Resolved): osd/ReplicatedBackend.cc: 1321: FAILED assert(get_parent()->get_log().get_...
- ...
- 09:36 AM Bug #26880 (Pending Backport): ceph-base debian package compiled on ubuntu/xenial has unmet runti...
- 03:20 AM Feature #26955: os/filestore: Add switch to turn on/off filestore dir splitting
- https://github.com/ceph/ceph/pull/23460
1. Refined HashIndex::must_split() to be more readable.
2. Introduced a h... - 03:19 AM Feature #26955 (New): os/filestore: Add switch to turn on/off filestore dir splitting
- We had done pre-split and increased split multiple, etc, at the beginning of building cluster in order to reduce the ...
- 12:16 AM Bug #25108: object errors found in be_select_auth_object() aren't logged the same
08/16/2018
- 10:46 PM Backport #26870 (Resolved): mimic: osd: segfaults under normal operation
- 05:58 PM Bug #24612: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
- /a/sage-2018-08-15_15:49:39-rados-wip-sage2-testing-2018-08-15-0731-distro-basic-smithi/2908178
08/15/2018
- 11:40 PM Bug #25084 (Fix Under Review): Attempt to read object that can't be repaired loops forever
- 11:35 PM Backport #25227 (Resolved): luminous: OSD: still returning EIO instead of recovering objects on c...
- 02:36 PM Feature #26948 (Resolved): librados: add a way to get a count of omap vals in an iterator
- We currently have functions like rados_read_op_omap_get_vals2 that hand back an iterator to a userland caller. There ...
08/14/2018
- 10:43 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
- Generally yes, but I havne't been able to reproduce to test a solution. I take it this has happened to you?
I'm h... - 01:34 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
- Guys, is there a way for an OSD to recover from this error?
- 09:58 PM Bug #26947 (Resolved): ENOENT on collection_move_rename from divergent activate
- ...
- 04:56 PM Bug #26940 (Fix Under Review): force-create-pg broken
- https://github.com/ceph/ceph/pull/23572
- 03:53 PM Bug #26940 (Resolved): force-create-pg broken
- This commit -
https://github.com/ceph/ceph/commit/7797ed67d2f9140b7eb9f182b06d04233e1e309c
has introduced regressio... - 04:33 AM Backport #26908 (Need More Info): luminous: kv: MergeOperator name() returns string, and caller c...
- 04:33 AM Backport #26908 (In Progress): luminous: kv: MergeOperator name() returns string, and caller call...
- https://github.com/ceph/ceph/pull/23566
08/13/2018
- 06:46 PM Backport #26932 (Resolved): luminous: scrub livelock
- https://github.com/ceph/ceph/pull/24396 (initial backport)
https://github.com/ceph/ceph/pull/24659 (follow-on fix) - 06:46 PM Backport #26931 (Resolved): mimic: scrub livelock
- https://github.com/ceph/ceph/pull/23722
- 06:01 PM Bug #26890 (Pending Backport): scrub livelock
- 07:38 AM Bug #20059: miscounting degraded objects
- Just adding another reference to #21803 here — this fix was meant to fix that issue as well, which it apparently did ...
- 03:14 AM Bug #23352: osd: segfaults under normal operation
- Brad Hubbard wrote:
> I've created a test package here based on 12.2.7 and including the one line patch above.
>
... - 12:59 AM Feature #24232: Add new command ceph mon status
- PR: https://github.com/ceph/ceph/pull/23525
08/12/2018
- 10:32 PM Backport #26871 (Resolved): luminous: osd: segfaults under normal operation
- 09:16 PM Backport #26910 (Resolved): luminous: PGLog.cc: saw valgrind issues while accessing complete_to->...
- https://github.com/ceph/ceph/pull/23211
- 09:16 PM Backport #26909 (Resolved): mimic: PGLog.cc: saw valgrind issues while accessing complete_to->ver...
- https://github.com/ceph/ceph/pull/23403
- 09:16 PM Backport #26908 (Resolved): luminous: kv: MergeOperator name() returns string, and caller calls c...
- https://github.com/ceph/ceph/pull/23566
- 09:16 PM Backport #26907 (Resolved): mimic: kv: MergeOperator name() returns string, and caller calls c_st...
- https://github.com/ceph/ceph/pull/23865
- 08:38 PM Bug #21592: LibRadosCWriteOps.CmpExt got 0 instead of -4095-1
- /a/sage-2018-08-11_18:40:58-rados-wip-sage-testing-2018-08-11-1120-distro-basic-smithi/2893875...
08/10/2018
- 08:13 PM Bug #23352: osd: segfaults under normal operation
- https://github.com/ceph/ceph/pull/23459 merged
- 04:54 AM Bug #12615: Repair of Erasure Coded pool with an unrepairable object causes pg state to lose clea...
- In a replicated case which in which all copies are bad, a rep_repair_primary_object() can cause loss of clean and ins...
- 04:44 AM Bug #25084: Attempt to read object that can't be repaired loops forever
- I don't think we should backport this change. In Luminous and possibly upgraded to Mimic there is a possibility that...
- 12:01 AM Bug #25084: Attempt to read object that can't be repaired loops forever
- https://github.com/ceph/ceph/pull/23518
- 02:54 AM Bug #26875 (Pending Backport): kv: MergeOperator name() returns string, and caller calls c_str() ...
- 02:47 AM Bug #24485: LibRadosTwoPoolsPP.ManifestUnset failure
- /a/kchai-2018-08-09_12:29:04-rados-wip-kefu-testing-2018-08-08-1144-distro-basic-smithi/2885459/
- 12:04 AM Bug #19753: Deny reservation if expected backfill size would put us over backfill_full_ratio
- https://github.com/ceph/ceph/pull/22797
08/09/2018
- 11:52 PM Bug #25084 (In Progress): Attempt to read object that can't be repaired loops forever
- What I actually ran into is that when do_read() fails because of the CRC mismatch, the recovery repair can pull from ...
- 07:56 PM Backport #24333 (In Progress): luminous: local_reserver double-reservation of backfilled pg
- PR: https://github.com/ceph/ceph/pull/23493
- 06:37 PM Feature #21366 (Resolved): tools/ceph-objectstore-tool: split filestore directories offline to ta...
- 06:37 PM Backport #24845 (Resolved): luminous: tools/ceph-objectstore-tool: split filestore directories of...
- 02:00 PM Bug #26891 (New): backfill reservation deadlock/stall
on backfill target:
- get backfill request, queue RequestBackfillPrio...- 01:34 PM Bug #26890: scrub livelock
- https://github.com/ceph/ceph/pull/23512
- 01:32 PM Bug #26890 (Resolved): scrub livelock
- - both osds locally reserve a scrub slot
- both osds send a scrub schedule request
- both scrub requests are reject... - 08:03 AM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
- Full info for ceph-base package:...
- 08:00 AM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
- Tried on fresh Ubuntu 16.04 vm to build Ceph packages for master branch, resulting .debs still depend on libstdc++6 (...
- 07:57 AM Bug #26880: ceph-base debian package compiled on ubuntu/xenial has unmet runtime dependencies
- as per Piotr Dałek we can reproduce this issue on master even with the fix .
08/08/2018
- 09:42 PM Bug #25146: "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:parallel-master-di...
- I think we need to fix this sooner rather than later. My suggestion is to incorporate enough of the original rocksdb...
- 09:10 PM Bug #26878 (Closed): `osd destroy` command hangs
- NOTABUG. :)
Presumably will have to update the ceph-volume tests but the louder notification PR is well on its way t... - 06:17 PM Bug #26878: `osd destroy` command hangs
- master PR https://github.com/ceph/ceph/pull/23492
- 12:03 PM Bug #26878 (Closed): `osd destroy` command hangs
- Running latest master without a manager daemon makes `osd destroy` commands hang.
ceph version 14.0.0-1906-g637bb2... - 06:52 PM Feature #1126 (Rejected): crush: extend rule definition
- actually, you can do the above, just set size=3 and you'll get 2 in first rack and 1 in second rack.
- 06:49 PM Feature #85 (Fix Under Review): osd: pg_num shrink
- https://github.com/ceph/ceph/pull/20469
- 06:33 PM Feature #84 (In Progress): mon: auto adjust pg_num as pool grows
- 05:16 PM Backport #24845: luminous: tools/ceph-objectstore-tool: split filestore directories offline to ta...
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/23418
merged - 12:50 PM Backport #26881 (In Progress): mimic: ceph-base debian package compiled on ubuntu/xenial has unme...
- 12:44 PM Backport #26881 (Resolved): mimic: ceph-base debian package compiled on ubuntu/xenial has unmet r...
- https://github.com/ceph/ceph/pull/23490
- 12:44 PM Bug #26880 (Pending Backport): ceph-base debian package compiled on ubuntu/xenial has unmet runti...
- https://github.com/ceph/ceph/pull/22990
https://github.com/ceph/ceph/pull/23432 - 12:30 PM Bug #26880 (Resolved): ceph-base debian package compiled on ubuntu/xenial has unmet runtime depen...
- ...
- 08:34 AM Backport #26839 (In Progress): mimic: librados application's symbol could conflict with the libce...
- -https://github.com/ceph/ceph/pull/23484-
- 08:32 AM Backport #26840 (In Progress): luminous: librados application's symbol could conflict with the li...
- https://github.com/ceph/ceph/pull/23483
- 03:35 AM Bug #25209 (Resolved): cls/test_cls_numops.sh aborts
- 01:53 AM Bug #26875 (Fix Under Review): kv: MergeOperator name() returns string, and caller calls c_str() ...
- https://github.com/ceph/ceph/pull/23477
08/07/2018
- 11:06 PM Bug #23857: flush (manifest) vs async recovery causes out of order op
- /a/yuriw-2018-08-06_20:38:17-rados-wip_master_8_6_2018-distro-basic-smithi/2873966/
the order of events here:
<... - 10:02 PM Bug #26875 (Resolved): kv: MergeOperator name() returns string, and caller calls c_str() on the t...
- On Tue, 7 Aug 2018, Réka Nikolett Kovács wrote:
> Hi,
>
> I am working on a bug finding tool that looks for a ... - 07:26 PM Bug #21142: OSD crashes when loading pgs with "FAILED assert(interval.last > last)"
- ubuntu@mastercontroller01:~$ ceph -s
cluster:
id: dc00b525-7dca-435a-bfa6-c0b9b216e1f2
health: HEALT... - 07:24 PM Bug #21142: OSD crashes when loading pgs with "FAILED assert(interval.last > last)"
- attaching new osd log.
- 06:12 PM Bug #21142: OSD crashes when loading pgs with "FAILED assert(interval.last > last)"
- We've encountered this again when we were adding a new OSD. Couldn't get the gdb as there was none installed and the ...
- 06:21 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
- attaching OSD log.
- 06:19 PM Bug #24866: FAILED assert(0 == "past_interval start interval mismatch") in check_past_interval_bo...
- We encountered this issue after trying out a patch for https://tracker.ceph.com/issues/21142.
Is it safe to bypas... - 08:50 AM Bug #25108 (Fix Under Review): object errors found in be_select_auth_object() aren't logged the same
- https://github.com/ceph/ceph/pull/23376/
- 03:32 AM Bug #26868 (Pending Backport): PGLog.cc: saw valgrind issues while accessing complete_to->version
- 02:02 AM Backport #26871 (In Progress): luminous: osd: segfaults under normal operation
- https://github.com/ceph/ceph/pull/23459
- 01:25 AM Backport #26871 (Resolved): luminous: osd: segfaults under normal operation
- https://github.com/ceph/ceph/pull/23459
- 02:01 AM Backport #26870 (In Progress): mimic: osd: segfaults under normal operation
- https://github.com/ceph/ceph/pull/23458
- 01:24 AM Backport #26870 (Resolved): mimic: osd: segfaults under normal operation
- https://github.com/ceph/ceph/pull/23458
08/06/2018
- 08:30 PM Backport #24495: luminous: osd: segv in Session::have_backoff
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22729
merged - 08:24 PM Backport #24501: luminous: osd: eternal stuck PG in 'unfound_recovery'
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22546
merged - 06:49 PM Bug #24613: luminous: rest/test.py fails with expected 200, got 400
- This one looks similar.
/a/yuriw-2018-08-03_19:54:05-rados-wip-yuri-testing-2018-08-03-1639-luminous-distro-basic-... - 06:46 PM Bug #26868 (Fix Under Review): PGLog.cc: saw valgrind issues while accessing complete_to->version
- https://github.com/ceph/ceph/pull/23450
- 06:35 PM Bug #26868 (In Progress): PGLog.cc: saw valgrind issues while accessing complete_to->version
- 06:28 PM Bug #26868 (Resolved): PGLog.cc: saw valgrind issues while accessing complete_to->version
- This occurred during a rados run of https://tracker.ceph.com/issues/24988. This failure has not been seen on master o...
- 02:52 PM Bug #23352 (Pending Backport): osd: segfaults under normal operation
- 02:51 PM Bug #24875 (Pending Backport): OSD: still returning EIO instead of recovering objects on checksum...
08/04/2018
- 09:58 PM Bug #24174: PrimaryLogPG::try_flush_mark_clean mixplaced ctx release
- This was seen in luminous. Could this be related?...
08/03/2018
- 11:45 PM Feature #24917: Gracefully deal with upgrades when bluestore skipping of data_digest becomes active
We need to wait to turn off data_digest once all OSDs are running bluestore AND we must disallow a filestore OSD to...- 10:42 PM Bug #23492 (Resolved): Abort in OSDMap::decode() during qa/standalone/erasure-code/test-erasure-e...
- 10:42 PM Backport #24864 (Resolved): luminous: Abort in OSDMap::decode() during qa/standalone/erasure-code...
- 03:11 PM Backport #24864: luminous: Abort in OSDMap::decode() during qa/standalone/erasure-code/test-erasu...
- Patrick Donnelly wrote:
> https://github.com/ceph/ceph/pull/23025
merged - 10:40 PM Feature #24949 (Resolved): luminous: Allow scrub to fix Luminous 12.2.6 corruption of data_digest
- 10:39 PM Backport #25128 (Resolved): mimic: Allow scrub to fix Luminous 12.2.6 corruption of data_digest
- 10:39 PM Backport #26841 (Closed): mimic: luminous: Allow scrub to fix Luminous 12.2.6 corruption of data_...
- 04:02 PM Backport #26841 (Closed): mimic: luminous: Allow scrub to fix Luminous 12.2.6 corruption of data_...
- 10:35 PM Backport #25126 (Resolved): mimic: Allow repair of an object with a bad data_digest in object_inf...
- 10:35 PM Feature #25085 (Resolved): Allow repair of an object with a bad data_digest in object_info on all...
- 10:18 PM Bug #24875 (In Progress): OSD: still returning EIO instead of recovering objects on checksum errors
- 05:59 PM Backport #24888: luminous: osd: crash in OpTracker::unregister_inflight_op via OSD::get_health_me...
- Radek, can you take a look at backporting this?
- 04:02 PM Backport #26840 (Resolved): luminous: librados application's symbol could conflict with the libce...
- https://github.com/ceph/ceph/pull/23483
- 04:02 PM Backport #26839 (Resolved): mimic: librados application's symbol could conflict with the libceph-...
- https://github.com/ceph/ceph/pull/24708
- 03:24 PM Backport #23772: luminous: ceph status shows wrong number of objects
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22680
merged - 03:22 PM Backport #24471: luminous: Ceph-osd crash when activate SPDK
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22686
merged - 03:15 PM Backport #24772: luminous: osd: may get empty info at recovery
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22862
merged - 12:57 PM Bug #25154 (Pending Backport): librados application's symbol could conflict with the libceph-common
- 08:10 AM Bug #24835: osd daemon spontaneous segfault
- The problem still persists with Mimic 13.2.1 (on the same cluster as above). Errors in ceph::buffer::list appear to h...
- 12:09 AM Backport #25199 (In Progress): luminous: FAILED assert(trim_to <= info.last_complete) in PGLog::t...
- 12:08 AM Backport #25219 (In Progress): luminous: osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
- 12:07 AM Backport #25200 (In Progress): mimic: FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
- 12:07 AM Backport #25220 (In Progress): mimic: osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
- 12:07 AM Backport #24989 (In Progress): mimic: Limit pg log length during recovery/backfill so that we don...
- 12:06 AM Bug #23352: osd: segfaults under normal operation
- I've created a test package here based on 12.2.7 and including the one line patch above.
https://shaman.ceph.com/r...
08/02/2018
- 11:57 PM Bug #23352 (In Progress): osd: segfaults under normal operation
- https://github.com/ceph/ceph/pull/23404
- 08:56 PM Bug #23352: osd: segfaults under normal operation
- Brad - you can just use kjetil@medallia.com
- 08:26 AM Bug #23352: osd: segfaults under normal operation
- Thanks Kjetil,
I think you are right, we should hold the lock in update_osd_health(). Not sure how we all missed tha... - 08:24 PM Bug #22330: ec: src/common/interval_map.h: 161: FAILED assert(len > 0)
- /ceph/teuthology-archive/pdonnell-2018-08-02_13:06:29-multimds-wip-pdonnell-testing-20180802.044402-testing-basic-smi...
- 08:21 PM Bug #21931: osd: src/osd/ECBackend.cc: 2164: FAILED assert((offset + length) <= (range.first.get_...
- Run with cores/logs: /ceph/teuthology-archive/pdonnell-2018-08-02_13:06:29-multimds-wip-pdonnell-testing-20180802.044...
- 03:57 PM Bug #25182: Upmaps forgotten after restarting OSDs
- Hmm, I wasn't able to reproduce this...
- 03:30 PM Bug #25182: Upmaps forgotten after restarting OSDs
- It is expected that the upmaps may evaporate if the "raw" CRUSH mapping changes. This shouldn't happen for osd up/do...
- 03:59 AM Bug #24875: OSD: still returning EIO instead of recovering objects on checksum errors
- *master PR*: https://github.com/ceph/ceph/pull/23377
- 03:57 AM Backport #25227 (In Progress): luminous: OSD: still returning EIO instead of recovering objects o...
- 03:56 AM Backport #25226 (In Progress): mimic: OSD: still returning EIO instead of recovering objects on c...
08/01/2018
- 11:37 PM Backport #25227 (Resolved): luminous: OSD: still returning EIO instead of recovering objects on c...
- https://github.com/ceph/ceph/pull/23379
- 11:32 PM Backport #25226 (Resolved): mimic: OSD: still returning EIO instead of recovering objects on chec...
- https://github.com/ceph/ceph/pull/23378
- 11:26 PM Bug #25211 (Fix Under Review): bug in PerfCounters
- 12:23 PM Bug #25211: bug in PerfCounters
- https://github.com/ceph/ceph/pull/23362
- 12:16 PM Bug #25211 (Resolved): bug in PerfCounters
- when we call PerfCounters::inc() and read_avg() at the same time, maybe the result is not what we want.
show the c... - 10:58 PM Bug #24875: OSD: still returning EIO instead of recovering objects on checksum errors
- 10:18 PM Bug #25146: "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:parallel-master-di...
- another option would be to only partially revert, and keep just the bits that ignore the older deleted log files.
- 02:03 PM Bug #25146: "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:parallel-master-di...
- an alternative option is to whip up a tool to rebuild the manifest to remove the dummy File4 with kDeletedLogNumberHa...
- 12:45 PM Bug #25146: "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:parallel-master-di...
- it's a regression in rocksdb. the rocksdb in mimic (eaee6d3beab3429232ceb188377a3f94e844fca7) is f4a857da0b720691effc...
- 06:28 AM Bug #25146: "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:parallel-master-di...
- i create a vstart.sh cluster using mimic branch, and ceph-monstore-tool from master is able to open it just fine.
... - 09:54 PM Feature #24949: luminous: Allow scrub to fix Luminous 12.2.6 corruption of data_digest
- mimic "backport" is actually a forward port from luminous
- 05:37 PM Feature #24949 (Pending Backport): luminous: Allow scrub to fix Luminous 12.2.6 corruption of dat...
- 09:50 PM Backport #25220 (Resolved): mimic: osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
- https://github.com/ceph/ceph/pull/23403
- 09:50 PM Backport #25219 (Resolved): luminous: osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
- https://github.com/ceph/ceph/pull/23211
- 09:47 PM Bug #24484 (Resolved): osdc: wrong offset in BufferHead
- 09:47 PM Backport #24584 (Resolved): luminous: osdc: wrong offset in BufferHead
- 03:37 PM Backport #24584: luminous: osdc: wrong offset in BufferHead
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22865
merged - 05:56 PM Bug #23352: osd: segfaults under normal operation
- For MgrClient::update_osd_health, does the move-assignment compile into updating a pointer to a std::vector, or does ...
- 05:41 AM Bug #23352: osd: segfaults under normal operation
- Just adding another "me too" on this. I've hit this on Luminous 12.2.7 also under Ubuntu 16.04.4 with 4.15.0-24-gener...
- 02:08 AM Bug #23352: osd: segfaults under normal operation
- ...
- 05:43 PM Bug #25108: object errors found in be_select_auth_object() aren't logged the same
- Kefu:
my concern is that, we don't reset object_error before moving to another ScrubMap. so once we identify an erro... - 05:41 PM Bug #25108 (In Progress): object errors found in be_select_auth_object() aren't logged the same
- 05:38 PM Feature #25085 (Pending Backport): Allow repair of an object with a bad data_digest in object_inf...
- 05:38 PM Backport #25127 (Resolved): luminous: Allow repair of an object with a bad data_digest in object_...
- 03:44 PM Bug #25184 (Pending Backport): osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
- 02:07 PM Bug #25209 (Fix Under Review): cls/test_cls_numops.sh aborts
- -https://github.com/ceph/ceph/pull/23364-
i think https://github.com/ceph/ceph/pull/23432 is a better fix. - 05:46 AM Bug #25209: cls/test_cls_numops.sh aborts
- i think we should revert https://github.com/ceph/ceph/pull/22990
- 05:44 AM Bug #25209: cls/test_cls_numops.sh aborts
- ...
- 05:28 AM Bug #25209 (Resolved): cls/test_cls_numops.sh aborts
- ...
- 01:06 PM Bug #25181 (Duplicate): /mon/OSDMonitor.cc: 1821: FAILED assert(osdmap_manifest.pinned.empty())
- 01:06 PM Bug #24612: FAILED assert(osdmap_manifest.pinned.empty()) in OSDMonitor::prune_init()
- /a/sage-2018-07-31_21:57:28-rados-wip-sage-testing-2018-07-31-1436-distro-basic-smithi/2844443
/a/sage-2018-07-30_13...
07/31/2018
- 10:53 PM Bug #25174: osd: assert failure with FAILED assert(repop_queue.front() == repop) In function 'vo...
- Do we have logs for this failure somewhere?
- 10:47 PM Backport #25199: luminous: FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
- This is dependent on a couple of other backports. Assigning it to myself.
- 10:45 PM Backport #25199 (Resolved): luminous: FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
- https://github.com/ceph/ceph/pull/23211
- 10:45 PM Backport #24068 (Resolved): luminous: osd sends op_reply out of order
- 07:48 PM Backport #24068: luminous: osd sends op_reply out of order
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/23137
merged - 10:45 PM Backport #25204 (Resolved): mimic: rados python bindings use prval from stack
- https://github.com/ceph/ceph/pull/23863
- 10:45 PM Backport #25203 (Resolved): luminous: rados python bindings use prval from stack
- https://github.com/ceph/ceph/pull/23864
- 10:45 PM Backport #25200 (Resolved): mimic: FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
- https://github.com/ceph/ceph/pull/23403
- 09:24 PM Bug #25198 (Pending Backport): FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
- 06:27 PM Bug #25198 (Fix Under Review): FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
- https://github.com/ceph/ceph/pull/23354
- 05:48 PM Bug #25198 (Resolved): FAILED assert(trim_to <= info.last_complete) in PGLog::trim()
- ...
- 08:02 PM Bug #23352: osd: segfaults under normal operation
- Latest crash just happened here, no messages not in the OSD log, but crash dump is generated and dmesg shows:
[Tue... - 07:25 PM Bug #24485: LibRadosTwoPoolsPP.ManifestUnset failure
- /a/sage-2018-07-31_14:52:20-rados:thrash-wip-sage2-testing-2018-07-30-1049-distro-basic-smithi/2843268
- 07:08 PM Bug #25175 (Pending Backport): rados python bindings use prval from stack
- https://github.com/ceph/ceph/pull/23334
- 03:03 PM Bug #25194 (Can't reproduce): Negative stats found by deep-scrub
http://pulpito.ceph.com/dzafman-2018-07-30_12:09:07-rados-wip-zafman-testing-distro-basic-smithi/2839428
log_cha...- 02:52 PM Feature #21710: add wildcard for namespaces
- Not at all.
- 12:43 AM Feature #21710: add wildcard for namespaces
- Hi Douglas, I started in on this and forgot to reassign the ticket! Mind if I take this one?
- 03:57 AM Tasks #25186 (In Progress): setup repo for building dependencies like boost, rocksdb, which are n...
- we need to build boost, spdk, dpdk, fio, rocksdb, gperftools, seastar for preparing the build dependencies for each P...
- 12:08 AM Bug #25184 (Fix Under Review): osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
07/30/2018
- 11:50 PM Bug #25184 (Resolved): osd/PGLog.cc: use lgeneric_subdout instead of generic_dout
- https://github.com/ceph/ceph/pull/23340
- 09:36 PM Bug #25182 (Resolved): Upmaps forgotten after restarting OSDs
- Problem:
I have a small cluster at home and I noticed that during the upgrade from 12.2.5 -> 12.2.7 and the upgrade ... - 08:47 PM Backport #25178 (In Progress): mimic: rados: not all exceptions accept keyargs
- 07:23 PM Backport #25178 (Resolved): mimic: rados: not all exceptions accept keyargs
- https://github.com/ceph/ceph/pull/23335
- 08:17 PM Bug #24485: LibRadosTwoPoolsPP.ManifestUnset failure
- /a/sage-2018-07-30_13:46:50-rados-wip-sage3-testing-2018-07-28-1512-distro-basic-smithi/2838971
- 08:16 PM Bug #25181 (Duplicate): /mon/OSDMonitor.cc: 1821: FAILED assert(osdmap_manifest.pinned.empty())
- ...
- 07:26 PM Bug #25112: osd,mon: increase mon_max_pg_per_osd to 250
- Please note that the value has been changed from 300->250 for this tracker. The PR reflects the correct value.
- 02:23 PM Bug #25112 (Pending Backport): osd,mon: increase mon_max_pg_per_osd to 250
- 07:23 PM Backport #25177 (Resolved): luminous: osd,mon: increase mon_max_pg_per_osd to 300
- https://github.com/ceph/ceph/pull/23862
- 07:23 PM Backport #25176 (Resolved): mimic: osd,mon: increase mon_max_pg_per_osd to 300
- https://github.com/ceph/ceph/pull/23861
- 07:15 PM Bug #25175 (Resolved): rados python bindings use prval from stack
- these methods include
- omap_get_vals
- omap_get_keys
- omap-get-vals-by-keys - 06:53 PM Bug #24686 (Resolved): change default filestore_merge_threshold to -10
- 06:53 PM Backport #24748 (Resolved): luminous: change default filestore_merge_threshold to -10
- 04:45 PM Backport #24748: luminous: change default filestore_merge_threshold to -10
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22814
merged - 06:51 PM Backport #24083 (Resolved): luminous: rados: not all exceptions accept keyargs
- 04:43 PM Backport #24083: luminous: rados: not all exceptions accept keyargs
- Nathan Cutler wrote:
> https://github.com/ceph/ceph/pull/22979
merged - 06:35 PM Bug #25174 (Can't reproduce): osd: assert failure with FAILED assert(repop_queue.front() == repop...
- branch: luminous
description: rados:downstream:singleton/{all/ec-lost-unfound.yaml msgr-failures/many.yaml
... - 05:07 PM Bug #25153 (Fix Under Review): output format is invalid of the crush tree json dumper
- 11:56 AM Bug #25153: output format is invalid of the crush tree json dumper
- Reference the pull request: https://github.com/ceph/ceph/pull/23319
- 11:50 AM Bug #25153 (Resolved): output format is invalid of the crush tree json dumper
- The output json string is invalid for "ceph osd crush tree --format=json" command. It contains an array of "nodes" an...
- 01:42 PM Bug #25155 (Can't reproduce): mon crash from 'ceph osd erasure-code-profile set lrcprofile name=l...
- ...
- 12:15 PM Bug #25154: librados application's symbol could conflict with the libceph-common
- https://github.com/ceph/ceph/pull/23320
- 12:15 PM Bug #25154 (Resolved): librados application's symbol could conflict with the libceph-common
- quoting from Zongyou Yao's mail from ceph-devel ML
> Internally, we have a program using librados C++ api to perio... - 07:20 AM Bug #24785 (Resolved): mimic selinux denials comm="tp_fstore_op / comm="ceph-osd dev=dm-0 and dm-1
- 07:20 AM Backport #25143 (Resolved): luminous: mimic selinux denials comm="tp_fstore_op / comm="ceph-osd ...
- Merged.
- 07:19 AM Backport #25142 (Resolved): mimic: mimic selinux denials comm="tp_fstore_op / comm="ceph-osd dev...
- Merged.
07/28/2018
- 07:55 PM Bug #20798: LibRadosLockECPP.LockExclusiveDurPP gets EEXIST
- /a/sage-2018-07-27_22:50:28-rados-wip-sage-testing-2018-07-27-0744-distro-basic-smithi/2826326
- 02:46 PM Bug #25146 (Resolved): "rocksdb: Corruption: Can't access /000000.sst" in upgrade:mimic-x:paralle...
- This is on mew mimic-x suite https://github.com/ceph/ceph/pull/23292
Run: http://pulpito.ceph.com/yuriw-2018-07-27_2... - 09:12 AM Backport #25143 (In Progress): luminous: mimic selinux denials comm="tp_fstore_op / comm="ceph-o...
- 09:11 AM Backport #25143 (Resolved): luminous: mimic selinux denials comm="tp_fstore_op / comm="ceph-osd ...
- https://github.com/ceph/ceph/pull/23296
- 09:11 AM Backport #25142 (In Progress): mimic: mimic selinux denials comm="tp_fstore_op / comm="ceph-osd ...
- 09:10 AM Backport #25142 (Resolved): mimic: mimic selinux denials comm="tp_fstore_op / comm="ceph-osd dev...
- https://github.com/ceph/ceph/pull/23295
- 09:11 AM Backport #25145 (Resolved): luminous: Automatically set expected_num_objects for new pools with >...
- https://github.com/ceph/ceph/pull/24395
- 09:11 AM Backport #25144 (Resolved): mimic: Automatically set expected_num_objects for new pools with >=10...
- https://github.com/ceph/ceph/pull/23860
07/27/2018
- 11:55 PM Bug #24785: mimic selinux denials comm="tp_fstore_op / comm="ceph-osd dev=dm-0 and dm-1
- Mimic back-port:
https://github.com/ceph/ceph/pull/23295
Luminous back-port:
https://github.com/ceph/ceph/pu... - 10:18 PM Bug #24785 (Pending Backport): mimic selinux denials comm="tp_fstore_op / comm="ceph-osd dev=dm-...
- 07:01 AM Bug #24785 (Fix Under Review): mimic selinux denials comm="tp_fstore_op / comm="ceph-osd dev=dm-...
- 07:00 AM Bug #24785: mimic selinux denials comm="tp_fstore_op / comm="ceph-osd dev=dm-0 and dm-1
- The manual testing suggests this should fix this issue:
https://github.com/ceph/ceph/pull/23278 - 03:03 AM Bug #23352: osd: segfaults under normal operation
- Dan van der Ster wrote:
> Can we see that state from the coredump somehow? Basically none of our clusters should hav...
Also available in: Atom