Activity
From 02/07/2017 to 03/08/2017
03/08/2017
- 09:38 PM Bug #19237 (New): "PG.cc: 3100: FAILED assert(e.version > info.last_update)" in upgrade:kraken-x-...
- Run: http://pulpito.ceph.com/teuthology-2017-03-08_02:25:22-upgrade:kraken-x-master-distro-basic-vps/
Job: 894399
L... - 12:17 PM Feature #18943 (Resolved): crush: add devices class that rules can use as a filter
03/07/2017
- 07:24 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
- Do we actually want to clear the missing set here, or just filter it for the correct child PG?
...I presume killing ... - 07:19 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
- Summary:
During pg split we are resetting last_backfill but not clearing the
local missing set. This comes back t... - 01:19 AM Bug #19199: Odd OSD failure path; ERROR: osd init failed: (110) Connection timed out
- Earlier in the log the root cause appears:...
03/06/2017
- 06:48 PM Bug #19199 (New): Odd OSD failure path; ERROR: osd init failed: (110) Connection timed out
- See attached OSD log for more details.
commit 6f8e4b38103d6f519e6661acc97a47ceccf5e5fc was the latest master
Interm... - 05:05 PM Bug #19198 (Closed): Bluestore doubles mem usage when caching object content
- When trying to cache object content BlueStore uses twice as much memory than it really caches.
The root cause for ... - 09:48 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
- yuri, the backtrace you posted is another issue. i am building your branch of wip-yuri-testing_2017_3_4 to see if "ce...
- 01:22 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
- Also see during PRs testing https://trello.com/c/il60a5yB
http://qa-proxy.ceph.com/teuthology/yuriw-2017-03-05_23:...
03/04/2017
- 07:24 PM Bug #19191 (Resolved): osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missi...
- ...
02/27/2017
- 10:03 AM Bug #19092 (New): cluster [ERR] scrub 2.1 ... is an unexpected clone" in cluster log
- see http://pulpito.ceph.com/kchai-2017-02-27_04:13:29-rados-wip-kefu-testing---basic-smithi/862801/
after evicting... - 06:53 AM Feature #18943: crush: add devices class that rules can use as a filter
- https://github.com/ceph/ceph/pull/13444
- 03:27 AM Bug #19086 (Need More Info): BlockDevice::create should add check for readlink result instead of ...
02/26/2017
- 11:19 AM Bug #19086: BlockDevice::create should add check for readlink result instead of raise error until...
- https://github.com/ceph/ceph/pull/13654
- 09:49 AM Bug #19086 (Rejected): BlockDevice::create should add check for readlink result instead of raise ...
- ...
02/24/2017
- 10:12 PM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- Something like: https://github.com/athanatos/ceph/tree/wip-19023
02/23/2017
- 11:00 PM Bug #19067 (Need More Info): missing set not persisted
- ...
- 02:45 PM Bug #19058: osd: backfill failed to remove racing evict
- /a/sage-2017-02-21_20:58:58-rados-wip-sage-testing---basic-smithi/844754
- 02:43 PM Bug #19058 (New): osd: backfill failed to remove racing evict
- we are backfilling......
02/22/2017
- 11:44 PM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- Well, sort of. last_epoch_clean is really about when we can forget OSDMaps. Should we retain OSDMaps on the mon (an...
- 11:33 PM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- 2017-02-20 20:45:59.104093 7f75c93f8700 10 osd.3 pg_epoch: 284 pg[1.16( v 278'379 (0'0,278'379] local-les=277 n=1 ec=...
- 12:09 AM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- 2017-02-20 20:46:28.567065 7ffa3242c700 10 osd.4 pg_epoch: 255 pg[1.16( v 254'369 (0'0,254'369] local-les=164 n=3 ec=...
- 12:05 AM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- 2017-02-20 20:46:40.165108 7f9e2ffc3700 10 osd.0 pg_epoch: 300 pg[1.16( DNE empty local-les=0 n=0 ec=0 les/c/f 0/0/0 ...
- 12:03 AM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- 2017-02-20 20:46:41.743173 7f9e277b2700 10 osd.0 pg_epoch: 301 pg[1.16( empty local-les=0 n=0 ec=141 les/c/f 164/164/...
- 07:46 AM Bug #18926: Why osds do not release memory?
- Hello,
Version: L12.0.0, bluestore, two replication.
Memory size:16GB
OSD number:12
After I trying to...
02/21/2017
- 11:39 PM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- Notably, when it goes active at the end there, it's missing the 10 commits which happened during the [3,1] interval.
- 11:38 PM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- At epoch 255, 1.16 is on [4,3] and is active+clean
2017-02-20 20:45:10.962790 7fd9b7cba700 10 osd.4 pg_epoch: 255 ... - 01:35 AM Bug #19023: ceph_test_rados invalid read caused apparently by lost intervals due to mons trimming...
- I assume from your description that this was a dirty interval the monitor shouldn't have trimmed? Or did osd.4 perhap...
- 01:27 AM Bug #19023 (Resolved): ceph_test_rados invalid read caused apparently by lost intervals due to mo...
- samuelj@teuthology:/a/samuelj-2017-02-20_18:45:04-rados-wip-18937---basic-smithi/839771/remote
If you look back in...
02/20/2017
- 11:32 AM Bug #18996: api_misc: [ FAILED ] LibRadosMiscConnectFailure.ConnectFailure
- the authenticate would time out at "15:59:24.639011"....
- 10:28 AM Bug #18996 (New): api_misc: [ FAILED ] LibRadosMiscConnectFailure.ConnectFailure
- ...
02/18/2017
- 09:51 PM Documentation #18986 (New): Need to document monitor health configuration values
All configuration variables referenced in OSDMonitor::get_health() need to be documented. These values affect the ...
02/16/2017
- 11:20 AM Bug #18924: kraken-bluestore 11.2.0 memory leak issue
- This was discussed during Yesterday's performance meeting and Sage suggested that this is indeed a memory leak.
Al... - 11:17 AM Bug #18926: Why osds do not release memory?
- Seems to be related to #18924 doesn't it?
Machines seem to be running out of memory with BlueStore.
02/15/2017
- 10:47 PM Feature #18943: crush: add devices class that rules can use as a filter
- <loicd> sage: I'm confused by how we should handle the weights with the device classes. The weight of the generated b...
- 03:00 PM Feature #18943: crush: add devices class that rules can use as a filter
- Instead of ...
- 12:24 PM Feature #18943 (Resolved): crush: add devices class that rules can use as a filter
- h3. Problem
1. We want to have different types of devices (SSD, HDD, NVMe) backing different OSDs within the same ...
02/14/2017
- 05:16 PM Bug #18930 (New): received Segmentation fault in PGLog::IndexedLog::add
- 2017-02-15 00:12:04.566736 7fee7b9ec700 -1 *** Caught signal (Segmentation fault) **
in thread 7fee7b9ec700 thread_... - 12:09 PM Bug #18924: kraken-bluestore 11.2.0 memory leak issue
- Marek Panek wrote:
> We observe the same effect in 11.2 with bluestore. After some time OSDs consume ~6G RAM memory ... - 12:07 PM Bug #18924: kraken-bluestore 11.2.0 memory leak issue
- We observe the same effect in 11.2. After some time OSDs consume ~6G RAM memory (12 OSDs per 64G RAM server) and fina...
- 08:58 AM Bug #18924 (Resolved): kraken-bluestore 11.2.0 memory leak issue
- Hi All,
On all our 5 node cluster with ceph 11.2.0 we encounter memory leak issues.
Cluster details : 5 node wi... - 11:40 AM Bug #18926 (Duplicate): Why osds do not release memory?
- Version: K11.2.0, bluestore, two replication.
test: testing cluster with fio, with parmeters "-direct=1 -iodepth 6... - 10:38 AM Bug #18925 (Can't reproduce): Leak_DefinitelyLost in KernelDevice::aio_write
See on fs test branch based on master.
http://pulpito.ceph.com/jspray-2017-02-14_02:39:19-fs-wip-jcsp-testing-20...
02/10/2017
- 10:05 PM Bug #18749: OSD: allow EC PGs to do recovery below min_size
- See https://www.mail-archive.com/ceph-users@lists.ceph.com/msg35273.html for user discovery.
02/09/2017
- 11:30 PM Cleanup #18875 (New): osd: give deletion ops a cost when performing backfill
- From PrimaryLogPG, line 11134 (at time of writing)...
- 02:15 PM Bug #18871 (New): problem about create pool with expected-num-objects does not cause collection s...
- i create a pool and want the PG folder splitting happen at the pool creation time,but i found it not happend
1、...
02/08/2017
- 08:10 PM Bug #18859 (Closed): kraken monitor fails to bootstrap off jewel monitors if it has booted before
- To reproduce; bootstrap a quorum off of jewel. Stop one of the monitors, remove it's filesystem contents, re-create i...
- 05:00 PM Bug #18162: osd/ReplicatedPG.cc: recover_replicas: object added to missing set for backfill, but ...
- Sorry, github is off-limits for me (it tries to run non-Free Software on my browser, and it refuses to work if I don'...
02/07/2017
- 01:33 AM Backport #17445: jewel: list-snap cache tier missing promotion logic (was: rbd cli segfault when ...
- FWIW: in our case, the rbd pool is tiered in write-back mode.
- 01:27 AM Backport #17445: jewel: list-snap cache tier missing promotion logic (was: rbd cli segfault when ...
- This particular snapshot were created on the 20th of January, and I'm relatively certain clients/osds/monitors/etc. r...
- 01:15 AM Backport #17445: jewel: list-snap cache tier missing promotion logic (was: rbd cli segfault when ...
- I suspect we're hitting the same....
Also available in: Atom