Project

General

Profile

Activity

From 03/02/2017 to 03/31/2017

03/31/2017

08:09 PM Bug #13385: cephx: verify_authorizer could not decrypt ticket info: error: NSS AES final round fa...
Just saw this bug , cluster was working normally one minute, and the next it's doing this on most of the OSDs. Never... Ben England
03:41 PM Bug #18924: kraken-bluestore 11.2.0 memory leak issue
I'm experiencing this runaway memory issue as well. It only appeared a couple of days ago. I tried setting the bluest... Aaron T
01:56 PM Bug #19440: osd: trims maps taht pgs haven't consumed yet when there are gaps
https://github.com/ceph/ceph/pull/14270 Sage Weil
01:56 PM Bug #19440 (Fix Under Review): osd: trims maps taht pgs haven't consumed yet when there are gaps
Sage Weil
01:52 PM Bug #19440 (New): osd: trims maps taht pgs haven't consumed yet when there are gaps
... Sage Weil
01:21 PM Feature #19384: ceph_objectstore_tool (set|clear)-missing-item command
Chang Liu wrote:
> Hi, Sam
>
> I looked at this problem, and find other problem.
>
> [...]
>
> We don't use...
Chang Liu
12:59 PM Feature #19384: ceph_objectstore_tool (set|clear)-missing-item command
Hi, Sam
I looked at this problem, and find other problem....
Chang Liu

03/29/2017

01:07 PM Bug #18924: kraken-bluestore 11.2.0 memory leak issue
sorry wrong window . ignore my previous comment Nokia ceph-users
01:00 PM Bug #18924: kraken-bluestore 11.2.0 memory leak issue

This is a bug with the ceph-mgr service -->> http://tracker.ceph.com/issues/19407 and currently set to need review ...
Nokia ceph-users
06:42 AM Bug #18924: kraken-bluestore 11.2.0 memory leak issue
Hi Jaime,
The issue not fixed with this workaround, and we will address this workaround in another issue related t...
Muthusamy Muthiah
12:34 AM Feature #15835: filestore: randomize split threshold
This one is more about performance testing, and at this point I think effort there is better spent on bluestore than ... Josh Durgin

03/28/2017

04:39 PM Bug #19400 (Resolved): add more info during pool delete error
In luminous the mon_allow_pool_delete is default to false and it may be confusing for any admin who
tries to delete ...
Vasu Kulkarni
02:55 PM Bug #18924: kraken-bluestore 11.2.0 memory leak issue

We decided to stop the ceph-mgr service in all the nodes because is using lot of CPU and we understood that this se...
Jaime Ruiz
02:51 PM Bug #18924: kraken-bluestore 11.2.0 memory leak issue

Fixed with the following commands:
The memory is released by applying the following commands in a content no...
Jaime Ruiz
09:40 AM Documentation #18986: Need to document monitor health configuration values
the description of "mon warn osd usage percent" and "mon_osd_min_in_ratio " can be found at https://github.com/ceph/... Kefu Chai

03/27/2017

09:09 PM Bug #19267: rados list-inconsistent-obj sometimes doesn't flag that all 3 copies are bad
Oh I see, it's missing the error string.
I'm not sure if in this case it's just taking one of them as authoritativ...
Greg Farnum
08:59 AM Bug #19320: Pg inconsistent make ceph osd down
backtrace in the attached log_inconsistent.txt... Kefu Chai

03/24/2017

10:01 PM Feature #19384 (New): ceph_objectstore_tool (set|clear)-missing-item command
This one is only relevant for kraken and later. It would be good to have a command for directly manipulating a pg's ... Samuel Just
09:59 PM Feature #19383 (New): ceph_objectstore_tool: set-version op to allow setting the prior_version an...
The motivation for this one is to be able to manually do part of what mark_unfound_lost revert does automatically and... Samuel Just
09:49 PM Bug #19380 (New): only sort of a bug: it's possible to get an unfound object without losing min_s...
Fundamentally, ReplicatedBackend does destructive updates. That makes the following sequence possible. Assume that ... Samuel Just
09:33 PM Bug #19379 (Resolved): bluestore: crc mismatch after recent overwrite
... Sage Weil
08:37 PM Bug #19377: mark_unfound_lost revert won't actually recover the objects unless there are some fou...
There is a very clumsy workaround to this issue. Once the mark_unfound_lost revert commands claims to have completed... Samuel Just
08:34 PM Bug #19377 (Duplicate): mark_unfound_lost revert won't actually recover the objects unless there ...
See ReplicatedPG::start_recovery_ops. If the num_missing==num_unfound, we don't try to do recovery. This is problem... Samuel Just

03/23/2017

01:36 PM Bug #18698: BlueFS FAILED assert(0 == "allocate failed... wtf")
Hi !
I also got that issue. I also added "bluefs_allocator = stupid" in my /etc/ceph/ceph.conf. Worked.
Issue happe...
François Blondel

03/22/2017

10:14 PM Bug #18162: osd/ReplicatedPG.cc: recover_replicas: object added to missing set for backfill, but ...
I've created https://github.com/ceph/ceph/pull/14054 to track Alexandre's changes.
I'm working on handling out of ...
David Zafman
09:10 AM Bug #19348 (Can't reproduce): "ceph ping mon.c" cli prints assertion failure on timeout
# start a cluster with 3 monitors: mon.a, mon.b and mon.c
# stop mon.c
# ceph ping mon.c --connect-timeout=5
it ...
Kefu Chai

03/21/2017

10:09 AM Bug #19320 (New): Pg inconsistent make ceph osd down
Hi all.
I am running a ceph cluster.
These is a pg inconsistent:
pg 3.aff is active+recovery_wait+degraded+incon...
hoan nv

03/20/2017

04:26 AM Feature #15835: filestore: randomize split threshold
Hi! I am an undergrad student wishing to contribute to CEPH, and I would like to work on this issue. Please let me kn... Peng Chen

03/18/2017

06:03 AM Bug #19267: rados list-inconsistent-obj sometimes doesn't flag that all 3 copies are bad
Greg Farnum wrote:
> I don't understand. What about this output says that two copies are bad and one isn't?
Thank...
cheng li

03/17/2017

09:33 PM Bug #19300 (Can't reproduce): "Segmentation fault ceph_test_objectstore --gtest_filter=-*/3"
Run: http://pulpito.ceph.com/yuriw-2017-03-16_15:10:12-rados-wip-yuri-testing_2017_3_16-distro-basic-smithi/
Logs: h...
Yuri Weinstein
09:11 PM Bug #19267: rados list-inconsistent-obj sometimes doesn't flag that all 3 copies are bad
I don't understand. What about this output says that two copies are bad and one isn't? Greg Farnum
08:11 PM Bug #19299: Jewel -> Kraken: OSD boot takes 1+ hours, unusually high CPU
As far as I know that was with -f passed. Maybe is relevant that I grepped out 'madvise' calls because they occur at... Ben Meekhof
07:59 PM Bug #19299 (Need More Info): Jewel -> Kraken: OSD boot takes 1+ hours, unusually high CPU
Sage Weil
07:58 PM Bug #19299: Jewel -> Kraken: OSD boot takes 1+ hours, unusually high CPU
the strace doesn't include child processes.. can you repeat with -f passed to strace? Sage Weil
07:48 PM Bug #19299 (Can't reproduce): Jewel -> Kraken: OSD boot takes 1+ hours, unusually high CPU
Since upgrading to Kraken we've had severe problems with OSD startup. Though this ticket mentions bootup specificall... Ben Meekhof

03/13/2017

09:48 AM Bug #19267 (New): rados list-inconsistent-obj sometimes doesn't flag that all 3 copies are bad
I tested ceph 10.2.3 with cluster of 3 osd nodes.
I upload a text file to ceph cluster, then manually change the t...
cheng li

03/09/2017

11:55 PM Bug #18926: Why osds do not release memory?

I don't know if this is the same issue, but bluestore got EROFS (30) and asserted.
http://pulpito.ceph.com/dzaf...
David Zafman
03:54 PM Bug #18687 (Resolved): bluestore: ENOSPC writing to XFS block file on smithi
Fixed as part of https://github.com/ceph/ceph/pull/13425 for tracker 15912 David Zafman

03/08/2017

09:38 PM Bug #19237 (New): "PG.cc: 3100: FAILED assert(e.version > info.last_update)" in upgrade:kraken-x-...
Run: http://pulpito.ceph.com/teuthology-2017-03-08_02:25:22-upgrade:kraken-x-master-distro-basic-vps/
Job: 894399
L...
Yuri Weinstein
12:17 PM Feature #18943 (Resolved): crush: add devices class that rules can use as a filter
Loïc Dachary

03/07/2017

07:24 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
Do we actually want to clear the missing set here, or just filter it for the correct child PG?
...I presume killing ...
Greg Farnum
07:19 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
Summary:
During pg split we are resetting last_backfill but not clearing the
local missing set. This comes back t...
Josh Durgin
01:19 AM Bug #19199: Odd OSD failure path; ERROR: osd init failed: (110) Connection timed out
Earlier in the log the root cause appears:... Josh Durgin

03/06/2017

06:48 PM Bug #19199 (New): Odd OSD failure path; ERROR: osd init failed: (110) Connection timed out
See attached OSD log for more details.
commit 6f8e4b38103d6f519e6661acc97a47ceccf5e5fc was the latest master
Interm...
J. Eric Ivancich
05:05 PM Bug #19198 (Closed): Bluestore doubles mem usage when caching object content
When trying to cache object content BlueStore uses twice as much memory than it really caches.
The root cause for ...
Igor Fedotov
09:48 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
yuri, the backtrace you posted is another issue. i am building your branch of wip-yuri-testing_2017_3_4 to see if "ce... Kefu Chai
01:22 AM Bug #19191: osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missing().is_mis...
Also see during PRs testing https://trello.com/c/il60a5yB
http://qa-proxy.ceph.com/teuthology/yuriw-2017-03-05_23:...
Yuri Weinstein

03/04/2017

07:24 PM Bug #19191 (Resolved): osd/ReplicatedBackend.cc: 1109: FAILED assert(!parent->get_log().get_missi...
... Sage Weil
 

Also available in: Atom