Project

General

Profile

Activity

From 08/28/2012 to 09/26/2012

09/26/2012

05:46 PM Bug #3228 (Resolved): osd: capspec parsing is broken
It's always returning ANY, even for simple cases like "allow r". Test cases are in wip-osd-caps. Josh Durgin
05:43 PM Bug #3227 (Resolved): mon: usage for 'ceph auth' is outdated
Currently the output is:... Josh Durgin
05:06 PM CephFS Bug #3186 (Resolved): mds: mds crash on client shutdown
Sage Weil
04:27 PM Bug #3226: osd: invalid capability string can allow arbitrary access
The first commit in wip-osd-caps fixes this. Josh Durgin
04:27 PM Bug #3226 (Resolved): osd: invalid capability string can allow arbitrary access
If you use the cap osd 'allow rwx pool=bar', the parser will add the grant for 'allow rwx', but fail to parse the 'fo... Josh Durgin
04:15 PM rgw Bug #3224 (Resolved): rgw: copy object with alternative attrs does not set ETAG
commit:f031046c4d600ce0a0c2c9ad5856dc100b0901fb fixes that. Yehuda Sadeh
03:47 PM rgw Bug #3224 (Resolved): rgw: copy object with alternative attrs does not set ETAG
When using copy object with alternative attrs, ETAG is not getting set, Yehuda Sadeh
04:03 PM rgw Feature #3225 (Resolved): rgw: stub for get bucket logging
Even if not implementing bucket logging api, we can still have a simple stub that returns that bucket logging is disa... Yehuda Sadeh
01:54 PM Documentation #3084: doc: Why and when should I use latest stable kernel and not the 3.2.0 from U...
Some more details:
The kernel that comes with 12.04 is old. This ticket requests a place in the documentation (FA...
Ross Turk
10:51 AM rgw Bug #2527: RGW may return 409 Conflict when deleting a bucket
The bucket index update when doing object removal is async. That means that when we get to remove the bucket, the ind... Yehuda Sadeh
10:20 AM rgw Feature #3216 (Fix Under Review): rgw: RESTful usage api
Yehuda Sadeh
08:25 AM CephFS Bug #1677 (Resolved): mds interval_set.h: 385: FAILED assert(p->first <= start)
Sage Weil
08:20 AM Bug #2919 (Resolved): ceph kernel module looks for :/ in path, but / stripped by precise mountall
it's fixed in our testing branch, but i didn't send it to linus in for 3.6; it was late in the -rc cycle to be fixing... Sage Weil

09/25/2012

09:53 PM Bug #2919: ceph kernel module looks for :/ in path, but / stripped by precise mountall
Was this not fixed? Dan Mick
09:51 PM Documentation #3223 (Resolved): Doc: ceph osd map not documented
> $ ceph osd getmap -o osdmap
> got osdmap epoch 59
> $ osdmaptool --test-map-object dmick.rbd osdmap
> osdmaptool...
Dan Mick
06:27 PM Feature #3219 (In Progress): Run tests in valgrind on gitbuilder
It turns out this handled by teuthology tests, we just need to update the test suite to include running valgrind on c... Sam Lang
03:31 PM Feature #3219 (Resolved): Run tests in valgrind on gitbuilder
This probably means modifying the scripts to run the test binaries within valgrind and using the --valgrind option in... Sam Lang
06:24 PM CephFS Bug #2285 (Fix Under Review): libcephfs: failure with empty name components
This was probably caused by the memory error in filepath (encoded member not getting initialized), which may explain ... Sam Lang
06:22 PM CephFS Bug #3186 (Fix Under Review): mds: mds crash on client shutdown
Proposed fix in wip-3186, specifically c92e1dd7b0106fa986985d8179a8ed66c61e6709. Needs review. Sam Lang
06:20 PM CephFS Bug #3184 (Fix Under Review): ceph-fuse should not segfault without a keyring
Sam Lang
05:55 PM Documentation #3222 (Resolved): DOC: Get an Object from a Primary OSD
ceph/doc/cluster-ops/control.rst should have usage for `ceph osd map {pool-name} {object-name}`
A knowledgebase ar...
John Wilkins
05:16 PM Bug #3221 (Resolved): disconnect_session_watchers missing pg
/a/teuthology-2012-09-24_04:00:04-regression-stable-master-basic/28971/remote
--- end dump of recent events ---
2...
Samuel Just
05:03 PM rbd Documentation #3220 (Resolved): doc: more detail on QEMU+RBD page
This page could use a bit more detail on the nature of the integration and how to use it. For example, why would som... Ross Turk
03:00 PM Bug #3002: ceph-authtool: --print does not work
commit 4e40a7849442d633c9801639fbdaf27a97f39888
Author: Tommi Virtanen <tv@inktank.com>
Date: 2012-08-02 13:02:04...
Anonymous
02:19 PM RADOS Documentation #3218 (Closed): Doc: osdmaptool manpage out of date with code *and* usage
Dan Mick
01:31 PM Feature #3191 (In Progress): client: coverity clean
Sam Lang
01:05 PM rgw Feature #3216 (In Progress): rgw: RESTful usage api
Yehuda Sadeh
01:03 PM rgw Feature #3216 (Resolved): rgw: RESTful usage api
Yehuda Sadeh
01:03 PM rgw Documentation #3217 (Closed): rgw: document RESTful usage api
Yehuda Sadeh
12:39 PM Feature #3215 (Resolved): qa: fix up system tests and add them to regression suite
These tests produce the testrados_* binaries, and their source is in src/test/system/.
There are a few tests we ha...
Josh Durgin
12:39 PM Bug #2525 (Resolved): librados: some functions are not thread-safe
Sage Weil
11:47 AM Bug #3214 (Closed): osdmaptool's usage is incomplete
--test-map-pg is shown, but not --test-map-object. In fact, there are a pile
of options missing from usage:
--du...
Dan Mick
11:18 AM Bug #3208: osd/PG.cc: 1346: FAILED assert(info.last_complete >= log.tail)
wip_last_complete_assert has a fix/explanation.
#3213 also needs to be handled in this case.
Samuel Just
11:16 AM Bug #3213 (Resolved): PG: we do not persist missing set elements not in the log
Normally the missing set is reconstructed on OSD startup by scanning the log and adding objects which are missing or ... Samuel Just
09:46 AM rbd Bug #3204: rbd client kernel panic when osd connection is lost
Here is an excerpt from the log that Chris originally provided.... Alex Elder
09:42 AM rbd Bug #3204: rbd client kernel panic when osd connection is lost
I believe have reproduced this problem (or perhaps something
like it) using stock stable kernel 3.5.4. I'm not 100% ...
Alex Elder
06:27 AM Bug #2476: osd: watch timeout depends on operations to an object
Have you made any progress with this issue? It is very annoying and breaks my CI. Is there any way to avoid or workar... Maciej Galkiewicz
02:42 AM Bug #3212: librados: failed to decode message of type 59 v1: buffer::end_of_buffer
Ok, with packages 0.51-700-g1a9c8c7-1precise from wip-3212 and back to 0.41-1ubuntu2.1 on the client, it now refuses ... John Leach

09/24/2012

08:30 PM rbd Feature #2788: rbd: add -l to rbd ls, show snapshots
Dan Mick
06:47 PM rbd Feature #3198: librbd: coverity clean
we'll see if coverity detects anything else Josh Durgin
06:12 PM rbd Feature #3198 (Fix Under Review): librbd: coverity clean
wip-rbd-coverity Josh Durgin
03:24 PM rbd Feature #3198 (In Progress): librbd: coverity clean
Josh Durgin
06:47 PM Feature #3197: librados: coverity clean
we'll see if coverity detects anything else Josh Durgin
06:12 PM Feature #3197 (Fix Under Review): librados: coverity clean
wip-rbd-coverity Josh Durgin
03:24 PM Feature #3197 (In Progress): librados: coverity clean
Josh Durgin
06:36 PM Bug #3212: librados: failed to decode message of type 59 v1: buffer::end_of_buffer
see wip-3212
we weren't encoding using the pre-v0.42 pool_stat_t.. or at least that's what it looks like, i wasn't...
Sage Weil
06:35 PM Bug #3212 (Resolved): librados: failed to decode message of type 59 v1: buffer::end_of_buffer
from ML:... Sage Weil
06:08 PM rgw Bug #3211 (Resolved): rgw: crash on armhf architecture
Ubuntu, launchpad issue #149582.... Yehuda Sadeh
05:57 PM CephFS Bug #3210 (Resolved): mds crashed and segfault at unlink_local_finish
Logs: /a/teuthology-2012-09-23_19:00:07-regression-master-testing-gcov/28428... Tamilarasi muthamizhan
05:55 PM Bug #3209 (Resolved): rbd_fsx_cache_writethrough test failure in the nightlies
Logs: ubuntu@teuthology:/a/teuthology-2012-09-23_19:00:07-regression-master-testing-gcov/28530... Tamilarasi muthamizhan
04:29 PM Bug #3208 (Resolved): osd/PG.cc: 1346: FAILED assert(info.last_complete >= log.tail)
... Sage Weil
03:22 PM rgw Feature #3207 (Resolved): qa: swift functional tests in nightly
Sage Weil
03:19 PM rgw Feature #3206 (Resolved): rgw: resolve vanity CNAMEs
Sage Weil
10:58 AM rbd Feature #3205 (New): doc: pool permissions for cloning use-case
Sage Weil
10:50 AM rbd Feature #2326 (Resolved): krbd: use new class interfaces, new image format
Sage Weil
10:00 AM CephFS Bug #3184: ceph-fuse should not segfault without a keyring
client->init() wasn't getting checked for error. Proposed fix in wip-3184 branch. Sam Lang
09:21 AM rbd Bug #3204 (Resolved): rbd client kernel panic when osd connection is lost
From ML:... Sage Weil

09/23/2012

10:07 PM Documentation #2968 (In Progress): doc: complete architecture section
John Wilkins
10:03 PM Documentation #2998 (Can't reproduce): doc: validate install docs on ubuntu server
I could use additional information. I have tried installing and re-installing Ubuntu 12.04 followed by Ceph. I did no... John Wilkins
10:00 PM Documentation #155 (Resolved): document ceph auth
This can be closed now, as we have had several iterations. There will be some new additions with a high level overvie... John Wilkins
09:58 PM Documentation #3185 (Resolved): doc: Differences from POSIX is useful content
Added it back in, but to the dev section as suggested. John Wilkins
05:47 PM Feature #3203: Add support for Ubuntu 12.10 (Quantal Quetzal) in deb repository
Gitbuilder and release scripts Sage Weil
03:58 PM Feature #3203 (Resolved): Add support for Ubuntu 12.10 (Quantal Quetzal) in deb repository
Quantal Quetzal is in beta now, it can't be used with the current repository.
Kernel 3.5 will be great for btrfs.
...
Guilhem Lettron
09:42 AM Feature #3202 (Resolved): tools: coverity clean
Sage Weil
09:42 AM rgw Feature #3201 (Resolved): rgw: coverity clean
Sage Weil
09:42 AM Feature #3200 (Resolved): os: coverity clean
Sage Weil
09:41 AM Feature #3199 (Resolved): msg: coverity clean
Sage Weil
09:39 AM rbd Feature #3198 (Resolved): librbd: coverity clean
Sage Weil
09:39 AM Feature #3197 (Resolved): librados: coverity clean
Sage Weil
09:39 AM Feature #3196 (Resolved): include: coverity clean
Sage Weil
09:39 AM Feature #3195 (Resolved): crush: coverity clean
Sage Weil
09:38 AM Feature #3194 (Resolved): common: coverity clean
Sage Weil
09:38 AM Feature #3193 (Resolved): cls: coverity clean
Sage Weil
09:26 AM Feature #3192 (Resolved): osd: coverity clean
Sage Weil
09:25 AM Feature #3191 (Resolved): client: coverity clean
Sage Weil
09:23 AM rbd Feature #3190 (Resolved): qa: test fencing
Sage Weil
09:18 AM Feature #3053 (Duplicate): osd: (de)prioritize recovery
Sage Weil
09:18 AM Feature #3189 (Resolved): osd: use reservations for non-backfill recovery too
Sage Weil
09:11 AM Fix #3188 (New): osd: close read hole
client and now-marked-down osd with old map may continue to read.
solution probably goes something like this:
*...
Sage Weil
08:50 AM Bug #3142: osd: crash induced by fsx workload
heap corruption? this hardly narrows it down, but from ubuntu@teuthology:/a/teuthology-2012-09-22_19:00:05-regressio... Sage Weil

09/22/2012

09:16 AM Bug #3142: osd: crash induced by fsx workload
ubuntu@teuthology:/a/teuthology-2012-09-21_19:00:08-regression-master-testing-gcov/27383 Sage Weil

09/21/2012

11:21 PM Bug #2536 (Can't reproduce): librados crashed while getting stat of an object
Sage Weil
11:20 PM Bug #2525 (Fix Under Review): librados: some functions are not thread-safe
Sage Weil
11:08 PM CephFS Bug #1677 (Fix Under Review): mds interval_set.h: 385: FAILED assert(p->first <= start)
wip-mds Sage Weil
10:39 PM CephFS Bug #3111 (Need More Info): Multiple MDS inevitable appear assert
Can you describe how you're causing the crash in more detail so we can reproduce it here? What commands are you runn... Sage Weil
10:37 PM CephFS Bug #1775 (Resolved): mds startup: _replay journaler got error -22, aborting, possible regresion?
chalking this up to a msgr failure due to one of the zillions of bugs we've fixed in the last few months. Sage Weil
10:36 PM CephFS Bug #2596 (Can't reproduce): mds: spinning on restart
Sage Weil
10:35 PM CephFS Bug #2288: libcephfs: setxattr returns EEXIST following removexattr
the problem is that it's not looking at the projected xattrs when it should. the logically simplest thing would be t... Sage Weil
10:32 PM CephFS Bug #1748 (Can't reproduce): mds segfault CDir::project_fnode
Sage Weil
09:44 PM Feature #3046 (Resolved): mon: factor osd failure reporters into heartbeat grace
Sage Weil
09:44 PM Feature #3047 (Resolved): mon: apply heartbeat grace adjustment to down_out_interval
Sage Weil
09:44 PM Feature #3043 (Resolved): mon: track osd laggy rate/interval
Sage Weil
09:44 PM Feature #3045 (Resolved): mon: factor osd laggy interval into heartbeat grace
Sage Weil
06:46 PM CephFS Bug #2778: uclient: failed asserts if mounting with a non-existent directory
As of last week, I cannot reproduce either, but I didn't look for a specific commit that fixed it. Noah Watkins
04:49 PM CephFS Bug #2778 (Can't reproduce): uclient: failed asserts if mounting with a non-existent directory
Was the error code from ceph_mount() getting checked? I'm not able to reproduce this with either fuse or the followi... Sam Lang
05:45 PM Bug #1975 (Won't Fix): btrfs: EINVAL on snap create
Sage Weil
05:44 PM Bug #2673: ReplicatedPG::prepare_transaction: don't crash on empty ops
this should be part of the librados api tests Sage Weil
05:43 PM Bug #2687 (Resolved): FileStore crashes when "osd_journal_size" is larger than the filesystem
this has called fallocate(2) for a while now. Sage Weil
04:29 PM CephFS Bug #2285: libcephfs: failure with empty name components
I tested this by working around #3186, first creating the file and then in a separate test running ceph_open as the o... Sam Lang
08:02 AM CephFS Bug #2285: libcephfs: failure with empty name components
I suspect that differs from his failure because the paths are already cached on the client. Try creating the file, u... Sage Weil
07:50 AM CephFS Bug #2285: libcephfs: failure with empty name components
I noticed that the above test doesn't have O_CREAT, not sure if the file already existed in previous tests. The foll... Sam Lang
03:47 PM Linux kernel client Bug #3187: ceph fs: crash/hang on 32-bit architecture
OK, mounting with the "nocrc" mount option makes the hang
when writing problem go away. The fix is pretty easy, and...
Alex Elder
03:34 PM Linux kernel client Bug #3187: ceph fs: crash/hang on 32-bit architecture
After some digging, I'm pretty sure the workqueue is from the
ceph messenger, because I have pretty good confidence ...
Alex Elder
02:55 PM Linux kernel client Bug #3187: ceph fs: crash/hang on 32-bit architecture
Below is the stack dump taken from gdb that I've been seeing
on the 32-bit system running inside a VM. Note this li...
Alex Elder
01:45 PM Linux kernel client Bug #3187 (Resolved): ceph fs: crash/hang on 32-bit architecture
I was hitting this while attempting to write files on a 32-bit system
running inside a VM, trying to reproduce bug 3...
Alex Elder
11:31 AM CephFS Bug #3186: mds: mds crash on client shutdown
The getattr segfault is actually from the second mount call in the client test. Sam Lang
11:00 AM CephFS Bug #3186: mds: mds crash on client shutdown
Verified that this is only an issue with an empty component in the path (related to #2285). Sam Lang
10:52 AM CephFS Bug #3186: mds: mds crash on client shutdown
can you reproduce with debug mds = 20 ? Sage Weil
09:53 AM CephFS Bug #3186: mds: mds crash on client shutdown
Just verified the same error occurs with one mds. Sam Lang
09:44 AM CephFS Bug #3186 (Resolved): mds: mds crash on client shutdown
With this libcephfs test: https://gist.github.com/3762534
The first mds crashes during shutdown. The ceph cluster...
Sam Lang
09:03 AM Linux kernel client Bug #3112: ceph 32 bit kernel client issue with file size more than 4GB.
I created a program to write out patterned data to a file,
and in a separate read mode, verify the data in a file
c...
Alex Elder
05:32 AM Documentation #3185 (Resolved): doc: Differences from POSIX is useful content
removing appendix/differences-from-posix.rst is a bad idea i think. perhaps put it in dev/ or something? people want ... Anonymous

09/20/2012

03:41 PM Feature #3097 (Resolved): create a teuthology task to run class unitests
Sage Weil
02:06 PM Linux kernel client Bug #3112: ceph 32 bit kernel client issue with file size more than 4GB.
I have set up a VM running a 32-bit kernel. It reports
via arch(1) that it is an i686 architecture. It is not
run...
Alex Elder
12:56 PM CephFS Bug #3184 (Resolved): ceph-fuse should not segfault without a keyring
Saw this at a customer on-site — it segfaults if it can't find the keyring. Greg Farnum
11:54 AM CephFS Bug #1957 (Resolved): ceph-fuse: have "." and ".." entries consistently
Sage Weil
11:16 AM Bug #3182: No JSON object could be decoded - failure in the nightly run
the rados bench process hasn't started up yet when we try the admin socket... it's a problem with teh test script Sage Weil
08:04 AM rbd Bug #3151 (Resolved): krbd: possible circular locking dependency (sysfs_lock and ctl_mutex) (test...
This has been resolved. The culprit was the committed patch
entitled "rbd: expand lock protection in rbd_add()" whi...
Alex Elder

09/19/2012

06:38 PM CephFS Bug #1957: ceph-fuse: have "." and ".." entries consistently
It looks like the trick is to use a fake inode number when filling in the parent (..) entry. I chose 2, as 1 is alre... Sam Lang
04:23 PM Bug #3182 (Can't reproduce): No JSON object could be decoded - failure in the nightly run
Logs: ubuntu@teuthology:/a/teuthology-2012-09-19_00:00:13-regression-next-testing-basic/25484... Tamilarasi muthamizhan
11:30 AM Fix #3180 (Rejected): use of strerror() for possibly-negative return values
System calls and similar routines throughout Ceph use negative constant return codes
to indicate error; strerror() i...
Dan Mick
01:51 AM Bug #2075: osd: recover_got assert
Hi Sage,
as announced on the list, find the logfile attached. Don't know if it's exactly the same, at least almost...
Oliver Francke

09/18/2012

09:08 PM rbd Bug #3151: krbd: possible circular locking dependency (sysfs_lock and ctl_mutex) (testing branch)
Strike that. I have now learned that the "copy.sh" script
was buggy (which I've now fixed) and have been unable to
...
Alex Elder
08:29 PM rbd Bug #3151: krbd: possible circular locking dependency (sysfs_lock and ctl_mutex) (testing branch)
FINALLY. After going through a bunch of different tests
to try to narrow it down, I have found that simply running
...
Alex Elder
04:43 PM rbd Feature #2951 (Resolved): cls_rbd: remove locking methods
Josh Durgin
04:42 PM rbd Feature #2686 (Resolved): rbd: let users specify a usage for shared locks
Josh Durgin
04:42 PM rbd Feature #2952 (Resolved): librbd: use generic rados locking class
Josh Durgin
04:42 PM rbd Feature #2556 (Resolved): rbd tool: break image locks
Josh Durgin
01:45 PM rgw Bug #3127 (Resolved): rgw: negative bucket stats
Also wip-3127-stable contains the required fix and an additional band aid solution for argonaut. Yehuda Sadeh
01:42 PM rgw Bug #3127: rgw: negative bucket stats
merged to master at commit:9b628efee1fc31fcb1f46818af4f6b262b460840 Yehuda Sadeh
07:44 AM Documentation #3174 (Resolved): Cephx with RBD/Qemu/libvirt
With cephx turned on by default. Cephx descriptions must be solid for:
- Client admin
- Ensuring Qemu can access...
John Wilkins
07:41 AM Documentation #3173 (Resolved): Cephx Overview
We need a stronger overview of Cephx for bobtail. The overview should include:
- Creating Admin Users. This will ...
John Wilkins
03:07 AM Bug #3172 (Resolved): ceph::buffer::bad_alloc downloading a large object using rados
got a 1gig object, when I get it I get a bad_alloc error.
This is on a 64bit Ubuntu 12.04.1 LTS box, with packages...
John Leach

09/17/2012

07:01 PM Documentation #3098 (Resolved): doc: ceph.conf host= field still confuses people
I updated the General Settings Reference and the Ceph Configuration section to point this out explicitly. John Wilkins
05:26 PM CephFS Documentation #3077 (Resolved): doc: Apparently people still don't realize we don't recommend Cep...
Added notes to http://ceph.com/docs/master/cephfs/ and http://ceph.com/docs/master/start/quick-cephfs/ John Wilkins
04:28 PM Documentation #3155 (Resolved): doc: ceph --admin-daemon ... show config
Added a section after http://ceph.com/docs/master/config-cluster/ceph-conf/#runtime-changes. John Wilkins
08:44 AM Documentation #3155 (Resolved): doc: ceph --admin-daemon ... show config
Seems to be completely undocumented currently.
http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/8944/foc...
Anonymous
03:35 PM Documentation #3099 (In Progress): doc: write install ceph+openstack with crowbar
Ross Turk
02:49 PM rbd Documentation #3171 (Resolved): doc: how to setup OpenStack (Glance and Cinder|Nova) to use rbd c...
This can probably be an addendum to the current OpenStack docs page for releases Folsom and beyond. Josh Durgin
02:41 PM rbd Documentation #3170 (Closed): doc: document image locking
Some resources:
The original discussion:
http://comments.gmane.org/gmane.comp.file-systems.ceph.devel/7094
The...
Josh Durgin
02:27 PM rbd Bug #2597 (Resolved): Import of image from file appears to succeed, but image not present in the ...
This was unintentionally solved by commit:fd4b294280cc2238289dbd7b37236d8a8cb10160. The --pool parameter to rbd impor... Josh Durgin
02:20 PM Feature #3167 (Resolved): auth: separate read/write capabilities for classes
Currently class operations can only be allowed in an all-or-nothing manner, with the x capability.
It would be use...
Josh Durgin
02:11 PM Feature #3097: create a teuthology task to run class unitests
We don't need a separate teuthology task for this, it could just be another workunit. A localfs task that sets up the... Josh Durgin
02:04 PM Bug #2563: leveldb corruption
Experiencing the same issue on a production ceph cluster.
ceph version 0.48.1argonaut (commit:a7ad701b9bd479f204...
Matt Garner
12:34 PM Feature #3080: rados: support asynchronous replication of some sort
Here's an write-up of what the proprietary XenServer product does: http://blogs.citrix.com/2012/08/24/storage_xenmotion/ Anonymous
10:35 AM RADOS Documentation #3166 (Closed): doc: Explain OSD up/down, in/out: what does it mean, where does it ...
Something like: up/down is observed by peers, spread via gossip, reaches monitors that make a decision to declare an ... Anonymous
10:33 AM Documentation #3165 (Rejected): doc: what to do after install to validate core / rbd / radosgw / ...
Anonymous
10:32 AM Documentation #3164 (Rejected): doc: "ceph auth add" syntax: * and ... are magic
Anonymous
10:22 AM Bug #3163 (Won't Fix): doc: explain meaning of pg dump output
Anonymous
10:16 AM devops Feature #3161 (Rejected): make gcov website public, via proxy on gitbuilder.sepia.ceph.com
Anonymous
09:38 AM devops Feature #3160 (Rejected): upstart: ceph-osd job leaves disks mounted
Anonymous
09:27 AM devops Feature #3158 (Closed): downburst: add --wait based on cd-rom eject detection kludge
Anonymous
09:26 AM devops Feature #3082 (In Progress): ceph-deploy: Support deployments without a framework like Chef, Pupp...
Anonymous
09:24 AM devops Feature #3157 (Resolved): upstart: move mds scripts to ceph-mds package.
These are currently in ceph.deb, but the ceph-mds binary is in ceph-mds. The upstart scripts should move to the packa... Anonymous
09:07 AM Messengers Bug #3156 (Resolved): port range is hard-coded
Right now the port range for daemons is hard-coded as 6800-6900, which means if you get more daemons on a box than ca... Greg Farnum

09/15/2012

03:22 PM rgw Feature #2923 (Resolved): rgw: non hard-coded pool names
Merged, commit:ff2ce1cbd868a28b95f5a53e7004718630931fbf.
This still does not provide a tool to modify pool names, ...
Yehuda Sadeh
01:41 PM rgw Bug #2915 (Resolved): rgw: copy of large object times out
Yehuda Sadeh
01:09 PM rgw Bug #2915 (Duplicate): rgw: copy of large object times out
merged, commit:a16fa0492eaa1f03b8676f8793eaba6fdad21c89 Yehuda Sadeh
01:14 PM rgw Cleanup #3154 (Rejected): rgw: configurable auid when creating pools
We should revise the defaults we use when we create pools, e.g. auid. Maybe we should avoid creating pools altogether? Yehuda Sadeh

09/14/2012

04:35 PM rgw Feature #3153 (New): Add --preserve-basic-acl option to obsync
A DHO Request:
The --no-preserve-acls flag is currently used to ensure that policies that are specific to external...
JuanJose Galvez
03:29 PM devops Feature #2548: ceph-disk-activate: take mount options from ceph.conf
Defaults for XFS, from an email from Sage:
When I talked to Christoph at Linuxcon a couple weeks back he suggested...
Anonymous
11:34 AM rgw Tasks #3152 (Resolved): rgw: document usage testing
Yehuda Sadeh

09/13/2012

11:38 PM rbd Bug #3151 (Resolved): krbd: possible circular locking dependency (sysfs_lock and ctl_mutex) (test...
Detected when unmapping an image:... Josh Durgin
04:28 PM rgw Bug #3127 (Fix Under Review): rgw: negative bucket stats
There were a few issues in the mechanism that recovers from failed operations. When we write/remove an object we go t... Yehuda Sadeh
03:42 PM rbd Bug #3133: perfcounters: sigfpe when rbd caching is enabled
The floating point exception in this case was due to "inexact result". This kind of exception is turned off by defaul... Josh Durgin
02:32 PM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
Recent logs: ubuntu@teuthology:/a/teuthology-2012-09-13_04:00:05-regression-stable-master-basic/22002 Tamilarasi muthamizhan
08:34 AM Bug #3135 (New): mon_host without ports leads to failure
Anonymous

09/12/2012

11:01 PM Bug #3142: osd: crash induced by fsx workload
Attempting a bisect from master to stable. Using
cd ~/src/ceph/ceph
git describe
make distclean && ./do_autoge...
Dan Mick
02:57 PM Bug #3142: osd: crash induced by fsx workload
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-09-11_02:00:03-regression-testing-testing-basic/20743 Sage Weil
02:56 PM Bug #3142 (Resolved): osd: crash induced by fsx workload
... Sage Weil
04:45 PM Bug #3124 (Resolved): Rados api test failure in the nightly run
d697b54c9bed80d9231f8f20fd22245d6f9d91f3 Samuel Just
02:44 PM Bug #3124: Rados api test failure in the nightly run
Recent logs: ubuntu@teuthology:/a/teuthology-2012-09-11_19:00:29-regression-master-testing-gcov/21043
Tamilarasi muthamizhan
04:28 PM Bug #3145 (Resolved): ceph-mon: mon_host insists on commas, does not understand spaces
Everything else seems happy with either of... Anonymous
04:11 PM Bug #3135: mon_host without ports leads to failure
I stumbled on this again. The trigger is this:
- no mon hosts or mon addr specified
- mon finds its own ip via pu...
Anonymous
10:06 AM Bug #3135 (Can't reproduce): mon_host without ports leads to failure
I swear this failed for me when I was at a customer site in NYC. That may have been when they were still using the ol... Anonymous
03:51 PM rgw Feature #3144 (New): Make obsync able to tolerate N failures in migrating a bucket before abortin...
DHO Request:
Currently if any one object fails to migrate, obsync aborts the entire bucket, making it very difficu...
JuanJose Galvez
03:23 PM Bug #3143 (Won't Fix): Obsync object verification takes too long
In Summary:
obsync verification for each objects takes about 3 seconds - with a large amount of objects per bucket...
JuanJose Galvez
02:17 PM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
ubuntu@teuthology:/a/teuthology-2012-09-12_02:00:04-regression-testing-testing-basic/21369 Tamilarasi muthamizhan
01:16 PM rgw Tasks #3118: QA: radosgw upstart: test the radosgw upstart script respawn functionality
There's no restarting ever with sysvinit, so I really don't expect Carl to have made that sort of a mistake. The "ser... Anonymous
11:17 AM rgw Tasks #3118: QA: radosgw upstart: test the radosgw upstart script respawn functionality
Using ceph version 0.51-516-gdfcb48d
My cluster is configured based off the examples in the web docs.
Following the...
Ken Franklin

09/11/2012

05:41 PM rgw Bug #3127: rgw: negative bucket stats
managed to reproduce with some code changes to simulate identified scenario. The code changes:
1. Randomly skip the b...
Yehuda Sadeh
05:15 PM Feature #3137 (Fix Under Review): osd: make default osd_journal_size 1GB, if existing (block or f...
wip-osd-journal Sage Weil
04:55 PM Feature #3137 (Resolved): osd: make default osd_journal_size 1GB, if existing (block or file) use...
This avoids having a "osd journal size = 1000" in every chef/ceph-deploy etc ceph.conf Anonymous
05:00 PM Bug #3135: mon_host without ports leads to failure
the code appears to set it to 6789. is it easy for you to reproduce? this must be some strange interaction with the... Sage Weil
04:53 PM Bug #3135 (Resolved): mon_host without ports leads to failure
ceph-mon should default to :6789 if port is not mentioned, however using just... Anonymous
04:56 PM Feature #3138 (Rejected): osd: Make filestore_xattr_use_omap=true the default? Or something?
Sam said this is not desired on btrfs, but I think the typical ceph.conf should not need to set this. Make the logic ... Anonymous
04:54 PM Feature #3136 (Resolved): for bobtail: Set auth_supported=cephx as default
Well, set it via the non-deprecated options, but still. Anonymous
04:34 PM Bug #3134 (Can't reproduce): osd: hang in journal?
... Sage Weil
04:12 PM Bug #3052: ceph-osd --mkfs fails on ext3
IRC user mrjack_ confirms that this (from Greg) fixes the problem:
diff --git a/src/os/FileStore.cc b/src/os/Fil...
Dan Mick
04:05 PM Feature #3101 (Resolved): allow "ceph osd crush set" without overwriting weight
Sage Weil
03:36 PM rbd Bug #3133 (Duplicate): perfcounters: sigfpe when rbd caching is enabled
On irc, Damien Churchill reported this backtrace from qemu-kvm with rbd caching enabled:... Josh Durgin
03:32 PM Bug #2787 (Resolved): RBD test copy fails when thrashing
I think this was fixed shortly after it was discovered. Josh Durgin
02:54 PM Bug #3128 (Resolved): Obsync doesn't appear to handle permanent redirect with specified endpoint ...
Sage Weil
02:36 PM Bug #3128 (Fix Under Review): Obsync doesn't appear to handle permanent redirect with specified e...
wip-obsync Sage Weil
02:10 PM Bug #2761 (Resolved): osd: failed to recover before timeout expired
Now in master/next 4e5283d476fc3405f30c91f1447e6059a1b52c8c Samuel Just
11:23 AM Bug #2761: osd: failed to recover before timeout expired
Testing fix in wip_push_fix c56b8a8bde6242e8b485a5d87798edf068d22642.
recovery_op leak.
Samuel Just
01:11 PM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
Recent logs: ubuntu@teuthology:/a/teuthology-2012-09-11_02:00:03-regression-testing-testing-basic/20743 Tamilarasi muthamizhan
11:29 AM Bug #3124: Rados api test failure in the nightly run
Recent logs: ubuntu@teuthology:/a/teuthology-2012-09-10_19:00:08-regression-master-testing-gcov/20355 Tamilarasi muthamizhan
10:48 AM Bug #2784: osd hit suicide timeout
Recent logs: /a/teuthology-2012-09-08_04:00:03-regression-stable-master-basic/19039 Tamilarasi muthamizhan
10:44 AM Bug #3130 (Resolved): osd: crash in SimpleMessenger::reaper
Logs: ubuntu@teuthology:/a/teuthology-2012-09-08_02:00:03-regression-testing-testing-basic/18829... Tamilarasi muthamizhan
10:38 AM Bug #3129 (Can't reproduce): valgrind reported error on uninitialized osd
logs: ubuntu@teuthology:/a/teuthology-2012-09-09_02:00:03-regression-testing-testing-basic/19388... Tamilarasi muthamizhan
04:31 AM Linux kernel client Bug #3112: ceph 32 bit kernel client issue with file size more than 4GB.
we have created two files( 4GB and 5 GB) using dd and if=/dev/urandom in our local machine for 32 kernel client issue... Mohamed Pakkeer

09/10/2012

06:51 PM Feature #3101 (Fix Under Review): allow "ceph osd crush set" without overwriting weight
Sage Weil
10:55 AM Feature #3101: allow "ceph osd crush set" without overwriting weight
Sage proposed this would be "ceph osd crush create-or-move".. I think the args could be "ID INITIAL_WEIGHT VAR=VAL.."... Anonymous
05:41 PM rgw Bug #3127: rgw: negative bucket stats
DHO reported the following details regarding this issue:
root@peon7935:~# radosgw-admin bucket stats --uid=andfar3...
JuanJose Galvez
05:11 PM rgw Bug #3127 (Resolved): rgw: negative bucket stats
We see a case where after all the objects were removed out of a bucket (40TB of 64k objects) the bucket index stats e... Yehuda Sadeh
05:18 PM Bug #3128 (Resolved): Obsync doesn't appear to handle permanent redirect with specified endpoint ...
It seems that accessing the bucket from http://s3.amazonaws.com/mikesbackup3or results in a PermanentRedirect error. ... JuanJose Galvez
04:24 PM Bug #3126 (Can't reproduce): mds crashed bool CDir::check_rstats()
Logs: ubuntu@teuthology:/a/teuthology-2012-09-09_00:00:04-regression-next-testing-basic/19242... Tamilarasi muthamizhan
04:19 PM Bug #3125 (Resolved): Assertion Error in peer.py - failure from the nightly run
Logs: ubuntu@teuthology:/a/teuthology-2012-09-07_19:00:06-regression-master-testing-gcov/18598... Tamilarasi muthamizhan
04:11 PM Bug #3124 (Resolved): Rados api test failure in the nightly run
Logs: ubuntu@teuthology:/a/teuthology-2012-09-08_19:00:04-regression-master-testing-gcov/19111... Tamilarasi muthamizhan
12:51 PM Documentation #3123 (Closed): doc: Where to see "ceph osd tell ID bench" results (ceph -w)
http://www.spinics.net/lists/ceph-devel/msg08789.html Anonymous
12:38 PM rgw Feature #1712 (Resolved): rgw: support swift manifest objects
Done, merged commit:63c090939951a8c5a6c0abb41d811f62c6fc04b2. Yehuda Sadeh
11:31 AM devops Feature #3122: chef: Use "argonaut"/"bobtail" in apt sources.list, not "stable"
debian is now a symlink to debian-argonaut on ceph.com. docs just need to be updated. Sage Weil
11:30 AM devops Feature #3122 (Resolved): chef: Use "argonaut"/"bobtail" in apt sources.list, not "stable"
Depends: #3083 Anonymous
11:29 AM rgw Feature #3060 (Resolved): rgw: different usage info break down
Done, merged, commit:a4bc7953a72fa2828e2aecc1d81f41d25222e0b1. Also update teuthology to reflect new radosgw-admin ou... Yehuda Sadeh
11:22 AM Documentation #2971 (Resolved): doc: expand/complete mon settings reference
Sage Weil
11:18 AM devops Feature #3121 (Rejected): chef: allow specifying exact version for "package" LWRP
This lets an admin have stricter control over what versions of packages are installed; decreases the chances of accid... Anonymous
11:10 AM devops Feature #3120 (Resolved): provide command "ceph-disk-list" that lists osd data/journal disks and ...
This should be easy after #3078 and #3079.
Something like...
Anonymous
10:59 AM devops Feature #3119 (Duplicate): chef: support destroying an osd
DH cookbooks do this by setting a node attribute that maps osd.id -> desired action, one of the actions is destroy.
...
Anonymous
10:44 AM rbd Feature #2686 (Fix Under Review): rbd: let users specify a usage for shared locks
Josh Durgin
09:57 AM devops Feature #2554: chef: open question: How do we discover what disks we should use as Ceph data disks?
What DH does: each Chef node has an attribute that's a list of block device pairs (data, journal). Anonymous
09:52 AM rgw Tasks #3118 (Closed): QA: radosgw upstart: test the radosgw upstart script respawn functionality
I have a vague bug report from Carl that radosgw, using the upstart script, is not restarting properly. Let's reprodu... Anonymous
06:47 AM Linux kernel client Bug #3112: ceph 32 bit kernel client issue with file size more than 4GB.
I tried your dd command with /dev/random
echo foo > /mnt/ceph/foo
dd if=/dev/random of=/mnt/ceph/foo bs=1M count=...
Mohamed Pakkeer
03:45 AM Linux kernel client Bug #3112: ceph 32 bit kernel client issue with file size more than 4GB.
I tried with the following command to create a file with random data
dd if=/dev/random of=cephtest bs=1M count=1 s...
Mohamed Pakkeer
03:15 AM Linux kernel client Bug #3112: ceph 32 bit kernel client issue with file size more than 4GB.
Hi Sage,
I tried with dd, it shows foo. But I have tried nearly 100 more video files, but i am getting same erro...
Mohamed Pakkeer

09/08/2012

09:44 AM Bug #3115: Cannot compile ceph 0.48.1
Apparently this is an issue with the tarballs from github. With the tarball from the blog post this error doesn't occur. Dennis Jacobfeuerborn
09:27 AM Bug #3115 (Won't Fix): Cannot compile ceph 0.48.1
I'm trying to compile ceph on a Fedora 15 system but that fails when I try to run autogen.sh:... Dennis Jacobfeuerborn

09/07/2012

07:52 PM rgw Feature #3114 (Resolved): rgw: support cache-control header field
Yehuda Sadeh
07:20 PM CephFS Bug #3088 (Need More Info): NULL pointer dereference at ceph_d_prune
Sage Weil
07:20 PM Linux kernel client Bug #3087 (Need More Info): Hung ceph_msg_kfree
Sage Weil
07:19 PM Linux kernel client Bug #3112: ceph 32 bit kernel client issue with file size more than 4GB.
Hi,
A real simple test is to see if a write to a file offset > 4gb inappropriately wraps to a low file offset. Ca...
Sage Weil
01:19 AM Linux kernel client Bug #3112 (Resolved): ceph 32 bit kernel client issue with file size more than 4GB.
We have been using 32 bit and 64 bit ceph cluster and kernel client. we have mounted ceph cluster using 32 bit and 64... Mohamed Pakkeer
07:16 PM rbd Feature #2556 (Fix Under Review): rbd tool: break image locks
wip-librbd-locking Josh Durgin
07:16 PM rbd Feature #2951 (Fix Under Review): cls_rbd: remove locking methods
wip-librbd-locking Josh Durgin
07:15 PM rbd Feature #2952 (Fix Under Review): librbd: use generic rados locking class
wip-librbd-locking Josh Durgin
04:18 PM rgw Feature #1776 (Resolved): rgw: swift auth prefix should be configurable (and optional)
Fixed, commit:06290f6dffec33f4a9f47e4c3733f6779173f595. Yehuda Sadeh
04:03 PM Bug #2834: osd/ReplicatedPG.cc: 3577: FAILED assert(waiting_for_ack.begin()->first == repop->v)
Recent log: ubuntu@teuthology:/a/teuthology-2012-09-07_00:00:07-regression-next-testing-basic/17906... Tamilarasi muthamizhan
03:57 PM Bug #2834: osd/ReplicatedPG.cc: 3577: FAILED assert(waiting_for_ack.begin()->first == repop->v)
Recent log: ubuntu@teuthology:/a/teuthology-2012-09-07_00:00:07-regression-next-testing-basic/17906 Tamilarasi muthamizhan
03:35 PM Bug #3072 (Resolved): osd/ReplicatedPG.cc: 3548: FAILED assert(waiting_for_ondisk.begin()->first ...
commit:03136d057f0048e9cd840a6e83efedfc20969247 Sage Weil
09:29 AM CephFS Documentation #3113 (Rejected): Ceph FS Options Could Use Some Additional Information
Documentation for mounting Ceph FS doesn't reflect some available options. Also, there are disparities -- e.g., cephf... John Wilkins

09/06/2012

11:38 PM CephFS Bug #3111 (Can't reproduce): Multiple MDS inevitable appear assert
I use two sets of equipment installed suse11sp2, deployment of CEPH double MDS active?I do LOOKUP/GETATTR/CREATE/SYML... zhiyuan duan
09:06 PM Documentation #3096 (Resolved): doc: Hardware recommendations sort of look like ranges
I also added a considerable amount of commentary, since people have asked for more guidance. John Wilkins
09:08 AM Documentation #3096 (Resolved): doc: Hardware recommendations sort of look like ranges
http://ceph.com/docs/master/install/hardware-recommendations/ has entries like "2-1GB Ethernet NICs" and "1-500GB dri... Anonymous
06:08 PM rgw Bug #2652 (Resolved): Segmentation fault in rest-bench
This should have been fixed. Yehuda Sadeh
06:06 PM rgw Feature #3102 (Need More Info): rgw: dedup
Yehuda Sadeh
04:45 PM rgw Feature #3060 (Fix Under Review): rgw: different usage info break down
Yehuda Sadeh
04:22 PM Feature #3101 (Resolved): allow "ceph osd crush set" without overwriting weight
Currently, a ceph-osd coming up gets its weight reset by src/upstart/ceph-osd.conf. This is not good. A node reboot s... Anonymous
01:30 PM Feature #2894: cli: help command for ceph subsystems
It also has "ceph --help", which gives a list of many commands, but not all. Remove the list of commands from --help ... Anonymous
01:10 PM Feature #1783 (Resolved): osd: scrub incrementally across hash range using MOSDPGScan
Sage Weil
01:10 PM RADOS Feature #3011 (Resolved): Remove "pool" terminology from CRUSH
Sage Weil
01:09 PM rbd Feature #3100 (Resolved): rbd-fuse prototype
Sage Weil
11:03 AM rgw Feature #1776 (Fix Under Review): rgw: swift auth prefix should be configurable (and optional)
Yehuda Sadeh
11:01 AM Documentation #3099 (Resolved): doc: write install ceph+openstack with crowbar
Ross Turk
10:59 AM Documentation #2981 (Resolved): doc: write add/remove a monitor
Ross Turk
10:15 AM Documentation #3098 (Resolved): doc: ceph.conf host= field still confuses people
I saw one group put in IP addresses in host=. I saw one person put in long hostnames (with domain).
The sysvinit i...
Anonymous
10:12 AM rgw Cleanup #3085 (Resolved): rgw: remove 'bucket info' from radosgw-admin usage dump
Fixed, commit:a5b9ba77737cbd7bf604492a40b291b2811f034f Yehuda Sadeh
09:59 AM Feature #3097 (Resolved): create a teuthology task to run class unitests
We have some class unitests now (lock, refcount). We should be able to run these through a teuthology task. Yehuda Sadeh
08:46 AM Feature #3095 (Resolved): rbd tool resize improvements
It might be handy if the rbd CLI tool could warn an admin when performing a resize operation that would ultimately en... Damien Churchill
07:18 AM rbd Bug #3094 (Resolved): krbd: race between finding existing client and creating new one
In rbd_get_client(), an attempt is made to reuse an existing
rbd client if there is already one with matching option...
Alex Elder

09/05/2012

04:20 PM Bug #2761: osd: failed to recover before timeout expired
Recent logs: ubuntu@teuthology:/a/teuthology-2012-09-05_04:00:05-regression-stable-master-basic/16530 Tamilarasi muthamizhan
01:03 PM Feature #3047 (Fix Under Review): mon: apply heartbeat grace adjustment to down_out_interval
Sage Weil
12:29 PM Linux kernel client Bug #3058 (Won't Fix): xfs: possible circular locking dependency detected
Sage Weil
12:28 PM CephFS Bug #2444: null pointer deference in ceph_d_prune inside kvm
Can you try... Sage Weil
12:28 PM CephFS Bug #3088: NULL pointer dereference at ceph_d_prune
Can you try... Sage Weil
11:56 AM CephFS Bug #3088: NULL pointer dereference at ceph_d_prune
Matt Garner wrote:
> Probably a duplicate of BUG #2444.
** Possibly
Matt Garner
11:55 AM CephFS Bug #3088: NULL pointer dereference at ceph_d_prune
Probably a duplicate of BUG #2444. Matt Garner
11:52 AM CephFS Bug #3088 (Resolved): NULL pointer dereference at ceph_d_prune
when I mount (via fstab) a specific folder
10.10.140.210:6789:/pool-hs-san-1 /mnt/ceph1-pool-hs-san-1 ceph ...
Matt Garner
11:03 AM Linux kernel client Bug #3087 (Can't reproduce): Hung ceph_msg_kfree
Sep 4 00:37:00 rmi-orem-ceph1-mds1 kernel: [233040.316059] INFO: task smbd:31483 blocked for more than 120 seconds.
...
Matt Garner
08:52 AM Feature #3086 (Resolved): workqueue: dynamically adjust number of threads
Add a config observer, and logic to start new threads and let extra threads die off. Sage Weil
08:32 AM Bug #2176: dependencies not checked by autoconf
Gary, want to quickly check if configure.ac is checking these things and see if this is still a problem? IIRC Mark s... Sage Weil
08:30 AM CephFS Bug #1510 (Resolved): fsx failure on cfuse
this has been back in the qa suite for weeks now, no problems. Sage Weil

09/04/2012

08:52 PM Linux kernel client Bug #2754: cephfs show_location produces kernel "divide error: 0000 [#1]" when run against a dire...
ceph_calc_file_object_mapping() does no divide-by-zero checking. Sage Weil
08:45 PM CephFS Bug #1509 (Can't reproduce): cfuse sometimes hangs after unmount
Sage Weil
08:44 PM CephFS Bug #1603 (Can't reproduce): ceph-fuse crash during unmount
Sage Weil
08:44 PM CephFS Bug #1806 (Can't reproduce): MDS won't start
logs indicate an unresponsive osd. Sage Weil
08:41 PM Bug #2151 (Resolved): mon: pg creation retry is slow
Sage Weil
08:40 PM CephFS Bug #1640 (Resolved): mds: failed assert(trim_to > trimming_pos)
pretty sure this is out of order osd replies, due to #2947 and similar bugs. Sage Weil
08:09 PM rgw Feature #2923 (Fix Under Review): rgw: non hard-coded pool names
Yehuda Sadeh
04:28 PM CephFS Documentation #3077 (In Progress): doc: Apparently people still don't realize we don't recommend ...
Added comments in Quick Start and Ceph FS sections. d233271434d1daa640273efe99ef3ee193da1de6 John Wilkins
02:50 PM CephFS Documentation #3077 (Resolved): doc: Apparently people still don't realize we don't recommend Cep...
Not sure what more we can or should do, but I run across this recently again. Anonymous
04:18 PM rgw Cleanup #3085 (Resolved): rgw: remove 'bucket info' from radosgw-admin usage dump
Yehuda Sadeh
04:13 PM Documentation #3084 (Resolved): doc: Why and when should I use latest stable kernel and not the 3...
This will be a living document, more reasons added as time progresses.
Another similar one will probably be starte...
Anonymous
04:11 PM Feature #3083 (Resolved): Provide separate APT repos for argonaut, bobtail, etc; stable would alw...
I think we should by default make installations use the "argonaut" style name, so they don't accidentally upgrade jus... Anonymous
03:51 PM devops Feature #3082 (Closed): ceph-deploy: Support deployments without a framework like Chef, Puppet, J...
Anonymous
03:34 PM rgw Feature #3081 (Duplicate): radosgw: asynchronous read-only replication mechanism
See #3080 for a similar RADOS-level feature. Proposed separately for radosgw as it can offer different tradeoffs, and... Anonymous
03:31 PM Feature #3080 (New): rados: support asynchronous replication of some sort
Probably easiest to construct using pool-level snapshots, and computing a delta between two snapshots on the sending ... Anonymous
03:07 PM devops Feature #3079 (Resolved): support journal on any accessible block device
Anonymous
03:06 PM devops Feature #3078 (Resolved): support journal in 2nd partition on same disk as osd
Anonymous
02:22 PM Documentation #3076: doc: Explain how loopback mounts (using kclient, ceph-fuse should be immune)...
this definitely qualifies as a faq Sage Weil
02:19 PM Documentation #3076: doc: Explain how loopback mounts (using kclient, ceph-fuse should be immune)...
The discussion in this thread https://lkml.org/lkml/2004/7/26/68 is interesting for a more in-depth discussion of the... Josh Durgin
02:00 PM Documentation #3076 (Resolved): doc: Explain how loopback mounts (using kclient, ceph-fuse should...
Currently, consuming CephFS/RBD services provided by the same machine can lead to deadlock. People in general don't r... Anonymous
02:08 PM rbd Cleanup #1761: krbd: make block/segment naming consistent
I just committed the following to the ceph-client/testing
branch. It renames field "block_name" to be "object_prefi...
Alex Elder
10:49 AM rbd Cleanup #1761 (Resolved): krbd: make block/segment naming consistent
Sage Weil
02:02 PM Bug #3071 (Resolved): osd: crash in ~ReplicatedPG
40fd6ba8ed9ba70c8d20a79936f53f10f2dfe839 should take care of this one. Samuel Just
01:49 PM Bug #3072: osd/ReplicatedPG.cc: 3548: FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
0aad5462eb79be0427004f2442903bb56c2057c1 should take care of this one. Samuel Just
12:06 PM Bug #3072: osd/ReplicatedPG.cc: 3548: FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
ubuntu@teuthology:/a/sage-h/15556 and 15540 Sage Weil
08:01 AM Bug #3072 (Resolved): osd/ReplicatedPG.cc: 3548: FAILED assert(waiting_for_ondisk.begin()->first ...
... Sage Weil
01:48 PM Bug #3025 (Resolved): WaitActingChange
1db67c443d84dc5d1ff53cc820fdfd4a2128b680 Samuel Just
01:38 PM Feature #3045 (Fix Under Review): mon: factor osd laggy interval into heartbeat grace
Sage Weil
01:38 PM Feature #3046 (Fix Under Review): mon: factor osd failure reporters into heartbeat grace
Sage Weil
12:04 PM Bug #3070 (Resolved): client/mds: ms failure injection causes problems
Sage Weil
11:29 AM Bug #3075 (Resolved): rados python tests occasionally hang with ms failures
commit:e09b26555c6132ffce08b565780a39e4177cbc1c Sage Weil
10:04 AM Bug #3075: rados python tests occasionally hang with ms failures
... Sage Weil
09:36 AM Bug #3075 (Resolved): rados python tests occasionally hang with ms failures
... Sage Weil
11:20 AM Bug #2954 (Resolved): osd: scrub stat mismatch, got 18/19 objects, 14/15 clones, 22478527/2538528...
Most likely fixed in b273c376ca6455f1e36be82cbc91606debd5fb1e. Samuel Just
09:13 AM rgw Feature #3074 (Resolved): radosgw needs --help support
User reports he only discovered radosgw supports -d via radosgw-admin (which shouldn't even have that option).
Needs...
Anonymous
09:12 AM Feature #3044 (Resolved): osd: include fail stamp in failure messages
Sage Weil
09:12 AM rgw Bug #3073 (Resolved): radosgw-admin: is not a daemon, should not have -d/-f options
... Anonymous

09/03/2012

08:24 PM Documentation #2983: doc: write performance monitoring
From Sage:
-- osds --
overall throughput
osd.op
osd.op_r
osd.op_w
osd.op_in...
Ross Turk
12:12 PM Bug #3071: osd: crash in ~ReplicatedPG
ubuntu@teuthology:/a/sage-fuse2/14470 Sage Weil
12:12 PM Bug #3071 (Resolved): osd: crash in ~ReplicatedPG
i suspect this was from the coverity fixes?... Sage Weil

09/02/2012

11:53 AM Bug #3070 (Resolved): client/mds: ms failure injection causes problems
most recently, ubuntu@teuthology:/a/teuthology-2012-09-01_19:00:04-regression-master-testing-gcov/13271 Sage Weil

09/01/2012

09:54 AM Subtask #2614 (Closed): Single Paxos instance shared across the existing services
Joao Eduardo Luis
09:53 AM Subtask #2745 (Closed): mon: Single-Paxos: Sync: Add new message support to the Monitor class
Joao Eduardo Luis
09:52 AM Subtask #2746 (Rejected): mon: Single-Paxos: Sync: Test message passing
We tested the message passing using a ill-built bash script based on grepping logs on both sides.
Further testing ...
Joao Eduardo Luis
09:51 AM Subtask #3069 (Rejected): mon: Single-Paxos: messaging: log MMonSync messages for offline matching
Keep a log of sent/received MMonSync messages, obtainable through the admin socket, for offline matching.
Completi...
Joao Eduardo Luis
09:44 AM Subtask #2737 (Resolved): mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Joao Eduardo Luis
09:43 AM Subtask #2757 (Resolved): mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transac...
We now synchronize the stores of two different monitors, being able to checksum the transferred contents on both side... Joao Eduardo Luis

08/31/2012

04:53 PM Feature #2943 (Resolved): mon: norecovery and/or nobackfill
Sage Weil
04:05 PM Bug #3068 (Resolved): rados: Can't pipe to the rados command
commit:fb379174bfa32ea5123a451fc5ddc30ecf15d2f1
Dan Mick
02:11 PM Bug #3068: rados: Can't pipe to the rados command
It appears this was broken by the reorganization in some past delta; it shouldn't
be difficult to fix.
Dan Mick
12:42 PM Bug #3068 (Resolved): rados: Can't pipe to the rados command
while "rados -p poolname put objectname filename" works, "cat filename | rados -p poolname put objectname -" doesn't.... Stefan Kleijkers
01:36 PM Feature #3044 (Fix Under Review): osd: include fail stamp in failure messages
Sage Weil
01:36 PM Feature #3043 (Fix Under Review): mon: track osd laggy rate/interval
Sage Weil
01:19 PM Linux kernel client Cleanup #2432 (Resolved): ceph-client: messenger: refactor to simplify state model
Sage Weil
01:17 PM Linux kernel client Bug #3063 (Resolved): rbd: snapshots must be added to end of list
Sage Weil
12:46 PM Bug #3059 (Resolved): osd:FAILED assert(waiting_for_ondisk.empty())
by commit:5f36b8d78416b7a1d1bbefecddfcee00b7bfcfa3 Sage Weil
11:11 AM rbd Feature #2952 (In Progress): librbd: use generic rados locking class
Josh Durgin
09:43 AM rgw Feature #1712 (Fix Under Review): rgw: support swift manifest objects
Yehuda Sadeh
08:23 AM Bug #3067: mon: runaway memory
If you can't ssh you probably need to power cycle the machine and restart the daemons. It sounds like there is some ... Sage Weil
08:20 AM Bug #3067 (Resolved): mon: runaway memory
... Sage Weil
08:21 AM Bug #2026: osd: ceph::HeartbeatMap::check_touch_file
This is unrelated to the heartbeta problem; opened new bug #3067! Sage Weil

08/30/2012

10:23 PM rgw Bug #2915 (Fix Under Review): rgw: copy of large object times out
Yehuda Sadeh
10:20 PM rgw Feature #1712: rgw: support swift manifest objects
Yehuda Sadeh
09:32 PM Bug #2026: osd: ceph::HeartbeatMap::check_touch_file
Ceph cluster is totally unreachable at this point, the only interesting in the log (on one of the machines, not the o... Xiaopong Tran
08:58 PM Bug #2026: osd: ceph::HeartbeatMap::check_touch_file
Some more information:
OS: Debian Wheezy 3.2.0-3-amd64
Ceph: ceph version 0.48argonaut (commit:c2b20ca74249892c8e...
Xiaopong Tran
08:36 PM Linux kernel client Bug #3040 (Resolved): btrfs: recursive locking of sb_internal#2
Sage Weil
08:36 PM Linux kernel client Bug #3031 (Resolved): btrfs: lock returned to userspace
Sage Weil
08:36 PM Bug #2842 (Won't Fix): mon: health detail lists pgs multiple times
Sage Weil
05:00 PM rbd Feature #2720 (Resolved): rbd: add children command
Added in commit:465fcca1febb62d8871d482b5bc079aa21704454 Josh Durgin
05:00 PM rbd Feature #2719 (Resolved): librbd: provide functions for listing parents and their children
Added in commit:78f69752a44f0f754ebdba2c06399c493f2be99f Josh Durgin
03:45 PM Bug #3059: osd:FAILED assert(waiting_for_ondisk.empty())
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-30_00:00:06-regression-next-testing-basic/11576 Tamilarasi muthamizhan
03:44 PM rgw Bug #2527: RGW may return 409 Conflict when deleting a bucket
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-30_00:00:06-regression-next-testing-basic/11588 Tamilarasi muthamizhan
03:40 PM rgw Bug #2527: RGW may return 409 Conflict when deleting a bucket
Logs:ubuntu@teuthology:/a/teuthology-2012-08-29_19:00:06-regression-master-testing-gcov/11434 Tamilarasi muthamizhan
03:38 PM Linux kernel client Bug #3058: xfs: possible circular locking dependency detected
Logs: ubuntu@teuthology:/a/teuthology-2012-08-29_19:00:06-regression-master-testing-gcov/11363 Tamilarasi muthamizhan
01:59 PM Bug #3066 (Resolved): impermissible OSD ops get dropped on the floor
Right now, an OSD operation that has insufficient permissions gets dropped without an EPERM error being sent back. I ... Greg Farnum
07:32 AM Bug #3065 (Won't Fix): don't always default to "client.admin"; perhaps client.$USER or something?
It would be desirable to have a read-only, status-view-only key that is able to run trivial operations like "ceph -s"... Anonymous
06:19 AM rbd Feature #3064 (Resolved): librbd: A way to read from nearby replicas
In the style of CEPH_OSD_FLAG_LOCALIZE_READS, there are cases when an rbd image is stored in a pool where some of the... Anonymous
12:44 AM Documentation #2970 (Resolved): doc: expand/complete osd settings reference
This has been largely resolved. However, there are still a few fields that could use some attention. Updates will occ... John Wilkins
12:42 AM rgw Documentation #2972 (Resolved): doc: expand/complete rgw settings reference
This was reviewed with Yehuda's help, and has been staged. John Wilkins
12:41 AM Documentation #3062 (In Progress): doc: osd tuning config options
Added filestore section to configuration. Should be reviewed. John Wilkins

08/29/2012

10:05 PM Linux kernel client Bug #3063 (Resolved): rbd: snapshots must be added to end of list
I just found and fixed this bug and I'm going to try to make a
practice of at least documenting the fixes to actual ...
Alex Elder
06:36 PM Documentation #3062 (Resolved): doc: osd tuning config options
Copy-paste from recent ceph-devel email from Sam:
filestore [min|max] sync interval:
Periodically, the filestor...
Anonymous
01:15 PM Bug #3061 (Resolved): osd crash during shutdown
Logs: ubuntu@teuthology:/a/teuthology-2012-08-28_19:00:05-regression-master-testing-gcov/10931... Tamilarasi muthamizhan
01:05 PM rgw Bug #2366: rgw: bucket index update rely on pg state
I think a simple solution for that would be attaching the pool id to the version number. Then, assuming that differen... Yehuda Sadeh
12:46 PM rgw Bug #2935 (Resolved): rgw: radosgw-admin bucket link clobbers index
Fix merged at commit:e3834f023be6d897995a2a7f6829f442be0795ff. Yehuda Sadeh
12:34 PM Bug #2761: osd: failed to recover before timeout expired
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-28_19:00:05-regression-master-testing-gcov/10872 Tamilarasi muthamizhan
12:32 PM Bug #3059: osd:FAILED assert(waiting_for_ondisk.empty())
Logs: ubuntu@teuthology:/a/teuthology-2012-08-28_19:00:05-regression-master-testing-gcov/10868 Tamilarasi muthamizhan
11:25 AM rgw Feature #3060 (Resolved): rgw: different usage info break down
We should be able to provide usage break down by the requests types, e.g. put_obj, get_obj, etc. Yehuda Sadeh

08/28/2012

05:49 PM Bug #3042: monitor hangs when osds are shut down
ubuntu@teuthology:/a/teuthology-2012-08-28_00:00:10-regression-next-testing-basic/10259 Tamilarasi muthamizhan
05:41 PM Bug #3059: osd:FAILED assert(waiting_for_ondisk.empty())
Logs: ubuntu@teuthology:/a/teuthology-2012-08-28_00:00:10-regression-next-testing-basic/10315... Tamilarasi muthamizhan
05:29 PM Bug #3059 (Resolved): osd:FAILED assert(waiting_for_ondisk.empty())
Logs: ubuntu@teuthology: /a/teuthology-2012-08-28_00:00:10-regression-next-testing-basic/10302... Tamilarasi muthamizhan
05:17 PM Linux kernel client Bug #3058 (Won't Fix): xfs: possible circular locking dependency detected
Logs: ubuntu@teuthology:/a/teuthology-2012-08-28_02:00:05-regression-testing-testing-basic/10406... Tamilarasi muthamizhan
04:45 PM rgw Bug #3057 (Resolved): rgw: usage show show duplicate info when there's a large number of buckets
Fixed, commit:82c62bd977c74c22385b18791943cb2054920f47 Yehuda Sadeh
03:46 PM rgw Bug #3057 (Resolved): rgw: usage show show duplicate info when there's a large number of buckets
Apparently we don't reset the aggregated usage before each iteration. Yehuda Sadeh
04:30 PM rbd Bug #2937: btrfs filesystem on rbd device kernel BUG writing large file
This smells a bit like it's related to this discussion:
https://patchwork.kernel.org/patch/1271871/
I terminated ...
Alex Elder
03:17 PM rgw Bug #2916 (Resolved): radosgw does not check command line options for correctness
The rados tool alone is affected by this issue and due to the nature of valid input to the command this will not be f... caleb miles
10:01 AM rgw Bug #2916: radosgw does not check command line options for correctness
caleb miles
01:41 PM rgw Feature #2797 (Resolved): rgw: support multi-objects delete
Fixed, merged at commit:0ab623250344705f55d0c249ad977f43bed8fd54. Yehuda Sadeh
01:30 PM Bug #3048 (Resolved): rados bench: use after free?
fixed in 2d79a62a11ee4c18cbc8619da7921a0707ec401c Anonymous
12:51 PM rgw Bug #3056 (Resolved): rgw: usage show doesn't show anything
Fixed, commit:5507665a58d909c7c659b233d472c3d3a868130a. Yehuda Sadeh
12:48 PM rgw Bug #3056 (Resolved): rgw: usage show doesn't show anything
Problem following cls rgw cleanup work. Affects current master only.
Yehuda Sadeh
08:33 AM Bug #3055: ceph command stuck until all predefined monitors are up
Holger Macht wrote:
> One thing that still bothers me is how the correct way of doing this looks like. '/sbin/mkceph...
Sage Weil
08:29 AM Bug #3055: ceph command stuck until all predefined monitors are up
One thing that still bothers me is how the correct way of doing this looks like. '/sbin/mkcephfs -d /tmp/mon-init -c ... Holger Macht
07:45 AM Bug #3055: ceph command stuck until all predefined monitors are up
Thanks for the clarification, that's all I wanted to know. Holger Macht
07:29 AM Bug #3055 (Won't Fix): ceph command stuck until all predefined monitors are up
By putting all three monitors in the monmap, you've declared them to all be part of the monitor group, and you'll nee... Greg Farnum
07:05 AM Bug #3055 (Won't Fix): ceph command stuck until all predefined monitors are up
I'm using ceph-0.48.1 and encountered a problem from which I'm not sure if it's a bug or a wrong setup sequence. So p... Holger Macht
07:45 AM Bug #2026: osd: ceph::HeartbeatMap::check_touch_file
It would be great if we could keep this issue going, and if there is anything I could provide, I'd be happy to do so.... Xiaopong Tran
06:39 AM RADOS Documentation #3054 (Closed): doc: omap, tmap, xattrs
Just about everything in this thread belongs in docs, though some of it should go under dev/
http://thread.gmane.o...
Anonymous
 

Also available in: Atom