Project

General

Profile

Activity

From 06/18/2012 to 07/17/2012

07/17/2012

09:42 PM Bug #2067 (Resolved): librados: we leak CephContext from rados_create()
Sage Weil
09:42 PM Bug #2315 (Resolved): unrecognized admin socket command 'objecter_requests'
Sage Weil
07:21 PM CephFS Bug #1737 (Resolved): ceph-fuse crash in xlist::remove
commit:d78235be1b5c20440f3a55abb3f4b980aa3f3596 Sage Weil
07:11 PM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
Sage Weil
08:57 AM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
this is 100% reproducible on my notebook, but not on my other boxes. debugging it there. :(
Sage Weil
07:11 PM Bug #2796: osd: watch state not reestablished when registration op resent
Sage Weil
12:45 PM Bug #2796 (Fix Under Review): osd: watch state not reestablished when registration op resent
Sage Weil
09:04 AM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
if the client doesn't get the watch ack and resends, the osd will ignore it as a dup op, and the watch session state ... Sage Weil
04:23 PM Bug #2798 (Resolved): handle_osd_ping assert
ceph version 0.48argonaut-404-gabe05a3 (commit:abe05a3fbbb120d8d354623258d9104584db66f7)
1: (OSDMap::get_cluster_in...
Samuel Just
02:36 PM devops Feature #2782: chef: easy scripts for running in vercoi vms
Anonymous
02:20 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Done as of barclamp-ceph.git commit 36842bd3e3db9fea65e2921bb7b57e9088964621. Anonymous
02:03 PM rgw Feature #2797 (Resolved): rgw: support multi-objects delete
Support a multi-objects delete S3 api call. Yehuda Sadeh
10:39 AM Subtask #2758 (Fix Under Review): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyVal...
From Task #2756:
> * We changed KeyValueDB:
> > * Added a Whole-space iterator interface
> > * Implemented the p...
Joao Eduardo Luis
10:32 AM Subtask #2756 (Fix Under Review): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis
10:32 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Just a couple of notes on this regard:
* We changed KeyValueDB:
> * Added a Whole-space iterator interface
> * I...
Joao Eduardo Luis
08:56 AM RADOS Subtask #2793 (Resolved): osd: require tunable feature if current osdmap uses non-default tunables
Sage Weil
08:56 AM RADOS Subtask #2792 (Resolved): mon: require tunable feature bit if current osdmap uses non-default tun...
Sage Weil
08:52 AM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
Sage Weil
08:48 AM Tasks #2789: pkgconfig files for libraries
There is a librados-config in librados-dev. Does that mean we just need a librbd-config too? Sage Weil
06:12 AM Tasks #2789 (New): pkgconfig files for libraries
From the libvirt mailinglist I got the message that they'd like to see pkg-config for at least librbd and librados.
...
Wido den Hollander
08:29 AM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
... Sage Weil

07/16/2012

10:45 PM rbd Feature #2788 (Resolved): rbd: add -l to rbd ls, show snapshots
Add -l to rbd ls to show snapshots, perhaps size of parent Dan Mick
09:58 PM rbd Feature #2717: cls_rbd: add methods for maintaining mapping from parent to child images
Dan Mick
09:58 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
Dan Mick
09:57 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
From the design doc:
In addition to knowing which parent a given image has, we want to be able to tell if a protec...
Dan Mick
06:34 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Currently have fsid generated via uuidgen, monitor is generated via the python script which is residing in barclamp-c... Tyler Brekke
06:28 PM rbd Feature #2561 (Fix Under Review): rbd: copyup command
Implemented at the moment as "flatten". Dan Mick
06:26 PM rbd Feature #2561 (In Progress): rbd: copyup command
Dan Mick
06:27 PM rbd Feature #2559: cls_rbd: copyup method
Dan Mick
06:26 PM rbd Subtask #2604 (In Progress): librbd layering: read path
Josh Durgin
06:26 PM rbd Subtask #2605 (In Progress): librbd layering: guard writes
Josh Durgin
04:07 PM Feature #2631 (Resolved): mon: kill rm -rf --mkfs behavior
Sage Weil
03:04 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Samuel Just
03:04 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Probably fixed in c7fb964c077d369943bd5c066c5f99da6bd5f37c.
Backported to stable: ec5cd6def9817039704b6cc010f2797a70...
Samuel Just
11:17 AM Bug #2761: osd: failed to recover before timeout expired
Logs: ubuntu@teuthology: /a/teuthology-2012-07-14_00:00:03-regression-next-testing-basic/11248
ubuntu@teuthology:/...
Tamilarasi muthamizhan
11:01 AM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
Samuel Just
11:01 AM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
Samuel Just
11:00 AM Bug #2787 (Resolved): RBD test copy fails when thrashing
Logs: ubuntu@teuthology:/a/teuthology-2012-07-15_00:00:04-regression-next-testing-basic/11629
Attached below an ex...
Tamilarasi muthamizhan
10:55 AM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
Samuel Just
10:48 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
Sage Weil
10:47 AM Bug #2022 (Resolved): osd: misdirectect request
YAY Sage Weil
10:47 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Sage Weil

07/15/2012

09:40 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
Sage Weil
09:40 PM Bug #2593 (Resolved): logmonitor: decode failure
aie. commit:fe57681892b0bf3155ddeaeaa79ddbf997e27af9 Sage Weil
08:43 PM Feature #2330 (Resolved): dump open files, sockets when we run out of fds
commit:bf9a85ade6f7b06ed28ff7930b9e33a41acf0e34 Sage Weil

07/14/2012

05:39 PM Bug #2022 (Fix Under Review): osd: misdirectect request
FINALLY tracked this one down. tests consistently now pass. bugfix-2022 Sage Weil
05:29 PM rgw Feature #2786 (Resolved): radosgw-admin: ability to remove objects/buckets
Need to be able to remove objects/buckets. Bucket removal can optionally remove all objects. Another option is to set... Yehuda Sadeh

07/13/2012

04:58 PM Bug #2748: include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
https://github.com/athanatos/ceph/commits/wip_biginfo Samuel Just
04:57 PM Subtask #825 (Resolved): osd: remove pg map updating from handle_osd_map
Samuel Just
04:57 PM Bug #2461 (Resolved): DBObjectMap is incompatible with collection_rename
Samuel Just
03:57 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
for the record, this is being done in this sprint for two reasons:
(a) we are under some pressure to give up the old...
Anonymous
09:39 AM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
A few possible kinks:
* The fabfile currently pulls the magic ssh keys for rsync from cephbooter. we need a new ...
Sage Weil
09:34 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
12:40 PM devops Feature #2785 (In Progress): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:40 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:38 PM devops Feature #2662 (In Progress): crowbar: Make barclamp-ceph set mon initial members, monitor-secret,...
Anonymous
12:37 PM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
Tyler says the google doc is good to go. Anonymous
12:32 PM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
Sage Weil
11:08 AM Bug #2784 (Resolved): osd hit suicide timeout
Log: ubuntu@teuthology:/a/teuthology-2012-07-12_19:00:15-regression-master-testing-gcov/10615
ubuntu@teuthology:/a...
Tamilarasi muthamizhan
10:22 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
5924f8e4a8c29e6de326a9e8576c30109cdc0e07 Samuel Just
09:50 AM devops Bug #2783 (Won't Fix): Chef-client fails on oneiric cluster
While verifying the web docs for Chef deployment of cepf, chef-client fails with the following error on an oneiric cl... Ken Franklin
09:45 AM Bug #2779 (Fix Under Review): mon: [near]full status doesn't get purged when osds are removed
tag! Sage Weil
07:12 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Date: Fri, 13 Jul 2012 12:17:47 +0400
From: Andrey Korolyov <andrey@xdel.ru>
To: ceph-devel@vger.kernel.org
Subjec...
Sage Weil
09:43 AM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
Anonymous
09:42 AM devops Feature #2781 (Closed): crowbar: easy scripts for running in vercoi vms
Anonymous
09:33 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
Works well enough for first iteration. Anonymous
09:32 AM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
Sage Weil
04:12 AM rbd Bug #2777: qemu: report discard support
It works for me, I test ext4 with discard and ext4 with fstrim. Martin Mailand

07/12/2012

05:25 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
wip_2769 should fix it. Samuel Just
04:55 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
bcfa573f5f615f3403ff71da0212cd1cee7e7d9c
The core provided pretty good support for this sequence of events, markin...
Samuel Just
04:40 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
Sage Weil
02:31 PM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
*next is enough to figure out whether we ended on a boundary. Samuel Just
02:07 PM Feature #2680 (Resolved): osd: report backfill progress via query
5450567a6769b64ace385140ffb645d9d805e354 Samuel Just
12:22 PM rbd Bug #2777: qemu: report discard support
It looks like this is handled by what qemu calls a 'device property', so you have to use a command line like:
<pre...
Josh Durgin
08:14 AM rbd Bug #2777: qemu: report discard support
Yeah, QUEUE_FLAG_DISCARD is as far as I looked yesterday too. The question now is how to set that within qemu. Josh Durgin
11:23 AM CephFS Bug #2778 (Can't reproduce): uclient: failed asserts if mounting with a non-existent directory
From Noah.
If Ceph is mounted with a non-existent directory, the mount call returns a success, but on unmount does t...
Greg Farnum
11:23 AM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-12_05:00:04-regression-stable-master-basic/9637
ubuntu@teutho...
Tamilarasi muthamizhan
09:58 AM RADOS Feature #2101 (Resolved): crushtool: check for weight overflow on reweight
Sage Weil

07/11/2012

11:18 PM rbd Bug #2777: qemu: report discard support
I don't know what I'm talking about, but stumbling around in the kernel seems to point at QUEUE_FLAG_DISCARD as the f... Dan Mick
02:19 PM rbd Bug #2777 (Resolved): qemu: report discard support
todin reported problems enabling discard inside qemu.
I'm not sure if this is rbd-specific, but there seem to be s...
Josh Durgin
11:14 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9430
ubuntu@teutho...
Tamilarasi muthamizhan
09:01 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
... Sage Weil
11:10 PM rbd Bug #2715: krbd: spinlock wrong CPU
latest logs:ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9371
12-07-11T19:36...
Tamilarasi muthamizhan
09:10 PM Bug #2772 (Resolved): rados bench -t broken
Sage Weil
10:16 AM Bug #2772: rados bench -t broken
we can't change bench flags.
just don't make a short one-char flag for the target... use --target or whatever instead
Sage Weil
10:12 AM Bug #2772: rados bench -t broken
FWIW, I hacked up a patch last night (changed -t to -n to avoid conflicts, and made some changes to the usage message... Dan Mick
02:23 PM Feature #2714 (Resolved): teuthology: store monitor data in archives
Sage Weil
01:54 PM Bug #2387 (Duplicate): mon: could not get service secret for auth subsystem
Sage Weil
11:31 AM Feature #2776 (Resolved): rados tool: bulk removal of objects
Provide an option to remove multiple objects in a single command. Yehuda Sadeh
10:08 AM Bug #2771: mon|osd: fail to create pgs on startup
i think this is related:... Sage Weil
09:19 AM Feature #2775 (Rejected): rest-bench man page
Sage Weil
09:19 AM devops Documentation #2774 (Resolved): doc: ceph-disk man page
commit:a450cab2b8148cb8a9b043d629feccf89e5aabac Sage Weil
09:11 AM rbd Subtask #2603 (Fix Under Review): librbd layering: open parent on open
In branch wip-rbd-parent Josh Durgin
08:16 AM Feature #2666 (Resolved): rados tool: copy pool
Sage Weil
04:35 AM rbd Feature #2773 (Resolved): libvirt: incorrect snapshot behavior
Since consistent(disk and memory state) snapshots currently supported only with qcow images, libvirt should:
- all...
Andrey Korolyov

07/10/2012

10:13 PM Bug #2772: rados bench -t broken
actually the -t param is for 'rados cp' which is for an object, not for cppool. Yehuda Sadeh
10:02 PM Bug #2772: rados bench -t broken
let's just make it --target-pool or similar. this is an uncommon enough command that it doesn't really need a single... Sage Weil
10:00 PM Bug #2772 (Resolved): rados bench -t broken
The -t parameter has being assigned to rados cppool (target pool), but apparently it was already being used for rados... Yehuda Sadeh
09:56 PM Feature #2714 (In Progress): teuthology: store monitor data in archives
Sage Weil
09:25 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
... Sage Weil
07:22 PM rbd Feature #2770 (Resolved): krbd: define tasks to add osd_client compound class op support
Sage Weil
04:38 PM Bug #2387 (Can't reproduce): mon: could not get service secret for auth subsystem
Sage Weil
04:26 PM Bug #2600 (Resolved): osd: crazy long watch timeout?
Sage Weil
09:28 AM Bug #2600 (In Progress): osd: crazy long watch timeout?
Sage Weil
02:52 PM Bug #2765: Extremely Long Object Latencies
I noticed one 15 second lag on an OSD caused by the filestore requiring 15 seconds to drain open ops before a sync. Samuel Just
02:52 PM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
Merged in commit:96ca508dcd227db11597cd4963208872af30fcc4 Josh Durgin
02:24 PM rbd Feature #2585 (Resolved): rbd: clone command
commit:83339a0cbb925022c44507ab0ea3da7b399f511e
Dan Mick
11:06 AM Bug #2767 (Resolved): rbd workunit failing
The real problem is that the workunit task only grabs things from master, since github only seems to create tarballs ... Josh Durgin
09:24 AM Bug #2767 (Resolved): rbd workunit failing
... Sage Weil
10:47 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
Log location: ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8335
ubuntu@teuth...
Tamilarasi muthamizhan
10:33 AM Feature #2768 (Resolved): teuthology: make workunit task work on different branch/sha1 etc
This involves making it download scripts individually instead of from a single tarball, since github only creates tar... Josh Durgin
10:14 AM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8328 Sage Weil
10:13 AM Bug #2453 (In Progress): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
ubuntu@teuthology:/a/teuthology-2012-07-10_05:00:02-regression-stable-master-basic/8546 Sage Weil
09:06 AM Subtask #2757: mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transactions
Pending testing Joao Eduardo Luis

07/09/2012

09:59 PM Bug #2067: librados: we leak CephContext from rados_create()
librados does not destroy the CephContext created in rados_create() in rados_destroy() (or the c++ equivalents). Thi... Sage Weil
09:59 PM Bug #2766 (Resolved): cephcontext: needs refcount
CephContext needs to be refcounted, so that it can be properly shared. Sage Weil
09:37 PM Bug #2600: osd: crazy long watch timeout?
Oh.... Sage Weil
06:41 PM Bug #2765 (Closed): Extremely Long Object Latencies
Saw this during testing on one of the aging clusters.
example object tid: 312322
Full logs available here:
...
Mark Nelson
06:03 PM Bug #2508 (Resolved): osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobje...
645ac87b84ae348483c7d52b1968af5f44378dea Samuel Just
06:02 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
67a5e75ed6a9b3689a984d6f7eb6a18e1242125e Samuel Just
04:12 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
Remove the loud warnings? Like, the mismatch warnings? Greg Farnum
04:08 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
Ok, so this is actually caused by not updating the backfill peer's partial stats while recovering the backfill portio... Samuel Just
03:10 PM devops Feature #2698 (Closed): crowbar: Guide for using "front" network
Merged as 90c42837696d252be40348fdd4499e15aa835aeb. Anonymous
01:56 PM Linux kernel client Bug #2764 (Closed): xfstest hang; osd socket closed messages
... Sage Weil
01:51 PM Cleanup #2763 (Resolved): move rbd locking infrastructure to a separate objclass
locks are generics, can serve other uses (e.g., rgw). We should separate them from rbd code. Yehuda Sadeh
12:52 PM Bug #2762 (Resolved): mon crash ceph::__ceph_assert_fail (assertion=0x63d150 "begin->last_committ...
#0 0x00007fb9b32b5b7b in raise (sig=<optimized out>) at ../nptl/sysdeps/unix/sysv/linux/pt-raise.c:42
#1 0x0000000...
Samuel Just
12:07 PM Bug #2761 (Resolved): osd: failed to recover before timeout expired
Logs: ubuntu@teuthology:/a/teuthology-2012-07-09_05:00:08-regression-stable-master-basic/8039
Attaching below an e...
Tamilarasi muthamizhan
11:54 AM Bug #2022: osd: misdirectect request
Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
Attaching the...
Tamilarasi muthamizhan
11:50 AM Bug #2022: osd: misdirectect request
Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7824
Attaching the...
Tamilarasi muthamizhan
11:45 AM Bug #2022: osd: misdirectect request
latest log: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
and
ubuntu@t...
Tamilarasi muthamizhan
11:33 AM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
... Anonymous
11:21 AM Linux kernel client Bug #2759 (Resolved): libceph: crush tree algorithm is not understood
After changing the crush algorithm for a bucket to tree, the kernel client cannot parse the OSDMap:... Josh Durgin
10:31 AM RADOS Feature #2101: crushtool: check for weight overflow on reweight
caleb miles
09:38 AM Subtask #2758 (Resolved): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyValueDB to ...
Extend the in-memory mock-up of KeyValueDB to support the safe iterator, and create an iterator to iterate over all t... Joao Eduardo Luis
09:16 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis
09:05 AM Subtask #2756 (Resolved): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
The current iterator implementation iterates directly over the database state, even if there are concurrent updates.
...
Joao Eduardo Luis
09:15 AM Subtask #2757 (Resolved): mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transac...
Using the LevelDBStore's safe iterator, obtain the existing key/values and pack them into transactions.
These tran...
Joao Eduardo Luis
08:57 AM Feature #2755 (Resolved): ceph-conftool: optionally return the default for a config option if no ...
Presently, the only way to figure out what the default for a specific configuration option is, is to consult the sour... Florian Haas

07/07/2012

06:19 AM Linux kernel client Bug #2754 (Resolved): cephfs show_location produces kernel "divide error: 0000 [#1]" when run aga...
Originally reported in http://marc.info/?l=ceph-devel&m=134151028212170&w=2:
Really easy to reproduce on my 3.2.0 ...
Florian Haas
06:16 AM CephFS Bug #2753 (Resolved): Writes to mounted Ceph FS fail silently if client has no write capability o...
Originally reported in http://marc.info/?l=ceph-devel&m=134151023912148&w=2:
How to reproduce (this is on a 3.2.0 ...
Florian Haas
06:12 AM Bug #2752: Setting large maxosd kills all mons
Original ML response from Greg (http://marc.info/?l=ceph-devel&m=134151077312444&w=2):
Ungh. Can you file a bug repo...
Florian Haas
06:10 AM Bug #2752 (Resolved): Setting large maxosd kills all mons
Originally reported in http://marc.info/?l=ceph-devel&m=134151018912133&w=2:
When setting a large maxosd, all mons...
Florian Haas

07/06/2012

08:03 PM Feature #2743 (Resolved): new cond assertions, associated cleanups
Sage Weil
09:41 AM Feature #2743 (Resolved): new cond assertions, associated cleanups
Sage Weil
05:05 PM Feature #2702 (Resolved): gitbuilder: sync each build as it completes
Sage Weil
03:23 PM Feature #2702: gitbuilder: sync each build as it completes
Sage Weil
09:43 AM Feature #2702 (In Progress): gitbuilder: sync each build as it completes
Sage Weil
05:02 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
Yehuda Sadeh
04:35 PM rgw Feature #2735: rgw: support response-* request params in GET obj
Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a. Yehuda Sadeh
04:48 PM Feature #2680: osd: report backfill progress via query
not yet tested https://github.com/athanatos/ceph/tree/wip_recovery_query Samuel Just
04:45 PM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
in progress already https://github.com/athanatos/ceph/tree/wip_biginfo
blocked by 2748
Samuel Just
04:35 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
lavor: basic
owner: scheduled_sage@metropolis
success: false
ubuntu@teuthology:/a/sage-2012-07-06_11:03:15-regress...
Samuel Just
04:35 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a. Yehuda Sadeh
10:58 AM rgw Bug #2734: rgw: fails to authenticate preauthenticated requests with sub resource
Fails only with specific sub resources (e.g., response-*) that were not hard coded. Yehuda Sadeh
04:34 PM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
roles:
- - mon.a
- osd.0
- osd.1
- osd.2
- - mds.a
- osd.3
- osd.4
- osd.5
- - client.0
targets...
Samuel Just
04:32 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
Saw on master
osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
ceph version 0.48argona...
Samuel Just
04:23 PM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
Samuel Just
03:51 PM Bug #2600: osd: crazy long watch timeout?
another instance:... Sage Weil
03:47 PM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
... Sage Weil
01:23 PM rgw Bug #2701 (Resolved): rgw: don't keep bucket info indexed by bucket_id
Fixed, commit:3df51040b19b9c110e08f42befa0cae18cbb984c. Yehuda Sadeh
11:34 AM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
Thanks, updated the doc to increase the VM size. Tyler Brekke
10:24 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Currently, Paxos own trimming is going through the Paxos proposal mechanism itself.
However, services do not do it...
Joao Eduardo Luis
09:29 AM Subtask #2737 (Resolved): mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
This includes Paxos' own trimming, so it done at the same time in all the quorum members. Furthermore, it allows the ... Joao Eduardo Luis
10:18 AM Subtask #2738: mon: Single-Paxos: Sync: Add snapshot support to the monitor store
Implemented, although it has not been tested yet. Joao Eduardo Luis
09:31 AM Subtask #2738 (Rejected): mon: Single-Paxos: Sync: Add snapshot support to the monitor store
Having snapshot support, and being able to iterate over it instead of iterating directly over the store's keys, allow... Joao Eduardo Luis
10:09 AM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
latest logs:
/a/teuthology-2012-07-06_00:00:03-regression-next-testing-basic/6655
Tamilarasi muthamizhan
10:01 AM Subtask #2746 (Rejected): mon: Single-Paxos: Sync: Test message passing
Make sure the message passing is working before starting to work on the store synchronization.
From the monitor's ...
Joao Eduardo Luis
09:58 AM Subtask #2745 (Closed): mon: Single-Paxos: Sync: Add new message support to the Monitor class
Three different "roles" on a monitor cluster regarding synchronization:
* Leader - responsible for disabling the P...
Joao Eduardo Luis
09:53 AM Subtask #2744 (Resolved): mon: Single-Paxos: Sync: Create new Message type
Create a new message type MMonSync, to dissociate it from the MMonProbe messages, and to allow us to use fields speci... Joao Eduardo Luis
09:43 AM RADOS Feature #2706 (In Progress): crush: update kernel code to decode tunables
Sage Weil
09:40 AM Feature #2742 (Resolved): qa: ms socket inject failures in regression suite
Sage Weil
09:40 AM Subtask #2741 (Resolved): mon: Single-Paxos: Sync: Assess requirements for QA tests
TBD Joao Eduardo Luis
09:37 AM CephFS Bug #2740 (Resolved): mds: crash in Objecter when shutting down too early
Jimmy Tang reported this on the mailing list.... Greg Farnum
09:36 AM Subtask #2739 (Resolved): mon: Single-Paxos: Sync: Synchronize the store of a drifted monitor
Synchronize the store of a monitor (mon.X) that drifted away from the system (say, it failed for a considerable amoun... Joao Eduardo Luis
09:22 AM Subtask #2736 (Resolved): mon: Single-Paxos: Sync: Implement message passing
See subtasks. Joao Eduardo Luis
01:37 AM Bug #2267: Ceph client crashed after shutting down one mds and osd
To which version do you plan to add this patch? Maciej Galkiewicz

07/05/2012

10:30 PM Feature #2692 (Resolved): stable testing debian repos
Sage Weil
10:16 PM Linux kernel client Bug #2716 (Resolved): crash when cluster goes down and new one comes up
bad con->private = NULL in monc __close_session Sage Weil
09:38 PM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
easy to reproduce with... Sage Weil
08:44 AM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
I just reproduced this several times on master using the socket failure injection. This is probably socket errors br... Sage Weil
09:37 PM Bug #2022: osd: misdirectect request
ugh, wrong bug.. ignore last two messages! Sage Weil
05:19 PM Bug #2022: osd: misdirectect request
this is easily reproduced with... Sage Weil
05:18 PM Bug #2022 (In Progress): osd: misdirectect request
... Sage Weil
06:18 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
The size of a VM is adjustable:... Greg Farnum
05:49 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
Tested on vercoi/downburst
https://docs.google.com/a/inktank.com/document/d/1C1QLD54sbpRJ7o6M9ML2TrexXC5JIMecHJxOM...
Tyler Brekke
05:21 PM Cleanup #2041 (Resolved): osd: move peering into worker threads
Sage Weil
05:08 PM Linux kernel client Bug #2523 (Resolved): xfs: xfs_iolock_reclaimable
patches from hch fix this up. added them to testing branch. Sage Weil
04:18 PM rbd Bug #2488 (Can't reproduce): rbd: EIO from rbd map
Sage Weil
04:18 PM Bug #2693 (Resolved): osd/ReplicatedPG.cc: 4293: FAILED assert(info.last_update <= active_rep_scr...
Sage Weil
04:18 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Sage Weil
04:13 PM Messengers Bug #2569 (Resolved): msgr: connect_rank crash
Sage Weil
04:12 PM devops Feature #2698: crowbar: Guide for using "front" network
I verified that this causes no trouble before crowbar is installed and that eth1 does come up automatically after reb... JuanJose Galvez
03:11 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
Yehuda Sadeh
03:10 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
Yehuda Sadeh
02:11 PM Feature #2733 (Resolved): osd: fadvise replica data don't use
This should help quite a bit with cache pressure. Samuel Just
01:56 PM Feature #2485 (Resolved): crush: validate hash function
Sage Weil
01:29 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
We can also keep data about the last N slow requests on the OSD to allow easier debugging. Samuel Just
01:16 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
syntax should be something along the lines of:
ceph tell osd <num> query_ops
Result should be a json formatted ...
Samuel Just
01:20 PM Bug #2711 (Resolved): chef does not install apache2/fastcgi on plana23
The apt-get update command I added said "apt-key update", which isn't the same at all. Fixed in
commit:6d351b0b2...
Dan Mick
01:18 PM Feature #2732 (Resolved): Idempotency tester
Add proper support for:
* omap
* xattr
* collection_rename
Samuel Just
01:17 PM Cleanup #2731 (Closed): ceph tell osd <num> should be ceph osd <subcommand> to match ceph pg <sub...
Samuel Just
11:41 AM Feature #2727: filestore: add split
On second thought, that would be hard to encode into a transaction, it'll probably just take a
[(hash_prefix, bits...
Samuel Just
11:06 AM Feature #2727 (Resolved): filestore: add split
Add split functionality to filestore. The existing collection structure should be already in place.
Interface coul...
Samuel Just
11:20 AM Documentation #2729 (Resolved): doc: OSD internals
Broad overview of OSD internals:
* Threading/wq model
* Message handling
* Message ordering (wait for map, pg wait...
Samuel Just
11:18 AM Feature #2728 (Resolved): OSD: handle split
OSD message handling needs to handle messages for pgs between the map epoch creating them and the actual split (shoul... Samuel Just
11:17 AM rbd Documentation #2670 (In Progress): Docs shouldn't direct users to echo to /sys/bus/rbd for normal...
Sage Weil
11:06 AM Bug #2725 (Duplicate): filestore: add split
Samuel Just
11:04 AM Bug #2725 (Duplicate): filestore: add split
Add split functionality to filestore. The existing collection structure should be already in place.
Interface cou...
Samuel Just
11:05 AM rbd Feature #2609 (In Progress): librbd: new image name -> image head indirection
Sage Weil
11:05 AM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
Sage Weil
11:04 AM rbd Feature #2726 (Resolved): krbd: clean up bio_pair leak/whatever
Sage Weil
11:03 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
I have restarted ceph mon with invoke-rc.d ceph restart mon and now it works! However I am 100% sure that I did it af... Maciej Galkiewicz
10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
you can just restart, or to verify in the future, 'ceph --admin-daemon /var/run/ceph/ceph-mon.a.asok version' (adjust... Sage Weil
10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
Oh! The monitor is still running the old version. Sage Weil
10:57 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
... Maciej Galkiewicz
10:55 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
Weird! Two things:
- does 'ceph status' work? (this is identical to -s now)
- can you try with commit:b53cdb9...
Sage Weil
10:45 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
# ceph -v
ceph version 0.48argonaut (commit:c2b20ca74249892c8e5e40c12aa14446a2bf2030)
Maciej Galkiewicz
10:44 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
can you include 'ceph -v' output? this works for me...... Sage Weil
10:05 AM Bug #2721 (Resolved): Ceph status does not work in 0.48 even if it is still documented
... Maciej Galkiewicz
10:22 AM rbd Feature #2724 (Resolved): rbd: add protect/unprotect command
See http://ceph.com/docs/master/dev/rbd-layering/#protection Josh Durgin
10:12 AM rbd Feature #2723 (Resolved): librbd: protect/unprotect as appropiate during cloning
See http://ceph.com/docs/master/dev/rbd-layering/#protection Josh Durgin
10:10 AM rbd Feature #2722 (Resolved): cls_rbd: add class methods to get/set protected status
The possible settings are: protected, unprotected, and protecting. See http://ceph.com/docs/master/dev/rbd-layering/ ... Josh Durgin
10:02 AM rbd Feature #2720 (Resolved): rbd: add children command
This should list all direct children of a given snapshot. Josh Durgin
10:01 AM rbd Feature #2719 (Resolved): librbd: provide functions for listing parents and their children
Wrap the class methods for listing parents and children of parents. Josh Durgin
09:57 AM rbd Feature #2718 (Resolved): librbd: map parent -> child in a per-pool rbd_children object when cloning
Josh Durgin
09:55 AM rbd Feature #2717 (Resolved): cls_rbd: add methods for maintaining mapping from parent to child images
From http://ceph.com/docs/master/dev/rbd-layering/#cls-rbd:... Josh Durgin

07/04/2012

10:23 PM Linux kernel client Bug #2716: crash when cluster goes down and new one comes up
... Sage Weil
10:21 PM Linux kernel client Bug #2716 (Resolved): crash when cluster goes down and new one comes up
- vstart cluster
- mount uml
- do some stuff (dbench, control-c, sync)
- stop vstart cluster
- (waited a while)
...
Sage Weil
09:24 PM rbd Bug #2715 (Resolved): krbd: spinlock wrong CPU
... Sage Weil
03:34 PM Feature #2714 (Resolved): teuthology: store monitor data in archives
Sage suggested we might want to have teuthology archive the monitor's data directory, since that information is often... Greg Farnum
02:52 PM Bug #2593: logmonitor: decode failure
nope.
i wonder if we should make teuthology archive the mon data directory... that info is often useful, even for ...
Sage Weil
02:49 PM Bug #2675 (Resolved): osd: segfault during log trim
Sage Weil
02:31 PM Bug #2712 (Resolved): osd: small write performance regression from 0.47.*
Several reports on the ML about small writes going slower than before, generating 'slow request' warnings where previ... Sage Weil
02:02 PM Bug #2711 (Resolved): chef does not install apache2/fastcgi on plana23
i saw this on at least one other node as well. why? (this manifests itself has a 100-continue s3test failure in qa.... Sage Weil
01:45 PM Linux kernel client Bug #2710 (Resolved): freeze from dbench in uml
Sage Weil
01:16 PM Linux kernel client Bug #2710: freeze from dbench in uml
this was a problem with 3df5c88d0d469c71c43f2379f6a375fc2d314eb7, fixed and re-pushed testing.
the fix f28dfe91f5f...
Sage Weil
11:32 AM Linux kernel client Bug #2710 (Resolved): freeze from dbench in uml
Current testing and about 3 seconds of dbench 2 locks up Sage Weil
08:19 AM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
Sage Weil
07:22 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
... Sage Weil

07/03/2012

10:36 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
Firstly, well done guys on achieving this version milestone. I successfully upgraded to the 0.48 format uneventfully ... Paul Pettigrew
04:45 PM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
Sage Weil
04:44 PM RADOS Feature #2705 (Resolved): crush: graceful transition to new default tunables
Sage Weil
04:44 PM RADOS Bug #2214 (Resolved): crush: pgs only mapped to 2 devices with replication level 3
Sage Weil
04:44 PM RADOS Bug #2047 (Resolved): crush: with a rack->host->device hierarchy, several down devices are likely...
Sage Weil
04:43 PM RADOS Bug #187 (Rejected): crush: high variance, latency for straw buckets
Sage Weil
04:43 PM RADOS Feature #2422 (Resolved): crush: test that mapping result is uncorrelated
Sage Weil
04:39 PM rgw Bug #2106: failed s3tests.functional.test_s3.test_100_continue
recent logs from the nightly run: /a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5054
Tamilarasi muthamizhan
04:34 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
Tamilarasi muthamizhan wrote:
> latest logs:
> /a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5019...
Tamilarasi muthamizhan
04:33 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
latest logs:
/a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5019
config.yaml:
++++++++++++
k...
Tamilarasi muthamizhan
03:53 PM devops Feature #2704 (Closed): sepia: Use ``names`` as resolver on plana, burnupi, vercoi
Anonymous
03:45 PM Feature #2702 (Resolved): gitbuilder: sync each build as it completes
Sage Weil
03:27 PM devops Feature #2549: ceph-disk-prepare: take fstype, mkfs and mount options from ceph.conf
As of commit ad97415ef72b55934adfa5024fd9af8fd1f0f82d, this now needs mount options too. Anonymous
03:26 PM devops Feature #2547 (Resolved): ceph-disk-prepare: handle partitioning and mkfs
commit ad97415ef72b55934adfa5024fd9af8fd1f0f82d
Author: Tommi Virtanen <tv@inktank.com>
Date: 2012-07-03 15:24:26...
Anonymous
02:24 PM rbd Bug #2457 (Resolved): libvirt: migration fails with rbd in 0.9.11 and 0.9.12
Fixed by upstream libvirt commit 78290b1641e95304c862062ee0aca95395c5926c. Josh Durgin
02:08 PM rbd Bug #2457: libvirt: migration fails with rbd in 0.9.11 and 0.9.12
Fixed in 0.9.12-3(debian naming) and later. Also recently in-list reports told the same, so issue may be closed safely. Andrey Korolyov
02:17 PM rgw Bug #2701 (Resolved): rgw: don't keep bucket info indexed by bucket_id
Yehuda Sadeh
02:15 PM rbd Bug #2700 (Resolved): blkdeviotune method at libvirt doesn`t work on RBD volumes
Since qemu implemented its own i/o limiting mechanism rather than cgroups, all block backends may be controlled over ... Andrey Korolyov
12:17 PM Messengers Bug #2569: msgr: connect_rank crash
i've merged fix for this into master, commit:204bc594be1a6046d1b362693d086b49294c2a27 (with possible side-effects fro... Sage Weil
12:16 PM Bug #2682 (Resolved): config lockdep error (recursive lock?) in LibRadosAio.SimpleWritePP
Sage Weil
10:48 AM devops Feature #2699 (Rejected): crowbar: change barclamp-glance to use rbd
Anonymous
10:38 AM devops Feature #2698: crowbar: Guide for using "front" network
We need an easy way to drop a "dhclient eth1" upstart job into a crowbar server installation. Just a sudo tee /etc/in... Anonymous
10:28 AM devops Feature #2698 (Closed): crowbar: Guide for using "front" network
Anonymous
10:26 AM devops Feature #2697 (In Progress): crowbar: ISO generation, reproducible in a cloud image vm
Anonymous
10:16 AM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
Anonymous
10:12 AM devops Feature #2696 (Rejected): chef: Automated QA
Use downburst vms on vercoi to automatically bring up ceph clusters, do basic RADOS/RBD functionality testing, tear d... Anonymous
10:11 AM devops Feature #2695 (Closed): crowbar: Automated QA
Use downburst vms on vercoi to automatically bring up ceph clusters, do basic RADOS/RBD functionality testing and Ope... Anonymous
10:10 AM rgw Bug #2642 (Resolved): rgw: show/trim usage using also time (not just date)
Done, commit:80a939a99db64f7802a4a3c1320316c91720f5d9 Yehuda Sadeh
10:08 AM rgw Bug #2658 (Resolved): rgw-admin: usage show fails when specifying hour > 12
Fixed, commit:c5d19b6df0bcb238e5e68732b4d252b06f2d9e56. Yehuda Sadeh
10:05 AM devops Feature #2584 (Resolved): sepia: provide networking, DHCP for dynamic virtual machines
Anonymous
10:05 AM devops Feature #2584: sepia: provide networking, DHCP for dynamic virtual machines
Split the DNS part to #2694, this is already providing value to users. Anonymous
09:59 AM devops Feature #2584: sepia: provide networking, DHCP for dynamic virtual machines
Status update: missing DNS updates, all the strictly required components are there; vms attached to the front network... Anonymous
10:04 AM devops Feature #2553: crowbar: open question: What's the correct way to add RBD support to the Nova barc...
(Wrong ticket, ignore) Anonymous
10:04 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
Anonymous
09:24 AM devops Feature #2546 (Resolved): ceph-disk-prepare: take fsid from ceph.conf (support --cluster=name)
commit 4e774fbcb38fd6883232b72352512a5f8e4a66e8
Author: Tommi Virtanen <tv@inktank.com>
Date: 2012-07-03 09:22:28...
Anonymous
08:04 AM Bug #2693 (Resolved): osd/ReplicatedPG.cc: 4293: FAILED assert(info.last_update <= active_rep_scr...
... Sage Weil

07/02/2012

09:25 PM Feature #2692 (Resolved): stable testing debian repos
Sage Weil
06:49 PM rbd Bug #2689 (In Progress): qemu iozone test hangs
Josh Durgin
02:51 PM rbd Bug #2689 (Resolved): qemu iozone test hangs
... Sage Weil
05:07 PM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
took down osd.2 and osd.3 with same crash. coredumps are on the hosts.. Sage Weil
05:06 PM Bug #2691 (Won't Fix): osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
... Sage Weil
04:40 PM Bug #2690 (Won't Fix): mon: persist quorum features
currently the non-leaders do not know the quorum features, and encode everything with a minimal (0) feature set.
...
Sage Weil
02:26 PM Linux kernel client Bug #2688 (Duplicate): lockup on ffsb + thrashing
... Sage Weil
12:54 PM Bug #2687: FileStore crashes when "osd_journal_size" is larger than the filesystem
for files, i think the right approach is to fallocate(), which will reserve the space. we shouldn't have to look at ... Sage Weil
12:47 PM Bug #2687 (Resolved): FileStore crashes when "osd_journal_size" is larger than the filesystem
See: http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/7282
If a user (on tmpfs, in this case) specifies...
Greg Farnum
12:49 PM Bug #2476: osd: watch timeout depends on operations to an object
fix qa/workunits/rbd/copy.sh when this is fixed !!! Sage Weil
12:36 PM rbd Feature #2556: rbd tool: break image locks
The current progress in is wip-rbd-locking. Still needs tests and docs, plus a small cleanup as noted on github. Josh Durgin
12:32 PM rbd Feature #2686 (Resolved): rbd: let users specify a usage for shared locks
If existing lockers have the same usage, the lock succeeds. Otherwise, it fails. This could let you use locks with e.... Josh Durgin
11:28 AM rbd Feature #2685 (Rejected): Support QEMU migration with caching enabled
This is a libvirt problem, it's not related to qemu at all. I already looked into and tested whether qemu was doing f... Josh Durgin
11:21 AM rbd Feature #2685 (Rejected): Support QEMU migration with caching enabled
See http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/7524
Apparently newer versions of QEMU refuse to...
Greg Farnum
09:44 AM Documentation #2684 (Won't Fix): doc: ceph and all daemons take --show-config
Quoting Sage:
For future reference, you can get a dump of all these values with
ceph-osd -i 123 --show-...
Anonymous
09:30 AM Bug #2593: logmonitor: decode failure
Do we know if the log in question actually existed on disk or not? Greg Farnum
07:28 AM Bug #2593: logmonitor: decode failure
saw this again on next:... Sage Weil
07:37 AM Bug #2683: ceph-fuse: crash during fsstress
... Sage Weil
07:31 AM Bug #2022 (Need More Info): osd: misdirectect request
apparently there is a different cause for this:... Sage Weil
05:57 AM Subtask #2621 (In Progress): mon: Single-Paxos: synchronize the MonitorDBStore of oblivious monitor
Joao Eduardo Luis

07/01/2012

09:46 PM Feature #2651: mon: race calling tick() when doing slurping
making this a cleanup so that it stops confusing me :) Sage Weil
08:57 PM Bug #2683 (Can't reproduce): ceph-fuse: crash during fsstress
... Sage Weil
07:48 PM Bug #2682 (Resolved): config lockdep error (recursive lock?) in LibRadosAio.SimpleWritePP
... Sage Weil
03:06 PM CephFS Bug #2681: client: got push without mds session
this was with 'ms inject socket failure = 200' Sage Weil
03:06 PM CephFS Bug #2681 (Resolved): client: got push without mds session
... Sage Weil
02:41 PM Bug #2599 (Can't reproduce): osd: crash in ReplicatedPG::C_OSD_OndiskWriteUnlock::finish
chalking this up to the bugs in next a couple weeks back Sage Weil
09:22 AM Feature #2680 (Resolved): osd: report backfill progress via query
... Sage Weil
07:09 AM CephFS Bug #2679 (Can't reproduce): POSIX file lock not released on process termination
I obtained a POSIX file lock with the following code:
> --- snip ---
>
> ...
> std::string x = "/tmp/ceph_mount...
Daniel Godas-Lopez

06/30/2012

10:52 PM rbd Documentation #2670: Docs shouldn't direct users to echo to /sys/bus/rbd for normal use
Sage Weil
10:51 PM rbd Feature #2279 (Resolved): rbd: trivial layering design doc
Sage Weil
11:34 AM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3450 Tamilarasi muthamizhan

06/29/2012

09:44 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3441... Tamilarasi muthamizhan
03:39 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3435 Sage Weil
03:37 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3437 Sage Weil
03:33 PM Bug #2675: osd: segfault during log trim
also:... Sage Weil
03:30 PM Bug #2675 (Resolved): osd: segfault during log trim
... Sage Weil
09:02 PM Feature #2471 (Resolved): osd: add prefix match to OSDCaps
Sage Weil
09:00 PM Feature #2678 (Rejected): osd, objecter: redirect misdirected requests
Generic mechanism to refer the client to the correct osd when they misdirect their requests. This will allow the clu... Sage Weil
08:59 PM Bug #2676 (Resolved): mon: cannot create pool with old renamed name
commit:5a9355091296121823156de7d3160de45328a0cc Sage Weil
04:46 PM Bug #2676 (Resolved): mon: cannot create pool with old renamed name
renaming a pool name, and then trying to create a new pool with the old name fails. Yehuda Sadeh
07:27 PM rbd Bug #2677 (Resolved): librbd: create does not clean up well
A create that fails part way through does not remove objects it created or undo modifications it does, for example ad... Josh Durgin
07:23 PM rbd Feature #2279 (Fix Under Review): rbd: trivial layering design doc
See wip-rbd-layering-doc Josh Durgin
03:26 PM Messengers Bug #2569: msgr: connect_rank crash
fix for this is in wip-msgr, still testing Sage Weil
02:16 PM RADOS Feature #2541 (Resolved): crush: move command to adjust non-leaf node position
Sage Weil
12:54 PM Feature #2575 (Resolved): perf: 0.48 numbers
Mark Nelson
12:53 PM Feature #2582 (Resolved): set up chart.io + mysql (or equivalent) infrastructure for tracking perf
Mark Nelson
12:51 PM Feature #2577 (Resolved): teuthology: blktrace task
Mark Nelson
12:29 PM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
Tried this on master. Although at first I triggered something else, the bottom line is that this works, and the monit... Joao Eduardo Luis
12:14 PM Subtask #2674 (Rejected): mon: Single-Paxos: mon commits suicide after remove&add
Yep. Makes sense. I was afraid this was cause by my changes.
Rejecting it then.
Joao Eduardo Luis
11:30 AM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
Yeah.. basically we're changing the mon's ip by removing and re-adding it, and the mon isn't smart enough to realize ... Sage Weil
11:12 AM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
I believe this is intended behavior, note the last line:... Greg Farnum
03:07 AM Subtask #2674 (Rejected): mon: Single-Paxos: mon commits suicide after remove&add
Pre-conditions:
3 mons: a=127.0.0.1:6789 ; b=127.0.0.1:6790 ; c=127.0.0.1:6791
* remove 'c' with ./ceph mon rem...
Joao Eduardo Luis
11:09 AM Bug #2646: mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
commit:840ae244499496d543d634713bdee7c7884ce527
The tick happened at the same time as slurping, which meant the di...
Greg Farnum
10:54 AM Bug #2646 (Resolved): mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
Sage Weil
10:53 AM Bug #2264 (Can't reproduce): mon: failed assert in bump_epoch
Sage Weil
06:19 AM Bug #2618: error: unable to open OSD superblock
Thanks, but that didn't help.
I did notice that drives get mounted a little weird.
Don't know if that's a problem...
John S

06/28/2012

10:06 PM Bug #2664 (Resolved): osd: extra attr _path, extra attr snapset from scrub
Sage Weil
11:29 AM Bug #2673 (Resolved): ReplicatedPG::prepare_transaction: don't crash on empty ops
Samuel Just
11:26 AM Cleanup #2672 (Rejected): PG::find_best_info cleanup
see 253033cd720db86e7c8372fd4184de7d4c43bce2 Samuel Just
11:26 AM Cleanup #2671 (Resolved): buffer.h: do efficient buffer comparisons
Samuel Just
10:15 AM rbd Documentation #2670 (Resolved): Docs shouldn't direct users to echo to /sys/bus/rbd for normal use
A naive user looking for "rbd map" will instead find this:
http://ceph.com/docs/master/rbd/rados-rbd-cmds/
with...
Anonymous
10:04 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
Lots of work on the messenger client, but still not completely
clear this particular bug is fixed. There are a few ...
Alex Elder
09:42 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
I suppose this really ought to get fixed at some point.
For now, it looks like Sage has implemented a workaround
th...
Alex Elder
09:41 AM rbd Bug #1070: krbd: ^C doesn't work
No progress on this. None expected unless it gets
reprioritized and planned.
Alex Elder
09:40 AM Linux kernel client Feature #1699: debug symbols in autobuilt (sepia) kernels
No progress on this. I have a vague memory that someone
else might have looked at this problem a while back (Dan?)....
Alex Elder
09:39 AM Feature #2127: Save kernel core dumps on all of our test machines
My work on this was pretty much complete a few months ago.
It included a shell script that leverages Ubuntu kdump
...
Alex Elder
09:32 AM Linux kernel client Bug #2261 (Can't reproduce): paging error in libceph after crashed osd comes back online
the osd_client refcounting bug fix may explain this one, too... commit:0d47766f14211a73eaf54cab234db134ece79f49
an...
Sage Weil
09:16 AM Linux kernel client Bug #2261: paging error in libceph after crashed osd comes back online
No progress on this.
There has been a lot of work on the messenger code since this bug was
reported. One change ...
Alex Elder
09:31 AM Linux kernel client Cleanup #2130: ceph: xattr: complete cleanups following review
No progress on this, but I still have the patches. I'll
try to sneak them in as I'm working on RBD. I believe
the...
Alex Elder
09:29 AM Linux kernel client Cleanup #2131: ceph: xattr: use the generic kernel xattr code
No progress on this. It should be put on our roadmap as a task
to complete, maybe within the next 6 months.
Alex Elder
09:12 AM Bug #2267 (Closed): Ceph client crashed after shutting down one mds and osd
A recent fix supplied by Zheng Yan of Intel seems to have fixed
this problem, so I'm closing this bug.
rbd: C...
Alex Elder
09:05 AM rbd Feature #2326 (In Progress): krbd: use new class interfaces, new image format
I've finally begun work on this, following some in-person discussion
with Josh, Dan, and Sage this week.
I will u...
Alex Elder
09:00 AM Linux kernel client Feature #2374: ceph-client: start laying the groundwork for Linux tracepoints
No progress on this yet.
However, I got this e-mail from Jim Schutt shortly after creating
this bug, and just wan...
Alex Elder
08:44 AM Bug #2386: xfstests: failed #34
I've been trying to find out whether this is still a problem or
if it was transient. But teuthology has had a strin...
Alex Elder
07:41 AM Linux kernel client Bug #2424 (Resolved): ceph-client: messenger: badness in prepare_write_connect()
This bug was fixed in May, by a small series of changes that
culminated in this one:
commit 3da54776e2c0385c3...
Alex Elder
07:37 AM Linux kernel client Cleanup #2432: ceph-client: messenger: refactor to simplify state model
I had worked out on paper some notes about a longer-term state/event
model that could be used for the client messeng...
Alex Elder
07:33 AM Linux kernel client Cleanup #2432: ceph-client: messenger: refactor to simplify state model
I worked on doing this for a good month but the job really isn't
complete. Nevertheless I think there was some prog...
Alex Elder
07:23 AM Linux kernel client Cleanup #2438: ceph-client: use BUG_ON() for null auth_client->ops pointers
Touching all my bugs today. This one's a good idea but
very low priority.
Alex Elder
07:20 AM rbd Bug #2608: rbd: hung xfstest 270
Just to summarize what I just added...
There are some recent XFS problems that might explain this,
irrespective o...
Alex Elder
07:16 AM rbd Bug #2608: rbd: hung xfstest 270
I looked at this on Tuesday, and sent a note to Sage that should
have instead been put here. Here it is.
I w...
Alex Elder
04:54 AM Feature #2668 (Resolved): Build linux-tools-common package for perf
It'd be really nice if we built linux-tools-common with our gitbuilder kernels so we can install perf on our test box... Mark Nelson

06/27/2012

06:10 PM Bug #2618: error: unable to open OSD superblock
I noticed an issue in your ceph.conf - you have keyring = /etc/ceph/keyring.admin in the global section, and the osd ... Josh Durgin
05:19 PM rbd Bug #2667 (Won't Fix): librbd: create_snap on a closed image segfaults
I wrote silly code, and in reordering it, managed to attempt rbd_snap_create() on an
image that I had rbd_close()d. ...
Dan Mick
05:13 PM Feature #2651: mon: race calling tick() when doing slurping
oops, stronger fix, yes! Sage Weil
05:13 PM Feature #2651 (Resolved): mon: race calling tick() when doing slurping
Sage Weil
05:01 PM Feature #2661 (Resolved): mon: do not allow monitors to be added to the map with port 0
Merged into dho and next. Thanks Joao! Greg Farnum
11:25 AM Feature #2661 (Resolved): mon: do not allow monitors to be added to the map with port 0
Last week, somebody used the "ceph mon add" command without specifying a port, and it defaulted to port 0. This cause... Greg Farnum
04:48 PM Feature #2666 (Resolved): rados tool: copy pool
A new operation to copy the entire content of a pool into a different pool. For each object we'd copy the locator, da... Yehuda Sadeh
04:04 PM rgw Bug #2665 (Resolved): rest-bench hangs periodically
rest-bench seems to hang periodically with the following spit out the console on a regular basis:
plana83: 2012-06...
Mark Nelson
04:04 PM Bug #2656 (Rejected): rados-bench hangs periodically
Mark Nelson
04:03 PM Bug #2656: rados-bench hangs periodically
gah,
this is what I get for submitting bugs at the end of the day. You are correct, rest-bench.
Mark Nelson
03:29 PM devops Feature #2587 (Resolved): sepia: isolated networking on vercoi (manual, a handful)
Anonymous
03:28 PM devops Feature #2587: sepia: isolated networking on vercoi (manual, a handful)
Confirmed: isolated0..isolated9 work even if Crowbar wants to put VLANs in them. They pass between vercoi as packets ... Anonymous
02:17 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
More on where that snippet should live:
- for standalone chef deployment, we want the admin run something similar,...
Anonymous
02:14 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
This python snippet creates ceph keys in the right format (for now). Where it should live is still an open question.
...
Anonymous
01:38 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Without this, multi-mon bring-up is racy.
At proposal save time, the barclamp should inspect the roles, and assign...
Anonymous
02:12 PM Bug #2664: osd: extra attr _path, extra attr snapset from scrub
full logs at metropolis:~sage/bug-2664 Sage Weil
02:11 PM Bug #2664 (Resolved): osd: extra attr _path, extra attr snapset from scrub
... Sage Weil
01:43 PM devops Feature #2663 (Closed): crowbar: UI for setting generic ceph.conf values
This needs to be some sort of an extensible list of key: value pairs.
Do we need to support sections too? Probably...
Anonymous
01:17 PM devops Feature #2589 (Resolved): crowbar: Update barclamp-ceph for Essex, new ceph-cookbooks
Tyler reported success as of b2c5d3307eef0ca44fd4b001136e9af043b322bd. Anonymous
01:16 PM devops Feature #2588: downburst: multiple, configurable networks to libvirt
For historical value: https://github.com/ceph/downburst/commit/de494eeefad0f0c72916d5dab8ba015b441a94f0 Anonymous
11:30 AM devops Feature #2588 (Resolved): downburst: multiple, configurable networks to libvirt
Anonymous
11:26 AM Linux kernel client Bug #2590: possible irq lock inversion dependency with con->mutex and osdc->request_mutex
Recent log location: /a/teuthology-2012-06-27_00:00:07-regression-next-testing-basic/3076
2012-06-27T01:25:05.11...
Tamilarasi muthamizhan
10:17 AM rbd Feature #2660 (New): qa: test resizing an rbd image while a vm has it open
Make sure the resize is visible to the guest. This works with the virtio driver after doing e.g. 'echo 1 | sudo tee /... Josh Durgin
10:02 AM Subtask #2659 (Can't reproduce): mon: Single-Paxos: ceph tool -w subscriptions not being updated
how to reproduce:... Joao Eduardo Luis

06/26/2012

05:16 PM rgw Bug #2658 (Resolved): rgw-admin: usage show fails when specifying hour > 12
using wrong modifier on for parsing it. Yehuda Sadeh
05:11 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
possibly fixed by commit:0d8970fc813b33e7c6ba2484fbc43cce947d3f4d Sage Weil
04:31 PM CephFS Bug #2657 (Resolved): kclient: direct io write larger than 8MiB fails
Writes larger than 8MiB get EFAULT, e.g.:... Josh Durgin
02:13 PM Bug #2656: rados-bench hangs periodically
rados-bench or rest-bench? Yehuda Sadeh
01:27 PM Bug #2656 (Rejected): rados-bench hangs periodically
rados-bench seems to hang periodically with the following spit out the console on a regular basis:
plana83: 2012-0...
Mark Nelson
01:45 PM Bug #2563 (Can't reproduce): leveldb corruption
It looks like one of the leveldb store files was corrupted, possibly by the filesystem. It may be possible to recove... Samuel Just
09:36 AM Bug #2655 (Resolved): scrub slows writes more than it should
Samuel Just
09:34 AM Subtask #2616 (Closed): mon: Single-Paxos: AuthMonitor: key_server has no entries
Joao Eduardo Luis
09:34 AM Subtask #2616 (Resolved): mon: Single-Paxos: AuthMonitor: key_server has no entries
Joao Eduardo Luis
09:33 AM Subtask #2620 (Closed): mon: Single-Paxos: MDSMonitor: MMDSBeacon from entity with insufficient p...
Note: turns out this was the same bug as #2643
Had to do with the AuthMonitor losing some infos when reading versi...
Joao Eduardo Luis
09:32 AM Subtask #2643 (Closed): mon: Single-Paxos: mds: Strange message behavior on peon
Had to do with the AuthMonitor losing some infos when reading versions from the store.
This is fixed.
Joao Eduardo Luis
09:01 AM Linux kernel client Bug #2523: xfs: xfs_iolock_reclaimable
... Sage Weil
06:15 AM rbd Bug #2654 (Won't Fix): Stale rbd volume cannot be unmaped
/dev/rbd0 exists in system but /dev/rbd/winnie-test/postgresql not... Maciej Galkiewicz

06/25/2012

10:01 PM rbd Bug #2608: rbd: hung xfstest 270
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-23_00:00:02-regression-next-testing-basic/1471
m...
Sage Weil
09:56 PM Bug #2536 (Need More Info): librados crashed while getting stat of an object
Sage Weil
09:56 PM Bug #2536: librados crashed while getting stat of an object
Have you seen this problem since then? It looks like it could be due to racing with rados startup or shutdown... Sage Weil
09:41 PM Bug #2346 (Resolved): xfs filesystem on top of rbd volume corrupts
No news is good news! Sage Weil
09:40 PM Bug #2602 (Resolved): osd: push failed because local copy is X
Sage Weil
05:09 PM Messengers Bug #2569: msgr: connect_rank crash
All three mon nodes and a client node on the second aging cluster died over the weekend (kernel and all). Looks like ... Mark Nelson
10:25 AM Messengers Bug #2569: msgr: connect_rank crash
Saw the following while debugging my aging test scripts. Seems to have happened when the mon was started. No core d... Mark Nelson
03:33 PM Bug #2649: osd: log bound mismatch
... Sage Weil
03:31 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
moved test to marginal suite; move back to regression when this is resolved! Sage Weil
03:31 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
ubuntu@teuthology:/a/teuthology-2012-06-24_00:00:07-regression-next-testing-basic$ ... Sage Weil
03:28 PM Bug #2593: logmonitor: decode failure
I wonder if this is also due to tick() colliding with slurping — the first one definitely could be (not sure about th... Greg Farnum
03:27 PM Bug #2653 (Resolved): Web docs point to obsolete "fusermount" page
The page http://ceph.com/docs/master/man/8/mount.ceph/ has a link at the bottome that points to "fusermount" descript... Ken Franklin
03:21 PM Bug #2618: error: unable to open OSD superblock
attaching my ceph.conf.
Can't get to IRC from work - I'll try in the evenings.
thanks
John S
02:54 PM rgw Bug #2652 (Resolved): Segmentation fault in rest-bench
This happened while running rest-bench during aging tests on the burnupi cluster.
--
plana83: *** Caught signal...
Mark Nelson
02:48 PM Bug #2022 (Resolved): osd: misdirectect request
Sage Weil
02:40 PM Feature #2651 (Rejected): mon: race calling tick() when doing slurping
Right now the monitor calls tick() on all the PaxosService implementations when it's doing slurping. This introduces ... Greg Farnum
09:19 AM rgw Bug #2650 (Resolved): rgw: swift key creation overrides subuser access mask
# radosgw-admin subuser create --uid=johndoe --subuser=johndoe:swift
--access=full
{ "user_id": "johndoe",
"rados...
Yehuda Sadeh

06/23/2012

04:56 PM Bug #2649 (Resolved): osd: log bound mismatch
... Sage Weil

06/22/2012

07:14 PM Bug #2648 (Resolved): removing a monitor from the map while it's running causes a crash
... Greg Farnum
05:27 PM Bug #2647 (Can't reproduce): osd: old request, waiting for subops
primary:... Yehuda Sadeh
11:43 AM Bug #2618: error: unable to open OSD superblock
John, can we see your ceph.conf file? If you have time, try chatting in #ceph on irc.oftc.net as well; perhaps we ca... Dan Mick
11:30 AM Bug #2646 (Resolved): mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
... Yehuda Sadeh
08:17 AM Subtask #2645 (Rejected): mon: Single-Paxos: Could not decrypt ticket info (immediately after run...
There was a lingering monitor still running, from a previous install.
Apparently, holding the wrong keys will lead...
Joao Eduardo Luis
08:09 AM Subtask #2645 (Rejected): mon: Single-Paxos: Could not decrypt ticket info (immediately after run...
... Joao Eduardo Luis
12:24 AM Bug #2602: osd: push failed because local copy is X
Hi Sage,
just updated to your wip_rolling_upgrade branch.
FileStore update worked ( 100GB => 30 minutes on XFS ) ...
Simon Frerichs

06/21/2012

06:55 PM rbd Feature #2566 (Duplicate): teuthology: task to run rbd workunits in a vm
Same as #1713. Josh Durgin
06:53 PM rbd Feature #1713 (Resolved): teuthology: qemu tasks, tests
Basic teuthology task is done in 38f6a78c71910a39b7f1890316c0a134ced8b0ec. Making a gitbuilder for qemu seems less im... Josh Durgin
06:52 PM rbd Feature #2644 (Rejected): qa: gitbuilder for qemu
This should build qemu with rbd support for regression testing new versions of qemu. Josh Durgin
06:49 PM rbd Feature #2567 (Resolved): qa: add qemu+rbd jobs to qa suite
Added in 94a6ab8ff3637f68c03261cf845b402d6bfa8e76 Josh Durgin
04:30 PM Subtask #2643: mon: Single-Paxos: mds: Strange message behavior on peon
This is what can be seen on the Leader:... Joao Eduardo Luis
04:24 PM Subtask #2643 (Closed): mon: Single-Paxos: mds: Strange message behavior on peon
Just for future reference.
When checking how things were going with the monitors, we noticed that the following sn...
Joao Eduardo Luis
03:37 PM Subtask #2633: mon: Single-Paxos: ceph tool unable to connect to monitor
Has something changed in the last five hours that you think fixed this? Greg Farnum
03:28 PM Subtask #2633 (Closed): mon: Single-Paxos: ceph tool unable to connect to monitor
It appears to be fixed.
The ceph tool is able to obtain the status from the monitors.
The 'watch' command doesn...
Joao Eduardo Luis
10:11 AM Subtask #2633 (Closed): mon: Single-Paxos: ceph tool unable to connect to monitor
This is what usually happens on the monitor side. Every now and then, the ceph tool is able to connect, but we haven'... Joao Eduardo Luis
01:54 PM rgw Bug #2642 (Resolved): rgw: show/trim usage using also time (not just date)
Yehuda Sadeh
01:42 PM Feature #2577 (In Progress): teuthology: blktrace task
Sage Weil
01:41 PM Feature #2581 (Resolved): perf: investigate 0.47.2 precise vs 0.46 oneiric discrepancy
Sage Weil
01:40 PM Feature #2576 (Resolved): perf: 0.48 on long-term clusters
Sage Weil
01:17 PM Linux kernel client Bug #2302 (Can't reproduce): xfs: warning at mutex_remove_waiter
Sage Weil
12:38 PM Bug #2550 (Resolved): logrotate: SIGHUP upstart jobs too, not just sysvinit
Sage Weil
12:06 PM rbd Feature #2641 (Duplicate): qa: regression tests for rbd openstack volume driver
This should include:
* booting a vm from an rbd device
* attaching/detaching an rbd device to a running guest
* ad...
Josh Durgin
11:30 AM rbd Feature #2640 (Duplicate): qa: regression tests for rbd glance backend
This should run against development versions of openstack to verify that the glance backend continues to work. Namely... Josh Durgin
11:26 AM Bug #2042 (Duplicate): mon: crash in LogMonitor::update_from_paxos
Indeed! Sage Weil
11:21 AM Bug #2042: mon: crash in LogMonitor::update_from_paxos
Hrm, I think that this is duplicated by #2593? Greg Farnum
11:16 AM Bug #2042 (Can't reproduce): mon: crash in LogMonitor::update_from_paxos
Sage Weil
11:13 AM Cleanup #2623 (Resolved): filestore btrfs trans should be removed
Sage Weil
11:07 AM Feature #1494 (Resolved): openstack: vm can boot off rbd
This has been possible for a long time. Josh Durgin
11:03 AM Bug #2638 (Resolved): mon: make pool ops idempotent
for example, deleting a pool fails with ENOENT (or ENODATA :/) if the pool doesn't exist, but if we lose our mon sess... Sage Weil
11:02 AM rbd Feature #2637 (New): teuthology: task for running a vm using libvirt
This should have similar semantics to the qemu task that runs qemu directly, but configure and run the vm via libvirt... Josh Durgin
10:59 AM rbd Feature #2636 (New): qa: regression tests for qemu monitor commands
Test attach/detach of rbd devices and snapshot operations executed directly by the qemu monitor. This is probably eas... Josh Durgin
10:40 AM Bug #2602 (Need More Info): osd: push failed because local copy is X
Hi Simon-
This looks like something that could be caused by the broken rolling osd upgrade support in the branch y...
Sage Weil
10:14 AM rgw Feature #2635 (New): benchmark for measuring rgw metadata operations
We need to come up with a benchmark that will measure the following operations:
* Service:
1. List buckets
*...
Yehuda Sadeh
10:11 AM rbd Feature #2634 (Resolved): teuthology: add networking to qemu task
Let the guest speak to the outside world so test scripts can e.g. check out git repos and download test programs to c... Josh Durgin
06:32 AM Bug #2618: error: unable to open OSD superblock
I manually created the directory.
Then I ran the mkcephfs command.
The directory has some files in it (journal, mag...
John S

06/20/2012

09:39 PM Feature #2631 (Resolved): mon: kill rm -rf --mkfs behavior
Sage Weil
09:02 PM Bug #2593: logmonitor: decode failure
... Sage Weil
06:55 PM rbd Feature #2630 (Resolved): teuthology: add task to run qemu-iotests against rbd
qemu-iotests are included in upstream qemu.git. They exercise qemu's block layer to test correctness. They use existi... Josh Durgin
06:44 PM rbd Feature #2629 (New): qa: test performance during live migration
This could be done after #2628 by running iozone during the migration, parsing its output, and checking that throughp... Josh Durgin
06:41 PM rbd Feature #2628 (New): qa: test live migration with qemu
Run something like fsstress in the vm during the migration, and verify that it completes successfully. To do this we'... Josh Durgin
06:37 PM rbd Feature #2627 (New): qa: regression tests for libvirt rbd storage pool
Libvirt storage pools allow you to create, delete, and list volumes. Wido wrote a backend that uses librbd to do this... Josh Durgin
06:30 PM rbd Feature #2626 (New): qa: regression tests for basic rbd libvirt integration (disks)
Test using rbd disks with vms through libvirt.
This includes:
* booting a vm backed only by rbd
* attaching rb...
Josh Durgin
06:15 PM rbd Feature #2625 (Rejected): qa: gitbuilder for libvirt
Create a gitbuilder for libvirt packages so we can regression test rbd against upstream releases. Base this on the ub... Josh Durgin
05:36 PM Bug #2600: osd: crazy long watch timeout?
In another recurrence, there are no objecter requests:... Josh Durgin
04:34 PM Bug #2524 (Won't Fix): librados crashed while connecting to cluster
Sage Weil
04:34 PM Bug #2456 (Resolved): librbd: failed LibRBD.TestIOToSnapshot
Haven't seen this in a while. Maybe some of the race cleanups fixed it... Sage Weil
04:32 PM Documentation #2624: OpenStack creation instructions should recommend non-default number of pg's ...
It'll have to be ceph osd pool create <pool> <num_pgs> until #2519 is done. Josh Durgin
04:25 PM Documentation #2624 (Resolved): OpenStack creation instructions should recommend non-default numb...
http://ceph.com/docs/master/rbd/rbd-openstack/ recommends
sudo rados mkpool nova
This should probably be
su...
Dan Mick
03:46 PM Cleanup #2623 (Resolved): filestore btrfs trans should be removed
On Wed, 20 Jun 2012, Stefan Priebe - Profihost AG wrote:
> Hello list,
>
> i've looked at the wiki (http://ceph.co...
Dan Mick
03:01 PM Subtask #2622 (Resolved): mon: Single-Paxos: convert existing, old MonitorStore to a brand new Mo...
The new monitor design does not support the old MonitorStore, nor does it store the versions and their values in the ... Joao Eduardo Luis
02:58 PM Subtask #2621 (Resolved): mon: Single-Paxos: synchronize the MonitorDBStore of oblivious monitor
*Objective:* synchronize monitor stores over the network whenever a given monitor mon.X falls too far behind.
*Sol...
Joao Eduardo Luis
02:50 PM Subtask #2615 (Closed): mon: Single-Paxos: MDSMap::get_health() asserting
Joao Eduardo Luis
02:49 PM Subtask #2615: mon: Single-Paxos: MDSMap::get_health() asserting
This issue stopped popping up after we changed the criteria to propose queued proposals and restarted testing with a ... Joao Eduardo Luis
03:59 AM Subtask #2615 (Closed): mon: Single-Paxos: MDSMap::get_health() asserting
MDSMap infos, dumped on MDSMap::get_health() just before the assert is triggered:... Joao Eduardo Luis
02:47 PM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
Appears to be fixed.
The ceph tool is able to connect to the cluster and obtain status information.
However, th...
Joao Eduardo Luis
11:01 AM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
Although this appears to be fixed, we still are unable to authenticate clients.
My current suspicion is that we ar...
Joao Eduardo Luis
09:00 AM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
We were encoding an empty "full version" of the key server during AuthMonitor::encode_pending(), along side with the ... Joao Eduardo Luis
08:36 AM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
The problem appears to affect all mon clients, and it may be the reason why our OSDs do not work as well.
Log snip...
Joao Eduardo Luis
08:00 AM Subtask #2616 (Closed): mon: Single-Paxos: AuthMonitor: key_server has no entries
The Monitor's key_server has no entries, even though we made sure to populate mon.X/keyring with every single service... Joao Eduardo Luis
02:45 PM Subtask #2614: Single Paxos instance shared across the existing services
Joao Eduardo Luis
03:48 AM Subtask #2614 (Closed): Single Paxos instance shared across the existing services
One Paxos to propose them all. Joao Eduardo Luis
02:44 PM Subtask #2620 (Closed): mon: Single-Paxos: MDSMonitor: MMDSBeacon from entity with insufficient p...
... Joao Eduardo Luis
02:06 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
Please mention https://bugs.launchpad.net/upstart/+bug/1012938 in the "sucks" comment, so someone can some day nicely... Anonymous
01:47 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
repushed upstart-vs-logrotate branch Sage Weil
12:25 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
yeah, that'll work. only solves the logrotate case, but that's fine by me. Sage Weil
11:39 AM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
That killall thing is hideous, and I'm utterly unconvinced having even more upstart jobs for Ceph is helpful in any w... Anonymous
12:30 PM Feature #2619 (Resolved): filejournal: instrument with perfcounters
Sage Weil
12:09 PM Bug #2618: error: unable to open OSD superblock
Hi John,
Did you create the /data/ceph/osd0 directory? mkcephfs doesn't do it for you because of the potential for...
Josh Durgin
11:31 AM Bug #2618 (Can't reproduce): error: unable to open OSD superblock
I am new at this.
I installed ceph.
When I do a service ceph start, mon.0 and mds.(machine name) seem ok.
When it ...
John S
11:13 AM Bug #2022: osd: misdirectect request
here is the smoking gun. note that teh pgid goes to 0.0 when linger tid 1 is resending the watch op 4:... Sage Weil
03:45 AM Subtask #2613: Sandbox PaxosServices accesses to the store
I messed up the formatting and don't seem to be able to edit it. So here goes a decent version of it.... Joao Eduardo Luis
03:41 AM Subtask #2613 (Resolved): Sandbox PaxosServices accesses to the store
Each service used to have direct access to the MonitorStore, and they could mess around wherever they wanted, allowin... Joao Eduardo Luis
03:25 AM Subtask #2612 (Resolved): Monitor key/value store
Create a key/value store, with transaction support, to be used on the monitor subsystem.
Its interface should refl...
Joao Eduardo Luis
03:21 AM Feature #2611 (Resolved): mon: Single-Paxos
The ceph-mon is (roughly) composed by a Monitor class, responsible for all things monitor-ish, and several monitor se... Joao Eduardo Luis

06/19/2012

07:04 PM rbd Feature #2556: rbd tool: break image locks
Argh. I don't seem to be getting my email notifications from you and Josh on Github, and I don't know why. Greg Farnum
06:57 PM rbd Feature #2556: rbd tool: break image locks
https://github.com/ceph/ceph/commit/3c05629691deb800e3c6e62e81f444a748e8857c#src-rbd-cc-P108
just making sure i un...
Sage Weil
06:48 PM rbd Feature #2556: rbd tool: break image locks
Your commits look good to me (sorry I missed the cli tests; I need to get into the habit of running those), but I don... Greg Farnum
05:46 PM rbd Feature #2556: rbd tool: break image locks
rebase, fixed up ENOENT vs ENOEXEC behavior. one clarification about the purpose/scope of 'rbd lock', but otherwise ... Sage Weil
03:13 PM rbd Feature #2556 (Fix Under Review): rbd tool: break image locks
wip-rbd-locking has this now, but it also merges in wip-clsrbd for an unrelated change, so you might want to wait to ... Greg Farnum
05:06 PM Bug #2610 (Resolved): osd: pg stuck at scrubbing
Happened on congress, pg was stuck at scrubbing state for two and a half days.... Yehuda Sadeh
04:20 PM rbd Feature #2558 (Resolved): cls_rbd: child/parent methods
Sage Weil
04:05 PM devops Feature #2584 (In Progress): sepia: provide networking, DHCP for dynamic virtual machines
Sage Weil
04:04 PM Feature #2576 (In Progress): perf: 0.48 on long-term clusters
Sage Weil
04:04 PM Feature #2575 (In Progress): perf: 0.48 numbers
Sage Weil
03:52 PM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
To prevent rename from disrupting clients with images open,
* put header in rbd_head.$id
* put $id in rbd_id.$nam...
Sage Weil
02:32 PM rgw Feature #2516 (Resolved): rgw: new bandwidth-only per-user log
Sage Weil
02:28 PM rbd Bug #2608 (Closed): rbd: hung xfstest 270
Logs are available in ubuntu@teuthology:/a/teuthology-2012-06-19_00:00:09-regression-next-testing-basic/1792
2012-...
Tamilarasi muthamizhan
01:25 PM Bug #2022: osd: misdirectect request
latest run log: ubuntu@teuthology:/a/teuthology-2012-06-18_19:00:05-regression-master-testing-gcov/1586 Tamilarasi muthamizhan
12:54 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
ubuntu@teuthology:/a/teuthology-2012-06-18_19:00:05-regression-master-testing-gcov/1579 Tamilarasi muthamizhan
11:31 AM Messengers Bug #1985: msgr: creating new Pipe for pre-existing connection leaks Pipe if they don't replace
I've still got this sitting around in my workspace. Since we seem to have pushed back a messenger re-do, perhaps we s... Greg Farnum
09:57 AM rbd Feature #2607 (Resolved): librbd: copyup helper
copyup helper to perform a copyup from parent to child. will be used by both the rbd command-line copyup command, an... Sage Weil
09:57 AM rbd Subtask #2606 (Resolved): librbd layering: copyup on missing child object
Sage Weil
09:57 AM rbd Subtask #2605 (Resolved): librbd layering: guard writes
Sage Weil
09:56 AM rbd Subtask #2604 (Resolved): librbd layering: read path
Sage Weil
09:56 AM rbd Subtask #2603 (Resolved): librbd layering: open parent on open
Sage Weil

06/18/2012

10:07 PM Bug #2550 (Fix Under Review): logrotate: SIGHUP upstart jobs too, not just sysvinit
Sigh. See branch upstart-vs-logrotate. Sage Weil
08:57 PM rbd Feature #2556: rbd tool: break image locks
Greg Farnum wrote:
> Team RBD needs more to do! Pulling this forward. :)
Go team! :)
Sage Weil
06:26 PM rbd Feature #2556 (In Progress): rbd tool: break image locks
Team RBD needs more to do! Pulling this forward. :) Greg Farnum
05:56 PM rbd Feature #2585 (In Progress): rbd: clone command
Dan Mick
05:34 PM rbd Feature #2585: rbd: clone command
Dan Mick
05:35 PM rbd Feature #2559: cls_rbd: copyup method
Dan Mick
01:50 PM rbd Feature #2601: rbd: Show image size with an "ls"
We've also heard from others that having a better estimate of rbd usage and expected usage would be good; taking into... Dan Mick
06:09 AM rbd Feature #2601 (Resolved): rbd: Show image size with an "ls"
On the mailinglist the request came if the "rbd" tool could be modified to not only show image names when doing an ls... Wido den Hollander
01:34 PM rgw Bug #2542 (Resolved): rgw: support S3 update of metadata
Yehuda Sadeh
01:32 PM rgw Bug #2542: rgw: support S3 update of metadata
Resolved, commit:343cc792e847ca8901f6c08e41799a2fbbd2ca92 Yehuda Sadeh
11:04 AM Bug #2602: osd: push failed because local copy is X
Updated another osd to 'next' and same errors happened.
I've attached the log with debug osd = 20 set.
Simon Frerichs
08:46 AM Bug #2602: osd: push failed because local copy is X
Is this reproducible with 'debug osd = 20'? Sage Weil
08:44 AM Bug #2602 (Resolved): osd: push failed because local copy is X
Hi,
filestore updated completed.
When i start the "updated" OSD the whole cluster starts lagging.
Is the next br...
Sage Weil
08:45 AM Bug #2598: filestore: error during upgrade
Simon Frerichs wrote:
> Hi,
>
> filestore updated completed.
> When i start the "updated" OSD the whole cluster ...
Sage Weil
08:42 AM Bug #2598 (Resolved): filestore: error during upgrade
THanks! Sage Weil
01:29 AM Bug #2598: filestore: error during upgrade
Hi,
filestore updated completed.
When i start the "updated" OSD the whole cluster starts lagging.
Is the next br...
Simon Frerichs
12:56 AM Bug #2598: filestore: error during upgrade
Thanks.
The bug seems to be fixed.
Simon Frerichs
08:43 AM Bug #2595: filestore: error creating filestore during mkcephfs
2012-06-18 17:42:16.232924 7f54292fb780 -1 filestore(/srv/osd.20) could not find 23c2fcde/osd_superblock/0//-1 in ind... Stefan Priebe
08:29 AM Bug #2599: osd: crash in ReplicatedPG::C_OSD_OndiskWriteUnlock::finish
commit:5efaa8d7799347dfae38333b1fd6e1a87dc76b28 Sage Weil
07:25 AM CephFS Bug #2596: mds: spinning on restart
gdb is not helpful here, process seems to be spinning in syscall:
(gdb) thread apply all bt
Thread 1 (process 148...
Amon Ott
 

Also available in: Atom