Project

General

Profile

Activity

From 06/26/2012 to 07/25/2012

07/25/2012

09:54 PM Bug #2843 (Can't reproduce): filestore: replay failure on xfs
congress osd.328 crashed with... Sage Weil
05:55 PM Bug #2842: mon: health detail lists pgs multiple times
This and #2827 may be related? Greg Farnum
05:52 PM Bug #2842 (Won't Fix): mon: health detail lists pgs multiple times
... Sage Weil
05:16 PM rgw Bug #1855: Creation of a subuser that appears to own an s3 key is possible, and removing the subu...
This bug can be reproduced by using the following options
./radosgw-admin -c {'ceph.conf'} --rgw-socket-path=/tmp...
caleb miles
05:16 PM rgw Bug #2841 (Resolved): rgw: fix usage trim
looking at the code, it seems that we don't encode the user in usage-trim (and also encode the wrong structure). Yehuda Sadeh
04:52 PM CephFS Bug #2187: pjd chown/00.t failed test 97
2012-07-23T19:16:10.185 INFO:teuthology.task.workunit.client.0.out:not ok 43
2012-07-23T19:16:10.186 INFO:teuthology...
Tamilarasi muthamizhan
04:51 PM CephFS Bug #2187: pjd chown/00.t failed test 97
Latest log: ubuntu@teuthology:/a/teuthology-2012-07-23_19:00:03-regression-master-testing-gcov/16530
Tamilarasi muthamizhan
04:30 PM Feature #2840 (Resolved): mon: $mon_data/cluster_fsid file
maybe written/verified by mkfs! Sage Weil
04:23 PM rgw Feature #2839 (Resolved): rgw: garbage collection
Provide a garbage collection mechanism, along the lines of what was described in a post to the mailing list. Yehuda Sadeh
04:20 PM rgw Bug #2652: Segmentation fault in rest-bench
is it still happening? Yehuda Sadeh
04:20 PM rgw Bug #2665: rest-bench hangs periodically
is that still happening? Yehuda Sadeh
04:10 PM devops Feature #2574 (Resolved): crowbar: use data disks automatically, journal inside data directory
There were bugs and the history was wrecked by github pull requests again, so I redid some commits, but this function... Anonymous
03:45 PM rgw Feature #2039 (Rejected): rgw: keep more than one bucket marker object
That's not the case anymore. We use the unique client id and a running counter instead. Yehuda Sadeh
02:08 PM Bug #2838 (Resolved): mon: json version of 'osd tree'
Sage Weil
01:52 PM Bug #2824: ceph-fuse; hang mounting with ms failures
Sage Weil
01:52 PM Bug #2835 (Resolved): osd: do not send alive/upthru until booted
Sage Weil
01:52 PM Bug #2836 (Resolved): osd: boot condition check incorrect
Sage Weil
10:52 AM Bug #2836 (Resolved): osd: boot condition check incorrect
commit:5979351ef3d3d03bced9286f79cbc22524c4a8de Sage Weil
11:04 AM Bug #2837 (Resolved): osd: past_interval calculation inefficient
It is still possible for osds to get pgs without past intervals and need to recalculate them, and that calculation ca... Sage Weil

07/24/2012

08:58 PM Bug #2835 (Resolved): osd: do not send alive/upthru until booted
Sage Weil
04:02 PM Bug #2833: LibRBD.TestClone failure with ms fialure injection
I'm pretty sure this is due to the op being done twice due to replay. That error (-EBADF) means the id was already ad... Josh Durgin
03:49 PM Bug #2833 (Resolved): LibRBD.TestClone failure with ms fialure injection
... Sage Weil
03:54 PM Bug #2834 (Resolved): osd/ReplicatedPG.cc: 3577: FAILED assert(waiting_for_ack.begin()->first == ...
... Sage Weil
03:10 PM devops Feature #2574: crowbar: use data disks automatically, journal inside data directory
The most recent pull request for the cookbook has been tested by Tyler and myself. I've setup the following situation... JuanJose Galvez
02:57 PM Bug #2832: [argonaut] config set foo bar broken?
What was the resolution? Greg Farnum
02:33 PM Bug #2832 (Resolved): [argonaut] config set foo bar broken?
Sage Weil
01:32 PM Bug #2832 (Resolved): [argonaut] config set foo bar broken?
... Sage Weil
01:08 PM Bug #2831 (Duplicate): librados: LibRadosWatchNotify.WatchNotifyTest segfault
nevermind, i cherry-pick a bad version of the linger op fix for #2796 Sage Weil
12:48 PM Bug #2831 (Duplicate): librados: LibRadosWatchNotify.WatchNotifyTest segfault
On the stable-next branch, commit:7e9336e7cb36c6640fd5c8509c9c0ffd87b15d7a,
I hit this sometime last week or the w...
Sage Weil
11:59 AM Bug #2824 (In Progress): ceph-fuse; hang mounting with ms failures
Sage Weil
10:58 AM Bug #2799 (Resolved): osd: pg log trimming zeroing broken
commit:f565ace62a1235dbe6a4ee9db5c5233cea395446 Sage Weil
08:44 AM Bug #2799: osd: pg log trimming zeroing broken
this was a bug in pg log trimming/zeroing. thankfully it was only enabled in master! will backport the fix all over... Sage Weil
08:25 AM Bug #2799: osd: pg log trimming zeroing broken
... Sage Weil
10:14 AM Subtask #2805: mon: Single-Paxos: Sync: Create a test unit to verify the correctness of the whole...
Current available test units:
* Removing keys:
> * Using the whole-space iterator, both in LevelDBStore and KeyVa...
Joao Eduardo Luis
09:19 AM Bug #2830 (Duplicate): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
0> 2012-07-24 14:38:09.918736 7fe53cede700 -1 osd/OSD.cc: In function 'OSDMapRef OSD::get_map(epoch_t)' thread 7... Sage Weil

07/23/2012

05:17 PM Bug #2822 (Resolved): rbd: ASSIGN_BID both mutates object and returns data
commit:48bd839b1e25b063c675416a8f6233463f1af115 on master
commit:7dfdf4f8de16155edd434534e161e06ba7c79d7d for argonaut
Sage Weil
05:03 PM Feature #2829 (Resolved): report on cluster size/status (for service billing purposes)
The plan is to base support contract pricing on capacity,
which means we need a quick, easy (and moderately trusted)...
Anonymous
04:59 PM Bug #2828 (Resolved): osd: assign_bid was allowed to mutate and return data
The osd shouldn't have allowed this. Figure out why, and then see if there is a graceful way to fix it without break... Sage Weil
04:52 PM Bug #2823 (Resolved): osd: out of order ACKs
commit:67832c34a228c5cff1941eab3225969b5e66e44b Sage Weil
08:47 AM Bug #2823: osd: out of order ACKs
specifically, this is out of order ACKs when ops are resent. trivially reproducible with the thrasher + radosmodel t... Sage Weil
02:44 PM Bug #2827 (Rejected): mon: ceph health string doesn't match "ceph -s" output
I've seen several ceph -s reports lately where the first line on the PGs doesn't match the output of the PG line. My ... Greg Farnum
11:11 AM devops Feature #2553 (Closed): crowbar: open question: What's the correct way to add RBD support to the ...
This ticket was an arbitrary reminder to send upstream an email with questions, and we have answers & sort of underst... Anonymous
11:04 AM Feature #2826 (Resolved): osd: backfill target reservations
When osds are marked out, the recovery ops per primary throttling works fine, because the backfill targets tend to be... Sage Weil
10:45 AM CephFS Bug #2187: pjd chown/00.t failed test 97
also,
ubuntu@teuthology: /a/teuthology-2012-07-23_05:00:03-regression-stable-master-basic/16425
from teuthology...
Tamilarasi muthamizhan
10:42 AM CephFS Bug #2187: pjd chown/00.t failed test 97
Recent logs: ubuntu@teuthology: /a/teuthology-2012-07-23_05:00:03-regression-stable-master-basic/16416
From teutho...
Tamilarasi muthamizhan
10:14 AM Bug #2787: RBD test copy fails when thrashing
Recent logs: ubuntu@teuthology:/a/teuthology-2012-07-23_00:00:08-regression-next-testing-basic/16393
ubuntu@teutholo...
Tamilarasi muthamizhan
08:59 AM CephFS Bug #2825: File lock doesn't work properly
Here is the code included in *pre* ;)... Jean-Sébastien Frerot
08:56 AM CephFS Bug #2825 (Resolved): File lock doesn't work properly
Hi,
I've been testing the locking feature on cephfs and seems that the locking doesn't work as expected when locki...
Jean-Sébastien Frerot

07/22/2012

10:46 PM Bug #2824 (Resolved): ceph-fuse; hang mounting with ms failures
... Sage Weil
10:36 PM Bug #2823 (Duplicate): osd: out of order ACKs
... Sage Weil
10:34 PM Bug #2822: rbd: ASSIGN_BID both mutates object and returns data
ubuntu@teuthology:/a/sage-2012-07-22_21:26:14-regression-wip-msgr-testing-basic/16193 too
Sage Weil
10:33 PM Bug #2822 (Resolved): rbd: ASSIGN_BID both mutates object and returns data
It's not okay to mutate data *and* return data, because when resending the operation (e.g., due to a connection fault... Sage Weil
09:40 PM Bug #2502 (Resolved): osd: do not mark self back up if internal heartbeats are failing
Sage Weil
09:34 PM Bug #2820 (Resolved): osd: crash in handle_osd_ping
Sage Weil
09:32 PM Bug #2799: osd: pg log trimming zeroing broken
Sage Weil
09:29 PM CephFS Bug #1677: mds interval_set.h: 385: FAILED assert(p->first <= start)
Sage Weil

07/21/2012

02:12 PM CephFS Bug #2187: pjd chown/00.t failed test 97
Latest logs: ubuntu@teuthology:/a/teuthology-2012-07-21_05:00:03-regression-stable-master-basic/15237
ubuntu@teuth...
Tamilarasi muthamizhan
09:17 AM Bug #2638 (Fix Under Review): mon: make pool ops idempotent
Sage Weil
09:17 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
commit:b497bdacf5d5732285d280772ab9ce6ae88f59e8 Sage Weil
08:15 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
... Sage Weil

07/20/2012

11:09 PM Bug #2820: osd: crash in handle_osd_ping
also ubuntu@teuthology:/a/sage-2012-07-20_18:17:10-regression-wip-msgr-cleanup-testing-basic/14885
- chef: null
-...
Sage Weil
11:08 PM Bug #2820 (Resolved): osd: crash in handle_osd_ping
... Sage Weil
10:45 PM RADOS Subtask #2793 (Fix Under Review): osd: require tunable feature if current osdmap uses non-default...
Sage Weil
05:22 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
Samuel Just
04:30 PM Bug #2819 (Won't Fix): krbd: lockup on large writes, msgr fault injection
krbd + osd socket failure injection + iozone -> crash in uml, without any useful debugging. uml itself either locks ... Sage Weil
04:07 PM rbd Feature #2559 (Fix Under Review): cls_rbd: copyup method
Sage Weil
04:07 PM rbd Feature #2607 (Fix Under Review): librbd: copyup helper
Sage Weil
03:45 PM Bug #2638 (In Progress): mon: make pool ops idempotent
Sage Weil
03:40 PM Documentation #2729 (Resolved): doc: OSD internals
Sage Weil
03:25 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
Responding to Sage's possible kinks:
1. the cephbooter issue only comes up with new installs, existing vms with th...
Anonymous
03:19 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
I moved http://gitbuilder-precise-i386.front.sepia.ceph.com/ and wrote a doc on how to move whole images: https://doc... Anonymous
02:37 PM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
Sage Weil
09:57 AM RADOS Subtask #2791 (In Progress): crush: feature bit for crush tunables
Sage Weil
02:20 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
Sage Weil
09:59 AM Messengers Bug #2802 (In Progress): msgr: mds session hangs on direct_io test
just a misleading message; cleaned up. the direct_io test (which leaves the mds connection mostly idle) still fails ... Sage Weil
01:50 PM Feature #2812 (Resolved): automated CentOS testing
(1) testing on anything other than Precise means we need the ability to re-image test machines or to run teuthology o... Anonymous
01:46 PM devops Feature #2808: crowbar: upgrade to fred3 (get bind bug fix)
Test that the bind problem has been fixed; nodes should be able to resolver other node names to IP addresses without ... Anonymous
01:45 PM devops Feature #2808 (Rejected): crowbar: upgrade to fred3 (get bind bug fix)
Anonymous
01:41 PM Feature #2807 (Resolved): RPM git-builder
We have customers who want to run Ceph on RHEL.
A lot of people run RHEL (or CentOS) and we should be building
(an...
Anonymous
01:26 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
The doc is fine as it is; it seems the new upstream makes this a lot nicer. Anonymous
10:46 AM devops Feature #2785: crowbar: custom ISO builder that includes barclamp-ceph
Crowbar cleaned up the build process so I was able to go through and build an ISO with our barclamp rather easily. Th... JuanJose Galvez
01:19 PM devops Feature #2574 (In Progress): crowbar: use data disks automatically, journal inside data directory
Anonymous
11:17 AM rbd Bug #2535 (Resolved): rbd: random data corruption in vm
Thanks for reminding us Florian. There are no open questions that I know of, the only problem with this was fiemap, s... Josh Durgin
11:11 AM rbd Bug #2535: rbd: random data corruption in vm
Just so this doesn't fall of the table, can this be considered resolved with the changed @filestore fiemap@ default? ... Florian Haas
10:06 AM Bug #2761: osd: failed to recover before timeout expired
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-20_05:00:03-regression-stable-master-basic/14803
from teuthol...
Tamilarasi muthamizhan
10:05 AM rbd Bug #2715: krbd: spinlock wrong CPU
hoping this was caused by the mutex-less con_open, or something similar. will keep this open for a few more days to ... Sage Weil
09:29 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Trimming through Paxos on the services is the way to go, no doubt. However, it's a whole world of pain when it comes ... Joao Eduardo Luis
08:33 AM Bug #2799: osd: pg log trimming zeroing broken
again today: ubuntu@teuthology:/a/teuthology-2012-07-19_19:00:08-regression-master-testing-gcov/14585 Sage Weil
07:13 AM Subtask #2805 (Resolved): mon: Single-Paxos: Sync: Create a test unit to verify the correctness o...
We have added support for a whole-space iterator and a snapshot iterator (task #2758) on KeyValueDB and LevelDB, and ... Joao Eduardo Luis

07/19/2012

10:06 PM rgw Feature #2804 (Rejected): rgw: disallow running multiple gateways on the same fastcgi socket
We saw an incident where multiple gateways were running on the same machine, both pointing to the same apache <-> rad... Yehuda Sadeh
09:35 PM rbd Feature #2788 (Fix Under Review): rbd: add -l to rbd ls, show snapshots
wip-rbd-ls-R Dan Mick
03:55 PM rbd Feature #2788 (In Progress): rbd: add -l to rbd ls, show snapshots
Dan Mick
09:13 PM Bug #2502 (Fix Under Review): osd: do not mark self back up if internal heartbeats are failing
wip-osd-hb Sage Weil
09:02 PM Bug #2638: mon: make pool ops idempotent
this needs to be fixed in order to get the msgr failure injection stuff in the qa suite Sage Weil
09:01 PM Bug #2771: mon|osd: fail to create pgs on startup
commit:ca9f71300471e1947be63e5dfad1a4800c1ffc9c and a few commits that follow Sage Weil
09:00 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
Sage Weil
05:52 PM Bug #2803 (Resolved): filer: probe crash
... Sage Weil
05:50 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
... Sage Weil
05:21 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
still working on part that dumps slowest ops over past N seconds Samuel Just
04:49 PM Linux kernel client Bug #2801 (Resolved): msgr crash in ceph_msg_new
... Sage Weil
04:15 PM Linux kernel client Feature #1699 (Resolved): debug symbols in autobuilt (sepia) kernels
the symbols are there. the trick is for objdump to find them in the original path. do this:
sudo mkdir -p /srv/a...
Sage Weil
04:13 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
... Sage Weil
03:18 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-07-19_08:03:32-marginal-master-testing-basic/14125 Sage Weil
03:55 PM rbd Subtask #2606 (Fix Under Review): librbd layering: copyup on missing child object
Dan Mick
03:54 PM rbd Subtask #2606 (In Progress): librbd layering: copyup on missing child object
Dan Mick
02:45 PM Bug #2800: osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
Might be a result of reordered or dropped messages. Samuel Just
02:31 PM Bug #2800 (Resolved): osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first ...
ceph-sha1: 508bf3fb96929dc6e91ee83a915a7f30df23eeeb
description: collection:thrash clusters:6-osd-3-machine.yaml fs:...
Samuel Just
02:26 PM Bug #2799 (Resolved): osd: pg log trimming zeroing broken
... Sage Weil
12:31 PM devops Feature #2780 (In Progress): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
12:30 PM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
Good enough for now. Anonymous
05:09 AM Tasks #2789: pkgconfig files for libraries
No, I'm talking about: http://en.wikipedia.org/wiki/Pkg-config
I added a patch which adds these files, but I'm not...
Wido den Hollander

07/18/2012

02:17 PM Bug #2798 (Resolved): handle_osd_ping assert
Samuel Just
10:40 AM Bug #2798: handle_osd_ping assert
Also,
ubuntu@teuthology: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13039
ubuntu@teuthology...
Tamilarasi muthamizhan
10:38 AM Bug #2798: handle_osd_ping assert
Recent logs: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13020
ubuntu@teuthology:/a/teuthol...
Tamilarasi muthamizhan
01:46 PM rbd Bug #2478 (Can't reproduce): krbd: unmap on 3.4.0: scheduling while atomic...
Sage Weil
01:32 PM Bug #2022: osd: misdirectect request
Writing down things discovered due to conversation on irc:
The log message happens with v0.48argonaut, it seems 41...
Anonymous
12:55 PM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
Sage Weil
10:17 AM CephFS Bug #2187: pjd chown/00.t failed test 97
... Sage Weil

07/17/2012

09:42 PM Bug #2067 (Resolved): librados: we leak CephContext from rados_create()
Sage Weil
09:42 PM Bug #2315 (Resolved): unrecognized admin socket command 'objecter_requests'
Sage Weil
07:21 PM CephFS Bug #1737 (Resolved): ceph-fuse crash in xlist::remove
commit:d78235be1b5c20440f3a55abb3f4b980aa3f3596 Sage Weil
07:11 PM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
Sage Weil
08:57 AM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
this is 100% reproducible on my notebook, but not on my other boxes. debugging it there. :(
Sage Weil
07:11 PM Bug #2796: osd: watch state not reestablished when registration op resent
Sage Weil
12:45 PM Bug #2796 (Fix Under Review): osd: watch state not reestablished when registration op resent
Sage Weil
09:04 AM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
if the client doesn't get the watch ack and resends, the osd will ignore it as a dup op, and the watch session state ... Sage Weil
04:23 PM Bug #2798 (Resolved): handle_osd_ping assert
ceph version 0.48argonaut-404-gabe05a3 (commit:abe05a3fbbb120d8d354623258d9104584db66f7)
1: (OSDMap::get_cluster_in...
Samuel Just
02:36 PM devops Feature #2782: chef: easy scripts for running in vercoi vms
Anonymous
02:20 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Done as of barclamp-ceph.git commit 36842bd3e3db9fea65e2921bb7b57e9088964621. Anonymous
02:03 PM rgw Feature #2797 (Resolved): rgw: support multi-objects delete
Support a multi-objects delete S3 api call. Yehuda Sadeh
10:39 AM Subtask #2758 (Fix Under Review): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyVal...
From Task #2756:
> * We changed KeyValueDB:
> > * Added a Whole-space iterator interface
> > * Implemented the p...
Joao Eduardo Luis
10:32 AM Subtask #2756 (Fix Under Review): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis
10:32 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Just a couple of notes on this regard:
* We changed KeyValueDB:
> * Added a Whole-space iterator interface
> * I...
Joao Eduardo Luis
08:56 AM RADOS Subtask #2793 (Resolved): osd: require tunable feature if current osdmap uses non-default tunables
Sage Weil
08:56 AM RADOS Subtask #2792 (Resolved): mon: require tunable feature bit if current osdmap uses non-default tun...
Sage Weil
08:52 AM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
Sage Weil
08:48 AM Tasks #2789: pkgconfig files for libraries
There is a librados-config in librados-dev. Does that mean we just need a librbd-config too? Sage Weil
06:12 AM Tasks #2789 (New): pkgconfig files for libraries
From the libvirt mailinglist I got the message that they'd like to see pkg-config for at least librbd and librados.
...
Wido den Hollander
08:29 AM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
... Sage Weil

07/16/2012

10:45 PM rbd Feature #2788 (Resolved): rbd: add -l to rbd ls, show snapshots
Add -l to rbd ls to show snapshots, perhaps size of parent Dan Mick
09:58 PM rbd Feature #2717: cls_rbd: add methods for maintaining mapping from parent to child images
Dan Mick
09:58 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
Dan Mick
09:57 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
From the design doc:
In addition to knowing which parent a given image has, we want to be able to tell if a protec...
Dan Mick
06:34 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Currently have fsid generated via uuidgen, monitor is generated via the python script which is residing in barclamp-c... Tyler Brekke
06:28 PM rbd Feature #2561 (Fix Under Review): rbd: copyup command
Implemented at the moment as "flatten". Dan Mick
06:26 PM rbd Feature #2561 (In Progress): rbd: copyup command
Dan Mick
06:27 PM rbd Feature #2559: cls_rbd: copyup method
Dan Mick
06:26 PM rbd Subtask #2604 (In Progress): librbd layering: read path
Josh Durgin
06:26 PM rbd Subtask #2605 (In Progress): librbd layering: guard writes
Josh Durgin
04:07 PM Feature #2631 (Resolved): mon: kill rm -rf --mkfs behavior
Sage Weil
03:04 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Samuel Just
03:04 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Probably fixed in c7fb964c077d369943bd5c066c5f99da6bd5f37c.
Backported to stable: ec5cd6def9817039704b6cc010f2797a70...
Samuel Just
11:17 AM Bug #2761: osd: failed to recover before timeout expired
Logs: ubuntu@teuthology: /a/teuthology-2012-07-14_00:00:03-regression-next-testing-basic/11248
ubuntu@teuthology:/...
Tamilarasi muthamizhan
11:01 AM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
Samuel Just
11:01 AM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
Samuel Just
11:00 AM Bug #2787 (Resolved): RBD test copy fails when thrashing
Logs: ubuntu@teuthology:/a/teuthology-2012-07-15_00:00:04-regression-next-testing-basic/11629
Attached below an ex...
Tamilarasi muthamizhan
10:55 AM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
Samuel Just
10:48 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
Sage Weil
10:47 AM Bug #2022 (Resolved): osd: misdirectect request
YAY Sage Weil
10:47 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Sage Weil

07/15/2012

09:40 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
Sage Weil
09:40 PM Bug #2593 (Resolved): logmonitor: decode failure
aie. commit:fe57681892b0bf3155ddeaeaa79ddbf997e27af9 Sage Weil
08:43 PM Feature #2330 (Resolved): dump open files, sockets when we run out of fds
commit:bf9a85ade6f7b06ed28ff7930b9e33a41acf0e34 Sage Weil

07/14/2012

05:39 PM Bug #2022 (Fix Under Review): osd: misdirectect request
FINALLY tracked this one down. tests consistently now pass. bugfix-2022 Sage Weil
05:29 PM rgw Feature #2786 (Resolved): radosgw-admin: ability to remove objects/buckets
Need to be able to remove objects/buckets. Bucket removal can optionally remove all objects. Another option is to set... Yehuda Sadeh

07/13/2012

04:58 PM Bug #2748: include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
https://github.com/athanatos/ceph/commits/wip_biginfo Samuel Just
04:57 PM Subtask #825 (Resolved): osd: remove pg map updating from handle_osd_map
Samuel Just
04:57 PM Bug #2461 (Resolved): DBObjectMap is incompatible with collection_rename
Samuel Just
03:57 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
for the record, this is being done in this sprint for two reasons:
(a) we are under some pressure to give up the old...
Anonymous
09:39 AM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
A few possible kinks:
* The fabfile currently pulls the magic ssh keys for rsync from cephbooter. we need a new ...
Sage Weil
09:34 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
12:40 PM devops Feature #2785 (In Progress): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:40 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:38 PM devops Feature #2662 (In Progress): crowbar: Make barclamp-ceph set mon initial members, monitor-secret,...
Anonymous
12:37 PM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
Tyler says the google doc is good to go. Anonymous
12:32 PM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
Sage Weil
11:08 AM Bug #2784 (Resolved): osd hit suicide timeout
Log: ubuntu@teuthology:/a/teuthology-2012-07-12_19:00:15-regression-master-testing-gcov/10615
ubuntu@teuthology:/a...
Tamilarasi muthamizhan
10:22 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
5924f8e4a8c29e6de326a9e8576c30109cdc0e07 Samuel Just
09:50 AM devops Bug #2783 (Won't Fix): Chef-client fails on oneiric cluster
While verifying the web docs for Chef deployment of cepf, chef-client fails with the following error on an oneiric cl... Ken Franklin
09:45 AM Bug #2779 (Fix Under Review): mon: [near]full status doesn't get purged when osds are removed
tag! Sage Weil
07:12 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Date: Fri, 13 Jul 2012 12:17:47 +0400
From: Andrey Korolyov <andrey@xdel.ru>
To: ceph-devel@vger.kernel.org
Subjec...
Sage Weil
09:43 AM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
Anonymous
09:42 AM devops Feature #2781 (Closed): crowbar: easy scripts for running in vercoi vms
Anonymous
09:33 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
Works well enough for first iteration. Anonymous
09:32 AM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
Sage Weil
04:12 AM rbd Bug #2777: qemu: report discard support
It works for me, I test ext4 with discard and ext4 with fstrim. Martin Mailand

07/12/2012

05:25 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
wip_2769 should fix it. Samuel Just
04:55 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
bcfa573f5f615f3403ff71da0212cd1cee7e7d9c
The core provided pretty good support for this sequence of events, markin...
Samuel Just
04:40 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
Sage Weil
02:31 PM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
*next is enough to figure out whether we ended on a boundary. Samuel Just
02:07 PM Feature #2680 (Resolved): osd: report backfill progress via query
5450567a6769b64ace385140ffb645d9d805e354 Samuel Just
12:22 PM rbd Bug #2777: qemu: report discard support
It looks like this is handled by what qemu calls a 'device property', so you have to use a command line like:
<pre...
Josh Durgin
08:14 AM rbd Bug #2777: qemu: report discard support
Yeah, QUEUE_FLAG_DISCARD is as far as I looked yesterday too. The question now is how to set that within qemu. Josh Durgin
11:23 AM CephFS Bug #2778 (Can't reproduce): uclient: failed asserts if mounting with a non-existent directory
From Noah.
If Ceph is mounted with a non-existent directory, the mount call returns a success, but on unmount does t...
Greg Farnum
11:23 AM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-12_05:00:04-regression-stable-master-basic/9637
ubuntu@teutho...
Tamilarasi muthamizhan
09:58 AM RADOS Feature #2101 (Resolved): crushtool: check for weight overflow on reweight
Sage Weil

07/11/2012

11:18 PM rbd Bug #2777: qemu: report discard support
I don't know what I'm talking about, but stumbling around in the kernel seems to point at QUEUE_FLAG_DISCARD as the f... Dan Mick
02:19 PM rbd Bug #2777 (Resolved): qemu: report discard support
todin reported problems enabling discard inside qemu.
I'm not sure if this is rbd-specific, but there seem to be s...
Josh Durgin
11:14 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9430
ubuntu@teutho...
Tamilarasi muthamizhan
09:01 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
... Sage Weil
11:10 PM rbd Bug #2715: krbd: spinlock wrong CPU
latest logs:ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9371
12-07-11T19:36...
Tamilarasi muthamizhan
09:10 PM Bug #2772 (Resolved): rados bench -t broken
Sage Weil
10:16 AM Bug #2772: rados bench -t broken
we can't change bench flags.
just don't make a short one-char flag for the target... use --target or whatever instead
Sage Weil
10:12 AM Bug #2772: rados bench -t broken
FWIW, I hacked up a patch last night (changed -t to -n to avoid conflicts, and made some changes to the usage message... Dan Mick
02:23 PM Feature #2714 (Resolved): teuthology: store monitor data in archives
Sage Weil
01:54 PM Bug #2387 (Duplicate): mon: could not get service secret for auth subsystem
Sage Weil
11:31 AM Feature #2776 (Resolved): rados tool: bulk removal of objects
Provide an option to remove multiple objects in a single command. Yehuda Sadeh
10:08 AM Bug #2771: mon|osd: fail to create pgs on startup
i think this is related:... Sage Weil
09:19 AM Feature #2775 (Rejected): rest-bench man page
Sage Weil
09:19 AM devops Documentation #2774 (Resolved): doc: ceph-disk man page
commit:a450cab2b8148cb8a9b043d629feccf89e5aabac Sage Weil
09:11 AM rbd Subtask #2603 (Fix Under Review): librbd layering: open parent on open
In branch wip-rbd-parent Josh Durgin
08:16 AM Feature #2666 (Resolved): rados tool: copy pool
Sage Weil
04:35 AM rbd Feature #2773 (Resolved): libvirt: incorrect snapshot behavior
Since consistent(disk and memory state) snapshots currently supported only with qcow images, libvirt should:
- all...
Andrey Korolyov

07/10/2012

10:13 PM Bug #2772: rados bench -t broken
actually the -t param is for 'rados cp' which is for an object, not for cppool. Yehuda Sadeh
10:02 PM Bug #2772: rados bench -t broken
let's just make it --target-pool or similar. this is an uncommon enough command that it doesn't really need a single... Sage Weil
10:00 PM Bug #2772 (Resolved): rados bench -t broken
The -t parameter has being assigned to rados cppool (target pool), but apparently it was already being used for rados... Yehuda Sadeh
09:56 PM Feature #2714 (In Progress): teuthology: store monitor data in archives
Sage Weil
09:25 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
... Sage Weil
07:22 PM rbd Feature #2770 (Resolved): krbd: define tasks to add osd_client compound class op support
Sage Weil
04:38 PM Bug #2387 (Can't reproduce): mon: could not get service secret for auth subsystem
Sage Weil
04:26 PM Bug #2600 (Resolved): osd: crazy long watch timeout?
Sage Weil
09:28 AM Bug #2600 (In Progress): osd: crazy long watch timeout?
Sage Weil
02:52 PM Bug #2765: Extremely Long Object Latencies
I noticed one 15 second lag on an OSD caused by the filestore requiring 15 seconds to drain open ops before a sync. Samuel Just
02:52 PM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
Merged in commit:96ca508dcd227db11597cd4963208872af30fcc4 Josh Durgin
02:24 PM rbd Feature #2585 (Resolved): rbd: clone command
commit:83339a0cbb925022c44507ab0ea3da7b399f511e
Dan Mick
11:06 AM Bug #2767 (Resolved): rbd workunit failing
The real problem is that the workunit task only grabs things from master, since github only seems to create tarballs ... Josh Durgin
09:24 AM Bug #2767 (Resolved): rbd workunit failing
... Sage Weil
10:47 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
Log location: ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8335
ubuntu@teuth...
Tamilarasi muthamizhan
10:33 AM Feature #2768 (Resolved): teuthology: make workunit task work on different branch/sha1 etc
This involves making it download scripts individually instead of from a single tarball, since github only creates tar... Josh Durgin
10:14 AM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8328 Sage Weil
10:13 AM Bug #2453 (In Progress): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
ubuntu@teuthology:/a/teuthology-2012-07-10_05:00:02-regression-stable-master-basic/8546 Sage Weil
09:06 AM Subtask #2757: mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transactions
Pending testing Joao Eduardo Luis

07/09/2012

09:59 PM Bug #2067: librados: we leak CephContext from rados_create()
librados does not destroy the CephContext created in rados_create() in rados_destroy() (or the c++ equivalents). Thi... Sage Weil
09:59 PM Bug #2766 (Resolved): cephcontext: needs refcount
CephContext needs to be refcounted, so that it can be properly shared. Sage Weil
09:37 PM Bug #2600: osd: crazy long watch timeout?
Oh.... Sage Weil
06:41 PM Bug #2765 (Closed): Extremely Long Object Latencies
Saw this during testing on one of the aging clusters.
example object tid: 312322
Full logs available here:
...
Mark Nelson
06:03 PM Bug #2508 (Resolved): osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobje...
645ac87b84ae348483c7d52b1968af5f44378dea Samuel Just
06:02 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
67a5e75ed6a9b3689a984d6f7eb6a18e1242125e Samuel Just
04:12 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
Remove the loud warnings? Like, the mismatch warnings? Greg Farnum
04:08 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
Ok, so this is actually caused by not updating the backfill peer's partial stats while recovering the backfill portio... Samuel Just
03:10 PM devops Feature #2698 (Closed): crowbar: Guide for using "front" network
Merged as 90c42837696d252be40348fdd4499e15aa835aeb. Anonymous
01:56 PM Linux kernel client Bug #2764 (Closed): xfstest hang; osd socket closed messages
... Sage Weil
01:51 PM Cleanup #2763 (Resolved): move rbd locking infrastructure to a separate objclass
locks are generics, can serve other uses (e.g., rgw). We should separate them from rbd code. Yehuda Sadeh
12:52 PM Bug #2762 (Resolved): mon crash ceph::__ceph_assert_fail (assertion=0x63d150 "begin->last_committ...
#0 0x00007fb9b32b5b7b in raise (sig=<optimized out>) at ../nptl/sysdeps/unix/sysv/linux/pt-raise.c:42
#1 0x0000000...
Samuel Just
12:07 PM Bug #2761 (Resolved): osd: failed to recover before timeout expired
Logs: ubuntu@teuthology:/a/teuthology-2012-07-09_05:00:08-regression-stable-master-basic/8039
Attaching below an e...
Tamilarasi muthamizhan
11:54 AM Bug #2022: osd: misdirectect request
Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
Attaching the...
Tamilarasi muthamizhan
11:50 AM Bug #2022: osd: misdirectect request
Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7824
Attaching the...
Tamilarasi muthamizhan
11:45 AM Bug #2022: osd: misdirectect request
latest log: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
and
ubuntu@t...
Tamilarasi muthamizhan
11:33 AM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
... Anonymous
11:21 AM Linux kernel client Bug #2759 (Resolved): libceph: crush tree algorithm is not understood
After changing the crush algorithm for a bucket to tree, the kernel client cannot parse the OSDMap:... Josh Durgin
10:31 AM RADOS Feature #2101: crushtool: check for weight overflow on reweight
caleb miles
09:38 AM Subtask #2758 (Resolved): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyValueDB to ...
Extend the in-memory mock-up of KeyValueDB to support the safe iterator, and create an iterator to iterate over all t... Joao Eduardo Luis
09:16 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis
09:05 AM Subtask #2756 (Resolved): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
The current iterator implementation iterates directly over the database state, even if there are concurrent updates.
...
Joao Eduardo Luis
09:15 AM Subtask #2757 (Resolved): mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transac...
Using the LevelDBStore's safe iterator, obtain the existing key/values and pack them into transactions.
These tran...
Joao Eduardo Luis
08:57 AM Feature #2755 (Resolved): ceph-conftool: optionally return the default for a config option if no ...
Presently, the only way to figure out what the default for a specific configuration option is, is to consult the sour... Florian Haas

07/07/2012

06:19 AM Linux kernel client Bug #2754 (Resolved): cephfs show_location produces kernel "divide error: 0000 [#1]" when run aga...
Originally reported in http://marc.info/?l=ceph-devel&m=134151028212170&w=2:
Really easy to reproduce on my 3.2.0 ...
Florian Haas
06:16 AM CephFS Bug #2753 (Resolved): Writes to mounted Ceph FS fail silently if client has no write capability o...
Originally reported in http://marc.info/?l=ceph-devel&m=134151023912148&w=2:
How to reproduce (this is on a 3.2.0 ...
Florian Haas
06:12 AM Bug #2752: Setting large maxosd kills all mons
Original ML response from Greg (http://marc.info/?l=ceph-devel&m=134151077312444&w=2):
Ungh. Can you file a bug repo...
Florian Haas
06:10 AM Bug #2752 (Resolved): Setting large maxosd kills all mons
Originally reported in http://marc.info/?l=ceph-devel&m=134151018912133&w=2:
When setting a large maxosd, all mons...
Florian Haas

07/06/2012

08:03 PM Feature #2743 (Resolved): new cond assertions, associated cleanups
Sage Weil
09:41 AM Feature #2743 (Resolved): new cond assertions, associated cleanups
Sage Weil
05:05 PM Feature #2702 (Resolved): gitbuilder: sync each build as it completes
Sage Weil
03:23 PM Feature #2702: gitbuilder: sync each build as it completes
Sage Weil
09:43 AM Feature #2702 (In Progress): gitbuilder: sync each build as it completes
Sage Weil
05:02 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
Yehuda Sadeh
04:35 PM rgw Feature #2735: rgw: support response-* request params in GET obj
Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a. Yehuda Sadeh
04:48 PM Feature #2680: osd: report backfill progress via query
not yet tested https://github.com/athanatos/ceph/tree/wip_recovery_query Samuel Just
04:45 PM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
in progress already https://github.com/athanatos/ceph/tree/wip_biginfo
blocked by 2748
Samuel Just
04:35 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
lavor: basic
owner: scheduled_sage@metropolis
success: false
ubuntu@teuthology:/a/sage-2012-07-06_11:03:15-regress...
Samuel Just
04:35 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a. Yehuda Sadeh
10:58 AM rgw Bug #2734: rgw: fails to authenticate preauthenticated requests with sub resource
Fails only with specific sub resources (e.g., response-*) that were not hard coded. Yehuda Sadeh
04:34 PM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
roles:
- - mon.a
- osd.0
- osd.1
- osd.2
- - mds.a
- osd.3
- osd.4
- osd.5
- - client.0
targets...
Samuel Just
04:32 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
Saw on master
osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
ceph version 0.48argona...
Samuel Just
04:23 PM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
Samuel Just
03:51 PM Bug #2600: osd: crazy long watch timeout?
another instance:... Sage Weil
03:47 PM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
... Sage Weil
01:23 PM rgw Bug #2701 (Resolved): rgw: don't keep bucket info indexed by bucket_id
Fixed, commit:3df51040b19b9c110e08f42befa0cae18cbb984c. Yehuda Sadeh
11:34 AM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
Thanks, updated the doc to increase the VM size. Tyler Brekke
10:24 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Currently, Paxos own trimming is going through the Paxos proposal mechanism itself.
However, services do not do it...
Joao Eduardo Luis
09:29 AM Subtask #2737 (Resolved): mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
This includes Paxos' own trimming, so it done at the same time in all the quorum members. Furthermore, it allows the ... Joao Eduardo Luis
10:18 AM Subtask #2738: mon: Single-Paxos: Sync: Add snapshot support to the monitor store
Implemented, although it has not been tested yet. Joao Eduardo Luis
09:31 AM Subtask #2738 (Rejected): mon: Single-Paxos: Sync: Add snapshot support to the monitor store
Having snapshot support, and being able to iterate over it instead of iterating directly over the store's keys, allow... Joao Eduardo Luis
10:09 AM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
latest logs:
/a/teuthology-2012-07-06_00:00:03-regression-next-testing-basic/6655
Tamilarasi muthamizhan
10:01 AM Subtask #2746 (Rejected): mon: Single-Paxos: Sync: Test message passing
Make sure the message passing is working before starting to work on the store synchronization.
From the monitor's ...
Joao Eduardo Luis
09:58 AM Subtask #2745 (Closed): mon: Single-Paxos: Sync: Add new message support to the Monitor class
Three different "roles" on a monitor cluster regarding synchronization:
* Leader - responsible for disabling the P...
Joao Eduardo Luis
09:53 AM Subtask #2744 (Resolved): mon: Single-Paxos: Sync: Create new Message type
Create a new message type MMonSync, to dissociate it from the MMonProbe messages, and to allow us to use fields speci... Joao Eduardo Luis
09:43 AM RADOS Feature #2706 (In Progress): crush: update kernel code to decode tunables
Sage Weil
09:40 AM Feature #2742 (Resolved): qa: ms socket inject failures in regression suite
Sage Weil
09:40 AM Subtask #2741 (Resolved): mon: Single-Paxos: Sync: Assess requirements for QA tests
TBD Joao Eduardo Luis
09:37 AM CephFS Bug #2740 (Resolved): mds: crash in Objecter when shutting down too early
Jimmy Tang reported this on the mailing list.... Greg Farnum
09:36 AM Subtask #2739 (Resolved): mon: Single-Paxos: Sync: Synchronize the store of a drifted monitor
Synchronize the store of a monitor (mon.X) that drifted away from the system (say, it failed for a considerable amoun... Joao Eduardo Luis
09:22 AM Subtask #2736 (Resolved): mon: Single-Paxos: Sync: Implement message passing
See subtasks. Joao Eduardo Luis
01:37 AM Bug #2267: Ceph client crashed after shutting down one mds and osd
To which version do you plan to add this patch? Maciej Galkiewicz

07/05/2012

10:30 PM Feature #2692 (Resolved): stable testing debian repos
Sage Weil
10:16 PM Linux kernel client Bug #2716 (Resolved): crash when cluster goes down and new one comes up
bad con->private = NULL in monc __close_session Sage Weil
09:38 PM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
easy to reproduce with... Sage Weil
08:44 AM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
I just reproduced this several times on master using the socket failure injection. This is probably socket errors br... Sage Weil
09:37 PM Bug #2022: osd: misdirectect request
ugh, wrong bug.. ignore last two messages! Sage Weil
05:19 PM Bug #2022: osd: misdirectect request
this is easily reproduced with... Sage Weil
05:18 PM Bug #2022 (In Progress): osd: misdirectect request
... Sage Weil
06:18 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
The size of a VM is adjustable:... Greg Farnum
05:49 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
Tested on vercoi/downburst
https://docs.google.com/a/inktank.com/document/d/1C1QLD54sbpRJ7o6M9ML2TrexXC5JIMecHJxOM...
Tyler Brekke
05:21 PM Cleanup #2041 (Resolved): osd: move peering into worker threads
Sage Weil
05:08 PM Linux kernel client Bug #2523 (Resolved): xfs: xfs_iolock_reclaimable
patches from hch fix this up. added them to testing branch. Sage Weil
04:18 PM rbd Bug #2488 (Can't reproduce): rbd: EIO from rbd map
Sage Weil
04:18 PM Bug #2693 (Resolved): osd/ReplicatedPG.cc: 4293: FAILED assert(info.last_update <= active_rep_scr...
Sage Weil
04:18 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Sage Weil
04:13 PM Messengers Bug #2569 (Resolved): msgr: connect_rank crash
Sage Weil
04:12 PM devops Feature #2698: crowbar: Guide for using "front" network
I verified that this causes no trouble before crowbar is installed and that eth1 does come up automatically after reb... JuanJose Galvez
03:11 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
Yehuda Sadeh
03:10 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
Yehuda Sadeh
02:11 PM Feature #2733 (Resolved): osd: fadvise replica data don't use
This should help quite a bit with cache pressure. Samuel Just
01:56 PM Feature #2485 (Resolved): crush: validate hash function
Sage Weil
01:29 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
We can also keep data about the last N slow requests on the OSD to allow easier debugging. Samuel Just
01:16 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
syntax should be something along the lines of:
ceph tell osd <num> query_ops
Result should be a json formatted ...
Samuel Just
01:20 PM Bug #2711 (Resolved): chef does not install apache2/fastcgi on plana23
The apt-get update command I added said "apt-key update", which isn't the same at all. Fixed in
commit:6d351b0b2...
Dan Mick
01:18 PM Feature #2732 (Resolved): Idempotency tester
Add proper support for:
* omap
* xattr
* collection_rename
Samuel Just
01:17 PM Cleanup #2731 (Closed): ceph tell osd <num> should be ceph osd <subcommand> to match ceph pg <sub...
Samuel Just
11:41 AM Feature #2727: filestore: add split
On second thought, that would be hard to encode into a transaction, it'll probably just take a
[(hash_prefix, bits...
Samuel Just
11:06 AM Feature #2727 (Resolved): filestore: add split
Add split functionality to filestore. The existing collection structure should be already in place.
Interface coul...
Samuel Just
11:20 AM Documentation #2729 (Resolved): doc: OSD internals
Broad overview of OSD internals:
* Threading/wq model
* Message handling
* Message ordering (wait for map, pg wait...
Samuel Just
11:18 AM Feature #2728 (Resolved): OSD: handle split
OSD message handling needs to handle messages for pgs between the map epoch creating them and the actual split (shoul... Samuel Just
11:17 AM rbd Documentation #2670 (In Progress): Docs shouldn't direct users to echo to /sys/bus/rbd for normal...
Sage Weil
11:06 AM Bug #2725 (Duplicate): filestore: add split
Samuel Just
11:04 AM Bug #2725 (Duplicate): filestore: add split
Add split functionality to filestore. The existing collection structure should be already in place.
Interface cou...
Samuel Just
11:05 AM rbd Feature #2609 (In Progress): librbd: new image name -> image head indirection
Sage Weil
11:05 AM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
Sage Weil
11:04 AM rbd Feature #2726 (Resolved): krbd: clean up bio_pair leak/whatever
Sage Weil
11:03 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
I have restarted ceph mon with invoke-rc.d ceph restart mon and now it works! However I am 100% sure that I did it af... Maciej Galkiewicz
10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
you can just restart, or to verify in the future, 'ceph --admin-daemon /var/run/ceph/ceph-mon.a.asok version' (adjust... Sage Weil
10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
Oh! The monitor is still running the old version. Sage Weil
10:57 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
... Maciej Galkiewicz
10:55 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
Weird! Two things:
- does 'ceph status' work? (this is identical to -s now)
- can you try with commit:b53cdb9...
Sage Weil
10:45 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
# ceph -v
ceph version 0.48argonaut (commit:c2b20ca74249892c8e5e40c12aa14446a2bf2030)
Maciej Galkiewicz
10:44 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
can you include 'ceph -v' output? this works for me...... Sage Weil
10:05 AM Bug #2721 (Resolved): Ceph status does not work in 0.48 even if it is still documented
... Maciej Galkiewicz
10:22 AM rbd Feature #2724 (Resolved): rbd: add protect/unprotect command
See http://ceph.com/docs/master/dev/rbd-layering/#protection Josh Durgin
10:12 AM rbd Feature #2723 (Resolved): librbd: protect/unprotect as appropiate during cloning
See http://ceph.com/docs/master/dev/rbd-layering/#protection Josh Durgin
10:10 AM rbd Feature #2722 (Resolved): cls_rbd: add class methods to get/set protected status
The possible settings are: protected, unprotected, and protecting. See http://ceph.com/docs/master/dev/rbd-layering/ ... Josh Durgin
10:02 AM rbd Feature #2720 (Resolved): rbd: add children command
This should list all direct children of a given snapshot. Josh Durgin
10:01 AM rbd Feature #2719 (Resolved): librbd: provide functions for listing parents and their children
Wrap the class methods for listing parents and children of parents. Josh Durgin
09:57 AM rbd Feature #2718 (Resolved): librbd: map parent -> child in a per-pool rbd_children object when cloning
Josh Durgin
09:55 AM rbd Feature #2717 (Resolved): cls_rbd: add methods for maintaining mapping from parent to child images
From http://ceph.com/docs/master/dev/rbd-layering/#cls-rbd:... Josh Durgin

07/04/2012

10:23 PM Linux kernel client Bug #2716: crash when cluster goes down and new one comes up
... Sage Weil
10:21 PM Linux kernel client Bug #2716 (Resolved): crash when cluster goes down and new one comes up
- vstart cluster
- mount uml
- do some stuff (dbench, control-c, sync)
- stop vstart cluster
- (waited a while)
...
Sage Weil
09:24 PM rbd Bug #2715 (Resolved): krbd: spinlock wrong CPU
... Sage Weil
03:34 PM Feature #2714 (Resolved): teuthology: store monitor data in archives
Sage suggested we might want to have teuthology archive the monitor's data directory, since that information is often... Greg Farnum
02:52 PM Bug #2593: logmonitor: decode failure
nope.
i wonder if we should make teuthology archive the mon data directory... that info is often useful, even for ...
Sage Weil
02:49 PM Bug #2675 (Resolved): osd: segfault during log trim
Sage Weil
02:31 PM Bug #2712 (Resolved): osd: small write performance regression from 0.47.*
Several reports on the ML about small writes going slower than before, generating 'slow request' warnings where previ... Sage Weil
02:02 PM Bug #2711 (Resolved): chef does not install apache2/fastcgi on plana23
i saw this on at least one other node as well. why? (this manifests itself has a 100-continue s3test failure in qa.... Sage Weil
01:45 PM Linux kernel client Bug #2710 (Resolved): freeze from dbench in uml
Sage Weil
01:16 PM Linux kernel client Bug #2710: freeze from dbench in uml
this was a problem with 3df5c88d0d469c71c43f2379f6a375fc2d314eb7, fixed and re-pushed testing.
the fix f28dfe91f5f...
Sage Weil
11:32 AM Linux kernel client Bug #2710 (Resolved): freeze from dbench in uml
Current testing and about 3 seconds of dbench 2 locks up Sage Weil
08:19 AM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
Sage Weil
07:22 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
... Sage Weil

07/03/2012

10:36 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
Firstly, well done guys on achieving this version milestone. I successfully upgraded to the 0.48 format uneventfully ... Paul Pettigrew
04:45 PM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
Sage Weil
04:44 PM RADOS Feature #2705 (Resolved): crush: graceful transition to new default tunables
Sage Weil
04:44 PM RADOS Bug #2214 (Resolved): crush: pgs only mapped to 2 devices with replication level 3
Sage Weil
04:44 PM RADOS Bug #2047 (Resolved): crush: with a rack->host->device hierarchy, several down devices are likely...
Sage Weil
04:43 PM RADOS Bug #187 (Rejected): crush: high variance, latency for straw buckets
Sage Weil
04:43 PM RADOS Feature #2422 (Resolved): crush: test that mapping result is uncorrelated
Sage Weil
04:39 PM rgw Bug #2106: failed s3tests.functional.test_s3.test_100_continue
recent logs from the nightly run: /a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5054
Tamilarasi muthamizhan
04:34 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
Tamilarasi muthamizhan wrote:
> latest logs:
> /a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5019...
Tamilarasi muthamizhan
04:33 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
latest logs:
/a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5019
config.yaml:
++++++++++++
k...
Tamilarasi muthamizhan
03:53 PM devops Feature #2704 (Closed): sepia: Use ``names`` as resolver on plana, burnupi, vercoi
Anonymous
03:45 PM Feature #2702 (Resolved): gitbuilder: sync each build as it completes
Sage Weil
03:27 PM devops Feature #2549: ceph-disk-prepare: take fstype, mkfs and mount options from ceph.conf
As of commit ad97415ef72b55934adfa5024fd9af8fd1f0f82d, this now needs mount options too. Anonymous
03:26 PM devops Feature #2547 (Resolved): ceph-disk-prepare: handle partitioning and mkfs
commit ad97415ef72b55934adfa5024fd9af8fd1f0f82d
Author: Tommi Virtanen <tv@inktank.com>
Date: 2012-07-03 15:24:26...
Anonymous
02:24 PM rbd Bug #2457 (Resolved): libvirt: migration fails with rbd in 0.9.11 and 0.9.12
Fixed by upstream libvirt commit 78290b1641e95304c862062ee0aca95395c5926c. Josh Durgin
02:08 PM rbd Bug #2457: libvirt: migration fails with rbd in 0.9.11 and 0.9.12
Fixed in 0.9.12-3(debian naming) and later. Also recently in-list reports told the same, so issue may be closed safely. Andrey Korolyov
02:17 PM rgw Bug #2701 (Resolved): rgw: don't keep bucket info indexed by bucket_id
Yehuda Sadeh
02:15 PM rbd Bug #2700 (Resolved): blkdeviotune method at libvirt doesn`t work on RBD volumes
Since qemu implemented its own i/o limiting mechanism rather than cgroups, all block backends may be controlled over ... Andrey Korolyov
12:17 PM Messengers Bug #2569: msgr: connect_rank crash
i've merged fix for this into master, commit:204bc594be1a6046d1b362693d086b49294c2a27 (with possible side-effects fro... Sage Weil
12:16 PM Bug #2682 (Resolved): config lockdep error (recursive lock?) in LibRadosAio.SimpleWritePP
Sage Weil
10:48 AM devops Feature #2699 (Rejected): crowbar: change barclamp-glance to use rbd
Anonymous
10:38 AM devops Feature #2698: crowbar: Guide for using "front" network
We need an easy way to drop a "dhclient eth1" upstart job into a crowbar server installation. Just a sudo tee /etc/in... Anonymous
10:28 AM devops Feature #2698 (Closed): crowbar: Guide for using "front" network
Anonymous
10:26 AM devops Feature #2697 (In Progress): crowbar: ISO generation, reproducible in a cloud image vm
Anonymous
10:16 AM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
Anonymous
10:12 AM devops Feature #2696 (Rejected): chef: Automated QA
Use downburst vms on vercoi to automatically bring up ceph clusters, do basic RADOS/RBD functionality testing, tear d... Anonymous
10:11 AM devops Feature #2695 (Closed): crowbar: Automated QA
Use downburst vms on vercoi to automatically bring up ceph clusters, do basic RADOS/RBD functionality testing and Ope... Anonymous
10:10 AM rgw Bug #2642 (Resolved): rgw: show/trim usage using also time (not just date)
Done, commit:80a939a99db64f7802a4a3c1320316c91720f5d9 Yehuda Sadeh
10:08 AM rgw Bug #2658 (Resolved): rgw-admin: usage show fails when specifying hour > 12
Fixed, commit:c5d19b6df0bcb238e5e68732b4d252b06f2d9e56. Yehuda Sadeh
10:05 AM devops Feature #2584 (Resolved): sepia: provide networking, DHCP for dynamic virtual machines
Anonymous
10:05 AM devops Feature #2584: sepia: provide networking, DHCP for dynamic virtual machines
Split the DNS part to #2694, this is already providing value to users. Anonymous
09:59 AM devops Feature #2584: sepia: provide networking, DHCP for dynamic virtual machines
Status update: missing DNS updates, all the strictly required components are there; vms attached to the front network... Anonymous
10:04 AM devops Feature #2553: crowbar: open question: What's the correct way to add RBD support to the Nova barc...
(Wrong ticket, ignore) Anonymous
10:04 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
Anonymous
09:24 AM devops Feature #2546 (Resolved): ceph-disk-prepare: take fsid from ceph.conf (support --cluster=name)
commit 4e774fbcb38fd6883232b72352512a5f8e4a66e8
Author: Tommi Virtanen <tv@inktank.com>
Date: 2012-07-03 09:22:28...
Anonymous
08:04 AM Bug #2693 (Resolved): osd/ReplicatedPG.cc: 4293: FAILED assert(info.last_update <= active_rep_scr...
... Sage Weil

07/02/2012

09:25 PM Feature #2692 (Resolved): stable testing debian repos
Sage Weil
06:49 PM rbd Bug #2689 (In Progress): qemu iozone test hangs
Josh Durgin
02:51 PM rbd Bug #2689 (Resolved): qemu iozone test hangs
... Sage Weil
05:07 PM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
took down osd.2 and osd.3 with same crash. coredumps are on the hosts.. Sage Weil
05:06 PM Bug #2691 (Won't Fix): osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
... Sage Weil
04:40 PM Bug #2690 (Won't Fix): mon: persist quorum features
currently the non-leaders do not know the quorum features, and encode everything with a minimal (0) feature set.
...
Sage Weil
02:26 PM Linux kernel client Bug #2688 (Duplicate): lockup on ffsb + thrashing
... Sage Weil
12:54 PM Bug #2687: FileStore crashes when "osd_journal_size" is larger than the filesystem
for files, i think the right approach is to fallocate(), which will reserve the space. we shouldn't have to look at ... Sage Weil
12:47 PM Bug #2687 (Resolved): FileStore crashes when "osd_journal_size" is larger than the filesystem
See: http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/7282
If a user (on tmpfs, in this case) specifies...
Greg Farnum
12:49 PM Bug #2476: osd: watch timeout depends on operations to an object
fix qa/workunits/rbd/copy.sh when this is fixed !!! Sage Weil
12:36 PM rbd Feature #2556: rbd tool: break image locks
The current progress in is wip-rbd-locking. Still needs tests and docs, plus a small cleanup as noted on github. Josh Durgin
12:32 PM rbd Feature #2686 (Resolved): rbd: let users specify a usage for shared locks
If existing lockers have the same usage, the lock succeeds. Otherwise, it fails. This could let you use locks with e.... Josh Durgin
11:28 AM rbd Feature #2685 (Rejected): Support QEMU migration with caching enabled
This is a libvirt problem, it's not related to qemu at all. I already looked into and tested whether qemu was doing f... Josh Durgin
11:21 AM rbd Feature #2685 (Rejected): Support QEMU migration with caching enabled
See http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/7524
Apparently newer versions of QEMU refuse to...
Greg Farnum
09:44 AM Documentation #2684 (Won't Fix): doc: ceph and all daemons take --show-config
Quoting Sage:
For future reference, you can get a dump of all these values with
ceph-osd -i 123 --show-...
Anonymous
09:30 AM Bug #2593: logmonitor: decode failure
Do we know if the log in question actually existed on disk or not? Greg Farnum
07:28 AM Bug #2593: logmonitor: decode failure
saw this again on next:... Sage Weil
07:37 AM Bug #2683: ceph-fuse: crash during fsstress
... Sage Weil
07:31 AM Bug #2022 (Need More Info): osd: misdirectect request
apparently there is a different cause for this:... Sage Weil
05:57 AM Subtask #2621 (In Progress): mon: Single-Paxos: synchronize the MonitorDBStore of oblivious monitor
Joao Eduardo Luis

07/01/2012

09:46 PM Feature #2651: mon: race calling tick() when doing slurping
making this a cleanup so that it stops confusing me :) Sage Weil
08:57 PM Bug #2683 (Can't reproduce): ceph-fuse: crash during fsstress
... Sage Weil
07:48 PM Bug #2682 (Resolved): config lockdep error (recursive lock?) in LibRadosAio.SimpleWritePP
... Sage Weil
03:06 PM CephFS Bug #2681: client: got push without mds session
this was with 'ms inject socket failure = 200' Sage Weil
03:06 PM CephFS Bug #2681 (Resolved): client: got push without mds session
... Sage Weil
02:41 PM Bug #2599 (Can't reproduce): osd: crash in ReplicatedPG::C_OSD_OndiskWriteUnlock::finish
chalking this up to the bugs in next a couple weeks back Sage Weil
09:22 AM Feature #2680 (Resolved): osd: report backfill progress via query
... Sage Weil
07:09 AM CephFS Bug #2679 (Can't reproduce): POSIX file lock not released on process termination
I obtained a POSIX file lock with the following code:
> --- snip ---
>
> ...
> std::string x = "/tmp/ceph_mount...
Daniel Godas-Lopez

06/30/2012

10:52 PM rbd Documentation #2670: Docs shouldn't direct users to echo to /sys/bus/rbd for normal use
Sage Weil
10:51 PM rbd Feature #2279 (Resolved): rbd: trivial layering design doc
Sage Weil
11:34 AM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3450 Tamilarasi muthamizhan

06/29/2012

09:44 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3441... Tamilarasi muthamizhan
03:39 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3435 Sage Weil
03:37 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3437 Sage Weil
03:33 PM Bug #2675: osd: segfault during log trim
also:... Sage Weil
03:30 PM Bug #2675 (Resolved): osd: segfault during log trim
... Sage Weil
09:02 PM Feature #2471 (Resolved): osd: add prefix match to OSDCaps
Sage Weil
09:00 PM Feature #2678 (Rejected): osd, objecter: redirect misdirected requests
Generic mechanism to refer the client to the correct osd when they misdirect their requests. This will allow the clu... Sage Weil
08:59 PM Bug #2676 (Resolved): mon: cannot create pool with old renamed name
commit:5a9355091296121823156de7d3160de45328a0cc Sage Weil
04:46 PM Bug #2676 (Resolved): mon: cannot create pool with old renamed name
renaming a pool name, and then trying to create a new pool with the old name fails. Yehuda Sadeh
07:27 PM rbd Bug #2677 (Resolved): librbd: create does not clean up well
A create that fails part way through does not remove objects it created or undo modifications it does, for example ad... Josh Durgin
07:23 PM rbd Feature #2279 (Fix Under Review): rbd: trivial layering design doc
See wip-rbd-layering-doc Josh Durgin
03:26 PM Messengers Bug #2569: msgr: connect_rank crash
fix for this is in wip-msgr, still testing Sage Weil
02:16 PM RADOS Feature #2541 (Resolved): crush: move command to adjust non-leaf node position
Sage Weil
12:54 PM Feature #2575 (Resolved): perf: 0.48 numbers
Mark Nelson
12:53 PM Feature #2582 (Resolved): set up chart.io + mysql (or equivalent) infrastructure for tracking perf
Mark Nelson
12:51 PM Feature #2577 (Resolved): teuthology: blktrace task
Mark Nelson
12:29 PM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
Tried this on master. Although at first I triggered something else, the bottom line is that this works, and the monit... Joao Eduardo Luis
12:14 PM Subtask #2674 (Rejected): mon: Single-Paxos: mon commits suicide after remove&add
Yep. Makes sense. I was afraid this was cause by my changes.
Rejecting it then.
Joao Eduardo Luis
11:30 AM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
Yeah.. basically we're changing the mon's ip by removing and re-adding it, and the mon isn't smart enough to realize ... Sage Weil
11:12 AM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
I believe this is intended behavior, note the last line:... Greg Farnum
03:07 AM Subtask #2674 (Rejected): mon: Single-Paxos: mon commits suicide after remove&add
Pre-conditions:
3 mons: a=127.0.0.1:6789 ; b=127.0.0.1:6790 ; c=127.0.0.1:6791
* remove 'c' with ./ceph mon rem...
Joao Eduardo Luis
11:09 AM Bug #2646: mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
commit:840ae244499496d543d634713bdee7c7884ce527
The tick happened at the same time as slurping, which meant the di...
Greg Farnum
10:54 AM Bug #2646 (Resolved): mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
Sage Weil
10:53 AM Bug #2264 (Can't reproduce): mon: failed assert in bump_epoch
Sage Weil
06:19 AM Bug #2618: error: unable to open OSD superblock
Thanks, but that didn't help.
I did notice that drives get mounted a little weird.
Don't know if that's a problem...
John S

06/28/2012

10:06 PM Bug #2664 (Resolved): osd: extra attr _path, extra attr snapset from scrub
Sage Weil
11:29 AM Bug #2673 (Resolved): ReplicatedPG::prepare_transaction: don't crash on empty ops
Samuel Just
11:26 AM Cleanup #2672 (Rejected): PG::find_best_info cleanup
see 253033cd720db86e7c8372fd4184de7d4c43bce2 Samuel Just
11:26 AM Cleanup #2671 (Resolved): buffer.h: do efficient buffer comparisons
Samuel Just
10:15 AM rbd Documentation #2670 (Resolved): Docs shouldn't direct users to echo to /sys/bus/rbd for normal use
A naive user looking for "rbd map" will instead find this:
http://ceph.com/docs/master/rbd/rados-rbd-cmds/
with...
Anonymous
10:04 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
Lots of work on the messenger client, but still not completely
clear this particular bug is fixed. There are a few ...
Alex Elder
09:42 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
I suppose this really ought to get fixed at some point.
For now, it looks like Sage has implemented a workaround
th...
Alex Elder
09:41 AM rbd Bug #1070: krbd: ^C doesn't work
No progress on this. None expected unless it gets
reprioritized and planned.
Alex Elder
09:40 AM Linux kernel client Feature #1699: debug symbols in autobuilt (sepia) kernels
No progress on this. I have a vague memory that someone
else might have looked at this problem a while back (Dan?)....
Alex Elder
09:39 AM Feature #2127: Save kernel core dumps on all of our test machines
My work on this was pretty much complete a few months ago.
It included a shell script that leverages Ubuntu kdump
...
Alex Elder
09:32 AM Linux kernel client Bug #2261 (Can't reproduce): paging error in libceph after crashed osd comes back online
the osd_client refcounting bug fix may explain this one, too... commit:0d47766f14211a73eaf54cab234db134ece79f49
an...
Sage Weil
09:16 AM Linux kernel client Bug #2261: paging error in libceph after crashed osd comes back online
No progress on this.
There has been a lot of work on the messenger code since this bug was
reported. One change ...
Alex Elder
09:31 AM Linux kernel client Cleanup #2130: ceph: xattr: complete cleanups following review
No progress on this, but I still have the patches. I'll
try to sneak them in as I'm working on RBD. I believe
the...
Alex Elder
09:29 AM Linux kernel client Cleanup #2131: ceph: xattr: use the generic kernel xattr code
No progress on this. It should be put on our roadmap as a task
to complete, maybe within the next 6 months.
Alex Elder
09:12 AM Bug #2267 (Closed): Ceph client crashed after shutting down one mds and osd
A recent fix supplied by Zheng Yan of Intel seems to have fixed
this problem, so I'm closing this bug.
rbd: C...
Alex Elder
09:05 AM rbd Feature #2326 (In Progress): krbd: use new class interfaces, new image format
I've finally begun work on this, following some in-person discussion
with Josh, Dan, and Sage this week.
I will u...
Alex Elder
09:00 AM Linux kernel client Feature #2374: ceph-client: start laying the groundwork for Linux tracepoints
No progress on this yet.
However, I got this e-mail from Jim Schutt shortly after creating
this bug, and just wan...
Alex Elder
08:44 AM Bug #2386: xfstests: failed #34
I've been trying to find out whether this is still a problem or
if it was transient. But teuthology has had a strin...
Alex Elder
07:41 AM Linux kernel client Bug #2424 (Resolved): ceph-client: messenger: badness in prepare_write_connect()
This bug was fixed in May, by a small series of changes that
culminated in this one:
commit 3da54776e2c0385c3...
Alex Elder
07:37 AM Linux kernel client Cleanup #2432: ceph-client: messenger: refactor to simplify state model
I had worked out on paper some notes about a longer-term state/event
model that could be used for the client messeng...
Alex Elder
07:33 AM Linux kernel client Cleanup #2432: ceph-client: messenger: refactor to simplify state model
I worked on doing this for a good month but the job really isn't
complete. Nevertheless I think there was some prog...
Alex Elder
07:23 AM Linux kernel client Cleanup #2438: ceph-client: use BUG_ON() for null auth_client->ops pointers
Touching all my bugs today. This one's a good idea but
very low priority.
Alex Elder
07:20 AM rbd Bug #2608: rbd: hung xfstest 270
Just to summarize what I just added...
There are some recent XFS problems that might explain this,
irrespective o...
Alex Elder
07:16 AM rbd Bug #2608: rbd: hung xfstest 270
I looked at this on Tuesday, and sent a note to Sage that should
have instead been put here. Here it is.
I w...
Alex Elder
04:54 AM Feature #2668 (Resolved): Build linux-tools-common package for perf
It'd be really nice if we built linux-tools-common with our gitbuilder kernels so we can install perf on our test box... Mark Nelson

06/27/2012

06:10 PM Bug #2618: error: unable to open OSD superblock
I noticed an issue in your ceph.conf - you have keyring = /etc/ceph/keyring.admin in the global section, and the osd ... Josh Durgin
05:19 PM rbd Bug #2667 (Won't Fix): librbd: create_snap on a closed image segfaults
I wrote silly code, and in reordering it, managed to attempt rbd_snap_create() on an
image that I had rbd_close()d. ...
Dan Mick
05:13 PM Feature #2651: mon: race calling tick() when doing slurping
oops, stronger fix, yes! Sage Weil
05:13 PM Feature #2651 (Resolved): mon: race calling tick() when doing slurping
Sage Weil
05:01 PM Feature #2661 (Resolved): mon: do not allow monitors to be added to the map with port 0
Merged into dho and next. Thanks Joao! Greg Farnum
11:25 AM Feature #2661 (Resolved): mon: do not allow monitors to be added to the map with port 0
Last week, somebody used the "ceph mon add" command without specifying a port, and it defaulted to port 0. This cause... Greg Farnum
04:48 PM Feature #2666 (Resolved): rados tool: copy pool
A new operation to copy the entire content of a pool into a different pool. For each object we'd copy the locator, da... Yehuda Sadeh
04:04 PM rgw Bug #2665 (Resolved): rest-bench hangs periodically
rest-bench seems to hang periodically with the following spit out the console on a regular basis:
plana83: 2012-06...
Mark Nelson
04:04 PM Bug #2656 (Rejected): rados-bench hangs periodically
Mark Nelson
04:03 PM Bug #2656: rados-bench hangs periodically
gah,
this is what I get for submitting bugs at the end of the day. You are correct, rest-bench.
Mark Nelson
03:29 PM devops Feature #2587 (Resolved): sepia: isolated networking on vercoi (manual, a handful)
Anonymous
03:28 PM devops Feature #2587: sepia: isolated networking on vercoi (manual, a handful)
Confirmed: isolated0..isolated9 work even if Crowbar wants to put VLANs in them. They pass between vercoi as packets ... Anonymous
02:17 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
More on where that snippet should live:
- for standalone chef deployment, we want the admin run something similar,...
Anonymous
02:14 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
This python snippet creates ceph keys in the right format (for now). Where it should live is still an open question.
...
Anonymous
01:38 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Without this, multi-mon bring-up is racy.
At proposal save time, the barclamp should inspect the roles, and assign...
Anonymous
02:12 PM Bug #2664: osd: extra attr _path, extra attr snapset from scrub
full logs at metropolis:~sage/bug-2664 Sage Weil
02:11 PM Bug #2664 (Resolved): osd: extra attr _path, extra attr snapset from scrub
... Sage Weil
01:43 PM devops Feature #2663 (Closed): crowbar: UI for setting generic ceph.conf values
This needs to be some sort of an extensible list of key: value pairs.
Do we need to support sections too? Probably...
Anonymous
01:17 PM devops Feature #2589 (Resolved): crowbar: Update barclamp-ceph for Essex, new ceph-cookbooks
Tyler reported success as of b2c5d3307eef0ca44fd4b001136e9af043b322bd. Anonymous
01:16 PM devops Feature #2588: downburst: multiple, configurable networks to libvirt
For historical value: https://github.com/ceph/downburst/commit/de494eeefad0f0c72916d5dab8ba015b441a94f0 Anonymous
11:30 AM devops Feature #2588 (Resolved): downburst: multiple, configurable networks to libvirt
Anonymous
11:26 AM Linux kernel client Bug #2590: possible irq lock inversion dependency with con->mutex and osdc->request_mutex
Recent log location: /a/teuthology-2012-06-27_00:00:07-regression-next-testing-basic/3076
2012-06-27T01:25:05.11...
Tamilarasi muthamizhan
10:17 AM rbd Feature #2660 (New): qa: test resizing an rbd image while a vm has it open
Make sure the resize is visible to the guest. This works with the virtio driver after doing e.g. 'echo 1 | sudo tee /... Josh Durgin
10:02 AM Subtask #2659 (Can't reproduce): mon: Single-Paxos: ceph tool -w subscriptions not being updated
how to reproduce:... Joao Eduardo Luis

06/26/2012

05:16 PM rgw Bug #2658 (Resolved): rgw-admin: usage show fails when specifying hour > 12
using wrong modifier on for parsing it. Yehuda Sadeh
05:11 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
possibly fixed by commit:0d8970fc813b33e7c6ba2484fbc43cce947d3f4d Sage Weil
04:31 PM CephFS Bug #2657 (Resolved): kclient: direct io write larger than 8MiB fails
Writes larger than 8MiB get EFAULT, e.g.:... Josh Durgin
02:13 PM Bug #2656: rados-bench hangs periodically
rados-bench or rest-bench? Yehuda Sadeh
01:27 PM Bug #2656 (Rejected): rados-bench hangs periodically
rados-bench seems to hang periodically with the following spit out the console on a regular basis:
plana83: 2012-0...
Mark Nelson
01:45 PM Bug #2563 (Can't reproduce): leveldb corruption
It looks like one of the leveldb store files was corrupted, possibly by the filesystem. It may be possible to recove... Samuel Just
09:36 AM Bug #2655 (Resolved): scrub slows writes more than it should
Samuel Just
09:34 AM Subtask #2616 (Closed): mon: Single-Paxos: AuthMonitor: key_server has no entries
Joao Eduardo Luis
09:34 AM Subtask #2616 (Resolved): mon: Single-Paxos: AuthMonitor: key_server has no entries
Joao Eduardo Luis
09:33 AM Subtask #2620 (Closed): mon: Single-Paxos: MDSMonitor: MMDSBeacon from entity with insufficient p...
Note: turns out this was the same bug as #2643
Had to do with the AuthMonitor losing some infos when reading versi...
Joao Eduardo Luis
09:32 AM Subtask #2643 (Closed): mon: Single-Paxos: mds: Strange message behavior on peon
Had to do with the AuthMonitor losing some infos when reading versions from the store.
This is fixed.
Joao Eduardo Luis
09:01 AM Linux kernel client Bug #2523: xfs: xfs_iolock_reclaimable
... Sage Weil
06:15 AM rbd Bug #2654 (Won't Fix): Stale rbd volume cannot be unmaped
/dev/rbd0 exists in system but /dev/rbd/winnie-test/postgresql not... Maciej Galkiewicz
 

Also available in: Atom