Project

General

Profile

Activity

From 07/28/2012 to 08/26/2012

08/26/2012

10:16 PM Bug #3050 (Resolved): objecter: need to resend requests when we get first map
... Sage Weil
10:10 PM Bug #3049 (Fix Under Review): mds: startup+suicide failure, MDLog::handle_journaler_write_error
was able to reproduce after a few attempts with... Sage Weil
09:30 AM Bug #3049 (Resolved): mds: startup+suicide failure, MDLog::handle_journaler_write_error
... Sage Weil
09:31 AM Bug #2947 (In Progress): osd: out of order reply
nooo!... Sage Weil
09:27 AM Bug #3048 (Resolved): rados bench: use after free?
... Sage Weil

08/25/2012

09:28 PM RADOS Bug #2874 (Resolved): apparent CRUSH mapping failure
Glad to hear the tunables resolved this for you, Alex! Sage Weil
05:42 PM Feature #2944 (Duplicate): mon: dynamically adjust heartbeat grace
#3044 #3043 #3047 #3045 #3046 Sage Weil
05:40 PM Feature #3047 (Resolved): mon: apply heartbeat grace adjustment to down_out_interval
Sage Weil
05:39 PM Feature #3046 (Resolved): mon: factor osd failure reporters into heartbeat grace
When a sufficient number of failure reports come in to mark an OSD
down, additionally compute the laggy probability ...
Sage Weil
05:39 PM Feature #3045 (Resolved): mon: factor osd laggy interval into heartbeat grace
Adjust the "heartbeat grace" locally on the monitor according to
the following formula:
adjusted_heartbeat_grace = ...
Sage Weil
05:38 PM Feature #3044 (Resolved): osd: include fail stamp in failure messages
Sage Weil
05:36 PM Feature #3043 (Resolved): mon: track osd laggy rate/interval
1) Keep track of when an OSD boots if it reports itself as fresh or as
wrongly-marked-down. Maintain the probability...
Sage Weil
04:15 PM Feature #2742 (Resolved): qa: ms socket inject failures in regression suite
Sage Weil

08/24/2012

07:24 PM Linux kernel client Bug #3040: btrfs: recursive locking of sb_internal#2
disabled this check in teuthology/tasks/internal.py for now... revert that when this is resolved! Sage Weil
03:43 PM Linux kernel client Bug #3040 (Resolved): btrfs: recursive locking of sb_internal#2
... Sage Weil
03:54 PM Bug #2827 (Rejected): mon: ceph health string doesn't match "ceph -s" output
i don't think theres' anything wrong here.. the "stuck" stuff is based on times they changed away from active or clea... Sage Weil
03:52 PM Bug #3042 (Can't reproduce): monitor hangs when osds are shut down
Logs: ubuntu@teuthology:/a/teuthology-2012-08-22_19:00:05-regression-master-testing-gcov/6876... Tamilarasi muthamizhan
03:52 PM Bug #3014 (Fix Under Review): ceph mds set_data_pool pool doesn't fail
Sage Weil
01:47 PM Bug #3014: ceph mds set_data_pool pool doesn't fail
all of the other atoi() users should be switched, while we're at it. Sage Weil
03:52 PM Bug #2858 (Fix Under Review): mon: osd id parsing returns 0 when passed 'osd.1234'
Sage Weil
03:47 PM Bug #3041 (Resolved): ceph manager down during osd recovery
Logs: ubuntu@teuthology:/a/teuthology-2012-08-23_19:00:08-regression-master-testing-gcov/7533... Tamilarasi muthamizhan
02:38 PM Bug #2876 (Resolved): mon: pg stuck peering (for example) broken?
commit:d9bd61304b14085deafc4835b4d35c7a58d096b3 Sage Weil
02:34 PM Bug #2761: osd: failed to recover before timeout expired
Recent logs:ubuntu@teuthology: /a/teuthology-2012-08-23_19:00:08-regression-master-testing-gcov/7594 Tamilarasi muthamizhan
01:48 PM Linux kernel client Bug #3031: btrfs: lock returned to userspace
Sage Weil
08:05 AM Bug #3038 (Resolved): objectcacher: segv in bh_write_commit -> close_object
... Sage Weil

08/23/2012

05:59 PM rgw Feature #2797: rgw: support multi-objects delete
Yehuda Sadeh
05:58 PM rgw Feature #2839: rgw: garbage collection
Yehuda Sadeh
05:58 PM rgw Feature #3037 (Resolved): rgw: unit test for rgw objclass
Yehuda Sadeh
04:10 PM Feature #2829 (Resolved): report on cluster size/status (for service billing purposes)
at some point we need the receiving end of this: extract the json, validate the crc, and stick it in some database or... Sage Weil
03:55 PM Feature #2477 (Resolved): rados bench cleanup
Sage Weil
01:16 PM rbd Bug #2948 (Resolved): rbd: fails to close image on error
commit:fed8aea662bf919f35a5a72e4e2a2a685af2b2ed in master
Dan Mick
12:59 PM Feature #2840 (Resolved): mon: $mon_data/cluster_fsid file
Sage Weil
10:12 AM Linux kernel client Bug #3031 (Resolved): btrfs: lock returned to userspace
[19490.018682]
[19490.038366] ================================================
[19490.063495] [ BUG: lock held whe...
Sage Weil
07:27 AM Subtask #2745: mon: Single-Paxos: Sync: Add new message support to the Monitor class
Currently, most timeout callbacks simply assert. This has been allowing us to successfully debug some unforeseen situ... Joao Eduardo Luis
07:13 AM Subtask #2757: mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transactions
Must still test how it behaves when we are only interested in synchronizing part of the store. Joao Eduardo Luis
07:11 AM Subtask #2744 (Resolved): mon: Single-Paxos: Sync: Create new Message type
Only thing missing: adjusting the commit message to fully describe the message in detail. Joao Eduardo Luis

08/22/2012

01:57 PM Bug #2784: osd hit suicide timeout
This test hung in the nightlies.
Logs: ubuntu@teuthology:/a/teuthology-2012-08-22_00:00:07-regression-next-testing...
Tamilarasi muthamizhan
01:51 PM Bug #3030: config/option parser: Avoid needing to list command line options in a global config list
Another example: daemonize. Anonymous
01:46 PM Bug #3030 (Won't Fix): config/option parser: Avoid needing to list command line options in a glob...
Having "monmap" in config_opts, when it's only really used by ceph-osd --mkfs, is pretty confusing. This should be be... Anonymous
01:44 PM Bug #3029 (Won't Fix): config/option parser: Avoid needing to list obscure one-use options in glo...
num_client is only used by ceph-syn, but still needs to be listed in the config_opts list, which a horribly generic n... Anonymous
11:40 AM rgw Documentation #2991: doc: expand/complete RGW Swift API reference
Sorry. Previous update intended for RGW config. This is checked in. Location is: ceph/doc/radosgw/swift. Accessible v... John Wilkins
11:34 AM rgw Documentation #2991 (In Progress): doc: expand/complete RGW Swift API reference
Yehuda needs to review the doc and sign off. Updated doc sent via email. Current location is ceph/doc/radosgw/config-... John Wilkins
10:52 AM Bug #2947 (Resolved): osd: out of order reply
commit:1113a6c56739a56871f01fa13da881dab36a32c4 Sage Weil

08/21/2012

10:49 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
logs: ubuntu@teuthology:/a/teuthology-2012-08-21_02:00:04-regression-testing-testing-basic/5691 Tamilarasi muthamizhan
10:57 AM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
added debugging to kernel ffsb task Sage Weil
06:34 PM Bug #2947 (In Progress): osd: out of order reply
ooof, the saga continues: ubuntu@teuthology:/a/sage-gfoo2/5974 Sage Weil
10:50 AM Bug #2947 (Resolved): osd: out of order reply
commit:4a0704e64a733b7bb14fb4103cd1cd54e4e7da8a Sage Weil
06:03 PM Bug #2954: osd: scrub stat mismatch, got 18/19 objects, 14/15 clones, 22478527/25385282 bytes.
another one. ms failure injection may have contributed.
ubuntu@teuthology:/a/sage-gfoo2/5925
Sage Weil
05:43 PM Bug #3026 (Resolved): ref counting error argonaut
(11:36:55 AM) Sage Weil: -1> 2012-08-21 07:00:24.285153 7ff5abba6700 1 -- 10.214.131.24:6806/20124 --> 10.214.13... Samuel Just
05:42 PM Bug #3025 (Resolved): WaitActingChange
We should not transition to WaitActingChange from Acting due to recovery complete. Samuel Just
05:32 PM rbd Feature #2720: rbd: add children command
First implementation from Josh has edges sanded off, sorta running. Needs better testing and manpage updates. Dan Mick
05:11 PM Feature #1515 (Duplicate): osd: pg split
Sage Weil
04:17 PM rbd Feature #2560: rbd: safe parent deletion
I *think* this is more or less implemented. The commands are "snap protect" and
"snap unprotect", but they behave a...
Dan Mick
03:46 PM RADOS Feature #3011 (Fix Under Review): Remove "pool" terminology from CRUSH
Sage Weil
09:38 AM RADOS Feature #3011: Remove "pool" terminology from CRUSH
agreed.
i'll stick this in the backlog!
Sage Weil
09:32 AM RADOS Feature #3011: Remove "pool" terminology from CRUSH
Since it's a hierarchy of nodes, I'd vote for "root." Also, the term "bucket" is confusing, because we use that term ... John Wilkins
08:15 AM RADOS Feature #3011: Remove "pool" terminology from CRUSH
You're talking about 'pool=default', right? I agree. What term should be use instead for the root of the tree?
'...
Sage Weil
02:38 PM devops Feature #3023 (Closed): juju: automated QA of OpenStack RBD integration
Anonymous
02:38 PM devops Feature #3022 (Closed): juju: automated QA of Ceph
Anonymous
02:36 PM devops Feature #3021 (Closed): juju: change glance to use rbd
Anonymous
02:36 PM devops Feature #3020 (Closed): juju: change nova to use rbd
Anonymous
02:36 PM devops Feature #3019 (Closed): juju: modernize ceph charm, mon & osd bootstrap
Anonymous
02:35 PM devops Feature #3018 (Closed): juju: test deploy of openstack
Anonymous
02:35 PM devops Feature #3017 (Closed): juju: dev env setup
Anonymous
02:13 PM CephFS Bug #2863: client: does not tolerate traceless replies from mds
Sage Weil
02:13 PM CephFS Bug #2863 (Resolved): client: does not tolerate traceless replies from mds
Sage Weil
02:10 PM Feature #2829 (Fix Under Review): report on cluster size/status (for service billing purposes)
Sage Weil
01:46 PM Feature #2829: report on cluster size/status (for service billing purposes)
see wip-mon-report Sage Weil
01:31 PM Cleanup #3016 (Resolved): make ceph osd crush set ${id} osd.${id} not require the ID twice
That is lame and confusing. Greg Farnum
01:01 PM CephFS Bug #1945: blogbench hang on caps
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-08-21_02:00:04-regression-testing-testing-basic/5675 Sage Weil
12:45 PM devops Cleanup #3015 (Resolved): order of arguments should not matter for init-ceph
-c ceph.conf start works
start -c ceph.conf does not.
Boo!
Greg Farnum
12:45 PM Bug #3014 (Resolved): ceph mds set_data_pool pool doesn't fail
If you specify a name instead of a pool ID, it just adds pool id 0! Greg Farnum
12:27 PM Bug #2762: mon crash ceph::__ceph_assert_fail (assertion=0x63d150 "begin->last_committed == last_...
May be clearly reproduced with >500 active clients, e.g. booting vms, and one monitor. Andrey Korolyov
11:21 AM RADOS Bug #3013 (New): doc: Document ceph-osd --mkfs --osd-uuid, --get-osd-uuid, and friends
ceph-osd --mkfs --osd-uuid <uuid> -i 123 ...
--get-osd-fsid and --get-cluster-fsid
Go through the source and lo...
Anonymous
11:02 AM rgw Bug #2961 (Resolved): rgw: bad content range
Sage Weil
08:04 AM rgw Bug #2961: rgw: bad content range
it only with >4G objects. A test like that would just take too long. Maybe it's possible to put it as an optional tes... Yehuda Sadeh
10:14 AM Feature #2668 (Resolved): Build linux-tools-common package for perf
Sage Weil
09:55 AM RADOS Feature #3012 (New): come up with some way to do gossip among daemons on a host
In discussion, it occurred to me that really OSDs on a host ought to gossip about certain kinds of information (altho... Greg Farnum
09:07 AM Feature #3010 (In Progress): Make it easy to find a list of data locations from a cephfs file
is this what they're after?... Sage Weil
08:50 AM Bug #3005 (Resolved): bootstrapped mon crashes after win_standalone_election
logs on #3006
also reproduced w/ vstart by doing 'ceph log foo &' every .01 seconds in a loop, and then removing m...
Sage Weil
08:50 AM Bug #3006 (Duplicate): mon: removing a running monitor can crash ceph
see #3005 Sage Weil

08/20/2012

09:42 PM rbd Bug #2937: btrfs filesystem on rbd device kernel BUG writing large file
This reproduces on plana. Details: two machine cluster, one monitor, two OSDs:
roles:
- [mon.0, osd.0]
- [osd.1...
Dan Mick
09:28 PM RADOS Feature #3011 (Resolved): Remove "pool" terminology from CRUSH
Users get confused and conflate RADOS pools and CRUSH pools. I don't think we actually use that term in many places i... Greg Farnum
08:58 PM Feature #3010 (Resolved): Make it easy to find a list of data locations from a cephfs file
Large cluster designers would like to be able to get as much information about a CephFS file's location as possible. ... Greg Farnum
07:12 PM Bug #3009 (Resolved): if you mkfs an OSD with --filestore-xattr-use-omap and then don't start the...
Apparently we auto-detect filestore-xattr-use-omap, but we don't store it anywhere in the OSD's data directory. Which... Greg Farnum
06:57 PM RADOS Cleanup #3008 (New): Consider making MLog messages not require MON_CAP_X
Right now, the permissions for an incoming MLog are checked against PAXOS_LOG, MON_CAP_X. This means that the MDS and... Greg Farnum
06:01 PM Bug #3006 (Duplicate): mon: removing a running monitor can crash ceph
While rewriting the ceph add/remove monitor documentation (http://ceph.com/docs/master/ops/manage/grow/mon/), I added... John Wilkins
05:18 PM Bug #3005 (Resolved): bootstrapped mon crashes after win_standalone_election
I created the mon from #3004 and got it running correctly. It crashed since it won without being rank 0.... Greg Farnum
05:17 PM Bug #3001 (Resolved): mkcephfs: -a fails if only "host=localhost" sections seen in ceph.conf
... Anonymous
04:22 PM Bug #3001: mkcephfs: -a fails if only "host=localhost" sections seen in ceph.conf
And for the record, for now I'm recommending this: don't use "host=localhost", put in the actual host name. Anonymous
04:21 PM Bug #3001 (Resolved): mkcephfs: -a fails if only "host=localhost" sections seen in ceph.conf
This was reported earlier on the list as http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/8051/focus=8092 a... Anonymous
05:13 PM Bug #3004 (Resolved): bootstrapped initial monitor can't find its own keyring with relative paths
I ran the following sequence of commands, which I sourced from vstart (while extracting the ceph.conf):... Greg Farnum
05:05 PM rbd Documentation #2992 (In Progress): doc: RBD parent/child snapshot
Ross Turk
11:31 AM rbd Documentation #2992 (Need More Info): doc: RBD parent/child snapshot
Ross Turk
04:54 PM Bug #3002 (Resolved): ceph-authtool: --print does not work
this already got fixed in master, it looks like (--print-key instead of --print). don't think it's worth backporting... Sage Weil
04:42 PM Bug #3002 (Resolved): ceph-authtool: --print does not work
... Greg Farnum
04:53 PM Bug #3003 (Resolved): mon: race/crash after removing monitors
commit:d521dde9b565098765a20dd001d8650ad02c2bef Sage Weil
04:47 PM Bug #3003 (Resolved): mon: race/crash after removing monitors
... Sage Weil
03:51 PM Bug #2691 (In Progress): osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
Recent log: ubuntu@teuthology:/a/teuthology-2012-08-20_00:00:04-regression-next-testing-basic/4822... Tamilarasi muthamizhan
03:36 PM Feature #3000 (Resolved): osd: balance recovery vs client io
Sage Weil
03:35 PM Linux kernel client Bug #1347 (Can't reproduce): forced unmount kernel bug
Sage Weil
03:34 PM Bug #2451 (Can't reproduce): qa: networking doesn't always start after reboot
i havne't seen this in a long time. Sage Weil
03:26 PM rgw Bug #2961 (In Progress): rgw: bad content range
Can we add an s3tests for this? Sage Weil
03:26 PM rgw Bug #2961 (Resolved): rgw: bad content range
Sage Weil
03:16 PM Feature #2668 (In Progress): Build linux-tools-common package for perf
Sage Weil
03:06 PM Bug #2999 (Resolved): osd: msgr crash in OSD::complete_notify
Logs: ubuntu@teuthology:/a/teuthology-2012-08-17_19:00:07-regression-master-testing-gcov/3549... Tamilarasi muthamizhan
03:05 PM Bug #2956 (Resolved): osd:FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
commit:dd4c1dc9f9dae43e4761caca049bfe7361d9ebfb Sage Weil
12:35 PM Bug #2956: osd:FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
Sage Weil
11:17 AM Bug #2956: osd:FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
... Sage Weil
02:28 PM Documentation #2998 (Can't reproduce): doc: validate install docs on ubuntu server
Ross Turk
02:00 PM RADOS Bug #2874: apparent CRUSH mapping failure
I'd like to report that I was seeing what I believe to be the same issue (at least the symptoms were the same: a 3-OS... Alex Moore
01:58 PM Bug #2761: osd: failed to recover before timeout expired
just reproduced this one with osd and msgr logs:... Sage Weil
12:56 PM Bug #2761: osd: failed to recover before timeout expired
Logs: ubuntu@teuthology:/a/teuthology-2012-08-20_04:00:05-regression-stable-master-basic/5044 Tamilarasi muthamizhan
01:31 PM Feature #2911 (Duplicate): osd: Restrict recovery when the OSD full list is nonempty
Sage Weil
01:31 PM Feature #1637 (Duplicate): OSDs running full take down other OSDs
Sage Weil
01:09 PM Bug #2924 (Resolved): doc: Adjust for mon. key being in external keyring
ceph auth get mon. > /tmp/monkey Sage Weil
01:04 PM Bug #2947: osd: out of order reply
More Logs: ubuntu@teuthology:/a/teuthology-2012-08-19_02:00:05-regression-testing-testing-basic/4288 Tamilarasi muthamizhan
11:38 AM Bug #2947: osd: out of order reply
... Sage Weil
12:35 PM rbd Bug #2967 (Resolved): librbd: cls_rbd.parents unit test failure
Sage Weil
11:36 AM rbd Bug #2967: librbd: cls_rbd.parents unit test failure
Sage Weil
10:59 AM rbd Bug #2967: librbd: cls_rbd.parents unit test failure
I think this is resolved by the about-to-be-merged layering code; testing is in progress Dan Mick
11:53 AM Bug #2997 (Resolved): ceph-mon --mkfs allows you to create one without an id which then crashes o...
And that sucks, especially when it crashes in a demo and you don't know why. Greg Farnum
11:41 AM rgw Documentation #2483 (Fix Under Review): doc: radosgw api diffs to swift
Ross Turk
11:40 AM rgw Documentation #2483: doc: radosgw api diffs to swift
Can you check the latest master build of docs and see if this has been updated to your satisfaction? Thanks! Ross Turk
11:30 AM Documentation #2978 (Need More Info): doc: write RADOS restore from backup procedure
Ross Turk
11:30 AM Documentation #2977 (Need More Info): doc: write RADOS backup procedure
Ross Turk
11:30 AM Documentation #2979 (Need More Info): doc: write doc on how to use / rollback to RADOS snapshots
Ross Turk
11:30 AM devops Documentation #2975 (Need More Info): doc: update docs to match new ceph-disk-prepare syntax
Ross Turk
11:29 AM Documentation #2995 (In Progress): doc: restructure documentation (its getting messy!)
Ross Turk
11:23 AM Documentation #2981 (In Progress): doc: write add/remove a monitor
Ross Turk
11:22 AM Documentation #2970 (In Progress): doc: expand/complete osd settings reference
Ross Turk
11:22 AM Documentation #2971 (In Progress): doc: expand/complete mon settings reference
Ross Turk
11:22 AM Documentation #2973 (In Progress): doc: expand/complete ceph general settings
Ross Turk
10:56 AM Feature #2840: mon: $mon_data/cluster_fsid file
wip-mon-mkfs Sage Weil
10:55 AM Feature #2840 (Fix Under Review): mon: $mon_data/cluster_fsid file
Sage Weil
09:22 AM Bug #2803 (Can't reproduce): filer: probe crash
Sage Weil
09:21 AM CephFS Bug #2959 (Resolved): mds: returns null dentry on getattr
Sage Weil
09:20 AM Linux kernel client Bug #2936 (Resolved): Remounting cephfs with non-existing path causes kernel panic
Sage Weil

08/19/2012

04:10 PM Documentation #2996 (Resolved): doc: write install Ceph with RPMs doc
Ross Turk
04:09 PM Documentation #2995 (Resolved): doc: restructure documentation (its getting messy!)
Ross Turk
04:07 PM Documentation #2994 (Resolved): doc: expand/complete librados API doc
Ross Turk
04:04 PM rgw Documentation #2993 (Resolved): doc: write quick RGW guide (if feasible)
Ross Turk
04:03 PM rbd Documentation #2992 (Resolved): doc: RBD parent/child snapshot
Ross Turk
03:59 PM rgw Documentation #2991 (Resolved): doc: expand/complete RGW Swift API reference
The reference for the [client.radosgw.gateway] sections of ceph.conf need to be completed by John Wilkins and reviewe... Ross Turk
03:58 PM rgw Documentation #2990 (Resolved): doc: expand/complete RGW S3 API reference
Ross Turk
03:57 PM rgw Documentation #2989 (Resolved): doc: write RGW troubleshooting
Ross Turk
03:57 PM CephFS Documentation #2988 (Resolved): doc: write MDS troubleshooting
Ross Turk
03:57 PM Documentation #2987 (Rejected): doc: write MON troubleshooting
Ross Turk
03:57 PM Documentation #2986 (Rejected): doc: write OSD troubleshooting
Ross Turk
03:56 PM Documentation #2985 (Rejected): doc: write install troubleshooting
Ross Turk
03:56 PM Documentation #2984 (Rejected): doc: write performance tuning
Ross Turk
03:56 PM Documentation #2983 (Rejected): doc: write performance monitoring
Ross Turk
03:56 PM CephFS Documentation #2982 (Resolved): doc: write add/remove a metadata server
Ross Turk
03:52 PM Documentation #2981 (Resolved): doc: write add/remove a monitor
Ross Turk
03:52 PM Documentation #2980 (Resolved): doc: write upgrading Ceph version
Ross Turk
03:52 PM Documentation #2979 (Closed): doc: write doc on how to use / rollback to RADOS snapshots
Ross Turk
03:51 PM Documentation #2978 (Closed): doc: write RADOS restore from backup procedure
Ross Turk
03:51 PM Documentation #2977 (Closed): doc: write RADOS backup procedure
Ross Turk
03:51 PM devops Documentation #2976 (Closed): doc: update chef doc to git clone with http, not ssh
Ross Turk
03:50 PM devops Documentation #2975 (Rejected): doc: update docs to match new ceph-disk-prepare syntax
Ross Turk
03:50 PM devops Documentation #2974 (Resolved): doc: update chef docs for mon key distribution
Ross Turk
03:50 PM Documentation #2973 (Resolved): doc: expand/complete ceph general settings
Ross Turk
03:49 PM rgw Documentation #2972 (Resolved): doc: expand/complete rgw settings reference
Ross Turk
03:49 PM Documentation #2971 (Resolved): doc: expand/complete mon settings reference
Ross Turk
03:48 PM Documentation #2970 (Resolved): doc: expand/complete osd settings reference
Ross Turk
03:47 PM CephFS Documentation #2969 (Resolved): doc: expand/complete mds settings reference
Ross Turk
03:46 PM Documentation #2968 (Resolved): doc: complete architecture section
Ross Turk
02:23 PM rbd Feature #2850 (Duplicate): libceph: support multi-operation transactions
Sage Weil
01:07 PM Bug #2784 (Can't reproduce): osd hit suicide timeout
Sage Weil
12:49 PM Bug #2856 (Resolved): osd: bound size of transactions trimming old osdmaps
Sage Weil
09:13 AM Linux kernel client Bug #2936: Remounting cephfs with non-existing path causes kernel panic
There are patches to do that pending, but i haven't pushed them to the tree yet because a regression in 3.6-rc1 break... Sage Weil
09:10 AM Linux kernel client Bug #2936: Remounting cephfs with non-existing path causes kernel panic
I see the change for #2959 is in the mds.
However, the kernel still shouldn't hang on bad data from the mds, so I ...
Bartek Kania
08:32 AM Linux kernel client Bug #2936: Remounting cephfs with non-existing path causes kernel panic
this is the same issue Yan hit, #2959. Sage Weil
09:09 AM rbd Bug #2532 (Resolved): rbd command allows passing in -K </path/to/secret>, but long version of (--...
Sage Weil
09:05 AM rbd Bug #2967 (Resolved): librbd: cls_rbd.parents unit test failure
... Sage Weil

08/18/2012

03:29 PM Feature #2428 (Resolved): auth: revise auth config params
Sage Weil

08/17/2012

04:18 PM Bug #2954: osd: scrub stat mismatch, got 18/19 objects, 14/15 clones, 22478527/25385282 bytes.
logs: ubuntu@teuthology:/a/teuthology-2012-08-17_00:00:25-regression-next-testing-basic/2877 Tamilarasi muthamizhan
04:09 PM rbd Bug #2958 (Resolved): librbd: discard can return -ENOENT
Sage Weil
04:08 PM rbd Bug #2958 (Fix Under Review): librbd: discard can return -ENOENT
Sage Weil
03:35 PM Bug #2960 (Resolved): ceph osd create claims you can specify '<osd-id>'; really means UUID. Could...
just merged a fix for this Sage Weil
11:38 AM Bug #2960 (Resolved): ceph osd create claims you can specify '<osd-id>'; really means UUID. Could...
I think we should consider a global pass making "id" clearer in context, but the
ceph osd create usage message, name...
Dan Mick
03:08 PM rgw Bug #2961 (Resolved): rgw: bad content range
Partial download of large file (> 4G), the content range is bad:... Yehuda Sadeh
11:59 AM Bug #2947 (In Progress): osd: out of order reply
Tamilarasi muthamizhan
11:28 AM Bug #2761: osd: failed to recover before timeout expired
logs: ubuntu@teuthology:/a/teuthology-2012-08-17_02:00:04-regression-testing-testing-basic/3038 Tamilarasi muthamizhan
11:27 AM Bug #2955: monitors failed to open new election
logs: ubuntu@teuthology:/a/teuthology-2012-08-17_02:00:04-regression-testing-testing-basic/2973 Tamilarasi muthamizhan
08:50 AM CephFS Bug #2959 (Resolved): mds: returns null dentry on getattr
the kclient open_root_dentry issues a getattr request like #1/some/path, but the mds must not return a dentry in the ... Sage Weil

08/16/2012

09:14 PM CephFS Bug #1945: blogbench hang on caps
... Sage Weil
05:10 PM rbd Documentation #2670 (Resolved): Docs shouldn't direct users to echo to /sys/bus/rbd for normal use
Sage Weil
05:01 PM rbd Bug #2958 (Resolved): librbd: discard can return -ENOENT
Sometimes discard tries to remove nonexistent objects, and does not translate the -ENOENT to 0 for its callers. This ... Josh Durgin
04:55 PM Bug #2957 (Resolved): osd: crash in PG::gen_prefix()
Sage Weil
03:30 PM Bug #2957 (Resolved): osd: crash in PG::gen_prefix()
... Sage Weil
04:45 PM rbd Feature #2719 (In Progress): librbd: provide functions for listing parents and their children
Josh Durgin
04:43 PM rbd Feature #2723 (Resolved): librbd: protect/unprotect as appropiate during cloning
Josh Durgin
04:43 PM rbd Subtask #2606 (Resolved): librbd layering: copyup on missing child object
Josh Durgin
04:43 PM rbd Feature #2722 (Resolved): cls_rbd: add class methods to get/set protected status
Josh Durgin
04:43 PM rbd Subtask #2605 (Resolved): librbd layering: guard writes
Josh Durgin
04:43 PM rbd Subtask #2604 (Resolved): librbd layering: read path
Josh Durgin
04:43 PM rbd Subtask #2603 (Resolved): librbd layering: open parent on open
Josh Durgin
04:43 PM rbd Feature #2562 (Resolved): librbd: open parent images, read path, write path
Josh Durgin
04:43 PM rbd Feature #2607 (Resolved): librbd: copyup helper
Josh Durgin
04:43 PM rbd Feature #2561 (Resolved): rbd: copyup command
Josh Durgin
04:42 PM rbd Feature #2559 (Resolved): cls_rbd: copyup method
Josh Durgin
02:15 PM Bug #2954: osd: scrub stat mismatch, got 18/19 objects, 14/15 clones, 22478527/25385282 bytes.
several more failures in /a/sage-a3 to look at. Sage Weil
10:11 AM Bug #2954 (Resolved): osd: scrub stat mismatch, got 18/19 objects, 14/15 clones, 22478527/2538528...
... Sage Weil
01:46 PM rbd Bug #2948: rbd: fails to close image on error
This affects operations that fail partway through. One example is:
rbd export <image> <existing-file>
export err...
Dan Mick
10:41 AM rbd Bug #2948: rbd: fails to close image on error
Josh Durgin
01:30 PM Bug #2946 (Resolved): osd: build fails on g++ 4.7
Sage Weil
01:29 PM Bug #2823 (Duplicate): osd: out of order ACKs
Sage Weil
01:21 PM Bug #2947: osd: out of order reply
Sage Weil
12:04 PM Bug #2761: osd: failed to recover before timeout expired
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-16_02:00:06-regression-testing-testing-basic/2211 Tamilarasi muthamizhan
11:32 AM Bug #2956 (Resolved): osd:FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
Logs: ubuntu@teuthology:/a/teuthology-2012-08-15_19:00:16-regression-master-testing-gcov/1878... Tamilarasi muthamizhan
11:29 AM Bug #2873 (Resolved): Stack trace thrown when using obsync
commit:47b24c0562bcb44964a0b8f6c4847bb0f05924e0 in stable-next
commit:5962a9dde051c95b7f39e60dcd16b339392685b8 in ne...
Dan Mick
11:18 AM Bug #2955 (Can't reproduce): monitors failed to open new election
logs: ubuntu@teuthology:/a/teuthology-2012-08-16_00:00:15-regression-next-testing-basic/2077 Tamilarasi muthamizhan

08/15/2012

06:33 PM rbd Bug #2950 (Resolved): ObjectCacher: leaks memory
commit:825f7334eef7cc69c6f439c21dd0bbb215dbf09d
it wasn't the buffers, it was some BufferHeads that had references...
Sage Weil
11:41 AM rbd Bug #2950 (Resolved): ObjectCacher: leaks memory
As reported in http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/7746 Josh Durgin
06:06 PM Bug #2922: mkcephfs fails with error "read: arg count"
Hmm, my testing of the modifications has a little buggy itself sorry. But after more careful analysis I can confirm t... Mark Kirkwood
05:55 PM Bug #2922 (Resolved): mkcephfs fails with error "read: arg count"
ah, it's just a stupid bash vs dash thing with the 'read' command. i tested on debian (bash), breaks on ubuntu. pus... Sage Weil
05:41 PM Bug #2922: mkcephfs fails with error "read: arg count"
just fyi you can enclose things in pre tags to make redmine skip its own formatting:... Josh Durgin
05:39 PM Bug #2922 (In Progress): mkcephfs fails with error "read: arg count"
building right now to test this out... i could have sworn i tested the directory exists situation, but i guess not!
...
Sage Weil
05:36 PM Bug #2922: mkcephfs fails with error "read: arg count"
Sorry that last one does *not* work properly either. Mark Kirkwood
05:29 PM Bug #2922: mkcephfs fails with error "read: arg count"
This might be cleaner (I'll avoid a diff as they seem to get mangled):
Replacing:
if test -d $mon_data && ! f...
Mark Kirkwood
03:19 PM Bug #2922: mkcephfs fails with error "read: arg count"
Hmm - I don't think so:
The amended code works ok if the directory does not exist, but fails if it exists and is e...
Mark Kirkwood
02:13 PM Feature #2953 (Resolved): append() in librados is not exposed to python API
the append to an object is not available at the pyton API level and needs to be implemented. Evan Felix
11:51 AM rbd Feature #2952 (Resolved): librbd: use generic rados locking class
Replace calls to cls_rbd's locking methods with calls to the generic lock class. Josh Durgin
11:49 AM rbd Feature #2951 (Resolved): cls_rbd: remove locking methods
Remove the unused cls_rbd locking methods, and merge the tests with the cls_lock tests. Josh Durgin
10:27 AM rbd Bug #2948 (Resolved): rbd: fails to close image on error
calling exit() doesn't run the Image destructor, which leads to the watch on the header sticking around. After that, ... Josh Durgin
10:10 AM rbd Feature #2723 (Fix Under Review): librbd: protect/unprotect as appropiate during cloning
Josh Durgin
10:09 AM rbd Feature #2722 (Fix Under Review): cls_rbd: add class methods to get/set protected status
Josh Durgin
10:09 AM rbd Feature #2718 (Fix Under Review): librbd: map parent -> child in a per-pool rbd_children object w...
Josh Durgin
10:09 AM rbd Feature #2717 (Fix Under Review): cls_rbd: add methods for maintaining mapping from parent to chi...
Josh Durgin
10:09 AM rbd Feature #2562 (Fix Under Review): librbd: open parent images, read path, write path
Josh Durgin
10:09 AM rbd Feature #2562 (Need More Info): librbd: open parent images, read path, write path
Josh Durgin
10:08 AM rbd Subtask #2605 (Fix Under Review): librbd layering: guard writes
Josh Durgin
10:08 AM rbd Subtask #2604 (Fix Under Review): librbd layering: read path
Josh Durgin

08/14/2012

05:41 PM Bug #2947 (Resolved): osd: out of order reply
triggered by thrashing by this job:... Sage Weil
04:45 PM Bug #2922 (Resolved): mkcephfs fails with error "read: arg count"
commit:24a26c627400d191bbb07cdd3ecfa644c9e313eb Sage Weil
04:28 PM Bug #2946 (Resolved): osd: build fails on g++ 4.7
... Sage Weil
04:06 PM Feature #2918 (Resolved): OSD ID numbers determine OSD count and thus default pg_cnt
Sage Weil
02:14 PM Feature #2918 (Fix Under Review): OSD ID numbers determine OSD count and thus default pg_cnt
Sage Weil
02:58 PM Feature #2942 (Resolved): mon: throttle client, server connections
Sage Weil
02:34 PM Feature #2619 (Resolved): filejournal: instrument with perfcounters
commit:9fc79584728f87938d13757d5176c5d19d3ca2cb Sage Weil
02:07 PM Feature #2940 (Resolved): daemons do not print out version to log on startup
Sage Weil
12:18 PM Feature #2940: daemons do not print out version to log on startup
Sage Weil
01:58 PM rbd Bug #2777 (Resolved): qemu: report discard support
Josh Durgin
01:18 PM Bug #2945 (Won't Fix): package upgrade from v0.46 to v0.48argonaut fails
I saw this once but assumed I had broken dependencies with my version mangling, but then it came up during a third pa... Greg Farnum
01:13 PM RADOS Subtask #2793 (Resolved): osd: require tunable feature if current osdmap uses non-default tunables
Sage Weil
01:13 PM RADOS Subtask #2792 (Resolved): mon: require tunable feature bit if current osdmap uses non-default tun...
Sage Weil
01:13 PM RADOS Feature #2705 (Resolved): crush: graceful transition to new default tunables
Sage Weil
12:18 PM RADOS Feature #2705 (In Progress): crush: graceful transition to new default tunables
Sage Weil
12:19 PM Feature #2320 (Duplicate): mon: detect and throttle osd flapping
Sage Weil
12:18 PM Feature #2742 (In Progress): qa: ms socket inject failures in regression suite
Sage Weil
12:14 PM Feature #1754 (Resolved): qa: run other suites nightly as well
Sage Weil
12:13 PM Feature #1514 (Duplicate): filestore: api to repartition a collection
Sage Weil
12:12 PM Feature #2440: osd: understand btrfs performance
Sage Weil
12:12 PM Feature #2440 (Won't Fix): osd: understand btrfs performance
Sage Weil
12:12 PM Feature #2564 (Resolved): teuthology: install kernels from local dir
Sage Weil
11:45 AM Feature #2944 (Duplicate): mon: dynamically adjust heartbeat grace
Basically:
1) Keep track of when an OSD boots if it reports itself as fresh or as
wrongly-marked-down. Maintain the...
Sage Weil
11:44 AM Feature #2943 (Resolved): mon: norecovery and/or nobackfill
Sage Weil
11:42 AM Cleanup #2763 (Resolved): move rbd locking infrastructure to a separate objclass
Sage Weil
11:42 AM Feature #2768 (Resolved): teuthology: make workunit task work on different branch/sha1 etc
Sage Weil
11:41 AM Feature #2857 (Resolved): compile non-production builds with -fno-omit-frame-pointer
Sage Weil
09:37 AM Bug #2761: osd: failed to recover before timeout expired
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-13_19:00:07-regression-master-testing-gcov/108 Tamilarasi muthamizhan

08/13/2012

09:48 PM Bug #2922: mkcephfs fails with error "read: arg count"
FWIW - this seems to happen even if the mon directory does not exist - there should probably be a check of the form:
...
Mark Kirkwood
07:54 PM Bug #2938 (Resolved): ceph-osd --mkfs failure to create journal is logged with dout(0), probably ...
commit:294c25bb37aa39caacee51cc405a1f2deebb6331
Dan Mick
11:09 AM Feature #2942 (Resolved): mon: throttle client, server connections
Sage Weil
10:57 AM rgw Feature #2941 (Resolved): rgw: improve streaming read performance
Sage Weil
10:51 AM Bug #2823: osd: out of order ACKs
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-11_00:00:08-regression-next-testing-basic/6401 Tamilarasi muthamizhan
10:46 AM Feature #2940 (Resolved): daemons do not print out version to log on startup
I imagine this applies to the other daemons too, but maybe not. Make it print out the version so we can be sure it's ... Greg Farnum
09:28 AM devops Feature #2939 (Rejected): chef: Write up how cluster shrinking should work
Expanding the cluster is pretty trivial, and practically identical with initial install, but shrinking needs a little... Anonymous

08/12/2012

10:30 AM CephFS Bug #2444: null pointer deference in ceph_d_prune inside kvm
problem doesent seem to be reproductible after upgrading to 3.5.0-9-generic (Ubuntu Quantal) Alexandre Dupouy
03:38 AM rbd Bug #2937: btrfs filesystem on rbd device kernel BUG writing large file
I activated some extra debugging.
This appears just before the BUG:...
Bartek Kania

08/11/2012

06:33 PM Bug #2887: pjd open/08.t failed test 2
this is an upstream fuse regression in the 3.6-rc1 kernel. reported to miklos and the fuse list. Sage Weil
06:28 PM Linux kernel client Bug #2868 (Resolved): kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osd
Sage Weil

08/10/2012

08:31 PM Bug #2919 (Fix Under Review): ceph kernel module looks for :/ in path, but / stripped by precise ...
Sage Weil
08:29 PM Bug #2938: ceph-osd --mkfs failure to create journal is logged with dout(0), probably should be derr
yeah, just change it to derr Sage Weil
06:34 PM Bug #2938 (Resolved): ceph-osd --mkfs failure to create journal is logged with dout(0), probably ...
A customer mistakenly named a directory as his osd journal location; the failure printed to his terminal with no hint... Dan Mick
08:25 PM Linux kernel client Bug #2801 (Resolved): msgr crash in ceph_msg_new
Sage Weil
08:24 PM Linux kernel client Bug #2392 (Resolved): First read of symlink after ceph filesystem mounted gives error
Sage Weil
04:26 PM Bug #2887: pjd open/08.t failed test 2
ubuntu@teuthology:/a/teuthology-2012-08-09_00:00:04-regression-next-testing-basic/5752 Tamilarasi muthamizhan
03:59 PM Bug #2887: pjd open/08.t failed test 2
ubuntu@teuthology:/a/teuthology-2012-08-09_02:00:13-regression-testing-testing-basic/5857 Tamilarasi muthamizhan
01:59 PM rbd Bug #2937 (Duplicate): btrfs filesystem on rbd device kernel BUG writing large file
Writing a large file with dd on btrfs filesystem mounted from rbd device causes kernel bug
Stock kernel 3.5.1, con...
Bartek Kania
01:48 PM Linux kernel client Bug #2936 (Resolved): Remounting cephfs with non-existing path causes kernel panic
Steps to reproduce:
First mount the root somewhere...
Bartek Kania
10:38 AM Bug #2913 (Resolved): monclient: asserts when no monitor addresses found due to dns failure
Fortunately I was wrong about the string splitting - that was just a confusing message from the parsing stage.
The...
Josh Durgin
10:09 AM rgw Feature #771: rgw: POST
Support the S3 POST object operation referenced in
http://docs.amazonwebservices.com/AmazonS3/latest/API/RESTObje...
caleb miles
09:40 AM rgw Bug #2935 (Resolved): rgw: radosgw-admin bucket link clobbers index
radosgw-admin bucket unlink, then radosgw-admin bucket link overrides the bucket index, so objects cannot be listed a... Yehuda Sadeh

08/09/2012

04:06 PM Feature #2934: crush: create a visualizer for crush maps
'ceph osd tree' provides a good start on the command line, but it'd be nice to have that in the crushtool as well if ... Josh Durgin
04:04 PM Feature #2934 (New): crush: create a visualizer for crush maps
The language used in crush maps is very well defined and
hierarchical. I don't know how to do this sort of thing,
...
Alex Elder
03:55 PM rbd Bug #2933 (Resolved): rbd: bio_pair leak in bio_chain_clone()
Guangliang Zhao <gzhao@suse.com> pointed out this problem on the
mailing list. Here's the latest edition of his pro...
Alex Elder
02:18 PM devops Feature #2932 (Rejected): chef: logstash integration
Anonymous
02:18 PM devops Feature #2931 (Rejected): chef: StatsD integration
Anonymous
01:54 PM rgw Feature #2499 (Resolved): rgw: ability to delete users without first emptying and deleting all bu...
done, commit:45f7f0602c90073af27041f92166724ca9472197. Yehuda Sadeh
01:53 PM rgw Feature #2786 (Resolved): radosgw-admin: ability to remove objects/buckets
object removal done, commit:cc8eac2427c745e154ad40eeb84ef28dbed99d36
bucket removal done, commit:45f7f0602c90073af27...
Yehuda Sadeh
01:32 PM rgw Bug #2504 (Resolved): rgw: use multiple notifications objects
Done, commit:b28db08ea8b84ec9f1d2df88ac4edd6aea0ba7d4 Yehuda Sadeh
12:29 PM Bug #2924 (Resolved): doc: Adjust for mon. key being in external keyring
This doc is outdated
http://ceph.com/docs/master/ops/manage/grow/mon/#adding-a-monitor
as per
http://thread.gmane....
Anonymous
11:13 AM CephFS Bug #2444: null pointer deference in ceph_d_prune inside kvm
same bug here with Ceph 0.49 on Ubuntu 12.04 LTS (GNU/Linux 3.2.0-27-generic x86_64) Alexandre Dupouy
10:58 AM rgw Feature #2923 (Resolved): rgw: non hard-coded pool names
Don't have pool names hard coded, make them configurable. Yehuda Sadeh
10:44 AM rgw Bug #2665 (Resolved): rest-bench hangs periodically
This was fixed a while ago. Yehuda Sadeh

08/08/2012

04:58 PM Bug #2922 (Resolved): mkcephfs fails with error "read: arg count"
Branch: wip-auth
ceph version 0.49-306-gfc3681f (commit:fc3681f59c4f49298f5a7a5172c30be63068c330)
tamil@tamil-Vir...
Tamilarasi muthamizhan
04:08 PM rgw Bug #2841 (Resolved): rgw: fix usage trim
Fixed, commit:6bc1067fc878cbfb6761146cb154c2985c9d9bd7 and commit:04a0eacd92b0c923cb9d1efc7d751a05d544dc85 Yehuda Sadeh
03:35 PM rgw Feature #2869 (Resolved): rgw: expand date format support
Fixed, commit:074c3c0fe0c005e54f4776c60463a16305dbab10 Yehuda Sadeh
03:34 PM rgw Bug #2879 (Resolved): rgw: xml parser doesn't work correctly with escape sequences
Fixed, commit:03b787e0ee1d94e054cfb17059e5e108a7162d7b Yehuda Sadeh
03:34 PM rgw Bug #2878 (Resolved): rgw: chunked encoding for POST requests (e.g., complete multipart uploads)
Fixed, commit:d39ea1d4b51afdbbd51254ff41c8285e8f5697df. Yehuda Sadeh
03:33 PM rgw Bug #2877 (Resolved): rgw: ETag parsing in complete multipart upload should xml decode ETag
Fixed, commit:3809e34448e47d7baa02d7a0f9240494aba0e337. Yehuda Sadeh
02:06 PM Bug #2845 (Resolved): mkcephfs hasn't learned about new default keyring locations in argonaut
fixed, commit:96b1a496cdfda34a5efdb6686becf0d2e7e3a1c0 Sage Weil
12:48 PM Bug #2875 (Resolved): osd: pg stuck in GetLog
Sage Weil
12:48 PM Bug #2834 (Resolved): osd/ReplicatedPG.cc: 3577: FAILED assert(waiting_for_ack.begin()->first == ...
hasn't come up recently Sage Weil
11:11 AM Bug #2887: pjd open/08.t failed test 2
Logs: ubuntu@teuthology:/a/teuthology-2012-08-06_00:00:02-regression-next-testing-basic/5012 Tamilarasi muthamizhan
10:03 AM Bug #2887: pjd open/08.t failed test 2
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-08_00:00:07-regression-next-testing-basic/5542 Tamilarasi muthamizhan
10:48 AM Bug #2761: osd: failed to recover before timeout expired
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-08_00:00:07-regression-next-testing-basic/5616... Tamilarasi muthamizhan
10:33 AM rgw Bug #2915: rgw: copy of large object times out
workaround: bump up fastcgi timeout Yehuda Sadeh
10:11 AM Feature #2921 (Rejected): doc: Provide epub docs
Sphinx supports it. Current output seems to only include the top-level file and indexes, skipping most of the content... Anonymous
10:04 AM Feature #2920 (Rejected): doc: Provide PDF docs
Sphinx supports it, but we'd need to fix other parts of our toolchain.
To see where we are:
1. add this patch:
...
Anonymous

08/07/2012

05:56 PM Bug #2919 (Resolved): ceph kernel module looks for :/ in path, but / stripped by precise mountall
I think this is really a bug in mountall (see https://bugs.launchpad.net/ubuntu/+source/mountall/+bug/809221), but it... Dan Mick
05:42 PM Feature #2918 (Resolved): OSD ID numbers determine OSD count and thus default pg_cnt
An IRC user (maelfius) had a problem with a 1-monitor, 3-OSD cluster; the monitor chewed up all memory before it star... Dan Mick
12:50 PM rgw Bug #2916: radosgw does not check command line options for correctness
that's a generic ceph command lines parsing issue Yehuda Sadeh
11:52 AM rgw Bug #2916 (Resolved): radosgw does not check command line options for correctness
It is possible to pass any command line option to radosgw without error. For example
./radosgw -c /home/caleb/cep...
caleb miles
12:49 PM rgw Bug #2915: rgw: copy of large object times out
The problem is that apache is timing out. We should return an early 200 and encode any error in the response code, as... Yehuda Sadeh
10:34 AM rgw Bug #2915 (Resolved): rgw: copy of large object times out
Yehuda Sadeh
09:26 AM devops Feature #2808 (Rejected): crowbar: upgrade to fred3 (get bind bug fix)
Looks like upstream Crowbar is still buggy with regard to DNS. Anonymous
06:37 AM Bug #2913: monclient: asserts when no monitor addresses found due to dns failure
I am using 0.48argonaut-1precise. Jeff Strunk

08/06/2012

04:22 PM Bug #2914 (Resolved): librados set_complete_callback, set_safe_callback clobber each other's argu...
Samuel Just
02:46 PM Bug #2913: monclient: asserts when no monitor addresses found due to dns failure
hmm, looking closer that's a second bug - it's not splitting 'thinkmate3:6789;thinkmate4:6789' into separate addresse... Josh Durgin
02:28 PM Bug #2913: monclient: asserts when no monitor addresses found due to dns failure
I'm not so sure this is a DNS issue. Here is how name service is set up on my ceph/kvm test cluster.
On each node,...
Jeff Strunk
01:58 PM Bug #2913 (Resolved): monclient: asserts when no monitor addresses found due to dns failure
This should be an error returned up to the user, not an assert.
From https://www.redhat.com/archives/libvirt-users...
Josh Durgin
02:16 PM Bug #2887: pjd open/08.t failed test 2
recent logs: ubuntu@teuthology:/a/teuthology-2012-08-06_02:00:02-regression-testing-testing-basic/5117 Tamilarasi muthamizhan
09:53 AM Feature #2911 (Duplicate): osd: Restrict recovery when the OSD full list is nonempty
See the conversation at http://www.spinics.net/lists/ceph-devel/msg08010.html
It would be nice if we could somehow...
Greg Farnum

08/03/2012

06:31 PM Bug #2909 (Resolved): the rados_trunc function did not get implemented in rados.py Python
Thanks! Added in commit:43291951fad241a6d3f8b8daa37d3665c9d842d6, with a simple test and spacing normalized to the re... Josh Durgin
03:58 PM Bug #2909: the rados_trunc function did not get implemented in rados.py Python
Yes you may. Evan Felix
03:35 PM Bug #2909: the rados_trunc function did not get implemented in rados.py Python
That looks good to me. Can I add your signed-off-by to the patch? Josh Durgin
01:53 PM Bug #2909 (Resolved): the rados_trunc function did not get implemented in rados.py Python
This code seems to work in the Ioctx class:
def trunc(self,key,size):
self.require_ioctx_open()
...
Evan Felix
02:23 PM devops Feature #2910: crowbar: Use JBOD mode for ceph-osd
There's a map in the deployer object from role name to BIOS and RAID configuration to set on the node. We can add cep... Anonymous
02:22 PM devops Feature #2910 (Closed): crowbar: Use JBOD mode for ceph-osd
Anonymous
11:38 AM Bug #2908 (Resolved): ceph osd crush remove <name>
(11:34:50 AM) Kyle Bader: so it looks like ceph -h is missing crush rm
(11:34:54 AM) Kyle Bader: could we add
(11:...
Samuel Just
07:56 AM Subtask #2738 (Rejected): mon: Single-Paxos: Sync: Add snapshot support to the monitor store
This task was superseded by task #2756, which provides a much more broad implementation using directly the available ... Joao Eduardo Luis
07:45 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Joao Eduardo Luis
07:44 AM Subtask #2805 (Resolved): mon: Single-Paxos: Sync: Create a test unit to verify the correctness o...
Joao Eduardo Luis
07:44 AM Subtask #2758 (Resolved): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyValueDB to ...
Joao Eduardo Luis
07:43 AM Subtask #2756 (Resolved): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis

08/02/2012

04:51 PM Bug #2907: rados benchmarking tool which does not always do creates
Why not record the raw data and let other tools produce percentiles and other statistics? Josh Durgin
04:44 PM Bug #2907 (Resolved): rados benchmarking tool which does not always do creates
Features:
Pluggable distribution for choosing objects (zipifan?, random?, sequential?)
configurable numbe...
Samuel Just
04:09 PM Bug #2904 (Resolved): ceph-authtool: Adds keys on typos, expected error message
... Anonymous
03:50 PM CephFS Feature #2903 (Resolved): ceph-fuse: Support -o noallow_other
Currently, ceph-fuse hardcodes the -o allow_other option to FUSE_ARGS_INIT.
https://github.com/ceph/ceph/blob/5db3...
Anonymous
01:35 PM rgw Bug #2841 (Fix Under Review): rgw: fix usage trim
Yehuda Sadeh
01:31 PM rgw Bug #1855 (Resolved): Creation of a subuser that appears to own an s3 key is possible, and removi...
Commit 5db3a9e71c6b757660d0702efada40af6be63eb8 pushed. We disallow creating s3 key when subuser is created in order ... Yehuda Sadeh
01:27 PM devops Feature #2398: chef: external osd journal support
Shuffling old notes here:
see if "osd journal" was overridden in $cluster.conf; if yes, do not attempt discovery
...
Anonymous
12:59 PM rgw Feature #2869 (Fix Under Review): rgw: expand date format support
Yehuda Sadeh
12:59 PM rgw Bug #2877 (Fix Under Review): rgw: ETag parsing in complete multipart upload should xml decode ETag
Yehuda Sadeh
12:59 PM rgw Bug #2878 (Fix Under Review): rgw: chunked encoding for POST requests (e.g., complete multipart u...
Yehuda Sadeh
12:59 PM rgw Bug #2879 (Fix Under Review): rgw: xml parser doesn't work correctly with escape sequences
Yehuda Sadeh
11:50 AM Bug #2902 (Resolved): common lib tries to open literal ~/.ceph/ceph.conf
... Anonymous
11:48 AM Bug #2901 (Resolved): librados-config should not read ceph.conf
... Anonymous
11:38 AM Bug #2900 (Resolved): ceph fuse crashed
Logs: ubuntu@teuthology: /a/teuthology-2012-07-27_19:00:07-regression-master-testing-gcov/1581
Core file: /a/teuthol...
Tamilarasi muthamizhan
11:09 AM Bug #2897 (Resolved): ceph fuse error segfault
... Tamilarasi muthamizhan
11:03 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
10:52 AM Bug #2823: osd: out of order ACKs
Log location: ubuntu@teuthology:/a/teuthology-2012-08-01_19:00:04-regression-master-testing-gcov/4196
ubuntu@teuth...
Tamilarasi muthamizhan
10:50 AM Bug #2823: osd: out of order ACKs
(10:46:42 AM) tamil.muthamizhan@newdream.net: 4196: (1138s) collection:rados-thrash clusters:6-osd-3-machine.yaml fs:... Samuel Just
10:45 AM Bug #2823 (New): osd: out of order ACKs
Samuel Just
10:10 AM Bug #2887: pjd open/08.t failed test 2
recent logs: ubuntu@teuthology:/a/teuthology-2012-08-01_19:00:04-regression-master-testing-gcov/4126 Tamilarasi muthamizhan
10:00 AM Bug #2896 (Won't Fix): ceph pg dump has empty hb_out field
I was looking at "ceph pg dump" output today on a patched argonaut build and saw that while all the osd stat outputs ... Greg Farnum

08/01/2012

06:31 PM Bug #2895: cli: non-existent command returns confusing error message
...and ceph osd map rbd/rbd_info returns "unknown command map', which is just wrong;
the problem is the argument nee...
Dan Mick
05:33 PM Bug #2895 (Resolved): cli: non-existent command returns confusing error message
'ceph osd crush get' returns 'unknown command crush', instead of the full command.
http://www.spinics.net/lists/ce...
Josh Durgin
05:28 PM Feature #2894 (Resolved): cli: help command for ceph subsystems
To make commands and their usage discoverable and easy to look up, each subsystem could provide a help command
that ...
Josh Durgin
04:50 PM Bug #2887: pjd open/08.t failed test 2
Also, ubuntu@teuthology:/a/teuthology-2012-07-31_19:00:04-regression-master-testing-gcov/3654 Tamilarasi muthamizhan
04:47 PM Bug #2887: pjd open/08.t failed test 2
Also, ubuntu@teuthology:/a/teuthology-2012-08-01_00:01:38-regression-next-testing-basic/3784 Tamilarasi muthamizhan
04:43 PM Bug #2887: pjd open/08.t failed test 2
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-01_02:00:04-regression-testing-testing-basic/3909 Tamilarasi muthamizhan
01:51 PM Bug #2887 (Resolved): pjd open/08.t failed test 2
pjd open/08.t failed test 2 on both ceph-fuse and kclient.
Logs:- ubuntu@teuthology:/a/teuthology-2012-07-31_02:00...
Tamilarasi muthamizhan
04:34 PM devops Feature #2893 (Closed): crowbar: Nested virtualization for running OpenStack in vercoi vm
Anonymous
04:34 PM devops Feature #2893 (Closed): crowbar: Nested virtualization for running OpenStack in vercoi vm
Anonymous
04:30 PM Bug #2892 (Resolved): ceph health detail kills monitor
Executed the following:
ubuntu@burnupi30:~$ sudo ceph health detail
Wait awhile and ceph will start to output t...
JuanJose Galvez
03:10 PM Bug #2891 (Can't reproduce): heap profiler hangs when trying to start it up on the mon
We tried to turn heap profiling on the mon (congress), however the last thing we see in the logs is the message that ... Yehuda Sadeh
02:24 PM Bug #2890 (Resolved): monitor: "recognize" heap commands
The monitor accepts the standard heap profiler commands, but it tells the user it doesn't due to not setting return c... Greg Farnum
01:56 PM devops Feature #2889 (Closed): crowbar: script for easily packaging ceph barclamp
Anonymous
01:53 PM devops Feature #2888 (Closed): crowbar: Make VM disk IO cache writes for performance
Anonymous
01:38 PM devops Documentation #2886 (Rejected): doc: crush location tricks, ceph.conf, automatic host=
- how it autoupdates on osd startup
- how hosts won't migrate from container to another automatically
Anonymous
01:37 PM devops Feature #2885 (Resolved): doc: mon initial members requirements, functioning, admin steps to take
Anonymous
01:36 PM devops Feature #2884 (Rejected): doc: osd hotplugging
Anonymous
01:34 PM devops Feature #2883 (Rejected): chef: union lists and maps in env vs node ceph.conf json
As an admin, I want to specify in environment
"osd crush location": {
"datacenter": "westcoast",
}
an...
Anonymous
01:33 PM devops Documentation #2882 (Rejected): doc: chef environment ceph.conf content tricks
Anonymous
01:32 PM devops Feature #2881 (Rejected): doc: chef cookbook better README, internal structure, assumptions
Anonymous
01:29 PM devops Feature #2704 (Closed): sepia: Use ``names`` as resolver on plana, burnupi, vercoi
dnscache01 and dnscache02 are happily serving anything that uses DHCP to get its configuration.
There may be stati...
Anonymous
01:20 PM devops Feature #2880 (Rejected): chef: use get-or-create instead of get-or-create-key
ceph.git commit 4551808fa00b812fee6e0c196fd333eca0b06de9 adds "ceph auth get-or-create". Switch to using it in ceph-c... Anonymous
01:10 PM rgw Bug #2877: rgw: ETag parsing in complete multipart upload should xml decode ETag
There are two different issues here. The first one is that we don't remove the quotes when comparing the etags. The s... Yehuda Sadeh
12:55 PM rgw Bug #2879 (Resolved): rgw: xml parser doesn't work correctly with escape sequences
e.g., when providing data with "&quot;", the entity is getting clobbered. Yehuda Sadeh

07/31/2012

09:44 PM Bug #2873 (Fix Under Review): Stack trace thrown when using obsync
Dan Mick
06:18 PM Bug #2873: Stack trace thrown when using obsync
Figured out what the problem is, it appears that on L111, it should go from being... Matthew Wodrich
11:27 AM Bug #2873 (Resolved): Stack trace thrown when using obsync
... Matthew Wodrich
03:38 PM RADOS Bug #2874: apparent CRUSH mapping failure
check if setting the tunables all to 0 makes it go away Sage Weil
11:40 AM RADOS Bug #2874 (Resolved): apparent CRUSH mapping failure
While doing crowbar tests, I created a 3-OSD cluster (on separate VMs) that ended up with 6 degraded PGs.... Greg Farnum
03:36 PM rgw Bug #2504 (In Progress): rgw: use multiple notifications objects
Yehuda Sadeh
03:35 PM rgw Bug #2878 (Resolved): rgw: chunked encoding for POST requests (e.g., complete multipart uploads)
We shouldn't require length passed for these requests. Yehuda Sadeh
03:28 PM rgw Bug #2877 (Resolved): rgw: ETag parsing in complete multipart upload should xml decode ETag
Should be able to accept both:... Yehuda Sadeh
03:08 PM Bug #2876 (Resolved): mon: pg stuck peering (for example) broken?
... Sage Weil
02:01 PM Bug #2875 (Resolved): osd: pg stuck in GetLog
we weren't checking if newest_update_osd went down (it could be outside the prior set) Sage Weil
12:43 PM Linux kernel client Bug #2573 (Resolved): libceph: many "socket closed" messages
I was seeing this too, but with the latest code and all (knock wood) the races closed I'm not anymore. Going to opti... Sage Weil
11:49 AM Bug #2846 (Resolved): Malformed keyring file causes kernel null pointer deref on "rbd map"
userland fixes applied to stable, next.
thanks!
Sage Weil
11:42 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
kernel patch is in testing branch. Sage Weil
06:23 AM Subtask #2805 (Fix Under Review): mon: Single-Paxos: Sync: Create a test unit to verify the corre...
Joao Eduardo Luis
06:22 AM Subtask #2805: mon: Single-Paxos: Sync: Create a test unit to verify the correctness of the whole...
Currently available tests:
* Removing keys:
> * Using both the whole-space iterator and the whole-space snapshot ...
Joao Eduardo Luis

07/30/2012

06:46 PM Linux kernel client Bug #2868: kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osd
hoping this was the messenger locking stuff, let's see if it pops up again Sage Weil
06:45 PM rbd Bug #2715 (Resolved): krbd: spinlock wrong CPU
Sage Weil
06:45 PM Linux kernel client Bug #2867 (Resolved): kclient: crash from ffsb in con_work -> kernel_sendmsg
Sage Weil
06:45 PM Linux kernel client Bug #2392: First read of symlink after ceph filesystem mounted gives error
Sage Weil
04:52 PM rbd Bug #2872 (Resolved): RBD resize command allows image size -1
Ceph Version : 0.48
Resize rbd image to size -1 allows rbd image to be resized to 15 Exabytes, which is incorrect....
Tamilarasi muthamizhan
03:52 PM rbd Bug #2871 (Resolved): rbd export command hangs when trying to export an image of size 0 to a loca...
Ceph Version: 0.48
Steps followed:
1. create a rbd image of size 1000 mb in rbd pool
2. resize the rbd image t...
Tamilarasi muthamizhan
10:52 AM Bug #2866 (Resolved): osd: pg stuck with unfound
commit:9e5d4e61a73343397e67e918e87f1e6dcb8ec72d and commit:7b9d37c662313929b52011ddae47cc8abab99095 Sage Weil
10:51 AM Bug #2860 (Resolved): osd: stuck waiting for pg acting set to change
commit:bae837010b6b486011b06dd97664fb54c3f3ff44 and commit:96feca450c5505a06868bc012fe998a03371b77f Sage Weil
09:14 AM Bug #2819: krbd: lockup on large writes, msgr fault injection
i'm unable to reproduce this on a real kernel.. it only happens on uml.
here is a full backtrace:...
Sage Weil
08:01 AM Bug #2638 (Resolved): mon: make pool ops idempotent
Sage Weil
08:01 AM Bug #2830 (Duplicate): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
Sage Weil

07/29/2012

09:31 PM Linux kernel client Bug #2688 (Duplicate): lockup on ffsb + thrashing
Sage Weil
09:31 PM Linux kernel client Bug #2260 (Resolved): libceph: null pointer dereference at try_write+0x638+0xfb0
this is either #2867, or a similar issue that is since resolved. Sage Weil
09:28 PM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
Sage Weil
09:24 PM Linux kernel client Bug #2867: kclient: crash from ffsb in con_work -> kernel_sendmsg
*sigh of relief* Sage Weil
08:22 PM Linux kernel client Bug #2867: kclient: crash from ffsb in con_work -> kernel_sendmsg
This appears to be a regression, so it is effectively blocking sending the pull request to Linus. Sage Weil

07/28/2012

05:52 PM Feature #2280 (Resolved): improve gitbuilder infrastructure
Sage Weil
05:50 PM RADOS Subtask #2792 (Fix Under Review): mon: require tunable feature bit if current osdmap uses non-def...
Sage Weil
03:49 PM rgw Feature #2869 (Resolved): rgw: expand date format support
should be able to parse the following:
Sat, 28 Jul 2012 20:35:55 UTC
Which uses UTC instead of GMT.
Yehuda Sadeh
03:30 PM Feature #2477 (Fix Under Review): rados bench cleanup
Sage Weil
03:30 PM Feature #1783 (Fix Under Review): osd: scrub incrementally across hash range using MOSDPGScan
Sage Weil
07:37 AM Linux kernel client Bug #2868 (Resolved): kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osd
... Sage Weil
 

Also available in: Atom