Project

General

Profile

Activity

From 07/12/2012 to 08/10/2012

08/10/2012

08:31 PM Bug #2919 (Fix Under Review): ceph kernel module looks for :/ in path, but / stripped by precise ...
Sage Weil
08:29 PM Bug #2938: ceph-osd --mkfs failure to create journal is logged with dout(0), probably should be derr
yeah, just change it to derr Sage Weil
06:34 PM Bug #2938 (Resolved): ceph-osd --mkfs failure to create journal is logged with dout(0), probably ...
A customer mistakenly named a directory as his osd journal location; the failure printed to his terminal with no hint... Dan Mick
08:25 PM Linux kernel client Bug #2801 (Resolved): msgr crash in ceph_msg_new
Sage Weil
08:24 PM Linux kernel client Bug #2392 (Resolved): First read of symlink after ceph filesystem mounted gives error
Sage Weil
05:39 PM Revision d4849f2f (ceph): rgw: fix usage trim call encoding
Fixes: #2841.
Usage trim operation was encoding the wrong op structure (usage read).
Since the structures somewhat ov...
Yehuda Sadeh
05:39 PM Revision 515952d0 (ceph): cls_rgw: fix rgw_cls_usage_log_trim_op encode/decode
It was not encoding user, adding that and reset version
compatibility.
This changes affects command interface, makes ...
Yehuda Sadeh
05:39 PM Revision 2e77130d (ceph): rgw: expand date format support
Relaxing the date format parsing function to allow UTC
instead of GMT.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:39 PM Revision 14fa77d9 (ceph): rgw: complete multipart upload can handle chunked encoding
Fixes: #2878
We now allow complete multipart upload to use chunked encoding
when sending request data. With chunked e...
Yehuda Sadeh
05:39 PM Revision a06f7783 (ceph): rgw_xml: xml_handle_data() appends data string
Fixes: #2879.
xml_handle_data() appends data to the object instead of just
replacing it. Parsed data can arrive in pi...
Yehuda Sadeh
05:39 PM Revision a8b224b9 (ceph): rgw: ETag is unquoted in multipart upload complete
Fixes #2877.
Removing quotes from ETag before comparing it to what we
have when completing a multipart upload.
Backp...
Yehuda Sadeh
04:26 PM Bug #2887: pjd open/08.t failed test 2
ubuntu@teuthology:/a/teuthology-2012-08-09_00:00:04-regression-next-testing-basic/5752 Tamilarasi muthamizhan
03:59 PM Bug #2887: pjd open/08.t failed test 2
ubuntu@teuthology:/a/teuthology-2012-08-09_02:00:13-regression-testing-testing-basic/5857 Tamilarasi muthamizhan
03:59 PM Revision 98286b49 (ceph): MonMap: return error on failure in build_initial
If mon_host fails to parse, return an error instead of success.
This avoids failing later on an assert monmap.size() ...
Josh Durgin
03:59 PM Revision 5c116bf8 (ceph): addr_parsing: report correct error message
getaddrinfo uses its return code to report failures.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
03:58 PM Revision 22259c6e (ceph): MonMap: return error on failure in build_initial
If mon_host fails to parse, return an error instead of success.
This avoids failing later on an assert monmap.size() ...
Josh Durgin
03:58 PM Revision 49b2c7b5 (ceph): addr_parsing: report correct error message
getaddrinfo uses its return code to report failures.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
01:59 PM rbd Bug #2937 (Duplicate): btrfs filesystem on rbd device kernel BUG writing large file
Writing a large file with dd on btrfs filesystem mounted from rbd device causes kernel bug
Stock kernel 3.5.1, con...
Bartek Kania
01:48 PM Linux kernel client Bug #2936 (Resolved): Remounting cephfs with non-existing path causes kernel panic
Steps to reproduce:
First mount the root somewhere...
Bartek Kania
10:38 AM Bug #2913 (Resolved): monclient: asserts when no monitor addresses found due to dns failure
Fortunately I was wrong about the string splitting - that was just a confusing message from the parsing stage.
The...
Josh Durgin
10:09 AM rgw Feature #771: rgw: POST
Support the S3 POST object operation referenced in
http://docs.amazonwebservices.com/AmazonS3/latest/API/RESTObje...
caleb miles
09:40 AM rgw Bug #2935 (Resolved): rgw: radosgw-admin bucket link clobbers index
radosgw-admin bucket unlink, then radosgw-admin bucket link overrides the bucket index, so objects cannot be listed a... Yehuda Sadeh

08/09/2012

11:41 PM Revision 1fd7acd8 (ceph): rbd: fix test compilation on 32-bit
32-bit can't implicitly convert size_t or unsigned to uint64_t
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
10:12 PM Revision be6b5e1b (ceph): test_librbd_fsx: fix warning
fstat was included implicitly before.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
09:53 PM Revision 4d71ca63 (ceph): rbd: fix cli tests
add flatten, snap protect, and snap unprotect
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
08:54 PM Revision 677934b1 (ceph): rgw: modify some error messages
Make them more correct, clearer.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:39 PM Revision 86be6fe7 (ceph): Merge remote-tracking branch 'origin/wip-rgw-admin'
Yehuda Sadeh
08:33 PM Revision b28db08e (ceph): rgw: use multiple notification objects
Issue #2504. This makes us listen and notify on more than
a single object, which reduces the contention of cache
noti...
Yehuda Sadeh
08:27 PM Revision b15d6392 (ceph): radosgw-admin.rst: Misc doccumentation update.
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
08:27 PM Revision 45f7f060 (ceph): rgw_admin.cc: Allow removal of a user's buckets during user removal.
Allow the buckets, and any child objects, of a user to be deleted when the
user is deleted through radosgw-admin. In ...
caleb miles
06:14 PM Revision c77f0fb7 (ceph): rbd: add snap [un]protect commands
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
06:14 PM Revision cd5d7241 (ceph): librbd: fix memory leak on error in clone
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
06:14 PM Revision 53d77502 (ceph): librados: add method to get the fsid of a cluster
This will be used by OpenStack to check whether two components
have access to the same backend cluster.
Signed-off-b...
Josh Durgin
06:14 PM Revision 8d5f1e93 (ceph): librbd: add methods for protecting/unprotecting snapshots
Unprotect will be more useful once integrated with the rbd_children object.
Right now we just check that a snapshot i...
Josh Durgin
04:55 PM Revision cc8eac24 (ceph): rgw_admin.cc: Allow for deletion of objects through radosgw-admin.
Allow objects to be deleted through radosgw-admin with an optional flag
to delete the tail of that object during the ...
caleb miles
04:42 PM Revision 273a43ed (ceph): Flush data to temp file before reading it in another process.
Tommi Virtanen
04:42 PM Revision 8aaf21d5 (ceph): Oops tempfile now gives us file objects not fds.
Tommi Virtanen
04:21 PM Revision f9359f0c (ceph): doc: New example usage.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:06 PM Feature #2934: crush: create a visualizer for crush maps
'ceph osd tree' provides a good start on the command line, but it'd be nice to have that in the crushtool as well if ... Josh Durgin
04:04 PM Feature #2934 (New): crush: create a visualizer for crush maps
The language used in crush maps is very well defined and
hierarchical. I don't know how to do this sort of thing,
...
Alex Elder
03:55 PM rbd Bug #2933 (Resolved): rbd: bio_pair leak in bio_chain_clone()
Guangliang Zhao <gzhao@suse.com> pointed out this problem on the
mailing list. Here's the latest edition of his pro...
Alex Elder
02:18 PM devops Feature #2932 (Rejected): chef: logstash integration
Anonymous
02:18 PM devops Feature #2931 (Rejected): chef: StatsD integration
Anonymous
01:54 PM rgw Feature #2499 (Resolved): rgw: ability to delete users without first emptying and deleting all bu...
done, commit:45f7f0602c90073af27041f92166724ca9472197. Yehuda Sadeh
01:53 PM rgw Feature #2786 (Resolved): radosgw-admin: ability to remove objects/buckets
object removal done, commit:cc8eac2427c745e154ad40eeb84ef28dbed99d36
bucket removal done, commit:45f7f0602c90073af27...
Yehuda Sadeh
01:32 PM rgw Bug #2504 (Resolved): rgw: use multiple notifications objects
Done, commit:b28db08ea8b84ec9f1d2df88ac4edd6aea0ba7d4 Yehuda Sadeh
12:29 PM Bug #2924 (Resolved): doc: Adjust for mon. key being in external keyring
This doc is outdated
http://ceph.com/docs/master/ops/manage/grow/mon/#adding-a-monitor
as per
http://thread.gmane....
Anonymous
11:13 AM CephFS Bug #2444: null pointer deference in ceph_d_prune inside kvm
same bug here with Ceph 0.49 on Ubuntu 12.04 LTS (GNU/Linux 3.2.0-27-generic x86_64) Alexandre Dupouy
10:58 AM rgw Feature #2923 (Resolved): rgw: non hard-coded pool names
Don't have pool names hard coded, make them configurable. Yehuda Sadeh
10:44 AM rgw Bug #2665 (Resolved): rest-bench hangs periodically
This was fixed a while ago. Yehuda Sadeh

08/08/2012

11:09 PM Revision 6bc1067f (ceph): rgw: fix usage trim call encoding
Fixes: #2841.
Usage trim operation was encoding the wrong op structure (usage read).
Since the structures somewhat ov...
Yehuda Sadeh
11:05 PM Revision 04a0eacd (ceph): cls_rgw: fix rgw_cls_usage_log_trim_op encode/decode
It was not encoding user, adding that and reset version
compatibility.
This changes affects command interface, makes ...
Yehuda Sadeh
10:44 PM Revision 3499c0e5 (ceph): cls_rbd: add methods for getting/setting protection status
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
10:37 PM Revision 074c3c0f (ceph): rgw: expand date format support
Relaxing the date format parsing function to allow UTC
instead of GMT.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
10:34 PM Revision d39ea1d4 (ceph): rgw: complete multipart upload can handle chunked encoding
Fixes: #2878
We now allow complete multipart upload to use chunked encoding
when sending request data. With chunked e...
Yehuda Sadeh
10:34 PM Revision 03b787e0 (ceph): rgw_xml: xml_handle_data() appends data string
Fixes: #2879.
xml_handle_data() appends data to the object instead of just
replacing it. Parsed data can arrive in pi...
Yehuda Sadeh
10:34 PM Revision 3809e344 (ceph): rgw: ETag is unquoted in multipart upload complete
Fixes #2877.
Removing quotes from ETag before comparing it to what we
have when completing a multipart upload.
Backp...
Yehuda Sadeh
10:06 PM Revision 52f03dcc (ceph): doc: Added debug and logging reference.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:48 PM Revision 99e99758 (ceph): In teuthology-worker, shuffle the child stdout/stderr into our log.
Otherwise, child can suffer a failure that does not get logged by
it's own exception handling machinery, and we have ...
Tommi Virtanen
09:45 PM Revision 05007f7e (ceph): Minimize scope of try-except.
os.write and list.append won't raise CalledProcessError, and now
we don't need to try to contain them for temp file c...
Tommi Virtanen
09:44 PM Revision 4b9e1762 (ceph): Use tempfile.NamedTemporaryFile instead of mkstemp.
Simpler code, no manual cleanup needed. We see a littering of
zero-length temp files from teuthology-worker, and this...
Tommi Virtanen
09:07 PM Revision d78dfe5d (ceph): mkcephfs: use default osd_data, _journal values
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
09:07 PM Revision 3c90ff4e (ceph): mkcephfs: use new default keyring locations
The ceph-conf command only parses the conf; it does not apply default
config values. This breaks mkcephfs if values ...
Sage Weil
09:01 PM Revision 7084f295 (ceph): mkcephfs: use default osd_data, _journal values
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
09:01 PM Revision 96b1a496 (ceph): mkcephfs: use new default keyring locations
The ceph-conf command only parses the conf; it does not apply default
config values. This breaks mkcephfs if values ...
Sage Weil
08:05 PM Revision 18391059 (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
04:58 PM Bug #2922 (Resolved): mkcephfs fails with error "read: arg count"
Branch: wip-auth
ceph version 0.49-306-gfc3681f (commit:fc3681f59c4f49298f5a7a5172c30be63068c330)
tamil@tamil-Vir...
Tamilarasi muthamizhan
04:08 PM rgw Bug #2841 (Resolved): rgw: fix usage trim
Fixed, commit:6bc1067fc878cbfb6761146cb154c2985c9d9bd7 and commit:04a0eacd92b0c923cb9d1efc7d751a05d544dc85 Yehuda Sadeh
03:35 PM rgw Feature #2869 (Resolved): rgw: expand date format support
Fixed, commit:074c3c0fe0c005e54f4776c60463a16305dbab10 Yehuda Sadeh
03:34 PM rgw Bug #2879 (Resolved): rgw: xml parser doesn't work correctly with escape sequences
Fixed, commit:03b787e0ee1d94e054cfb17059e5e108a7162d7b Yehuda Sadeh
03:34 PM rgw Bug #2878 (Resolved): rgw: chunked encoding for POST requests (e.g., complete multipart uploads)
Fixed, commit:d39ea1d4b51afdbbd51254ff41c8285e8f5697df. Yehuda Sadeh
03:33 PM rgw Bug #2877 (Resolved): rgw: ETag parsing in complete multipart upload should xml decode ETag
Fixed, commit:3809e34448e47d7baa02d7a0f9240494aba0e337. Yehuda Sadeh
03:15 PM Revision e4e4882d (ceph): mon: differentiate cluster and client auth requirements
- require 'cluster required' of osds, mdss, and other mons
- require 'service required' of clients
Signed-off-by: Sa...
Sage Weil
03:15 PM Revision e5368891 (ceph): monclient: differentiate clients vs daemons
If we are a daemon, require 'auth cluster required'. Otherwise, use the
client requirement.
As always, fall back to...
Sage Weil
03:15 PM Revision 5e13fd67 (ceph): keyring: check client requirement list too when loading keyring
Check client requirements when loading a keyring from the cct. This is
only used to succeed when no keyring is found...
Sage Weil
03:15 PM Revision ac9d429d (ceph): keyring: simplify from_ceph_context() arguments
Allocate the KeyRing in the caller, and just pass a pointer.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:15 PM Revision 35e79f23 (ceph): keyring: simplify from_ceph_context()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:15 PM Revision 69130d7c (ceph): monclient: move keyring requirement where it belongs
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:15 PM Revision ece78c86 (ceph): doc: some basic cephx documentation
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:15 PM Revision c03f744e (ceph): keyring: make from_ceph_context() a member
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:09 PM Revision d5704bd7 (ceph): buffer: make release() private
This should only be called by ~ptr or when we are replacing the current
target with something new. It is not suitabl...
Sage Weil
02:06 PM Bug #2845 (Resolved): mkcephfs hasn't learned about new default keyring locations in argonaut
fixed, commit:96b1a496cdfda34a5efdb6686becf0d2e7e3a1c0 Sage Weil
12:48 PM Bug #2875 (Resolved): osd: pg stuck in GetLog
Sage Weil
12:48 PM Bug #2834 (Resolved): osd/ReplicatedPG.cc: 3577: FAILED assert(waiting_for_ack.begin()->first == ...
hasn't come up recently Sage Weil
11:11 AM Bug #2887: pjd open/08.t failed test 2
Logs: ubuntu@teuthology:/a/teuthology-2012-08-06_00:00:02-regression-next-testing-basic/5012 Tamilarasi muthamizhan
10:03 AM Bug #2887: pjd open/08.t failed test 2
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-08_00:00:07-regression-next-testing-basic/5542 Tamilarasi muthamizhan
10:48 AM Bug #2761: osd: failed to recover before timeout expired
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-08_00:00:07-regression-next-testing-basic/5616... Tamilarasi muthamizhan
10:33 AM rgw Bug #2915: rgw: copy of large object times out
workaround: bump up fastcgi timeout Yehuda Sadeh
10:11 AM Feature #2921 (Rejected): doc: Provide epub docs
Sphinx supports it. Current output seems to only include the top-level file and indexes, skipping most of the content... Anonymous
10:04 AM Feature #2920 (Rejected): doc: Provide PDF docs
Sphinx supports it, but we'd need to fix other parts of our toolchain.
To see where we are:
1. add this patch:
...
Anonymous
02:33 AM Revision bf4c5e7e (ceph): mds, osd: require auth_service_required, if defined
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil

08/07/2012

05:56 PM Bug #2919 (Resolved): ceph kernel module looks for :/ in path, but / stripped by precise mountall
I think this is really a bug in mountall (see https://bugs.launchpad.net/ubuntu/+source/mountall/+bug/809221), but it... Dan Mick
05:42 PM Feature #2918 (Resolved): OSD ID numbers determine OSD count and thus default pg_cnt
An IRC user (maelfius) had a problem with a 1-monitor, 3-OSD cluster; the monitor chewed up all memory before it star... Dan Mick
12:50 PM rgw Bug #2916: radosgw does not check command line options for correctness
that's a generic ceph command lines parsing issue Yehuda Sadeh
11:52 AM rgw Bug #2916 (Resolved): radosgw does not check command line options for correctness
It is possible to pass any command line option to radosgw without error. For example
./radosgw -c /home/caleb/cep...
caleb miles
12:49 PM rgw Bug #2915: rgw: copy of large object times out
The problem is that apache is timing out. We should return an early 200 and encode any error in the response code, as... Yehuda Sadeh
10:34 AM rgw Bug #2915 (Resolved): rgw: copy of large object times out
Yehuda Sadeh
09:26 AM devops Feature #2808 (Rejected): crowbar: upgrade to fred3 (get bind bug fix)
Looks like upstream Crowbar is still buggy with regard to DNS. Anonymous
06:37 AM Bug #2913: monclient: asserts when no monitor addresses found due to dns failure
I am using 0.48argonaut-1precise. Jeff Strunk

08/06/2012

04:22 PM Bug #2914 (Resolved): librados set_complete_callback, set_safe_callback clobber each other's argu...
Samuel Just
02:46 PM Bug #2913: monclient: asserts when no monitor addresses found due to dns failure
hmm, looking closer that's a second bug - it's not splitting 'thinkmate3:6789;thinkmate4:6789' into separate addresse... Josh Durgin
02:28 PM Bug #2913: monclient: asserts when no monitor addresses found due to dns failure
I'm not so sure this is a DNS issue. Here is how name service is set up on my ceph/kvm test cluster.
On each node,...
Jeff Strunk
01:58 PM Bug #2913 (Resolved): monclient: asserts when no monitor addresses found due to dns failure
This should be an error returned up to the user, not an assert.
From https://www.redhat.com/archives/libvirt-users...
Josh Durgin
02:16 PM Bug #2887: pjd open/08.t failed test 2
recent logs: ubuntu@teuthology:/a/teuthology-2012-08-06_02:00:02-regression-testing-testing-basic/5117 Tamilarasi muthamizhan
09:53 AM Feature #2911 (Duplicate): osd: Restrict recovery when the OSD full list is nonempty
See the conversation at http://www.spinics.net/lists/ceph-devel/msg08010.html
It would be nice if we could somehow...
Greg Farnum

08/04/2012

01:49 AM Revision 4fd393f3 (ceph): librbd: fix bounds of memcpy in info()
Caught by valgrind.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
01:49 AM Revision d48d1ffb (ceph): test_librbd_fsx: add cloning
5% of the time a clone op is chosen, create a snapshot, save the
current contents to a file, and continue operating o...
Josh Durgin
01:48 AM Revision 90dc5650 (ceph): librbd: implement read/write from parent images
Put the completion handling logic into new subclases of
librbd::AioRequest, so the caching/non-caching paths can shar...
Josh Durgin
01:48 AM Revision beccac93 (ceph): librbd: don't open parent again during get_parent_info
The parent was already opened when the original ImageCtx
was initialized. We just need to lookup the pool and image
n...
Josh Durgin
01:48 AM Revision 12c9a7d5 (ceph): test_rbd.py: refactor cloning tests
Move into a separate class that requires layering to be enabled,
so the common step of creating and deleting a clone ...
Josh Durgin
01:48 AM Revision 965271db (ceph): test_librbd: add a simple io to clone test
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
01:48 AM Revision 53947ed8 (ceph): librbd: check new snapc for validity, not the old one
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
01:48 AM Revision 9bf3d486 (ceph): librbd: move flush on new snap outside of snap_lock
snap_lock needs to be taken during writeback.
This is still protected by md_lock. The altered snapc doesn't
affect in...
Josh Durgin
01:48 AM Revision 12e20709 (ceph): librbd: close parent after shutting down cache
Requests that haven't been flushed yet might need the parent still.
Signed-off-by: Josh Durgin <josh.durgin@inktank....
Josh Durgin
01:48 AM Revision 745783c1 (ceph): librbd: expose feature bits through librbd.h
Move them to a separate header so we don't expose
rbd_types.h to the librbd user.
Signed-off-by: Josh Durgin <josh.d...
Josh Durgin
12:59 AM Revision 43291951 (ceph): rados.py: add binding for rados_trunc
Fixes: #2909
Signed-off-by: Evan Felix <evan.felix@pnnl.gov>
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Evan Felix

08/03/2012

06:31 PM Bug #2909 (Resolved): the rados_trunc function did not get implemented in rados.py Python
Thanks! Added in commit:43291951fad241a6d3f8b8daa37d3665c9d842d6, with a simple test and spacing normalized to the re... Josh Durgin
03:58 PM Bug #2909: the rados_trunc function did not get implemented in rados.py Python
Yes you may. Evan Felix
03:35 PM Bug #2909: the rados_trunc function did not get implemented in rados.py Python
That looks good to me. Can I add your signed-off-by to the patch? Josh Durgin
01:53 PM Bug #2909 (Resolved): the rados_trunc function did not get implemented in rados.py Python
This code seems to work in the Ioctx class:
def trunc(self,key,size):
self.require_ioctx_open()
...
Evan Felix
02:23 PM devops Feature #2910: crowbar: Use JBOD mode for ceph-osd
There's a map in the deployer object from role name to BIOS and RAID configuration to set on the node. We can add cep... Anonymous
02:22 PM devops Feature #2910 (Closed): crowbar: Use JBOD mode for ceph-osd
Anonymous
11:38 AM Bug #2908 (Resolved): ceph osd crush remove <name>
(11:34:50 AM) Kyle Bader: so it looks like ceph -h is missing crush rm
(11:34:54 AM) Kyle Bader: could we add
(11:...
Samuel Just
07:56 AM Subtask #2738 (Rejected): mon: Single-Paxos: Sync: Add snapshot support to the monitor store
This task was superseded by task #2756, which provides a much more broad implementation using directly the available ... Joao Eduardo Luis
07:45 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Joao Eduardo Luis
07:44 AM Subtask #2805 (Resolved): mon: Single-Paxos: Sync: Create a test unit to verify the correctness o...
Joao Eduardo Luis
07:44 AM Subtask #2758 (Resolved): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyValueDB to ...
Joao Eduardo Luis
07:43 AM Subtask #2756 (Resolved): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis

08/02/2012

11:03 PM Revision 3671a37e (ceph): doc: Explicitly state mount.ceph two first arguments are fixed.
I've hurt myself too many times with this.
Signed-off-by: Tommi Virtanen <tv@inktank.com>
Tommi Virtanen
11:03 PM Revision 17ad4e18 (ceph): doc: mount.ceph is also needed for cephx.
Signed-off-by: Tommi Virtanen <tv@inktank.com> Tommi Virtanen
10:56 PM Revision c961a20e (ceph): doc: Fix toctree structure for man obsync(1).
Signed-off-by: Tommi Virtanen <tv@inktank.com> Tommi Virtanen
08:31 PM Revision 5db3a9e7 (ceph): rgw_admin.cc: Disallow addition of S3 keys with subuser creation
Fixes: #1855
It is no longer possible to create a subuser and new S3 key associated
with that user through the radosg...
caleb miles
08:02 PM Revision 4e40a784 (ceph): ceph-authtool: Fix usage, it's --print-key not --print.
Signed-off-by: Tommi Virtanen <tv@inktank.com> Tommi Virtanen
06:14 PM Revision 8665bdc1 (ceph): task: scrub OSDs periodically
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
06:14 PM Revision 3b85b231 (ceph): task: verify scrub detects files whose contents changed
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
06:00 PM Revision 25de5e59 (ceph): Revert "osd: peering: detect when log source osd goes down"
This reverts commit 0d3d75ec0d17e2f6bc5d42058f51eb2ff19ffb71.
Accidental double push.
Samuel Just
04:51 PM Bug #2907: rados benchmarking tool which does not always do creates
Why not record the raw data and let other tools produce percentiles and other statistics? Josh Durgin
04:44 PM Bug #2907 (Resolved): rados benchmarking tool which does not always do creates
Features:
Pluggable distribution for choosing objects (zipifan?, random?, sequential?)
configurable numbe...
Samuel Just
04:09 PM Bug #2904 (Resolved): ceph-authtool: Adds keys on typos, expected error message
... Anonymous
03:50 PM CephFS Feature #2903 (Resolved): ceph-fuse: Support -o noallow_other
Currently, ceph-fuse hardcodes the -o allow_other option to FUSE_ARGS_INIT.
https://github.com/ceph/ceph/blob/5db3...
Anonymous
03:35 PM Revision 93cbab73 (ceph): doc: Simplify submodules explanation.
``git clone --recursive`` does ``init`` & ``update`` for us. Also
avoids incorrect language; there never were submodu...
Tommi Virtanen
03:35 PM Revision 48de9b5b (ceph): doc: Say what to do if submodules are out of date.
Signed-off-by: Tommi Virtanen <tv@inktank.com> Tommi Virtanen
03:35 PM Revision 203dffad (ceph): doc: cd to repository before running git status.
Signed-off-by: Tommi Virtanen <tv@inktank.com> Tommi Virtanen
03:33 PM Revision 6af560de (ceph): doc: Correct Git URL for clone
Using git@github.com would mean SSHing in, and your average open
source consumer probably won't have a Github account...
Wido den Hollander
01:35 PM rgw Bug #2841 (Fix Under Review): rgw: fix usage trim
Yehuda Sadeh
01:31 PM rgw Bug #1855 (Resolved): Creation of a subuser that appears to own an s3 key is possible, and removi...
Commit 5db3a9e71c6b757660d0702efada40af6be63eb8 pushed. We disallow creating s3 key when subuser is created in order ... Yehuda Sadeh
01:27 PM devops Feature #2398: chef: external osd journal support
Shuffling old notes here:
see if "osd journal" was overridden in $cluster.conf; if yes, do not attempt discovery
...
Anonymous
12:59 PM rgw Feature #2869 (Fix Under Review): rgw: expand date format support
Yehuda Sadeh
12:59 PM rgw Bug #2877 (Fix Under Review): rgw: ETag parsing in complete multipart upload should xml decode ETag
Yehuda Sadeh
12:59 PM rgw Bug #2878 (Fix Under Review): rgw: chunked encoding for POST requests (e.g., complete multipart u...
Yehuda Sadeh
12:59 PM rgw Bug #2879 (Fix Under Review): rgw: xml parser doesn't work correctly with escape sequences
Yehuda Sadeh
11:50 AM Bug #2902 (Resolved): common lib tries to open literal ~/.ceph/ceph.conf
... Anonymous
11:48 AM Bug #2901 (Resolved): librados-config should not read ceph.conf
... Anonymous
11:38 AM Bug #2900 (Resolved): ceph fuse crashed
Logs: ubuntu@teuthology: /a/teuthology-2012-07-27_19:00:07-regression-master-testing-gcov/1581
Core file: /a/teuthol...
Tamilarasi muthamizhan
11:09 AM Bug #2897 (Resolved): ceph fuse error segfault
... Tamilarasi muthamizhan
11:03 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
10:52 AM Bug #2823: osd: out of order ACKs
Log location: ubuntu@teuthology:/a/teuthology-2012-08-01_19:00:04-regression-master-testing-gcov/4196
ubuntu@teuth...
Tamilarasi muthamizhan
10:50 AM Bug #2823: osd: out of order ACKs
(10:46:42 AM) tamil.muthamizhan@newdream.net: 4196: (1138s) collection:rados-thrash clusters:6-osd-3-machine.yaml fs:... Samuel Just
10:45 AM Bug #2823 (New): osd: out of order ACKs
Samuel Just
10:10 AM Bug #2887: pjd open/08.t failed test 2
recent logs: ubuntu@teuthology:/a/teuthology-2012-08-01_19:00:04-regression-master-testing-gcov/4126 Tamilarasi muthamizhan
10:00 AM Bug #2896 (Won't Fix): ceph pg dump has empty hb_out field
I was looking at "ceph pg dump" output today on a patched argonaut build and saw that while all the osd stat outputs ... Greg Farnum
12:17 AM Revision 0d3d75ec (ceph): osd: peering: detect when log source osd goes down
The Peering state has a generic check based on the prior set osds that
will restart peering if one of them goes down ...
Sage Weil

08/01/2012

11:34 PM Revision 4bd466d6 (ceph): osd: peering: detect when log source osd goes down
The Peering state has a generic check based on the prior set osds that
will restart peering if one of them goes down ...
Sage Weil
06:31 PM Bug #2895: cli: non-existent command returns confusing error message
...and ceph osd map rbd/rbd_info returns "unknown command map', which is just wrong;
the problem is the argument nee...
Dan Mick
05:33 PM Bug #2895 (Resolved): cli: non-existent command returns confusing error message
'ceph osd crush get' returns 'unknown command crush', instead of the full command.
http://www.spinics.net/lists/ce...
Josh Durgin
05:28 PM Feature #2894 (Resolved): cli: help command for ceph subsystems
To make commands and their usage discoverable and easy to look up, each subsystem could provide a help command
that ...
Josh Durgin
04:55 PM Revision ca2c3813 (ceph): osd: peering: detect when log source osd goes down
The Peering state has a generic check based on the prior set osds that
will restart peering if one of them goes down ...
Sage Weil
04:50 PM Bug #2887: pjd open/08.t failed test 2
Also, ubuntu@teuthology:/a/teuthology-2012-07-31_19:00:04-regression-master-testing-gcov/3654 Tamilarasi muthamizhan
04:47 PM Bug #2887: pjd open/08.t failed test 2
Also, ubuntu@teuthology:/a/teuthology-2012-08-01_00:01:38-regression-next-testing-basic/3784 Tamilarasi muthamizhan
04:43 PM Bug #2887: pjd open/08.t failed test 2
Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-01_02:00:04-regression-testing-testing-basic/3909 Tamilarasi muthamizhan
01:51 PM Bug #2887 (Resolved): pjd open/08.t failed test 2
pjd open/08.t failed test 2 on both ceph-fuse and kclient.
Logs:- ubuntu@teuthology:/a/teuthology-2012-07-31_02:00...
Tamilarasi muthamizhan
04:34 PM devops Feature #2893 (Closed): crowbar: Nested virtualization for running OpenStack in vercoi vm
Anonymous
04:34 PM devops Feature #2893 (Closed): crowbar: Nested virtualization for running OpenStack in vercoi vm
Anonymous
04:30 PM Bug #2892 (Resolved): ceph health detail kills monitor
Executed the following:
ubuntu@burnupi30:~$ sudo ceph health detail
Wait awhile and ceph will start to output t...
JuanJose Galvez
03:10 PM Bug #2891 (Can't reproduce): heap profiler hangs when trying to start it up on the mon
We tried to turn heap profiling on the mon (congress), however the last thing we see in the logs is the message that ... Yehuda Sadeh
02:24 PM Bug #2890 (Resolved): monitor: "recognize" heap commands
The monitor accepts the standard heap profiler commands, but it tells the user it doesn't due to not setting return c... Greg Farnum
01:56 PM devops Feature #2889 (Closed): crowbar: script for easily packaging ceph barclamp
Anonymous
01:53 PM devops Feature #2888 (Closed): crowbar: Make VM disk IO cache writes for performance
Anonymous
01:38 PM devops Documentation #2886 (Rejected): doc: crush location tricks, ceph.conf, automatic host=
- how it autoupdates on osd startup
- how hosts won't migrate from container to another automatically
Anonymous
01:37 PM devops Feature #2885 (Resolved): doc: mon initial members requirements, functioning, admin steps to take
Anonymous
01:36 PM devops Feature #2884 (Rejected): doc: osd hotplugging
Anonymous
01:34 PM devops Feature #2883 (Rejected): chef: union lists and maps in env vs node ceph.conf json
As an admin, I want to specify in environment
"osd crush location": {
"datacenter": "westcoast",
}
an...
Anonymous
01:33 PM devops Documentation #2882 (Rejected): doc: chef environment ceph.conf content tricks
Anonymous
01:32 PM devops Feature #2881 (Rejected): doc: chef cookbook better README, internal structure, assumptions
Anonymous
01:29 PM devops Feature #2704 (Closed): sepia: Use ``names`` as resolver on plana, burnupi, vercoi
dnscache01 and dnscache02 are happily serving anything that uses DHCP to get its configuration.
There may be stati...
Anonymous
01:20 PM devops Feature #2880 (Rejected): chef: use get-or-create instead of get-or-create-key
ceph.git commit 4551808fa00b812fee6e0c196fd333eca0b06de9 adds "ceph auth get-or-create". Switch to using it in ceph-c... Anonymous
01:10 PM rgw Bug #2877: rgw: ETag parsing in complete multipart upload should xml decode ETag
There are two different issues here. The first one is that we don't remove the quotes when comparing the etags. The s... Yehuda Sadeh
12:55 PM rgw Bug #2879 (Resolved): rgw: xml parser doesn't work correctly with escape sequences
e.g., when providing data with "&quot;", the entity is getting clobbered. Yehuda Sadeh
03:37 AM Revision c2ff66eb (ceph): crank up pjd debugging
Sage Weil

07/31/2012

10:49 PM Revision bb6e0d0e (ceph): wireshark: update patch
Update to latest source tree (svn 43768).
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
09:44 PM Bug #2873 (Fix Under Review): Stack trace thrown when using obsync
Dan Mick
06:18 PM Bug #2873: Stack trace thrown when using obsync
Figured out what the problem is, it appears that on L111, it should go from being... Matthew Wodrich
11:27 AM Bug #2873 (Resolved): Stack trace thrown when using obsync
... Matthew Wodrich
09:26 PM Revision deec81b4 (ceph): ReplicatedPG: clear waiting_for_ack when we send the commit
Otherwise, we might send the ack anyway later, after a subsequent
commit is sent resulting in an out-of-order op.
Th...
Samuel Just
08:51 PM Revision e0e72d78 (ceph): Merge remote-tracking branch 'upstream/wip-leveldb-iterators'
Samuel Just
06:55 PM Revision 87defa88 (ceph): rbd: fix off-by-one error in key name
Fixes: #2846
Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Sylvain Munaut
06:54 PM Revision 37d5b462 (ceph): secret: return error on empty secret
Signed-off-by: Sylvain Munaut <tnt@246tNt.com> Sylvain Munaut
06:48 PM Revision f22b95da (ceph): rbd: fix off-by-one error in key name
Fixes: #2846
Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
Sylvain Munaut
06:48 PM Revision e775ce52 (ceph): secret: return error on empty secret
Signed-off-by: Sylvain Munaut <tnt@246tNt.com> Sylvain Munaut
04:55 PM Revision cd0b5278 (ceph): separate regression suite into topical categories rados, rbd, fs
Sage Weil
03:38 PM RADOS Bug #2874: apparent CRUSH mapping failure
check if setting the tunables all to 0 makes it go away Sage Weil
11:40 AM RADOS Bug #2874 (Resolved): apparent CRUSH mapping failure
While doing crowbar tests, I created a 3-OSD cluster (on separate VMs) that ended up with 6 degraded PGs.... Greg Farnum
03:36 PM rgw Bug #2504 (In Progress): rgw: use multiple notifications objects
Yehuda Sadeh
03:35 PM rgw Bug #2878 (Resolved): rgw: chunked encoding for POST requests (e.g., complete multipart uploads)
We shouldn't require length passed for these requests. Yehuda Sadeh
03:28 PM rgw Bug #2877 (Resolved): rgw: ETag parsing in complete multipart upload should xml decode ETag
Should be able to accept both:... Yehuda Sadeh
03:08 PM Bug #2876 (Resolved): mon: pg stuck peering (for example) broken?
... Sage Weil
02:01 PM Bug #2875 (Resolved): osd: pg stuck in GetLog
we weren't checking if newest_update_osd went down (it could be outside the prior set) Sage Weil
12:43 PM Linux kernel client Bug #2573 (Resolved): libceph: many "socket closed" messages
I was seeing this too, but with the latest code and all (knock wood) the races closed I'm not anymore. Going to opti... Sage Weil
11:49 AM Bug #2846 (Resolved): Malformed keyring file causes kernel null pointer deref on "rbd map"
userland fixes applied to stable, next.
thanks!
Sage Weil
11:42 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
kernel patch is in testing branch. Sage Weil
06:23 AM Subtask #2805 (Fix Under Review): mon: Single-Paxos: Sync: Create a test unit to verify the corre...
Joao Eduardo Luis
06:22 AM Subtask #2805: mon: Single-Paxos: Sync: Create a test unit to verify the correctness of the whole...
Currently available tests:
* Removing keys:
> * Using both the whole-space iterator and the whole-space snapshot ...
Joao Eduardo Luis

07/30/2012

08:43 PM Revision cda5e8e0 (ceph): PG,ReplicatedPG: clarify scrub state clearing
scrub_clear_state takes care of clearing the SCRUB and REPAIR
flags. Thus, PG::scrub() needn't clear them again sinc...
Samuel Just
08:39 PM Revision 6d464a21 (ceph): PG::mark_clean(): queue_snap_trim if snap_trimq is not empty
Currently, we won't queue for snap trim until the next map
update.
Noticed while reviewing another patch, this would...
Samuel Just
08:38 PM Revision 1041b92c (ceph): ReplicatedPG::snap_trimmer: requeue if scrub_block_writes
Otherwise, we do not continue snap_trimming once scrub is
complete.
Noticed while revewing another patch. This woul...
Samuel Just
06:46 PM Linux kernel client Bug #2868: kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osd
hoping this was the messenger locking stuff, let's see if it pops up again Sage Weil
06:45 PM rbd Bug #2715 (Resolved): krbd: spinlock wrong CPU
Sage Weil
06:45 PM Linux kernel client Bug #2867 (Resolved): kclient: crash from ffsb in con_work -> kernel_sendmsg
Sage Weil
06:45 PM Linux kernel client Bug #2392: First read of symlink after ceph filesystem mounted gives error
Sage Weil
06:19 PM Revision 9071764a (ceph): qa: remove rbd image when done
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
06:19 PM Revision 4111501a (ceph): qa: add script to run rbd/librbd tests
Run on old and new style images, with different features. This is
intended to ease development, as opposed to being p...
Josh Durgin
06:19 PM Revision 902bbd79 (ceph): cls_rbd: don't duplicate encoding
Just skip the parts before encoding instead of returning early.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
06:19 PM Revision 72dc665b (ceph): librbd: make ImageCtx methods take snap_id parameters
This makes it easier to use without racing with snap_set.
Requests in the cache, for example, store their snap_id
and...
Josh Durgin
06:19 PM Revision bd119908 (ceph): librbd: add method to get the size of an image
To go with the other individual properties instead of stat,
add a method that only gets the size.
Signed-off-by: Jos...
Josh Durgin
06:19 PM Revision ea75d1ff (ceph): librbd: check for new snapshots in snap_set
This allows callers to create a snapshot and then read from
it immediately. No one uses this other than the tests,
fo...
Josh Durgin
06:19 PM Revision ff2a96c7 (ceph): rbd: fix copy-pasted snap remove error message
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
06:19 PM Revision c6bc3e10 (ceph): librbd: refactor synchronous I/O
Write in terms of the asynchronous functions, so all the logic
is not duplicated. Now there's only a single point whe...
Josh Durgin
06:19 PM Revision d3b5ee75 (ceph): librbd: split into multiple files
This was long overdue. There's still a bunch of stuff in internal.cc
that should be moved elsewhere, but this is a go...
Josh Durgin
06:17 PM Revision bbc2a4e1 (ceph): librbd: round #blocks in image_info() for edge cases
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
06:17 PM Revision 237f2409 (ceph): rbd: flatten command (copyup an entire image)
Fixes: #2561
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
06:17 PM Revision 26ed3841 (ceph): librbd, rbd.py, test_rbd.py: flatten methods
librbd:
- copyup_block for individual blocks (for read/write path)
- flatten for iterating across whole image (for rb...
Dan Mick
05:49 PM Revision 48038002 (ceph): Merge branch 'wip-osd'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
05:49 PM Revision 9e5d4e61 (ceph): osd: initialize send_notify on pg load
When the PG is loaded, we need to set send_notify if we are not the
primary. Otherwise, if the PG does not go throug...
Sage Weil
05:49 PM Revision f9ff8dd3 (ceph): osd: replace STRAY bit with bool
We were setting a bit in pg->state that is private to the non-primary
PG. The other bits get shared with the mon etc...
Sage Weil
05:47 PM Revision fb1d5495 (ceph): os: KeyValueDB: re-implement (prefix) iter in terms of whole-space iter
In-a-nutshell-version: Create a whole-space iterator interface, and
implement the already existing, prefix-based iter...
Joao Eduardo Luis
05:47 PM Revision 9dd8a333 (ceph): os: KeyValueDB: implement snapshot iterators
Create a set of functions, to be implemented by derivative classes of
KeyValueDB, responsible for returning an iterat...
Joao Eduardo Luis
05:47 PM Revision 8e404772 (ceph): test: test_keyvaluedb_iterators: Test KeyValueDB implementations iterators
This set of tests focus on testing the expected behavior of LevelDBStore's
and KeyValueDBMemory's iterators.
We test...
Joao Eduardo Luis
04:52 PM rbd Bug #2872 (Resolved): RBD resize command allows image size -1
Ceph Version : 0.48
Resize rbd image to size -1 allows rbd image to be resized to 15 Exabytes, which is incorrect....
Tamilarasi muthamizhan
03:52 PM rbd Bug #2871 (Resolved): rbd export command hangs when trying to export an image of size 0 to a loca...
Ceph Version: 0.48
Steps followed:
1. create a rbd image of size 1000 mb in rbd pool
2. resize the rbd image t...
Tamilarasi muthamizhan
10:52 AM Bug #2866 (Resolved): osd: pg stuck with unfound
commit:9e5d4e61a73343397e67e918e87f1e6dcb8ec72d and commit:7b9d37c662313929b52011ddae47cc8abab99095 Sage Weil
10:51 AM Bug #2860 (Resolved): osd: stuck waiting for pg acting set to change
commit:bae837010b6b486011b06dd97664fb54c3f3ff44 and commit:96feca450c5505a06868bc012fe998a03371b77f Sage Weil
09:14 AM Bug #2819: krbd: lockup on large writes, msgr fault injection
i'm unable to reproduce this on a real kernel.. it only happens on uml.
here is a full backtrace:...
Sage Weil
08:01 AM Bug #2638 (Resolved): mon: make pool ops idempotent
Sage Weil
08:01 AM Bug #2830 (Duplicate): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
Sage Weil
02:27 AM Revision 4a0a7e2f (ceph): auth: (ordered) list of auth methods, not a set
This lets us express a preference.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
02:27 AM Revision ec6ecc19 (ceph): auth: AuthSupported -> AuthMethodList
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:27 AM Revision effdec9e (ceph): auth: introduce cluster, service, and client auth settings
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil

07/29/2012

09:31 PM Linux kernel client Bug #2688 (Duplicate): lockup on ffsb + thrashing
Sage Weil
09:31 PM Linux kernel client Bug #2260 (Resolved): libceph: null pointer dereference at try_write+0x638+0xfb0
this is either #2867, or a similar issue that is since resolved. Sage Weil
09:28 PM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
Sage Weil
09:24 PM Linux kernel client Bug #2867: kclient: crash from ffsb in con_work -> kernel_sendmsg
*sigh of relief* Sage Weil
08:22 PM Linux kernel client Bug #2867: kclient: crash from ffsb in con_work -> kernel_sendmsg
This appears to be a regression, so it is effectively blocking sending the pull request to Linus. Sage Weil
07:40 PM Revision e4e239e2 (ceph): kernel: push a local .deb instead of using gitbuilder
This lets you specify a path to an existing kernel deb
to be pushed and installed on the remote node.
Limitations:
...
Sage Weil
07:15 PM Revision 1c93d5ab (ceph): syslog check: fix false-positive BUG matches in random strings
Sage Weil

07/28/2012

08:53 PM Revision 9d43c8a4 (ceph): test: workloadgen: Don't linearly iterate over a map to obtain a collec...
We were iterating over the collections map a certain amount of times, in
order to obtain the collection in that posit...
Joao Eduardo Luis
05:54 PM Revision 85187edd (ceph): add osd-recovery-incomplete
Sage Weil
05:53 PM Revision a0847694 (ceph): osd_recovery: also test unfound discovery
This tests for bug #2866. Sage Weil
05:52 PM Feature #2280 (Resolved): improve gitbuilder infrastructure
Sage Weil
05:50 PM RADOS Subtask #2792 (Fix Under Review): mon: require tunable feature bit if current osdmap uses non-def...
Sage Weil
05:23 PM Revision 731d5209 (ceph): ceph_manager: count 'incomplete' as 'down'
Sage Weil
05:23 PM Revision a9f2bf62 (ceph): ceph_manager: wait_for_active
Sage Weil
05:23 PM Revision 8dd09cb2 (ceph): osd_recovery: test incomplete pg recovery
4-osd test to reproduce #2860 and confirm the fix. Sage Weil
05:05 PM Revision 7b9d37c6 (ceph): osd: set STRAY on pg load when non-primary
The STRAY bit indicates that we should annouce ourselves to the primary,
but it is only set in start_peering_interval...
Sage Weil
04:04 PM Revision bae83701 (ceph): osd: peering: make Incomplete a Peering substate
This allows us to still catch changes in the prior set that would affect
our conclusions (that we are incomplete) and...
Sage Weil
04:04 PM Revision d1602ee2 (ceph): osd: peering: move to Incomplete when.. incomplete
PG::choose_acting() may return false and *not* request an acting set change
if it can't find any suitable peers with ...
Sage Weil
03:49 PM rgw Feature #2869 (Resolved): rgw: expand date format support
should be able to parse the following:
Sat, 28 Jul 2012 20:35:55 UTC
Which uses UTC instead of GMT.
Yehuda Sadeh
03:30 PM Feature #2477 (Fix Under Review): rados bench cleanup
Sage Weil
03:30 PM Feature #1783 (Fix Under Review): osd: scrub incrementally across hash range using MOSDPGScan
Sage Weil
02:39 PM Revision d6126940 (ceph): config: send warnings to a ostream* argument
We shouldn't always send these to stderr. (Among other things, the
warning: prefix breaks the gitbuilder error detec...
Sage Weil
02:21 PM Revision 1fc19df8 (ceph): Merge remote-tracking branch 'gh/wip-msgr-masterbits'
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
07:37 AM Linux kernel client Bug #2868 (Resolved): kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osd
... Sage Weil

07/27/2012

11:03 PM Revision 96feca45 (ceph): osd: peering: make Incomplete a Peering substate
This allows us to still catch changes in the prior set that would affect
our conclusions (that we are incomplete) and...
Sage Weil
10:39 PM Revision a71e442f (ceph): osd: peering: move to Incomplete when.. incomplete
PG::choose_acting() may return false and *not* request an acting set change
if it can't find any suitable peers with ...
Sage Weil
09:28 PM Revision de4474ac (ceph): vstart.sh: apply extra conf after the defaults
This let's you do e.g., -o 'debug ms = 100' and it will apply after
the default logging levels.
Signed-off-by: Sage ...
Sage Weil
09:00 PM Revision 623026d9 (ceph): Merge remote-tracking branch 'gh/stable' into stable-next
Sage Weil
08:42 PM Revision 0b8b58ff (ceph): fix adminsocket test
Sage Weil
05:52 PM Linux kernel client Bug #2867 (Resolved): kclient: crash from ffsb in con_work -> kernel_sendmsg
... Sage Weil
05:45 PM Revision bbc49179 (ceph): msg/Pipe: if we send a wait, make sure we follow through
Mark our outgoing connection attempt if we send a WAIT in accept(). This
ensures we don't go to standby or closed in...
Sage Weil
05:44 PM Revision 6c01d46e (ceph): client: handle fault during session teardown
We may have a sequence like:
- client does REQUEST_CLOSE
- mds sends reply
- connection faults, client does get r...
Sage Weil
05:44 PM Revision a879425b (ceph): msg/Pipe: make STANDBY behavior optional
In particular, lossless_peers should use STANDBY, but lossless_clients
should reconnect immediately since they are al...
Sage Weil
05:43 PM Revision f69d025b (ceph): conf: make dup lines override previous value
If you put
[some section]
foo = 1
...
foo = 2
in a .conf file, make the second key override the first.
Generate...
Sage Weil
05:43 PM Revision 4788567e (ceph): qa: simple test for pool create/delete commands
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:43 PM Revision 6f7837a9 (ceph): mon: make 'osd rm ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:43 PM Revision 640e5fde (ceph): qa: simple tests for 'ceph osd create|rm' commands
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:43 PM Revision 3715d205 (ceph): mon: check for invalid pool snap creates in preprocess_op, too
This avoids waiting for a paxos commit just to return an error.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:43 PM Revision 507f99e9 (ceph): librados: make snap create/destroy handle client-side errors
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:43 PM Revision 53aa9593 (ceph): objecter: return ENOENT/EEXIST on pool snap delete/create
Do these checks on the client to mask monitor idempotency from the user.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:43 PM Revision 5f3ef77d (ceph): mon: make pool snap creation ops idempotent
Return 0 if the snap already exists, or is already deleted.
Also, avoid updating the pg_pool if we are just waiting ...
Sage Weil
05:43 PM Revision 4dfc14c4 (ceph): mon: remove superfluous "can't delete except on master" comments
That's what 'return false' means for preprocess_*().
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:43 PM Revision 6ec9555b (ceph): cond: reorder asserts
Make the more specific checks assert before the less specific ones, so we
are more likely to crash with useful inform...
Sage Weil
05:43 PM Revision c2e1c629 (ceph): mutex: assert we are unlocked by the same thread that locked
This only works for non-recursive locks. (Which is probably all of them?)
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:43 PM Revision 46e819ed (ceph): objecter: fix mon command resends
The monitor session is lossy. Send these when the op is initiated, or
when we reconnect. The timeout/cutoff was pre...
Sage Weil
05:43 PM Revision 358d6b61 (ceph): librados: make pool create/destroy handle client-side errors
Add tests!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:43 PM Revision 597f14ab (ceph): objecter: make pool create/delete return EEXIST/ENOENT
Do these checks on the client side to mask monitor idempotency from
the user.
Signed-off-by: Sage Weil <sage@inktank...
Sage Weil
05:43 PM Revision 5503376f (ceph): mon: make pool removal idempotent
Return success if pool does not exist. Part of #2638.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:43 PM Revision a01e22d2 (ceph): mon: make pool creation idempotent
Return success if the pool already exists. Part of #2638.
Also, fix this so we wait until a creating pool is create...
Sage Weil
05:18 PM Bug #2866 (Fix Under Review): osd: pg stuck with unfound
Sage Weil
04:29 PM Bug #2866 (Resolved): osd: pg stuck with unfound
on congress, observed pg stuck with unfound objects. kicking peering (marking primary down) resolved it.
in testi...
Sage Weil
05:15 PM Bug #2860 (Fix Under Review): osd: stuck waiting for pg acting set to change
Sage Weil
03:17 PM Bug #2860: osd: stuck waiting for pg acting set to change
i can reproduce this with:... Sage Weil
12:39 PM Bug #2860 (Resolved): osd: stuck waiting for pg acting set to change
... Sage Weil
03:26 PM rbd Bug #2865 (Resolved): rbd import fails for directory but creates rbd image
Ceph Version: 0.48
Created a local directory t_dir.
when tried to import directory t_dir to rbd/rbd_image, it rep...
Tamilarasi muthamizhan
02:36 PM rgw Bug #2864 (Won't Fix): rados leaves behind references to old buckets
As this behavior can only be reproduced through deleting objects directly through rados, and not radosgw-admin or API... caleb miles
02:09 PM rgw Bug #2864 (Won't Fix): rados leaves behind references to old buckets
It is possible to create an inconsistent state by following this procedure:
1. create a bucket through an API call...
caleb miles
01:58 PM Bug #2824 (Resolved): ceph-fuse; hang mounting with ms failures
Sage Weil
01:46 PM CephFS Bug #2863 (Resolved): client: does not tolerate traceless replies from mds
In at least one case (_create's _mknod) we do not tolerate a (write) reply from the mds with no trace. This happens ... Sage Weil
01:21 PM rbd Bug #2862 (Resolved): CLI: rbd create command throws inappropriate error messages
Ceph Version: 0.48
When tried a few negative test cases using "rbd create command", found that the command display...
Tamilarasi muthamizhan
12:57 PM rbd Bug #2861 (Won't Fix): CLI: rbd create command requires validation for image-name
Ceph version: 0.48
When trying to create a rbd image, the image name seems to accept empty string and special char...
Tamilarasi muthamizhan
11:45 AM Bug #2462: osd/PG.cc: 402: FAILED assert(log.head >= olog.tail && olog.head >= log.tail)
just swa this on congress during a huge crush restructure:... Sage Weil
11:31 AM rgw Tasks #2859 (New): Make add subuser in radosgw-admin idempotent
Currently, attempting to create a subuser that already exists returns an error; it has been suggested that this behav... caleb miles
11:25 AM Bug #2858: mon: osd id parsing returns 0 when passed 'osd.1234'
Not sure exactly what scenario you're looking at here or what the bug is, but there are lots of places in the monitor... Greg Farnum
11:08 AM Bug #2858 (Resolved): mon: osd id parsing returns 0 when passed 'osd.1234'
Sage Weil
10:54 AM Bug #2752: Setting large maxosd kills all mons
Thanks Yehuda! Florian Haas
10:51 AM Feature #2857 (Resolved): compile non-production builds with -fno-omit-frame-pointer
This will let us get much more useful profiling data out of various tools with relatively minimal CPU overhead. Mark Nelson
08:17 AM Bug #2856 (Resolved): osd: bound size of transactions trimming old osdmaps
The monitor can arbitrarily advance it's oldest map. The osd should avoid sending down an arbitrarily large transacti... Sage Weil
04:55 AM Revision 9553c6ed (ceph): osd: fixing sharing of past_intervals on backfill restart
We need to share past_intervals whenever we instantiate the PG on a peer.
In the PG activation case, this is based on...
Sage Weil
04:55 AM Revision 29aa1cf4 (ceph): filestore: check for EIO in read path
Check for EIO in read methods and helpers. Try to do checks in low-level
methods (e.g., lfn_*()) to avoid duplicatio...
Sage Weil
04:29 AM Revision 0891948e (ceph): filestore: add 'filestore fail eio' option, default true
By default we will assert/fail/crash on EIO from the underlying fs. We
already do this in the write path, but not th...
Sage Weil

07/26/2012

11:35 PM Revision 9db78090 (ceph): osd: fixing sharing of past_intervals on backfill restart
We need to share past_intervals whenever we instantiate the PG on a peer.
In the PG activation case, this is based on...
Sage Weil
11:29 PM Revision aef10e72 (ceph): librbd: fix id initialization in new format
48bd839b1e25b063c675416a8f6233463f1af115 should have included this.
I misread it due to the use of bid instead of id ...
Josh Durgin
10:36 PM Bug #2830 (Need More Info): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
this may duplicate #2843.. sadly didn't take note of the osd id :( Sage Weil
10:34 PM Bug #2837 (Resolved): osd: past_interval calculation inefficient
Sage Weil
10:34 PM Bug #2849 (Resolved): osd: past_intervals not shared on backfill restart
Sage Weil
04:25 PM Bug #2849 (Resolved): osd: past_intervals not shared on backfill restart
peer info value is clobbered by backfill block prior to the dne() check in PG::activate()
this explains a lot!
Sage Weil
10:29 PM Revision 17bb78a2 (ceph): librbd: fix id initialization in new format
48bd839b1e25b063c675416a8f6233463f1af115 should have included this.
I misread it due to the use of bid instead of id ...
Josh Durgin
10:26 PM Revision 5601ae27 (ceph): mon: set a configurable max osd cap
Don't allow setting a higher osd num through the
ceph control util.
Fixes: #2752
Signed-off-by: Yehuda Sadeh <yehuda...
Yehuda Sadeh
10:04 PM Revision 87b6e804 (ceph): Merge remote-tracking branch 'gh/wip-rbd-bid' into stable-next
Sage Weil
10:04 PM Revision 81d72e5d (ceph): mon: make 'ceph osd rm ...' wipe out all state bits, not just EXISTS
This ensures that when a new osd reclaims that id it behaves as if it were
really new.
Backport: argonaut
Signed-off...
Sage Weil
10:03 PM Revision ad9c37f2 (ceph): test_stress_watch: just one librados instance
This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets...
Sage Weil
10:03 PM Revision c60afe18 (ceph): Merge commit '35b13266923f8095650f45562d66372e618c8824' into stable-next
First batch of msgr fixes. Sage Weil
10:03 PM Revision d2d40dc3 (ceph): client: fix locking for SafeCond users
Need to wait on flock, not client_lock.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:03 PM Revision 6ed01df4 (ceph): workqueue: kick -> wake or _wake, depending on locking
Break kick() into wake() and _wake() methods, depending on whether the
lock is already held. (The rename ensures tha...
Sage Weil
10:03 PM Revision 7fbbe465 (ceph): librados: take lock when signaling notify cond
When we are signaling the cond to indicate that a notify is complete,
take the appropriate lock. This removes the po...
Sage Weil
10:03 PM Revision 4d7d3e27 (ceph): osd: guard class call decoding
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:03 PM Revision 682609a9 (ceph): objecter: always resend linger registrations
If a linger op (watch) is sent to the OSD and updates the object, and then
the client loses the reply, it will resend...
Sage Weil
10:03 PM Revision 88330505 (ceph): ReplicatedPG: fix replay op ordering
After a client reconnect, the client replays outstanding ops. The
OSD then immediately responds with success if the ...
Samuel Just
10:02 PM Revision c963a21a (ceph): filestore: check for EIO in read path
Check for EIO in read methods and helpers. Try to do checks in low-level
methods (e.g., lfn_*()) to avoid duplicatio...
Sage Weil
10:01 PM Revision 6bd89aeb (ceph): filestore: add 'filestore fail eio' option, default true
By default we will assert/fail/crash on EIO from the underlying fs. We
already do this in the write path, but not th...
Sage Weil
08:48 PM Revision 0d6ce424 (ceph): Fixed the code to pass 'yes' during mkfs
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> tamil
06:08 PM rbd Subtask #2855 (Closed): krbd: copy-up on write to clone
Anonymous
06:07 PM rbd Subtask #2854 (Closed): krbd: write path
verify the target object exists in write requests. if we fail with ENOENT, trigger a copy-up. Anonymous
06:07 PM rbd Tasks #2853 (Resolved): krbd: read path
Anonymous
06:06 PM rbd Subtask #2852 (Closed): krbd: open parent on open
Anonymous
06:05 PM rbd Feature #2851 (Duplicate): krbd: RBD layering support
Kernel client should support all the layering functionality of the usermode client. Anonymous
06:00 PM rbd Feature #2850 (Duplicate): libceph: support multi-operation transactions
Anonymous
03:23 PM Bug #2752 (Resolved): Setting large maxosd kills all mons
Fixed, commit:5601ae27d6daf167dd83b3fc91b7b9591ca0cea6. Yehuda Sadeh
12:28 PM Bug #2848 (Won't Fix): OSDMap: pool_id is 64-bit, but pool_max is 32-bit
A large number of pools will overflow pool_max before using the full range of pool ids. Josh Durgin
12:26 PM Linux kernel client Cleanup #2847 (Resolved): libceph: osdmap definition is out of date
In particular, pool_id is an int instead of a 64-bit integer. There are probably other important differences as well. Josh Durgin
10:53 AM rbd Feature #2562 (In Progress): librbd: open parent images, read path, write path
Anonymous
10:46 AM rbd Feature #2726 (In Progress): krbd: clean up bio_pair leak/whatever
Guangliang Zhao sent a patch to fix that, however, I had some concerns about it, and I'm waiting for him to respond t... Yehuda Sadeh
08:26 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
Ok, I finally know the failing path.
So when you call add_key with an invalid payload, it will be parsed by ceph_k...
Sylvain Munaut
08:07 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
Damnit ... first it didn't take the formatting and second I pasted the wrong code :p... Sylvain Munaut
08:05 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
wrt to kernel crash, here's a minimal test case that will crash any machine that has rbd module loaded (works as user... Sylvain Munaut
04:28 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
I was pointing to a keyring file directly that happened to start with an empty line. So in rbd.cc, the function read_... Sylvain Munaut
03:53 AM Bug #2846 (Resolved): Malformed keyring file causes kernel null pointer deref on "rbd map"
Reported by Sylvain Munaut ("tnt" on OFTC):
(12:30:27) tnt: Is mounting a RBD on a machine that has an OSD suppose...
Florian Haas
02:32 AM Bug #2845 (Resolved): mkcephfs hasn't learned about new default keyring locations in argonaut
In 0.48, when running @mkcephfs@ in a @cephx@ authentication enabled cluster, the per-daemon keys for MDSs and OSDs a... Florian Haas
01:13 AM Revision e9b5a289 (ceph): config: fix 'config set' admin socket command
Fixes: #2832
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

07/25/2012

11:38 PM Revision 2b75ddef (ceph): Added '-y' option for mkfs.ext4
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> tamil
11:35 PM Revision 1a6cd965 (ceph): osd: break potentially large transaction into pieces
We do a similar trick elsewhere. Control this via a tunable. Eventually
we'll control the others (in a non-stable b...
Sage Weil
10:57 PM Revision bcb9ab8b (ceph): doc: updates to fix problem with ceph-cookbooks appearing in chef-server.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:54 PM Bug #2843 (Can't reproduce): filestore: replay failure on xfs
congress osd.328 crashed with... Sage Weil
09:53 PM Revision 15e16229 (ceph): osd: only commit past intervals at end of parallel build
We don't check for gaps in the past intervals, so we should only commit
this when we are completely done. Otherwise ...
Sage Weil
08:28 PM Revision 9767146f (ceph): osd: generate past intervals in parallel on boot
Even though we aggressively share past_intervals with notifies etc, it is
still possible for an osd to get buried beh...
Sage Weil
08:28 PM Revision d45929f4 (ceph): osd: move calculation of past_interval range into helper
PG::generate_past_intervals() first calculates the range over which it
needs to generate past intervals. Do this in ...
Sage Weil
08:27 PM Revision 18d5fc41 (ceph): osd: fix map epoch boot condition
We only want to join the cluster if we can catch up to the latest
osdmap with a small number of maps, in this case a ...
Sage Weil
05:58 PM Revision fca65ff5 (ceph): osd: move calculation of past_interval range into helper
PG::generate_past_intervals() first calculates the range over which it
needs to generate past intervals. Do this in ...
Sage Weil
05:58 PM Revision 16302ace (ceph): osd: generate past intervals in parallel on boot
Even though we aggressively share past_intervals with notifies etc, it is
still possible for an osd to get buried beh...
Sage Weil
05:58 PM Revision 5979351e (ceph): osd: fix map epoch boot condition
We only want to join the cluster if we can catch up to the latest
osdmap with a small number of maps, in this case a ...
Sage Weil
05:55 PM Bug #2842: mon: health detail lists pgs multiple times
This and #2827 may be related? Greg Farnum
05:52 PM Bug #2842 (Won't Fix): mon: health detail lists pgs multiple times
... Sage Weil
05:16 PM rgw Bug #1855: Creation of a subuser that appears to own an s3 key is possible, and removing the subu...
This bug can be reproduced by using the following options
./radosgw-admin -c {'ceph.conf'} --rgw-socket-path=/tmp...
caleb miles
05:16 PM rgw Bug #2841 (Resolved): rgw: fix usage trim
looking at the code, it seems that we don't encode the user in usage-trim (and also encode the wrong structure). Yehuda Sadeh
04:52 PM CephFS Bug #2187: pjd chown/00.t failed test 97
2012-07-23T19:16:10.185 INFO:teuthology.task.workunit.client.0.out:not ok 43
2012-07-23T19:16:10.186 INFO:teuthology...
Tamilarasi muthamizhan
04:51 PM CephFS Bug #2187: pjd chown/00.t failed test 97
Latest log: ubuntu@teuthology:/a/teuthology-2012-07-23_19:00:03-regression-master-testing-gcov/16530
Tamilarasi muthamizhan
04:30 PM Feature #2840 (Resolved): mon: $mon_data/cluster_fsid file
maybe written/verified by mkfs! Sage Weil
04:23 PM rgw Feature #2839 (Resolved): rgw: garbage collection
Provide a garbage collection mechanism, along the lines of what was described in a post to the mailing list. Yehuda Sadeh
04:20 PM rgw Bug #2652: Segmentation fault in rest-bench
is it still happening? Yehuda Sadeh
04:20 PM rgw Bug #2665: rest-bench hangs periodically
is that still happening? Yehuda Sadeh
04:10 PM devops Feature #2574 (Resolved): crowbar: use data disks automatically, journal inside data directory
There were bugs and the history was wrecked by github pull requests again, so I redid some commits, but this function... Anonymous
03:45 PM rgw Feature #2039 (Rejected): rgw: keep more than one bucket marker object
That's not the case anymore. We use the unique client id and a running counter instead. Yehuda Sadeh
02:08 PM Bug #2838 (Resolved): mon: json version of 'osd tree'
Sage Weil
01:52 PM Bug #2824: ceph-fuse; hang mounting with ms failures
Sage Weil
01:52 PM Bug #2835 (Resolved): osd: do not send alive/upthru until booted
Sage Weil
01:52 PM Bug #2836 (Resolved): osd: boot condition check incorrect
Sage Weil
10:52 AM Bug #2836 (Resolved): osd: boot condition check incorrect
commit:5979351ef3d3d03bced9286f79cbc22524c4a8de Sage Weil
11:04 AM Bug #2837 (Resolved): osd: past_interval calculation inefficient
It is still possible for osds to get pgs without past intervals and need to recalculate them, and that calculation ca... Sage Weil
04:36 AM Revision 61ff894c (ceph): admin-socket: test generic admin socket commands
Sage Weil
03:54 AM Revision 11b275a0 (ceph): osd: avoid misc work before we're active
If we're booting, we shouldn't scrub, or send reports to the montior,
or send heartbeats, or any of that.
Signed-off...
Sage Weil
03:51 AM Revision 278b5f58 (ceph): mon: ignore pgtemp messages from down osds
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:51 AM Revision 08e2ecac (ceph): mon: ignore osd_alive messages from down osds
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:18 AM Revision 8c7186d0 (ceph): mon: ignore pgtemp messages from down osds
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:16 AM Revision b17f5467 (ceph): mon: ignore osd_alive messages from down osds
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:23 AM Revision 404a7f52 (ceph): admin_socket: json output, always
If the perfcounters stuff were refactored to use the Formatter, we could
put the JSONFormatter in the admin_socket co...
Sage Weil
12:23 AM Revision 0133392b (ceph): admin_socket: dump config in json; add test
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:22 AM Revision 8c3b4907 (ceph): Merge branch 'next'
Sage Weil

07/24/2012

10:26 PM Revision 9bc86171 (ceph): admin_socket: make test optional
If it's not there, we just verify the output is valid json. Sage Weil
08:58 PM Bug #2835 (Resolved): osd: do not send alive/upthru until booted
Sage Weil
08:53 PM Revision 0ef8cd3c (ceph): config: fix 'config set' admin socket command
Fixes: #2832
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:49 PM Revision 186a595c (ceph): Merge branch 'next'
Sage Weil
06:02 PM Revision f565ace6 (ceph): osd: fix pg log zeroing
Zero the right number of bytes. Fixes a bug where we clobber legit log
data. Fortunately this is only triggered wit...
Sage Weil
05:10 PM Revision 3e886799 (ceph): Merge branch 'wip-2763'
Yehuda Sadeh
05:09 PM Revision 52f51a24 (ceph): wireshar/ceph/packet-ceph.c: fix eol
Removing extra char from dos eol format.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:09 PM Revision d67ad0db (ceph): Wireshark dissector updated, work with the current development tree of ...
Pierre Rognant
04:02 PM Bug #2833: LibRBD.TestClone failure with ms fialure injection
I'm pretty sure this is due to the op being done twice due to replay. That error (-EBADF) means the id was already ad... Josh Durgin
03:49 PM Bug #2833 (Resolved): LibRBD.TestClone failure with ms fialure injection
... Sage Weil
03:54 PM Bug #2834 (Resolved): osd/ReplicatedPG.cc: 3577: FAILED assert(waiting_for_ack.begin()->first == ...
... Sage Weil
03:10 PM devops Feature #2574: crowbar: use data disks automatically, journal inside data directory
The most recent pull request for the cookbook has been tested by Tyler and myself. I've setup the following situation... JuanJose Galvez
02:57 PM Bug #2832: [argonaut] config set foo bar broken?
What was the resolution? Greg Farnum
02:33 PM Bug #2832 (Resolved): [argonaut] config set foo bar broken?
Sage Weil
01:32 PM Bug #2832 (Resolved): [argonaut] config set foo bar broken?
... Sage Weil
01:08 PM Bug #2831 (Duplicate): librados: LibRadosWatchNotify.WatchNotifyTest segfault
nevermind, i cherry-pick a bad version of the linger op fix for #2796 Sage Weil
12:48 PM Bug #2831 (Duplicate): librados: LibRadosWatchNotify.WatchNotifyTest segfault
On the stable-next branch, commit:7e9336e7cb36c6640fd5c8509c9c0ffd87b15d7a,
I hit this sometime last week or the w...
Sage Weil
11:59 AM Bug #2824 (In Progress): ceph-fuse; hang mounting with ms failures
Sage Weil
10:58 AM Bug #2799 (Resolved): osd: pg log trimming zeroing broken
commit:f565ace62a1235dbe6a4ee9db5c5233cea395446 Sage Weil
08:44 AM Bug #2799: osd: pg log trimming zeroing broken
this was a bug in pg log trimming/zeroing. thankfully it was only enabled in master! will backport the fix all over... Sage Weil
08:25 AM Bug #2799: osd: pg log trimming zeroing broken
... Sage Weil
10:14 AM Subtask #2805: mon: Single-Paxos: Sync: Create a test unit to verify the correctness of the whole...
Current available test units:
* Removing keys:
> * Using the whole-space iterator, both in LevelDBStore and KeyVa...
Joao Eduardo Luis
09:19 AM Bug #2830 (Duplicate): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
0> 2012-07-24 14:38:09.918736 7fe53cede700 -1 osd/OSD.cc: In function 'OSDMapRef OSD::get_map(epoch_t)' thread 7... Sage Weil
01:30 AM Revision a16d9c64 (ceph): os: KeyValueDB: allow finer-grained control of transaction operations
This patch introduces the possibility of using single key/value
modification operations into the transaction interfac...
Joao Eduardo Luis
01:30 AM Revision a3d57a6e (ceph): os: KeyValueDB: Add virtual raw_key() function to return (prefix,key) pair
If we were to use solely the key() function, whenever we had a key with,
say, prefix 'Foo' and key 'Bar', the key() f...
Joao Eduardo Luis
12:39 AM Revision 6c0fa509 (ceph): doc: update information about stable vs development releases
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:16 AM Revision 48bd839b (ceph): librbd: replace assign_bid with client id and random number
The assign_bid method has issues with replay because it is a write
that also returns data. This means that the replay...
Josh Durgin

07/23/2012

11:51 PM Revision 67832c34 (ceph): osd: fix ACK ordering on resent ops
The wait_for_ondisk handling fixed COMMIT ordering, but the ACKs need to
go back in the same order too. For example:...
Sage Weil
11:01 PM Revision 96dbc412 (ceph): rados::cls::lock: move api types into namespace
By popular demand, moved public api into namespace. This
required some changes to ceph_dencoder to get some template
...
Yehuda Sadeh
09:09 PM Revision 7dfdf4f8 (ceph): librbd: replace assign_bid with client id and random number
The assign_bid method has issues with replay because it is a write
that also returns data. This means that the replay...
Josh Durgin
07:51 PM Revision dc2d6711 (ceph): librados: add new constructor to form a Rados object from IoCtx
This creates a separate reference to an existing connection, for
use when a client holding IoCtx needs to consult ano...
Dan Mick
07:43 PM Revision d9bfe954 (ceph): Merge tag 'v0.49'
v0.49 Sage Weil
06:28 PM Revision ca6265d0 (ceph): v0.49
Sage Weil
05:47 PM Revision c8f13119 (ceph): mon: make 'ceph osd rm ...' wipe out all state bits, not just EXISTS
This ensures that when a new osd reclaims that id it behaves as if it were
really new.
Backport: argonaut
Signed-off...
Sage Weil
05:17 PM Bug #2822 (Resolved): rbd: ASSIGN_BID both mutates object and returns data
commit:48bd839b1e25b063c675416a8f6233463f1af115 on master
commit:7dfdf4f8de16155edd434534e161e06ba7c79d7d for argonaut
Sage Weil
05:03 PM Feature #2829 (Resolved): report on cluster size/status (for service billing purposes)
The plan is to base support contract pricing on capacity,
which means we need a quick, easy (and moderately trusted)...
Anonymous
04:59 PM Bug #2828 (Resolved): osd: assign_bid was allowed to mutate and return data
The osd shouldn't have allowed this. Figure out why, and then see if there is a graceful way to fix it without break... Sage Weil
04:56 PM Revision f42e1875 (ceph): cls_rbd, cls_rbd_client, test_cls_rbd: copyup method
Fixes: #2559
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
04:55 PM Revision 9b9efe7e (ceph): cls_rbd_client: fix locking function indentation
Greg's editor was set to spaces only.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
04:55 PM Revision d5e45418 (ceph): cls_rbd: return negative pool id if parent does not exist
This lets us use get_parent in a multi-op transaction without getting
-ENOENT for the entire transaction.
Signed-off...
Josh Durgin
04:55 PM Revision 56bc3699 (ceph): cls_rbd: make get_parent return valid data when layering is disabled
This means clients can treat an error in their multi-object
transaction as a failure for all of them. This makes the ...
Josh Durgin
04:55 PM Revision 02aba7bc (ceph): librbd: rename variables for more consistency
Rename: poolname -> pool_name
snapname -> snap_name
snapid -> snap_id
Signed-off-by: Josh Durgin <...
Josh Durgin
04:55 PM Revision 4af1f981 (ceph): librbd: move watch/unwatch to ImageCtx methods
This fixes close_image() on a partially initialized ImageCtx, and
paves the way for opening parent images without wat...
Josh Durgin
04:55 PM Revision 233d44a1 (ceph): librbd: allow an image to be opened without watching
Watching the header of a parent image could produce unreasonable
delays. If hundreds of child images watch the same p...
Josh Durgin
04:55 PM Revision df71c2d3 (ceph): librbd: store parent info in snapshot metadata
If an image is open to a snapshot, return the features or overlap from
that snapshot.
Signed-off-by: Josh Durgin <jo...
Josh Durgin
04:55 PM Revision 8014d932 (ceph): librbd: allow opening an image by id
If we have an ImageCtx with an id set, don't bother setting the name.
Just go directly to the header object in ImageC...
Josh Durgin
04:55 PM Revision b82f5958 (ceph): librbd: move all side-effects to snap_{un}set()
All callers need to do this anyway.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
04:55 PM Revision c18c4b86 (ceph): librbd: open parent image when opening a child
Extract a helper out of get_parent_info. The parent may become unset
while the child is open, so detect changes in it...
Josh Durgin
04:55 PM Revision 127ff618 (ceph): librbd: drop unnecessary std:: and struct prefixes
Most of this file do not use them.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
04:52 PM Bug #2823 (Resolved): osd: out of order ACKs
commit:67832c34a228c5cff1941eab3225969b5e66e44b Sage Weil
08:47 AM Bug #2823: osd: out of order ACKs
specifically, this is out of order ACKs when ops are resent. trivially reproducible with the thrasher + radosmodel t... Sage Weil
04:21 PM Revision 5fcb22f0 (ceph): mkcephfs: add sync between btrfs scan and mount
This appears to fix problems with mount failing for at least one user.
Reported-by: Paul Pettigrew <Paul.Pettigrew@m...
Sage Weil
02:44 PM Bug #2827 (Rejected): mon: ceph health string doesn't match "ceph -s" output
I've seen several ceph -s reports lately where the first line on the PGs doesn't match the output of the PG line. My ... Greg Farnum
11:11 AM devops Feature #2553 (Closed): crowbar: open question: What's the correct way to add RBD support to the ...
This ticket was an arbitrary reminder to send upstream an email with questions, and we have answers & sort of underst... Anonymous
11:04 AM Feature #2826 (Resolved): osd: backfill target reservations
When osds are marked out, the recovery ops per primary throttling works fine, because the backfill targets tend to be... Sage Weil
10:45 AM CephFS Bug #2187: pjd chown/00.t failed test 97
also,
ubuntu@teuthology: /a/teuthology-2012-07-23_05:00:03-regression-stable-master-basic/16425
from teuthology...
Tamilarasi muthamizhan
10:42 AM CephFS Bug #2187: pjd chown/00.t failed test 97
Recent logs: ubuntu@teuthology: /a/teuthology-2012-07-23_05:00:03-regression-stable-master-basic/16416
From teutho...
Tamilarasi muthamizhan
10:14 AM Bug #2787: RBD test copy fails when thrashing
Recent logs: ubuntu@teuthology:/a/teuthology-2012-07-23_00:00:08-regression-next-testing-basic/16393
ubuntu@teutholo...
Tamilarasi muthamizhan
08:59 AM CephFS Bug #2825: File lock doesn't work properly
Here is the code included in *pre* ;)... Jean-Sébastien Frerot
08:56 AM CephFS Bug #2825 (Resolved): File lock doesn't work properly
Hi,
I've been testing the locking feature on cephfs and seems that the locking doesn't work as expected when locki...
Jean-Sébastien Frerot
05:03 AM Revision d0698f9d (ceph): schedule_suite.sh: put coverage option in ceph section
Sage Weil
03:50 AM Revision deb7a54a (ceph): marginal kclient+ffsb: enable mds logging to catch badess
See #1947 Sage Weil
03:47 AM Revision 32353f13 (ceph): move misc, blogbench back into active kernel suite
these were removed from regression ages ago, and only recently put back in
marginal. they seem fine.
Sage Weil
03:47 AM Revision 000fec3b (ceph): move all kernel tests to kernel suite; symlink collections from regression
Make regression a union of other topical suites. Sage Weil

07/22/2012

10:46 PM Bug #2824 (Resolved): ceph-fuse; hang mounting with ms failures
... Sage Weil
10:36 PM Bug #2823 (Duplicate): osd: out of order ACKs
... Sage Weil
10:34 PM Bug #2822: rbd: ASSIGN_BID both mutates object and returns data
ubuntu@teuthology:/a/sage-2012-07-22_21:26:14-regression-wip-msgr-testing-basic/16193 too
Sage Weil
10:33 PM Bug #2822 (Resolved): rbd: ASSIGN_BID both mutates object and returns data
It's not okay to mutate data *and* return data, because when resending the operation (e.g., due to a connection fault... Sage Weil
09:40 PM Bug #2502 (Resolved): osd: do not mark self back up if internal heartbeats are failing
Sage Weil
09:34 PM Bug #2820 (Resolved): osd: crash in handle_osd_ping
Sage Weil
09:32 PM Bug #2799: osd: pg log trimming zeroing broken
Sage Weil
09:29 PM CephFS Bug #1677: mds interval_set.h: 385: FAILED assert(p->first <= start)
Sage Weil
03:59 AM Revision 17e4f751 (ceph): this fails reliably
Sage Weil
03:18 AM Revision f70b8250 (ceph): ceph: fix mkfs/mount option defaults
Later code expects a list, not None. Sage Weil

07/21/2012

04:15 PM Revision 2d7e2cbf (ceph): crush: fix name map encoding
We screwed up and encoded using the name 'int' type instead of int32_t.
That means people have systems encoding this ...
Sage Weil
03:50 PM Revision b497bdac (ceph): osd/OpTracker: fix use-after-free
And formatting.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
02:12 PM CephFS Bug #2187: pjd chown/00.t failed test 97
Latest logs: ubuntu@teuthology:/a/teuthology-2012-07-21_05:00:03-regression-stable-master-basic/15237
ubuntu@teuth...
Tamilarasi muthamizhan
09:17 AM Bug #2638 (Fix Under Review): mon: make pool ops idempotent
Sage Weil
09:17 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
commit:b497bdacf5d5732285d280772ab9ce6ae88f59e8 Sage Weil
08:15 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
... Sage Weil
01:36 AM Revision 7cf1f1fb (ceph): msg/Pipe: go to STANDBY on lossless accept fault
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 2e67b7a3 (ceph): msgr: do not reopen failed lossy Connections
There was a race where:
- sending stuff to a lossy Connection
- it fails, and queues itself for reap, queues a RES...
Sage Weil
01:36 AM Revision adce6df2 (ceph): msgr: simplify submit_message()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 04fde518 (ceph): msgr: drop CLOSED checks during queueing
AFAICS these checks are pointless. There should be no harm in queueing
messages on a closed connection; they'll get ...
Sage Weil
01:36 AM Revision 60eb36ef (ceph): msgr: close get_connection() race
This could null deref if the Pipe is registered but failed.
We need to loop here because the Pipe vs Connection stuf...
Sage Weil
01:36 AM Revision ea7511b8 (ceph): msg/Pipe: fix leak of Connection in ctor
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 9348bb38 (ceph): osd: reopen heartbeat connections when they fail
If we have an active peer whose Connection fails, open a new one. This
is necessary now that a lossy client connecti...
Sage Weil
01:36 AM Revision ef3fd1c3 (ceph): msg/Pipe: go to standby on lossless server connection faults
Go directly to the STANDBY state, and print a more accurate message.
Otherwise, we do the same check in writer() and ...
Sage Weil
01:36 AM Revision 8966f71a (ceph): msg/IncomingQueue: make the pipe parent informational only
Use this pointer only for debug output prefix; do not dereference, as we
may live beyond the original parent.
Signed...
Sage Weil
01:36 AM Revision 525830cd (ceph): msgr: do not discard_queue in Pipe reaper
The IncomingQueue can live beyond the Pipe. In particular, there is no
reason not to deliver messages we've received...
Sage Weil
01:36 AM Revision 7b398a5d (ceph): msgr: indicate whether clients are lossy
We need to know whether the client is lossy before we connect to the peer
in order to know whether to deliver a RESET...
Sage Weil
01:36 AM Revision 9d94ed1c (ceph): test_stress_watch: verify that the watch operation isn't slow
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 472d14f7 (ceph): msg/DispatchQueue: fix locking in dispatch thread
The locking was awkward with locally delivered messages.. we dropped dq
lock, inq lock, re-took dq lock, etc. We wo...
Sage Weil
01:36 AM Revision cef85105 (ceph): msg/Connection: add failed flag for lossy Connections
If a lossy Connection fails and we disconnect the Pipe, set a failed flag.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:36 AM Revision 541694f7 (ceph): msg/Pipe: disconnect Pipe from lossy Connection immediately on failure
When we have a lossy connection failure, immediately disconnect the Pipe
and set the Connection failed flag. There i...
Sage Weil
01:36 AM Revision 9a4e7027 (ceph): msg/Pipe: unregister pipe immediately on fault; fix mark_down
This fixes a problem where:
- pipe faults, con->pipe is cleared
- ms_handle_reset tries to mark_down, but it doesn...
Sage Weil
01:36 AM Revision 35b7bca3 (ceph): msg: kill tcp.{cc,h}
Move the remaining comparator into msg_types.h and kill this off.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:36 AM Revision 5a62dfef (ceph): msg/DispachQueue: hold lock in IncomingQueue::discard_queue()
This prevents races with the dispatch thread, among other things.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:36 AM Revision 999c506d (ceph): msg/DispatchQueue: give IncomingQueue ref to queue
We want to be able to queue an event (e.g., RESET) and deliver it even
after the Pipe is destroyed.
Signed-off-by: S...
Sage Weil
01:22 AM Revision 5ecc5bce (ceph): msg/DispatchQueue: cleanup debug prefix
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:22 AM Revision 89b07f47 (ceph): msg/Pipe: move tcp_* functions into Pipe class
This lets us print nice debug prefixes. It also calls BS on the
Pipe vs tcp.cc separation.
Signed-off-by: Sage Weil...
Sage Weil
01:22 AM Revision d034e46d (ceph): msgr: move Accepter into separate .cc
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:22 AM Revision 3e98617c (ceph): msg/Pipe: get_state_name()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:22 AM Revision f78a4010 (ceph): msgr: rework accept() connect_seq/race handling
We change a couple of key things here:
* If there is a matching connect_seq and the existing connection is in OPEN ...
Sage Weil
12:36 AM Revision ec4ce8bf (ceph): regression: do some tests on ext4
Sage Weil
12:20 AM Revision a6735ab0 (ceph): OpRequest,OSD: track recent slow ops
This should be helpful while investigating slow performance.
OpRequests now track events with timestamp in addition
...
Samuel Just

07/20/2012

11:09 PM Bug #2820: osd: crash in handle_osd_ping
also ubuntu@teuthology:/a/sage-2012-07-20_18:17:10-regression-wip-msgr-cleanup-testing-basic/14885
- chef: null
-...
Sage Weil
11:08 PM Bug #2820 (Resolved): osd: crash in handle_osd_ping
... Sage Weil
10:45 PM RADOS Subtask #2793 (Fix Under Review): osd: require tunable feature if current osdmap uses non-default...
Sage Weil
09:32 PM Revision d624f343 (ceph): Merge branch 'next'
Samuel Just
08:59 PM Revision 9e207aa8 (ceph): test/store_test.cc: verify collection_list_partial results are sorted
Synthetic test now also varies snapshots and uses a small variety of
hashes.
Signed-off-by: Samuel Just <sam.just@in...
Samuel Just
08:41 PM Revision 49877cde (ceph): cls_lock: cls_lock_id_t -> cls_lock_locker_id_t
Renamed type to make more sense.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:28 PM Revision 315bbea5 (ceph): cls_lock: document lock properties
Added some comments about different lock properties.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:16 PM Revision 056d42cf (ceph): cls_log: update a comment
Was missing output param description.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:14 PM Revision da770146 (ceph): move cfuse+dbench back to regression for verify, too
Sage Weil
08:11 PM Revision 2c7d7821 (ceph): rados: lock info keeps expiration, not duration
We pass duration in the request, but internally we keep
the expiration.
Signed-off-by: Yehuda Sadeh <yehuda@inktank....
Yehuda Sadeh
08:00 PM Revision d16844c8 (ceph): rados tool: add advisory lock control commands
Can now lock, break lock, list locks and show lock
info.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:59 PM Revision 2f8de894 (ceph): cls_lock: objclass for advisory locking
Providing an objclass to create and manipulate advisory
locking. Also providing a client api to control it. A lock
ma...
Yehuda Sadeh
07:55 PM Revision 9c5c3edf (ceph): objclass: add api calls to get/set xattrs
added the following functions:
cls_cxx_getxattr
cls_cxx_getxattrs
cls_cxx_setxattr
Signed-off-by: Yehuda Sadeh...
Yehuda Sadeh
07:29 PM Revision adc9b91f (ceph): os/HashIndex: use set<pair<string, hobject_t>> rather than multimap
Multimap does not make any guarantees about ordering of different
values with the same key. list_by_hash, however, a...
Samuel Just
05:22 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
Samuel Just
04:30 PM Bug #2819 (Won't Fix): krbd: lockup on large writes, msgr fault injection
krbd + osd socket failure injection + iozone -> crash in uml, without any useful debugging. uml itself either locks ... Sage Weil
04:07 PM rbd Feature #2559 (Fix Under Review): cls_rbd: copyup method
Sage Weil
04:07 PM rbd Feature #2607 (Fix Under Review): librbd: copyup helper
Sage Weil
03:45 PM Bug #2638 (In Progress): mon: make pool ops idempotent
Sage Weil
03:40 PM Documentation #2729 (Resolved): doc: OSD internals
Sage Weil
03:25 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
Responding to Sage's possible kinks:
1. the cephbooter issue only comes up with new installs, existing vms with th...
Anonymous
03:19 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
I moved http://gitbuilder-precise-i386.front.sepia.ceph.com/ and wrote a doc on how to move whole images: https://doc... Anonymous
02:37 PM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
Sage Weil
09:57 AM RADOS Subtask #2791 (In Progress): crush: feature bit for crush tunables
Sage Weil
02:20 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
Sage Weil
09:59 AM Messengers Bug #2802 (In Progress): msgr: mds session hangs on direct_io test
just a misleading message; cleaned up. the direct_io test (which leaves the mds connection mostly idle) still fails ... Sage Weil
01:50 PM Feature #2812 (Resolved): automated CentOS testing
(1) testing on anything other than Precise means we need the ability to re-image test machines or to run teuthology o... Anonymous
01:46 PM devops Feature #2808: crowbar: upgrade to fred3 (get bind bug fix)
Test that the bind problem has been fixed; nodes should be able to resolver other node names to IP addresses without ... Anonymous
01:45 PM devops Feature #2808 (Rejected): crowbar: upgrade to fred3 (get bind bug fix)
Anonymous
01:41 PM Feature #2807 (Resolved): RPM git-builder
We have customers who want to run Ceph on RHEL.
A lot of people run RHEL (or CentOS) and we should be building
(an...
Anonymous
01:26 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
The doc is fine as it is; it seems the new upstream makes this a lot nicer. Anonymous
10:46 AM devops Feature #2785: crowbar: custom ISO builder that includes barclamp-ceph
Crowbar cleaned up the build process so I was able to go through and build an ISO with our barclamp rather easily. Th... JuanJose Galvez
01:19 PM devops Feature #2574 (In Progress): crowbar: use data disks automatically, journal inside data directory
Anonymous
11:17 AM rbd Bug #2535 (Resolved): rbd: random data corruption in vm
Thanks for reminding us Florian. There are no open questions that I know of, the only problem with this was fiemap, s... Josh Durgin
11:11 AM rbd Bug #2535: rbd: random data corruption in vm
Just so this doesn't fall of the table, can this be considered resolved with the changed @filestore fiemap@ default? ... Florian Haas
10:06 AM Bug #2761: osd: failed to recover before timeout expired
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-20_05:00:03-regression-stable-master-basic/14803
from teuthol...
Tamilarasi muthamizhan
10:05 AM rbd Bug #2715: krbd: spinlock wrong CPU
hoping this was caused by the mutex-less con_open, or something similar. will keep this open for a few more days to ... Sage Weil
09:29 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Trimming through Paxos on the services is the way to go, no doubt. However, it's a whole world of pain when it comes ... Joao Eduardo Luis
08:33 AM Bug #2799: osd: pg log trimming zeroing broken
again today: ubuntu@teuthology:/a/teuthology-2012-07-19_19:00:08-regression-master-testing-gcov/14585 Sage Weil
07:13 AM Subtask #2805 (Resolved): mon: Single-Paxos: Sync: Create a test unit to verify the correctness o...
We have added support for a whole-space iterator and a snapshot iterator (task #2758) on KeyValueDB and LevelDB, and ... Joao Eduardo Luis
05:14 AM Revision 0b84384f (ceph): mon: shut up about sessionless MPGStats messages
If the mon gets a reset on the client connection, it clears the session
on the connection. This is perfectly normal ...
Sage Weil
04:27 AM Revision 6580450f (ceph): osd: clean up boot method names
Prefix subsequent steps with _. Better names.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Yehuda Sadeh...
Sage Weil
04:27 AM Revision 369fbf61 (ceph): osd: defer boot if heartbeatmap indicates we are unhealthy
If the OSD is bogged down or unresponsive, we should not try to join
the cluster. This was observed on congress (slo...
Sage Weil
03:22 AM Revision d76df212 (ceph): Merge branch 'next'
Conflicts:
src/include/ceph_features.h
Sage Weil
12:13 AM Revision 7f58b9be (ceph): mon: track pg creations by osd
Track the pending pg creations by osd, and use a helper to send out that
messages.
Signed-off-by: Sage Weil <sage@in...
Sage Weil
12:13 AM Revision dec93692 (ceph): osd/mon: subscribe (onetime) to pg creations on connect
Ask the monitor for pending pg creations each time we connect.
Normally, this is a freebie check. If there are pend...
Sage Weil

07/19/2012

11:48 PM Revision 42de6873 (ceph): rbd: fix usage for snap commands
Snap commands take '--snap <snapname> <imagename>'.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:45 PM Revision 4c6c927b (ceph): Revert "rbd: fix usage for snap commands"
This reverts commit 42de6873f9ca33fc20e70176d9a422635a6f0152.
Actually, these are fine! Dan made them all kinds of ...
Sage Weil
10:51 PM Revision e1c98e7d (ceph): tasks: add multibench task for testing pool creation
Also adds support for specifying a pool for radosbench
to create and then cleanup instead of "data".
Signed-off-by: ...
Samuel Just
10:06 PM rgw Feature #2804 (Rejected): rgw: disallow running multiple gateways on the same fastcgi socket
We saw an incident where multiple gateways were running on the same machine, both pointing to the same apache <-> rad... Yehuda Sadeh
09:35 PM rbd Feature #2788 (Fix Under Review): rbd: add -l to rbd ls, show snapshots
wip-rbd-ls-R Dan Mick
03:55 PM rbd Feature #2788 (In Progress): rbd: add -l to rbd ls, show snapshots
Dan Mick
09:13 PM Bug #2502 (Fix Under Review): osd: do not mark self back up if internal heartbeats are failing
wip-osd-hb Sage Weil
09:02 PM Bug #2638: mon: make pool ops idempotent
this needs to be fixed in order to get the msgr failure injection stuff in the qa suite Sage Weil
09:01 PM Bug #2771: mon|osd: fail to create pgs on startup
commit:ca9f71300471e1947be63e5dfad1a4800c1ffc9c and a few commits that follow Sage Weil
09:00 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
Sage Weil
06:29 PM Revision 58cd27fd (ceph): doc: add missing dependencies to README
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
05:52 PM Bug #2803 (Resolved): filer: probe crash
... Sage Weil
05:50 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
... Sage Weil
05:21 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
still working on part that dumps slowest ops over past N seconds Samuel Just
04:49 PM Linux kernel client Bug #2801 (Resolved): msgr crash in ceph_msg_new
... Sage Weil
04:15 PM Linux kernel client Feature #1699 (Resolved): debug symbols in autobuilt (sepia) kernels
the symbols are there. the trick is for objdump to find them in the original path. do this:
sudo mkdir -p /srv/a...
Sage Weil
04:13 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
... Sage Weil
03:18 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-07-19_08:03:32-marginal-master-testing-basic/14125 Sage Weil
03:55 PM rbd Subtask #2606 (Fix Under Review): librbd layering: copyup on missing child object
Dan Mick
03:54 PM rbd Subtask #2606 (In Progress): librbd layering: copyup on missing child object
Dan Mick
02:45 PM Bug #2800: osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
Might be a result of reordered or dropped messages. Samuel Just
02:31 PM Bug #2800 (Resolved): osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first ...
ceph-sha1: 508bf3fb96929dc6e91ee83a915a7f30df23eeeb
description: collection:thrash clusters:6-osd-3-machine.yaml fs:...
Samuel Just
02:26 PM Bug #2799 (Resolved): osd: pg log trimming zeroing broken
... Sage Weil
12:31 PM devops Feature #2780 (In Progress): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
12:30 PM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
Good enough for now. Anonymous
05:09 AM Tasks #2789: pkgconfig files for libraries
No, I'm talking about: http://en.wikipedia.org/wiki/Pkg-config
I added a patch which adds these files, but I'm not...
Wido den Hollander
02:50 AM Revision c9967120 (ceph): add CRUSH_TUNABLES feature bit
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:49 AM Revision 6f381aff (ceph): add CRUSH_TUNABLES feature bit
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil

07/18/2012

10:37 PM Revision e3349a2a (ceph): OSD::handle_osd_map: don't lock pgs while advancing maps
We no longer do anything with the pgs here. PG map
advancing is now handled in OSD::advance_pg asyncronously.
Signe...
Samuel Just
09:26 PM Revision ca9f7130 (ceph): OSD: actually send queries during handle_pg_create
During the osd threading refactor, we lost the do_queries
call in favor of dispatch_context. However, this did not
i...
Samuel Just
09:26 PM Revision 8f5562ff (ceph): OSD: write_if_dirty during get_or_create_pg after handle_create
In the case that the pg is newly created, we will activate during
that call, so the info and log will be dirty.
Sign...
Samuel Just
09:26 PM Revision c8ee3016 (ceph): osd: add osd_debug_drop_pg_create_{probability,duration} options
This will let us exercise more of the pg creation code.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:44 PM Revision c49daeca (ceph): clock: print skew with ntp servers to log to help debug time issues
Sage Weil
07:59 PM Revision 0b579546 (ceph): ObjectCacher: fix cache_bytes_hit accounting
Misses are not hits!
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:58 PM Revision 0d0b4689 (ceph): Merge branch 'next'
Josh Durgin
07:55 PM Revision 5dd68b95 (ceph): objecter: always resend linger registrations
If a linger op (watch) is sent to the OSD and updates the object, and then
the client loses the reply, it will resend...
Sage Weil
06:04 PM Revision 55847fc2 (ceph): nuke: log what pid we are killing when we kill it
Sage Weil
05:52 PM Revision 7586cde9 (ceph): qa/workunits/suites/pjd.sh: bash -x
This will let us see what test is failing, exactly, and what its inputs
were. Hoping to help find #2187.
Signed-off...
Sage Weil
05:44 PM Revision 76efd977 (ceph): OSD: publish_map in init to initialize OSDService map
Other areas rely on OSDService::get_map() to function, possibly before
activate_map is first called. In particular, ...
Samuel Just
05:25 PM Revision 675d6302 (ceph): ObjectCacher: fix cache_bytes_hit accounting
Misses are not hits!
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
02:35 PM Revision 4e1d973e (ceph): doc: Fixed heading text.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:35 PM Revision ebc57736 (ceph): doc: favicon.ico should be new Ceph icon.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:17 PM Bug #2798 (Resolved): handle_osd_ping assert
Samuel Just
10:40 AM Bug #2798: handle_osd_ping assert
Also,
ubuntu@teuthology: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13039
ubuntu@teuthology...
Tamilarasi muthamizhan
10:38 AM Bug #2798: handle_osd_ping assert
Recent logs: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13020
ubuntu@teuthology:/a/teuthol...
Tamilarasi muthamizhan
01:46 PM rbd Bug #2478 (Can't reproduce): krbd: unmap on 3.4.0: scheduling while atomic...
Sage Weil
01:32 PM Bug #2022: osd: misdirectect request
Writing down things discovered due to conversation on irc:
The log message happens with v0.48argonaut, it seems 41...
Anonymous
12:55 PM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
Sage Weil
10:17 AM CephFS Bug #2187: pjd chown/00.t failed test 97
... Sage Weil
04:28 AM Revision 3a377c44 (ceph): doc: Overhauled Swift API documentation.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:05 AM Revision 5c5ca4b8 (ceph): move cfuse + dbench from marginal to regression
Fixed #1737, yay! Sage Weil
02:20 AM Revision aecf0031 (ceph): Merge branch 'next'
Sage Weil
02:19 AM Revision d78235be (ceph): client: fix readdir locking
Several of the readdir-related methods were not taking client_lock.
Fixes: #1737
Backport: argonaut
Signed-off-by: S...
Sage Weil
02:18 AM Revision 82a575c9 (ceph): client: fix leak of client_lock when not initialized
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

07/17/2012

11:58 PM Revision 58e81c82 (ceph): OSD: handle_osd_ping: use service->get_osdmap()
This way, we avoid grabbing the map_lock. Furthermore,
get curmap at the beginning of the method to ensure that
we s...
Samuel Just
11:58 PM Revision 90ddc5ae (ceph): OSD: use service.get_osdmap() in heartbeat(), don't grab map_lock
service.get_osdmap() gives us sufficiently consist
access to the map state.
Signed-off-by: Samuel Just <sam.just@ink...
Samuel Just
11:51 PM Revision 32892c1e (ceph): doc/dev/osd_internals: add newlines before numbered lists
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:02 PM Revision fe4c658b (ceph): librados: simplify locking slightly
No reason to hold mylock_all here.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:42 PM Bug #2067 (Resolved): librados: we leak CephContext from rados_create()
Sage Weil
09:42 PM Bug #2315 (Resolved): unrecognized admin socket command 'objecter_requests'
Sage Weil
07:40 PM Revision 199397dc (ceph): osd: default 'osd_preserve_trimmed_log = false'
This option makes the osd skip zeroing old trimmed regions of the log. The
data is never read, since the xattrs indi...
Sage Weil
07:21 PM CephFS Bug #1737 (Resolved): ceph-fuse crash in xlist::remove
commit:d78235be1b5c20440f3a55abb3f4b980aa3f3596 Sage Weil
07:11 PM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
Sage Weil
08:57 AM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
this is 100% reproducible on my notebook, but not on my other boxes. debugging it there. :(
Sage Weil
07:11 PM Bug #2796: osd: watch state not reestablished when registration op resent
Sage Weil
12:45 PM Bug #2796 (Fix Under Review): osd: watch state not reestablished when registration op resent
Sage Weil
09:04 AM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
if the client doesn't get the watch ack and resends, the osd will ignore it as a dup op, and the watch session state ... Sage Weil
05:00 PM Revision 12dc0ad1 (ceph): ceph: archive mon data to a .tgz
Saves bandwidth, time, and space. Sage Weil
04:54 PM Revision 24df8b1d (ceph): doc/dev: add osd_internals to toc
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:31 PM Revision 5a27f071 (ceph): doc/internals/osd_internals: fix indentation errors
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:23 PM Bug #2798 (Resolved): handle_osd_ping assert
ceph version 0.48argonaut-404-gabe05a3 (commit:abe05a3fbbb120d8d354623258d9104584db66f7)
1: (OSDMap::get_cluster_in...
Samuel Just
03:36 PM Revision 36d0a355 (ceph): log: simplify log logic a bit
Whether an entry is eligible to log/dump is independent of the channel it
is sent to. Some channels impose additiona...
Sage Weil
03:36 PM Revision 6490c84f (ceph): doc: discuss choice of pg_num
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:36 PM devops Feature #2782: chef: easy scripts for running in vercoi vms
Anonymous
02:20 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Done as of barclamp-ceph.git commit 36842bd3e3db9fea65e2921bb7b57e9088964621. Anonymous
02:03 PM rgw Feature #2797 (Resolved): rgw: support multi-objects delete
Support a multi-objects delete S3 api call. Yehuda Sadeh
10:39 AM Subtask #2758 (Fix Under Review): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyVal...
From Task #2756:
> * We changed KeyValueDB:
> > * Added a Whole-space iterator interface
> > * Implemented the p...
Joao Eduardo Luis
10:32 AM Subtask #2756 (Fix Under Review): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis
10:32 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Just a couple of notes on this regard:
* We changed KeyValueDB:
> * Added a Whole-space iterator interface
> * I...
Joao Eduardo Luis
08:56 AM RADOS Subtask #2793 (Resolved): osd: require tunable feature if current osdmap uses non-default tunables
Sage Weil
08:56 AM RADOS Subtask #2792 (Resolved): mon: require tunable feature bit if current osdmap uses non-default tun...
Sage Weil
08:52 AM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
Sage Weil
08:48 AM Tasks #2789: pkgconfig files for libraries
There is a librados-config in librados-dev. Does that mean we just need a librbd-config too? Sage Weil
06:12 AM Tasks #2789 (New): pkgconfig files for libraries
From the libvirt mailinglist I got the message that they'd like to see pkg-config for at least librbd and librados.
...
Wido den Hollander
08:29 AM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
... Sage Weil
12:36 AM Revision abe05a3f (ceph): Merge branch 'next'
Josh Durgin
12:35 AM Revision 2869039b (ceph): Robustify ceph-rbdnamer and adapt udev rules
Below is a patch which makes the ceph-rbdnamer script more robust and
fixes a problem with the rbd udev rules.
On ou...
Pascal de Bruijn | Unilogic Networks B.V
12:34 AM Revision 96587f39 (ceph): Robustify ceph-rbdnamer and adapt udev rules
Below is a patch which makes the ceph-rbdnamer script more robust and
fixes a problem with the rbd udev rules.
On ou...
Pascal de Bruijn | Unilogic Networks B.V

07/16/2012

11:30 PM Revision b0465496 (ceph): doc/radosgw/config.rst: mended small typo
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
11:15 PM Revision 2a8c4db7 (ceph): Merge branch 'wip-mon-mkfs'
Reviewed-by: Tommi Virtanen <tv@inktank.com> Sage Weil
11:14 PM Revision 54be9d09 (ceph): mon: stop doing rm -rf on mon mkfs
Simply verify that the directory exists, or if it doesn't, create it.
Do nothing about its content.
Signed-off-by: S...
Sage Weil
11:14 PM Revision 6b1835a9 (ceph): vstart.sh: blow away mon directory on creation/start
Now that ceph-mon doesn't blow away the mon data content, we need to.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:14 PM Revision 4e66a3b9 (ceph): mkcephfs: error out if mon data directory is not empty
The ceph-mon --mkfs function no longer wipes out the directory; it is in
fact mostly a no-op that just verifies the d...
Sage Weil
11:14 PM Revision 4eec4fc5 (ceph): mkcephfs: nicer empty directory check
From TV.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:14 PM Revision 426384f6 (ceph): log: apply log_level to stderr/syslog logic
In non-crash situations, we want to make sure the message is both below the
syslog/stderr threshold and also below th...
Sage Weil
11:13 PM Revision f9c1a6fb (ceph): Merge branch 'next'
Sage Weil
11:02 PM Revision 52f96b9f (ceph): log: apply log_level to stderr/syslog logic
In non-crash situations, we want to make sure the message is both below the
syslog/stderr threshold and also below th...
Sage Weil
10:53 PM Revision de524abd (ceph): log: dump logging levels in crash dump
So you know what you are/are not seeing.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:53 PM Revision d3c76f75 (ceph): Merge branch 'next'
Sage Weil
10:45 PM rbd Feature #2788 (Resolved): rbd: add -l to rbd ls, show snapshots
Add -l to rbd ls to show snapshots, perhaps size of parent Dan Mick
10:43 PM Revision 3821f6c4 (ceph): PG: grab reference to pg in C_OSD_AppliedRecoveredObject
Otherwise, accessing the pg via _applied_recovered_object
isn't safe. Using intrusive_ptr clarifies the reference
ow...
Samuel Just
10:37 PM Revision 8dafcc5c (ceph): log: fix event gather condition
We should gather an event if it is below the log or gather threshold.
Previously we were only gathering if we were g...
Sage Weil
10:36 PM Revision 64f74500 (ceph): log: fix event gather condition
We should gather an event if it is below the log or gather threshold.
Previously we were only gathering if we were g...
Sage Weil
09:58 PM rbd Feature #2717: cls_rbd: add methods for maintaining mapping from parent to child images
Dan Mick
09:58 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
Dan Mick
09:57 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
From the design doc:
In addition to knowing which parent a given image has, we want to be able to tell if a protec...
Dan Mick
09:18 PM Revision 4afa8925 (ceph): PG: use stats from primary after rewinding divergent entries
If the osd recieving the info has divergent entries, it will
also have a "divergent" stat structure.
Probably fixes ...
Samuel Just
09:18 PM Revision d4410e4a (ceph): PG::RecoveryState::Stray::react(LogEvt&): set dirty_info/log
We adjust the info and the log, so we must set dirty_info and
dirty_log to force writes.
Signed-off-by: Samuel Just ...
Samuel Just
09:18 PM Revision 5f602366 (ceph): Merge remote-tracking branch 'upstream/next'
Samuel Just
09:11 PM Revision ec5cd6de (ceph): PG::RecoveryState::Stray::react(LogEvt&): reset last_pg_scrub
We need to reset the last_pg_scrub data in the osd since we
are replacing the info.
Probably fixes #2453
In cases l...
Samuel Just
09:07 PM Revision c7fb964c (ceph): PG::RecoveryState::Stray::react(LogEvt&): reset last_pg_scrub
We need to reset the last_pg_scrub data in the osd since we
are replacing the info.
Probably fixes #2453
In cases l...
Samuel Just
06:34 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Currently have fsid generated via uuidgen, monitor is generated via the python script which is residing in barclamp-c... Tyler Brekke
06:33 PM Revision 248cfadd (ceph): ReplicatedPG: don't warn if backfill peer stats don't match
pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill....
Samuel Just
06:28 PM rbd Feature #2561 (Fix Under Review): rbd: copyup command
Implemented at the moment as "flatten". Dan Mick
06:26 PM rbd Feature #2561 (In Progress): rbd: copyup command
Dan Mick
06:27 PM rbd Feature #2559: cls_rbd: copyup method
Dan Mick
06:26 PM rbd Subtask #2604 (In Progress): librbd layering: read path
Josh Durgin
06:26 PM rbd Subtask #2605 (In Progress): librbd layering: guard writes
Josh Durgin
06:11 PM Revision 5d82a770 (ceph): doc/dev/osd_internals: OSD overview, pg removal, map/message handling
This is a start on some osd internals documentation for new
developers.
Signed-off-by: Samuel Just <sam.just@inktank...
Samuel Just
05:59 PM Revision 470796b5 (ceph): CompatSet: users pass bit indices rather than masks
CompatSet users number the Feature objects rather than
providing masks. Thus, we should do
mask |= (1 << f.id) rath...
Samuel Just
05:59 PM Revision 90381dc9 (ceph): OSD: set superblock compat_features on boot and mkfs
Previously, we did not actually persist the osd compatibility
mask. Without persisting the current compat mask, a pr...
Samuel Just
05:59 PM Revision 12d70738 (ceph): PG: use write_info to set snap_collections in make_snap_collections
At one point, snap_collections were written to a pg collection
attribute. Subsequently, they were moved to the bigin...
Samuel Just
05:59 PM Revision 1b8819bb (ceph): PG: Place info in biginfo object
The purged_snaps set can grow without bound as snaps are
created and removed. Because the filestore doesn't
provide ...
Samuel Just
05:58 PM Revision bcb1073f (ceph): mon/MonitorStore: always O_TRUNC when writing states
It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p...
Sage Weil
05:57 PM Revision 41a57077 (ceph): osd: based misdirected op role calc on acting set
We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin...
Sage Weil
05:57 PM Revision b7814dbe (ceph): osd: based misdirected op role calc on acting set
We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin...
Sage Weil
05:57 PM Revision 14d2efc4 (ceph): mon/MonitorStore: always O_TRUNC when writing states
It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p...
Sage Weil
05:53 PM Revision ff0f4742 (ceph): set machine description to ctx.archive when auto-locking machines for a...
Sage Weil
05:48 PM Revision e429da34 (ceph): Merge remote-tracking branch 'gh/bugfix-2022'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
05:35 PM Revision 811665a8 (ceph): move cfuse + ffsb from marginal to regression
This has had no failures. Sage Weil
04:41 PM Revision 88e2ad35 (ceph): move cfuse + fsx back into regression suite
No failures in marginal. The objectcacher fixes that came out of the
rbd_fsx stuff probably fixed the original problem?
Sage Weil
04:12 PM Revision 47b38dd0 (ceph): Merge remote-tracking branch 'gh/bugfix-2779'
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
04:07 PM Feature #2631 (Resolved): mon: kill rm -rf --mkfs behavior
Sage Weil
03:04 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Samuel Just
03:04 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Probably fixed in c7fb964c077d369943bd5c066c5f99da6bd5f37c.
Backported to stable: ec5cd6def9817039704b6cc010f2797a70...
Samuel Just
11:17 AM Bug #2761: osd: failed to recover before timeout expired
Logs: ubuntu@teuthology: /a/teuthology-2012-07-14_00:00:03-regression-next-testing-basic/11248
ubuntu@teuthology:/...
Tamilarasi muthamizhan
11:01 AM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
Samuel Just
11:01 AM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
Samuel Just
11:00 AM Bug #2787 (Resolved): RBD test copy fails when thrashing
Logs: ubuntu@teuthology:/a/teuthology-2012-07-15_00:00:04-regression-next-testing-basic/11629
Attached below an ex...
Tamilarasi muthamizhan
10:55 AM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
Samuel Just
10:48 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
Sage Weil
10:47 AM Bug #2022 (Resolved): osd: misdirectect request
YAY Sage Weil
10:47 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Sage Weil
05:03 AM Revision f94c7646 (ceph): mon: remove osds from [near]full sets when their stats are removed from...
Greg points out that we could have a situation like:
- mon recovers..
- goes through osdmaps, notes an osd was rem...
Sage Weil
04:38 AM Revision fe576818 (ceph): mon/MonitorStore: always O_TRUNC when writing states
It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p...
Sage Weil

07/15/2012

11:31 PM Revision bf9a85ad (ceph): filestore: dump open fds when we hit EMFILE
Use a helper to dump /proc/self/fd when we hit EMFILE in the filestore.
Ideally, we should trigger this in other appr...
Sage Weil
09:40 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
Sage Weil
09:40 PM Bug #2593 (Resolved): logmonitor: decode failure
aie. commit:fe57681892b0bf3155ddeaeaa79ddbf997e27af9 Sage Weil
08:43 PM Feature #2330 (Resolved): dump open files, sockets when we run out of fds
commit:bf9a85ade6f7b06ed28ff7930b9e33a41acf0e34 Sage Weil
12:39 AM Revision a278ea13 (ceph): osdmap: drop useless and unused get_pg_role() method
Users probably want get_pg_acting_rank(). If they don't, they can probably
have the mapping and can calculate the ra...
Sage Weil
12:39 AM Revision 6faeedac (ceph): osd: simplify helper usage for misdirected ops
Make the helper exclusively for the PG != NULL cases, and open-code the
one PG == NULL caller. This is simpler, and ...
Sage Weil
12:39 AM Revision 38962abd (ceph): osd: based misdirected op role calc on acting set
We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin...
Sage Weil
12:39 AM Revision ed4f80f9 (ceph): vstart: use absolute path for keyring
Stores absolute path to the generated keyring so that tests running in
other directories (e.g. src/java/test) can sim...
Noah Watkins

07/14/2012

08:51 PM Revision 042edcbe (ceph): schedule/suite: schedule job, suite N times
Sage Weil
05:39 PM Bug #2022 (Fix Under Review): osd: misdirectect request
FINALLY tracked this one down. tests consistently now pass. bugfix-2022 Sage Weil
05:29 PM rgw Feature #2786 (Resolved): radosgw-admin: ability to remove objects/buckets
Need to be able to remove objects/buckets. Bucket removal can optionally remove all objects. Another option is to set... Yehuda Sadeh

07/13/2012

11:09 PM Revision 117b2868 (ceph): OSD: add config options to fake missed pings
In order to test monitor and osd failure detection and false
positive correction, this patch adds the following optio...
Samuel Just
10:14 PM Revision ce20e020 (ceph): crushtool: allow information generated during testing to be dumped
to a set of CSV files for off-line analysis.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
caleb miles
09:16 PM Revision 8a89d40e (ceph): doc: remove last reference to ceph-cookbooks.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:08 PM Revision 20119567 (ceph): doc: cookbooks issue resolved, so changed 'ceph-cookbooks' back to 'ceph.'
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:57 PM Revision f37214e1 (ceph): kernel: fix kernel installation when kdb: is specified
This normalize check would only trigger if a *single* key was specified.
Change it so that it triggers as long as all...
Sage Weil
08:35 PM Revision b3d077c6 (ceph): qa: download tests from specified branch
These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
08:35 PM Revision 5a5597f6 (ceph): qa: download tests from specified branch
These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:36 PM Revision f5d1a328 (ceph): schedule_suite.sh: use workunits from ceph commit
Use the workunits from the same ceph branch we are testing. Sage Weil
07:18 PM Revision 53600798 (ceph): OSD: send_still_alive when we get a reply if we reported failure
When we get a ping reply, remove the peer from the failure_queue
and send a still alive message if the peer is in the...
Samuel Just
06:30 PM Revision b0985e44 (ceph): ceph: add default btrfs mkfs options
Sage Weil
06:30 PM Revision 014fb97f (ceph): ceph: cleanup/simplify mount/mkfs options
Sage Weil
06:13 PM Revision a50bf672 (ceph): workunit: allow branch/sha1/tag to be specified
Pull the workunit(s) from the branch/tag/sha1 specified in the config. Sage Weil
06:13 PM Revision d0c9e49b (ceph): workunit: allow overrides
Pull top-level overrides into our config. This lets you do:
overrides:
workunit:
branch: foo
tasks:
...
- wor...
Sage Weil
05:19 PM Revision 5924f8e4 (ceph): PG: merge_log always use stats from authoritative replica
If the osd recieving the log has divergent entries, it will
also have a "divergent" stat structure. In general, it s...
Samuel Just
05:01 PM Revision 58126b01 (ceph): workunit: pass branch/sha1 to test
Some tests download things from the ceph repo. Let them know which
version to use through the CEPH_REF environment va...
Josh Durgin
04:58 PM Bug #2748: include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
https://github.com/athanatos/ceph/commits/wip_biginfo Samuel Just
04:57 PM Subtask #825 (Resolved): osd: remove pg map updating from handle_osd_map
Samuel Just
04:57 PM Bug #2461 (Resolved): DBObjectMap is incompatible with collection_rename
Samuel Just
04:42 PM Revision 3dd65a89 (ceph): qa: download tests from specified branch
These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
03:57 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
for the record, this is being done in this sprint for two reasons:
(a) we are under some pressure to give up the old...
Anonymous
09:39 AM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
A few possible kinks:
* The fabfile currently pulls the magic ssh keys for rsync from cephbooter. we need a new ...
Sage Weil
09:34 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
02:28 PM Revision ce7e0be1 (ceph): mon: use single helper for [near]full sets
Use a single helper to add/remove osds from the [near]full sets. This
keeps the logic in a single place, and simplif...
Sage Weil
02:28 PM Revision 30b3dd1d (ceph): mon: purge removed osds from [near]full sets
The [near]full sets are volatile state. Remove removed (or created)
osds from the set when we process a map.
Fixes:...
Sage Weil
12:40 PM devops Feature #2785 (In Progress): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:40 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:38 PM devops Feature #2662 (In Progress): crowbar: Make barclamp-ceph set mon initial members, monitor-secret,...
Anonymous
12:37 PM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
Tyler says the google doc is good to go. Anonymous
12:32 PM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
Sage Weil
11:08 AM Bug #2784 (Resolved): osd hit suicide timeout
Log: ubuntu@teuthology:/a/teuthology-2012-07-12_19:00:15-regression-master-testing-gcov/10615
ubuntu@teuthology:/a...
Tamilarasi muthamizhan
10:22 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
5924f8e4a8c29e6de326a9e8576c30109cdc0e07 Samuel Just
09:50 AM devops Bug #2783 (Won't Fix): Chef-client fails on oneiric cluster
While verifying the web docs for Chef deployment of cepf, chef-client fails with the following error on an oneiric cl... Ken Franklin
09:45 AM Bug #2779 (Fix Under Review): mon: [near]full status doesn't get purged when osds are removed
tag! Sage Weil
07:12 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Date: Fri, 13 Jul 2012 12:17:47 +0400
From: Andrey Korolyov <andrey@xdel.ru>
To: ceph-devel@vger.kernel.org
Subjec...
Sage Weil
09:43 AM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
Anonymous
09:42 AM devops Feature #2781 (Closed): crowbar: easy scripts for running in vercoi vms
Anonymous
09:33 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
Works well enough for first iteration. Anonymous
09:32 AM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
Sage Weil
04:12 AM rbd Bug #2777: qemu: report discard support
It works for me, I test ext4 with discard and ext4 with fstrim. Martin Mailand
01:02 AM Revision 1741cb6c (ceph): Added functionality to get mkfs and mount options for file systems
from the config file,if present. Otherwise, default options are used.
The default value for inode size is changed to...
tamil

07/12/2012

11:52 PM Revision bcfa573f (ceph): ReplicatedPG: don't mark repop done until apply completes
Consider the following sequence:
1. issue, apply repop
2. replicas and primary commit
Here, repop->waitfor_(ack|dis...
Samuel Just
11:36 PM Revision 353d9ccf (ceph): fixed typo
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> tamil
11:14 PM Revision 10ec5926 (ceph): test_librbd: fix warnings
test/test_librbd.cc: In member function ‘virtual void LibRBD_TestClone_Test::TestBody()’:
warning: test/test_librbd.c...
Sage Weil
11:05 PM Revision f6a16f74 (ceph): fix wrongly marked down whitelist
This used to have '...or wrong addr' but it doesn't any more. Sage Weil
09:06 PM Revision 5450567a (ceph): ReplicatedPG,PG: dump recovery/backfill state on pg query
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:11 PM Revision b133c490 (ceph): Merge remote-tracking branch 'gh/wip-2101'
Sage Weil
06:46 PM Revision dfe29aff (ceph): doc: reverted file and role names.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:27 PM Revision 508bf3fb (ceph): rbd: enable layering when using the new format
We'll add options for different features later.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
05:47 PM Revision f8478d4c (ceph): upstart: Make ceph-osd always set the crush location.
This used to be conditional on config having osd_crush_location set,
but with that, minimal configuration left the OS...
Tommi Virtanen
05:25 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
wip_2769 should fix it. Samuel Just
05:00 PM Revision d1054df6 (ceph): doc: perf counters
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:00 PM Revision 5ceb7c73 (ceph): doc: fix config metavariables discussion
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:55 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
bcfa573f5f615f3403ff71da0212cd1cee7e7d9c
The core provided pretty good support for this sequence of events, markin...
Samuel Just
04:42 PM Revision e855cb24 (ceph): rgw: don't override subuser perm mask if perm not specified
Bug #2650. We were overriding subuser perm mask whenever subuser
was modified, even if perm mask was not passed.
Sig...
Yehuda Sadeh
04:41 PM Revision f33c0bee (ceph): rgw: don't override subuser perm mask if perm not specified
Bug #2650. We were overriding subuser perm mask whenever subuser
was modified, even if perm mask was not passed.
Sig...
Yehuda Sadeh
04:40 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
Sage Weil
04:00 PM Revision 09c60b43 (ceph): doc: added :: to code example.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:55 PM Revision ad8beeb4 (ceph): doc: minor edits.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:47 PM Revision 63a17998 (ceph): doc: cookbook name change broke some things in doc. Fixed.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:33 PM Revision 51148b81 (ceph): radosgw-admin: use --bucket instead of old --bucket-id
The --bucket-id support was removed. Sage Weil
02:31 PM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
*next is enough to figure out whether we ended on a boundary. Samuel Just
02:07 PM Feature #2680 (Resolved): osd: report backfill progress via query
5450567a6769b64ace385140ffb645d9d805e354 Samuel Just
01:58 PM Revision 65c43e34 (ceph): debian: fix ceph-fs-common-dbg depends
Signed-off-by: James Page <james.page@ubuntu.com> James Page
01:58 PM Revision d6c766ea (ceph): debian: fix ceph-fs-common-dbg depends
Signed-off-by: James Page <james.page@ubuntu.com> James Page
12:22 PM rbd Bug #2777: qemu: report discard support
It looks like this is handled by what qemu calls a 'device property', so you have to use a command line like:
<pre...
Josh Durgin
08:14 AM rbd Bug #2777: qemu: report discard support
Yeah, QUEUE_FLAG_DISCARD is as far as I looked yesterday too. The question now is how to set that within qemu. Josh Durgin
11:23 AM CephFS Bug #2778 (Can't reproduce): uclient: failed asserts if mounting with a non-existent directory
From Noah.
If Ceph is mounted with a non-existent directory, the mount call returns a success, but on unmount does t...
Greg Farnum
11:23 AM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-12_05:00:04-regression-stable-master-basic/9637
ubuntu@teutho...
Tamilarasi muthamizhan
09:58 AM RADOS Feature #2101 (Resolved): crushtool: check for weight overflow on reweight
Sage Weil
03:06 AM Revision cc8df29e (ceph): rados tool: bulk objects removal
Issue #2776. Allow the removal of multiple objects in a single
rados tool command:
# rados -p pool rm obj1 [obj2 [...
Yehuda Sadeh
02:59 AM Revision 762a5b63 (ceph): Merge remote-tracking branch 'gh/wip-cct'
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
01:56 AM Revision f20b6022 (ceph): Merge branch 'next'
Conflicts:
src/rados.cc
Sage Weil
01:54 AM Revision 99a048d8 (ceph): rados: more usage cleanup
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:53 AM Revision 0081c8e4 (ceph): rados: usage message
Bad linebreaks, wrapping, stringification, missing doc for bench args
Signed-off-by: Dan Mick <dan.mick@inktank....
Dan Mick
12:35 AM Revision 0782db36 (ceph): doc: changed role file names as part of update to roles.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:35 AM Revision e5997f4e (ceph): doc: added DHO config.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:11 AM Revision 173d592a (ceph): rados tool: remove -t param option for target pool
Bug #2772. This fixes an issue that was introduced when we
added the 'rados cp' command. The -t param was already use...
Yehuda Sadeh
 

Also available in: Atom