Project

General

Profile

Activity

From 06/22/2012 to 07/21/2012

07/21/2012

04:15 PM Revision 2d7e2cbf (ceph): crush: fix name map encoding
We screwed up and encoded using the name 'int' type instead of int32_t.
That means people have systems encoding this ...
Sage Weil
03:50 PM Revision b497bdac (ceph): osd/OpTracker: fix use-after-free
And formatting.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
02:12 PM CephFS Bug #2187: pjd chown/00.t failed test 97
Latest logs: ubuntu@teuthology:/a/teuthology-2012-07-21_05:00:03-regression-stable-master-basic/15237
ubuntu@teuth...
Tamilarasi muthamizhan
09:17 AM Bug #2638 (Fix Under Review): mon: make pool ops idempotent
Sage Weil
09:17 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
commit:b497bdacf5d5732285d280772ab9ce6ae88f59e8 Sage Weil
08:15 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
... Sage Weil
01:36 AM Revision 7cf1f1fb (ceph): msg/Pipe: go to STANDBY on lossless accept fault
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 2e67b7a3 (ceph): msgr: do not reopen failed lossy Connections
There was a race where:
- sending stuff to a lossy Connection
- it fails, and queues itself for reap, queues a RES...
Sage Weil
01:36 AM Revision adce6df2 (ceph): msgr: simplify submit_message()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 04fde518 (ceph): msgr: drop CLOSED checks during queueing
AFAICS these checks are pointless. There should be no harm in queueing
messages on a closed connection; they'll get ...
Sage Weil
01:36 AM Revision 60eb36ef (ceph): msgr: close get_connection() race
This could null deref if the Pipe is registered but failed.
We need to loop here because the Pipe vs Connection stuf...
Sage Weil
01:36 AM Revision ea7511b8 (ceph): msg/Pipe: fix leak of Connection in ctor
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 9348bb38 (ceph): osd: reopen heartbeat connections when they fail
If we have an active peer whose Connection fails, open a new one. This
is necessary now that a lossy client connecti...
Sage Weil
01:36 AM Revision ef3fd1c3 (ceph): msg/Pipe: go to standby on lossless server connection faults
Go directly to the STANDBY state, and print a more accurate message.
Otherwise, we do the same check in writer() and ...
Sage Weil
01:36 AM Revision 8966f71a (ceph): msg/IncomingQueue: make the pipe parent informational only
Use this pointer only for debug output prefix; do not dereference, as we
may live beyond the original parent.
Signed...
Sage Weil
01:36 AM Revision 525830cd (ceph): msgr: do not discard_queue in Pipe reaper
The IncomingQueue can live beyond the Pipe. In particular, there is no
reason not to deliver messages we've received...
Sage Weil
01:36 AM Revision 7b398a5d (ceph): msgr: indicate whether clients are lossy
We need to know whether the client is lossy before we connect to the peer
in order to know whether to deliver a RESET...
Sage Weil
01:36 AM Revision 9d94ed1c (ceph): test_stress_watch: verify that the watch operation isn't slow
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:36 AM Revision 472d14f7 (ceph): msg/DispatchQueue: fix locking in dispatch thread
The locking was awkward with locally delivered messages.. we dropped dq
lock, inq lock, re-took dq lock, etc. We wo...
Sage Weil
01:36 AM Revision cef85105 (ceph): msg/Connection: add failed flag for lossy Connections
If a lossy Connection fails and we disconnect the Pipe, set a failed flag.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:36 AM Revision 541694f7 (ceph): msg/Pipe: disconnect Pipe from lossy Connection immediately on failure
When we have a lossy connection failure, immediately disconnect the Pipe
and set the Connection failed flag. There i...
Sage Weil
01:36 AM Revision 9a4e7027 (ceph): msg/Pipe: unregister pipe immediately on fault; fix mark_down
This fixes a problem where:
- pipe faults, con->pipe is cleared
- ms_handle_reset tries to mark_down, but it doesn...
Sage Weil
01:36 AM Revision 35b7bca3 (ceph): msg: kill tcp.{cc,h}
Move the remaining comparator into msg_types.h and kill this off.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:36 AM Revision 5a62dfef (ceph): msg/DispachQueue: hold lock in IncomingQueue::discard_queue()
This prevents races with the dispatch thread, among other things.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:36 AM Revision 999c506d (ceph): msg/DispatchQueue: give IncomingQueue ref to queue
We want to be able to queue an event (e.g., RESET) and deliver it even
after the Pipe is destroyed.
Signed-off-by: S...
Sage Weil
01:22 AM Revision 5ecc5bce (ceph): msg/DispatchQueue: cleanup debug prefix
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:22 AM Revision 89b07f47 (ceph): msg/Pipe: move tcp_* functions into Pipe class
This lets us print nice debug prefixes. It also calls BS on the
Pipe vs tcp.cc separation.
Signed-off-by: Sage Weil...
Sage Weil
01:22 AM Revision d034e46d (ceph): msgr: move Accepter into separate .cc
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:22 AM Revision 3e98617c (ceph): msg/Pipe: get_state_name()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:22 AM Revision f78a4010 (ceph): msgr: rework accept() connect_seq/race handling
We change a couple of key things here:
* If there is a matching connect_seq and the existing connection is in OPEN ...
Sage Weil
12:36 AM Revision ec4ce8bf (ceph): regression: do some tests on ext4
Sage Weil
12:20 AM Revision a6735ab0 (ceph): OpRequest,OSD: track recent slow ops
This should be helpful while investigating slow performance.
OpRequests now track events with timestamp in addition
...
Samuel Just

07/20/2012

11:09 PM Bug #2820: osd: crash in handle_osd_ping
also ubuntu@teuthology:/a/sage-2012-07-20_18:17:10-regression-wip-msgr-cleanup-testing-basic/14885
- chef: null
-...
Sage Weil
11:08 PM Bug #2820 (Resolved): osd: crash in handle_osd_ping
... Sage Weil
10:45 PM RADOS Subtask #2793 (Fix Under Review): osd: require tunable feature if current osdmap uses non-default...
Sage Weil
09:32 PM Revision d624f343 (ceph): Merge branch 'next'
Samuel Just
08:59 PM Revision 9e207aa8 (ceph): test/store_test.cc: verify collection_list_partial results are sorted
Synthetic test now also varies snapshots and uses a small variety of
hashes.
Signed-off-by: Samuel Just <sam.just@in...
Samuel Just
08:41 PM Revision 49877cde (ceph): cls_lock: cls_lock_id_t -> cls_lock_locker_id_t
Renamed type to make more sense.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:28 PM Revision 315bbea5 (ceph): cls_lock: document lock properties
Added some comments about different lock properties.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:16 PM Revision 056d42cf (ceph): cls_log: update a comment
Was missing output param description.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:14 PM Revision da770146 (ceph): move cfuse+dbench back to regression for verify, too
Sage Weil
08:11 PM Revision 2c7d7821 (ceph): rados: lock info keeps expiration, not duration
We pass duration in the request, but internally we keep
the expiration.
Signed-off-by: Yehuda Sadeh <yehuda@inktank....
Yehuda Sadeh
08:00 PM Revision d16844c8 (ceph): rados tool: add advisory lock control commands
Can now lock, break lock, list locks and show lock
info.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:59 PM Revision 2f8de894 (ceph): cls_lock: objclass for advisory locking
Providing an objclass to create and manipulate advisory
locking. Also providing a client api to control it. A lock
ma...
Yehuda Sadeh
07:55 PM Revision 9c5c3edf (ceph): objclass: add api calls to get/set xattrs
added the following functions:
cls_cxx_getxattr
cls_cxx_getxattrs
cls_cxx_setxattr
Signed-off-by: Yehuda Sadeh...
Yehuda Sadeh
07:29 PM Revision adc9b91f (ceph): os/HashIndex: use set<pair<string, hobject_t>> rather than multimap
Multimap does not make any guarantees about ordering of different
values with the same key. list_by_hash, however, a...
Samuel Just
05:22 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
Samuel Just
04:30 PM Bug #2819 (Won't Fix): krbd: lockup on large writes, msgr fault injection
krbd + osd socket failure injection + iozone -> crash in uml, without any useful debugging. uml itself either locks ... Sage Weil
04:07 PM rbd Feature #2559 (Fix Under Review): cls_rbd: copyup method
Sage Weil
04:07 PM rbd Feature #2607 (Fix Under Review): librbd: copyup helper
Sage Weil
03:45 PM Bug #2638 (In Progress): mon: make pool ops idempotent
Sage Weil
03:40 PM Documentation #2729 (Resolved): doc: OSD internals
Sage Weil
03:25 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
Responding to Sage's possible kinks:
1. the cephbooter issue only comes up with new installs, existing vms with th...
Anonymous
03:19 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
I moved http://gitbuilder-precise-i386.front.sepia.ceph.com/ and wrote a doc on how to move whole images: https://doc... Anonymous
02:37 PM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
Sage Weil
09:57 AM RADOS Subtask #2791 (In Progress): crush: feature bit for crush tunables
Sage Weil
02:20 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
Sage Weil
09:59 AM Messengers Bug #2802 (In Progress): msgr: mds session hangs on direct_io test
just a misleading message; cleaned up. the direct_io test (which leaves the mds connection mostly idle) still fails ... Sage Weil
01:50 PM Feature #2812 (Resolved): automated CentOS testing
(1) testing on anything other than Precise means we need the ability to re-image test machines or to run teuthology o... Anonymous
01:46 PM devops Feature #2808: crowbar: upgrade to fred3 (get bind bug fix)
Test that the bind problem has been fixed; nodes should be able to resolver other node names to IP addresses without ... Anonymous
01:45 PM devops Feature #2808 (Rejected): crowbar: upgrade to fred3 (get bind bug fix)
Anonymous
01:41 PM Feature #2807 (Resolved): RPM git-builder
We have customers who want to run Ceph on RHEL.
A lot of people run RHEL (or CentOS) and we should be building
(an...
Anonymous
01:26 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
The doc is fine as it is; it seems the new upstream makes this a lot nicer. Anonymous
10:46 AM devops Feature #2785: crowbar: custom ISO builder that includes barclamp-ceph
Crowbar cleaned up the build process so I was able to go through and build an ISO with our barclamp rather easily. Th... JuanJose Galvez
01:19 PM devops Feature #2574 (In Progress): crowbar: use data disks automatically, journal inside data directory
Anonymous
11:17 AM rbd Bug #2535 (Resolved): rbd: random data corruption in vm
Thanks for reminding us Florian. There are no open questions that I know of, the only problem with this was fiemap, s... Josh Durgin
11:11 AM rbd Bug #2535: rbd: random data corruption in vm
Just so this doesn't fall of the table, can this be considered resolved with the changed @filestore fiemap@ default? ... Florian Haas
10:06 AM Bug #2761: osd: failed to recover before timeout expired
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-20_05:00:03-regression-stable-master-basic/14803
from teuthol...
Tamilarasi muthamizhan
10:05 AM rbd Bug #2715: krbd: spinlock wrong CPU
hoping this was caused by the mutex-less con_open, or something similar. will keep this open for a few more days to ... Sage Weil
09:29 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Trimming through Paxos on the services is the way to go, no doubt. However, it's a whole world of pain when it comes ... Joao Eduardo Luis
08:33 AM Bug #2799: osd: pg log trimming zeroing broken
again today: ubuntu@teuthology:/a/teuthology-2012-07-19_19:00:08-regression-master-testing-gcov/14585 Sage Weil
07:13 AM Subtask #2805 (Resolved): mon: Single-Paxos: Sync: Create a test unit to verify the correctness o...
We have added support for a whole-space iterator and a snapshot iterator (task #2758) on KeyValueDB and LevelDB, and ... Joao Eduardo Luis
05:14 AM Revision 0b84384f (ceph): mon: shut up about sessionless MPGStats messages
If the mon gets a reset on the client connection, it clears the session
on the connection. This is perfectly normal ...
Sage Weil
04:27 AM Revision 6580450f (ceph): osd: clean up boot method names
Prefix subsequent steps with _. Better names.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Yehuda Sadeh...
Sage Weil
04:27 AM Revision 369fbf61 (ceph): osd: defer boot if heartbeatmap indicates we are unhealthy
If the OSD is bogged down or unresponsive, we should not try to join
the cluster. This was observed on congress (slo...
Sage Weil
03:22 AM Revision d76df212 (ceph): Merge branch 'next'
Conflicts:
src/include/ceph_features.h
Sage Weil
12:13 AM Revision 7f58b9be (ceph): mon: track pg creations by osd
Track the pending pg creations by osd, and use a helper to send out that
messages.
Signed-off-by: Sage Weil <sage@in...
Sage Weil
12:13 AM Revision dec93692 (ceph): osd/mon: subscribe (onetime) to pg creations on connect
Ask the monitor for pending pg creations each time we connect.
Normally, this is a freebie check. If there are pend...
Sage Weil

07/19/2012

11:48 PM Revision 42de6873 (ceph): rbd: fix usage for snap commands
Snap commands take '--snap <snapname> <imagename>'.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:45 PM Revision 4c6c927b (ceph): Revert "rbd: fix usage for snap commands"
This reverts commit 42de6873f9ca33fc20e70176d9a422635a6f0152.
Actually, these are fine! Dan made them all kinds of ...
Sage Weil
10:51 PM Revision e1c98e7d (ceph): tasks: add multibench task for testing pool creation
Also adds support for specifying a pool for radosbench
to create and then cleanup instead of "data".
Signed-off-by: ...
Samuel Just
10:06 PM rgw Feature #2804 (Rejected): rgw: disallow running multiple gateways on the same fastcgi socket
We saw an incident where multiple gateways were running on the same machine, both pointing to the same apache <-> rad... Yehuda Sadeh
09:35 PM rbd Feature #2788 (Fix Under Review): rbd: add -l to rbd ls, show snapshots
wip-rbd-ls-R Dan Mick
03:55 PM rbd Feature #2788 (In Progress): rbd: add -l to rbd ls, show snapshots
Dan Mick
09:13 PM Bug #2502 (Fix Under Review): osd: do not mark self back up if internal heartbeats are failing
wip-osd-hb Sage Weil
09:02 PM Bug #2638: mon: make pool ops idempotent
this needs to be fixed in order to get the msgr failure injection stuff in the qa suite Sage Weil
09:01 PM Bug #2771: mon|osd: fail to create pgs on startup
commit:ca9f71300471e1947be63e5dfad1a4800c1ffc9c and a few commits that follow Sage Weil
09:00 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
Sage Weil
06:29 PM Revision 58cd27fd (ceph): doc: add missing dependencies to README
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
05:52 PM Bug #2803 (Resolved): filer: probe crash
... Sage Weil
05:50 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
... Sage Weil
05:21 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
still working on part that dumps slowest ops over past N seconds Samuel Just
04:49 PM Linux kernel client Bug #2801 (Resolved): msgr crash in ceph_msg_new
... Sage Weil
04:15 PM Linux kernel client Feature #1699 (Resolved): debug symbols in autobuilt (sepia) kernels
the symbols are there. the trick is for objdump to find them in the original path. do this:
sudo mkdir -p /srv/a...
Sage Weil
04:13 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
... Sage Weil
03:18 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-07-19_08:03:32-marginal-master-testing-basic/14125 Sage Weil
03:55 PM rbd Subtask #2606 (Fix Under Review): librbd layering: copyup on missing child object
Dan Mick
03:54 PM rbd Subtask #2606 (In Progress): librbd layering: copyup on missing child object
Dan Mick
02:45 PM Bug #2800: osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
Might be a result of reordered or dropped messages. Samuel Just
02:31 PM Bug #2800 (Resolved): osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first ...
ceph-sha1: 508bf3fb96929dc6e91ee83a915a7f30df23eeeb
description: collection:thrash clusters:6-osd-3-machine.yaml fs:...
Samuel Just
02:26 PM Bug #2799 (Resolved): osd: pg log trimming zeroing broken
... Sage Weil
12:31 PM devops Feature #2780 (In Progress): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
12:30 PM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
Good enough for now. Anonymous
05:09 AM Tasks #2789: pkgconfig files for libraries
No, I'm talking about: http://en.wikipedia.org/wiki/Pkg-config
I added a patch which adds these files, but I'm not...
Wido den Hollander
02:50 AM Revision c9967120 (ceph): add CRUSH_TUNABLES feature bit
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:49 AM Revision 6f381aff (ceph): add CRUSH_TUNABLES feature bit
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil

07/18/2012

10:37 PM Revision e3349a2a (ceph): OSD::handle_osd_map: don't lock pgs while advancing maps
We no longer do anything with the pgs here. PG map
advancing is now handled in OSD::advance_pg asyncronously.
Signe...
Samuel Just
09:26 PM Revision ca9f7130 (ceph): OSD: actually send queries during handle_pg_create
During the osd threading refactor, we lost the do_queries
call in favor of dispatch_context. However, this did not
i...
Samuel Just
09:26 PM Revision 8f5562ff (ceph): OSD: write_if_dirty during get_or_create_pg after handle_create
In the case that the pg is newly created, we will activate during
that call, so the info and log will be dirty.
Sign...
Samuel Just
09:26 PM Revision c8ee3016 (ceph): osd: add osd_debug_drop_pg_create_{probability,duration} options
This will let us exercise more of the pg creation code.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:44 PM Revision c49daeca (ceph): clock: print skew with ntp servers to log to help debug time issues
Sage Weil
07:59 PM Revision 0b579546 (ceph): ObjectCacher: fix cache_bytes_hit accounting
Misses are not hits!
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:58 PM Revision 0d0b4689 (ceph): Merge branch 'next'
Josh Durgin
07:55 PM Revision 5dd68b95 (ceph): objecter: always resend linger registrations
If a linger op (watch) is sent to the OSD and updates the object, and then
the client loses the reply, it will resend...
Sage Weil
06:04 PM Revision 55847fc2 (ceph): nuke: log what pid we are killing when we kill it
Sage Weil
05:52 PM Revision 7586cde9 (ceph): qa/workunits/suites/pjd.sh: bash -x
This will let us see what test is failing, exactly, and what its inputs
were. Hoping to help find #2187.
Signed-off...
Sage Weil
05:44 PM Revision 76efd977 (ceph): OSD: publish_map in init to initialize OSDService map
Other areas rely on OSDService::get_map() to function, possibly before
activate_map is first called. In particular, ...
Samuel Just
05:25 PM Revision 675d6302 (ceph): ObjectCacher: fix cache_bytes_hit accounting
Misses are not hits!
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
02:35 PM Revision 4e1d973e (ceph): doc: Fixed heading text.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:35 PM Revision ebc57736 (ceph): doc: favicon.ico should be new Ceph icon.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:17 PM Bug #2798 (Resolved): handle_osd_ping assert
Samuel Just
10:40 AM Bug #2798: handle_osd_ping assert
Also,
ubuntu@teuthology: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13039
ubuntu@teuthology...
Tamilarasi muthamizhan
10:38 AM Bug #2798: handle_osd_ping assert
Recent logs: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13020
ubuntu@teuthology:/a/teuthol...
Tamilarasi muthamizhan
01:46 PM rbd Bug #2478 (Can't reproduce): krbd: unmap on 3.4.0: scheduling while atomic...
Sage Weil
01:32 PM Bug #2022: osd: misdirectect request
Writing down things discovered due to conversation on irc:
The log message happens with v0.48argonaut, it seems 41...
Anonymous
12:55 PM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
Sage Weil
10:17 AM CephFS Bug #2187: pjd chown/00.t failed test 97
... Sage Weil
04:28 AM Revision 3a377c44 (ceph): doc: Overhauled Swift API documentation.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:05 AM Revision 5c5ca4b8 (ceph): move cfuse + dbench from marginal to regression
Fixed #1737, yay! Sage Weil
02:20 AM Revision aecf0031 (ceph): Merge branch 'next'
Sage Weil
02:19 AM Revision d78235be (ceph): client: fix readdir locking
Several of the readdir-related methods were not taking client_lock.
Fixes: #1737
Backport: argonaut
Signed-off-by: S...
Sage Weil
02:18 AM Revision 82a575c9 (ceph): client: fix leak of client_lock when not initialized
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

07/17/2012

11:58 PM Revision 58e81c82 (ceph): OSD: handle_osd_ping: use service->get_osdmap()
This way, we avoid grabbing the map_lock. Furthermore,
get curmap at the beginning of the method to ensure that
we s...
Samuel Just
11:58 PM Revision 90ddc5ae (ceph): OSD: use service.get_osdmap() in heartbeat(), don't grab map_lock
service.get_osdmap() gives us sufficiently consist
access to the map state.
Signed-off-by: Samuel Just <sam.just@ink...
Samuel Just
11:51 PM Revision 32892c1e (ceph): doc/dev/osd_internals: add newlines before numbered lists
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:02 PM Revision fe4c658b (ceph): librados: simplify locking slightly
No reason to hold mylock_all here.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:42 PM Bug #2067 (Resolved): librados: we leak CephContext from rados_create()
Sage Weil
09:42 PM Bug #2315 (Resolved): unrecognized admin socket command 'objecter_requests'
Sage Weil
07:40 PM Revision 199397dc (ceph): osd: default 'osd_preserve_trimmed_log = false'
This option makes the osd skip zeroing old trimmed regions of the log. The
data is never read, since the xattrs indi...
Sage Weil
07:21 PM CephFS Bug #1737 (Resolved): ceph-fuse crash in xlist::remove
commit:d78235be1b5c20440f3a55abb3f4b980aa3f3596 Sage Weil
07:11 PM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
Sage Weil
08:57 AM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
this is 100% reproducible on my notebook, but not on my other boxes. debugging it there. :(
Sage Weil
07:11 PM Bug #2796: osd: watch state not reestablished when registration op resent
Sage Weil
12:45 PM Bug #2796 (Fix Under Review): osd: watch state not reestablished when registration op resent
Sage Weil
09:04 AM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
if the client doesn't get the watch ack and resends, the osd will ignore it as a dup op, and the watch session state ... Sage Weil
05:00 PM Revision 12dc0ad1 (ceph): ceph: archive mon data to a .tgz
Saves bandwidth, time, and space. Sage Weil
04:54 PM Revision 24df8b1d (ceph): doc/dev: add osd_internals to toc
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:31 PM Revision 5a27f071 (ceph): doc/internals/osd_internals: fix indentation errors
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:23 PM Bug #2798 (Resolved): handle_osd_ping assert
ceph version 0.48argonaut-404-gabe05a3 (commit:abe05a3fbbb120d8d354623258d9104584db66f7)
1: (OSDMap::get_cluster_in...
Samuel Just
03:36 PM Revision 36d0a355 (ceph): log: simplify log logic a bit
Whether an entry is eligible to log/dump is independent of the channel it
is sent to. Some channels impose additiona...
Sage Weil
03:36 PM Revision 6490c84f (ceph): doc: discuss choice of pg_num
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:36 PM devops Feature #2782: chef: easy scripts for running in vercoi vms
Anonymous
02:20 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Done as of barclamp-ceph.git commit 36842bd3e3db9fea65e2921bb7b57e9088964621. Anonymous
02:03 PM rgw Feature #2797 (Resolved): rgw: support multi-objects delete
Support a multi-objects delete S3 api call. Yehuda Sadeh
10:39 AM Subtask #2758 (Fix Under Review): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyVal...
From Task #2756:
> * We changed KeyValueDB:
> > * Added a Whole-space iterator interface
> > * Implemented the p...
Joao Eduardo Luis
10:32 AM Subtask #2756 (Fix Under Review): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis
10:32 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Just a couple of notes on this regard:
* We changed KeyValueDB:
> * Added a Whole-space iterator interface
> * I...
Joao Eduardo Luis
08:56 AM RADOS Subtask #2793 (Resolved): osd: require tunable feature if current osdmap uses non-default tunables
Sage Weil
08:56 AM RADOS Subtask #2792 (Resolved): mon: require tunable feature bit if current osdmap uses non-default tun...
Sage Weil
08:52 AM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
Sage Weil
08:48 AM Tasks #2789: pkgconfig files for libraries
There is a librados-config in librados-dev. Does that mean we just need a librbd-config too? Sage Weil
06:12 AM Tasks #2789 (New): pkgconfig files for libraries
From the libvirt mailinglist I got the message that they'd like to see pkg-config for at least librbd and librados.
...
Wido den Hollander
08:29 AM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
... Sage Weil
12:36 AM Revision abe05a3f (ceph): Merge branch 'next'
Josh Durgin
12:35 AM Revision 2869039b (ceph): Robustify ceph-rbdnamer and adapt udev rules
Below is a patch which makes the ceph-rbdnamer script more robust and
fixes a problem with the rbd udev rules.
On ou...
Pascal de Bruijn | Unilogic Networks B.V
12:34 AM Revision 96587f39 (ceph): Robustify ceph-rbdnamer and adapt udev rules
Below is a patch which makes the ceph-rbdnamer script more robust and
fixes a problem with the rbd udev rules.
On ou...
Pascal de Bruijn | Unilogic Networks B.V

07/16/2012

11:30 PM Revision b0465496 (ceph): doc/radosgw/config.rst: mended small typo
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
11:15 PM Revision 2a8c4db7 (ceph): Merge branch 'wip-mon-mkfs'
Reviewed-by: Tommi Virtanen <tv@inktank.com> Sage Weil
11:14 PM Revision 54be9d09 (ceph): mon: stop doing rm -rf on mon mkfs
Simply verify that the directory exists, or if it doesn't, create it.
Do nothing about its content.
Signed-off-by: S...
Sage Weil
11:14 PM Revision 6b1835a9 (ceph): vstart.sh: blow away mon directory on creation/start
Now that ceph-mon doesn't blow away the mon data content, we need to.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:14 PM Revision 4e66a3b9 (ceph): mkcephfs: error out if mon data directory is not empty
The ceph-mon --mkfs function no longer wipes out the directory; it is in
fact mostly a no-op that just verifies the d...
Sage Weil
11:14 PM Revision 4eec4fc5 (ceph): mkcephfs: nicer empty directory check
From TV.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:14 PM Revision 426384f6 (ceph): log: apply log_level to stderr/syslog logic
In non-crash situations, we want to make sure the message is both below the
syslog/stderr threshold and also below th...
Sage Weil
11:13 PM Revision f9c1a6fb (ceph): Merge branch 'next'
Sage Weil
11:02 PM Revision 52f96b9f (ceph): log: apply log_level to stderr/syslog logic
In non-crash situations, we want to make sure the message is both below the
syslog/stderr threshold and also below th...
Sage Weil
10:53 PM Revision de524abd (ceph): log: dump logging levels in crash dump
So you know what you are/are not seeing.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:53 PM Revision d3c76f75 (ceph): Merge branch 'next'
Sage Weil
10:45 PM rbd Feature #2788 (Resolved): rbd: add -l to rbd ls, show snapshots
Add -l to rbd ls to show snapshots, perhaps size of parent Dan Mick
10:43 PM Revision 3821f6c4 (ceph): PG: grab reference to pg in C_OSD_AppliedRecoveredObject
Otherwise, accessing the pg via _applied_recovered_object
isn't safe. Using intrusive_ptr clarifies the reference
ow...
Samuel Just
10:37 PM Revision 8dafcc5c (ceph): log: fix event gather condition
We should gather an event if it is below the log or gather threshold.
Previously we were only gathering if we were g...
Sage Weil
10:36 PM Revision 64f74500 (ceph): log: fix event gather condition
We should gather an event if it is below the log or gather threshold.
Previously we were only gathering if we were g...
Sage Weil
09:58 PM rbd Feature #2717: cls_rbd: add methods for maintaining mapping from parent to child images
Dan Mick
09:58 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
Dan Mick
09:57 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
From the design doc:
In addition to knowing which parent a given image has, we want to be able to tell if a protec...
Dan Mick
09:18 PM Revision 4afa8925 (ceph): PG: use stats from primary after rewinding divergent entries
If the osd recieving the info has divergent entries, it will
also have a "divergent" stat structure.
Probably fixes ...
Samuel Just
09:18 PM Revision d4410e4a (ceph): PG::RecoveryState::Stray::react(LogEvt&): set dirty_info/log
We adjust the info and the log, so we must set dirty_info and
dirty_log to force writes.
Signed-off-by: Samuel Just ...
Samuel Just
09:18 PM Revision 5f602366 (ceph): Merge remote-tracking branch 'upstream/next'
Samuel Just
09:11 PM Revision ec5cd6de (ceph): PG::RecoveryState::Stray::react(LogEvt&): reset last_pg_scrub
We need to reset the last_pg_scrub data in the osd since we
are replacing the info.
Probably fixes #2453
In cases l...
Samuel Just
09:07 PM Revision c7fb964c (ceph): PG::RecoveryState::Stray::react(LogEvt&): reset last_pg_scrub
We need to reset the last_pg_scrub data in the osd since we
are replacing the info.
Probably fixes #2453
In cases l...
Samuel Just
06:34 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Currently have fsid generated via uuidgen, monitor is generated via the python script which is residing in barclamp-c... Tyler Brekke
06:33 PM Revision 248cfadd (ceph): ReplicatedPG: don't warn if backfill peer stats don't match
pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill....
Samuel Just
06:28 PM rbd Feature #2561 (Fix Under Review): rbd: copyup command
Implemented at the moment as "flatten". Dan Mick
06:26 PM rbd Feature #2561 (In Progress): rbd: copyup command
Dan Mick
06:27 PM rbd Feature #2559: cls_rbd: copyup method
Dan Mick
06:26 PM rbd Subtask #2604 (In Progress): librbd layering: read path
Josh Durgin
06:26 PM rbd Subtask #2605 (In Progress): librbd layering: guard writes
Josh Durgin
06:11 PM Revision 5d82a770 (ceph): doc/dev/osd_internals: OSD overview, pg removal, map/message handling
This is a start on some osd internals documentation for new
developers.
Signed-off-by: Samuel Just <sam.just@inktank...
Samuel Just
05:59 PM Revision 470796b5 (ceph): CompatSet: users pass bit indices rather than masks
CompatSet users number the Feature objects rather than
providing masks. Thus, we should do
mask |= (1 << f.id) rath...
Samuel Just
05:59 PM Revision 90381dc9 (ceph): OSD: set superblock compat_features on boot and mkfs
Previously, we did not actually persist the osd compatibility
mask. Without persisting the current compat mask, a pr...
Samuel Just
05:59 PM Revision 12d70738 (ceph): PG: use write_info to set snap_collections in make_snap_collections
At one point, snap_collections were written to a pg collection
attribute. Subsequently, they were moved to the bigin...
Samuel Just
05:59 PM Revision 1b8819bb (ceph): PG: Place info in biginfo object
The purged_snaps set can grow without bound as snaps are
created and removed. Because the filestore doesn't
provide ...
Samuel Just
05:58 PM Revision bcb1073f (ceph): mon/MonitorStore: always O_TRUNC when writing states
It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p...
Sage Weil
05:57 PM Revision 41a57077 (ceph): osd: based misdirected op role calc on acting set
We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin...
Sage Weil
05:57 PM Revision b7814dbe (ceph): osd: based misdirected op role calc on acting set
We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin...
Sage Weil
05:57 PM Revision 14d2efc4 (ceph): mon/MonitorStore: always O_TRUNC when writing states
It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p...
Sage Weil
05:53 PM Revision ff0f4742 (ceph): set machine description to ctx.archive when auto-locking machines for a...
Sage Weil
05:48 PM Revision e429da34 (ceph): Merge remote-tracking branch 'gh/bugfix-2022'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
05:35 PM Revision 811665a8 (ceph): move cfuse + ffsb from marginal to regression
This has had no failures. Sage Weil
04:41 PM Revision 88e2ad35 (ceph): move cfuse + fsx back into regression suite
No failures in marginal. The objectcacher fixes that came out of the
rbd_fsx stuff probably fixed the original problem?
Sage Weil
04:12 PM Revision 47b38dd0 (ceph): Merge remote-tracking branch 'gh/bugfix-2779'
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
04:07 PM Feature #2631 (Resolved): mon: kill rm -rf --mkfs behavior
Sage Weil
03:04 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Samuel Just
03:04 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Probably fixed in c7fb964c077d369943bd5c066c5f99da6bd5f37c.
Backported to stable: ec5cd6def9817039704b6cc010f2797a70...
Samuel Just
11:17 AM Bug #2761: osd: failed to recover before timeout expired
Logs: ubuntu@teuthology: /a/teuthology-2012-07-14_00:00:03-regression-next-testing-basic/11248
ubuntu@teuthology:/...
Tamilarasi muthamizhan
11:01 AM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
Samuel Just
11:01 AM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
Samuel Just
11:00 AM Bug #2787 (Resolved): RBD test copy fails when thrashing
Logs: ubuntu@teuthology:/a/teuthology-2012-07-15_00:00:04-regression-next-testing-basic/11629
Attached below an ex...
Tamilarasi muthamizhan
10:55 AM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
Samuel Just
10:48 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
Sage Weil
10:47 AM Bug #2022 (Resolved): osd: misdirectect request
YAY Sage Weil
10:47 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Sage Weil
05:03 AM Revision f94c7646 (ceph): mon: remove osds from [near]full sets when their stats are removed from...
Greg points out that we could have a situation like:
- mon recovers..
- goes through osdmaps, notes an osd was rem...
Sage Weil
04:38 AM Revision fe576818 (ceph): mon/MonitorStore: always O_TRUNC when writing states
It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p...
Sage Weil

07/15/2012

11:31 PM Revision bf9a85ad (ceph): filestore: dump open fds when we hit EMFILE
Use a helper to dump /proc/self/fd when we hit EMFILE in the filestore.
Ideally, we should trigger this in other appr...
Sage Weil
09:40 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
Sage Weil
09:40 PM Bug #2593 (Resolved): logmonitor: decode failure
aie. commit:fe57681892b0bf3155ddeaeaa79ddbf997e27af9 Sage Weil
08:43 PM Feature #2330 (Resolved): dump open files, sockets when we run out of fds
commit:bf9a85ade6f7b06ed28ff7930b9e33a41acf0e34 Sage Weil
12:39 AM Revision a278ea13 (ceph): osdmap: drop useless and unused get_pg_role() method
Users probably want get_pg_acting_rank(). If they don't, they can probably
have the mapping and can calculate the ra...
Sage Weil
12:39 AM Revision 6faeedac (ceph): osd: simplify helper usage for misdirected ops
Make the helper exclusively for the PG != NULL cases, and open-code the
one PG == NULL caller. This is simpler, and ...
Sage Weil
12:39 AM Revision 38962abd (ceph): osd: based misdirected op role calc on acting set
We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin...
Sage Weil
12:39 AM Revision ed4f80f9 (ceph): vstart: use absolute path for keyring
Stores absolute path to the generated keyring so that tests running in
other directories (e.g. src/java/test) can sim...
Noah Watkins

07/14/2012

08:51 PM Revision 042edcbe (ceph): schedule/suite: schedule job, suite N times
Sage Weil
05:39 PM Bug #2022 (Fix Under Review): osd: misdirectect request
FINALLY tracked this one down. tests consistently now pass. bugfix-2022 Sage Weil
05:29 PM rgw Feature #2786 (Resolved): radosgw-admin: ability to remove objects/buckets
Need to be able to remove objects/buckets. Bucket removal can optionally remove all objects. Another option is to set... Yehuda Sadeh

07/13/2012

11:09 PM Revision 117b2868 (ceph): OSD: add config options to fake missed pings
In order to test monitor and osd failure detection and false
positive correction, this patch adds the following optio...
Samuel Just
10:14 PM Revision ce20e020 (ceph): crushtool: allow information generated during testing to be dumped
to a set of CSV files for off-line analysis.
Signed-off-by: caleb miles <caleb.miles@inktank.com>
caleb miles
09:16 PM Revision 8a89d40e (ceph): doc: remove last reference to ceph-cookbooks.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:08 PM Revision 20119567 (ceph): doc: cookbooks issue resolved, so changed 'ceph-cookbooks' back to 'ceph.'
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:57 PM Revision f37214e1 (ceph): kernel: fix kernel installation when kdb: is specified
This normalize check would only trigger if a *single* key was specified.
Change it so that it triggers as long as all...
Sage Weil
08:35 PM Revision b3d077c6 (ceph): qa: download tests from specified branch
These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
08:35 PM Revision 5a5597f6 (ceph): qa: download tests from specified branch
These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:36 PM Revision f5d1a328 (ceph): schedule_suite.sh: use workunits from ceph commit
Use the workunits from the same ceph branch we are testing. Sage Weil
07:18 PM Revision 53600798 (ceph): OSD: send_still_alive when we get a reply if we reported failure
When we get a ping reply, remove the peer from the failure_queue
and send a still alive message if the peer is in the...
Samuel Just
06:30 PM Revision b0985e44 (ceph): ceph: add default btrfs mkfs options
Sage Weil
06:30 PM Revision 014fb97f (ceph): ceph: cleanup/simplify mount/mkfs options
Sage Weil
06:13 PM Revision a50bf672 (ceph): workunit: allow branch/sha1/tag to be specified
Pull the workunit(s) from the branch/tag/sha1 specified in the config. Sage Weil
06:13 PM Revision d0c9e49b (ceph): workunit: allow overrides
Pull top-level overrides into our config. This lets you do:
overrides:
workunit:
branch: foo
tasks:
...
- wor...
Sage Weil
05:19 PM Revision 5924f8e4 (ceph): PG: merge_log always use stats from authoritative replica
If the osd recieving the log has divergent entries, it will
also have a "divergent" stat structure. In general, it s...
Samuel Just
05:01 PM Revision 58126b01 (ceph): workunit: pass branch/sha1 to test
Some tests download things from the ceph repo. Let them know which
version to use through the CEPH_REF environment va...
Josh Durgin
04:58 PM Bug #2748: include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
https://github.com/athanatos/ceph/commits/wip_biginfo Samuel Just
04:57 PM Subtask #825 (Resolved): osd: remove pg map updating from handle_osd_map
Samuel Just
04:57 PM Bug #2461 (Resolved): DBObjectMap is incompatible with collection_rename
Samuel Just
04:42 PM Revision 3dd65a89 (ceph): qa: download tests from specified branch
These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
03:57 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
for the record, this is being done in this sprint for two reasons:
(a) we are under some pressure to give up the old...
Anonymous
09:39 AM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
A few possible kinks:
* The fabfile currently pulls the magic ssh keys for rsync from cephbooter. we need a new ...
Sage Weil
09:34 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
Anonymous
02:28 PM Revision ce7e0be1 (ceph): mon: use single helper for [near]full sets
Use a single helper to add/remove osds from the [near]full sets. This
keeps the logic in a single place, and simplif...
Sage Weil
02:28 PM Revision 30b3dd1d (ceph): mon: purge removed osds from [near]full sets
The [near]full sets are volatile state. Remove removed (or created)
osds from the set when we process a map.
Fixes:...
Sage Weil
12:40 PM devops Feature #2785 (In Progress): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:40 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
Anonymous
12:38 PM devops Feature #2662 (In Progress): crowbar: Make barclamp-ceph set mon initial members, monitor-secret,...
Anonymous
12:37 PM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
Tyler says the google doc is good to go. Anonymous
12:32 PM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
Sage Weil
11:08 AM Bug #2784 (Resolved): osd hit suicide timeout
Log: ubuntu@teuthology:/a/teuthology-2012-07-12_19:00:15-regression-master-testing-gcov/10615
ubuntu@teuthology:/a...
Tamilarasi muthamizhan
10:22 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
5924f8e4a8c29e6de326a9e8576c30109cdc0e07 Samuel Just
09:50 AM devops Bug #2783 (Won't Fix): Chef-client fails on oneiric cluster
While verifying the web docs for Chef deployment of cepf, chef-client fails with the following error on an oneiric cl... Ken Franklin
09:45 AM Bug #2779 (Fix Under Review): mon: [near]full status doesn't get purged when osds are removed
tag! Sage Weil
07:12 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
Date: Fri, 13 Jul 2012 12:17:47 +0400
From: Andrey Korolyov <andrey@xdel.ru>
To: ceph-devel@vger.kernel.org
Subjec...
Sage Weil
09:43 AM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
Anonymous
09:42 AM devops Feature #2781 (Closed): crowbar: easy scripts for running in vercoi vms
Anonymous
09:33 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
Works well enough for first iteration. Anonymous
09:32 AM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
Sage Weil
04:12 AM rbd Bug #2777: qemu: report discard support
It works for me, I test ext4 with discard and ext4 with fstrim. Martin Mailand
01:02 AM Revision 1741cb6c (ceph): Added functionality to get mkfs and mount options for file systems
from the config file,if present. Otherwise, default options are used.
The default value for inode size is changed to...
tamil

07/12/2012

11:52 PM Revision bcfa573f (ceph): ReplicatedPG: don't mark repop done until apply completes
Consider the following sequence:
1. issue, apply repop
2. replicas and primary commit
Here, repop->waitfor_(ack|dis...
Samuel Just
11:36 PM Revision 353d9ccf (ceph): fixed typo
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> tamil
11:14 PM Revision 10ec5926 (ceph): test_librbd: fix warnings
test/test_librbd.cc: In member function ‘virtual void LibRBD_TestClone_Test::TestBody()’:
warning: test/test_librbd.c...
Sage Weil
11:05 PM Revision f6a16f74 (ceph): fix wrongly marked down whitelist
This used to have '...or wrong addr' but it doesn't any more. Sage Weil
09:06 PM Revision 5450567a (ceph): ReplicatedPG,PG: dump recovery/backfill state on pg query
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:11 PM Revision b133c490 (ceph): Merge remote-tracking branch 'gh/wip-2101'
Sage Weil
06:46 PM Revision dfe29aff (ceph): doc: reverted file and role names.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:27 PM Revision 508bf3fb (ceph): rbd: enable layering when using the new format
We'll add options for different features later.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
05:47 PM Revision f8478d4c (ceph): upstart: Make ceph-osd always set the crush location.
This used to be conditional on config having osd_crush_location set,
but with that, minimal configuration left the OS...
Tommi Virtanen
05:25 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
wip_2769 should fix it. Samuel Just
05:00 PM Revision d1054df6 (ceph): doc: perf counters
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:00 PM Revision 5ceb7c73 (ceph): doc: fix config metavariables discussion
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:55 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
bcfa573f5f615f3403ff71da0212cd1cee7e7d9c
The core provided pretty good support for this sequence of events, markin...
Samuel Just
04:42 PM Revision e855cb24 (ceph): rgw: don't override subuser perm mask if perm not specified
Bug #2650. We were overriding subuser perm mask whenever subuser
was modified, even if perm mask was not passed.
Sig...
Yehuda Sadeh
04:41 PM Revision f33c0bee (ceph): rgw: don't override subuser perm mask if perm not specified
Bug #2650. We were overriding subuser perm mask whenever subuser
was modified, even if perm mask was not passed.
Sig...
Yehuda Sadeh
04:40 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
Sage Weil
04:00 PM Revision 09c60b43 (ceph): doc: added :: to code example.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:55 PM Revision ad8beeb4 (ceph): doc: minor edits.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:47 PM Revision 63a17998 (ceph): doc: cookbook name change broke some things in doc. Fixed.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:33 PM Revision 51148b81 (ceph): radosgw-admin: use --bucket instead of old --bucket-id
The --bucket-id support was removed. Sage Weil
02:31 PM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
*next is enough to figure out whether we ended on a boundary. Samuel Just
02:07 PM Feature #2680 (Resolved): osd: report backfill progress via query
5450567a6769b64ace385140ffb645d9d805e354 Samuel Just
01:58 PM Revision 65c43e34 (ceph): debian: fix ceph-fs-common-dbg depends
Signed-off-by: James Page <james.page@ubuntu.com> James Page
01:58 PM Revision d6c766ea (ceph): debian: fix ceph-fs-common-dbg depends
Signed-off-by: James Page <james.page@ubuntu.com> James Page
12:22 PM rbd Bug #2777: qemu: report discard support
It looks like this is handled by what qemu calls a 'device property', so you have to use a command line like:
<pre...
Josh Durgin
08:14 AM rbd Bug #2777: qemu: report discard support
Yeah, QUEUE_FLAG_DISCARD is as far as I looked yesterday too. The question now is how to set that within qemu. Josh Durgin
11:23 AM CephFS Bug #2778 (Can't reproduce): uclient: failed asserts if mounting with a non-existent directory
From Noah.
If Ceph is mounted with a non-existent directory, the mount call returns a success, but on unmount does t...
Greg Farnum
11:23 AM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-12_05:00:04-regression-stable-master-basic/9637
ubuntu@teutho...
Tamilarasi muthamizhan
09:58 AM RADOS Feature #2101 (Resolved): crushtool: check for weight overflow on reweight
Sage Weil
03:06 AM Revision cc8df29e (ceph): rados tool: bulk objects removal
Issue #2776. Allow the removal of multiple objects in a single
rados tool command:
# rados -p pool rm obj1 [obj2 [...
Yehuda Sadeh
02:59 AM Revision 762a5b63 (ceph): Merge remote-tracking branch 'gh/wip-cct'
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
01:56 AM Revision f20b6022 (ceph): Merge branch 'next'
Conflicts:
src/rados.cc
Sage Weil
01:54 AM Revision 99a048d8 (ceph): rados: more usage cleanup
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:53 AM Revision 0081c8e4 (ceph): rados: usage message
Bad linebreaks, wrapping, stringification, missing doc for bench args
Signed-off-by: Dan Mick <dan.mick@inktank....
Dan Mick
12:35 AM Revision 0782db36 (ceph): doc: changed role file names as part of update to roles.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:35 AM Revision e5997f4e (ceph): doc: added DHO config.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:11 AM Revision 173d592a (ceph): rados tool: remove -t param option for target pool
Bug #2772. This fixes an issue that was introduced when we
added the 'rados cp' command. The -t param was already use...
Yehuda Sadeh

07/11/2012

11:36 PM Revision 31c8dcc1 (ceph): crush: sum and check quantized weights for bucket
Sum the quantized weights for each bucket, and check that for overflow.
This could change the results of a compile m...
Sage Weil
11:18 PM rbd Bug #2777: qemu: report discard support
I don't know what I'm talking about, but stumbling around in the kernel seems to point at QUEUE_FLAG_DISCARD as the f... Dan Mick
02:19 PM rbd Bug #2777 (Resolved): qemu: report discard support
todin reported problems enabling discard inside qemu.
I'm not sure if this is rbd-specific, but there seem to be s...
Josh Durgin
11:14 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
latest logs: ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9430
ubuntu@teutho...
Tamilarasi muthamizhan
09:01 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
... Sage Weil
11:10 PM rbd Bug #2715: krbd: spinlock wrong CPU
latest logs:ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9371
12-07-11T19:36...
Tamilarasi muthamizhan
11:03 PM Revision 675a1b7b (ceph): crush: Set maximum device/bucket weights.
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
11:03 PM Revision c9fc5a24 (ceph): crush: prevent integer overflow on reweight
Disallow setting OSD weights to a value over 10,000 and cap bucket weight
at 10,000,000 in a CRUSH map. Addresses iss...
caleb miles
10:32 PM Revision d29ec1e2 (ceph): rados: usage message
Bad linebreaks, wrapping, stringification, missing doc for bench args
Signed-off-by: Dan Mick <dan.mick@inktank....
Dan Mick
09:23 PM Revision 9b289486 (ceph): nuke: honor 'check-locks: ...' field in targets file
If you are nuking a yaml file with check-locks: false, don't check locks. Sage Weil
09:14 PM Revision 3abc4128 (ceph): internal: archive mon data dirs
These can be useful for debugging, and are usually pretty small.
Fixes: #2714
Sage Weil
09:10 PM Bug #2772 (Resolved): rados bench -t broken
Sage Weil
10:16 AM Bug #2772: rados bench -t broken
we can't change bench flags.
just don't make a short one-char flag for the target... use --target or whatever instead
Sage Weil
10:12 AM Bug #2772: rados bench -t broken
FWIW, I hacked up a patch last night (changed -t to -n to avoid conflicts, and made some changes to the usage message... Dan Mick
09:10 PM Revision cff2cfa2 (ceph): internal: move pulling archive w/ tar to helper
Sage Weil
06:52 PM Revision 95e8d87b (ceph): rados tool: remove -t param option for target pool
Bug #2772. This fixes an issue that was introduced when we
added the 'rados cp' command. The -t param was already use...
Yehuda Sadeh
05:59 PM Revision 986c0ef9 (ceph): rbd: test with layering enabled
RBD_FEATURES=0 hits a bug that's fixed in wip-rbd-parent.
Once that's merged, we can add RBD_FEATURES=0 tests back in.
Josh Durgin
04:19 PM Revision 5b107783 (ceph): Makefile: don't install crush headers
This is leftover from when we built a libcrush.so. We can re-add when we
start doing that again.
Reported-by: Laszl...
Sage Weil
04:19 PM Revision 2c001b28 (ceph): Makefile: don't install crush headers
This is leftover from when we built a libcrush.so. We can re-add when we
start doing that again.
Reported-by: Laszl...
Sage Weil
04:04 PM Revision 22d0648d (ceph): librados: simplify cct refcounting
get() in ctor, put() in dtor.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:58 PM Revision c5bcb04b (ceph): lockdep: stop lockdep when its cct goes away
When a cct is destroyed, tell lockdep so that it can shut down if it needed
it.
Signed-off-by: Sage Weil <sage@inkta...
Sage Weil
03:27 PM Revision fe6ed3e7 (ceph): ffsb is marginal, remove from smoke suite
Sage Weil
02:23 PM Feature #2714 (Resolved): teuthology: store monitor data in archives
Sage Weil
01:54 PM Bug #2387 (Duplicate): mon: could not get service secret for auth subsystem
Sage Weil
11:31 AM Feature #2776 (Resolved): rados tool: bulk removal of objects
Provide an option to remove multiple objects in a single command. Yehuda Sadeh
10:08 AM Bug #2771: mon|osd: fail to create pgs on startup
i think this is related:... Sage Weil
09:19 AM Feature #2775 (Rejected): rest-bench man page
Sage Weil
09:19 AM devops Documentation #2774 (Resolved): doc: ceph-disk man page
commit:a450cab2b8148cb8a9b043d629feccf89e5aabac Sage Weil
09:11 AM rbd Subtask #2603 (Fix Under Review): librbd layering: open parent on open
In branch wip-rbd-parent Josh Durgin
08:16 AM Feature #2666 (Resolved): rados tool: copy pool
Sage Weil
04:35 AM rbd Feature #2773 (Resolved): libvirt: incorrect snapshot behavior
Since consistent(disk and memory state) snapshots currently supported only with qcow images, libvirt should:
- all...
Andrey Korolyov
04:27 AM Revision 7adc6c08 (ceph): mon: simplify logmonitor check_subs; less noise
* simple helper to translate name to id
* verify sub type is valid in caller
* assert sub type is valid in method
...
Sage Weil
03:26 AM Revision 19742ec2 (ceph): Revert "smoke: add msgr failures"
This reverts commit 9278e231e64f49c3205c2ded8b1f2d3b27265eac. Sage Weil
02:58 AM Revision 657c9db2 (ceph): move cfuse fsx into marginal suite
This should probably pass, given the testing that ObjectCacher gets these
days with librbd_fsx.
Sage Weil
02:58 AM Revision c3e1ab49 (ceph): remove suites/stress/basic
Sage Weil
02:58 AM Revision 392a6596 (ceph): move some old flaky tasks into marginal suite
These were pulled out of regression a while ago. Put them into the
marginal suite where they will be regularly run a...
Sage Weil
02:04 AM Revision a542d89e (ceph): mds: fix race in connection accept; fix con replacement
We solve two problems with this patch. The first is that the messenger
will now reuse an existing session's Connecti...
Sage Weil
01:21 AM Revision fa96e19f (ceph): Merge branch 'stable' into next
Sage Weil
01:21 AM Revision 0f917c2f (ceph): osd: guard class call decoding
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:21 AM Revision 0ff6c979 (ceph): test_stress_watch: just one librados instance
This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets...
Sage Weil
01:19 AM Revision ee1c029d (ceph): ReplicatedPG: don't warn if backfill peer stats don't match
pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill....
Samuel Just
01:18 AM Revision d3c97dae (ceph): librados: take lock when signaling notify cond
When we are signaling the cond to indicate that a notify is complete,
take the appropriate lock. This removes the po...
Sage Weil
01:17 AM Revision ec490d87 (ceph): client: fix locking for SafeCond users
Need to wait on flock, not client_lock.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

07/10/2012

11:13 PM Revision 38e2de3d (ceph): doc: No ssh -t -t, forcing a pty allocation there makes it hang.
Earlier, this was a single -t, and that is overridden by the fact that
stdin is not a tty, so that did nothing.
Sign...
Tommi Virtanen
11:03 PM Revision 79e3416c (ceph): doc: removed the ceph directory per tommi's update to the chef-cookbooks.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:23 PM Revision 5c84f013 (ceph): doc: Adding apt update message. VM users didn't get the package otherwise.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:13 PM Bug #2772: rados bench -t broken
actually the -t param is for 'rados cp' which is for an object, not for cppool. Yehuda Sadeh
10:02 PM Bug #2772: rados bench -t broken
let's just make it --target-pool or similar. this is an uncommon enough command that it doesn't really need a single... Sage Weil
10:00 PM Bug #2772 (Resolved): rados bench -t broken
The -t parameter has being assigned to rados cppool (target pool), but apparently it was already being used for rados... Yehuda Sadeh
09:56 PM Feature #2714 (In Progress): teuthology: store monitor data in archives
Sage Weil
09:25 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
... Sage Weil
09:20 PM Revision 83339a0c (ceph): Merge branch 'wip-rbd-clone-dmick' into master
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Dan Mick
09:03 PM Revision fe5c0cd9 (ceph): osd: guard class call decoding
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:00 PM Revision 2a6af208 (ceph): rbd: update manpage for clone command
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:59 PM Revision e3531497 (ceph): rbd: update cli test reference files
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:59 PM Revision 64949d42 (ceph): librbd, rbd, rbd.py: Add parent info reporting
split out new parent info into separate retrieval methods;
structure packing on rbd_image_info_t was becoming a probl...
Dan Mick
08:59 PM Revision 6ad59610 (ceph): rbd, librbd: add tests for cloning
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:59 PM Revision 7b0c71cc (ceph): librados: pool_get_name handles "not found" wrong
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:59 PM Revision 96a517b4 (ceph): librados: Add mapping from pool id to pool name and ioctx to rados client
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
08:59 PM Revision 8cc43427 (ceph): librbd: open_image snapshot handling
Allow opening with no snap, but check for error for nonexistent snap
Backport: argonaut
Signed-off-by: Dan Mick <dan...
Dan Mick
08:59 PM Revision a94fc8c8 (ceph): rbd, librbd, rbd.py: cloning (copy-on-write child image of snapshot)
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:59 PM Revision 68bad03b (ceph): msgr: queue accept event when pipe is accepted
Queue an event when an incoming connection is accepted.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:59 PM Revision fab6e824 (ceph): msg/DispatchQueue: queue and deliver accept events
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:59 PM Revision 95f85df7 (ceph): librados: add new constructor to form a Rados object from IoCtx
This creates a separate reference to an existing connection, for
use when a client holding IoCtx needs to consult ano...
Dan Mick
08:30 PM Revision d4ef004e (ceph): dispatcher: new 'accept' event type
Create a new event type when we successfully accept a connection. This is
distinct from the authorizor verification,...
Sage Weil
08:30 PM Revision 912123f2 (ceph): msgr: move Pipe, DispatchQueue into separate files
These don't need to be subclasses of SimpleMessenger. Separate!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:30 PM Revision 34908140 (ceph): msgr: uninline ctor and dtor
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:30 PM Revision bffd46c5 (ceph): msgr: remove obsolete dead path
This hasn't triggered in years.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:30 PM Revision bafcbdeb (ceph): msgr: remove unnecessary accept check
We don't need to worry about racing with shutdown here; the cleanup
procedure will stop the accepter thread before cl...
Sage Weil
08:30 PM Revision 9e291bae (ceph): msgr: simplify checks for queueing connection events
Looking through git history it is not clear exactly how these checks
came to be. They seem to have grown during the ...
Sage Weil
08:30 PM Revision e84565d9 (ceph): msgr: move dispatch thread into DispatchQueue
The DispatchQueue class now completely owns message delivery. This is
cleaner and lets us drop the redundant destina...
Sage Weil
08:30 PM Revision 1a4a1330 (ceph): msgr: drop unnecessary (un)locking on queuing connection events
This used to be necessary because the pipe_lock was used when queueing
the pipe in the dispatch queue. Now that is h...
Sage Weil
08:28 PM Revision 2d071b9e (ceph): msgr: use explicit Connection for messages sent to ourself
Move to an explicit Connection for messages sent to ourselves, instead of
using the one on the local_pipe (which we'l...
Sage Weil
08:28 PM Revision fa4c1920 (ceph): msgr: use local IncomingQueue instead of Pipe
Simpler, cleaner. No need for the rest of the Pipe crap. We just need to
queue messages for ourselves.
Signed-off-...
Sage Weil
08:28 PM Revision c7caf91b (ceph): msgr: simplify IncomingQueue creation, pointers
* create it via DispatchQueue
* keep pointer to parent DispatchQueue
* drop now-useless contextual arguments to mos...
Sage Weil
08:25 PM Revision 35b13266 (ceph): msgr: take over existing Connection on Pipe replacement
If a new pipe/socket is taking over an existing session, it should also
take over the Connection* associated with the...
Sage Weil
07:22 PM rbd Feature #2770 (Resolved): krbd: define tasks to add osd_client compound class op support
Sage Weil
06:29 PM Revision 80111cce (ceph): test_stress_watch: just one librados instance
This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets...
Sage Weil
06:04 PM Revision 782a17da (ceph): doc: added cookbook path instruction.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:38 PM Bug #2387 (Can't reproduce): mon: could not get service secret for auth subsystem
Sage Weil
04:26 PM Bug #2600 (Resolved): osd: crazy long watch timeout?
Sage Weil
09:28 AM Bug #2600 (In Progress): osd: crazy long watch timeout?
Sage Weil
03:14 PM Revision 8869621f (ceph): doc: Added some pre-clarification for gdisk. Added DHO OSD hardware con...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:52 PM Bug #2765: Extremely Long Object Latencies
I noticed one 15 second lag on an OSD caused by the filestore requiring 15 seconds to drain open ops before a sync. Samuel Just
02:52 PM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
Merged in commit:96ca508dcd227db11597cd4963208872af30fcc4 Josh Durgin
02:24 PM rbd Feature #2585 (Resolved): rbd: clone command
commit:83339a0cbb925022c44507ab0ea3da7b399f511e
Dan Mick
11:06 AM Bug #2767 (Resolved): rbd workunit failing
The real problem is that the workunit task only grabs things from master, since github only seems to create tarballs ... Josh Durgin
09:24 AM Bug #2767 (Resolved): rbd workunit failing
... Sage Weil
10:47 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
Log location: ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8335
ubuntu@teuth...
Tamilarasi muthamizhan
10:33 AM Feature #2768 (Resolved): teuthology: make workunit task work on different branch/sha1 etc
This involves making it download scripts individually instead of from a single tarball, since github only creates tar... Josh Durgin
10:14 AM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8328 Sage Weil
10:13 AM Bug #2453 (In Progress): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
ubuntu@teuthology:/a/teuthology-2012-07-10_05:00:02-regression-stable-master-basic/8546 Sage Weil
09:06 AM Subtask #2757: mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transactions
Pending testing Joao Eduardo Luis
04:36 AM Revision fbe5577f (ceph): CephContext: don't leak admin socket
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:32 AM Revision 74e62051 (ceph): CephContext: drop obsolete ceph_destroy_context
Drop this useless helper and call cct->put() directly. The comment that
this can't be used after global_init is no l...
Sage Weil
04:27 AM Revision 58f46bb9 (ceph): test_stress_watch: just one librados instance
This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets...
Sage Weil
04:27 AM Revision 6aa9f496 (ceph): CephContext: ref count
These get shared via the librados API.
Fixes: #845
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:27 AM Revision 074df64d (ceph): librados: use CephContext ref counts
Take ownership of the passed cct. Drop it when we destroy the
RadosClient.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:01 AM Revision 645ac87b (ceph): ReplicatedPG: fix replay op ordering
After a client reconnect, the client replays outstanding ops. The
OSD then immediately responds with success if the ...
Samuel Just
01:01 AM Revision 67a5e75e (ceph): ReplicatedPG: don't warn if backfill peer stats don't match
pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill....
Samuel Just
12:27 AM Revision 67361c10 (ceph): librbd: return an error when removing a non-existent image
Try treating the image as new format if it's not in the old-style
directory, which is the last step in old-style remo...
Josh Durgin

07/09/2012

09:59 PM Bug #2067: librados: we leak CephContext from rados_create()
librados does not destroy the CephContext created in rados_create() in rados_destroy() (or the c++ equivalents). Thi... Sage Weil
09:59 PM Bug #2766 (Resolved): cephcontext: needs refcount
CephContext needs to be refcounted, so that it can be properly shared. Sage Weil
09:37 PM Bug #2600: osd: crazy long watch timeout?
Oh.... Sage Weil
08:58 PM Revision ec87a1ea (ceph): cls_rbd: add logging for failed set_parent steps
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
06:43 PM Revision 96ca508d (ceph): Merge remote-tracking branch 'gh/wip-rbd-id'
Sage Weil
06:41 PM Bug #2765 (Closed): Extremely Long Object Latencies
Saw this during testing on one of the aging clusters.
example object tid: 312322
Full logs available here:
...
Mark Nelson
06:06 PM Revision b23e4e58 (ceph): doc: Removed legacy paths and keyname settings from examples.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:03 PM Bug #2508 (Resolved): osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobje...
645ac87b84ae348483c7d52b1968af5f44378dea Samuel Just
06:02 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
67a5e75ed6a9b3689a984d6f7eb6a18e1242125e Samuel Just
04:12 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
Remove the loud warnings? Like, the mismatch warnings? Greg Farnum
04:08 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
Ok, so this is actually caused by not updating the backfill peer's partial stats while recovering the backfill portio... Samuel Just
03:10 PM devops Feature #2698 (Closed): crowbar: Guide for using "front" network
Merged as 90c42837696d252be40348fdd4499e15aa835aeb. Anonymous
01:56 PM Linux kernel client Bug #2764 (Closed): xfstest hang; osd socket closed messages
... Sage Weil
01:51 PM Cleanup #2763 (Resolved): move rbd locking infrastructure to a separate objclass
locks are generics, can serve other uses (e.g., rgw). We should separate them from rbd code. Yehuda Sadeh
12:52 PM Bug #2762 (Resolved): mon crash ceph::__ceph_assert_fail (assertion=0x63d150 "begin->last_committ...
#0 0x00007fb9b32b5b7b in raise (sig=<optimized out>) at ../nptl/sysdeps/unix/sysv/linux/pt-raise.c:42
#1 0x0000000...
Samuel Just
12:07 PM Bug #2761 (Resolved): osd: failed to recover before timeout expired
Logs: ubuntu@teuthology:/a/teuthology-2012-07-09_05:00:08-regression-stable-master-basic/8039
Attaching below an e...
Tamilarasi muthamizhan
11:54 AM Bug #2022: osd: misdirectect request
Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
Attaching the...
Tamilarasi muthamizhan
11:50 AM Bug #2022: osd: misdirectect request
Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7824
Attaching the...
Tamilarasi muthamizhan
11:45 AM Bug #2022: osd: misdirectect request
latest log: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
and
ubuntu@t...
Tamilarasi muthamizhan
11:33 AM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
... Anonymous
11:21 AM Linux kernel client Bug #2759 (Resolved): libceph: crush tree algorithm is not understood
After changing the crush algorithm for a bucket to tree, the kernel client cannot parse the OSDMap:... Josh Durgin
10:31 AM RADOS Feature #2101: crushtool: check for weight overflow on reweight
caleb miles
09:38 AM Subtask #2758 (Resolved): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyValueDB to ...
Extend the in-memory mock-up of KeyValueDB to support the safe iterator, and create an iterator to iterate over all t... Joao Eduardo Luis
09:16 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
Joao Eduardo Luis
09:05 AM Subtask #2756 (Resolved): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
The current iterator implementation iterates directly over the database state, even if there are concurrent updates.
...
Joao Eduardo Luis
09:15 AM Subtask #2757 (Resolved): mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transac...
Using the LevelDBStore's safe iterator, obtain the existing key/values and pack them into transactions.
These tran...
Joao Eduardo Luis
08:57 AM Feature #2755 (Resolved): ceph-conftool: optionally return the default for a config option if no ...
Presently, the only way to figure out what the default for a specific configuration option is, is to consult the sour... Florian Haas
03:33 AM Revision b387077b (ceph): debian: include librados-config in librados-dev
Reported-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

07/08/2012

09:39 PM Revision 8e5fe62a (ceph): doc: remove reference to 'ceph stop' command
It doesn't exist anymore.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

07/07/2012

06:19 AM Linux kernel client Bug #2754 (Resolved): cephfs show_location produces kernel "divide error: 0000 [#1]" when run aga...
Originally reported in http://marc.info/?l=ceph-devel&m=134151028212170&w=2:
Really easy to reproduce on my 3.2.0 ...
Florian Haas
06:16 AM CephFS Bug #2753 (Resolved): Writes to mounted Ceph FS fail silently if client has no write capability o...
Originally reported in http://marc.info/?l=ceph-devel&m=134151023912148&w=2:
How to reproduce (this is on a 3.2.0 ...
Florian Haas
06:12 AM Bug #2752: Setting large maxosd kills all mons
Original ML response from Greg (http://marc.info/?l=ceph-devel&m=134151077312444&w=2):
Ungh. Can you file a bug repo...
Florian Haas
06:10 AM Bug #2752 (Resolved): Setting large maxosd kills all mons
Originally reported in http://marc.info/?l=ceph-devel&m=134151018912133&w=2:
When setting a large maxosd, all mons...
Florian Haas
03:15 AM Revision 9ea22133 (ceph): use sudo to kill teuthology proc
Sage Weil
03:01 AM Revision dddf783f (ceph): Merge branch 'wip-cond'
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
12:04 AM Revision 98a21cc8 (ceph): move qemu_iozone test to marginal suite
Sage Weil

07/06/2012

11:45 PM Revision 03c2dc24 (ceph): lockdep: increase max locks
Hit this limit with the rados api tests.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:45 PM Revision b554d112 (ceph): config: add unlocked version of get_my_sections; use it internally
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:45 PM Revision 01da287b (ceph): config: fix lock recursion in get_val_from_conf_file()
Introduce a private, already-locked version.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:45 PM Revision c73c64a0 (ceph): config: fix recursive lock in parse_config_files()
The _impl() helper is only called from parse_config_files(); don't retake
the lock.
Signed-off-by: Sage Weil <sage@i...
Sage Weil
11:44 PM Revision b33553aa (ceph): rgw: handle response-* params
Handle response-* params that set response header field values.
Fixes #2734, #2735.
Backport: argonaut
Signed-off-by...
Yehuda Sadeh
11:43 PM Revision 97c1562d (ceph): rgw: handle response-* params
Handle response-* params that set response header field values.
Fixes #2734, #2735.
Backport: argonaut
Signed-off-by...
Yehuda Sadeh
11:43 PM Revision 6646e891 (ceph): rgw: initialize fields of RGWObjEnt
This fixes various valgrind warnings triggered by the s3test
test_object_create_unreadable.
Signed-off-by: Sage Weil...
Sage Weil
10:17 PM Revision 74f68750 (ceph): osd: add missing formatter close_section() to scrub status
Also add braces to make the open/close matchups easier to see. Broken
by f36617392710f9b3538bfd59d45fd72265993d57.
...
Sage Weil
08:45 PM Revision 020b2996 (ceph): pg: report scrub status
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
08:45 PM Revision db6d83b3 (ceph): pg: track who we are waiting for maps from
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
08:45 PM Revision e1d4855f (ceph): pg: reduce scrub write lock window
Wait for all replicas to construct the base scrub map before finalizing
the scrub and locking out writes.
Signed-off...
Mike Ryan
08:03 PM Feature #2743 (Resolved): new cond assertions, associated cleanups
Sage Weil
09:41 AM Feature #2743 (Resolved): new cond assertions, associated cleanups
Sage Weil
07:21 PM Revision 72bf66f6 (ceph): doc: added some discussion to libvirt.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:14 PM Revision 1316f868 (ceph): cond: cleanup
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:14 PM Revision 7c919e9b (ceph): workqueue: kick -> wake or _wake, depending on locking
Break kick() into wake() and _wake() methods, depending on whether the
lock is already held. (The rename ensures tha...
Sage Weil
07:14 PM Revision b5693a6e (ceph): librados: take lock when signaling notify cond
When we are signaling the cond to indicate that a notify is complete,
take the appropriate lock. This removes the po...
Sage Weil
07:14 PM Revision 9c6fdd5a (ceph): librados: drop unused local variables
This is unused boilerplate cruft.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:14 PM Revision da7a3284 (ceph): cond: drop unused Wait variant
This was used for debugging forever ago.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:07 PM Revision b486f2ff (ceph): client: fix locking for SafeCond users
Need to wait on flock, not client_lock.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:07 PM Revision 8de0c227 (ceph): cond: assert that we are holding the same mutex as the waiter
Try to verify that we are holding the same mutex that the waiter is
waiting on. Specifically:
* only wait on a sin...
Sage Weil
06:29 PM Revision 57463455 (ceph): Merge branch 'master' of github.com:ceph/ceph
John Wilkins
06:29 PM Revision 540c2c3a (ceph): doc: Minor cleanup on deploy with Chef.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:18 PM Revision 3df51040 (ceph): rgw: don't store bucket info indexed by bucket_id
Issue #2701. This info wasn't really used anywhere and we weren't
removing it. It was also sharing the same pool name...
Yehuda Sadeh
05:17 PM Revision 27409aa1 (ceph): rgw: don't store bucket info indexed by bucket_id
Issue #2701. This info wasn't really used anywhere and we weren't
removing it. It was also sharing the same pool name...
Yehuda Sadeh
05:16 PM Revision 84ba6bf6 (ceph): Merge branch 'stable' into next
Yehuda Sadeh
05:15 PM Revision 960c2124 (ceph): rados tool: copy object
New rados command: rados cp <src-obj> [dest-obj]
Requires specifying source pool. Target pool and locator can be spe...
Yehuda Sadeh
05:15 PM Revision 16ea64fb (ceph): rados tool: copy entire pool
A new rados tool command that copies an entire pool
into another existing pool.
Signed-off-by: Yehuda Sadeh <yehuda@...
Yehuda Sadeh
05:15 PM Revision d7510066 (ceph): rados tool: copy object in chunks
Instead of reading the entire object and then writing it,
we read it in chunks.
Signed-off-by: Yehuda Sadeh <yehuda@...
Yehuda Sadeh
05:15 PM Revision 9814374a (ceph): test_rados_tool.sh: test copy pool
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:12 PM Revision d59b2db4 (ceph): Merge remote-tracking branch 'origin/stable' into next
Yehuda Sadeh
05:05 PM Feature #2702 (Resolved): gitbuilder: sync each build as it completes
Sage Weil
03:23 PM Feature #2702: gitbuilder: sync each build as it completes
Sage Weil
09:43 AM Feature #2702 (In Progress): gitbuilder: sync each build as it completes
Sage Weil
05:02 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
Yehuda Sadeh
04:35 PM rgw Feature #2735: rgw: support response-* request params in GET obj
Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a. Yehuda Sadeh
05:02 PM Revision ed3bd211 (ceph): increase thrashosds timeout
Samuel Just
04:48 PM Feature #2680: osd: report backfill progress via query
not yet tested https://github.com/athanatos/ceph/tree/wip_recovery_query Samuel Just
04:45 PM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
in progress already https://github.com/athanatos/ceph/tree/wip_biginfo
blocked by 2748
Samuel Just
04:35 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
lavor: basic
owner: scheduled_sage@metropolis
success: false
ubuntu@teuthology:/a/sage-2012-07-06_11:03:15-regress...
Samuel Just
04:35 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a. Yehuda Sadeh
10:58 AM rgw Bug #2734: rgw: fails to authenticate preauthenticated requests with sub resource
Fails only with specific sub resources (e.g., response-*) that were not hard coded. Yehuda Sadeh
04:34 PM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
roles:
- - mon.a
- osd.0
- osd.1
- osd.2
- - mds.a
- osd.3
- osd.4
- osd.5
- - client.0
targets...
Samuel Just
04:32 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
Saw on master
osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
ceph version 0.48argona...
Samuel Just
04:23 PM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
Samuel Just
03:51 PM Bug #2600: osd: crazy long watch timeout?
another instance:... Sage Weil
03:47 PM Revision 23d31d3e (ceph): ceph.spec.in: add ceph-disk-{activate,prepare}
Reported-by: Jimmy Tang <jtang@tchpc.tcd.ie>
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:47 PM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
... Sage Weil
01:23 PM rgw Bug #2701 (Resolved): rgw: don't keep bucket info indexed by bucket_id
Fixed, commit:3df51040b19b9c110e08f42befa0cae18cbb984c. Yehuda Sadeh
11:34 AM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
Thanks, updated the doc to increase the VM size. Tyler Brekke
10:24 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
Currently, Paxos own trimming is going through the Paxos proposal mechanism itself.
However, services do not do it...
Joao Eduardo Luis
09:29 AM Subtask #2737 (Resolved): mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
This includes Paxos' own trimming, so it done at the same time in all the quorum members. Furthermore, it allows the ... Joao Eduardo Luis
10:18 AM Subtask #2738: mon: Single-Paxos: Sync: Add snapshot support to the monitor store
Implemented, although it has not been tested yet. Joao Eduardo Luis
09:31 AM Subtask #2738 (Rejected): mon: Single-Paxos: Sync: Add snapshot support to the monitor store
Having snapshot support, and being able to iterate over it instead of iterating directly over the store's keys, allow... Joao Eduardo Luis
10:09 AM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
latest logs:
/a/teuthology-2012-07-06_00:00:03-regression-next-testing-basic/6655
Tamilarasi muthamizhan
10:01 AM Subtask #2746 (Rejected): mon: Single-Paxos: Sync: Test message passing
Make sure the message passing is working before starting to work on the store synchronization.
From the monitor's ...
Joao Eduardo Luis
09:58 AM Subtask #2745 (Closed): mon: Single-Paxos: Sync: Add new message support to the Monitor class
Three different "roles" on a monitor cluster regarding synchronization:
* Leader - responsible for disabling the P...
Joao Eduardo Luis
09:53 AM Subtask #2744 (Resolved): mon: Single-Paxos: Sync: Create new Message type
Create a new message type MMonSync, to dissociate it from the MMonProbe messages, and to allow us to use fields speci... Joao Eduardo Luis
09:43 AM RADOS Feature #2706 (In Progress): crush: update kernel code to decode tunables
Sage Weil
09:40 AM Feature #2742 (Resolved): qa: ms socket inject failures in regression suite
Sage Weil
09:40 AM Subtask #2741 (Resolved): mon: Single-Paxos: Sync: Assess requirements for QA tests
TBD Joao Eduardo Luis
09:37 AM CephFS Bug #2740 (Resolved): mds: crash in Objecter when shutting down too early
Jimmy Tang reported this on the mailing list.... Greg Farnum
09:36 AM Subtask #2739 (Resolved): mon: Single-Paxos: Sync: Synchronize the store of a drifted monitor
Synchronize the store of a monitor (mon.X) that drifted away from the system (say, it failed for a considerable amoun... Joao Eduardo Luis
09:22 AM Subtask #2736 (Resolved): mon: Single-Paxos: Sync: Implement message passing
See subtasks. Joao Eduardo Luis
04:28 AM Revision 668ce00a (ceph): osd: make on_removal() pure virtual
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:26 AM Revision 3d00130b (ceph): osd: fix PG dtor compile error
We need at least none non-pure virtual method to tell gcc where the
vtable goes. The destructor wins!
libosd.a(libo...
Sage Weil
01:37 AM Bug #2267: Ceph client crashed after shutting down one mds and osd
To which version do you plan to add this patch? Maciej Galkiewicz
12:20 AM Revision 6e3fb20d (ceph): Merge remote-tracking branch 'gh/wip_osd_threading'
Sage Weil
12:18 AM Revision bdf09f20 (ceph): OSD,ReplicatedPG::snap_trimmer: pg->put() in process, not snap_trimmer()
This clarifies responsibility for the reference.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:18 AM Revision 0475ee45 (ceph): OSD,PG:;replica_scrub: move msg->put() into queue process
This clarifies the ownership of the reference.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:18 AM Revision fe14c181 (ceph): OSD::PeeringWQ::_dequeue(PG*) drop pg refs
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:18 AM Revision 816d4247 (ceph): ReplicatedPG.cc: C_OSD_CommittedPushedObject move pg->put() to finish
This should clarify the ownership of the pg ref.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:18 AM Revision 8e93e8b0 (ceph): OSD: ensure that OpSequencer lives through on_commit callback
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:18 AM Revision 69174198 (ceph): OSDMonitor: disable cluster snapshot
The map handling changes broke cluster snapshot support.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:18 AM Revision 09af670b (ceph): PG,ReplicatedPG: on_removal must handle repop and watcher state
on_removal is now in ReplicatedPG in order to handle watcher state
and repop state. Addionally, workqueue dequeues a...
Samuel Just
12:18 AM Revision cab7b75d (ceph): OSD: drop pg refcounts in OpWQ::_dequeue(PG*)
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:18 AM Revision 868168a5 (ceph): OSD: clean up revcovery_wq queueing and ref counting
Previously, we tended to explicitely remove the pg from the queue uisng
remove_myself on the xlist::item. This cause...
Samuel Just

07/05/2012

10:30 PM Feature #2692 (Resolved): stable testing debian repos
Sage Weil
10:29 PM Revision c70392a8 (ceph): doc: minor typo
Signed-off-by: Ross Turk <ross@inktank.com> Ross Turk
10:24 PM Revision 4d7bb075 (ceph): doc: update copyright notice in footer
Signed-off-by: Ross Turk <ross@inktank.com> Ross Turk
10:16 PM Linux kernel client Bug #2716 (Resolved): crash when cluster goes down and new one comes up
bad con->private = NULL in monc __close_session Sage Weil
09:38 PM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
easy to reproduce with... Sage Weil
08:44 AM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
I just reproduced this several times on master using the socket failure injection. This is probably socket errors br... Sage Weil
09:37 PM Bug #2022: osd: misdirectect request
ugh, wrong bug.. ignore last two messages! Sage Weil
05:19 PM Bug #2022: osd: misdirectect request
this is easily reproduced with... Sage Weil
05:18 PM Bug #2022 (In Progress): osd: misdirectect request
... Sage Weil
09:01 PM Revision 57bc8da9 (ceph): doc: minor updates to the restrucuredText file.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:00 PM Revision 0659f7c5 (ceph): doc: minor cleanup.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:47 PM Revision 1c9e1c61 (ceph): doc: Publishing as described. Still requires some verification and QA.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:43 PM Revision e5fb4991 (ceph): run: make -a short for --archive
Sage Weil
06:18 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
The size of a VM is adjustable:... Greg Farnum
05:49 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
Tested on vercoi/downburst
https://docs.google.com/a/inktank.com/document/d/1C1QLD54sbpRJ7o6M9ML2TrexXC5JIMecHJxOM...
Tyler Brekke
05:21 PM Cleanup #2041 (Resolved): osd: move peering into worker threads
Sage Weil
05:15 PM Revision 87d1cdb5 (ceph): OSD: _remove_pg not ruin iterator consistency
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 7b5d8e8c (ceph): ReplicatedPG: on_activate for a peer might happen before flush
We don't ensure for a peer that the flush completes before activation,
merely that we don't serve any ops until flush...
Samuel Just
05:15 PM Revision f0b2310f (ceph): ReplicatedPG: RepModify track epoch_started and bail on interval change
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 193f18f2 (ceph): FileStore: delete source collection if not replaying collection_rename
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 99c23b69 (ceph): OSD: split notify|info|query messages for old clients
Old clients do not expect mixed epoch compound messages. Thus, we
send each sub-message independently.
Signed-off-b...
Samuel Just
05:15 PM Revision 4ce17cca (ceph): test/encoding/types.h: disable pg_query_t encoding test
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 9fc5db8c (ceph): ReplicatedPG::op_applied: update last_update_applied iff !aborted
scrub state and last_update_applied will have been reset during
the interval change.
Signed-off-by: Samuel Just <sam...
Samuel Just
05:15 PM Revision 7f2354c7 (ceph): doc/scripts/gen_state_diagram.py: make parser a bit more forgiving
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 31db8ed0 (ceph): OSD::activate_map: don't publish map until pgs in deleted pools have be...
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 7e26d6df (ceph): PG: C_PG_MarkUnfoundLost put pg in finish
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 5dc45f77 (ceph): PG: PG now store its own PGPool
Otherwise, we need to syncronize access to the shared PGPool objects.
The wasted memory is probably preferable to syn...
Samuel Just
05:15 PM Revision 5c0e8b46 (ceph): OSD: bail out of do_recovery if no longer primary and active
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 3ca6359c (ceph): common/WorkQueue.h: add BatchWorkQueue
Rather than dispatching one item at a time to process, etc,
BatchWorkQueue dispatches up to a configurable number of
...
Samuel Just
05:15 PM Revision d8a68e76 (ceph): OSD: do_(notifies|infos|queries) must now be passed a map
This removes the need to call them from within the osd lock.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:15 PM Revision c1f2a802 (ceph): OSD: peering_wq is now a BatchWorkQueue
process_peering_events now handles multiple pgs at once to better
batch up notifes, etc.
Signed-off-by: Samuel Just ...
Samuel Just
05:15 PM Revision 0ee3d87f (ceph): OSD: process_peering_event check for new map on each pg
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 9b182d20 (ceph): OSD: maybe_update_heartbeat_peers, don't print pg
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision c6db1b2e (ceph): PG: delay ops in do_request, not queue_op
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 1b558fba (ceph): OSD: write_info/log during process_peering_events, do_recovery
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 392df3b7 (ceph): Revert "osd: reset last_peering_interval on replica activate"
This reverts commit 17114f266a336b6edd7e98975d494fdd487eec20. Samuel Just
05:15 PM Revision f9282e6c (ceph): Revert "osd: check against last_peering_reset in _activate_committed"
This reverts commit 86aa07d7a91ac23074e76551c3a6db3a5736cffa. Samuel Just
05:15 PM Revision 442b5583 (ceph): PG: pass activate epoch with Activate event
This allows us to pass into activate() in which epoch the
message triggering activation occurred allowing us mark
the...
Samuel Just
05:15 PM Revision 311a061e (ceph): OSD: move watch into OSDService
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision b200710b (ceph): OSD,PG: clean up pg removal
PG opsequencers will be used for removing a pg. If the pg is recreated
before the removal is complete, we need the n...
Samuel Just
05:15 PM Revision e8d09d02 (ceph): PG,OSD: prevent pg from completing peering until deletion is complete
hobject_t must now be globally unique in the filestore. Thus, if we
start creating objects in a pg before the remova...
Samuel Just
05:15 PM Revision 35949c54 (ceph): ReplicatedPG: explicitely block on not active for certain ops
Ops and some subops need to wait for active to ensure correct ordering
with respect to peering operations.
Signed-of...
Samuel Just
05:15 PM Revision 2552a7f4 (ceph): OSD,PG: _share_map_outgoing must not require osd_lock
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 4fec85f2 (ceph): OSD: activate_map() in handle_osd_map only when active
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 040a22b6 (ceph): OSD: map_cache should contain const OSDMap
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 3dcce50e (ceph): OSD: only do_(notify|info|query) for up osd
pg may have an older map and attempt to notify|info|query on a down
osd.
Signed-off-by: Samuel Just <sam.just@inktan...
Samuel Just
05:15 PM Revision a5bf3d71 (ceph): OSD: lock recovery_wq before debug output on finish_recovery_op
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 8079a489 (ceph): OSD,PG: clean up _get_or_create_pg and set interval based on msg
Previously, we set last_peering_reset based on the epoch in which the pg
is created. We now pass the map from the qu...
Samuel Just
05:15 PM Revision 7c1dc90a (ceph): OSD: don't advance_pg() if pg is up-to-date
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision 73f5ce94 (ceph): OSD,PG: issue pg removals in line, remove remove_list
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision c7581b69 (ceph): CrushWrapper: rmaps don't need to be mutable
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:15 PM Revision a67a874b (ceph): CrushWrapper: add locking around crush_do_rule
crush_do_rule uses a cache on the bucket objects.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:15 PM Revision b242c565 (ceph): OSD: on pg_removal, project_pg_history to get current interval
First, we don't really want to remove the pg if we can use it. Second,
there might be messages in the pg peering que...
Samuel Just
05:14 PM Revision f6fc1b30 (ceph): OSD: remove superfluous pg get/put around enqueue_op
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision b6bf5736 (ceph): ReplicatedPG: do not eval_repop if aborted
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 91355fb7 (ceph): ReplicatedPG: change ReplicatedPG debug output to match PG
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 3d32712e (ceph): OSD: don't assume pending pg removals have flushed
_create_lock_pg might encounter a preexisting pg collection simply
because the removal transaction had not yet comple...
Samuel Just
05:14 PM Revision 90e3727e (ceph): OSD: do not lock osd during dequeue_op
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 10ec5bc5 (ceph): OSD: push_waiters is no longer used
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 14381b38 (ceph): OSD: get map read lock during queue_want_up_thru
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 492e6351 (ceph): OSD: do not drop osd_lock in handle_osd_map
PGs have their map updates done in a different thread. Thus, we no
longer need to grab the pg locks. activate_map n...
Samuel Just
05:14 PM Revision f68f52ce (ceph): OSD: queue_want_up_thru in process_peering_event
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 502c0335 (ceph): PG, OSD: info_map shouldn't contain the MOSDPGInfo*
Rather, we will just pass the same type as the noties.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:14 PM Revision b80b5005 (ceph): OSD,PG: Move pg accesible methods, objects to OSDService
In order to clarify data structure locking, PGs will now access
OSDService rather the the OSD directly. Over time, m...
Samuel Just
05:14 PM Revision 33b985dc (ceph): OSD,PG: added helper methods for creating and dispatching RecoveryCtxs
This is simpler than having to update all of the RecoveryCtx users
whenever we change the types in RecoveryCtx.
Sign...
Samuel Just
05:14 PM Revision 9a4a9b9a (ceph): PG: flush ops by the end of peering without osr.flush
Rather than explicitely flushing the filestore, send a noop through the
filestore at the beginning of peering and, at...
Samuel Just
05:14 PM Revision ddef446d (ceph): OSD,PG: Move Op,SubOp queueing into PG
PG now handles delaying/discarding messages since pg map epoch may not
be the same as the OSD map.
Signed-off-by: Sa...
Samuel Just
05:14 PM Revision bbd8af02 (ceph): osd/: Make pg osdmap be independent of osd, other pg maps
This will allow handle_osd_map to not stop other work queues.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:14 PM Revision 12e22b3d (ceph): OSD,PG: handle pg map advance in process_peering_event
The pg map will now be advanced in process_peering_event (in advance_pg)
to allow handle_osd_map to not grab pg locks...
Samuel Just
05:14 PM Revision c7e34951 (ceph): PG: include info and query by value in peering events
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 65abfc07 (ceph): OSD: initialize pgs in get_or_create_pg via handle_create
Previously, pgs were initialized via Info/Log/etc. Since the event
which triggered the pg creation may now be queued...
Samuel Just
05:14 PM Revision b081bd4c (ceph): PG: CephPeeringEvents can now be descriptively printed
The CephPeeringEvt constructor is now templated to allow
storing a description string for debugging.
Signed-off-by: ...
Samuel Just
05:14 PM Revision e10e81c0 (ceph): PG: compound messages must carry epoch_sent for each part
Query and Notify messages include logical messages from multiple
pgs. Each logical message (pg_query_t and pg_notify...
Samuel Just
05:14 PM Revision fff0874c (ceph): PG: use osd->requeue_ops for ops, pg->queue_for_peering to requeue pg
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 64b71c69 (ceph): OSD: check for deleting in process_peering_event
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision b5078bb2 (ceph): PG,OSD: add OSD::queue_for_op, use in PG::queue_op
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision b0cb96bf (ceph): PG,OSD: fix op checking in pg, take_waiters during ActMap
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:14 PM Revision 18fec69a (ceph): OSD,PG::scrub() move pg->put() into queue process
This clarifies ownership of the pg reference.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:14 PM Revision 1ffd190e (ceph): PG: CephPeeringEvt
CephPeeringEvt is now the supertype for all peering state machine
events. This will allow us to generalize checking ...
Samuel Just
05:14 PM Revision 3691e3ca (ceph): PG: Move handle_* methods to PG
PG now calls handle_event in RecoveryState.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:14 PM Revision 7c414c5d (ceph): OSD: Remove handle_pg_missing, MOSDPGMissing no longer used
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
05:14 PM Revision d0158230 (ceph): OSD,PG: push message checking to pg
old_peering_evt now checks CephPeeringEvts generically in
PG::handle_peering_event().
Signed-off-by: Samuel Just <sa...
Samuel Just
05:14 PM Revision 6922f9cf (ceph): osd/: move history update from handle_pg_query into pg
Previously, replica history was updated in OSD::handle_pg_query.
Updating the history is now handled in the pg state ...
Samuel Just
05:14 PM Revision 36d1381a (ceph): PG: use intrusive_ptr in CephPeeringEvt
Properly disposing of the event_base member of CephPeeringEvt
requires use of intrusive_ptr.
Signed-off-by: Samuel J...
Samuel Just
05:14 PM Revision 33bcbb33 (ceph): PG: process peering events in a queue
Peering events are now queued via queue_peering_event in the
peering_queue.
Signed-off-by: Samuel Just <sam.just@ink...
Samuel Just
05:08 PM Linux kernel client Bug #2523 (Resolved): xfs: xfs_iolock_reclaimable
patches from hch fix this up. added them to testing branch. Sage Weil
04:18 PM rbd Bug #2488 (Can't reproduce): rbd: EIO from rbd map
Sage Weil
04:18 PM Bug #2693 (Resolved): osd/ReplicatedPG.cc: 4293: FAILED assert(info.last_update <= active_rep_scr...
Sage Weil
04:18 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
Sage Weil
04:13 PM Messengers Bug #2569 (Resolved): msgr: connect_rank crash
Sage Weil
04:12 PM devops Feature #2698: crowbar: Guide for using "front" network
I verified that this causes no trouble before crowbar is installed and that eth1 does come up automatically after reb... JuanJose Galvez
03:11 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
Yehuda Sadeh
03:10 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
Yehuda Sadeh
02:32 PM Revision ea11c7f9 (ceph): Allow URL-safe base64 cephx keys to be decoded.
In these cases + and / are replaced by - and _ to prevent problems when using
the base64 strings in URLs.
Signed-off...
Wido den Hollander
02:11 PM Feature #2733 (Resolved): osd: fadvise replica data don't use
This should help quite a bit with cache pressure. Samuel Just
01:56 PM Feature #2485 (Resolved): crush: validate hash function
Sage Weil
01:29 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
We can also keep data about the last N slow requests on the OSD to allow easier debugging. Samuel Just
01:16 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
syntax should be something along the lines of:
ceph tell osd <num> query_ops
Result should be a json formatted ...
Samuel Just
01:20 PM Bug #2711 (Resolved): chef does not install apache2/fastcgi on plana23
The apt-get update command I added said "apt-key update", which isn't the same at all. Fixed in
commit:6d351b0b2...
Dan Mick
01:18 PM Feature #2732 (Resolved): Idempotency tester
Add proper support for:
* omap
* xattr
* collection_rename
Samuel Just
01:17 PM Cleanup #2731 (Closed): ceph tell osd <num> should be ceph osd <subcommand> to match ceph pg <sub...
Samuel Just
11:41 AM Feature #2727: filestore: add split
On second thought, that would be hard to encode into a transaction, it'll probably just take a
[(hash_prefix, bits...
Samuel Just
11:06 AM Feature #2727 (Resolved): filestore: add split
Add split functionality to filestore. The existing collection structure should be already in place.
Interface coul...
Samuel Just
11:20 AM Documentation #2729 (Resolved): doc: OSD internals
Broad overview of OSD internals:
* Threading/wq model
* Message handling
* Message ordering (wait for map, pg wait...
Samuel Just
11:18 AM Feature #2728 (Resolved): OSD: handle split
OSD message handling needs to handle messages for pgs between the map epoch creating them and the actual split (shoul... Samuel Just
11:17 AM rbd Documentation #2670 (In Progress): Docs shouldn't direct users to echo to /sys/bus/rbd for normal...
Sage Weil
11:06 AM Bug #2725 (Duplicate): filestore: add split
Samuel Just
11:04 AM Bug #2725 (Duplicate): filestore: add split
Add split functionality to filestore. The existing collection structure should be already in place.
Interface cou...
Samuel Just
11:05 AM rbd Feature #2609 (In Progress): librbd: new image name -> image head indirection
Sage Weil
11:05 AM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
Sage Weil
11:04 AM rbd Feature #2726 (Resolved): krbd: clean up bio_pair leak/whatever
Sage Weil
11:03 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
I have restarted ceph mon with invoke-rc.d ceph restart mon and now it works! However I am 100% sure that I did it af... Maciej Galkiewicz
10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
you can just restart, or to verify in the future, 'ceph --admin-daemon /var/run/ceph/ceph-mon.a.asok version' (adjust... Sage Weil
10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
Oh! The monitor is still running the old version. Sage Weil
10:57 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
... Maciej Galkiewicz
10:55 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
Weird! Two things:
- does 'ceph status' work? (this is identical to -s now)
- can you try with commit:b53cdb9...
Sage Weil
10:45 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
# ceph -v
ceph version 0.48argonaut (commit:c2b20ca74249892c8e5e40c12aa14446a2bf2030)
Maciej Galkiewicz
10:44 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
can you include 'ceph -v' output? this works for me...... Sage Weil
10:05 AM Bug #2721 (Resolved): Ceph status does not work in 0.48 even if it is still documented
... Maciej Galkiewicz
10:22 AM rbd Feature #2724 (Resolved): rbd: add protect/unprotect command
See http://ceph.com/docs/master/dev/rbd-layering/#protection Josh Durgin
10:12 AM rbd Feature #2723 (Resolved): librbd: protect/unprotect as appropiate during cloning
See http://ceph.com/docs/master/dev/rbd-layering/#protection Josh Durgin
10:10 AM rbd Feature #2722 (Resolved): cls_rbd: add class methods to get/set protected status
The possible settings are: protected, unprotected, and protecting. See http://ceph.com/docs/master/dev/rbd-layering/ ... Josh Durgin
10:02 AM rbd Feature #2720 (Resolved): rbd: add children command
This should list all direct children of a given snapshot. Josh Durgin
10:01 AM rbd Feature #2719 (Resolved): librbd: provide functions for listing parents and their children
Wrap the class methods for listing parents and children of parents. Josh Durgin
09:57 AM rbd Feature #2718 (Resolved): librbd: map parent -> child in a per-pool rbd_children object when cloning
Josh Durgin
09:55 AM rbd Feature #2717 (Resolved): cls_rbd: add methods for maintaining mapping from parent to child images
From http://ceph.com/docs/master/dev/rbd-layering/#cls-rbd:... Josh Durgin

07/04/2012

10:23 PM Linux kernel client Bug #2716: crash when cluster goes down and new one comes up
... Sage Weil
10:21 PM Linux kernel client Bug #2716 (Resolved): crash when cluster goes down and new one comes up
- vstart cluster
- mount uml
- do some stuff (dbench, control-c, sync)
- stop vstart cluster
- (waited a while)
...
Sage Weil
09:47 PM Revision 6dbf53e2 (ceph): nuke: nuke based on archive path
Use path/config.yaml for targets, path/pid for pid to kill, and
path/owner for job owner.
Sage Weil
09:47 PM Revision 132dc006 (ceph): nuke: be more careful about kill; simplify
If the archive dir is specified, make sure we are killing the right
process.
Also drop the kill_process helper; it's...
Sage Weil
09:47 PM Revision 2499bd3f (ceph): watch-suite: stupid script to watch teuth run progress
Sage Weil
09:29 PM Revision 45fcca1f (ceph): valgrind: add strptime suppressions
Precise's strptime triggers valgrind false positives.
Use ship_utilities to push the valgrind.supp file over, which ...
Sage Weil
09:24 PM rbd Bug #2715 (Resolved): krbd: spinlock wrong CPU
... Sage Weil
08:59 PM Revision 7fa85790 (ceph): osd: add missing formatter close_section() to scrub status
Also add braces to make the open/close matchups easier to see. Broken
by f36617392710f9b3538bfd59d45fd72265993d57.
...
Sage Weil
07:47 PM Revision 12a1f623 (ceph): move other ffsb workloads to marginal suite
Sage Weil
04:30 PM Revision c0b01cda (ceph): Merge branch 'stable'
Conflicts:
src/test/cli/radosgw-admin/help.t
Sage Weil
04:21 PM Revision f67fe4e3 (ceph): librados: Bump the version to 0.48
Signed-off-by: Wido den Hollander <wido@widodh.nl>
Signed-off-by: Sage Weil <sage@inktank.com>
Wido den Hollander
03:34 PM Feature #2714 (Resolved): teuthology: store monitor data in archives
Sage suggested we might want to have teuthology archive the monitor's data directory, since that information is often... Greg Farnum
02:52 PM Bug #2593: logmonitor: decode failure
nope.
i wonder if we should make teuthology archive the mon data directory... that info is often useful, even for ...
Sage Weil
02:49 PM Bug #2675 (Resolved): osd: segfault during log trim
Sage Weil
02:32 PM Revision bcfcf8ef (ceph): librados: add assert_version as an operation on an ObjectOperation
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
02:31 PM Bug #2712 (Resolved): osd: small write performance regression from 0.47.*
Several reports on the ML about small writes going slower than before, generating 'slow request' warnings where previ... Sage Weil
02:29 PM Revision 39eaa230 (ceph): ReplicatedPG: do not set reply version to last_update
The version should be oi.user_version as set above.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
02:27 PM Revision e6e36c0a (ceph): rgw: initialize fields of RGWObjEnt
This fixes various valgrind warnings triggered by the s3test
test_object_create_unreadable.
Signed-off-by: Sage Weil...
Sage Weil
02:02 PM Bug #2711 (Resolved): chef does not install apache2/fastcgi on plana23
i saw this on at least one other node as well. why? (this manifests itself has a 100-continue s3test failure in qa.... Sage Weil
01:45 PM Linux kernel client Bug #2710 (Resolved): freeze from dbench in uml
Sage Weil
01:16 PM Linux kernel client Bug #2710: freeze from dbench in uml
this was a problem with 3df5c88d0d469c71c43f2379f6a375fc2d314eb7, fixed and re-pushed testing.
the fix f28dfe91f5f...
Sage Weil
11:32 AM Linux kernel client Bug #2710 (Resolved): freeze from dbench in uml
Current testing and about 3 seconds of dbench 2 locks up Sage Weil
08:19 AM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
Sage Weil
07:22 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
... Sage Weil
12:39 AM Revision fb9d39d5 (ceph): move locktest to marginal suite
This fails 1 in 10 times or something like that. Sage Weil

07/03/2012

11:49 PM Revision f6cdd852 (ceph): Merge remote-tracking branch 'gh/wip-crush'
Sage Weil
11:24 PM Revision 35b9ec88 (ceph): rgw-admin: use correct modifier with strptime
Bug #2658: used %I (12h) instead of %H (24h)
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
11:24 PM Revision da251fe8 (ceph): rgw: send both swift x-storage-token and x-auth-token
older clients need x-storage-token, newer x-auth-token
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
11:23 PM Revision 4c19ecb9 (ceph): rgw: radosgw-admin date params now also accept time
The date format now is "YYYY-MM-DD[ hh:mm:ss]". Got rid of
the --time param for the old ops log stuff.
Signed-off-by...
Yehuda Sadeh
11:10 PM Revision 6958aeb8 (ceph): rgw-admin: fix usage help
s/show/trim
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
11:04 PM Revision e07b7113 (ceph): Added a debug message
The debug message is to print the string that should be JSON.
This is to track a nightly run failure.
Signed-off-by:...
tamil
10:36 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
Firstly, well done guys on achieving this version milestone. I successfully upgraded to the 0.48 format uneventfully ... Paul Pettigrew
10:25 PM Revision ad97415e (ceph): ceph-disk-prepare: Partition and format OSD data disks automatically.
Uses gdisk, as it seems to be the only tool that can automate GPT uuid
changes. Needs to run as root.
Adds Recommend...
Tommi Virtanen
09:20 PM Revision a1696fe0 (ceph): doc: removed /srv/osd.$id.journal from ceph.conf example.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:16 PM Revision 8f646478 (ceph): CrushTester.cc: remove BOOST dependencies.
remove calls to BOOST libraries for computing Chi-squared statistics and
producing discrete random variables with a g...
caleb miles
09:14 PM Revision 7e23aad6 (ceph): doc: Updates to 5-minute quick start.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:07 PM Revision 83c043f8 (ceph): radosgw-admin: fix clit test
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:04 PM Revision ec0588e7 (ceph): Merge branch 'wip-config'
Sage Weil
08:04 PM Revision c667f5d7 (ceph): lockdep: increase max locks
Hit this limit with the rados api tests.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:49 PM Revision b99d11c4 (ceph): schedule_suite: use the sha1, not branch name
Keep the entire suite run on the same commit. We were resolving the sha1,
but not using it.
Sage Weil
07:23 PM Revision f3c24517 (ceph): nuke - optionally kill the process hung
Added a function kill_process to kill the process hung in the nightly runs.
It takes in pid as an optional argument.
...
tamil
07:07 PM Revision 12ba580f (ceph): config: add unlocked version of get_my_sections; use it internally
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:51 PM Revision 56741581 (ceph): ceph: fix cli help test
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:48 PM Revision c11ca33b (ceph): Merge branch 'master' of github.com:ceph/ceph
John Wilkins
06:48 PM Revision fc18cca0 (ceph): doc: Clean up of 5-minute quick start.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:30 PM Revision 151bf0ee (ceph): ReplicatedPG: remove faulty scrub assert in sub_op_modify_applied
This assert assumed that all ops submitted before MOSDRepScrub was
submitted were processed by the time that MOSDRepS...
Samuel Just
06:30 PM Revision 8ade74b2 (ceph): ReplicatedPG: remove faulty scrub assert in sub_op_modify_applied
This assert assumed that all ops submitted before MOSDRepScrub was
submitted were processed by the time that MOSDRepS...
Samuel Just
06:21 PM Revision 09dd8cad (ceph): doc: Updating Getting Started with 5-minute quick start.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:20 PM Revision 32833e88 (ceph): ceph: better usage
Signed-off-by: Kyle Bader <kyle.bader@dreamhost.com> Kyle Bader
06:18 PM Revision da24208d (ceph): Merge branch 'master' of github.com:ceph/ceph
John Wilkins
06:17 PM Revision 8c95b5c8 (ceph): doc: restructuring quick start section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:10 PM Revision 4bd7c05b (ceph): IoCtxImpl: pass objver pointer to aio_operate_read
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:45 PM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
Sage Weil
04:44 PM RADOS Feature #2705 (Resolved): crush: graceful transition to new default tunables
Sage Weil
04:44 PM RADOS Bug #2214 (Resolved): crush: pgs only mapped to 2 devices with replication level 3
Sage Weil
04:44 PM RADOS Bug #2047 (Resolved): crush: with a rack->host->device hierarchy, several down devices are likely...
Sage Weil
04:43 PM RADOS Bug #187 (Rejected): crush: high variance, latency for straw buckets
Sage Weil
04:43 PM RADOS Feature #2422 (Resolved): crush: test that mapping result is uncorrelated
Sage Weil
04:39 PM rgw Bug #2106: failed s3tests.functional.test_s3.test_100_continue
recent logs from the nightly run: /a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5054
Tamilarasi muthamizhan
04:34 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
Tamilarasi muthamizhan wrote:
> latest logs:
> /a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5019...
Tamilarasi muthamizhan
04:33 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
latest logs:
/a/teuthology-2012-07-03_00:00:09-regression-next-testing-basic/5019
config.yaml:
++++++++++++
k...
Tamilarasi muthamizhan
04:24 PM Revision 4e774fbc (ceph): ceph-disk-prepare: Take fsid from config file.
Closes: #2546.
Signed-off-by: Tommi Virtanen <tv@inktank.com>
Tommi Virtanen
04:21 PM Revision cd0314d1 (ceph): config: remove bad argparse_flag argument in parse_option()
This is wrong, and thankfully valgrind picks it up.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:20 PM Revision 67455c21 (ceph): debian: strip new ceph-mds package
Reported-by: Amon Ott <a.ott@m-privacy.de>
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:53 PM devops Feature #2704 (Closed): sepia: Use ``names`` as resolver on plana, burnupi, vercoi
Anonymous
03:46 PM Revision 6bb19e74 (ceph): doc: Cleaned up rbd snapshots.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:45 PM Feature #2702 (Resolved): gitbuilder: sync each build as it completes
Sage Weil
03:27 PM devops Feature #2549: ceph-disk-prepare: take fstype, mkfs and mount options from ceph.conf
As of commit ad97415ef72b55934adfa5024fd9af8fd1f0f82d, this now needs mount options too. Anonymous
03:26 PM devops Feature #2547 (Resolved): ceph-disk-prepare: handle partitioning and mkfs
commit ad97415ef72b55934adfa5024fd9af8fd1f0f82d
Author: Tommi Virtanen <tv@inktank.com>
Date: 2012-07-03 15:24:26...
Anonymous
03:20 PM Revision ad61bbd1 (ceph): config: fix lock recursion in get_val_from_conf_file()
Introduce a private, already-locked version.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:15 PM Revision 7009c32e (ceph): config: fix recursive lock in parse_config_files()
The _impl() helper is only called from parse_config_files(); don't retake
the lock.
Signed-off-by: Sage Weil <sage@i...
Sage Weil
02:24 PM rbd Bug #2457 (Resolved): libvirt: migration fails with rbd in 0.9.11 and 0.9.12
Fixed by upstream libvirt commit 78290b1641e95304c862062ee0aca95395c5926c. Josh Durgin
02:08 PM rbd Bug #2457: libvirt: migration fails with rbd in 0.9.11 and 0.9.12
Fixed in 0.9.12-3(debian naming) and later. Also recently in-list reports told the same, so issue may be closed safely. Andrey Korolyov
02:17 PM rgw Bug #2701 (Resolved): rgw: don't keep bucket info indexed by bucket_id
Yehuda Sadeh
02:15 PM rbd Bug #2700 (Resolved): blkdeviotune method at libvirt doesn`t work on RBD volumes
Since qemu implemented its own i/o limiting mechanism rather than cgroups, all block backends may be controlled over ... Andrey Korolyov
01:46 PM Revision b53cdb97 (ceph): config: remove bad argparse_flag argument in parse_option()
This is wrong, and thankfully valgrind picks it up.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:17 PM Messengers Bug #2569: msgr: connect_rank crash
i've merged fix for this into master, commit:204bc594be1a6046d1b362693d086b49294c2a27 (with possible side-effects fro... Sage Weil
12:16 PM Bug #2682 (Resolved): config lockdep error (recursive lock?) in LibRadosAio.SimpleWritePP
Sage Weil
10:48 AM devops Feature #2699 (Rejected): crowbar: change barclamp-glance to use rbd
Anonymous
10:38 AM devops Feature #2698: crowbar: Guide for using "front" network
We need an easy way to drop a "dhclient eth1" upstart job into a crowbar server installation. Just a sudo tee /etc/in... Anonymous
10:28 AM devops Feature #2698 (Closed): crowbar: Guide for using "front" network
Anonymous
10:26 AM devops Feature #2697 (In Progress): crowbar: ISO generation, reproducible in a cloud image vm
Anonymous
10:16 AM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
Anonymous
10:12 AM devops Feature #2696 (Rejected): chef: Automated QA
Use downburst vms on vercoi to automatically bring up ceph clusters, do basic RADOS/RBD functionality testing, tear d... Anonymous
10:11 AM devops Feature #2695 (Closed): crowbar: Automated QA
Use downburst vms on vercoi to automatically bring up ceph clusters, do basic RADOS/RBD functionality testing and Ope... Anonymous
10:10 AM rgw Bug #2642 (Resolved): rgw: show/trim usage using also time (not just date)
Done, commit:80a939a99db64f7802a4a3c1320316c91720f5d9 Yehuda Sadeh
10:08 AM rgw Bug #2658 (Resolved): rgw-admin: usage show fails when specifying hour > 12
Fixed, commit:c5d19b6df0bcb238e5e68732b4d252b06f2d9e56. Yehuda Sadeh
10:05 AM devops Feature #2584 (Resolved): sepia: provide networking, DHCP for dynamic virtual machines
Anonymous
10:05 AM devops Feature #2584: sepia: provide networking, DHCP for dynamic virtual machines
Split the DNS part to #2694, this is already providing value to users. Anonymous
09:59 AM devops Feature #2584: sepia: provide networking, DHCP for dynamic virtual machines
Status update: missing DNS updates, all the strictly required components are there; vms attached to the front network... Anonymous
10:04 AM devops Feature #2553: crowbar: open question: What's the correct way to add RBD support to the Nova barc...
(Wrong ticket, ignore) Anonymous
10:04 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
Anonymous
09:24 AM devops Feature #2546 (Resolved): ceph-disk-prepare: take fsid from ceph.conf (support --cluster=name)
commit 4e774fbcb38fd6883232b72352512a5f8e4a66e8
Author: Tommi Virtanen <tv@inktank.com>
Date: 2012-07-03 09:22:28...
Anonymous
08:04 AM Bug #2693 (Resolved): osd/ReplicatedPG.cc: 4293: FAILED assert(info.last_update <= active_rep_scr...
... Sage Weil
04:24 AM Revision d347cc89 (ceph): Merge tag 'v0.48argonaut'
v0.48argonaut Sage Weil
04:08 AM Revision 14ec7782 (ceph): client: improve dump_cache output
Hunting #1737.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:20 AM Revision 8b462019 (ceph): doc: release notes for 0.48
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:18 AM Revision 562cad66 (ceph): doc: 'Configuring a Storage Cluster' -> 'Configuration'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:54 AM Revision 030478e2 (ceph): Merge branch 'wip-msgr'
Sage Weil
12:54 AM Revision b5287c19 (ceph): lockdep: enable in common_init
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:54 AM Revision 204bc594 (ceph): msgr: do not try to reconnect con with CLOSED pipe
If we have a con with a closed pipe, drop the message. For lossless
sessions, the state will be STANDBY if we should...
Sage Weil
12:54 AM Revision 2429556a (ceph): msgr: fix pipe replacement assert
We may replace an existing pipe in the STANDBY state if the previous
attempt failed during accept() (see previous pat...
Sage Weil
12:54 AM Revision 0dbc5416 (ceph): msgr: make D_CONNECT constant non-zero, fix ms_handle_connect() callback
A while ago we inadvertantly broke ms_handle_connect() callbacks because
of a check for m being non-zero in the dispa...
Sage Weil
12:54 AM Revision 03445290 (ceph): msgr: move incoming queue to separate class
This extricates the incoming queue and its funky relationship with
DispatchQueue from Pipe and moves it into Incoming...
Sage Weil
12:54 AM Revision 1f3a722e (ceph): msgr: move dispatch_entry into DispatchQueue class
A bit cleaner.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:54 AM Revision b7007a15 (ceph): msgr: preserve incoming message queue when replacing pipes
If we replace an existing pipe with a new one, move the incoming queue
of messages that have not yet been dispatched ...
Sage Weil
12:54 AM Revision 5dfd2a51 (ceph): msgr: choose incoming connection if ours is STANDBY
If the connect_seq matches, but our existing connection is in STANDBY, take
the incoming one. Otherwise, the other e...
Sage Weil
12:54 AM Revision f7d4e397 (ceph): msgr: restart_queue when replacing existing pipe and taking over the queue
The queue may have been previously stopped (by discard_queue()), and needs
to be restarted.
Fixes consistent failure...
Sage Weil
12:53 AM Revision e6ad6d25 (ceph): msgr: move to STANDBY if we replace during accept and then fail
If we replace an existing pipe during accept() and then fail, move to
STANDBY so that our connection state (connect_s...
Sage Weil

07/02/2012

11:06 PM Revision b5098b38 (ceph): ceph.spec.in: Change license of base package to GPL and use SPDX format
LGPLv2 in spec file is not correct, because some of the included
packages/binaries are GPLv2. For example:
src/moun...
Holger Macht
11:06 PM Revision c2b20ca7 (ceph): v0.48argonaut
Sage Weil
11:05 PM Revision a1fe5892 (ceph): mon: initialize quorum_features
This could cause us to incorrectly encode new features into the monstore
that an old mon won't understand.
This is o...
Sage Weil
09:25 PM Feature #2692 (Resolved): stable testing debian repos
Sage Weil
09:18 PM Revision f2d37c52 (ceph): librbd: handle NULL old_format pointer in detect_format()'s logging
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
09:18 PM Revision 6f096b6c (ceph): librbd: use new class methods to access rbd_directory
Since we only use the new interface with new format images,
old clients won't be able to list them.
Signed-off-by: J...
Josh Durgin
09:18 PM Revision 0ad14c95 (ceph): qa: add rbd command rename tests
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:18 PM Revision 288bde3f (ceph): qa: restructure rbd cli tests
Move tests into functions and remove all images at the end of each.
Signed-off-by: Josh Durgin <josh.durgin@inktank....
Josh Durgin
09:18 PM Revision aa3255b8 (ceph): qa: add tests for rbd ls with old and new formats
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:18 PM Revision 1c096a80 (ceph): qa: run rbd tests against old and new formats
Now it's not the caller's responsibility to specify the format,
and we can eliminate a job from the qa suite.
Signed...
Josh Durgin
09:18 PM Revision d04b2fb7 (ceph): cls_rbd_client: check return code of operate
This lets us get a more useful error, like -ENOENT, instead
of a failure to decode the empty output buffer.
Signed-o...
Josh Durgin
09:18 PM Revision d82a5026 (ceph): qa: add rbd remove tests
These check that removing an image still works if an rbd rm
command was interrupted partway through.
Signed-off-by: ...
Josh Durgin
09:08 PM Revision 9278e231 (ceph): smoke: add msgr failures
Sage Weil
08:05 PM Revision 2628530e (ceph): doc: fixed --cap error and a few additional bits of cleanup.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:26 PM Revision b9414b6c (ceph): fewer hosts for mon tests
Sage Weil
06:49 PM rbd Bug #2689 (In Progress): qemu iozone test hangs
Josh Durgin
02:51 PM rbd Bug #2689 (Resolved): qemu iozone test hangs
... Sage Weil
05:07 PM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
took down osd.2 and osd.3 with same crash. coredumps are on the hosts.. Sage Weil
05:06 PM Bug #2691 (Won't Fix): osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
... Sage Weil
04:51 PM Revision 2472034c (ceph): OSD::do_command: unlock pg only if we had it
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:49 PM Revision 841451f2 (ceph): MOSDSubOp: set hobject_incorrect_pool in decode_payload
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:40 PM Bug #2690 (Won't Fix): mon: persist quorum features
currently the non-leaders do not know the quorum features, and encode everything with a minimal (0) feature set.
...
Sage Weil
03:55 PM Revision b91becad (ceph): doc: Added sudo and cleaner instruction for unmap.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:44 PM Revision 38aa344d (ceph): ceph: fix valgrind error check
grep all the logs, not the dir... doh! Sage Weil
03:35 PM Revision f16a9c46 (ceph): doc: updated usage on RBD kernel object commands.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:07 PM Revision 82abe00e (ceph): Merge pull request #15 from fghaas/wip-rgw-doc
radosgw documentation improvements John Wilkins
03:03 PM Revision e1a385bb (ceph): Merge pull request #17 from fghaas/rgw-manpage
doc: drop mention of MDS capabilities from radosgw man page Sage Weil
02:54 PM Revision e1860134 (ceph): doc: drop mention of MDS capabilities from radosgw man page
Signed-off-by: Florian Haas <florian@hastexo.com> Florian Haas
02:26 PM Linux kernel client Bug #2688 (Duplicate): lockup on ffsb + thrashing
... Sage Weil
02:10 PM Revision deceb709 (ceph): filestore: initialize m_filestore_do_dump
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:54 PM Bug #2687: FileStore crashes when "osd_journal_size" is larger than the filesystem
for files, i think the right approach is to fallocate(), which will reserve the space. we shouldn't have to look at ... Sage Weil
12:47 PM Bug #2687 (Resolved): FileStore crashes when "osd_journal_size" is larger than the filesystem
See: http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/7282
If a user (on tmpfs, in this case) specifies...
Greg Farnum
12:49 PM Bug #2476: osd: watch timeout depends on operations to an object
fix qa/workunits/rbd/copy.sh when this is fixed !!! Sage Weil
12:36 PM rbd Feature #2556: rbd tool: break image locks
The current progress in is wip-rbd-locking. Still needs tests and docs, plus a small cleanup as noted on github. Josh Durgin
12:32 PM rbd Feature #2686 (Resolved): rbd: let users specify a usage for shared locks
If existing lockers have the same usage, the lock succeeds. Otherwise, it fails. This could let you use locks with e.... Josh Durgin
11:28 AM rbd Feature #2685 (Rejected): Support QEMU migration with caching enabled
This is a libvirt problem, it's not related to qemu at all. I already looked into and tested whether qemu was doing f... Josh Durgin
11:21 AM rbd Feature #2685 (Rejected): Support QEMU migration with caching enabled
See http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/7524
Apparently newer versions of QEMU refuse to...
Greg Farnum
09:44 AM Documentation #2684 (Won't Fix): doc: ceph and all daemons take --show-config
Quoting Sage:
For future reference, you can get a dump of all these values with
ceph-osd -i 123 --show-...
Anonymous
09:30 AM Bug #2593: logmonitor: decode failure
Do we know if the log in question actually existed on disk or not? Greg Farnum
07:28 AM Bug #2593: logmonitor: decode failure
saw this again on next:... Sage Weil
07:37 AM Bug #2683: ceph-fuse: crash during fsstress
... Sage Weil
07:31 AM Bug #2022 (Need More Info): osd: misdirectect request
apparently there is a different cause for this:... Sage Weil
05:57 AM Subtask #2621 (In Progress): mon: Single-Paxos: synchronize the MonitorDBStore of oblivious monitor
Joao Eduardo Luis

07/01/2012

09:46 PM Feature #2651: mon: race calling tick() when doing slurping
making this a cleanup so that it stops confusing me :) Sage Weil
09:27 PM Revision 96ccb060 (ceph): add rbd_xfstests to kernel suite
Sage Weil
08:57 PM Bug #2683 (Can't reproduce): ceph-fuse: crash during fsstress
... Sage Weil
07:48 PM Bug #2682 (Resolved): config lockdep error (recursive lock?) in LibRadosAio.SimpleWritePP
... Sage Weil
03:06 PM CephFS Bug #2681: client: got push without mds session
this was with 'ms inject socket failure = 200' Sage Weil
03:06 PM CephFS Bug #2681 (Resolved): client: got push without mds session
... Sage Weil
02:41 PM Bug #2599 (Can't reproduce): osd: crash in ReplicatedPG::C_OSD_OndiskWriteUnlock::finish
chalking this up to the bugs in next a couple weeks back Sage Weil
09:22 AM Feature #2680 (Resolved): osd: report backfill progress via query
... Sage Weil
07:09 AM CephFS Bug #2679 (Can't reproduce): POSIX file lock not released on process termination
I obtained a POSIX file lock with the following code:
> --- snip ---
>
> ...
> std::string x = "/tmp/ceph_mount...
Daniel Godas-Lopez
04:45 AM Revision 31ad2639 (ceph): doc: add design doc for rbd layering
I'm still not sure about the names for the command line
operations, but they can be changed later if better ones
come...
Josh Durgin

06/30/2012

10:52 PM rbd Documentation #2670: Docs shouldn't direct users to echo to /sys/bus/rbd for normal use
Sage Weil
10:51 PM rbd Feature #2279 (Resolved): rbd: trivial layering design doc
Sage Weil
05:31 PM Revision 398a229c (ceph): filestore: set min flush size
If a write is smaller than some threshold, do not bother to flush it; let
the fs do that (efficiently, we hope) at co...
Sage Weil
11:34 AM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3450 Tamilarasi muthamizhan
02:56 AM Revision 0810ab6d (ceph): osdmap: check new pool name on rename
Ensure the new pool name doesn't already exist, both in the current and
project map.
Signed-off-by: Sage Weil <sage@...
Sage Weil
02:54 AM Revision 5a935509 (ceph): osd: handle pool name changes properly
* Remove the old name from the name->id map.
Fixes: #2676
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

06/29/2012

11:38 PM Revision d15385b0 (ceph): CrushTester: add Monte Carlo generator to more accurately simulate a
CRUSH mapping using a RNG
Signed-off-by: caleb miles <caleb.miles@inktank.com>
caleb miles
11:37 PM Revision 5de9af55 (ceph): crush: style changes
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
10:49 PM Revision cc4955a5 (ceph): PG: reset_recovery_pointers in activate if missing is empty
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
10:25 PM Revision 657e86c5 (ceph): vstart.sh: don't clobber keyring if not -n
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Dan Mick
09:51 PM Revision a8d7fd95 (ceph): mon: 'osd pool rename <oldname> <newname>'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:44 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3441... Tamilarasi muthamizhan
03:39 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3435 Sage Weil
03:37 PM Bug #2675: osd: segfault during log trim
and ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-28_19:00:12-regression-master-testing-gcov/3437 Sage Weil
03:33 PM Bug #2675: osd: segfault during log trim
also:... Sage Weil
03:30 PM Bug #2675 (Resolved): osd: segfault during log trim
... Sage Weil
09:22 PM Revision 02f1b0a6 (ceph): doc: document new 'osd crush move ...' command
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:16 PM Revision f5e3a675 (ceph): mon: fix 'osd crush move <item> <loc ...>'
- fix arg count check
- avoid updating crush map if it is already correct
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:02 PM Feature #2471 (Resolved): osd: add prefix match to OSDCaps
Sage Weil
09:00 PM Feature #2678 (Rejected): osd, objecter: redirect misdirected requests
Generic mechanism to refer the client to the correct osd when they misdirect their requests. This will allow the clu... Sage Weil
08:59 PM Bug #2676 (Resolved): mon: cannot create pool with old renamed name
commit:5a9355091296121823156de7d3160de45328a0cc Sage Weil
04:46 PM Bug #2676 (Resolved): mon: cannot create pool with old renamed name
renaming a pool name, and then trying to create a new pool with the old name fails. Yehuda Sadeh
07:36 PM Revision e89dec78 (ceph): Merge branch 'master' of github.com:ceph/teuthology
Mark Nelson
07:36 PM Revision 2e5853f4 (ceph): Now using daemon-helper
Signed-off-by: Mark Nelson <nhm@clusterfaq.org> Mark Nelson
07:34 PM Revision 8d53965a (ceph): Merge branch 'wip-crushwrapper2'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
07:33 PM Revision 5e454bbd (ceph): mon: add 'osd crush move <name> <loc ...>' command
Move an existing bucket to a new position in the hierarchy.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:32 PM Revision a2d0cff1 (ceph): crush: add move_bucket() method
Move an existing bucket to a new location in the hierarchy.
Signed-off-by: Caleb Miles <caselim@gmail.com>
caleb miles
07:32 PM Revision d22529d1 (ceph): crush: get_loc -> get_immediate_parent
Signed-off-by: Caleb Miles <caselim@gmail.com> caleb miles
07:27 PM rbd Bug #2677 (Resolved): librbd: create does not clean up well
A create that fails part way through does not remove objects it created or undo modifications it does, for example ad... Josh Durgin
07:23 PM rbd Feature #2279 (Fix Under Review): rbd: trivial layering design doc
See wip-rbd-layering-doc Josh Durgin
06:08 PM Revision d7c18c13 (ceph): PG::merge_old_entry: handle clone prior_version case
A clone can have a prior_version after log_tail and still not have
a corresponding log entry since the prior_version ...
Samuel Just
06:06 PM Revision b907c881 (ceph): ReplicatedPG: adjust log.complete_to based on rmissing
log.complete_to can actually move backwards due to objects
missing because of divergent entries.
Signed-off-by: Samu...
Samuel Just
06:05 PM Revision 143afcf9 (ceph): ReplicatedPG: clear peer_(missing|log)_requested in check_recovery_sources
Otherwise, we might fail to re-request the information from a
previously down osd.
Signed-off-by: Samuel Just <sam.j...
Samuel Just
06:02 PM Revision 3321700a (ceph): qemu_iozone: use a larger image
The default is not large enough. Josh Durgin
05:41 PM Revision 090e5109 (ceph): PG: check_recovery_sources on each map
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:12 PM Revision 74b1468f (ceph): kernel suite
Sage Weil
03:26 PM Messengers Bug #2569: msgr: connect_rank crash
fix for this is in wip-msgr, still testing Sage Weil
02:16 PM RADOS Feature #2541 (Resolved): crush: move command to adjust non-leaf node position
Sage Weil
12:54 PM Feature #2575 (Resolved): perf: 0.48 numbers
Mark Nelson
12:53 PM Feature #2582 (Resolved): set up chart.io + mysql (or equivalent) infrastructure for tracking perf
Mark Nelson
12:51 PM Feature #2577 (Resolved): teuthology: blktrace task
Mark Nelson
12:29 PM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
Tried this on master. Although at first I triggered something else, the bottom line is that this works, and the monit... Joao Eduardo Luis
12:14 PM Subtask #2674 (Rejected): mon: Single-Paxos: mon commits suicide after remove&add
Yep. Makes sense. I was afraid this was cause by my changes.
Rejecting it then.
Joao Eduardo Luis
11:30 AM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
Yeah.. basically we're changing the mon's ip by removing and re-adding it, and the mon isn't smart enough to realize ... Sage Weil
11:12 AM Subtask #2674: mon: Single-Paxos: mon commits suicide after remove&add
I believe this is intended behavior, note the last line:... Greg Farnum
03:07 AM Subtask #2674 (Rejected): mon: Single-Paxos: mon commits suicide after remove&add
Pre-conditions:
3 mons: a=127.0.0.1:6789 ; b=127.0.0.1:6790 ; c=127.0.0.1:6791
* remove 'c' with ./ceph mon rem...
Joao Eduardo Luis
11:09 AM Bug #2646: mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
commit:840ae244499496d543d634713bdee7c7884ce527
The tick happened at the same time as slurping, which meant the di...
Greg Farnum
10:54 AM Bug #2646 (Resolved): mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
Sage Weil
10:53 AM Bug #2264 (Can't reproduce): mon: failed assert in bump_epoch
Sage Weil
06:19 AM Bug #2618: error: unable to open OSD superblock
Thanks, but that didn't help.
I did notice that drives get mounted a little weird.
Don't know if that's a problem...
John S
05:50 AM Revision c3a02ea0 (ceph): doc: radosgw: further simplify Apache rewrite rule
Sufficient to just pass the query string, drop params.
Signed-off-by: Florian Haas <florian@hastexo.com>
Florian Haas
05:25 AM Revision fb4ce538 (ceph): Merge pull request #16 from fghaas/doc-osd
doc: document osd tell bench Sage Weil

06/28/2012

10:06 PM Bug #2664 (Resolved): osd: extra attr _path, extra attr snapset from scrub
Sage Weil
07:34 PM Revision 76c657e3 (ceph): cls_rbd: add methods for dealing with rbd_directory objects
These hide the structure of the directory from clients, and provide
mappings from id to name and name to id. These ma...
Josh Durgin
07:33 PM Revision 5ae6e71f (ceph): cls_rbd: add {get,set}_id methods
These will be used on a new rbd id object to provide a level of
indirection so that header objects don't need to move...
Josh Durgin
07:33 PM Revision a1d477b9 (ceph): librbd: add indirection between name and header object
This adds a new rbd_id.$name object that contains the id of an image.
Rename the header in new-format images to rbd_...
Josh Durgin
07:33 PM Revision 41107078 (ceph): cls_rbd: use already defined local var instead of constant
This occurrence was meant to be replaced in the original commit.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:33 PM Revision de62c4c4 (ceph): objclass: add create method
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
07:07 PM Revision 0a6ee694 (ceph): doc: document osd tell bench
Signed-off-by: Florian Haas <florian@hastexo.com> Florian Haas
06:41 PM Revision 147114da (ceph): doc: minor edits.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:28 PM Revision 15ebf202 (ceph): rest-bench: mark request as complete later
We marked a request as complete in the callback, however
it might be that we're still inside S3_runall_request_contex...
Yehuda Sadeh
06:27 PM Revision 5c23d357 (ceph): buffer.h: fix operator> and operator>= increment error
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:14 PM Revision f81feade (ceph): schedule_suite.sh: drop -x
Sage Weil
06:14 PM Revision c6b2e93d (ceph): add cleanup-user.sh script
big hammer, use with care Sage Weil
04:53 PM Revision 335b918d (ceph): DBObjectMap: clones must inherit spos from parent
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:53 PM Revision cc1da958 (ceph): filestore: sync object_map object in lfn_remove when nlink > 1
In the following sequence:
1) create (a, 1)
2) setattr (a, 1)
3) link (a, 1), (b, 1)
4) remove (a, 1)
If we play 1-...
Samuel Just
04:47 PM Revision 8c453cce (ceph): cleaned up commented code
Signed-off-by: Mark Nelson <nhm@clusterfaq.org> Mark Nelson
11:29 AM Bug #2673 (Resolved): ReplicatedPG::prepare_transaction: don't crash on empty ops
Samuel Just
11:26 AM Cleanup #2672 (Rejected): PG::find_best_info cleanup
see 253033cd720db86e7c8372fd4184de7d4c43bce2 Samuel Just
11:26 AM Cleanup #2671 (Resolved): buffer.h: do efficient buffer comparisons
Samuel Just
10:15 AM rbd Documentation #2670 (Resolved): Docs shouldn't direct users to echo to /sys/bus/rbd for normal use
A naive user looking for "rbd map" will instead find this:
http://ceph.com/docs/master/rbd/rados-rbd-cmds/
with...
Anonymous
10:04 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
Lots of work on the messenger client, but still not completely
clear this particular bug is fixed. There are a few ...
Alex Elder
09:42 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
I suppose this really ought to get fixed at some point.
For now, it looks like Sage has implemented a workaround
th...
Alex Elder
09:41 AM rbd Bug #1070: krbd: ^C doesn't work
No progress on this. None expected unless it gets
reprioritized and planned.
Alex Elder
09:40 AM Linux kernel client Feature #1699: debug symbols in autobuilt (sepia) kernels
No progress on this. I have a vague memory that someone
else might have looked at this problem a while back (Dan?)....
Alex Elder
09:39 AM Feature #2127: Save kernel core dumps on all of our test machines
My work on this was pretty much complete a few months ago.
It included a shell script that leverages Ubuntu kdump
...
Alex Elder
09:32 AM Linux kernel client Bug #2261 (Can't reproduce): paging error in libceph after crashed osd comes back online
the osd_client refcounting bug fix may explain this one, too... commit:0d47766f14211a73eaf54cab234db134ece79f49
an...
Sage Weil
09:16 AM Linux kernel client Bug #2261: paging error in libceph after crashed osd comes back online
No progress on this.
There has been a lot of work on the messenger code since this bug was
reported. One change ...
Alex Elder
09:31 AM Linux kernel client Cleanup #2130: ceph: xattr: complete cleanups following review
No progress on this, but I still have the patches. I'll
try to sneak them in as I'm working on RBD. I believe
the...
Alex Elder
09:29 AM Linux kernel client Cleanup #2131: ceph: xattr: use the generic kernel xattr code
No progress on this. It should be put on our roadmap as a task
to complete, maybe within the next 6 months.
Alex Elder
09:12 AM Bug #2267 (Closed): Ceph client crashed after shutting down one mds and osd
A recent fix supplied by Zheng Yan of Intel seems to have fixed
this problem, so I'm closing this bug.
rbd: C...
Alex Elder
09:05 AM rbd Feature #2326 (In Progress): krbd: use new class interfaces, new image format
I've finally begun work on this, following some in-person discussion
with Josh, Dan, and Sage this week.
I will u...
Alex Elder
09:00 AM Linux kernel client Feature #2374: ceph-client: start laying the groundwork for Linux tracepoints
No progress on this yet.
However, I got this e-mail from Jim Schutt shortly after creating
this bug, and just wan...
Alex Elder
08:44 AM Bug #2386: xfstests: failed #34
I've been trying to find out whether this is still a problem or
if it was transient. But teuthology has had a strin...
Alex Elder
07:46 AM Revision 218dd5ad (ceph): radosgw Apache configuration: simplify rewrite rule
Use a simpler RewriteRule as suggested here:
http://marc.info/?l=ceph-devel&m=133951986510085&w=2
Florian Haas
07:43 AM Revision 5a06af78 (ceph): Fix example radosgw Apache configuration
Follow RFC2606, use example.com everywhere. Florian Haas
07:41 AM Linux kernel client Bug #2424 (Resolved): ceph-client: messenger: badness in prepare_write_connect()
This bug was fixed in May, by a small series of changes that
culminated in this one:
commit 3da54776e2c0385c3...
Alex Elder
07:37 AM Linux kernel client Cleanup #2432: ceph-client: messenger: refactor to simplify state model
I had worked out on paper some notes about a longer-term state/event
model that could be used for the client messeng...
Alex Elder
07:33 AM Linux kernel client Cleanup #2432: ceph-client: messenger: refactor to simplify state model
I worked on doing this for a good month but the job really isn't
complete. Nevertheless I think there was some prog...
Alex Elder
07:23 AM Linux kernel client Cleanup #2438: ceph-client: use BUG_ON() for null auth_client->ops pointers
Touching all my bugs today. This one's a good idea but
very low priority.
Alex Elder
07:20 AM rbd Bug #2608: rbd: hung xfstest 270
Just to summarize what I just added...
There are some recent XFS problems that might explain this,
irrespective o...
Alex Elder
07:16 AM rbd Bug #2608: rbd: hung xfstest 270
I looked at this on Tuesday, and sent a note to Sage that should
have instead been put here. Here it is.
I w...
Alex Elder
07:18 AM Revision f969b59a (ceph): cls_rbd: fix signed-ness warning
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
07:18 AM Revision 6a5c1555 (ceph): objclass: remove unused variable from cls_cxx_map_get_vals
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
04:54 AM Feature #2668 (Resolved): Build linux-tools-common package for perf
It'd be really nice if we built linux-tools-common with our gitbuilder kernels so we can install perf on our test box... Mark Nelson
03:25 AM Revision 89492329 (ceph): debian: build with libnss instead of crypto++
Signed-off-by: Laszlo Boszormenyi (GCS) <gcs@debian.hu> Laszlo Boszormenyi (GCS)
03:25 AM Revision 0d9b558f (ceph): debian: arch linux-any
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:25 AM Revision 915f3210 (ceph): debian: move mount.ceph and cephfs into ceph-fs-common
Based on patches from Laszlo Boszormenyi (GCS) <gcs@debian.hu>.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:25 AM Revision 9d6013e0 (ceph): debian: move metadata server into ceph-mds
Also adjust the recommends and depends, so that libcephfs1 and ceph-fuse
hang off of ceph-mds instead of ceph.
Signe...
Sage Weil
12:49 AM Revision 9d7f0480 (ceph): doc/config-cluster/authentication: keyring default locations, simplify ...
- keyrings have new default locations that everyone should use.
- the user key setup is vastly simplified if you use ...
Sage Weil
12:38 AM Revision 1a43c344 (ceph): Added blktrace task
Signed-off-by: Mark Nelson <nhm@clusterfaq.org> Mark Nelson

06/27/2012

11:28 PM Revision f3661739 (ceph): pg: report scrub status
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
11:28 PM Revision c94583e5 (ceph): pg: track who we are waiting for maps from
Signed-off-by: Mike Ryan <mike.ryan@inktank.com> Mike Ryan
11:28 PM Revision ecd7ffe7 (ceph): pg: reduce scrub write lock window
Wait for all replicas to construct the base scrub map before finalizing
the scrub and locking out writes.
Signed-off...
Mike Ryan
11:20 PM Revision 16d55651 (ceph): mon: MonmapMonitor: Use default port when the specified on 'add' is zero
Fixes a bug triggered by using the ceph tool to 'mon add' with a port set
to zero. We now default to the monitor's de...
Joao Eduardo Luis
10:59 PM Revision 04e1c301 (ceph): rest-bench: mark request as complete later
We marked a request as complete in the callback, however
it might be that we're still inside S3_runall_request_contex...
Yehuda Sadeh
08:48 PM Revision c5d19b6d (ceph): rgw-admin: use correct modifier with strptime
Bug #2658: used %I (12h) instead of %H (24h)
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:10 PM Bug #2618: error: unable to open OSD superblock
I noticed an issue in your ceph.conf - you have keyring = /etc/ceph/keyring.admin in the global section, and the osd ... Josh Durgin
05:19 PM rbd Bug #2667 (Won't Fix): librbd: create_snap on a closed image segfaults
I wrote silly code, and in reordering it, managed to attempt rbd_snap_create() on an
image that I had rbd_close()d. ...
Dan Mick
05:13 PM Feature #2651: mon: race calling tick() when doing slurping
oops, stronger fix, yes! Sage Weil
05:13 PM Feature #2651 (Resolved): mon: race calling tick() when doing slurping
Sage Weil
05:01 PM Feature #2661 (Resolved): mon: do not allow monitors to be added to the map with port 0
Merged into dho and next. Thanks Joao! Greg Farnum
11:25 AM Feature #2661 (Resolved): mon: do not allow monitors to be added to the map with port 0
Last week, somebody used the "ceph mon add" command without specifying a port, and it defaulted to port 0. This cause... Greg Farnum
04:48 PM Feature #2666 (Resolved): rados tool: copy pool
A new operation to copy the entire content of a pool into a different pool. For each object we'd copy the locator, da... Yehuda Sadeh
04:04 PM rgw Bug #2665 (Resolved): rest-bench hangs periodically
rest-bench seems to hang periodically with the following spit out the console on a regular basis:
plana83: 2012-06...
Mark Nelson
04:04 PM Bug #2656 (Rejected): rados-bench hangs periodically
Mark Nelson
04:03 PM Bug #2656: rados-bench hangs periodically
gah,
this is what I get for submitting bugs at the end of the day. You are correct, rest-bench.
Mark Nelson
03:29 PM devops Feature #2587 (Resolved): sepia: isolated networking on vercoi (manual, a handful)
Anonymous
03:28 PM devops Feature #2587: sepia: isolated networking on vercoi (manual, a handful)
Confirmed: isolated0..isolated9 work even if Crowbar wants to put VLANs in them. They pass between vercoi as packets ... Anonymous
02:17 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
More on where that snippet should live:
- for standalone chef deployment, we want the admin run something similar,...
Anonymous
02:14 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
This python snippet creates ceph keys in the right format (for now). Where it should live is still an open question.
...
Anonymous
01:38 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
Without this, multi-mon bring-up is racy.
At proposal save time, the barclamp should inspect the roles, and assign...
Anonymous
02:12 PM Bug #2664: osd: extra attr _path, extra attr snapset from scrub
full logs at metropolis:~sage/bug-2664 Sage Weil
02:11 PM Bug #2664 (Resolved): osd: extra attr _path, extra attr snapset from scrub
... Sage Weil
02:10 PM Revision 17f433aa (ceph): OSD: disconnect_session_watches: handle race with watch disconnect
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Tested-by: Stefan Priebe ...
Samuel Just
01:43 PM devops Feature #2663 (Closed): crowbar: UI for setting generic ceph.conf values
This needs to be some sort of an extensible list of key: value pairs.
Do we need to support sections too? Probably...
Anonymous
01:17 PM devops Feature #2589 (Resolved): crowbar: Update barclamp-ceph for Essex, new ceph-cookbooks
Tyler reported success as of b2c5d3307eef0ca44fd4b001136e9af043b322bd. Anonymous
01:16 PM devops Feature #2588: downburst: multiple, configurable networks to libvirt
For historical value: https://github.com/ceph/downburst/commit/de494eeefad0f0c72916d5dab8ba015b441a94f0 Anonymous
11:30 AM devops Feature #2588 (Resolved): downburst: multiple, configurable networks to libvirt
Anonymous
11:26 AM Linux kernel client Bug #2590: possible irq lock inversion dependency with con->mutex and osdc->request_mutex
Recent log location: /a/teuthology-2012-06-27_00:00:07-regression-next-testing-basic/3076
2012-06-27T01:25:05.11...
Tamilarasi muthamizhan
10:17 AM rbd Feature #2660 (New): qa: test resizing an rbd image while a vm has it open
Make sure the resize is visible to the guest. This works with the virtio driver after doing e.g. 'echo 1 | sudo tee /... Josh Durgin
10:02 AM Subtask #2659 (Can't reproduce): mon: Single-Paxos: ceph tool -w subscriptions not being updated
how to reproduce:... Joao Eduardo Luis

06/26/2012

07:30 PM Revision e34dfc3b (ceph): doc: removing old 'todo' configuration reference.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:24 PM Revision efb74a90 (ceph): doc: converted daemon references from nasty tables to lines.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:16 PM rgw Bug #2658 (Resolved): rgw-admin: usage show fails when specifying hour > 12
using wrong modifier on for parsing it. Yehuda Sadeh
05:11 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
possibly fixed by commit:0d8970fc813b33e7c6ba2484fbc43cce947d3f4d Sage Weil
04:31 PM CephFS Bug #2657 (Resolved): kclient: direct io write larger than 8MiB fails
Writes larger than 8MiB get EFAULT, e.g.:... Josh Durgin
03:28 PM Revision 4142ac44 (ceph): doc: a few minor tweaks to radosgw to cleanup navigation.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:21 PM Revision 0be26111 (ceph): doc: Published RADOS GW docs and install as is for now. Needs more work.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:13 PM Bug #2656: rados-bench hangs periodically
rados-bench or rest-bench? Yehuda Sadeh
01:27 PM Bug #2656 (Rejected): rados-bench hangs periodically
rados-bench seems to hang periodically with the following spit out the console on a regular basis:
plana83: 2012-0...
Mark Nelson
01:45 PM Bug #2563 (Can't reproduce): leveldb corruption
It looks like one of the leveldb store files was corrupted, possibly by the filesystem. It may be possible to recove... Samuel Just
09:36 AM Bug #2655 (Resolved): scrub slows writes more than it should
Samuel Just
09:34 AM Subtask #2616 (Closed): mon: Single-Paxos: AuthMonitor: key_server has no entries
Joao Eduardo Luis
09:34 AM Subtask #2616 (Resolved): mon: Single-Paxos: AuthMonitor: key_server has no entries
Joao Eduardo Luis
09:33 AM Subtask #2620 (Closed): mon: Single-Paxos: MDSMonitor: MMDSBeacon from entity with insufficient p...
Note: turns out this was the same bug as #2643
Had to do with the AuthMonitor losing some infos when reading versi...
Joao Eduardo Luis
09:32 AM Subtask #2643 (Closed): mon: Single-Paxos: mds: Strange message behavior on peon
Had to do with the AuthMonitor losing some infos when reading versions from the store.
This is fixed.
Joao Eduardo Luis
09:01 AM Linux kernel client Bug #2523: xfs: xfs_iolock_reclaimable
... Sage Weil
07:05 AM Revision 93425734 (ceph): CrushTester.cc: removed last broken bits of obsolete method
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
07:02 AM Revision 6d551c17 (ceph): CrushTester.cc: removed obsolete method for collapsing a CRUSH map
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
06:58 AM Revision df7729e5 (ceph): CrushTester: eliminated compiler warning
Signed-off-by: caleb miles <caselim@gmail.com> caleb miles
06:39 AM Revision 88e31546 (ceph): radosgw-admin: improve man page
* remove "OpenStack user" information (deprecated, should no
longer be used. No reason to keep mentioning it)
* fix...
Florian Haas
06:37 AM Revision 63b562f9 (ceph): doc: explain how to configure Ceph for radosgw
* explain creating auth creds for radosgw
* explain Apache config for radosgw
* explain starting daemons for radosgw
...
Florian Haas
06:15 AM rbd Bug #2654 (Won't Fix): Stale rbd volume cannot be unmaped
/dev/rbd0 exists in system but /dev/rbd/winnie-test/postgresql not... Maciej Galkiewicz
06:14 AM Revision d82357df (ceph): Merge branch 'master' of github.com:ceph/ceph
caleb miles
04:21 AM Revision 1db84ddd (ceph): include ceph task in librbd collection
Sage Weil
03:45 AM Revision 07029a4f (ceph): rgw: send both swift x-storage-token and x-auth-token
older clients need x-storage-token, newer x-auth-token
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
03:45 AM Revision 80a939a9 (ceph): rgw: radosgw-admin date params now also accept time
The date format now is "YYYY-MM-DD[ hh:mm:ss]". Got rid of
the --time param for the old ops log stuff.
Signed-off-by...
Yehuda Sadeh
03:45 AM Revision c87c83f3 (ceph): rgw-admin: fix usage help
s/show/trim
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh

06/25/2012

10:30 PM Revision aa89e6ab (ceph): move kclient_workunit_suites_ffsb to marginal suite
until #1947 is fixed Sage Weil
10:20 PM Revision cc380dee (ceph): ignore DEADLOCK line inside lockdep splat
Sage Weil
10:14 PM Revision 77fcf064 (ceph): upstart: fix regex
Signed-off-by: Tommi Virtanen <tv@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Tommi Virtanen
10:01 PM rbd Bug #2608: rbd: hung xfstest 270
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-06-23_00:00:02-regression-next-testing-basic/1471
m...
Sage Weil
09:56 PM Bug #2536 (Need More Info): librados crashed while getting stat of an object
Sage Weil
09:56 PM Bug #2536: librados crashed while getting stat of an object
Have you seen this problem since then? It looks like it could be due to racing with rados startup or shutdown... Sage Weil
09:45 PM Revision 0cd21ea1 (ceph): Merge branch 'next'
Sage Weil
09:45 PM Revision 840ae244 (ceph): mon: don't tick the PaxosServices if we are currently slurping.
They aren't prepared to deal with the on-disk state being inconsistent.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:44 PM Revision ef6beec9 (ceph): objecter: do not feed session to op_submit()
The linger_send() method was doing this, but it is problematic because the
new Op doesn't get its pgid or acting vect...
Sage Weil
09:41 PM Bug #2346 (Resolved): xfs filesystem on top of rbd volume corrupts
No news is good news! Sage Weil
09:40 PM Bug #2602 (Resolved): osd: push failed because local copy is X
Sage Weil
05:09 PM Messengers Bug #2569: msgr: connect_rank crash
All three mon nodes and a client node on the second aging cluster died over the weekend (kernel and all). Looks like ... Mark Nelson
10:25 AM Messengers Bug #2569: msgr: connect_rank crash
Saw the following while debugging my aging test scripts. Seems to have happened when the mon was started. No core d... Mark Nelson
03:33 PM Bug #2649: osd: log bound mismatch
... Sage Weil
03:31 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
moved test to marginal suite; move back to regression when this is resolved! Sage Weil
03:31 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
ubuntu@teuthology:/a/teuthology-2012-06-24_00:00:07-regression-next-testing-basic$ ... Sage Weil
03:28 PM Bug #2593: logmonitor: decode failure
I wonder if this is also due to tick() colliding with slurping — the first one definitely could be (not sure about th... Greg Farnum
03:27 PM Bug #2653 (Resolved): Web docs point to obsolete "fusermount" page
The page http://ceph.com/docs/master/man/8/mount.ceph/ has a link at the bottome that points to "fusermount" descript... Ken Franklin
03:21 PM Bug #2618: error: unable to open OSD superblock
attaching my ceph.conf.
Can't get to IRC from work - I'll try in the evenings.
thanks
John S
02:54 PM rgw Bug #2652 (Resolved): Segmentation fault in rest-bench
This happened while running rest-bench during aging tests on the burnupi cluster.
--
plana83: *** Caught signal...
Mark Nelson
02:48 PM Bug #2022 (Resolved): osd: misdirectect request
Sage Weil
02:40 PM Feature #2651 (Rejected): mon: race calling tick() when doing slurping
Right now the monitor calls tick() on all the PaxosService implementations when it's doing slurping. This introduces ... Greg Farnum
09:19 AM rgw Bug #2650 (Resolved): rgw: swift key creation overrides subuser access mask
# radosgw-admin subuser create --uid=johndoe --subuser=johndoe:swift
--access=full
{ "user_id": "johndoe",
"rados...
Yehuda Sadeh

06/24/2012

08:33 PM Revision 4e45d60f (ceph): ObjectStore::Transaction: initialize pool_override in all constructors
use_pool_override and pool_override weren't initialized in these two
constructors.
Signed-off-by: Samuel Just <sam.j...
Samuel Just

06/23/2012

05:34 PM Revision ff67210e (ceph): objecter: do not feed session to op_submit()
The linger_send() method was doing this, but it is problematic because the
new Op doesn't get its pgid or acting vect...
Sage Weil
04:56 PM Bug #2649 (Resolved): osd: log bound mismatch
... Sage Weil

06/22/2012

07:14 PM Bug #2648 (Resolved): removing a monitor from the map while it's running causes a crash
... Greg Farnum
05:27 PM Bug #2647 (Can't reproduce): osd: old request, waiting for subops
primary:... Yehuda Sadeh
11:43 AM Bug #2618: error: unable to open OSD superblock
John, can we see your ceph.conf file? If you have time, try chatting in #ceph on irc.oftc.net as well; perhaps we ca... Dan Mick
11:30 AM Bug #2646 (Resolved): mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
... Yehuda Sadeh
08:17 AM Subtask #2645 (Rejected): mon: Single-Paxos: Could not decrypt ticket info (immediately after run...
There was a lingering monitor still running, from a previous install.
Apparently, holding the wrong keys will lead...
Joao Eduardo Luis
08:09 AM Subtask #2645 (Rejected): mon: Single-Paxos: Could not decrypt ticket info (immediately after run...
... Joao Eduardo Luis
02:23 AM Revision 48e8e0a7 (ceph): Add script to create a vm image with extra packages
Josh Durgin
01:44 AM Revision 38f6a78c (ceph): Add a task to run a test against rbd inside of qemu.
For now this task does not setup networking for the vm,
and simply runs an executable downloaded from a specified url...
Josh Durgin
01:18 AM Revision 94a6ab8f (ceph): Add some tests inside qemu for the librbd suite
Josh Durgin
01:16 AM Revision a92306a4 (ceph): Move librbd tests to rbd suite
This lets us generate jobs with different caching settings instead of
hardcoding them.
Josh Durgin
12:24 AM Bug #2602: osd: push failed because local copy is X
Hi Sage,
just updated to your wip_rolling_upgrade branch.
FileStore update worked ( 100GB => 30 minutes on XFS ) ...
Simon Frerichs
12:10 AM Revision c399d903 (ceph): mon: conditionally encode PGMap[::Incremental] with quorum features
This allows a mon cluster to transition to the new encoding during a
rolling upgrade.
Signed-off-by: Sage Weil <sage...
Sage Weil
12:10 AM Revision de5b3236 (ceph): mon: conditionally encode mon features for remote mon
The only time we encode these is when forwarding messages. Encoding using
the destination's feature set.
Signed-off...
Sage Weil
12:10 AM Revision 77d836c5 (ceph): mon: encoding new monmap using quorum feature set
It is probably unlikely that someone will expand the mon cluster with a
mixed feature set, but we know the quorum fea...
Sage Weil
12:10 AM Revision 80649d08 (ceph): mon: note that monmap may be reencoded later
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:10 AM Revision 9fcc3dee (ceph): osd_types.cc: remove hobject_t decode asserts
These asserts were useful for ensuring that pool is passed
in in the correct places, but they prevent the encoder
tes...
Samuel Just
12:10 AM Revision 2fe98163 (ceph): OSD,PG,ObjectStore: handle messages with old hobject_t encoding
Messages that embed an hobject_t need to have the pool field fixed
on messages from old peers.
Signed-off-by: Samuel...
Samuel Just
12:10 AM Revision 2355b233 (ceph): mon: conditionally encode old monmap when peer lacks feature
This allows a rolling upgrade from 0.47.2 to 0.48.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:10 AM Revision 06288a9d (ceph): mon: track intersection of quorum member features
When we form a quorum, also note the intersection of the quorum members'
feature bits. This will inform decisions ab...
Sage Weil
12:10 AM Revision 0aaf7334 (ceph): mon: conditionally encode auth incremental with quorum feature bits
If the quorum does not yet all have the MONENC feature, stick to the old
encoding.
It might be more polite to requir...
Sage Weil
 

Also available in: Atom