Activity
From 07/05/2012 to 08/03/2012
08/03/2012
- 06:31 PM Bug #2909 (Resolved): the rados_trunc function did not get implemented in rados.py Python
- Thanks! Added in commit:43291951fad241a6d3f8b8daa37d3665c9d842d6, with a simple test and spacing normalized to the re...
- 03:58 PM Bug #2909: the rados_trunc function did not get implemented in rados.py Python
- Yes you may.
- 03:35 PM Bug #2909: the rados_trunc function did not get implemented in rados.py Python
- That looks good to me. Can I add your signed-off-by to the patch?
- 01:53 PM Bug #2909 (Resolved): the rados_trunc function did not get implemented in rados.py Python
- This code seems to work in the Ioctx class:
def trunc(self,key,size):
self.require_ioctx_open()
... - 02:23 PM devops Feature #2910: crowbar: Use JBOD mode for ceph-osd
- There's a map in the deployer object from role name to BIOS and RAID configuration to set on the node. We can add cep...
- 02:22 PM devops Feature #2910 (Closed): crowbar: Use JBOD mode for ceph-osd
- 11:38 AM Bug #2908 (Resolved): ceph osd crush remove <name>
- (11:34:50 AM) Kyle Bader: so it looks like ceph -h is missing crush rm
(11:34:54 AM) Kyle Bader: could we add
(11:... - 07:56 AM Subtask #2738 (Rejected): mon: Single-Paxos: Sync: Add snapshot support to the monitor store
- This task was superseded by task #2756, which provides a much more broad implementation using directly the available ...
- 07:45 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
- 07:44 AM Subtask #2805 (Resolved): mon: Single-Paxos: Sync: Create a test unit to verify the correctness o...
- 07:44 AM Subtask #2758 (Resolved): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyValueDB to ...
- 07:43 AM Subtask #2756 (Resolved): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
08/02/2012
- 11:03 PM Revision 3671a37e (ceph): doc: Explicitly state mount.ceph two first arguments are fixed.
- I've hurt myself too many times with this.
Signed-off-by: Tommi Virtanen <tv@inktank.com> - 11:03 PM Revision 17ad4e18 (ceph): doc: mount.ceph is also needed for cephx.
- Signed-off-by: Tommi Virtanen <tv@inktank.com>
- 10:56 PM Revision c961a20e (ceph): doc: Fix toctree structure for man obsync(1).
- Signed-off-by: Tommi Virtanen <tv@inktank.com>
- 08:31 PM Revision 5db3a9e7 (ceph): rgw_admin.cc: Disallow addition of S3 keys with subuser creation
- Fixes: #1855
It is no longer possible to create a subuser and new S3 key associated
with that user through the radosg... - 08:02 PM Revision 4e40a784 (ceph): ceph-authtool: Fix usage, it's --print-key not --print.
- Signed-off-by: Tommi Virtanen <tv@inktank.com>
- 06:14 PM Revision 8665bdc1 (ceph): task: scrub OSDs periodically
- Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
- 06:14 PM Revision 3b85b231 (ceph): task: verify scrub detects files whose contents changed
- Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
- 06:00 PM Revision 25de5e59 (ceph): Revert "osd: peering: detect when log source osd goes down"
- This reverts commit 0d3d75ec0d17e2f6bc5d42058f51eb2ff19ffb71.
Accidental double push. - 04:51 PM Bug #2907: rados benchmarking tool which does not always do creates
- Why not record the raw data and let other tools produce percentiles and other statistics?
- 04:44 PM Bug #2907 (Resolved): rados benchmarking tool which does not always do creates
- Features:
Pluggable distribution for choosing objects (zipifan?, random?, sequential?)
configurable numbe... - 04:09 PM Bug #2904 (Resolved): ceph-authtool: Adds keys on typos, expected error message
- ...
- 03:50 PM CephFS Feature #2903 (Resolved): ceph-fuse: Support -o noallow_other
- Currently, ceph-fuse hardcodes the -o allow_other option to FUSE_ARGS_INIT.
https://github.com/ceph/ceph/blob/5db3... - 03:35 PM Revision 93cbab73 (ceph): doc: Simplify submodules explanation.
- ``git clone --recursive`` does ``init`` & ``update`` for us. Also
avoids incorrect language; there never were submodu... - 03:35 PM Revision 48de9b5b (ceph): doc: Say what to do if submodules are out of date.
- Signed-off-by: Tommi Virtanen <tv@inktank.com>
- 03:35 PM Revision 203dffad (ceph): doc: cd to repository before running git status.
- Signed-off-by: Tommi Virtanen <tv@inktank.com>
- 03:33 PM Revision 6af560de (ceph): doc: Correct Git URL for clone
- Using git@github.com would mean SSHing in, and your average open
source consumer probably won't have a Github account... - 01:35 PM rgw Bug #2841 (Fix Under Review): rgw: fix usage trim
- 01:31 PM rgw Bug #1855 (Resolved): Creation of a subuser that appears to own an s3 key is possible, and removi...
- Commit 5db3a9e71c6b757660d0702efada40af6be63eb8 pushed. We disallow creating s3 key when subuser is created in order ...
- 01:27 PM devops Feature #2398: chef: external osd journal support
- Shuffling old notes here:
see if "osd journal" was overridden in $cluster.conf; if yes, do not attempt discovery
... - 12:59 PM rgw Feature #2869 (Fix Under Review): rgw: expand date format support
- 12:59 PM rgw Bug #2877 (Fix Under Review): rgw: ETag parsing in complete multipart upload should xml decode ETag
- 12:59 PM rgw Bug #2878 (Fix Under Review): rgw: chunked encoding for POST requests (e.g., complete multipart u...
- 12:59 PM rgw Bug #2879 (Fix Under Review): rgw: xml parser doesn't work correctly with escape sequences
- 11:50 AM Bug #2902 (Resolved): common lib tries to open literal ~/.ceph/ceph.conf
- ...
- 11:48 AM Bug #2901 (Resolved): librados-config should not read ceph.conf
- ...
- 11:38 AM Bug #2900 (Resolved): ceph fuse crashed
- Logs: ubuntu@teuthology: /a/teuthology-2012-07-27_19:00:07-regression-master-testing-gcov/1581
Core file: /a/teuthol... - 11:09 AM Bug #2897 (Resolved): ceph fuse error segfault
- ...
- 11:03 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
- 10:52 AM Bug #2823: osd: out of order ACKs
- Log location: ubuntu@teuthology:/a/teuthology-2012-08-01_19:00:04-regression-master-testing-gcov/4196
ubuntu@teuth... - 10:50 AM Bug #2823: osd: out of order ACKs
- (10:46:42 AM) tamil.muthamizhan@newdream.net: 4196: (1138s) collection:rados-thrash clusters:6-osd-3-machine.yaml fs:...
- 10:45 AM Bug #2823 (New): osd: out of order ACKs
- 10:10 AM Bug #2887: pjd open/08.t failed test 2
- recent logs: ubuntu@teuthology:/a/teuthology-2012-08-01_19:00:04-regression-master-testing-gcov/4126
- 10:00 AM Bug #2896 (Won't Fix): ceph pg dump has empty hb_out field
- I was looking at "ceph pg dump" output today on a patched argonaut build and saw that while all the osd stat outputs ...
- 12:17 AM Revision 0d3d75ec (ceph): osd: peering: detect when log source osd goes down
- The Peering state has a generic check based on the prior set osds that
will restart peering if one of them goes down ...
08/01/2012
- 11:34 PM Revision 4bd466d6 (ceph): osd: peering: detect when log source osd goes down
- The Peering state has a generic check based on the prior set osds that
will restart peering if one of them goes down ... - 06:31 PM Bug #2895: cli: non-existent command returns confusing error message
- ...and ceph osd map rbd/rbd_info returns "unknown command map', which is just wrong;
the problem is the argument nee... - 05:33 PM Bug #2895 (Resolved): cli: non-existent command returns confusing error message
- 'ceph osd crush get' returns 'unknown command crush', instead of the full command.
http://www.spinics.net/lists/ce... - 05:28 PM Feature #2894 (Resolved): cli: help command for ceph subsystems
- To make commands and their usage discoverable and easy to look up, each subsystem could provide a help command
that ... - 04:55 PM Revision ca2c3813 (ceph): osd: peering: detect when log source osd goes down
- The Peering state has a generic check based on the prior set osds that
will restart peering if one of them goes down ... - 04:50 PM Bug #2887: pjd open/08.t failed test 2
- Also, ubuntu@teuthology:/a/teuthology-2012-07-31_19:00:04-regression-master-testing-gcov/3654
- 04:47 PM Bug #2887: pjd open/08.t failed test 2
- Also, ubuntu@teuthology:/a/teuthology-2012-08-01_00:01:38-regression-next-testing-basic/3784
- 04:43 PM Bug #2887: pjd open/08.t failed test 2
- Recent logs: ubuntu@teuthology:/a/teuthology-2012-08-01_02:00:04-regression-testing-testing-basic/3909
- 01:51 PM Bug #2887 (Resolved): pjd open/08.t failed test 2
- pjd open/08.t failed test 2 on both ceph-fuse and kclient.
Logs:- ubuntu@teuthology:/a/teuthology-2012-07-31_02:00... - 04:34 PM devops Feature #2893 (Closed): crowbar: Nested virtualization for running OpenStack in vercoi vm
- 04:34 PM devops Feature #2893 (Closed): crowbar: Nested virtualization for running OpenStack in vercoi vm
- 04:30 PM Bug #2892 (Resolved): ceph health detail kills monitor
- Executed the following:
ubuntu@burnupi30:~$ sudo ceph health detail
Wait awhile and ceph will start to output t... - 03:10 PM Bug #2891 (Can't reproduce): heap profiler hangs when trying to start it up on the mon
- We tried to turn heap profiling on the mon (congress), however the last thing we see in the logs is the message that ...
- 02:24 PM Bug #2890 (Resolved): monitor: "recognize" heap commands
- The monitor accepts the standard heap profiler commands, but it tells the user it doesn't due to not setting return c...
- 01:56 PM devops Feature #2889 (Closed): crowbar: script for easily packaging ceph barclamp
- 01:53 PM devops Feature #2888 (Closed): crowbar: Make VM disk IO cache writes for performance
- 01:38 PM devops Documentation #2886 (Rejected): doc: crush location tricks, ceph.conf, automatic host=
- - how it autoupdates on osd startup
- how hosts won't migrate from container to another automatically - 01:37 PM devops Feature #2885 (Resolved): doc: mon initial members requirements, functioning, admin steps to take
- 01:36 PM devops Feature #2884 (Rejected): doc: osd hotplugging
- 01:34 PM devops Feature #2883 (Rejected): chef: union lists and maps in env vs node ceph.conf json
- As an admin, I want to specify in environment
"osd crush location": {
"datacenter": "westcoast",
}
an... - 01:33 PM devops Documentation #2882 (Rejected): doc: chef environment ceph.conf content tricks
- 01:32 PM devops Feature #2881 (Rejected): doc: chef cookbook better README, internal structure, assumptions
- 01:29 PM devops Feature #2704 (Closed): sepia: Use ``names`` as resolver on plana, burnupi, vercoi
- dnscache01 and dnscache02 are happily serving anything that uses DHCP to get its configuration.
There may be stati... - 01:20 PM devops Feature #2880 (Rejected): chef: use get-or-create instead of get-or-create-key
- ceph.git commit 4551808fa00b812fee6e0c196fd333eca0b06de9 adds "ceph auth get-or-create". Switch to using it in ceph-c...
- 01:10 PM rgw Bug #2877: rgw: ETag parsing in complete multipart upload should xml decode ETag
- There are two different issues here. The first one is that we don't remove the quotes when comparing the etags. The s...
- 12:55 PM rgw Bug #2879 (Resolved): rgw: xml parser doesn't work correctly with escape sequences
- e.g., when providing data with """, the entity is getting clobbered.
- 03:37 AM Revision c2ff66eb (ceph): crank up pjd debugging
07/31/2012
- 10:49 PM Revision bb6e0d0e (ceph): wireshark: update patch
- Update to latest source tree (svn 43768).
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> - 09:44 PM Bug #2873 (Fix Under Review): Stack trace thrown when using obsync
- 06:18 PM Bug #2873: Stack trace thrown when using obsync
- Figured out what the problem is, it appears that on L111, it should go from being...
- 11:27 AM Bug #2873 (Resolved): Stack trace thrown when using obsync
- ...
- 09:26 PM Revision deec81b4 (ceph): ReplicatedPG: clear waiting_for_ack when we send the commit
- Otherwise, we might send the ack anyway later, after a subsequent
commit is sent resulting in an out-of-order op.
Th... - 08:51 PM Revision e0e72d78 (ceph): Merge remote-tracking branch 'upstream/wip-leveldb-iterators'
- 06:55 PM Revision 87defa88 (ceph): rbd: fix off-by-one error in key name
- Fixes: #2846
Signed-off-by: Sylvain Munaut <tnt@246tNt.com> - 06:54 PM Revision 37d5b462 (ceph): secret: return error on empty secret
- Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
- 06:48 PM Revision f22b95da (ceph): rbd: fix off-by-one error in key name
- Fixes: #2846
Signed-off-by: Sylvain Munaut <tnt@246tNt.com> - 06:48 PM Revision e775ce52 (ceph): secret: return error on empty secret
- Signed-off-by: Sylvain Munaut <tnt@246tNt.com>
- 04:55 PM Revision cd0b5278 (ceph): separate regression suite into topical categories rados, rbd, fs
- 03:38 PM RADOS Bug #2874: apparent CRUSH mapping failure
- check if setting the tunables all to 0 makes it go away
- 11:40 AM RADOS Bug #2874 (Resolved): apparent CRUSH mapping failure
- While doing crowbar tests, I created a 3-OSD cluster (on separate VMs) that ended up with 6 degraded PGs....
- 03:36 PM rgw Bug #2504 (In Progress): rgw: use multiple notifications objects
- 03:35 PM rgw Bug #2878 (Resolved): rgw: chunked encoding for POST requests (e.g., complete multipart uploads)
- We shouldn't require length passed for these requests.
- 03:28 PM rgw Bug #2877 (Resolved): rgw: ETag parsing in complete multipart upload should xml decode ETag
- Should be able to accept both:...
- 03:08 PM Bug #2876 (Resolved): mon: pg stuck peering (for example) broken?
- ...
- 02:01 PM Bug #2875 (Resolved): osd: pg stuck in GetLog
- we weren't checking if newest_update_osd went down (it could be outside the prior set)
- 12:43 PM Linux kernel client Bug #2573 (Resolved): libceph: many "socket closed" messages
- I was seeing this too, but with the latest code and all (knock wood) the races closed I'm not anymore. Going to opti...
- 11:49 AM Bug #2846 (Resolved): Malformed keyring file causes kernel null pointer deref on "rbd map"
- userland fixes applied to stable, next.
thanks! - 11:42 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
- kernel patch is in testing branch.
- 06:23 AM Subtask #2805 (Fix Under Review): mon: Single-Paxos: Sync: Create a test unit to verify the corre...
- 06:22 AM Subtask #2805: mon: Single-Paxos: Sync: Create a test unit to verify the correctness of the whole...
- Currently available tests:
* Removing keys:
> * Using both the whole-space iterator and the whole-space snapshot ...
07/30/2012
- 08:43 PM Revision cda5e8e0 (ceph): PG,ReplicatedPG: clarify scrub state clearing
- scrub_clear_state takes care of clearing the SCRUB and REPAIR
flags. Thus, PG::scrub() needn't clear them again sinc... - 08:39 PM Revision 6d464a21 (ceph): PG::mark_clean(): queue_snap_trim if snap_trimq is not empty
- Currently, we won't queue for snap trim until the next map
update.
Noticed while reviewing another patch, this would... - 08:38 PM Revision 1041b92c (ceph): ReplicatedPG::snap_trimmer: requeue if scrub_block_writes
- Otherwise, we do not continue snap_trimming once scrub is
complete.
Noticed while revewing another patch. This woul... - 06:46 PM Linux kernel client Bug #2868: kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osd
- hoping this was the messenger locking stuff, let's see if it pops up again
- 06:45 PM rbd Bug #2715 (Resolved): krbd: spinlock wrong CPU
- 06:45 PM Linux kernel client Bug #2867 (Resolved): kclient: crash from ffsb in con_work -> kernel_sendmsg
- 06:45 PM Linux kernel client Bug #2392: First read of symlink after ceph filesystem mounted gives error
- 06:19 PM Revision 9071764a (ceph): qa: remove rbd image when done
- Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
- 06:19 PM Revision 4111501a (ceph): qa: add script to run rbd/librbd tests
- Run on old and new style images, with different features. This is
intended to ease development, as opposed to being p... - 06:19 PM Revision 902bbd79 (ceph): cls_rbd: don't duplicate encoding
- Just skip the parts before encoding instead of returning early.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 06:19 PM Revision 72dc665b (ceph): librbd: make ImageCtx methods take snap_id parameters
- This makes it easier to use without racing with snap_set.
Requests in the cache, for example, store their snap_id
and... - 06:19 PM Revision bd119908 (ceph): librbd: add method to get the size of an image
- To go with the other individual properties instead of stat,
add a method that only gets the size.
Signed-off-by: Jos... - 06:19 PM Revision ea75d1ff (ceph): librbd: check for new snapshots in snap_set
- This allows callers to create a snapshot and then read from
it immediately. No one uses this other than the tests,
fo... - 06:19 PM Revision ff2a96c7 (ceph): rbd: fix copy-pasted snap remove error message
- Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
- 06:19 PM Revision c6bc3e10 (ceph): librbd: refactor synchronous I/O
- Write in terms of the asynchronous functions, so all the logic
is not duplicated. Now there's only a single point whe... - 06:19 PM Revision d3b5ee75 (ceph): librbd: split into multiple files
- This was long overdue. There's still a bunch of stuff in internal.cc
that should be moved elsewhere, but this is a go... - 06:17 PM Revision bbc2a4e1 (ceph): librbd: round #blocks in image_info() for edge cases
- Signed-off-by: Dan Mick <dan.mick@inktank.com>
- 06:17 PM Revision 237f2409 (ceph): rbd: flatten command (copyup an entire image)
- Fixes: #2561
Signed-off-by: Dan Mick <dan.mick@inktank.com> - 06:17 PM Revision 26ed3841 (ceph): librbd, rbd.py, test_rbd.py: flatten methods
- librbd:
- copyup_block for individual blocks (for read/write path)
- flatten for iterating across whole image (for rb... - 05:49 PM Revision 48038002 (ceph): Merge branch 'wip-osd'
- Reviewed-by: Samuel Just <sam.just@inktank.com>
- 05:49 PM Revision 9e5d4e61 (ceph): osd: initialize send_notify on pg load
- When the PG is loaded, we need to set send_notify if we are not the
primary. Otherwise, if the PG does not go throug... - 05:49 PM Revision f9ff8dd3 (ceph): osd: replace STRAY bit with bool
- We were setting a bit in pg->state that is private to the non-primary
PG. The other bits get shared with the mon etc... - 05:47 PM Revision fb1d5495 (ceph): os: KeyValueDB: re-implement (prefix) iter in terms of whole-space iter
- In-a-nutshell-version: Create a whole-space iterator interface, and
implement the already existing, prefix-based iter... - 05:47 PM Revision 9dd8a333 (ceph): os: KeyValueDB: implement snapshot iterators
- Create a set of functions, to be implemented by derivative classes of
KeyValueDB, responsible for returning an iterat... - 05:47 PM Revision 8e404772 (ceph): test: test_keyvaluedb_iterators: Test KeyValueDB implementations iterators
- This set of tests focus on testing the expected behavior of LevelDBStore's
and KeyValueDBMemory's iterators.
We test... - 04:52 PM rbd Bug #2872 (Resolved): RBD resize command allows image size -1
- Ceph Version : 0.48
Resize rbd image to size -1 allows rbd image to be resized to 15 Exabytes, which is incorrect.... - 03:52 PM rbd Bug #2871 (Resolved): rbd export command hangs when trying to export an image of size 0 to a loca...
- Ceph Version: 0.48
Steps followed:
1. create a rbd image of size 1000 mb in rbd pool
2. resize the rbd image t... - 10:52 AM Bug #2866 (Resolved): osd: pg stuck with unfound
- commit:9e5d4e61a73343397e67e918e87f1e6dcb8ec72d and commit:7b9d37c662313929b52011ddae47cc8abab99095
- 10:51 AM Bug #2860 (Resolved): osd: stuck waiting for pg acting set to change
- commit:bae837010b6b486011b06dd97664fb54c3f3ff44 and commit:96feca450c5505a06868bc012fe998a03371b77f
- 09:14 AM Bug #2819: krbd: lockup on large writes, msgr fault injection
- i'm unable to reproduce this on a real kernel.. it only happens on uml.
here is a full backtrace:... - 08:01 AM Bug #2638 (Resolved): mon: make pool ops idempotent
- 08:01 AM Bug #2830 (Duplicate): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
- 02:27 AM Revision 4a0a7e2f (ceph): auth: (ordered) list of auth methods, not a set
- This lets us express a preference.
Signed-off-by: Sage Weil <sage@newdream.net> - 02:27 AM Revision ec6ecc19 (ceph): auth: AuthSupported -> AuthMethodList
- Signed-off-by: Sage Weil <sage@inktank.com>
- 02:27 AM Revision effdec9e (ceph): auth: introduce cluster, service, and client auth settings
- Signed-off-by: Sage Weil <sage@inktank.com>
07/29/2012
- 09:31 PM Linux kernel client Bug #2688 (Duplicate): lockup on ffsb + thrashing
- 09:31 PM Linux kernel client Bug #2260 (Resolved): libceph: null pointer dereference at try_write+0x638+0xfb0
- this is either #2867, or a similar issue that is since resolved.
- 09:28 PM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
- 09:24 PM Linux kernel client Bug #2867: kclient: crash from ffsb in con_work -> kernel_sendmsg
- *sigh of relief*
- 08:22 PM Linux kernel client Bug #2867: kclient: crash from ffsb in con_work -> kernel_sendmsg
- This appears to be a regression, so it is effectively blocking sending the pull request to Linus.
- 07:40 PM Revision e4e239e2 (ceph): kernel: push a local .deb instead of using gitbuilder
- This lets you specify a path to an existing kernel deb
to be pushed and installed on the remote node.
Limitations:
... - 07:15 PM Revision 1c93d5ab (ceph): syslog check: fix false-positive BUG matches in random strings
07/28/2012
- 08:53 PM Revision 9d43c8a4 (ceph): test: workloadgen: Don't linearly iterate over a map to obtain a collec...
- We were iterating over the collections map a certain amount of times, in
order to obtain the collection in that posit... - 05:54 PM Revision 85187edd (ceph): add osd-recovery-incomplete
- 05:53 PM Revision a0847694 (ceph): osd_recovery: also test unfound discovery
- This tests for bug #2866.
- 05:52 PM Feature #2280 (Resolved): improve gitbuilder infrastructure
- 05:50 PM RADOS Subtask #2792 (Fix Under Review): mon: require tunable feature bit if current osdmap uses non-def...
- 05:23 PM Revision 731d5209 (ceph): ceph_manager: count 'incomplete' as 'down'
- 05:23 PM Revision a9f2bf62 (ceph): ceph_manager: wait_for_active
- 05:23 PM Revision 8dd09cb2 (ceph): osd_recovery: test incomplete pg recovery
- 4-osd test to reproduce #2860 and confirm the fix.
- 05:05 PM Revision 7b9d37c6 (ceph): osd: set STRAY on pg load when non-primary
- The STRAY bit indicates that we should annouce ourselves to the primary,
but it is only set in start_peering_interval... - 04:04 PM Revision bae83701 (ceph): osd: peering: make Incomplete a Peering substate
- This allows us to still catch changes in the prior set that would affect
our conclusions (that we are incomplete) and... - 04:04 PM Revision d1602ee2 (ceph): osd: peering: move to Incomplete when.. incomplete
- PG::choose_acting() may return false and *not* request an acting set change
if it can't find any suitable peers with ... - 03:49 PM rgw Feature #2869 (Resolved): rgw: expand date format support
- should be able to parse the following:
Sat, 28 Jul 2012 20:35:55 UTC
Which uses UTC instead of GMT. - 03:30 PM Feature #2477 (Fix Under Review): rados bench cleanup
- 03:30 PM Feature #1783 (Fix Under Review): osd: scrub incrementally across hash range using MOSDPGScan
- 02:39 PM Revision d6126940 (ceph): config: send warnings to a ostream* argument
- We shouldn't always send these to stderr. (Among other things, the
warning: prefix breaks the gitbuilder error detec... - 02:21 PM Revision 1fc19df8 (ceph): Merge remote-tracking branch 'gh/wip-msgr-masterbits'
- Reviewed-by: Greg Farnum <greg@inktank.com>
- 07:37 AM Linux kernel client Bug #2868 (Resolved): kclient: crash in __kick_osd_requests -> __reset_osd -> __remove_osd
- ...
07/27/2012
- 11:03 PM Revision 96feca45 (ceph): osd: peering: make Incomplete a Peering substate
- This allows us to still catch changes in the prior set that would affect
our conclusions (that we are incomplete) and... - 10:39 PM Revision a71e442f (ceph): osd: peering: move to Incomplete when.. incomplete
- PG::choose_acting() may return false and *not* request an acting set change
if it can't find any suitable peers with ... - 09:28 PM Revision de4474ac (ceph): vstart.sh: apply extra conf after the defaults
- This let's you do e.g., -o 'debug ms = 100' and it will apply after
the default logging levels.
Signed-off-by: Sage ... - 09:00 PM Revision 623026d9 (ceph): Merge remote-tracking branch 'gh/stable' into stable-next
- 08:42 PM Revision 0b8b58ff (ceph): fix adminsocket test
- 05:52 PM Linux kernel client Bug #2867 (Resolved): kclient: crash from ffsb in con_work -> kernel_sendmsg
- ...
- 05:45 PM Revision bbc49179 (ceph): msg/Pipe: if we send a wait, make sure we follow through
- Mark our outgoing connection attempt if we send a WAIT in accept(). This
ensures we don't go to standby or closed in... - 05:44 PM Revision 6c01d46e (ceph): client: handle fault during session teardown
- We may have a sequence like:
- client does REQUEST_CLOSE
- mds sends reply
- connection faults, client does get r... - 05:44 PM Revision a879425b (ceph): msg/Pipe: make STANDBY behavior optional
- In particular, lossless_peers should use STANDBY, but lossless_clients
should reconnect immediately since they are al... - 05:43 PM Revision f69d025b (ceph): conf: make dup lines override previous value
- If you put
[some section]
foo = 1
...
foo = 2
in a .conf file, make the second key override the first.
Generate... - 05:43 PM Revision 4788567e (ceph): qa: simple test for pool create/delete commands
- Signed-off-by: Sage Weil <sage@inktank.com>
- 05:43 PM Revision 6f7837a9 (ceph): mon: make 'osd rm ...' idempotent
- Signed-off-by: Sage Weil <sage@inktank.com>
- 05:43 PM Revision 640e5fde (ceph): qa: simple tests for 'ceph osd create|rm' commands
- Signed-off-by: Sage Weil <sage@inktank.com>
- 05:43 PM Revision 3715d205 (ceph): mon: check for invalid pool snap creates in preprocess_op, too
- This avoids waiting for a paxos commit just to return an error.
Signed-off-by: Sage Weil <sage@inktank.com> - 05:43 PM Revision 507f99e9 (ceph): librados: make snap create/destroy handle client-side errors
- Signed-off-by: Sage Weil <sage@inktank.com>
- 05:43 PM Revision 53aa9593 (ceph): objecter: return ENOENT/EEXIST on pool snap delete/create
- Do these checks on the client to mask monitor idempotency from the user.
Signed-off-by: Sage Weil <sage@inktank.com> - 05:43 PM Revision 5f3ef77d (ceph): mon: make pool snap creation ops idempotent
- Return 0 if the snap already exists, or is already deleted.
Also, avoid updating the pg_pool if we are just waiting ... - 05:43 PM Revision 4dfc14c4 (ceph): mon: remove superfluous "can't delete except on master" comments
- That's what 'return false' means for preprocess_*().
Signed-off-by: Sage Weil <sage@inktank.com> - 05:43 PM Revision 6ec9555b (ceph): cond: reorder asserts
- Make the more specific checks assert before the less specific ones, so we
are more likely to crash with useful inform... - 05:43 PM Revision c2e1c629 (ceph): mutex: assert we are unlocked by the same thread that locked
- This only works for non-recursive locks. (Which is probably all of them?)
Signed-off-by: Sage Weil <sage@inktank.com> - 05:43 PM Revision 46e819ed (ceph): objecter: fix mon command resends
- The monitor session is lossy. Send these when the op is initiated, or
when we reconnect. The timeout/cutoff was pre... - 05:43 PM Revision 358d6b61 (ceph): librados: make pool create/destroy handle client-side errors
- Add tests!
Signed-off-by: Sage Weil <sage@inktank.com> - 05:43 PM Revision 597f14ab (ceph): objecter: make pool create/delete return EEXIST/ENOENT
- Do these checks on the client side to mask monitor idempotency from
the user.
Signed-off-by: Sage Weil <sage@inktank... - 05:43 PM Revision 5503376f (ceph): mon: make pool removal idempotent
- Return success if pool does not exist. Part of #2638.
Signed-off-by: Sage Weil <sage@inktank.com> - 05:43 PM Revision a01e22d2 (ceph): mon: make pool creation idempotent
- Return success if the pool already exists. Part of #2638.
Also, fix this so we wait until a creating pool is create... - 05:18 PM Bug #2866 (Fix Under Review): osd: pg stuck with unfound
- 04:29 PM Bug #2866 (Resolved): osd: pg stuck with unfound
- on congress, observed pg stuck with unfound objects. kicking peering (marking primary down) resolved it.
in testi... - 05:15 PM Bug #2860 (Fix Under Review): osd: stuck waiting for pg acting set to change
- 03:17 PM Bug #2860: osd: stuck waiting for pg acting set to change
- i can reproduce this with:...
- 12:39 PM Bug #2860 (Resolved): osd: stuck waiting for pg acting set to change
- ...
- 03:26 PM rbd Bug #2865 (Resolved): rbd import fails for directory but creates rbd image
- Ceph Version: 0.48
Created a local directory t_dir.
when tried to import directory t_dir to rbd/rbd_image, it rep... - 02:36 PM rgw Bug #2864 (Won't Fix): rados leaves behind references to old buckets
- As this behavior can only be reproduced through deleting objects directly through rados, and not radosgw-admin or API...
- 02:09 PM rgw Bug #2864 (Won't Fix): rados leaves behind references to old buckets
- It is possible to create an inconsistent state by following this procedure:
1. create a bucket through an API call... - 01:58 PM Bug #2824 (Resolved): ceph-fuse; hang mounting with ms failures
- 01:46 PM CephFS Bug #2863 (Resolved): client: does not tolerate traceless replies from mds
- In at least one case (_create's _mknod) we do not tolerate a (write) reply from the mds with no trace. This happens ...
- 01:21 PM rbd Bug #2862 (Resolved): CLI: rbd create command throws inappropriate error messages
- Ceph Version: 0.48
When tried a few negative test cases using "rbd create command", found that the command display... - 12:57 PM rbd Bug #2861 (Won't Fix): CLI: rbd create command requires validation for image-name
- Ceph version: 0.48
When trying to create a rbd image, the image name seems to accept empty string and special char... - 11:45 AM Bug #2462: osd/PG.cc: 402: FAILED assert(log.head >= olog.tail && olog.head >= log.tail)
- just swa this on congress during a huge crush restructure:...
- 11:31 AM rgw Tasks #2859 (New): Make add subuser in radosgw-admin idempotent
- Currently, attempting to create a subuser that already exists returns an error; it has been suggested that this behav...
- 11:25 AM Bug #2858: mon: osd id parsing returns 0 when passed 'osd.1234'
- Not sure exactly what scenario you're looking at here or what the bug is, but there are lots of places in the monitor...
- 11:08 AM Bug #2858 (Resolved): mon: osd id parsing returns 0 when passed 'osd.1234'
- 10:54 AM Bug #2752: Setting large maxosd kills all mons
- Thanks Yehuda!
- 10:51 AM Feature #2857 (Resolved): compile non-production builds with -fno-omit-frame-pointer
- This will let us get much more useful profiling data out of various tools with relatively minimal CPU overhead.
- 08:17 AM Bug #2856 (Resolved): osd: bound size of transactions trimming old osdmaps
- The monitor can arbitrarily advance it's oldest map. The osd should avoid sending down an arbitrarily large transacti...
- 04:55 AM Revision 9553c6ed (ceph): osd: fixing sharing of past_intervals on backfill restart
- We need to share past_intervals whenever we instantiate the PG on a peer.
In the PG activation case, this is based on... - 04:55 AM Revision 29aa1cf4 (ceph): filestore: check for EIO in read path
- Check for EIO in read methods and helpers. Try to do checks in low-level
methods (e.g., lfn_*()) to avoid duplicatio... - 04:29 AM Revision 0891948e (ceph): filestore: add 'filestore fail eio' option, default true
- By default we will assert/fail/crash on EIO from the underlying fs. We
already do this in the write path, but not th...
07/26/2012
- 11:35 PM Revision 9db78090 (ceph): osd: fixing sharing of past_intervals on backfill restart
- We need to share past_intervals whenever we instantiate the PG on a peer.
In the PG activation case, this is based on... - 11:29 PM Revision aef10e72 (ceph): librbd: fix id initialization in new format
- 48bd839b1e25b063c675416a8f6233463f1af115 should have included this.
I misread it due to the use of bid instead of id ... - 10:36 PM Bug #2830 (Need More Info): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
- this may duplicate #2843.. sadly didn't take note of the osd id :(
- 10:34 PM Bug #2837 (Resolved): osd: past_interval calculation inefficient
- 10:34 PM Bug #2849 (Resolved): osd: past_intervals not shared on backfill restart
- 04:25 PM Bug #2849 (Resolved): osd: past_intervals not shared on backfill restart
- peer info value is clobbered by backfill block prior to the dne() check in PG::activate()
this explains a lot! - 10:29 PM Revision 17bb78a2 (ceph): librbd: fix id initialization in new format
- 48bd839b1e25b063c675416a8f6233463f1af115 should have included this.
I misread it due to the use of bid instead of id ... - 10:26 PM Revision 5601ae27 (ceph): mon: set a configurable max osd cap
- Don't allow setting a higher osd num through the
ceph control util.
Fixes: #2752
Signed-off-by: Yehuda Sadeh <yehuda... - 10:04 PM Revision 87b6e804 (ceph): Merge remote-tracking branch 'gh/wip-rbd-bid' into stable-next
- 10:04 PM Revision 81d72e5d (ceph): mon: make 'ceph osd rm ...' wipe out all state bits, not just EXISTS
- This ensures that when a new osd reclaims that id it behaves as if it were
really new.
Backport: argonaut
Signed-off... - 10:03 PM Revision ad9c37f2 (ceph): test_stress_watch: just one librados instance
- This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets... - 10:03 PM Revision c60afe18 (ceph): Merge commit '35b13266923f8095650f45562d66372e618c8824' into stable-next
- First batch of msgr fixes.
- 10:03 PM Revision d2d40dc3 (ceph): client: fix locking for SafeCond users
- Need to wait on flock, not client_lock.
Signed-off-by: Sage Weil <sage@inktank.com> - 10:03 PM Revision 6ed01df4 (ceph): workqueue: kick -> wake or _wake, depending on locking
- Break kick() into wake() and _wake() methods, depending on whether the
lock is already held. (The rename ensures tha... - 10:03 PM Revision 7fbbe465 (ceph): librados: take lock when signaling notify cond
- When we are signaling the cond to indicate that a notify is complete,
take the appropriate lock. This removes the po... - 10:03 PM Revision 4d7d3e27 (ceph): osd: guard class call decoding
- Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com> - 10:03 PM Revision 682609a9 (ceph): objecter: always resend linger registrations
- If a linger op (watch) is sent to the OSD and updates the object, and then
the client loses the reply, it will resend... - 10:03 PM Revision 88330505 (ceph): ReplicatedPG: fix replay op ordering
- After a client reconnect, the client replays outstanding ops. The
OSD then immediately responds with success if the ... - 10:02 PM Revision c963a21a (ceph): filestore: check for EIO in read path
- Check for EIO in read methods and helpers. Try to do checks in low-level
methods (e.g., lfn_*()) to avoid duplicatio... - 10:01 PM Revision 6bd89aeb (ceph): filestore: add 'filestore fail eio' option, default true
- By default we will assert/fail/crash on EIO from the underlying fs. We
already do this in the write path, but not th... - 08:48 PM Revision 0d6ce424 (ceph): Fixed the code to pass 'yes' during mkfs
- Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
- 06:08 PM rbd Subtask #2855 (Closed): krbd: copy-up on write to clone
- 06:07 PM rbd Subtask #2854 (Closed): krbd: write path
- verify the target object exists in write requests. if we fail with ENOENT, trigger a copy-up.
- 06:07 PM rbd Tasks #2853 (Resolved): krbd: read path
- 06:06 PM rbd Subtask #2852 (Closed): krbd: open parent on open
- 06:05 PM rbd Feature #2851 (Duplicate): krbd: RBD layering support
- Kernel client should support all the layering functionality of the usermode client.
- 06:00 PM rbd Feature #2850 (Duplicate): libceph: support multi-operation transactions
- 03:23 PM Bug #2752 (Resolved): Setting large maxosd kills all mons
- Fixed, commit:5601ae27d6daf167dd83b3fc91b7b9591ca0cea6.
- 12:28 PM Bug #2848 (Won't Fix): OSDMap: pool_id is 64-bit, but pool_max is 32-bit
- A large number of pools will overflow pool_max before using the full range of pool ids.
- 12:26 PM Linux kernel client Cleanup #2847 (Resolved): libceph: osdmap definition is out of date
- In particular, pool_id is an int instead of a 64-bit integer. There are probably other important differences as well.
- 10:53 AM rbd Feature #2562 (In Progress): librbd: open parent images, read path, write path
- 10:46 AM rbd Feature #2726 (In Progress): krbd: clean up bio_pair leak/whatever
- Guangliang Zhao sent a patch to fix that, however, I had some concerns about it, and I'm waiting for him to respond t...
- 08:26 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
- Ok, I finally know the failing path.
So when you call add_key with an invalid payload, it will be parsed by ceph_k... - 08:07 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
- Damnit ... first it didn't take the formatting and second I pasted the wrong code :p...
- 08:05 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
- wrt to kernel crash, here's a minimal test case that will crash any machine that has rbd module loaded (works as user...
- 04:28 AM Bug #2846: Malformed keyring file causes kernel null pointer deref on "rbd map"
- I was pointing to a keyring file directly that happened to start with an empty line. So in rbd.cc, the function read_...
- 03:53 AM Bug #2846 (Resolved): Malformed keyring file causes kernel null pointer deref on "rbd map"
- Reported by Sylvain Munaut ("tnt" on OFTC):
(12:30:27) tnt: Is mounting a RBD on a machine that has an OSD suppose... - 02:32 AM Bug #2845 (Resolved): mkcephfs hasn't learned about new default keyring locations in argonaut
- In 0.48, when running @mkcephfs@ in a @cephx@ authentication enabled cluster, the per-daemon keys for MDSs and OSDs a...
- 01:13 AM Revision e9b5a289 (ceph): config: fix 'config set' admin socket command
- Fixes: #2832
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
07/25/2012
- 11:38 PM Revision 2b75ddef (ceph): Added '-y' option for mkfs.ext4
- Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
- 11:35 PM Revision 1a6cd965 (ceph): osd: break potentially large transaction into pieces
- We do a similar trick elsewhere. Control this via a tunable. Eventually
we'll control the others (in a non-stable b... - 10:57 PM Revision bcb9ab8b (ceph): doc: updates to fix problem with ceph-cookbooks appearing in chef-server.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 09:54 PM Bug #2843 (Can't reproduce): filestore: replay failure on xfs
- congress osd.328 crashed with...
- 09:53 PM Revision 15e16229 (ceph): osd: only commit past intervals at end of parallel build
- We don't check for gaps in the past intervals, so we should only commit
this when we are completely done. Otherwise ... - 08:28 PM Revision 9767146f (ceph): osd: generate past intervals in parallel on boot
- Even though we aggressively share past_intervals with notifies etc, it is
still possible for an osd to get buried beh... - 08:28 PM Revision d45929f4 (ceph): osd: move calculation of past_interval range into helper
- PG::generate_past_intervals() first calculates the range over which it
needs to generate past intervals. Do this in ... - 08:27 PM Revision 18d5fc41 (ceph): osd: fix map epoch boot condition
- We only want to join the cluster if we can catch up to the latest
osdmap with a small number of maps, in this case a ... - 05:58 PM Revision fca65ff5 (ceph): osd: move calculation of past_interval range into helper
- PG::generate_past_intervals() first calculates the range over which it
needs to generate past intervals. Do this in ... - 05:58 PM Revision 16302ace (ceph): osd: generate past intervals in parallel on boot
- Even though we aggressively share past_intervals with notifies etc, it is
still possible for an osd to get buried beh... - 05:58 PM Revision 5979351e (ceph): osd: fix map epoch boot condition
- We only want to join the cluster if we can catch up to the latest
osdmap with a small number of maps, in this case a ... - 05:55 PM Bug #2842: mon: health detail lists pgs multiple times
- This and #2827 may be related?
- 05:52 PM Bug #2842 (Won't Fix): mon: health detail lists pgs multiple times
- ...
- 05:16 PM rgw Bug #1855: Creation of a subuser that appears to own an s3 key is possible, and removing the subu...
- This bug can be reproduced by using the following options
./radosgw-admin -c {'ceph.conf'} --rgw-socket-path=/tmp... - 05:16 PM rgw Bug #2841 (Resolved): rgw: fix usage trim
- looking at the code, it seems that we don't encode the user in usage-trim (and also encode the wrong structure).
- 04:52 PM CephFS Bug #2187: pjd chown/00.t failed test 97
- 2012-07-23T19:16:10.185 INFO:teuthology.task.workunit.client.0.out:not ok 43
2012-07-23T19:16:10.186 INFO:teuthology... - 04:51 PM CephFS Bug #2187: pjd chown/00.t failed test 97
- Latest log: ubuntu@teuthology:/a/teuthology-2012-07-23_19:00:03-regression-master-testing-gcov/16530
- 04:30 PM Feature #2840 (Resolved): mon: $mon_data/cluster_fsid file
- maybe written/verified by mkfs!
- 04:23 PM rgw Feature #2839 (Resolved): rgw: garbage collection
- Provide a garbage collection mechanism, along the lines of what was described in a post to the mailing list.
- 04:20 PM rgw Bug #2652: Segmentation fault in rest-bench
- is it still happening?
- 04:20 PM rgw Bug #2665: rest-bench hangs periodically
- is that still happening?
- 04:10 PM devops Feature #2574 (Resolved): crowbar: use data disks automatically, journal inside data directory
- There were bugs and the history was wrecked by github pull requests again, so I redid some commits, but this function...
- 03:45 PM rgw Feature #2039 (Rejected): rgw: keep more than one bucket marker object
- That's not the case anymore. We use the unique client id and a running counter instead.
- 02:08 PM Bug #2838 (Resolved): mon: json version of 'osd tree'
- 01:52 PM Bug #2824: ceph-fuse; hang mounting with ms failures
- 01:52 PM Bug #2835 (Resolved): osd: do not send alive/upthru until booted
- 01:52 PM Bug #2836 (Resolved): osd: boot condition check incorrect
- 10:52 AM Bug #2836 (Resolved): osd: boot condition check incorrect
- commit:5979351ef3d3d03bced9286f79cbc22524c4a8de
- 11:04 AM Bug #2837 (Resolved): osd: past_interval calculation inefficient
- It is still possible for osds to get pgs without past intervals and need to recalculate them, and that calculation ca...
- 04:36 AM Revision 61ff894c (ceph): admin-socket: test generic admin socket commands
- 03:54 AM Revision 11b275a0 (ceph): osd: avoid misc work before we're active
- If we're booting, we shouldn't scrub, or send reports to the montior,
or send heartbeats, or any of that.
Signed-off... - 03:51 AM Revision 278b5f58 (ceph): mon: ignore pgtemp messages from down osds
- Signed-off-by: Sage Weil <sage@inktank.com>
- 03:51 AM Revision 08e2ecac (ceph): mon: ignore osd_alive messages from down osds
- Signed-off-by: Sage Weil <sage@inktank.com>
- 03:18 AM Revision 8c7186d0 (ceph): mon: ignore pgtemp messages from down osds
- Signed-off-by: Sage Weil <sage@inktank.com>
- 03:16 AM Revision b17f5467 (ceph): mon: ignore osd_alive messages from down osds
- Signed-off-by: Sage Weil <sage@inktank.com>
- 12:23 AM Revision 404a7f52 (ceph): admin_socket: json output, always
- If the perfcounters stuff were refactored to use the Formatter, we could
put the JSONFormatter in the admin_socket co... - 12:23 AM Revision 0133392b (ceph): admin_socket: dump config in json; add test
- Signed-off-by: Sage Weil <sage@inktank.com>
- 12:22 AM Revision 8c3b4907 (ceph): Merge branch 'next'
07/24/2012
- 10:26 PM Revision 9bc86171 (ceph): admin_socket: make test optional
- If it's not there, we just verify the output is valid json.
- 08:58 PM Bug #2835 (Resolved): osd: do not send alive/upthru until booted
- 08:53 PM Revision 0ef8cd3c (ceph): config: fix 'config set' admin socket command
- Fixes: #2832
Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com> - 06:49 PM Revision 186a595c (ceph): Merge branch 'next'
- 06:02 PM Revision f565ace6 (ceph): osd: fix pg log zeroing
- Zero the right number of bytes. Fixes a bug where we clobber legit log
data. Fortunately this is only triggered wit... - 05:10 PM Revision 3e886799 (ceph): Merge branch 'wip-2763'
- 05:09 PM Revision 52f51a24 (ceph): wireshar/ceph/packet-ceph.c: fix eol
- Removing extra char from dos eol format.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> - 05:09 PM Revision d67ad0db (ceph): Wireshark dissector updated, work with the current development tree of ...
- 04:02 PM Bug #2833: LibRBD.TestClone failure with ms fialure injection
- I'm pretty sure this is due to the op being done twice due to replay. That error (-EBADF) means the id was already ad...
- 03:49 PM Bug #2833 (Resolved): LibRBD.TestClone failure with ms fialure injection
- ...
- 03:54 PM Bug #2834 (Resolved): osd/ReplicatedPG.cc: 3577: FAILED assert(waiting_for_ack.begin()->first == ...
- ...
- 03:10 PM devops Feature #2574: crowbar: use data disks automatically, journal inside data directory
- The most recent pull request for the cookbook has been tested by Tyler and myself. I've setup the following situation...
- 02:57 PM Bug #2832: [argonaut] config set foo bar broken?
- What was the resolution?
- 02:33 PM Bug #2832 (Resolved): [argonaut] config set foo bar broken?
- 01:32 PM Bug #2832 (Resolved): [argonaut] config set foo bar broken?
- ...
- 01:08 PM Bug #2831 (Duplicate): librados: LibRadosWatchNotify.WatchNotifyTest segfault
- nevermind, i cherry-pick a bad version of the linger op fix for #2796
- 12:48 PM Bug #2831 (Duplicate): librados: LibRadosWatchNotify.WatchNotifyTest segfault
- On the stable-next branch, commit:7e9336e7cb36c6640fd5c8509c9c0ffd87b15d7a,
I hit this sometime last week or the w... - 11:59 AM Bug #2824 (In Progress): ceph-fuse; hang mounting with ms failures
- 10:58 AM Bug #2799 (Resolved): osd: pg log trimming zeroing broken
- commit:f565ace62a1235dbe6a4ee9db5c5233cea395446
- 08:44 AM Bug #2799: osd: pg log trimming zeroing broken
- this was a bug in pg log trimming/zeroing. thankfully it was only enabled in master! will backport the fix all over...
- 08:25 AM Bug #2799: osd: pg log trimming zeroing broken
- ...
- 10:14 AM Subtask #2805: mon: Single-Paxos: Sync: Create a test unit to verify the correctness of the whole...
- Current available test units:
* Removing keys:
> * Using the whole-space iterator, both in LevelDBStore and KeyVa... - 09:19 AM Bug #2830 (Duplicate): [argonaut] osd/OSD.cc: 3906: FAILED assert(_get_map_bl(epoch, bl))
- 0> 2012-07-24 14:38:09.918736 7fe53cede700 -1 osd/OSD.cc: In function 'OSDMapRef OSD::get_map(epoch_t)' thread 7...
- 01:30 AM Revision a16d9c64 (ceph): os: KeyValueDB: allow finer-grained control of transaction operations
- This patch introduces the possibility of using single key/value
modification operations into the transaction interfac... - 01:30 AM Revision a3d57a6e (ceph): os: KeyValueDB: Add virtual raw_key() function to return (prefix,key) pair
- If we were to use solely the key() function, whenever we had a key with,
say, prefix 'Foo' and key 'Bar', the key() f... - 12:39 AM Revision 6c0fa509 (ceph): doc: update information about stable vs development releases
- Signed-off-by: Sage Weil <sage@inktank.com>
- 12:16 AM Revision 48bd839b (ceph): librbd: replace assign_bid with client id and random number
- The assign_bid method has issues with replay because it is a write
that also returns data. This means that the replay...
07/23/2012
- 11:51 PM Revision 67832c34 (ceph): osd: fix ACK ordering on resent ops
- The wait_for_ondisk handling fixed COMMIT ordering, but the ACKs need to
go back in the same order too. For example:... - 11:01 PM Revision 96dbc412 (ceph): rados::cls::lock: move api types into namespace
- By popular demand, moved public api into namespace. This
required some changes to ceph_dencoder to get some template
... - 09:09 PM Revision 7dfdf4f8 (ceph): librbd: replace assign_bid with client id and random number
- The assign_bid method has issues with replay because it is a write
that also returns data. This means that the replay... - 07:51 PM Revision dc2d6711 (ceph): librados: add new constructor to form a Rados object from IoCtx
- This creates a separate reference to an existing connection, for
use when a client holding IoCtx needs to consult ano... - 07:43 PM Revision d9bfe954 (ceph): Merge tag 'v0.49'
- v0.49
- 06:28 PM Revision ca6265d0 (ceph): v0.49
- 05:47 PM Revision c8f13119 (ceph): mon: make 'ceph osd rm ...' wipe out all state bits, not just EXISTS
- This ensures that when a new osd reclaims that id it behaves as if it were
really new.
Backport: argonaut
Signed-off... - 05:17 PM Bug #2822 (Resolved): rbd: ASSIGN_BID both mutates object and returns data
- commit:48bd839b1e25b063c675416a8f6233463f1af115 on master
commit:7dfdf4f8de16155edd434534e161e06ba7c79d7d for argonaut - 05:03 PM Feature #2829 (Resolved): report on cluster size/status (for service billing purposes)
- The plan is to base support contract pricing on capacity,
which means we need a quick, easy (and moderately trusted)... - 04:59 PM Bug #2828 (Resolved): osd: assign_bid was allowed to mutate and return data
- The osd shouldn't have allowed this. Figure out why, and then see if there is a graceful way to fix it without break...
- 04:56 PM Revision f42e1875 (ceph): cls_rbd, cls_rbd_client, test_cls_rbd: copyup method
- Fixes: #2559
Signed-off-by: Dan Mick <dan.mick@inktank.com> - 04:55 PM Revision 9b9efe7e (ceph): cls_rbd_client: fix locking function indentation
- Greg's editor was set to spaces only.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 04:55 PM Revision d5e45418 (ceph): cls_rbd: return negative pool id if parent does not exist
- This lets us use get_parent in a multi-op transaction without getting
-ENOENT for the entire transaction.
Signed-off... - 04:55 PM Revision 56bc3699 (ceph): cls_rbd: make get_parent return valid data when layering is disabled
- This means clients can treat an error in their multi-object
transaction as a failure for all of them. This makes the ... - 04:55 PM Revision 02aba7bc (ceph): librbd: rename variables for more consistency
- Rename: poolname -> pool_name
snapname -> snap_name
snapid -> snap_id
Signed-off-by: Josh Durgin <... - 04:55 PM Revision 4af1f981 (ceph): librbd: move watch/unwatch to ImageCtx methods
- This fixes close_image() on a partially initialized ImageCtx, and
paves the way for opening parent images without wat... - 04:55 PM Revision 233d44a1 (ceph): librbd: allow an image to be opened without watching
- Watching the header of a parent image could produce unreasonable
delays. If hundreds of child images watch the same p... - 04:55 PM Revision df71c2d3 (ceph): librbd: store parent info in snapshot metadata
- If an image is open to a snapshot, return the features or overlap from
that snapshot.
Signed-off-by: Josh Durgin <jo... - 04:55 PM Revision 8014d932 (ceph): librbd: allow opening an image by id
- If we have an ImageCtx with an id set, don't bother setting the name.
Just go directly to the header object in ImageC... - 04:55 PM Revision b82f5958 (ceph): librbd: move all side-effects to snap_{un}set()
- All callers need to do this anyway.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 04:55 PM Revision c18c4b86 (ceph): librbd: open parent image when opening a child
- Extract a helper out of get_parent_info. The parent may become unset
while the child is open, so detect changes in it... - 04:55 PM Revision 127ff618 (ceph): librbd: drop unnecessary std:: and struct prefixes
- Most of this file do not use them.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 04:52 PM Bug #2823 (Resolved): osd: out of order ACKs
- commit:67832c34a228c5cff1941eab3225969b5e66e44b
- 08:47 AM Bug #2823: osd: out of order ACKs
- specifically, this is out of order ACKs when ops are resent. trivially reproducible with the thrasher + radosmodel t...
- 04:21 PM Revision 5fcb22f0 (ceph): mkcephfs: add sync between btrfs scan and mount
- This appears to fix problems with mount failing for at least one user.
Reported-by: Paul Pettigrew <Paul.Pettigrew@m... - 02:44 PM Bug #2827 (Rejected): mon: ceph health string doesn't match "ceph -s" output
- I've seen several ceph -s reports lately where the first line on the PGs doesn't match the output of the PG line. My ...
- 11:11 AM devops Feature #2553 (Closed): crowbar: open question: What's the correct way to add RBD support to the ...
- This ticket was an arbitrary reminder to send upstream an email with questions, and we have answers & sort of underst...
- 11:04 AM Feature #2826 (Resolved): osd: backfill target reservations
- When osds are marked out, the recovery ops per primary throttling works fine, because the backfill targets tend to be...
- 10:45 AM CephFS Bug #2187: pjd chown/00.t failed test 97
- also,
ubuntu@teuthology: /a/teuthology-2012-07-23_05:00:03-regression-stable-master-basic/16425
from teuthology... - 10:42 AM CephFS Bug #2187: pjd chown/00.t failed test 97
- Recent logs: ubuntu@teuthology: /a/teuthology-2012-07-23_05:00:03-regression-stable-master-basic/16416
From teutho... - 10:14 AM Bug #2787: RBD test copy fails when thrashing
- Recent logs: ubuntu@teuthology:/a/teuthology-2012-07-23_00:00:08-regression-next-testing-basic/16393
ubuntu@teutholo... - 08:59 AM CephFS Bug #2825: File lock doesn't work properly
- Here is the code included in *pre* ;)...
- 08:56 AM CephFS Bug #2825 (Resolved): File lock doesn't work properly
- Hi,
I've been testing the locking feature on cephfs and seems that the locking doesn't work as expected when locki... - 05:03 AM Revision d0698f9d (ceph): schedule_suite.sh: put coverage option in ceph section
- 03:50 AM Revision deb7a54a (ceph): marginal kclient+ffsb: enable mds logging to catch badess
- See #1947
- 03:47 AM Revision 32353f13 (ceph): move misc, blogbench back into active kernel suite
- these were removed from regression ages ago, and only recently put back in
marginal. they seem fine. - 03:47 AM Revision 000fec3b (ceph): move all kernel tests to kernel suite; symlink collections from regression
- Make regression a union of other topical suites.
07/22/2012
- 10:46 PM Bug #2824 (Resolved): ceph-fuse; hang mounting with ms failures
- ...
- 10:36 PM Bug #2823 (Duplicate): osd: out of order ACKs
- ...
- 10:34 PM Bug #2822: rbd: ASSIGN_BID both mutates object and returns data
- ubuntu@teuthology:/a/sage-2012-07-22_21:26:14-regression-wip-msgr-testing-basic/16193 too
- 10:33 PM Bug #2822 (Resolved): rbd: ASSIGN_BID both mutates object and returns data
- It's not okay to mutate data *and* return data, because when resending the operation (e.g., due to a connection fault...
- 09:40 PM Bug #2502 (Resolved): osd: do not mark self back up if internal heartbeats are failing
- 09:34 PM Bug #2820 (Resolved): osd: crash in handle_osd_ping
- 09:32 PM Bug #2799: osd: pg log trimming zeroing broken
- 09:29 PM CephFS Bug #1677: mds interval_set.h: 385: FAILED assert(p->first <= start)
- 03:59 AM Revision 17e4f751 (ceph): this fails reliably
- 03:18 AM Revision f70b8250 (ceph): ceph: fix mkfs/mount option defaults
- Later code expects a list, not None.
07/21/2012
- 04:15 PM Revision 2d7e2cbf (ceph): crush: fix name map encoding
- We screwed up and encoded using the name 'int' type instead of int32_t.
That means people have systems encoding this ... - 03:50 PM Revision b497bdac (ceph): osd/OpTracker: fix use-after-free
- And formatting.
Signed-off-by: Sage Weil <sage@inktank.com> - 02:12 PM CephFS Bug #2187: pjd chown/00.t failed test 97
- Latest logs: ubuntu@teuthology:/a/teuthology-2012-07-21_05:00:03-regression-stable-master-basic/15237
ubuntu@teuth... - 09:17 AM Bug #2638 (Fix Under Review): mon: make pool ops idempotent
- 09:17 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
- commit:b497bdacf5d5732285d280772ab9ce6ae88f59e8
- 08:15 AM Bug #2821 (Resolved): osd: segfault in OpHistory::cleanup(utime_t)
- ...
- 01:36 AM Revision 7cf1f1fb (ceph): msg/Pipe: go to STANDBY on lossless accept fault
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:36 AM Revision 2e67b7a3 (ceph): msgr: do not reopen failed lossy Connections
- There was a race where:
- sending stuff to a lossy Connection
- it fails, and queues itself for reap, queues a RES... - 01:36 AM Revision adce6df2 (ceph): msgr: simplify submit_message()
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:36 AM Revision 04fde518 (ceph): msgr: drop CLOSED checks during queueing
- AFAICS these checks are pointless. There should be no harm in queueing
messages on a closed connection; they'll get ... - 01:36 AM Revision 60eb36ef (ceph): msgr: close get_connection() race
- This could null deref if the Pipe is registered but failed.
We need to loop here because the Pipe vs Connection stuf... - 01:36 AM Revision ea7511b8 (ceph): msg/Pipe: fix leak of Connection in ctor
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:36 AM Revision 9348bb38 (ceph): osd: reopen heartbeat connections when they fail
- If we have an active peer whose Connection fails, open a new one. This
is necessary now that a lossy client connecti... - 01:36 AM Revision ef3fd1c3 (ceph): msg/Pipe: go to standby on lossless server connection faults
- Go directly to the STANDBY state, and print a more accurate message.
Otherwise, we do the same check in writer() and ... - 01:36 AM Revision 8966f71a (ceph): msg/IncomingQueue: make the pipe parent informational only
- Use this pointer only for debug output prefix; do not dereference, as we
may live beyond the original parent.
Signed... - 01:36 AM Revision 525830cd (ceph): msgr: do not discard_queue in Pipe reaper
- The IncomingQueue can live beyond the Pipe. In particular, there is no
reason not to deliver messages we've received... - 01:36 AM Revision 7b398a5d (ceph): msgr: indicate whether clients are lossy
- We need to know whether the client is lossy before we connect to the peer
in order to know whether to deliver a RESET... - 01:36 AM Revision 9d94ed1c (ceph): test_stress_watch: verify that the watch operation isn't slow
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:36 AM Revision 472d14f7 (ceph): msg/DispatchQueue: fix locking in dispatch thread
- The locking was awkward with locally delivered messages.. we dropped dq
lock, inq lock, re-took dq lock, etc. We wo... - 01:36 AM Revision cef85105 (ceph): msg/Connection: add failed flag for lossy Connections
- If a lossy Connection fails and we disconnect the Pipe, set a failed flag.
Signed-off-by: Sage Weil <sage@inktank.com> - 01:36 AM Revision 541694f7 (ceph): msg/Pipe: disconnect Pipe from lossy Connection immediately on failure
- When we have a lossy connection failure, immediately disconnect the Pipe
and set the Connection failed flag. There i... - 01:36 AM Revision 9a4e7027 (ceph): msg/Pipe: unregister pipe immediately on fault; fix mark_down
- This fixes a problem where:
- pipe faults, con->pipe is cleared
- ms_handle_reset tries to mark_down, but it doesn... - 01:36 AM Revision 35b7bca3 (ceph): msg: kill tcp.{cc,h}
- Move the remaining comparator into msg_types.h and kill this off.
Signed-off-by: Sage Weil <sage@inktank.com> - 01:36 AM Revision 5a62dfef (ceph): msg/DispachQueue: hold lock in IncomingQueue::discard_queue()
- This prevents races with the dispatch thread, among other things.
Signed-off-by: Sage Weil <sage@inktank.com> - 01:36 AM Revision 999c506d (ceph): msg/DispatchQueue: give IncomingQueue ref to queue
- We want to be able to queue an event (e.g., RESET) and deliver it even
after the Pipe is destroyed.
Signed-off-by: S... - 01:22 AM Revision 5ecc5bce (ceph): msg/DispatchQueue: cleanup debug prefix
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:22 AM Revision 89b07f47 (ceph): msg/Pipe: move tcp_* functions into Pipe class
- This lets us print nice debug prefixes. It also calls BS on the
Pipe vs tcp.cc separation.
Signed-off-by: Sage Weil... - 01:22 AM Revision d034e46d (ceph): msgr: move Accepter into separate .cc
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:22 AM Revision 3e98617c (ceph): msg/Pipe: get_state_name()
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:22 AM Revision f78a4010 (ceph): msgr: rework accept() connect_seq/race handling
- We change a couple of key things here:
* If there is a matching connect_seq and the existing connection is in OPEN ... - 12:36 AM Revision ec4ce8bf (ceph): regression: do some tests on ext4
- 12:20 AM Revision a6735ab0 (ceph): OpRequest,OSD: track recent slow ops
- This should be helpful while investigating slow performance.
OpRequests now track events with timestamp in addition
...
07/20/2012
- 11:09 PM Bug #2820: osd: crash in handle_osd_ping
- also ubuntu@teuthology:/a/sage-2012-07-20_18:17:10-regression-wip-msgr-cleanup-testing-basic/14885
- chef: null
-... - 11:08 PM Bug #2820 (Resolved): osd: crash in handle_osd_ping
- ...
- 10:45 PM RADOS Subtask #2793 (Fix Under Review): osd: require tunable feature if current osdmap uses non-default...
- 09:32 PM Revision d624f343 (ceph): Merge branch 'next'
- 08:59 PM Revision 9e207aa8 (ceph): test/store_test.cc: verify collection_list_partial results are sorted
- Synthetic test now also varies snapshots and uses a small variety of
hashes.
Signed-off-by: Samuel Just <sam.just@in... - 08:41 PM Revision 49877cde (ceph): cls_lock: cls_lock_id_t -> cls_lock_locker_id_t
- Renamed type to make more sense.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> - 08:28 PM Revision 315bbea5 (ceph): cls_lock: document lock properties
- Added some comments about different lock properties.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> - 08:16 PM Revision 056d42cf (ceph): cls_log: update a comment
- Was missing output param description.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> - 08:14 PM Revision da770146 (ceph): move cfuse+dbench back to regression for verify, too
- 08:11 PM Revision 2c7d7821 (ceph): rados: lock info keeps expiration, not duration
- We pass duration in the request, but internally we keep
the expiration.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.... - 08:00 PM Revision d16844c8 (ceph): rados tool: add advisory lock control commands
- Can now lock, break lock, list locks and show lock
info.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> - 07:59 PM Revision 2f8de894 (ceph): cls_lock: objclass for advisory locking
- Providing an objclass to create and manipulate advisory
locking. Also providing a client api to control it. A lock
ma... - 07:55 PM Revision 9c5c3edf (ceph): objclass: add api calls to get/set xattrs
- added the following functions:
cls_cxx_getxattr
cls_cxx_getxattrs
cls_cxx_setxattr
Signed-off-by: Yehuda Sadeh... - 07:29 PM Revision adc9b91f (ceph): os/HashIndex: use set<pair<string, hobject_t>> rather than multimap
- Multimap does not make any guarantees about ordering of different
values with the same key. list_by_hash, however, a... - 05:22 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
- 04:30 PM Bug #2819 (Won't Fix): krbd: lockup on large writes, msgr fault injection
- krbd + osd socket failure injection + iozone -> crash in uml, without any useful debugging. uml itself either locks ...
- 04:07 PM rbd Feature #2559 (Fix Under Review): cls_rbd: copyup method
- 04:07 PM rbd Feature #2607 (Fix Under Review): librbd: copyup helper
- 03:45 PM Bug #2638 (In Progress): mon: make pool ops idempotent
- 03:40 PM Documentation #2729 (Resolved): doc: OSD internals
- 03:25 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
- Responding to Sage's possible kinks:
1. the cephbooter issue only comes up with new installs, existing vms with th... - 03:19 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
- I moved http://gitbuilder-precise-i386.front.sepia.ceph.com/ and wrote a doc on how to move whole images: https://doc...
- 02:37 PM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
- 09:57 AM RADOS Subtask #2791 (In Progress): crush: feature bit for crush tunables
- 02:20 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
- 09:59 AM Messengers Bug #2802 (In Progress): msgr: mds session hangs on direct_io test
- just a misleading message; cleaned up. the direct_io test (which leaves the mds connection mostly idle) still fails ...
- 01:50 PM Feature #2812 (Resolved): automated CentOS testing
- (1) testing on anything other than Precise means we need the ability to re-image test machines or to run teuthology o...
- 01:46 PM devops Feature #2808: crowbar: upgrade to fred3 (get bind bug fix)
- Test that the bind problem has been fixed; nodes should be able to resolver other node names to IP addresses without ...
- 01:45 PM devops Feature #2808 (Rejected): crowbar: upgrade to fred3 (get bind bug fix)
- 01:41 PM Feature #2807 (Resolved): RPM git-builder
- We have customers who want to run Ceph on RHEL.
A lot of people run RHEL (or CentOS) and we should be building
(an... - 01:26 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
- The doc is fine as it is; it seems the new upstream makes this a lot nicer.
- 10:46 AM devops Feature #2785: crowbar: custom ISO builder that includes barclamp-ceph
- Crowbar cleaned up the build process so I was able to go through and build an ISO with our barclamp rather easily. Th...
- 01:19 PM devops Feature #2574 (In Progress): crowbar: use data disks automatically, journal inside data directory
- 11:17 AM rbd Bug #2535 (Resolved): rbd: random data corruption in vm
- Thanks for reminding us Florian. There are no open questions that I know of, the only problem with this was fiemap, s...
- 11:11 AM rbd Bug #2535: rbd: random data corruption in vm
- Just so this doesn't fall of the table, can this be considered resolved with the changed @filestore fiemap@ default? ...
- 10:06 AM Bug #2761: osd: failed to recover before timeout expired
- latest logs: ubuntu@teuthology:/a/teuthology-2012-07-20_05:00:03-regression-stable-master-basic/14803
from teuthol... - 10:05 AM rbd Bug #2715: krbd: spinlock wrong CPU
- hoping this was caused by the mutex-less con_open, or something similar. will keep this open for a few more days to ...
- 09:29 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
- Trimming through Paxos on the services is the way to go, no doubt. However, it's a whole world of pain when it comes ...
- 08:33 AM Bug #2799: osd: pg log trimming zeroing broken
- again today: ubuntu@teuthology:/a/teuthology-2012-07-19_19:00:08-regression-master-testing-gcov/14585
- 07:13 AM Subtask #2805 (Resolved): mon: Single-Paxos: Sync: Create a test unit to verify the correctness o...
- We have added support for a whole-space iterator and a snapshot iterator (task #2758) on KeyValueDB and LevelDB, and ...
- 05:14 AM Revision 0b84384f (ceph): mon: shut up about sessionless MPGStats messages
- If the mon gets a reset on the client connection, it clears the session
on the connection. This is perfectly normal ... - 04:27 AM Revision 6580450f (ceph): osd: clean up boot method names
- Prefix subsequent steps with _. Better names.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Yehuda Sadeh... - 04:27 AM Revision 369fbf61 (ceph): osd: defer boot if heartbeatmap indicates we are unhealthy
- If the OSD is bogged down or unresponsive, we should not try to join
the cluster. This was observed on congress (slo... - 03:22 AM Revision d76df212 (ceph): Merge branch 'next'
- Conflicts:
src/include/ceph_features.h - 12:13 AM Revision 7f58b9be (ceph): mon: track pg creations by osd
- Track the pending pg creations by osd, and use a helper to send out that
messages.
Signed-off-by: Sage Weil <sage@in... - 12:13 AM Revision dec93692 (ceph): osd/mon: subscribe (onetime) to pg creations on connect
- Ask the monitor for pending pg creations each time we connect.
Normally, this is a freebie check. If there are pend...
07/19/2012
- 11:48 PM Revision 42de6873 (ceph): rbd: fix usage for snap commands
- Snap commands take '--snap <snapname> <imagename>'.
Signed-off-by: Sage Weil <sage@inktank.com> - 11:45 PM Revision 4c6c927b (ceph): Revert "rbd: fix usage for snap commands"
- This reverts commit 42de6873f9ca33fc20e70176d9a422635a6f0152.
Actually, these are fine! Dan made them all kinds of ... - 10:51 PM Revision e1c98e7d (ceph): tasks: add multibench task for testing pool creation
- Also adds support for specifying a pool for radosbench
to create and then cleanup instead of "data".
Signed-off-by: ... - 10:06 PM rgw Feature #2804 (Rejected): rgw: disallow running multiple gateways on the same fastcgi socket
- We saw an incident where multiple gateways were running on the same machine, both pointing to the same apache <-> rad...
- 09:35 PM rbd Feature #2788 (Fix Under Review): rbd: add -l to rbd ls, show snapshots
- wip-rbd-ls-R
- 03:55 PM rbd Feature #2788 (In Progress): rbd: add -l to rbd ls, show snapshots
- 09:13 PM Bug #2502 (Fix Under Review): osd: do not mark self back up if internal heartbeats are failing
- wip-osd-hb
- 09:02 PM Bug #2638: mon: make pool ops idempotent
- this needs to be fixed in order to get the msgr failure injection stuff in the qa suite
- 09:01 PM Bug #2771: mon|osd: fail to create pgs on startup
- commit:ca9f71300471e1947be63e5dfad1a4800c1ffc9c and a few commits that follow
- 09:00 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
- 06:29 PM Revision 58cd27fd (ceph): doc: add missing dependencies to README
- Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
- 05:52 PM Bug #2803 (Resolved): filer: probe crash
- ...
- 05:50 PM Messengers Bug #2802 (Resolved): msgr: mds session hangs on direct_io test
- ...
- 05:21 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
- still working on part that dumps slowest ops over past N seconds
- 04:49 PM Linux kernel client Bug #2801 (Resolved): msgr crash in ceph_msg_new
- ...
- 04:15 PM Linux kernel client Feature #1699 (Resolved): debug symbols in autobuilt (sepia) kernels
- the symbols are there. the trick is for objdump to find them in the original path. do this:
sudo mkdir -p /srv/a... - 04:13 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
- ...
- 03:18 PM Linux kernel client Bug #2790: libceph: crash in read_partial_message_section on ffsb
- ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2012-07-19_08:03:32-marginal-master-testing-basic/14125
- 03:55 PM rbd Subtask #2606 (Fix Under Review): librbd layering: copyup on missing child object
- 03:54 PM rbd Subtask #2606 (In Progress): librbd layering: copyup on missing child object
- 02:45 PM Bug #2800: osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first == repop->v)
- Might be a result of reordered or dropped messages.
- 02:31 PM Bug #2800 (Resolved): osd/ReplicatedPG.cc: 3537: FAILED assert(waiting_for_ondisk.begin()->first ...
- ceph-sha1: 508bf3fb96929dc6e91ee83a915a7f30df23eeeb
description: collection:thrash clusters:6-osd-3-machine.yaml fs:... - 02:26 PM Bug #2799 (Resolved): osd: pg log trimming zeroing broken
- ...
- 12:31 PM devops Feature #2780 (In Progress): gitbuilder: move to vercoi, redo deployment if feasible
- 12:30 PM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
- Good enough for now.
- 05:09 AM Tasks #2789: pkgconfig files for libraries
- No, I'm talking about: http://en.wikipedia.org/wiki/Pkg-config
I added a patch which adds these files, but I'm not... - 02:50 AM Revision c9967120 (ceph): add CRUSH_TUNABLES feature bit
- Signed-off-by: Sage Weil <sage@inktank.com>
- 02:49 AM Revision 6f381aff (ceph): add CRUSH_TUNABLES feature bit
- Signed-off-by: Sage Weil <sage@inktank.com>
07/18/2012
- 10:37 PM Revision e3349a2a (ceph): OSD::handle_osd_map: don't lock pgs while advancing maps
- We no longer do anything with the pgs here. PG map
advancing is now handled in OSD::advance_pg asyncronously.
Signe... - 09:26 PM Revision ca9f7130 (ceph): OSD: actually send queries during handle_pg_create
- During the osd threading refactor, we lost the do_queries
call in favor of dispatch_context. However, this did not
i... - 09:26 PM Revision 8f5562ff (ceph): OSD: write_if_dirty during get_or_create_pg after handle_create
- In the case that the pg is newly created, we will activate during
that call, so the info and log will be dirty.
Sign... - 09:26 PM Revision c8ee3016 (ceph): osd: add osd_debug_drop_pg_create_{probability,duration} options
- This will let us exercise more of the pg creation code.
Signed-off-by: Sage Weil <sage@inktank.com> - 08:44 PM Revision c49daeca (ceph): clock: print skew with ntp servers to log to help debug time issues
- 07:59 PM Revision 0b579546 (ceph): ObjectCacher: fix cache_bytes_hit accounting
- Misses are not hits!
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 07:58 PM Revision 0d0b4689 (ceph): Merge branch 'next'
- 07:55 PM Revision 5dd68b95 (ceph): objecter: always resend linger registrations
- If a linger op (watch) is sent to the OSD and updates the object, and then
the client loses the reply, it will resend... - 06:04 PM Revision 55847fc2 (ceph): nuke: log what pid we are killing when we kill it
- 05:52 PM Revision 7586cde9 (ceph): qa/workunits/suites/pjd.sh: bash -x
- This will let us see what test is failing, exactly, and what its inputs
were. Hoping to help find #2187.
Signed-off... - 05:44 PM Revision 76efd977 (ceph): OSD: publish_map in init to initialize OSDService map
- Other areas rely on OSDService::get_map() to function, possibly before
activate_map is first called. In particular, ... - 05:25 PM Revision 675d6302 (ceph): ObjectCacher: fix cache_bytes_hit accounting
- Misses are not hits!
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 02:35 PM Revision 4e1d973e (ceph): doc: Fixed heading text.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 02:35 PM Revision ebc57736 (ceph): doc: favicon.ico should be new Ceph icon.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 02:17 PM Bug #2798 (Resolved): handle_osd_ping assert
- 10:40 AM Bug #2798: handle_osd_ping assert
- Also,
ubuntu@teuthology: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13039
ubuntu@teuthology... - 10:38 AM Bug #2798: handle_osd_ping assert
- Recent logs: /a/teuthology-2012-07-17_19:00:06-regression-master-testing-gcov/13020
ubuntu@teuthology:/a/teuthol... - 01:46 PM rbd Bug #2478 (Can't reproduce): krbd: unmap on 3.4.0: scheduling while atomic...
- 01:32 PM Bug #2022: osd: misdirectect request
- Writing down things discovered due to conversation on irc:
The log message happens with v0.48argonaut, it seems 41... - 12:55 PM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
- 10:17 AM CephFS Bug #2187: pjd chown/00.t failed test 97
- ...
- 04:28 AM Revision 3a377c44 (ceph): doc: Overhauled Swift API documentation.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 03:05 AM Revision 5c5ca4b8 (ceph): move cfuse + dbench from marginal to regression
- Fixed #1737, yay!
- 02:20 AM Revision aecf0031 (ceph): Merge branch 'next'
- 02:19 AM Revision d78235be (ceph): client: fix readdir locking
- Several of the readdir-related methods were not taking client_lock.
Fixes: #1737
Backport: argonaut
Signed-off-by: S... - 02:18 AM Revision 82a575c9 (ceph): client: fix leak of client_lock when not initialized
- Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com>
07/17/2012
- 11:58 PM Revision 58e81c82 (ceph): OSD: handle_osd_ping: use service->get_osdmap()
- This way, we avoid grabbing the map_lock. Furthermore,
get curmap at the beginning of the method to ensure that
we s... - 11:58 PM Revision 90ddc5ae (ceph): OSD: use service.get_osdmap() in heartbeat(), don't grab map_lock
- service.get_osdmap() gives us sufficiently consist
access to the map state.
Signed-off-by: Samuel Just <sam.just@ink... - 11:51 PM Revision 32892c1e (ceph): doc/dev/osd_internals: add newlines before numbered lists
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 11:02 PM Revision fe4c658b (ceph): librados: simplify locking slightly
- No reason to hold mylock_all here.
Signed-off-by: Sage Weil <sage@inktank.com> - 09:42 PM Bug #2067 (Resolved): librados: we leak CephContext from rados_create()
- 09:42 PM Bug #2315 (Resolved): unrecognized admin socket command 'objecter_requests'
- 07:40 PM Revision 199397dc (ceph): osd: default 'osd_preserve_trimmed_log = false'
- This option makes the osd skip zeroing old trimmed regions of the log. The
data is never read, since the xattrs indi... - 07:21 PM CephFS Bug #1737 (Resolved): ceph-fuse crash in xlist::remove
- commit:d78235be1b5c20440f3a55abb3f4b980aa3f3596
- 07:11 PM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
- 08:57 AM Bug #2794 (Resolved): test_stress_watch: deadlocks with ms failure injection
- this is 100% reproducible on my notebook, but not on my other boxes. debugging it there. :(
- 07:11 PM Bug #2796: osd: watch state not reestablished when registration op resent
- 12:45 PM Bug #2796 (Fix Under Review): osd: watch state not reestablished when registration op resent
- 09:04 AM Bug #2796 (Resolved): osd: watch state not reestablished when registration op resent
- if the client doesn't get the watch ack and resends, the osd will ignore it as a dup op, and the watch session state ...
- 05:00 PM Revision 12dc0ad1 (ceph): ceph: archive mon data to a .tgz
- Saves bandwidth, time, and space.
- 04:54 PM Revision 24df8b1d (ceph): doc/dev: add osd_internals to toc
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 04:31 PM Revision 5a27f071 (ceph): doc/internals/osd_internals: fix indentation errors
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 04:23 PM Bug #2798 (Resolved): handle_osd_ping assert
- ceph version 0.48argonaut-404-gabe05a3 (commit:abe05a3fbbb120d8d354623258d9104584db66f7)
1: (OSDMap::get_cluster_in... - 03:36 PM Revision 36d0a355 (ceph): log: simplify log logic a bit
- Whether an entry is eligible to log/dump is independent of the channel it
is sent to. Some channels impose additiona... - 03:36 PM Revision 6490c84f (ceph): doc: discuss choice of pg_num
- Signed-off-by: Sage Weil <sage@inktank.com>
- 02:36 PM devops Feature #2782: chef: easy scripts for running in vercoi vms
- 02:20 PM devops Feature #2662 (Resolved): crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
- Done as of barclamp-ceph.git commit 36842bd3e3db9fea65e2921bb7b57e9088964621.
- 02:03 PM rgw Feature #2797 (Resolved): rgw: support multi-objects delete
- Support a multi-objects delete S3 api call.
- 10:39 AM Subtask #2758 (Fix Under Review): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyVal...
- From Task #2756:
> * We changed KeyValueDB:
> > * Added a Whole-space iterator interface
> > * Implemented the p... - 10:32 AM Subtask #2756 (Fix Under Review): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
- 10:32 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
- Just a couple of notes on this regard:
* We changed KeyValueDB:
> * Added a Whole-space iterator interface
> * I... - 08:56 AM RADOS Subtask #2793 (Resolved): osd: require tunable feature if current osdmap uses non-default tunables
- 08:56 AM RADOS Subtask #2792 (Resolved): mon: require tunable feature bit if current osdmap uses non-default tun...
- 08:52 AM RADOS Subtask #2791 (Resolved): crush: feature bit for crush tunables
- 08:48 AM Tasks #2789: pkgconfig files for libraries
- There is a librados-config in librados-dev. Does that mean we just need a librbd-config too?
- 06:12 AM Tasks #2789 (New): pkgconfig files for libraries
- From the libvirt mailinglist I got the message that they'd like to see pkg-config for at least librbd and librados.
... - 08:29 AM Linux kernel client Bug #2790 (Duplicate): libceph: crash in read_partial_message_section on ffsb
- ...
- 12:36 AM Revision abe05a3f (ceph): Merge branch 'next'
- 12:35 AM Revision 2869039b (ceph): Robustify ceph-rbdnamer and adapt udev rules
- Below is a patch which makes the ceph-rbdnamer script more robust and
fixes a problem with the rbd udev rules.
On ou... - 12:34 AM Revision 96587f39 (ceph): Robustify ceph-rbdnamer and adapt udev rules
- Below is a patch which makes the ceph-rbdnamer script more robust and
fixes a problem with the rbd udev rules.
On ou...
07/16/2012
- 11:30 PM Revision b0465496 (ceph): doc/radosgw/config.rst: mended small typo
- Signed-off-by: caleb miles <caleb.miles@inktank.com>
- 11:15 PM Revision 2a8c4db7 (ceph): Merge branch 'wip-mon-mkfs'
- Reviewed-by: Tommi Virtanen <tv@inktank.com>
- 11:14 PM Revision 54be9d09 (ceph): mon: stop doing rm -rf on mon mkfs
- Simply verify that the directory exists, or if it doesn't, create it.
Do nothing about its content.
Signed-off-by: S... - 11:14 PM Revision 6b1835a9 (ceph): vstart.sh: blow away mon directory on creation/start
- Now that ceph-mon doesn't blow away the mon data content, we need to.
Signed-off-by: Sage Weil <sage@inktank.com> - 11:14 PM Revision 4e66a3b9 (ceph): mkcephfs: error out if mon data directory is not empty
- The ceph-mon --mkfs function no longer wipes out the directory; it is in
fact mostly a no-op that just verifies the d... - 11:14 PM Revision 4eec4fc5 (ceph): mkcephfs: nicer empty directory check
- From TV.
Signed-off-by: Sage Weil <sage@inktank.com> - 11:14 PM Revision 426384f6 (ceph): log: apply log_level to stderr/syslog logic
- In non-crash situations, we want to make sure the message is both below the
syslog/stderr threshold and also below th... - 11:13 PM Revision f9c1a6fb (ceph): Merge branch 'next'
- 11:02 PM Revision 52f96b9f (ceph): log: apply log_level to stderr/syslog logic
- In non-crash situations, we want to make sure the message is both below the
syslog/stderr threshold and also below th... - 10:53 PM Revision de524abd (ceph): log: dump logging levels in crash dump
- So you know what you are/are not seeing.
Signed-off-by: Sage Weil <sage@inktank.com> - 10:53 PM Revision d3c76f75 (ceph): Merge branch 'next'
- 10:45 PM rbd Feature #2788 (Resolved): rbd: add -l to rbd ls, show snapshots
- Add -l to rbd ls to show snapshots, perhaps size of parent
- 10:43 PM Revision 3821f6c4 (ceph): PG: grab reference to pg in C_OSD_AppliedRecoveredObject
- Otherwise, accessing the pg via _applied_recovered_object
isn't safe. Using intrusive_ptr clarifies the reference
ow... - 10:37 PM Revision 8dafcc5c (ceph): log: fix event gather condition
- We should gather an event if it is below the log or gather threshold.
Previously we were only gathering if we were g... - 10:36 PM Revision 64f74500 (ceph): log: fix event gather condition
- We should gather an event if it is below the log or gather threshold.
Previously we were only gathering if we were g... - 09:58 PM rbd Feature #2717: cls_rbd: add methods for maintaining mapping from parent to child images
- 09:58 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
- 09:57 PM rbd Feature #2718: librbd: map parent -> child in a per-pool rbd_children object when cloning
- From the design doc:
In addition to knowing which parent a given image has, we want to be able to tell if a protec... - 09:18 PM Revision 4afa8925 (ceph): PG: use stats from primary after rewinding divergent entries
- If the osd recieving the info has divergent entries, it will
also have a "divergent" stat structure.
Probably fixes ... - 09:18 PM Revision d4410e4a (ceph): PG::RecoveryState::Stray::react(LogEvt&): set dirty_info/log
- We adjust the info and the log, so we must set dirty_info and
dirty_log to force writes.
Signed-off-by: Samuel Just ... - 09:18 PM Revision 5f602366 (ceph): Merge remote-tracking branch 'upstream/next'
- 09:11 PM Revision ec5cd6de (ceph): PG::RecoveryState::Stray::react(LogEvt&): reset last_pg_scrub
- We need to reset the last_pg_scrub data in the osd since we
are replacing the info.
Probably fixes #2453
In cases l... - 09:07 PM Revision c7fb964c (ceph): PG::RecoveryState::Stray::react(LogEvt&): reset last_pg_scrub
- We need to reset the last_pg_scrub data in the osd since we
are replacing the info.
Probably fixes #2453
In cases l... - 06:34 PM devops Feature #2662: crowbar: Make barclamp-ceph set mon initial members, monitor-secret, fsid
- Currently have fsid generated via uuidgen, monitor is generated via the python script which is residing in barclamp-c...
- 06:33 PM Revision 248cfadd (ceph): ReplicatedPG: don't warn if backfill peer stats don't match
- pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill.... - 06:28 PM rbd Feature #2561 (Fix Under Review): rbd: copyup command
- Implemented at the moment as "flatten".
- 06:26 PM rbd Feature #2561 (In Progress): rbd: copyup command
- 06:27 PM rbd Feature #2559: cls_rbd: copyup method
- 06:26 PM rbd Subtask #2604 (In Progress): librbd layering: read path
- 06:26 PM rbd Subtask #2605 (In Progress): librbd layering: guard writes
- 06:11 PM Revision 5d82a770 (ceph): doc/dev/osd_internals: OSD overview, pg removal, map/message handling
- This is a start on some osd internals documentation for new
developers.
Signed-off-by: Samuel Just <sam.just@inktank... - 05:59 PM Revision 470796b5 (ceph): CompatSet: users pass bit indices rather than masks
- CompatSet users number the Feature objects rather than
providing masks. Thus, we should do
mask |= (1 << f.id) rath... - 05:59 PM Revision 90381dc9 (ceph): OSD: set superblock compat_features on boot and mkfs
- Previously, we did not actually persist the osd compatibility
mask. Without persisting the current compat mask, a pr... - 05:59 PM Revision 12d70738 (ceph): PG: use write_info to set snap_collections in make_snap_collections
- At one point, snap_collections were written to a pg collection
attribute. Subsequently, they were moved to the bigin... - 05:59 PM Revision 1b8819bb (ceph): PG: Place info in biginfo object
- The purged_snaps set can grow without bound as snaps are
created and removed. Because the filestore doesn't
provide ... - 05:58 PM Revision bcb1073f (ceph): mon/MonitorStore: always O_TRUNC when writing states
- It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p... - 05:57 PM Revision 41a57077 (ceph): osd: based misdirected op role calc on acting set
- We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin... - 05:57 PM Revision b7814dbe (ceph): osd: based misdirected op role calc on acting set
- We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin... - 05:57 PM Revision 14d2efc4 (ceph): mon/MonitorStore: always O_TRUNC when writing states
- It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p... - 05:53 PM Revision ff0f4742 (ceph): set machine description to ctx.archive when auto-locking machines for a...
- 05:48 PM Revision e429da34 (ceph): Merge remote-tracking branch 'gh/bugfix-2022'
- Reviewed-by: Samuel Just <sam.just@inktank.com>
- 05:35 PM Revision 811665a8 (ceph): move cfuse + ffsb from marginal to regression
- This has had no failures.
- 04:41 PM Revision 88e2ad35 (ceph): move cfuse + fsx back into regression suite
- No failures in marginal. The objectcacher fixes that came out of the
rbd_fsx stuff probably fixed the original problem? - 04:12 PM Revision 47b38dd0 (ceph): Merge remote-tracking branch 'gh/bugfix-2779'
- Reviewed-by: Greg Farnum <greg@inktank.com>
- 04:07 PM Feature #2631 (Resolved): mon: kill rm -rf --mkfs behavior
- 03:04 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
- 03:04 PM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
- Probably fixed in c7fb964c077d369943bd5c066c5f99da6bd5f37c.
Backported to stable: ec5cd6def9817039704b6cc010f2797a70... - 11:17 AM Bug #2761: osd: failed to recover before timeout expired
- Logs: ubuntu@teuthology: /a/teuthology-2012-07-14_00:00:03-regression-next-testing-basic/11248
ubuntu@teuthology:/... - 11:01 AM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
- 11:01 AM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
- 11:00 AM Bug #2787 (Resolved): RBD test copy fails when thrashing
- Logs: ubuntu@teuthology:/a/teuthology-2012-07-15_00:00:04-regression-next-testing-basic/11629
Attached below an ex... - 10:55 AM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
- 10:48 AM Linux kernel client Bug #2708 (Can't reproduce): spinlock lockup in queue_con, queue_work
- 10:47 AM Bug #2022 (Resolved): osd: misdirectect request
- YAY
- 10:47 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
- 05:03 AM Revision f94c7646 (ceph): mon: remove osds from [near]full sets when their stats are removed from...
- Greg points out that we could have a situation like:
- mon recovers..
- goes through osdmaps, notes an osd was rem... - 04:38 AM Revision fe576818 (ceph): mon/MonitorStore: always O_TRUNC when writing states
- It is possible for a .new file to already exist, potentially with a
larger size. This would happen if:
- we were p...
07/15/2012
- 11:31 PM Revision bf9a85ad (ceph): filestore: dump open fds when we hit EMFILE
- Use a helper to dump /proc/self/fd when we hit EMFILE in the filestore.
Ideally, we should trigger this in other appr... - 09:40 PM Bug #2707 (Can't reproduce): mkcephfs failing on v0.48 "argonaut"
- 09:40 PM Bug #2593 (Resolved): logmonitor: decode failure
- aie. commit:fe57681892b0bf3155ddeaeaa79ddbf997e27af9
- 08:43 PM Feature #2330 (Resolved): dump open files, sockets when we run out of fds
- commit:bf9a85ade6f7b06ed28ff7930b9e33a41acf0e34
- 12:39 AM Revision a278ea13 (ceph): osdmap: drop useless and unused get_pg_role() method
- Users probably want get_pg_acting_rank(). If they don't, they can probably
have the mapping and can calculate the ra... - 12:39 AM Revision 6faeedac (ceph): osd: simplify helper usage for misdirected ops
- Make the helper exclusively for the PG != NULL cases, and open-code the
one PG == NULL caller. This is simpler, and ... - 12:39 AM Revision 38962abd (ceph): osd: based misdirected op role calc on acting set
- We want to look at the acting set here, nothing else. This was causing us
to erroneously queue ops for later (wastin... - 12:39 AM Revision ed4f80f9 (ceph): vstart: use absolute path for keyring
- Stores absolute path to the generated keyring so that tests running in
other directories (e.g. src/java/test) can sim...
07/14/2012
- 08:51 PM Revision 042edcbe (ceph): schedule/suite: schedule job, suite N times
- 05:39 PM Bug #2022 (Fix Under Review): osd: misdirectect request
- FINALLY tracked this one down. tests consistently now pass. bugfix-2022
- 05:29 PM rgw Feature #2786 (Resolved): radosgw-admin: ability to remove objects/buckets
- Need to be able to remove objects/buckets. Bucket removal can optionally remove all objects. Another option is to set...
07/13/2012
- 11:09 PM Revision 117b2868 (ceph): OSD: add config options to fake missed pings
- In order to test monitor and osd failure detection and false
positive correction, this patch adds the following optio... - 10:14 PM Revision ce20e020 (ceph): crushtool: allow information generated during testing to be dumped
- to a set of CSV files for off-line analysis.
Signed-off-by: caleb miles <caleb.miles@inktank.com> - 09:16 PM Revision 8a89d40e (ceph): doc: remove last reference to ceph-cookbooks.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 09:08 PM Revision 20119567 (ceph): doc: cookbooks issue resolved, so changed 'ceph-cookbooks' back to 'ceph.'
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 08:57 PM Revision f37214e1 (ceph): kernel: fix kernel installation when kdb: is specified
- This normalize check would only trigger if a *single* key was specified.
Change it so that it triggers as long as all... - 08:35 PM Revision b3d077c6 (ceph): qa: download tests from specified branch
- These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 08:35 PM Revision 5a5597f6 (ceph): qa: download tests from specified branch
- These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 07:36 PM Revision f5d1a328 (ceph): schedule_suite.sh: use workunits from ceph commit
- Use the workunits from the same ceph branch we are testing.
- 07:18 PM Revision 53600798 (ceph): OSD: send_still_alive when we get a reply if we reported failure
- When we get a ping reply, remove the peer from the failure_queue
and send a still alive message if the peer is in the... - 06:30 PM Revision b0985e44 (ceph): ceph: add default btrfs mkfs options
- 06:30 PM Revision 014fb97f (ceph): ceph: cleanup/simplify mount/mkfs options
- 06:13 PM Revision a50bf672 (ceph): workunit: allow branch/sha1/tag to be specified
- Pull the workunit(s) from the branch/tag/sha1 specified in the config.
- 06:13 PM Revision d0c9e49b (ceph): workunit: allow overrides
- Pull top-level overrides into our config. This lets you do:
overrides:
workunit:
branch: foo
tasks:
...
- wor... - 05:19 PM Revision 5924f8e4 (ceph): PG: merge_log always use stats from authoritative replica
- If the osd recieving the log has divergent entries, it will
also have a "divergent" stat structure. In general, it s... - 05:01 PM Revision 58126b01 (ceph): workunit: pass branch/sha1 to test
- Some tests download things from the ceph repo. Let them know which
version to use through the CEPH_REF environment va... - 04:58 PM Bug #2748: include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
- https://github.com/athanatos/ceph/commits/wip_biginfo
- 04:57 PM Subtask #825 (Resolved): osd: remove pg map updating from handle_osd_map
- 04:57 PM Bug #2461 (Resolved): DBObjectMap is incompatible with collection_rename
- 04:42 PM Revision 3dd65a89 (ceph): qa: download tests from specified branch
- These python tests aren't installed, so they need to be downloaded
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 03:57 PM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
- for the record, this is being done in this sprint for two reasons:
(a) we are under some pressure to give up the old... - 09:39 AM devops Feature #2780: gitbuilder: move to vercoi, redo deployment if feasible
- A few possible kinks:
* The fabfile currently pulls the magic ssh keys for rsync from cephbooter. we need a new ... - 09:34 AM devops Feature #2780 (Closed): gitbuilder: move to vercoi, redo deployment if feasible
- 02:28 PM Revision ce7e0be1 (ceph): mon: use single helper for [near]full sets
- Use a single helper to add/remove osds from the [near]full sets. This
keeps the logic in a single place, and simplif... - 02:28 PM Revision 30b3dd1d (ceph): mon: purge removed osds from [near]full sets
- The [near]full sets are volatile state. Remove removed (or created)
osds from the set when we process a map.
Fixes:... - 12:40 PM devops Feature #2785 (In Progress): crowbar: custom ISO builder that includes barclamp-ceph
- 12:40 PM devops Feature #2785 (Resolved): crowbar: custom ISO builder that includes barclamp-ceph
- 12:38 PM devops Feature #2662 (In Progress): crowbar: Make barclamp-ceph set mon initial members, monitor-secret,...
- 12:37 PM devops Feature #2697 (Resolved): crowbar: ISO generation, reproducible in a cloud image vm
- Tyler says the google doc is good to go.
- 12:32 PM Feature #2709 (Resolved): teuthology: create xfs fs with large inodes
- 11:08 AM Bug #2784 (Resolved): osd hit suicide timeout
- Log: ubuntu@teuthology:/a/teuthology-2012-07-12_19:00:15-regression-master-testing-gcov/10615
ubuntu@teuthology:/a... - 10:22 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
- 5924f8e4a8c29e6de326a9e8576c30109cdc0e07
- 09:50 AM devops Bug #2783 (Won't Fix): Chef-client fails on oneiric cluster
- While verifying the web docs for Chef deployment of cepf, chef-client fails with the following error on an oneiric cl...
- 09:45 AM Bug #2779 (Fix Under Review): mon: [near]full status doesn't get purged when osds are removed
- tag!
- 07:12 AM Bug #2779 (Resolved): mon: [near]full status doesn't get purged when osds are removed
- Date: Fri, 13 Jul 2012 12:17:47 +0400
From: Andrey Korolyov <andrey@xdel.ru>
To: ceph-devel@vger.kernel.org
Subjec... - 09:43 AM devops Feature #2782 (Resolved): chef: easy scripts for running in vercoi vms
- 09:42 AM devops Feature #2781 (Closed): crowbar: easy scripts for running in vercoi vms
- 09:33 AM devops Feature #2694 (Closed): sepia: provide DNS for dynamic vms
- Works well enough for first iteration.
- 09:32 AM RADOS Feature #2706 (Resolved): crush: update kernel code to decode tunables
- 04:12 AM rbd Bug #2777: qemu: report discard support
- It works for me, I test ext4 with discard and ext4 with fstrim.
- 01:02 AM Revision 1741cb6c (ceph): Added functionality to get mkfs and mount options for file systems
- from the config file,if present. Otherwise, default options are used.
The default value for inode size is changed to...
07/12/2012
- 11:52 PM Revision bcfa573f (ceph): ReplicatedPG: don't mark repop done until apply completes
- Consider the following sequence:
1. issue, apply repop
2. replicas and primary commit
Here, repop->waitfor_(ack|dis... - 11:36 PM Revision 353d9ccf (ceph): fixed typo
- Signed-off-by: tamil <tamil.muthamizhan@inktank.com>
- 11:14 PM Revision 10ec5926 (ceph): test_librbd: fix warnings
- test/test_librbd.cc: In member function ‘virtual void LibRBD_TestClone_Test::TestBody()’:
warning: test/test_librbd.c... - 11:05 PM Revision f6a16f74 (ceph): fix wrongly marked down whitelist
- This used to have '...or wrong addr' but it doesn't any more.
- 09:06 PM Revision 5450567a (ceph): ReplicatedPG,PG: dump recovery/backfill state on pg query
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 08:11 PM Revision b133c490 (ceph): Merge remote-tracking branch 'gh/wip-2101'
- 06:46 PM Revision dfe29aff (ceph): doc: reverted file and role names.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 06:27 PM Revision 508bf3fb (ceph): rbd: enable layering when using the new format
- We'll add options for different features later.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> - 05:47 PM Revision f8478d4c (ceph): upstart: Make ceph-osd always set the crush location.
- This used to be conditional on config having osd_crush_location set,
but with that, minimal configuration left the OS... - 05:25 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
- wip_2769 should fix it.
- 05:00 PM Revision d1054df6 (ceph): doc: perf counters
- Signed-off-by: Sage Weil <sage@inktank.com>
- 05:00 PM Revision 5ceb7c73 (ceph): doc: fix config metavariables discussion
- Signed-off-by: Sage Weil <sage@inktank.com>
- 04:55 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
- bcfa573f5f615f3403ff71da0212cd1cee7e7d9c
The core provided pretty good support for this sequence of events, markin... - 04:42 PM Revision e855cb24 (ceph): rgw: don't override subuser perm mask if perm not specified
- Bug #2650. We were overriding subuser perm mask whenever subuser
was modified, even if perm mask was not passed.
Sig... - 04:41 PM Revision f33c0bee (ceph): rgw: don't override subuser perm mask if perm not specified
- Bug #2650. We were overriding subuser perm mask whenever subuser
was modified, even if perm mask was not passed.
Sig... - 04:40 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
- 04:00 PM Revision 09c60b43 (ceph): doc: added :: to code example.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 03:55 PM Revision ad8beeb4 (ceph): doc: minor edits.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 03:47 PM Revision 63a17998 (ceph): doc: cookbook name change broke some things in doc. Fixed.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 03:33 PM Revision 51148b81 (ceph): radosgw-admin: use --bucket instead of old --bucket-id
- The --bucket-id support was removed.
- 02:31 PM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
- *next is enough to figure out whether we ended on a boundary.
- 02:07 PM Feature #2680 (Resolved): osd: report backfill progress via query
- 5450567a6769b64ace385140ffb645d9d805e354
- 01:58 PM Revision 65c43e34 (ceph): debian: fix ceph-fs-common-dbg depends
- Signed-off-by: James Page <james.page@ubuntu.com>
- 01:58 PM Revision d6c766ea (ceph): debian: fix ceph-fs-common-dbg depends
- Signed-off-by: James Page <james.page@ubuntu.com>
- 12:22 PM rbd Bug #2777: qemu: report discard support
- It looks like this is handled by what qemu calls a 'device property', so you have to use a command line like:
<pre... - 08:14 AM rbd Bug #2777: qemu: report discard support
- Yeah, QUEUE_FLAG_DISCARD is as far as I looked yesterday too. The question now is how to set that within qemu.
- 11:23 AM CephFS Bug #2778 (Can't reproduce): uclient: failed asserts if mounting with a non-existent directory
- From Noah.
If Ceph is mounted with a non-existent directory, the mount call returns a success, but on unmount does t... - 11:23 AM Bug #2453: osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
- latest logs: ubuntu@teuthology:/a/teuthology-2012-07-12_05:00:04-regression-stable-master-basic/9637
ubuntu@teutho... - 09:58 AM RADOS Feature #2101 (Resolved): crushtool: check for weight overflow on reweight
- 03:06 AM Revision cc8df29e (ceph): rados tool: bulk objects removal
- Issue #2776. Allow the removal of multiple objects in a single
rados tool command:
# rados -p pool rm obj1 [obj2 [... - 02:59 AM Revision 762a5b63 (ceph): Merge remote-tracking branch 'gh/wip-cct'
- Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com> - 01:56 AM Revision f20b6022 (ceph): Merge branch 'next'
- Conflicts:
src/rados.cc - 01:54 AM Revision 99a048d8 (ceph): rados: more usage cleanup
- Signed-off-by: Sage Weil <sage@inktank.com>
- 01:53 AM Revision 0081c8e4 (ceph): rados: usage message
- Bad linebreaks, wrapping, stringification, missing doc for bench args
Signed-off-by: Dan Mick <dan.mick@inktank.... - 12:35 AM Revision 0782db36 (ceph): doc: changed role file names as part of update to roles.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 12:35 AM Revision e5997f4e (ceph): doc: added DHO config.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 12:11 AM Revision 173d592a (ceph): rados tool: remove -t param option for target pool
- Bug #2772. This fixes an issue that was introduced when we
added the 'rados cp' command. The -t param was already use...
07/11/2012
- 11:36 PM Revision 31c8dcc1 (ceph): crush: sum and check quantized weights for bucket
- Sum the quantized weights for each bucket, and check that for overflow.
This could change the results of a compile m... - 11:18 PM rbd Bug #2777: qemu: report discard support
- I don't know what I'm talking about, but stumbling around in the kernel seems to point at QUEUE_FLAG_DISCARD as the f...
- 02:19 PM rbd Bug #2777 (Resolved): qemu: report discard support
- todin reported problems enabling discard inside qemu.
I'm not sure if this is rbd-specific, but there seem to be s... - 11:14 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
- latest logs: ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9430
ubuntu@teutho... - 09:01 PM Bug #2769: 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 bytes in clu...
- ...
- 11:10 PM rbd Bug #2715: krbd: spinlock wrong CPU
- latest logs:ubuntu@teuthology:/a/teuthology-2012-07-11_19:00:11-regression-master-testing-gcov/9371
12-07-11T19:36... - 11:03 PM Revision 675a1b7b (ceph): crush: Set maximum device/bucket weights.
- Signed-off-by: caleb miles <caleb.miles@inktank.com>
- 11:03 PM Revision c9fc5a24 (ceph): crush: prevent integer overflow on reweight
- Disallow setting OSD weights to a value over 10,000 and cap bucket weight
at 10,000,000 in a CRUSH map. Addresses iss... - 10:32 PM Revision d29ec1e2 (ceph): rados: usage message
- Bad linebreaks, wrapping, stringification, missing doc for bench args
Signed-off-by: Dan Mick <dan.mick@inktank.... - 09:23 PM Revision 9b289486 (ceph): nuke: honor 'check-locks: ...' field in targets file
- If you are nuking a yaml file with check-locks: false, don't check locks.
- 09:14 PM Revision 3abc4128 (ceph): internal: archive mon data dirs
- These can be useful for debugging, and are usually pretty small.
Fixes: #2714 - 09:10 PM Bug #2772 (Resolved): rados bench -t broken
- 10:16 AM Bug #2772: rados bench -t broken
- we can't change bench flags.
just don't make a short one-char flag for the target... use --target or whatever instead - 10:12 AM Bug #2772: rados bench -t broken
- FWIW, I hacked up a patch last night (changed -t to -n to avoid conflicts, and made some changes to the usage message...
- 09:10 PM Revision cff2cfa2 (ceph): internal: move pulling archive w/ tar to helper
- 06:52 PM Revision 95e8d87b (ceph): rados tool: remove -t param option for target pool
- Bug #2772. This fixes an issue that was introduced when we
added the 'rados cp' command. The -t param was already use... - 05:59 PM Revision 986c0ef9 (ceph): rbd: test with layering enabled
- RBD_FEATURES=0 hits a bug that's fixed in wip-rbd-parent.
Once that's merged, we can add RBD_FEATURES=0 tests back in. - 04:19 PM Revision 5b107783 (ceph): Makefile: don't install crush headers
- This is leftover from when we built a libcrush.so. We can re-add when we
start doing that again.
Reported-by: Laszl... - 04:19 PM Revision 2c001b28 (ceph): Makefile: don't install crush headers
- This is leftover from when we built a libcrush.so. We can re-add when we
start doing that again.
Reported-by: Laszl... - 04:04 PM Revision 22d0648d (ceph): librados: simplify cct refcounting
- get() in ctor, put() in dtor.
Signed-off-by: Sage Weil <sage@inktank.com> - 03:58 PM Revision c5bcb04b (ceph): lockdep: stop lockdep when its cct goes away
- When a cct is destroyed, tell lockdep so that it can shut down if it needed
it.
Signed-off-by: Sage Weil <sage@inkta... - 03:27 PM Revision fe6ed3e7 (ceph): ffsb is marginal, remove from smoke suite
- 02:23 PM Feature #2714 (Resolved): teuthology: store monitor data in archives
- 01:54 PM Bug #2387 (Duplicate): mon: could not get service secret for auth subsystem
- 11:31 AM Feature #2776 (Resolved): rados tool: bulk removal of objects
- Provide an option to remove multiple objects in a single command.
- 10:08 AM Bug #2771: mon|osd: fail to create pgs on startup
- i think this is related:...
- 09:19 AM Feature #2775 (Rejected): rest-bench man page
- 09:19 AM devops Documentation #2774 (Resolved): doc: ceph-disk man page
- commit:a450cab2b8148cb8a9b043d629feccf89e5aabac
- 09:11 AM rbd Subtask #2603 (Fix Under Review): librbd layering: open parent on open
- In branch wip-rbd-parent
- 08:16 AM Feature #2666 (Resolved): rados tool: copy pool
- 04:35 AM rbd Feature #2773 (Resolved): libvirt: incorrect snapshot behavior
- Since consistent(disk and memory state) snapshots currently supported only with qcow images, libvirt should:
- all... - 04:27 AM Revision 7adc6c08 (ceph): mon: simplify logmonitor check_subs; less noise
- * simple helper to translate name to id
* verify sub type is valid in caller
* assert sub type is valid in method
... - 03:26 AM Revision 19742ec2 (ceph): Revert "smoke: add msgr failures"
- This reverts commit 9278e231e64f49c3205c2ded8b1f2d3b27265eac.
- 02:58 AM Revision 657c9db2 (ceph): move cfuse fsx into marginal suite
- This should probably pass, given the testing that ObjectCacher gets these
days with librbd_fsx. - 02:58 AM Revision c3e1ab49 (ceph): remove suites/stress/basic
- 02:58 AM Revision 392a6596 (ceph): move some old flaky tasks into marginal suite
- These were pulled out of regression a while ago. Put them into the
marginal suite where they will be regularly run a... - 02:04 AM Revision a542d89e (ceph): mds: fix race in connection accept; fix con replacement
- We solve two problems with this patch. The first is that the messenger
will now reuse an existing session's Connecti... - 01:21 AM Revision fa96e19f (ceph): Merge branch 'stable' into next
- 01:21 AM Revision 0f917c2f (ceph): osd: guard class call decoding
- Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com> - 01:21 AM Revision 0ff6c979 (ceph): test_stress_watch: just one librados instance
- This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets... - 01:19 AM Revision ee1c029d (ceph): ReplicatedPG: don't warn if backfill peer stats don't match
- pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill.... - 01:18 AM Revision d3c97dae (ceph): librados: take lock when signaling notify cond
- When we are signaling the cond to indicate that a notify is complete,
take the appropriate lock. This removes the po... - 01:17 AM Revision ec490d87 (ceph): client: fix locking for SafeCond users
- Need to wait on flock, not client_lock.
Signed-off-by: Sage Weil <sage@inktank.com>
07/10/2012
- 11:13 PM Revision 38e2de3d (ceph): doc: No ssh -t -t, forcing a pty allocation there makes it hang.
- Earlier, this was a single -t, and that is overridden by the fact that
stdin is not a tty, so that did nothing.
Sign... - 11:03 PM Revision 79e3416c (ceph): doc: removed the ceph directory per tommi's update to the chef-cookbooks.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 10:23 PM Revision 5c84f013 (ceph): doc: Adding apt update message. VM users didn't get the package otherwise.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 10:13 PM Bug #2772: rados bench -t broken
- actually the -t param is for 'rados cp' which is for an object, not for cppool.
- 10:02 PM Bug #2772: rados bench -t broken
- let's just make it --target-pool or similar. this is an uncommon enough command that it doesn't really need a single...
- 10:00 PM Bug #2772 (Resolved): rados bench -t broken
- The -t parameter has being assigned to rados cppool (target pool), but apparently it was already being used for rados...
- 09:56 PM Feature #2714 (In Progress): teuthology: store monitor data in archives
- 09:25 PM Bug #2771 (Resolved): mon|osd: fail to create pgs on startup
- ...
- 09:20 PM Revision 83339a0c (ceph): Merge branch 'wip-rbd-clone-dmick' into master
- Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
- 09:03 PM Revision fe5c0cd9 (ceph): osd: guard class call decoding
- Backport: argonaut
Signed-off-by: Sage Weil <sage@inktank.com> - 09:00 PM Revision 2a6af208 (ceph): rbd: update manpage for clone command
- Signed-off-by: Dan Mick <dan.mick@inktank.com>
- 08:59 PM Revision e3531497 (ceph): rbd: update cli test reference files
- Signed-off-by: Dan Mick <dan.mick@inktank.com>
- 08:59 PM Revision 64949d42 (ceph): librbd, rbd, rbd.py: Add parent info reporting
- split out new parent info into separate retrieval methods;
structure packing on rbd_image_info_t was becoming a probl... - 08:59 PM Revision 6ad59610 (ceph): rbd, librbd: add tests for cloning
- Signed-off-by: Dan Mick <dan.mick@inktank.com>
- 08:59 PM Revision 7b0c71cc (ceph): librados: pool_get_name handles "not found" wrong
- Signed-off-by: Dan Mick <dan.mick@inktank.com>
- 08:59 PM Revision 96a517b4 (ceph): librados: Add mapping from pool id to pool name and ioctx to rados client
- Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
- 08:59 PM Revision 8cc43427 (ceph): librbd: open_image snapshot handling
- Allow opening with no snap, but check for error for nonexistent snap
Backport: argonaut
Signed-off-by: Dan Mick <dan... - 08:59 PM Revision a94fc8c8 (ceph): rbd, librbd, rbd.py: cloning (copy-on-write child image of snapshot)
- Signed-off-by: Dan Mick <dan.mick@inktank.com>
- 08:59 PM Revision 68bad03b (ceph): msgr: queue accept event when pipe is accepted
- Queue an event when an incoming connection is accepted.
Signed-off-by: Sage Weil <sage@inktank.com> - 08:59 PM Revision fab6e824 (ceph): msg/DispatchQueue: queue and deliver accept events
- Signed-off-by: Sage Weil <sage@inktank.com>
- 08:59 PM Revision 95f85df7 (ceph): librados: add new constructor to form a Rados object from IoCtx
- This creates a separate reference to an existing connection, for
use when a client holding IoCtx needs to consult ano... - 08:30 PM Revision d4ef004e (ceph): dispatcher: new 'accept' event type
- Create a new event type when we successfully accept a connection. This is
distinct from the authorizor verification,... - 08:30 PM Revision 912123f2 (ceph): msgr: move Pipe, DispatchQueue into separate files
- These don't need to be subclasses of SimpleMessenger. Separate!
Signed-off-by: Sage Weil <sage@inktank.com> - 08:30 PM Revision 34908140 (ceph): msgr: uninline ctor and dtor
- Signed-off-by: Sage Weil <sage@inktank.com>
- 08:30 PM Revision bffd46c5 (ceph): msgr: remove obsolete dead path
- This hasn't triggered in years.
Signed-off-by: Sage Weil <sage@inktank.com> - 08:30 PM Revision bafcbdeb (ceph): msgr: remove unnecessary accept check
- We don't need to worry about racing with shutdown here; the cleanup
procedure will stop the accepter thread before cl... - 08:30 PM Revision 9e291bae (ceph): msgr: simplify checks for queueing connection events
- Looking through git history it is not clear exactly how these checks
came to be. They seem to have grown during the ... - 08:30 PM Revision e84565d9 (ceph): msgr: move dispatch thread into DispatchQueue
- The DispatchQueue class now completely owns message delivery. This is
cleaner and lets us drop the redundant destina... - 08:30 PM Revision 1a4a1330 (ceph): msgr: drop unnecessary (un)locking on queuing connection events
- This used to be necessary because the pipe_lock was used when queueing
the pipe in the dispatch queue. Now that is h... - 08:28 PM Revision 2d071b9e (ceph): msgr: use explicit Connection for messages sent to ourself
- Move to an explicit Connection for messages sent to ourselves, instead of
using the one on the local_pipe (which we'l... - 08:28 PM Revision fa4c1920 (ceph): msgr: use local IncomingQueue instead of Pipe
- Simpler, cleaner. No need for the rest of the Pipe crap. We just need to
queue messages for ourselves.
Signed-off-... - 08:28 PM Revision c7caf91b (ceph): msgr: simplify IncomingQueue creation, pointers
- * create it via DispatchQueue
* keep pointer to parent DispatchQueue
* drop now-useless contextual arguments to mos... - 08:25 PM Revision 35b13266 (ceph): msgr: take over existing Connection on Pipe replacement
- If a new pipe/socket is taking over an existing session, it should also
take over the Connection* associated with the... - 07:22 PM rbd Feature #2770 (Resolved): krbd: define tasks to add osd_client compound class op support
- 06:29 PM Revision 80111cce (ceph): test_stress_watch: just one librados instance
- This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets... - 06:04 PM Revision 782a17da (ceph): doc: added cookbook path instruction.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 04:38 PM Bug #2387 (Can't reproduce): mon: could not get service secret for auth subsystem
- 04:26 PM Bug #2600 (Resolved): osd: crazy long watch timeout?
- 09:28 AM Bug #2600 (In Progress): osd: crazy long watch timeout?
- 03:14 PM Revision 8869621f (ceph): doc: Added some pre-clarification for gdisk. Added DHO OSD hardware con...
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 02:52 PM Bug #2765: Extremely Long Object Latencies
- I noticed one 15 second lag on an OSD caused by the filestore requiring 15 seconds to drain open ops before a sync.
- 02:52 PM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
- Merged in commit:96ca508dcd227db11597cd4963208872af30fcc4
- 02:24 PM rbd Feature #2585 (Resolved): rbd: clone command
- commit:83339a0cbb925022c44507ab0ea3da7b399f511e
- 11:06 AM Bug #2767 (Resolved): rbd workunit failing
- The real problem is that the workunit task only grabs things from master, since github only seems to create tarballs ...
- 09:24 AM Bug #2767 (Resolved): rbd workunit failing
- ...
- 10:47 AM Bug #2769 (Resolved): 0.f scrub stat mismatch, got 29/30 objects, 6/7 clones, 69817305/70217362 b...
- Log location: ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8335
ubuntu@teuth... - 10:33 AM Feature #2768 (Resolved): teuthology: make workunit task work on different branch/sha1 etc
- This involves making it download scripts individually instead of from a single tarball, since github only creates tar...
- 10:14 AM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
- ubuntu@teuthology:/a/teuthology-2012-07-09_19:00:03-regression-master-testing-gcov/8328
- 10:13 AM Bug #2453 (In Progress): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
- ubuntu@teuthology:/a/teuthology-2012-07-10_05:00:02-regression-stable-master-basic/8546
- 09:06 AM Subtask #2757: mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transactions
- Pending testing
- 04:36 AM Revision fbe5577f (ceph): CephContext: don't leak admin socket
- Signed-off-by: Sage Weil <sage@inktank.com>
- 04:32 AM Revision 74e62051 (ceph): CephContext: drop obsolete ceph_destroy_context
- Drop this useless helper and call cct->put() directly. The comment that
this can't be used after global_init is no l... - 04:27 AM Revision 58f46bb9 (ceph): test_stress_watch: just one librados instance
- This was creating a new cluster connection/session per iteration, and
along with it a few service threads and sockets... - 04:27 AM Revision 6aa9f496 (ceph): CephContext: ref count
- These get shared via the librados API.
Fixes: #845
Signed-off-by: Sage Weil <sage@inktank.com> - 04:27 AM Revision 074df64d (ceph): librados: use CephContext ref counts
- Take ownership of the passed cct. Drop it when we destroy the
RadosClient.
Signed-off-by: Sage Weil <sage@inktank.com> - 01:01 AM Revision 645ac87b (ceph): ReplicatedPG: fix replay op ordering
- After a client reconnect, the client replays outstanding ops. The
OSD then immediately responds with success if the ... - 01:01 AM Revision 67a5e75e (ceph): ReplicatedPG: don't warn if backfill peer stats don't match
- pinfo.stats might be wrong if we did log-based recovery on the
backfilled portion in addition to continuing backfill.... - 12:27 AM Revision 67361c10 (ceph): librbd: return an error when removing a non-existent image
- Try treating the image as new format if it's not in the old-style
directory, which is the last step in old-style remo...
07/09/2012
- 09:59 PM Bug #2067: librados: we leak CephContext from rados_create()
- librados does not destroy the CephContext created in rados_create() in rados_destroy() (or the c++ equivalents). Thi...
- 09:59 PM Bug #2766 (Resolved): cephcontext: needs refcount
- CephContext needs to be refcounted, so that it can be properly shared.
- 09:37 PM Bug #2600: osd: crazy long watch timeout?
- Oh....
- 08:58 PM Revision ec87a1ea (ceph): cls_rbd: add logging for failed set_parent steps
- Signed-off-by: Dan Mick <dan.mick@inktank.com>
- 06:43 PM Revision 96ca508d (ceph): Merge remote-tracking branch 'gh/wip-rbd-id'
- 06:41 PM Bug #2765 (Closed): Extremely Long Object Latencies
- Saw this during testing on one of the aging clusters.
example object tid: 312322
Full logs available here:
... - 06:06 PM Revision b23e4e58 (ceph): doc: Removed legacy paths and keyname settings from examples.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 06:03 PM Bug #2508 (Resolved): osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobje...
- 645ac87b84ae348483c7d52b1968af5f44378dea
- 06:02 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
- 67a5e75ed6a9b3689a984d6f7eb6a18e1242125e
- 04:12 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
- Remove the loud warnings? Like, the mismatch warnings?
- 04:08 PM Bug #2750: 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes." in cluster
- Ok, so this is actually caused by not updating the backfill peer's partial stats while recovering the backfill portio...
- 03:10 PM devops Feature #2698 (Closed): crowbar: Guide for using "front" network
- Merged as 90c42837696d252be40348fdd4499e15aa835aeb.
- 01:56 PM Linux kernel client Bug #2764 (Closed): xfstest hang; osd socket closed messages
- ...
- 01:51 PM Cleanup #2763 (Resolved): move rbd locking infrastructure to a separate objclass
- locks are generics, can serve other uses (e.g., rgw). We should separate them from rbd code.
- 12:52 PM Bug #2762 (Resolved): mon crash ceph::__ceph_assert_fail (assertion=0x63d150 "begin->last_committ...
- #0 0x00007fb9b32b5b7b in raise (sig=<optimized out>) at ../nptl/sysdeps/unix/sysv/linux/pt-raise.c:42
#1 0x0000000... - 12:07 PM Bug #2761 (Resolved): osd: failed to recover before timeout expired
- Logs: ubuntu@teuthology:/a/teuthology-2012-07-09_05:00:08-regression-stable-master-basic/8039
Attaching below an e... - 11:54 AM Bug #2022: osd: misdirectect request
- Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
Attaching the... - 11:50 AM Bug #2022: osd: misdirectect request
- Log location: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7824
Attaching the... - 11:45 AM Bug #2022: osd: misdirectect request
- latest log: ubuntu@teuthology:/a/teuthology-2012-07-08_21:46:11-regression-next-testing-basic/7815
and
ubuntu@t... - 11:33 AM Feature #2760 (Rejected): make FileStore::collection_list_partial end on a boundary
- ...
- 11:21 AM Linux kernel client Bug #2759 (Resolved): libceph: crush tree algorithm is not understood
- After changing the crush algorithm for a bucket to tree, the kernel client cannot parse the OSDMap:...
- 10:31 AM RADOS Feature #2101: crushtool: check for weight overflow on reweight
- 09:38 AM Subtask #2758 (Resolved): mon: Single-Paxos: Sync: Extend the in-memory mock-up of KeyValueDB to ...
- Extend the in-memory mock-up of KeyValueDB to support the safe iterator, and create an iterator to iterate over all t...
- 09:16 AM Subtask #2756: mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
- 09:05 AM Subtask #2756 (Resolved): mon: Single-Paxos: LevelDBStore: Make iterator thread-safe
- The current iterator implementation iterates directly over the database state, even if there are concurrent updates.
... - 09:15 AM Subtask #2757 (Resolved): mon: Single-Paxos: Sync: pack chunks of the MonitorDBStore into transac...
- Using the LevelDBStore's safe iterator, obtain the existing key/values and pack them into transactions.
These tran... - 08:57 AM Feature #2755 (Resolved): ceph-conftool: optionally return the default for a config option if no ...
- Presently, the only way to figure out what the default for a specific configuration option is, is to consult the sour...
- 03:33 AM Revision b387077b (ceph): debian: include librados-config in librados-dev
- Reported-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@inktank.com>
07/08/2012
- 09:39 PM Revision 8e5fe62a (ceph): doc: remove reference to 'ceph stop' command
- It doesn't exist anymore.
Signed-off-by: Sage Weil <sage@inktank.com>
07/07/2012
- 06:19 AM Linux kernel client Bug #2754 (Resolved): cephfs show_location produces kernel "divide error: 0000 [#1]" when run aga...
- Originally reported in http://marc.info/?l=ceph-devel&m=134151028212170&w=2:
Really easy to reproduce on my 3.2.0 ... - 06:16 AM CephFS Bug #2753 (Resolved): Writes to mounted Ceph FS fail silently if client has no write capability o...
- Originally reported in http://marc.info/?l=ceph-devel&m=134151023912148&w=2:
How to reproduce (this is on a 3.2.0 ... - 06:12 AM Bug #2752: Setting large maxosd kills all mons
- Original ML response from Greg (http://marc.info/?l=ceph-devel&m=134151077312444&w=2):
Ungh. Can you file a bug repo... - 06:10 AM Bug #2752 (Resolved): Setting large maxosd kills all mons
- Originally reported in http://marc.info/?l=ceph-devel&m=134151018912133&w=2:
When setting a large maxosd, all mons... - 03:15 AM Revision 9ea22133 (ceph): use sudo to kill teuthology proc
- 03:01 AM Revision dddf783f (ceph): Merge branch 'wip-cond'
- Reviewed-by: Greg Farnum <greg@inktank.com>
- 12:04 AM Revision 98a21cc8 (ceph): move qemu_iozone test to marginal suite
07/06/2012
- 11:45 PM Revision 03c2dc24 (ceph): lockdep: increase max locks
- Hit this limit with the rados api tests.
Signed-off-by: Sage Weil <sage@inktank.com> - 11:45 PM Revision b554d112 (ceph): config: add unlocked version of get_my_sections; use it internally
- Signed-off-by: Sage Weil <sage@inktank.com>
- 11:45 PM Revision 01da287b (ceph): config: fix lock recursion in get_val_from_conf_file()
- Introduce a private, already-locked version.
Signed-off-by: Sage Weil <sage@inktank.com> - 11:45 PM Revision c73c64a0 (ceph): config: fix recursive lock in parse_config_files()
- The _impl() helper is only called from parse_config_files(); don't retake
the lock.
Signed-off-by: Sage Weil <sage@i... - 11:44 PM Revision b33553aa (ceph): rgw: handle response-* params
- Handle response-* params that set response header field values.
Fixes #2734, #2735.
Backport: argonaut
Signed-off-by... - 11:43 PM Revision 97c1562d (ceph): rgw: handle response-* params
- Handle response-* params that set response header field values.
Fixes #2734, #2735.
Backport: argonaut
Signed-off-by... - 11:43 PM Revision 6646e891 (ceph): rgw: initialize fields of RGWObjEnt
- This fixes various valgrind warnings triggered by the s3test
test_object_create_unreadable.
Signed-off-by: Sage Weil... - 10:17 PM Revision 74f68750 (ceph): osd: add missing formatter close_section() to scrub status
- Also add braces to make the open/close matchups easier to see. Broken
by f36617392710f9b3538bfd59d45fd72265993d57.
... - 08:45 PM Revision 020b2996 (ceph): pg: report scrub status
- Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
- 08:45 PM Revision db6d83b3 (ceph): pg: track who we are waiting for maps from
- Signed-off-by: Mike Ryan <mike.ryan@inktank.com>
- 08:45 PM Revision e1d4855f (ceph): pg: reduce scrub write lock window
- Wait for all replicas to construct the base scrub map before finalizing
the scrub and locking out writes.
Signed-off... - 08:03 PM Feature #2743 (Resolved): new cond assertions, associated cleanups
- 09:41 AM Feature #2743 (Resolved): new cond assertions, associated cleanups
- 07:21 PM Revision 72bf66f6 (ceph): doc: added some discussion to libvirt.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 07:14 PM Revision 1316f868 (ceph): cond: cleanup
- Signed-off-by: Sage Weil <sage@inktank.com>
- 07:14 PM Revision 7c919e9b (ceph): workqueue: kick -> wake or _wake, depending on locking
- Break kick() into wake() and _wake() methods, depending on whether the
lock is already held. (The rename ensures tha... - 07:14 PM Revision b5693a6e (ceph): librados: take lock when signaling notify cond
- When we are signaling the cond to indicate that a notify is complete,
take the appropriate lock. This removes the po... - 07:14 PM Revision 9c6fdd5a (ceph): librados: drop unused local variables
- This is unused boilerplate cruft.
Signed-off-by: Sage Weil <sage@inktank.com> - 07:14 PM Revision da7a3284 (ceph): cond: drop unused Wait variant
- This was used for debugging forever ago.
Signed-off-by: Sage Weil <sage@inktank.com> - 07:07 PM Revision b486f2ff (ceph): client: fix locking for SafeCond users
- Need to wait on flock, not client_lock.
Signed-off-by: Sage Weil <sage@inktank.com> - 07:07 PM Revision 8de0c227 (ceph): cond: assert that we are holding the same mutex as the waiter
- Try to verify that we are holding the same mutex that the waiter is
waiting on. Specifically:
* only wait on a sin... - 06:29 PM Revision 57463455 (ceph): Merge branch 'master' of github.com:ceph/ceph
- 06:29 PM Revision 540c2c3a (ceph): doc: Minor cleanup on deploy with Chef.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 05:18 PM Revision 3df51040 (ceph): rgw: don't store bucket info indexed by bucket_id
- Issue #2701. This info wasn't really used anywhere and we weren't
removing it. It was also sharing the same pool name... - 05:17 PM Revision 27409aa1 (ceph): rgw: don't store bucket info indexed by bucket_id
- Issue #2701. This info wasn't really used anywhere and we weren't
removing it. It was also sharing the same pool name... - 05:16 PM Revision 84ba6bf6 (ceph): Merge branch 'stable' into next
- 05:15 PM Revision 960c2124 (ceph): rados tool: copy object
- New rados command: rados cp <src-obj> [dest-obj]
Requires specifying source pool. Target pool and locator can be spe... - 05:15 PM Revision 16ea64fb (ceph): rados tool: copy entire pool
- A new rados tool command that copies an entire pool
into another existing pool.
Signed-off-by: Yehuda Sadeh <yehuda@... - 05:15 PM Revision d7510066 (ceph): rados tool: copy object in chunks
- Instead of reading the entire object and then writing it,
we read it in chunks.
Signed-off-by: Yehuda Sadeh <yehuda@... - 05:15 PM Revision 9814374a (ceph): test_rados_tool.sh: test copy pool
- Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
- 05:12 PM Revision d59b2db4 (ceph): Merge remote-tracking branch 'origin/stable' into next
- 05:05 PM Feature #2702 (Resolved): gitbuilder: sync each build as it completes
- 03:23 PM Feature #2702: gitbuilder: sync each build as it completes
- 09:43 AM Feature #2702 (In Progress): gitbuilder: sync each build as it completes
- 05:02 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
- 04:35 PM rgw Feature #2735: rgw: support response-* request params in GET obj
- Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a.
- 05:02 PM Revision ed3bd211 (ceph): increase thrashosds timeout
- 04:48 PM Feature #2680: osd: report backfill progress via query
- not yet tested https://github.com/athanatos/ceph/tree/wip_recovery_query
- 04:45 PM Bug #2751 (Resolved): info can grow without bound but is kept on collection attribute
- in progress already https://github.com/athanatos/ceph/tree/wip_biginfo
blocked by 2748 - 04:35 PM Bug #2750 (Won't Fix): 1.14 scrub stat mismatch, got 4/4 objects, 0/0 clones, 8138/7467 bytes."...
- lavor: basic
owner: scheduled_sage@metropolis
success: false
ubuntu@teuthology:/a/sage-2012-07-06_11:03:15-regress... - 04:35 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
- Fixed, commit:b33553aae63f70ccba8e3d377ad3068c6144c99a.
- 10:58 AM rgw Bug #2734: rgw: fails to authenticate preauthenticated requests with sub resource
- Fails only with specific sub resources (e.g., response-*) that were not hard coded.
- 04:34 PM Bug #2749: osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
- roles:
- - mon.a
- osd.0
- osd.1
- osd.2
- - mds.a
- osd.3
- osd.4
- osd.5
- - client.0
targets... - 04:32 PM Bug #2749 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
- Saw on master
osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
ceph version 0.48argona... - 04:23 PM Bug #2748 (Resolved): include/CompatSet.h, osd/osd_types.h inconsistent use of mask argument
- 03:51 PM Bug #2600: osd: crazy long watch timeout?
- another instance:...
- 03:47 PM Revision 23d31d3e (ceph): ceph.spec.in: add ceph-disk-{activate,prepare}
- Reported-by: Jimmy Tang <jtang@tchpc.tcd.ie>
Signed-off-by: Sage Weil <sage@inktank.com> - 03:47 PM Bug #2747 (Resolved): osd/ReplicatedPG.cc: 3436: FAILED assert(last_update_applied < repop->v)
- ...
- 01:23 PM rgw Bug #2701 (Resolved): rgw: don't keep bucket info indexed by bucket_id
- Fixed, commit:3df51040b19b9c110e08f42befa0cae18cbb984c.
- 11:34 AM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
- Thanks, updated the doc to increase the VM size.
- 10:24 AM Subtask #2737: mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
- Currently, Paxos own trimming is going through the Paxos proposal mechanism itself.
However, services do not do it... - 09:29 AM Subtask #2737 (Resolved): mon: Single-Paxos: Sync: Force trimming to be proposed through Paxos
- This includes Paxos' own trimming, so it done at the same time in all the quorum members. Furthermore, it allows the ...
- 10:18 AM Subtask #2738: mon: Single-Paxos: Sync: Add snapshot support to the monitor store
- Implemented, although it has not been tested yet.
- 09:31 AM Subtask #2738 (Rejected): mon: Single-Paxos: Sync: Add snapshot support to the monitor store
- Having snapshot support, and being able to iterate over it instead of iterating directly over the store's keys, allow...
- 10:09 AM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
- latest logs:
/a/teuthology-2012-07-06_00:00:03-regression-next-testing-basic/6655 - 10:01 AM Subtask #2746 (Rejected): mon: Single-Paxos: Sync: Test message passing
- Make sure the message passing is working before starting to work on the store synchronization.
From the monitor's ... - 09:58 AM Subtask #2745 (Closed): mon: Single-Paxos: Sync: Add new message support to the Monitor class
- Three different "roles" on a monitor cluster regarding synchronization:
* Leader - responsible for disabling the P... - 09:53 AM Subtask #2744 (Resolved): mon: Single-Paxos: Sync: Create new Message type
- Create a new message type MMonSync, to dissociate it from the MMonProbe messages, and to allow us to use fields speci...
- 09:43 AM RADOS Feature #2706 (In Progress): crush: update kernel code to decode tunables
- 09:40 AM Feature #2742 (Resolved): qa: ms socket inject failures in regression suite
- 09:40 AM Subtask #2741 (Resolved): mon: Single-Paxos: Sync: Assess requirements for QA tests
- TBD
- 09:37 AM CephFS Bug #2740 (Resolved): mds: crash in Objecter when shutting down too early
- Jimmy Tang reported this on the mailing list....
- 09:36 AM Subtask #2739 (Resolved): mon: Single-Paxos: Sync: Synchronize the store of a drifted monitor
- Synchronize the store of a monitor (mon.X) that drifted away from the system (say, it failed for a considerable amoun...
- 09:22 AM Subtask #2736 (Resolved): mon: Single-Paxos: Sync: Implement message passing
- See subtasks.
- 04:28 AM Revision 668ce00a (ceph): osd: make on_removal() pure virtual
- Signed-off-by: Sage Weil <sage@inktank.com>
- 04:26 AM Revision 3d00130b (ceph): osd: fix PG dtor compile error
- We need at least none non-pure virtual method to tell gcc where the
vtable goes. The destructor wins!
libosd.a(libo... - 01:37 AM Bug #2267: Ceph client crashed after shutting down one mds and osd
- To which version do you plan to add this patch?
- 12:20 AM Revision 6e3fb20d (ceph): Merge remote-tracking branch 'gh/wip_osd_threading'
- 12:18 AM Revision bdf09f20 (ceph): OSD,ReplicatedPG::snap_trimmer: pg->put() in process, not snap_trimmer()
- This clarifies responsibility for the reference.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 12:18 AM Revision 0475ee45 (ceph): OSD,PG:;replica_scrub: move msg->put() into queue process
- This clarifies the ownership of the reference.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 12:18 AM Revision fe14c181 (ceph): OSD::PeeringWQ::_dequeue(PG*) drop pg refs
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 12:18 AM Revision 816d4247 (ceph): ReplicatedPG.cc: C_OSD_CommittedPushedObject move pg->put() to finish
- This should clarify the ownership of the pg ref.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 12:18 AM Revision 8e93e8b0 (ceph): OSD: ensure that OpSequencer lives through on_commit callback
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 12:18 AM Revision 69174198 (ceph): OSDMonitor: disable cluster snapshot
- The map handling changes broke cluster snapshot support.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 12:18 AM Revision 09af670b (ceph): PG,ReplicatedPG: on_removal must handle repop and watcher state
- on_removal is now in ReplicatedPG in order to handle watcher state
and repop state. Addionally, workqueue dequeues a... - 12:18 AM Revision cab7b75d (ceph): OSD: drop pg refcounts in OpWQ::_dequeue(PG*)
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 12:18 AM Revision 868168a5 (ceph): OSD: clean up revcovery_wq queueing and ref counting
- Previously, we tended to explicitely remove the pg from the queue uisng
remove_myself on the xlist::item. This cause...
07/05/2012
- 10:30 PM Feature #2692 (Resolved): stable testing debian repos
- 10:29 PM Revision c70392a8 (ceph): doc: minor typo
- Signed-off-by: Ross Turk <ross@inktank.com>
- 10:24 PM Revision 4d7bb075 (ceph): doc: update copyright notice in footer
- Signed-off-by: Ross Turk <ross@inktank.com>
- 10:16 PM Linux kernel client Bug #2716 (Resolved): crash when cluster goes down and new one comes up
- bad con->private = NULL in monc __close_session
- 09:38 PM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
- easy to reproduce with...
- 08:44 AM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
- I just reproduced this several times on master using the socket failure injection. This is probably socket errors br...
- 09:37 PM Bug #2022: osd: misdirectect request
- ugh, wrong bug.. ignore last two messages!
- 05:19 PM Bug #2022: osd: misdirectect request
- this is easily reproduced with...
- 05:18 PM Bug #2022 (In Progress): osd: misdirectect request
- ...
- 09:01 PM Revision 57bc8da9 (ceph): doc: minor updates to the restrucuredText file.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 09:00 PM Revision 0659f7c5 (ceph): doc: minor cleanup.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 08:47 PM Revision 1c9e1c61 (ceph): doc: Publishing as described. Still requires some verification and QA.
- Signed-off-by: John Wilkins <john.wilkins@inktank.com>
- 08:43 PM Revision e5fb4991 (ceph): run: make -a short for --archive
- 06:18 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
- The size of a VM is adjustable:...
- 05:49 PM devops Feature #2697: crowbar: ISO generation, reproducible in a cloud image vm
- Tested on vercoi/downburst
https://docs.google.com/a/inktank.com/document/d/1C1QLD54sbpRJ7o6M9ML2TrexXC5JIMecHJxOM... - 05:21 PM Cleanup #2041 (Resolved): osd: move peering into worker threads
- 05:15 PM Revision 87d1cdb5 (ceph): OSD: _remove_pg not ruin iterator consistency
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 7b5d8e8c (ceph): ReplicatedPG: on_activate for a peer might happen before flush
- We don't ensure for a peer that the flush completes before activation,
merely that we don't serve any ops until flush... - 05:15 PM Revision f0b2310f (ceph): ReplicatedPG: RepModify track epoch_started and bail on interval change
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 193f18f2 (ceph): FileStore: delete source collection if not replaying collection_rename
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 99c23b69 (ceph): OSD: split notify|info|query messages for old clients
- Old clients do not expect mixed epoch compound messages. Thus, we
send each sub-message independently.
Signed-off-b... - 05:15 PM Revision 4ce17cca (ceph): test/encoding/types.h: disable pg_query_t encoding test
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 9fc5db8c (ceph): ReplicatedPG::op_applied: update last_update_applied iff !aborted
- scrub state and last_update_applied will have been reset during
the interval change.
Signed-off-by: Samuel Just <sam... - 05:15 PM Revision 7f2354c7 (ceph): doc/scripts/gen_state_diagram.py: make parser a bit more forgiving
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 31db8ed0 (ceph): OSD::activate_map: don't publish map until pgs in deleted pools have be...
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 7e26d6df (ceph): PG: C_PG_MarkUnfoundLost put pg in finish
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 5dc45f77 (ceph): PG: PG now store its own PGPool
- Otherwise, we need to syncronize access to the shared PGPool objects.
The wasted memory is probably preferable to syn... - 05:15 PM Revision 5c0e8b46 (ceph): OSD: bail out of do_recovery if no longer primary and active
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 3ca6359c (ceph): common/WorkQueue.h: add BatchWorkQueue
- Rather than dispatching one item at a time to process, etc,
BatchWorkQueue dispatches up to a configurable number of
... - 05:15 PM Revision d8a68e76 (ceph): OSD: do_(notifies|infos|queries) must now be passed a map
- This removes the need to call them from within the osd lock.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 05:15 PM Revision c1f2a802 (ceph): OSD: peering_wq is now a BatchWorkQueue
- process_peering_events now handles multiple pgs at once to better
batch up notifes, etc.
Signed-off-by: Samuel Just ... - 05:15 PM Revision 0ee3d87f (ceph): OSD: process_peering_event check for new map on each pg
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 9b182d20 (ceph): OSD: maybe_update_heartbeat_peers, don't print pg
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision c6db1b2e (ceph): PG: delay ops in do_request, not queue_op
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 1b558fba (ceph): OSD: write_info/log during process_peering_events, do_recovery
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 392df3b7 (ceph): Revert "osd: reset last_peering_interval on replica activate"
- This reverts commit 17114f266a336b6edd7e98975d494fdd487eec20.
- 05:15 PM Revision f9282e6c (ceph): Revert "osd: check against last_peering_reset in _activate_committed"
- This reverts commit 86aa07d7a91ac23074e76551c3a6db3a5736cffa.
- 05:15 PM Revision 442b5583 (ceph): PG: pass activate epoch with Activate event
- This allows us to pass into activate() in which epoch the
message triggering activation occurred allowing us mark
the... - 05:15 PM Revision 311a061e (ceph): OSD: move watch into OSDService
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision b200710b (ceph): OSD,PG: clean up pg removal
- PG opsequencers will be used for removing a pg. If the pg is recreated
before the removal is complete, we need the n... - 05:15 PM Revision e8d09d02 (ceph): PG,OSD: prevent pg from completing peering until deletion is complete
- hobject_t must now be globally unique in the filestore. Thus, if we
start creating objects in a pg before the remova... - 05:15 PM Revision 35949c54 (ceph): ReplicatedPG: explicitely block on not active for certain ops
- Ops and some subops need to wait for active to ensure correct ordering
with respect to peering operations.
Signed-of... - 05:15 PM Revision 2552a7f4 (ceph): OSD,PG: _share_map_outgoing must not require osd_lock
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 4fec85f2 (ceph): OSD: activate_map() in handle_osd_map only when active
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 040a22b6 (ceph): OSD: map_cache should contain const OSDMap
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 3dcce50e (ceph): OSD: only do_(notify|info|query) for up osd
- pg may have an older map and attempt to notify|info|query on a down
osd.
Signed-off-by: Samuel Just <sam.just@inktan... - 05:15 PM Revision a5bf3d71 (ceph): OSD: lock recovery_wq before debug output on finish_recovery_op
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 8079a489 (ceph): OSD,PG: clean up _get_or_create_pg and set interval based on msg
- Previously, we set last_peering_reset based on the epoch in which the pg
is created. We now pass the map from the qu... - 05:15 PM Revision 7c1dc90a (ceph): OSD: don't advance_pg() if pg is up-to-date
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision 73f5ce94 (ceph): OSD,PG: issue pg removals in line, remove remove_list
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision c7581b69 (ceph): CrushWrapper: rmaps don't need to be mutable
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:15 PM Revision a67a874b (ceph): CrushWrapper: add locking around crush_do_rule
- crush_do_rule uses a cache on the bucket objects.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 05:15 PM Revision b242c565 (ceph): OSD: on pg_removal, project_pg_history to get current interval
- First, we don't really want to remove the pg if we can use it. Second,
there might be messages in the pg peering que... - 05:14 PM Revision f6fc1b30 (ceph): OSD: remove superfluous pg get/put around enqueue_op
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision b6bf5736 (ceph): ReplicatedPG: do not eval_repop if aborted
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 91355fb7 (ceph): ReplicatedPG: change ReplicatedPG debug output to match PG
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 3d32712e (ceph): OSD: don't assume pending pg removals have flushed
- _create_lock_pg might encounter a preexisting pg collection simply
because the removal transaction had not yet comple... - 05:14 PM Revision 90e3727e (ceph): OSD: do not lock osd during dequeue_op
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 10ec5bc5 (ceph): OSD: push_waiters is no longer used
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 14381b38 (ceph): OSD: get map read lock during queue_want_up_thru
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 492e6351 (ceph): OSD: do not drop osd_lock in handle_osd_map
- PGs have their map updates done in a different thread. Thus, we no
longer need to grab the pg locks. activate_map n... - 05:14 PM Revision f68f52ce (ceph): OSD: queue_want_up_thru in process_peering_event
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 502c0335 (ceph): PG, OSD: info_map shouldn't contain the MOSDPGInfo*
- Rather, we will just pass the same type as the noties.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 05:14 PM Revision b80b5005 (ceph): OSD,PG: Move pg accesible methods, objects to OSDService
- In order to clarify data structure locking, PGs will now access
OSDService rather the the OSD directly. Over time, m... - 05:14 PM Revision 33b985dc (ceph): OSD,PG: added helper methods for creating and dispatching RecoveryCtxs
- This is simpler than having to update all of the RecoveryCtx users
whenever we change the types in RecoveryCtx.
Sign... - 05:14 PM Revision 9a4a9b9a (ceph): PG: flush ops by the end of peering without osr.flush
- Rather than explicitely flushing the filestore, send a noop through the
filestore at the beginning of peering and, at... - 05:14 PM Revision ddef446d (ceph): OSD,PG: Move Op,SubOp queueing into PG
- PG now handles delaying/discarding messages since pg map epoch may not
be the same as the OSD map.
Signed-off-by: Sa... - 05:14 PM Revision bbd8af02 (ceph): osd/: Make pg osdmap be independent of osd, other pg maps
- This will allow handle_osd_map to not stop other work queues.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 05:14 PM Revision 12e22b3d (ceph): OSD,PG: handle pg map advance in process_peering_event
- The pg map will now be advanced in process_peering_event (in advance_pg)
to allow handle_osd_map to not grab pg locks... - 05:14 PM Revision c7e34951 (ceph): PG: include info and query by value in peering events
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 65abfc07 (ceph): OSD: initialize pgs in get_or_create_pg via handle_create
- Previously, pgs were initialized via Info/Log/etc. Since the event
which triggered the pg creation may now be queued... - 05:14 PM Revision b081bd4c (ceph): PG: CephPeeringEvents can now be descriptively printed
- The CephPeeringEvt constructor is now templated to allow
storing a description string for debugging.
Signed-off-by: ... - 05:14 PM Revision e10e81c0 (ceph): PG: compound messages must carry epoch_sent for each part
- Query and Notify messages include logical messages from multiple
pgs. Each logical message (pg_query_t and pg_notify... - 05:14 PM Revision fff0874c (ceph): PG: use osd->requeue_ops for ops, pg->queue_for_peering to requeue pg
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 64b71c69 (ceph): OSD: check for deleting in process_peering_event
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision b5078bb2 (ceph): PG,OSD: add OSD::queue_for_op, use in PG::queue_op
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision b0cb96bf (ceph): PG,OSD: fix op checking in pg, take_waiters during ActMap
- Signed-off-by: Samuel Just <sam.just@inktank.com>
- 05:14 PM Revision 18fec69a (ceph): OSD,PG::scrub() move pg->put() into queue process
- This clarifies ownership of the pg reference.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 05:14 PM Revision 1ffd190e (ceph): PG: CephPeeringEvt
- CephPeeringEvt is now the supertype for all peering state machine
events. This will allow us to generalize checking ... - 05:14 PM Revision 3691e3ca (ceph): PG: Move handle_* methods to PG
- PG now calls handle_event in RecoveryState.
Signed-off-by: Samuel Just <sam.just@inktank.com> - 05:14 PM Revision 7c414c5d (ceph): OSD: Remove handle_pg_missing, MOSDPGMissing no longer used
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 05:14 PM Revision d0158230 (ceph): OSD,PG: push message checking to pg
- old_peering_evt now checks CephPeeringEvts generically in
PG::handle_peering_event().
Signed-off-by: Samuel Just <sa... - 05:14 PM Revision 6922f9cf (ceph): osd/: move history update from handle_pg_query into pg
- Previously, replica history was updated in OSD::handle_pg_query.
Updating the history is now handled in the pg state ... - 05:14 PM Revision 36d1381a (ceph): PG: use intrusive_ptr in CephPeeringEvt
- Properly disposing of the event_base member of CephPeeringEvt
requires use of intrusive_ptr.
Signed-off-by: Samuel J... - 05:14 PM Revision 33bcbb33 (ceph): PG: process peering events in a queue
- Peering events are now queued via queue_peering_event in the
peering_queue.
Signed-off-by: Samuel Just <sam.just@ink... - 05:08 PM Linux kernel client Bug #2523 (Resolved): xfs: xfs_iolock_reclaimable
- patches from hch fix this up. added them to testing branch.
- 04:18 PM rbd Bug #2488 (Can't reproduce): rbd: EIO from rbd map
- 04:18 PM Bug #2693 (Resolved): osd/ReplicatedPG.cc: 4293: FAILED assert(info.last_update <= active_rep_scr...
- 04:18 PM Bug #2453 (Resolved): osd/OSD.h: 840: FAILED assert(last_scrub_pg.count(p))
- 04:13 PM Messengers Bug #2569 (Resolved): msgr: connect_rank crash
- 04:12 PM devops Feature #2698: crowbar: Guide for using "front" network
- I verified that this causes no trouble before crowbar is installed and that eth1 does come up automatically after reb...
- 03:11 PM rgw Feature #2735 (Resolved): rgw: support response-* request params in GET obj
- 03:10 PM rgw Bug #2734 (Resolved): rgw: fails to authenticate preauthenticated requests with sub resource
- 02:32 PM Revision ea11c7f9 (ceph): Allow URL-safe base64 cephx keys to be decoded.
- In these cases + and / are replaced by - and _ to prevent problems when using
the base64 strings in URLs.
Signed-off... - 02:11 PM Feature #2733 (Resolved): osd: fadvise replica data don't use
- This should help quite a bit with cache pressure.
- 01:56 PM Feature #2485 (Resolved): crush: validate hash function
- 01:29 PM Feature #2730: [Also for argonaut] OSD dump detailed info about in progress ops
- We can also keep data about the last N slow requests on the OSD to allow easier debugging.
- 01:16 PM Feature #2730 (Resolved): [Also for argonaut] OSD dump detailed info about in progress ops
- syntax should be something along the lines of:
ceph tell osd <num> query_ops
Result should be a json formatted ... - 01:20 PM Bug #2711 (Resolved): chef does not install apache2/fastcgi on plana23
- The apt-get update command I added said "apt-key update", which isn't the same at all. Fixed in
commit:6d351b0b2... - 01:18 PM Feature #2732 (Resolved): Idempotency tester
- Add proper support for:
* omap
* xattr
* collection_rename - 01:17 PM Cleanup #2731 (Closed): ceph tell osd <num> should be ceph osd <subcommand> to match ceph pg <sub...
- 11:41 AM Feature #2727: filestore: add split
- On second thought, that would be hard to encode into a transaction, it'll probably just take a
[(hash_prefix, bits... - 11:06 AM Feature #2727 (Resolved): filestore: add split
- Add split functionality to filestore. The existing collection structure should be already in place.
Interface coul... - 11:20 AM Documentation #2729 (Resolved): doc: OSD internals
- Broad overview of OSD internals:
* Threading/wq model
* Message handling
* Message ordering (wait for map, pg wait... - 11:18 AM Feature #2728 (Resolved): OSD: handle split
- OSD message handling needs to handle messages for pgs between the map epoch creating them and the actual split (shoul...
- 11:17 AM rbd Documentation #2670 (In Progress): Docs shouldn't direct users to echo to /sys/bus/rbd for normal...
- 11:06 AM Bug #2725 (Duplicate): filestore: add split
- 11:04 AM Bug #2725 (Duplicate): filestore: add split
- Add split functionality to filestore. The existing collection structure should be already in place.
Interface cou... - 11:05 AM rbd Feature #2609 (In Progress): librbd: new image name -> image head indirection
- 11:05 AM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
- 11:04 AM rbd Feature #2726 (Resolved): krbd: clean up bio_pair leak/whatever
- 11:03 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
- I have restarted ceph mon with invoke-rc.d ceph restart mon and now it works! However I am 100% sure that I did it af...
- 10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
- you can just restart, or to verify in the future, 'ceph --admin-daemon /var/run/ceph/ceph-mon.a.asok version' (adjust...
- 10:59 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
- Oh! The monitor is still running the old version.
- 10:57 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
- ...
- 10:55 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
- Weird! Two things:
- does 'ceph status' work? (this is identical to -s now)
- can you try with commit:b53cdb9... - 10:45 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
- # ceph -v
ceph version 0.48argonaut (commit:c2b20ca74249892c8e5e40c12aa14446a2bf2030) - 10:44 AM Bug #2721: Ceph status does not work in 0.48 even if it is still documented
- can you include 'ceph -v' output? this works for me......
- 10:05 AM Bug #2721 (Resolved): Ceph status does not work in 0.48 even if it is still documented
- ...
- 10:22 AM rbd Feature #2724 (Resolved): rbd: add protect/unprotect command
- See http://ceph.com/docs/master/dev/rbd-layering/#protection
- 10:12 AM rbd Feature #2723 (Resolved): librbd: protect/unprotect as appropiate during cloning
- See http://ceph.com/docs/master/dev/rbd-layering/#protection
- 10:10 AM rbd Feature #2722 (Resolved): cls_rbd: add class methods to get/set protected status
- The possible settings are: protected, unprotected, and protecting. See http://ceph.com/docs/master/dev/rbd-layering/ ...
- 10:02 AM rbd Feature #2720 (Resolved): rbd: add children command
- This should list all direct children of a given snapshot.
- 10:01 AM rbd Feature #2719 (Resolved): librbd: provide functions for listing parents and their children
- Wrap the class methods for listing parents and children of parents.
- 09:57 AM rbd Feature #2718 (Resolved): librbd: map parent -> child in a per-pool rbd_children object when cloning
- 09:55 AM rbd Feature #2717 (Resolved): cls_rbd: add methods for maintaining mapping from parent to child images
- From http://ceph.com/docs/master/dev/rbd-layering/#cls-rbd:...
Also available in: Atom