Project

General

Profile

Activity

From 05/24/2012 to 06/22/2012

06/22/2012

07:14 PM Bug #2648 (Resolved): removing a monitor from the map while it's running causes a crash
... Greg Farnum
05:27 PM Bug #2647 (Can't reproduce): osd: old request, waiting for subops
primary:... Yehuda Sadeh
11:43 AM Bug #2618: error: unable to open OSD superblock
John, can we see your ceph.conf file? If you have time, try chatting in #ceph on irc.oftc.net as well; perhaps we ca... Dan Mick
11:30 AM Bug #2646 (Resolved): mon:update_from_paxos: error parsing incremental update: buffer::end_of_buffer
... Yehuda Sadeh
08:17 AM Subtask #2645 (Rejected): mon: Single-Paxos: Could not decrypt ticket info (immediately after run...
There was a lingering monitor still running, from a previous install.
Apparently, holding the wrong keys will lead...
Joao Eduardo Luis
08:09 AM Subtask #2645 (Rejected): mon: Single-Paxos: Could not decrypt ticket info (immediately after run...
... Joao Eduardo Luis
02:23 AM Revision 48e8e0a7 (ceph): Add script to create a vm image with extra packages
Josh Durgin
01:44 AM Revision 38f6a78c (ceph): Add a task to run a test against rbd inside of qemu.
For now this task does not setup networking for the vm,
and simply runs an executable downloaded from a specified url...
Josh Durgin
01:18 AM Revision 94a6ab8f (ceph): Add some tests inside qemu for the librbd suite
Josh Durgin
01:16 AM Revision a92306a4 (ceph): Move librbd tests to rbd suite
This lets us generate jobs with different caching settings instead of
hardcoding them.
Josh Durgin
12:24 AM Bug #2602: osd: push failed because local copy is X
Hi Sage,
just updated to your wip_rolling_upgrade branch.
FileStore update worked ( 100GB => 30 minutes on XFS ) ...
Simon Frerichs
12:10 AM Revision c399d903 (ceph): mon: conditionally encode PGMap[::Incremental] with quorum features
This allows a mon cluster to transition to the new encoding during a
rolling upgrade.
Signed-off-by: Sage Weil <sage...
Sage Weil
12:10 AM Revision de5b3236 (ceph): mon: conditionally encode mon features for remote mon
The only time we encode these is when forwarding messages. Encoding using
the destination's feature set.
Signed-off...
Sage Weil
12:10 AM Revision 77d836c5 (ceph): mon: encoding new monmap using quorum feature set
It is probably unlikely that someone will expand the mon cluster with a
mixed feature set, but we know the quorum fea...
Sage Weil
12:10 AM Revision 80649d08 (ceph): mon: note that monmap may be reencoded later
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:10 AM Revision 9fcc3dee (ceph): osd_types.cc: remove hobject_t decode asserts
These asserts were useful for ensuring that pool is passed
in in the correct places, but they prevent the encoder
tes...
Samuel Just
12:10 AM Revision 2fe98163 (ceph): OSD,PG,ObjectStore: handle messages with old hobject_t encoding
Messages that embed an hobject_t need to have the pool field fixed
on messages from old peers.
Signed-off-by: Samuel...
Samuel Just
12:10 AM Revision 2355b233 (ceph): mon: conditionally encode old monmap when peer lacks feature
This allows a rolling upgrade from 0.47.2 to 0.48.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:10 AM Revision 06288a9d (ceph): mon: track intersection of quorum member features
When we form a quorum, also note the intersection of the quorum members'
feature bits. This will inform decisions ab...
Sage Weil
12:10 AM Revision 0aaf7334 (ceph): mon: conditionally encode auth incremental with quorum feature bits
If the quorum does not yet all have the MONENC feature, stick to the old
encoding.
It might be more polite to requir...
Sage Weil

06/21/2012

09:33 PM Revision 03597ca6 (ceph): Check for machine args based on local, not ctx.machines
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:29 PM Revision ddf7e836 (ceph): doc: ceph osd crush add is now ceph osd crush set
Mailing list thread: http://www.spinics.net/lists/ceph-devel/msg06199.html
Signed-off-by: Travis Rhoden <trhoden@gma...
Travis Rhoden
08:20 PM Revision 7773a93e (ceph): whitelist current lockdep warnings in syslog
These are causing too much noise in the qa runs to leave, and #2617 is
sufficiently non-trivial to do this in the int...
Sage Weil
07:43 PM Revision 448f5b02 (ceph): logrotate: reload all upstart instances
upstart doesn't let you wildcard all instances of a given job, so we
slog through initctl list output, and reload any...
Sage Weil
06:55 PM rbd Feature #2566 (Duplicate): teuthology: task to run rbd workunits in a vm
Same as #1713. Josh Durgin
06:53 PM rbd Feature #1713 (Resolved): teuthology: qemu tasks, tests
Basic teuthology task is done in 38f6a78c71910a39b7f1890316c0a134ced8b0ec. Making a gitbuilder for qemu seems less im... Josh Durgin
06:52 PM rbd Feature #2644 (Rejected): qa: gitbuilder for qemu
This should build qemu with rbd support for regression testing new versions of qemu. Josh Durgin
06:49 PM rbd Feature #2567 (Resolved): qa: add qemu+rbd jobs to qa suite
Added in 94a6ab8ff3637f68c03261cf845b402d6bfa8e76 Josh Durgin
06:35 PM Revision 58db045a (ceph): docs: clarify example in radosgw-admin
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:17 PM Revision 990febd9 (ceph): mon: make pool deletion dne error code consistently ENOENT
This matches the check in _prepare_remove_pool() now.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:17 PM Revision 7044192d (ceph): filestore: remove btrfs start/end transaction ioctl code
This was an ill-conceived approach to getting atomic transactions out of
btrfs. It doesn't offer rollback, which mea...
Sage Weil
05:47 PM Revision 62a4ad66 (ceph): docs: update usage logging info
added some text under radosgw, examples under radosgw-admin
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
04:30 PM Subtask #2643: mon: Single-Paxos: mds: Strange message behavior on peon
This is what can be seen on the Leader:... Joao Eduardo Luis
04:24 PM Subtask #2643 (Closed): mon: Single-Paxos: mds: Strange message behavior on peon
Just for future reference.
When checking how things were going with the monitors, we noticed that the following sn...
Joao Eduardo Luis
03:37 PM Subtask #2633: mon: Single-Paxos: ceph tool unable to connect to monitor
Has something changed in the last five hours that you think fixed this? Greg Farnum
03:28 PM Subtask #2633 (Closed): mon: Single-Paxos: ceph tool unable to connect to monitor
It appears to be fixed.
The ceph tool is able to obtain the status from the monitors.
The 'watch' command doesn...
Joao Eduardo Luis
10:11 AM Subtask #2633 (Closed): mon: Single-Paxos: ceph tool unable to connect to monitor
This is what usually happens on the monitor side. Every now and then, the ceph tool is able to connect, but we haven'... Joao Eduardo Luis
03:20 PM Revision a85a15fe (ceph): Merge remote-tracking branch 'gh/stable' into next
Sage Weil
01:54 PM rgw Bug #2642 (Resolved): rgw: show/trim usage using also time (not just date)
Yehuda Sadeh
01:42 PM Feature #2577 (In Progress): teuthology: blktrace task
Sage Weil
01:41 PM Feature #2581 (Resolved): perf: investigate 0.47.2 precise vs 0.46 oneiric discrepancy
Sage Weil
01:40 PM Feature #2576 (Resolved): perf: 0.48 on long-term clusters
Sage Weil
01:17 PM Linux kernel client Bug #2302 (Can't reproduce): xfs: warning at mutex_remove_waiter
Sage Weil
12:38 PM Bug #2550 (Resolved): logrotate: SIGHUP upstart jobs too, not just sysvinit
Sage Weil
12:06 PM rbd Feature #2641 (Duplicate): qa: regression tests for rbd openstack volume driver
This should include:
* booting a vm from an rbd device
* attaching/detaching an rbd device to a running guest
* ad...
Josh Durgin
11:30 AM rbd Feature #2640 (Duplicate): qa: regression tests for rbd glance backend
This should run against development versions of openstack to verify that the glance backend continues to work. Namely... Josh Durgin
11:26 AM Bug #2042 (Duplicate): mon: crash in LogMonitor::update_from_paxos
Indeed! Sage Weil
11:21 AM Bug #2042: mon: crash in LogMonitor::update_from_paxos
Hrm, I think that this is duplicated by #2593? Greg Farnum
11:16 AM Bug #2042 (Can't reproduce): mon: crash in LogMonitor::update_from_paxos
Sage Weil
11:13 AM Cleanup #2623 (Resolved): filestore btrfs trans should be removed
Sage Weil
11:07 AM Feature #1494 (Resolved): openstack: vm can boot off rbd
This has been possible for a long time. Josh Durgin
11:03 AM Bug #2638 (Resolved): mon: make pool ops idempotent
for example, deleting a pool fails with ENOENT (or ENODATA :/) if the pool doesn't exist, but if we lose our mon sess... Sage Weil
11:02 AM rbd Feature #2637 (New): teuthology: task for running a vm using libvirt
This should have similar semantics to the qemu task that runs qemu directly, but configure and run the vm via libvirt... Josh Durgin
10:59 AM rbd Feature #2636 (New): qa: regression tests for qemu monitor commands
Test attach/detach of rbd devices and snapshot operations executed directly by the qemu monitor. This is probably eas... Josh Durgin
10:40 AM Bug #2602 (Need More Info): osd: push failed because local copy is X
Hi Simon-
This looks like something that could be caused by the broken rolling osd upgrade support in the branch y...
Sage Weil
10:14 AM rgw Feature #2635 (New): benchmark for measuring rgw metadata operations
We need to come up with a benchmark that will measure the following operations:
* Service:
1. List buckets
*...
Yehuda Sadeh
10:11 AM rbd Feature #2634 (Resolved): teuthology: add networking to qemu task
Let the guest speak to the outside world so test scripts can e.g. check out git repos and download test programs to c... Josh Durgin
06:32 AM Bug #2618: error: unable to open OSD superblock
I manually created the directory.
Then I ran the mkcephfs command.
The directory has some files in it (journal, mag...
John S

06/20/2012

10:10 PM Revision 1e539daf (ceph): doc: Normalized shell script syntax. Added generic cookbook path.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:39 PM Feature #2631 (Resolved): mon: kill rm -rf --mkfs behavior
Sage Weil
09:29 PM Revision 7d38758a (ceph): doc: Changed libvirt-dev to libvirt-bin, and cleaned up ./autogen.sh
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:02 PM Bug #2593: logmonitor: decode failure
... Sage Weil
07:55 PM Revision 3e32dd08 (ceph): doc: Typo.
Signed-off-by: Tommi Virtanen <tv@inktank.com> Tommi Virtanen
06:55 PM rbd Feature #2630 (Resolved): teuthology: add task to run qemu-iotests against rbd
qemu-iotests are included in upstream qemu.git. They exercise qemu's block layer to test correctness. They use existi... Josh Durgin
06:44 PM rbd Feature #2629 (New): qa: test performance during live migration
This could be done after #2628 by running iozone during the migration, parsing its output, and checking that throughp... Josh Durgin
06:41 PM rbd Feature #2628 (New): qa: test live migration with qemu
Run something like fsstress in the vm during the migration, and verify that it completes successfully. To do this we'... Josh Durgin
06:37 PM rbd Feature #2627 (New): qa: regression tests for libvirt rbd storage pool
Libvirt storage pools allow you to create, delete, and list volumes. Wido wrote a backend that uses librbd to do this... Josh Durgin
06:35 PM Revision c8e1ec6a (ceph): record owner at start of run
So that we can clean up easily even when we don't finish and there is no
summary.yaml.
Sage Weil
06:30 PM rbd Feature #2626 (New): qa: regression tests for basic rbd libvirt integration (disks)
Test using rbd disks with vms through libvirt.
This includes:
* booting a vm backed only by rbd
* attaching rb...
Josh Durgin
06:23 PM Revision 845e6c28 (ceph): move cfuse + dbench task that triggers #1737 to marginal suite
Sage Weil
06:15 PM rbd Feature #2625 (Rejected): qa: gitbuilder for libvirt
Create a gitbuilder for libvirt packages so we can regression test rbd against upstream releases. Base this on the ub... Josh Durgin
05:57 PM Revision c467d9d1 (ceph): v0.47.3
Sage Weil
05:36 PM Bug #2600: osd: crazy long watch timeout?
In another recurrence, there are no objecter requests:... Josh Durgin
05:13 PM Revision 218b6924 (ceph): teuthology-ls: tolerate non-existent 'success' key in summary file
Josh Durgin
05:02 PM Revision 17dcf605 (ceph): filestore: disable 'filestore fiemap' by default
We've seen this failing on both btrfs (Guido) and XFS (Oliver). This works
around #2535.
Signed-off-by: Sage Weil <...
Sage Weil
04:34 PM Bug #2524 (Won't Fix): librados crashed while connecting to cluster
Sage Weil
04:34 PM Bug #2456 (Resolved): librbd: failed LibRBD.TestIOToSnapshot
Haven't seen this in a while. Maybe some of the race cleanups fixed it... Sage Weil
04:32 PM Documentation #2624: OpenStack creation instructions should recommend non-default number of pg's ...
It'll have to be ceph osd pool create <pool> <num_pgs> until #2519 is done. Josh Durgin
04:25 PM Documentation #2624 (Resolved): OpenStack creation instructions should recommend non-default numb...
http://ceph.com/docs/master/rbd/rbd-openstack/ recommends
sudo rados mkpool nova
This should probably be
su...
Dan Mick
03:46 PM Cleanup #2623 (Resolved): filestore btrfs trans should be removed
On Wed, 20 Jun 2012, Stefan Priebe - Profihost AG wrote:
> Hello list,
>
> i've looked at the wiki (http://ceph.co...
Dan Mick
03:01 PM Subtask #2622 (Resolved): mon: Single-Paxos: convert existing, old MonitorStore to a brand new Mo...
The new monitor design does not support the old MonitorStore, nor does it store the versions and their values in the ... Joao Eduardo Luis
02:58 PM Subtask #2621 (Resolved): mon: Single-Paxos: synchronize the MonitorDBStore of oblivious monitor
*Objective:* synchronize monitor stores over the network whenever a given monitor mon.X falls too far behind.
*Sol...
Joao Eduardo Luis
02:50 PM Subtask #2615 (Closed): mon: Single-Paxos: MDSMap::get_health() asserting
Joao Eduardo Luis
02:49 PM Subtask #2615: mon: Single-Paxos: MDSMap::get_health() asserting
This issue stopped popping up after we changed the criteria to propose queued proposals and restarted testing with a ... Joao Eduardo Luis
03:59 AM Subtask #2615 (Closed): mon: Single-Paxos: MDSMap::get_health() asserting
MDSMap infos, dumped on MDSMap::get_health() just before the assert is triggered:... Joao Eduardo Luis
02:47 PM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
Appears to be fixed.
The ceph tool is able to connect to the cluster and obtain status information.
However, th...
Joao Eduardo Luis
11:01 AM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
Although this appears to be fixed, we still are unable to authenticate clients.
My current suspicion is that we ar...
Joao Eduardo Luis
09:00 AM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
We were encoding an empty "full version" of the key server during AuthMonitor::encode_pending(), along side with the ... Joao Eduardo Luis
08:36 AM Subtask #2616: mon: Single-Paxos: AuthMonitor: key_server has no entries
The problem appears to affect all mon clients, and it may be the reason why our OSDs do not work as well.
Log snip...
Joao Eduardo Luis
08:00 AM Subtask #2616 (Closed): mon: Single-Paxos: AuthMonitor: key_server has no entries
The Monitor's key_server has no entries, even though we made sure to populate mon.X/keyring with every single service... Joao Eduardo Luis
02:45 PM Subtask #2614: Single Paxos instance shared across the existing services
Joao Eduardo Luis
03:48 AM Subtask #2614 (Closed): Single Paxos instance shared across the existing services
One Paxos to propose them all. Joao Eduardo Luis
02:44 PM Subtask #2620 (Closed): mon: Single-Paxos: MDSMonitor: MMDSBeacon from entity with insufficient p...
... Joao Eduardo Luis
02:06 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
Please mention https://bugs.launchpad.net/upstart/+bug/1012938 in the "sucks" comment, so someone can some day nicely... Anonymous
01:47 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
repushed upstart-vs-logrotate branch Sage Weil
12:25 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
yeah, that'll work. only solves the logrotate case, but that's fine by me. Sage Weil
11:39 AM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
That killall thing is hideous, and I'm utterly unconvinced having even more upstart jobs for Ceph is helpful in any w... Anonymous
12:30 PM Feature #2619 (Resolved): filejournal: instrument with perfcounters
Sage Weil
12:09 PM Bug #2618: error: unable to open OSD superblock
Hi John,
Did you create the /data/ceph/osd0 directory? mkcephfs doesn't do it for you because of the potential for...
Josh Durgin
11:31 AM Bug #2618 (Can't reproduce): error: unable to open OSD superblock
I am new at this.
I installed ceph.
When I do a service ceph start, mon.0 and mds.(machine name) seem ok.
When it ...
John S
11:13 AM Bug #2022: osd: misdirectect request
here is the smoking gun. note that teh pgid goes to 0.0 when linger tid 1 is resending the watch op 4:... Sage Weil
03:45 AM Subtask #2613: Sandbox PaxosServices accesses to the store
I messed up the formatting and don't seem to be able to edit it. So here goes a decent version of it.... Joao Eduardo Luis
03:41 AM Subtask #2613 (Resolved): Sandbox PaxosServices accesses to the store
Each service used to have direct access to the MonitorStore, and they could mess around wherever they wanted, allowin... Joao Eduardo Luis
03:25 AM Subtask #2612 (Resolved): Monitor key/value store
Create a key/value store, with transaction support, to be used on the monitor subsystem.
Its interface should refl...
Joao Eduardo Luis
03:21 AM Feature #2611 (Resolved): mon: Single-Paxos
The ceph-mon is (roughly) composed by a Monitor class, responsible for all things monitor-ish, and several monitor se... Joao Eduardo Luis
12:34 AM Revision 88c7629e (ceph): OSD: clear_temp: split delete into many transactions
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
12:29 AM Revision 233bc8dd (ceph): schedule_suite: enable kdb
Among other things, you can attach to the console after the fact and type
'dmesg' to see wtf happened.
Sage Weil
12:24 AM Revision 286e6397 (ceph): kernel: enable/disable kdb
This hard-codes ttyS1, which is what we use on sepia. Sage Weil

06/19/2012

10:19 PM Revision 6948af02 (ceph): Merge branch 'wip-clsrbd'
Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Sage Weil
10:17 PM Revision b84e1ed5 (ceph): doc: document usage log
Modify radosgw, radosgw-admin man page
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
10:08 PM Revision 986d0e11 (ceph): cls_rbd: clean up doxygen, whitespace a bit
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:08 PM Revision 156879f4 (ceph): cls_rbd: fix uninitialized var in cls_rbd_parent
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:08 PM Revision 25311e96 (ceph): cls_rbd: note overlap units
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:08 PM Revision 3948e069 (ceph): cls_rbd: handle parent overlap
Two things:
* when we set a parent, the overlap is the min of our size and the
parent's size.
* when we resize,...
Sage Weil
10:08 PM Revision 87cadffd (ceph): cls_rbd: rename parent size -> overlap
The use of 'size' to describe the amount that we overlap with the parent
was confusing; overlap is more intuitive (to...
Sage Weil
10:08 PM Revision 4e73ca45 (ceph): cls_rbd: include dumpers, test instances for our types
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:08 PM Revision 2f75b466 (ceph): cls_rbd: implement get_parent, set_parent, remove_parent
Implement methods to get/set/remove the parent pointer on child images.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:08 PM Revision f87c441f (ceph): cls_rbd: check for LAYERING feature for parent methods
You need to set the feature on the image before you can use these methods.
We *could* allow get_parent, but that just...
Sage Weil
10:08 PM Revision e8b36ed9 (ceph): cls_rbd: make snapshots inherit the head's parent
When you take a snapshot, the snap gets the current parent of the head, if
any.
Signed-off-by: Sage Weil <sage@inkta...
Sage Weil
09:35 PM Revision c7f91e6c (ceph): cls_rbd_client: use snapid_t type
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:30 PM Revision ab42b8dd (ceph): add usage log tests to radosgw-admin tasks
tests 'usage show' and 'usage trim'
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
09:29 PM Revision c70c09f6 (ceph): Merge remote-tracking branch 'gh/next'
Conflicts:
src/osd/PG.cc
Sage Weil
08:26 PM Revision 145d1c14 (ceph): rgw: set s->header_ended before flushing formatter
otherwise we don't account the formatter in s->bytes_sent.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:26 PM Revision 8a4e2a11 (ceph): rgw: log user and not bucket owner for service operations
For operations that are done on the service (e.g., list buckets)
we need to log the user that did the operation, and ...
Yehuda Sadeh
08:25 PM Revision 282e2260 (ceph): rgw: initalize s->enable_usage_log
Missing initialization, we ended up not logging every operation.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:04 PM rbd Feature #2556: rbd tool: break image locks
Argh. I don't seem to be getting my email notifications from you and Josh on Github, and I don't know why. Greg Farnum
06:57 PM rbd Feature #2556: rbd tool: break image locks
https://github.com/ceph/ceph/commit/3c05629691deb800e3c6e62e81f444a748e8857c#src-rbd-cc-P108
just making sure i un...
Sage Weil
06:48 PM rbd Feature #2556: rbd tool: break image locks
Your commits look good to me (sorry I missed the cli tests; I need to get into the habit of running those), but I don... Greg Farnum
05:46 PM rbd Feature #2556: rbd tool: break image locks
rebase, fixed up ENOENT vs ENOEXEC behavior. one clarification about the purpose/scope of 'rbd lock', but otherwise ... Sage Weil
03:13 PM rbd Feature #2556 (Fix Under Review): rbd tool: break image locks
wip-rbd-locking has this now, but it also merges in wip-clsrbd for an unrelated change, so you might want to wait to ... Greg Farnum
05:49 PM Revision 4ec96336 (ceph): PG: improve find_best_info
07f853db3982e68b952a337cf91cbf7ec0709de9 is actually too conservative,
it suffices to find any info with a last_updat...
Samuel Just
05:15 PM Revision 992faa42 (ceph): doc: Added a snapshot page with outline. Needs additional info.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:12 PM Revision f3f144ad (ceph): osd: use derr (instead of cerr) for convertfs
This will appear in the log *and* stderr (if we're running in the
foreground).
Signed-off-by: Sage Weil <sage@inktan...
Sage Weil
05:11 PM Revision 74658dfa (ceph): osd: close stderr on daemonize
This spams stderr in an ugly way. Users should look at the logs.
In particular, filestore upgrades spam the console...
Sage Weil
05:10 PM Revision 2be02918 (ceph): doc: Added prerequisite of running Ceph cluster.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:06 PM Bug #2610 (Resolved): osd: pg stuck at scrubbing
Happened on congress, pg was stuck at scrubbing state for two and a half days.... Yehuda Sadeh
04:39 PM Revision 5435a4de (ceph): osdcap: make grammar build on spirit 2.2/boost 1.42
This is a stupid workaround, but it makes the compilation error go away.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:20 PM rbd Feature #2558 (Resolved): cls_rbd: child/parent methods
Sage Weil
04:05 PM devops Feature #2584 (In Progress): sepia: provide networking, DHCP for dynamic virtual machines
Sage Weil
04:04 PM Feature #2576 (In Progress): perf: 0.48 on long-term clusters
Sage Weil
04:04 PM Feature #2575 (In Progress): perf: 0.48 numbers
Sage Weil
03:52 PM rbd Feature #2609 (Resolved): librbd: new image name -> image head indirection
To prevent rename from disrupting clients with images open,
* put header in rbd_head.$id
* put $id in rbd_id.$nam...
Sage Weil
02:32 PM rgw Feature #2516 (Resolved): rgw: new bandwidth-only per-user log
Sage Weil
02:28 PM rbd Bug #2608 (Closed): rbd: hung xfstest 270
Logs are available in ubuntu@teuthology:/a/teuthology-2012-06-19_00:00:09-regression-next-testing-basic/1792
2012-...
Tamilarasi muthamizhan
01:25 PM Bug #2022: osd: misdirectect request
latest run log: ubuntu@teuthology:/a/teuthology-2012-06-18_19:00:05-regression-master-testing-gcov/1586 Tamilarasi muthamizhan
12:54 PM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
ubuntu@teuthology:/a/teuthology-2012-06-18_19:00:05-regression-master-testing-gcov/1579 Tamilarasi muthamizhan
11:31 AM Messengers Bug #1985: msgr: creating new Pipe for pre-existing connection leaks Pipe if they don't replace
I've still got this sitting around in my workspace. Since we seem to have pushed back a messenger re-do, perhaps we s... Greg Farnum
09:57 AM rbd Feature #2607 (Resolved): librbd: copyup helper
copyup helper to perform a copyup from parent to child. will be used by both the rbd command-line copyup command, an... Sage Weil
09:57 AM rbd Subtask #2606 (Resolved): librbd layering: copyup on missing child object
Sage Weil
09:57 AM rbd Subtask #2605 (Resolved): librbd layering: guard writes
Sage Weil
09:56 AM rbd Subtask #2604 (Resolved): librbd layering: read path
Sage Weil
09:56 AM rbd Subtask #2603 (Resolved): librbd layering: open parent on open
Sage Weil

06/18/2012

10:07 PM Bug #2550 (Fix Under Review): logrotate: SIGHUP upstart jobs too, not just sysvinit
Sigh. See branch upstart-vs-logrotate. Sage Weil
09:33 PM Revision 0d8970fc (ceph): PG: reg_last_pg_scrub on pg resurrection
This may solve the unreg_last_pg_scrub assert.
see #2453.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
09:33 PM Revision b0e66b70 (ceph): ceph_osd: move auto-upgrade to after fork
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
09:30 PM Revision 6feeff95 (ceph): librbd: simplify refresh sequence checks
- incremental seq each request
- set last to the last completed seq
- drop the now-redundant bool
Signed-off-by: Sag...
Sage Weil
09:29 PM Revision 6c88ff59 (ceph): Merge branch 'master' of github.com:ceph/ceph
John Wilkins
09:29 PM Revision b101f4ce (ceph): doc: first cut of OpenStack/RBD w/out cephx. Authentication coming soon.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:07 PM Revision 37e56e01 (ceph): filestore: make disk format upgrade warning less scary, more informative
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Sam Just <sam.just@inktank.com>
Sage Weil
09:02 PM Revision 030a2e3b (ceph): mon: include quorum in ceph status
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:00 PM Revision 2fc2cf03 (ceph): mon: gracefully handle slow 'ceph -w' clients
If we are sending log updates to a client (ceph -w), and they are far
enough behind to drop behind first_committed, i...
Sage Weil
08:57 PM rbd Feature #2556: rbd tool: break image locks
Greg Farnum wrote:
> Team RBD needs more to do! Pulling this forward. :)
Go team! :)
Sage Weil
06:26 PM rbd Feature #2556 (In Progress): rbd tool: break image locks
Team RBD needs more to do! Pulling this forward. :) Greg Farnum
08:22 PM Revision 8b96b9c7 (ceph): doc: Added libvirt installation.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:18 PM Revision 51fcef24 (ceph): PG: best_info must have a last_epoch_started as high as any other info
We disregard incomplete infos during find_best_info, but we can't an
info with a last_epoch_started less that of the ...
Samuel Just
05:56 PM rbd Feature #2585 (In Progress): rbd: clone command
Dan Mick
05:34 PM rbd Feature #2585: rbd: clone command
Dan Mick
05:36 PM Revision 2b92c73b (ceph): doc: btrfs and parted chef recipes are not needed.
They were removed in ceph-cookbooks.git commit
a77b418b95df8f2b6f56d9a90e47dc62100cdf4d.
Signed-off-by: Tommi Virtan...
Tommi Virtanen
05:35 PM rbd Feature #2559: cls_rbd: copyup method
Dan Mick
01:50 PM rbd Feature #2601: rbd: Show image size with an "ls"
We've also heard from others that having a better estimate of rbd usage and expected usage would be good; taking into... Dan Mick
06:09 AM rbd Feature #2601 (Resolved): rbd: Show image size with an "ls"
On the mailinglist the request came if the "rbd" tool could be modified to not only show image names when doing an ls... Wido den Hollander
01:34 PM rgw Bug #2542 (Resolved): rgw: support S3 update of metadata
Yehuda Sadeh
01:32 PM rgw Bug #2542: rgw: support S3 update of metadata
Resolved, commit:343cc792e847ca8901f6c08e41799a2fbbd2ca92 Yehuda Sadeh
11:04 AM Bug #2602: osd: push failed because local copy is X
Updated another osd to 'next' and same errors happened.
I've attached the log with debug osd = 20 set.
Simon Frerichs
08:46 AM Bug #2602: osd: push failed because local copy is X
Is this reproducible with 'debug osd = 20'? Sage Weil
08:44 AM Bug #2602 (Resolved): osd: push failed because local copy is X
Hi,
filestore updated completed.
When i start the "updated" OSD the whole cluster starts lagging.
Is the next br...
Sage Weil
08:45 AM Bug #2598: filestore: error during upgrade
Simon Frerichs wrote:
> Hi,
>
> filestore updated completed.
> When i start the "updated" OSD the whole cluster ...
Sage Weil
08:42 AM Bug #2598 (Resolved): filestore: error during upgrade
THanks! Sage Weil
01:29 AM Bug #2598: filestore: error during upgrade
Hi,
filestore updated completed.
When i start the "updated" OSD the whole cluster starts lagging.
Is the next br...
Simon Frerichs
12:56 AM Bug #2598: filestore: error during upgrade
Thanks.
The bug seems to be fixed.
Simon Frerichs
08:43 AM Bug #2595: filestore: error creating filestore during mkcephfs
2012-06-18 17:42:16.232924 7f54292fb780 -1 filestore(/srv/osd.20) could not find 23c2fcde/osd_superblock/0//-1 in ind... Stefan Priebe
08:29 AM Bug #2599: osd: crash in ReplicatedPG::C_OSD_OndiskWriteUnlock::finish
commit:5efaa8d7799347dfae38333b1fd6e1a87dc76b28 Sage Weil
07:25 AM CephFS Bug #2596: mds: spinning on restart
gdb is not helpful here, process seems to be spinning in syscall:
(gdb) thread apply all bt
Thread 1 (process 148...
Amon Ott

06/17/2012

10:40 PM Bug #2600: osd: crazy long watch timeout?
Possibly related to #2476 Josh Durgin
09:37 PM Bug #2600 (Resolved): osd: crazy long watch timeout?
... Sage Weil
09:34 PM CephFS Bug #1737: ceph-fuse crash in xlist::remove
see ubuntu@teuthology:/a/teuthology-2012-06-17_19:00:03-regression-master-testing-gcov/1303 for a failure with logs! Sage Weil
09:00 PM Revision 10c57b5a (ceph): Merge branch 'next'
Sage Weil
08:21 PM Revision 95ac5bae (ceph): debian: fix python-ceph depends
Signed-off-by: Laszlo Boszormenyi (GCS) <gcs@debian.hu> Laszlo Boszormenyi (GCS)
08:21 PM Revision d7fe0e3f (ceph): debian: update homepage url
Signed-off-by: Laszlo Boszormenyi (GCS) <gcs@debian.hu> Laszlo Boszormenyi (GCS)
08:20 PM Revision 82cb3d61 (ceph): filestore: fix 'omap' collection skipping
The if/else if/... structure was skipping this test if the file system
didn't support d_type.
Fixes: #2598
Signed-of...
Sage Weil
07:16 PM Revision 372fbe0e (ceph): sync clock at start of every run
Sage Weil
07:00 PM Revision 343cc792 (ceph): run-cli-test: use new pip incantation
http://www.pip-installer.org/en/latest/news.html#id1
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:15 PM Revision e3b6957d (ceph): run-cli-test: use new pip incantation
http://www.pip-installer.org/en/latest/news.html#id1
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:07 PM Revision 3d4ba431 (ceph): cls_rbd: do not pass snapid_t to vargs
On squeeze,
warning: cls_rbd.cc:534: cannot pass objects of non-POD type ‘struct snapid_t’ through ‘...’; call will ...
Sage Weil
03:58 PM Revision a4589c6a (ceph): don't dup ceph task for new fsx jobs
Sage Weil
02:33 PM RADOS Feature #2422 (In Progress): crush: test that mapping result is uncorrelated
Sage Weil
02:32 PM Bug #2598: filestore: error during upgrade
Ah... should have tested on another filesystem. Samuel Just
02:21 PM Bug #2598: filestore: error during upgrade
Oh, der.. pretty sure commit:82cb3d61ff4f200e0a9040e6381a9eed32db9de1 fixes this. Sage Weil
02:29 PM Bug #2022: osd: misdirectect request
Last two failures were the rados api tests:... Sage Weil
06:50 AM CephFS Bug #2385: max mds = 2, mds hang and crash
... Yavuz Selim Komur
06:45 AM CephFS Bug #2385: max mds = 2, mds hang and crash
... Yavuz Selim Komur
03:14 AM Revision f7ee34b5 (ceph): tolerate 250ms clock drift
Sage Weil
03:09 AM Revision 5efaa8d7 (ceph): msg: fix buffer overflow in ipv6 addr parsing
Noticed because of failing i386 unit tests for long addrs; x86_64 passed
fine. Sigh. FTR, the failing address was
...
Sage Weil

06/16/2012

10:05 PM Revision 9929ceb4 (ceph): include suite in archive dir
Sage Weil
09:50 PM Revision d9e902f9 (ceph): ceph.newdream.net -> ceph.com
wiki url -> docs url Sage Weil
08:59 PM Revision abd7d183 (ceph): whitelist 'slow request' in qa runs
Sage Weil
02:42 PM Revision 150e1fc5 (ceph): Merge branch 'next'
Conflicts:
src/cls_rbd.cc
Sage Weil
02:34 PM Revision f8a196fb (ceph): cls_rbd: drop useless snapshot metadata helpers
Now that cls_rbd_snap is encodable, we don't need these helpers; get_key()
will suffice.
Signed-off-by: Sage Weil <s...
Sage Weil
02:33 PM Revision b08d7ba9 (ceph): cls_rbd: use encode macros for on-disk snap metadata
This will let us version this encoding later when we add new information
and features, like a per-snap parent.
Signe...
Sage Weil
11:34 AM Bug #2598: filestore: error during upgrade
That's odd, it's updating the omap directory as a collection. list_collections should not have returned omap as a co... Samuel Just
08:04 AM Bug #2598 (Resolved): filestore: error during upgrade
from ML:... Sage Weil
08:25 AM Bug #2462 (Resolved): osd/PG.cc: 402: FAILED assert(log.head >= olog.tail && olog.head >= log.tail)
I'm going to optimistically call this resolved. If we see this crash again, though, we'll need to reopen, and hopefu... Sage Weil
08:24 AM rbd Bug #2535: rbd: random data corruption in vm
We've disabled fiemap, which appears to be the culprit. Josh is still tracking down which kernel releases are affect... Sage Weil
08:21 AM Bug #2599 (Can't reproduce): osd: crash in ReplicatedPG::C_OSD_OndiskWriteUnlock::finish
from ml:... Sage Weil
07:59 AM Bug #2595 (Resolved): filestore: error creating filestore during mkcephfs
Sage Weil
07:59 AM Bug #2595: filestore: error creating filestore during mkcephfs
commit:1e899d08e61bbba0af6f3600b6bc9a5fc9e5c2e9 Sage Weil
06:40 AM Bug #2595: filestore: error creating filestore during mkcephfs
Yes Stefan Priebe
12:39 AM Revision dac9f223 (ceph): Merge branch 'wip-rbd-locking'
Conflicts:
src/cls_rbd.cc
Reviewed-by: Sage Weil <sage@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Greg Farnum
12:30 AM Revision 07f853db (ceph): PG: best_info must have a last_epoch_started as high as any other info
We disregard incomplete infos during find_best_info, but we can't an
info with a last_epoch_started less that of the ...
Samuel Just
12:27 AM Revision c88a4550 (ceph): cls_rbd: add locking functions.
Users can now place shared or exclusive locks, can remove their own
locks, can break the locks of others, and can lis...
Greg Farnum
12:27 AM Revision 71e1b1fd (ceph): cls_rbd: add failure logging to all callers of read_key.
Previously, read_key was printing out error messages when it failed.
But because some callers are fine with it failin...
Greg Farnum
12:27 AM Revision 71c7ed3f (ceph): cls_rbd_client: expose locking operations
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:27 AM Revision 48d2376e (ceph): test: add rbd image locking tests to test_cls_rbd
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:27 AM Revision 668aa786 (ceph): librbd: include the image lockers in the ImageCtx mutable metadata
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:27 AM Revision b018befd (ceph): librbd: add cooperative image locking to the C and C++ interfaces
While we're at it, increment the version number.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
12:27 AM Revision 5b1b02b6 (ceph): librbd: remove the set-r-and-return idiom from some functions.
Josh hates this, and I have to admit it seems silly.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum

06/15/2012

09:48 PM Revision 1e899d08 (ceph): filejournal: make less noise about open failures
The callers report errors and pass up errors, so do not spam stderr with
this. Fixes the confusion that sparked #259...
Sage Weil
08:44 PM Revision b415fd21 (ceph): rgw: obj copy respects -metadata-directive
Fixes #2542. The old behavior just merged src object attrs
and provided attributes. The new (and correct) behavior lo...
Yehuda Sadeh
08:06 PM Revision 2dd17988 (ceph): librbd: only clear the needs_refresh flag on successful updates.
To support this, add a refresh_seq counter which is incremented every
time we get a notify. This way we can handle th...
Greg Farnum
06:59 PM Revision 0c40b24c (ceph): Run fsx on rbd with thrashing
Josh Durgin
06:55 PM Revision 50e01c18 (ceph): Increase number of ops done by fsx against rbd.
Especially in the no-cache case, this should detect more races. The
fiemap problem is detectable on plana after ~5000...
Josh Durgin
06:54 PM Revision b5de8398 (ceph): osd: optional verify that sparse_read holes are zero-filled
This should help us track down/verify #2535. It seems to happen on several
different systems, but we haven't figured...
Sage Weil
06:54 PM Revision 1156f943 (ceph): buffer: add list and ptr is_zero() method
Simple helper to check if a buffer is all zeros.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:10 PM Revision 6fd91381 (ceph): Merge remote-tracking branch 'gh/wip-osdcap'
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
05:58 PM rbd Feature #1480 (Resolved): librbd: image locking
Okay, discussed and merged in commit:dac9f223598c5f67b228403e514f202280d56488 Greg Farnum
05:49 PM rbd Feature #1480: librbd: image locking
And after thorough review from Josh, this should be ready for merge (commit:5b1b02b60a253092700f364dca77bb6b1065e3e0)... Greg Farnum
05:10 PM Revision 614d5a7b (ceph): LFNIndex.cc: escape null in append_[un]escaped
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:00 PM Revision 9d4ce4bc (ceph): filestore: disable 'filestore fiemap' by default
We've seen this failing on both btrfs (Guido) and XFS (Oliver). This works
around #2535.
Signed-off-by: Sage Weil <...
Sage Weil
02:40 PM rgw Bug #1643 (Rejected): radosgw-admin log show should accept --time
Yehuda Sadeh
02:03 PM Bug #2595: filestore: error creating filestore during mkcephfs
Oh, it looks like it's just noise from checking the journal. The mkcephfs succeeded, right? Sage Weil
01:57 PM Bug #2595: filestore: error creating filestore during mkcephfs

> Can you reproduce with 'debug filestore = 20' and attach the log to this
> bug?
Log:...
Stefan Priebe
10:32 AM Bug #2595: filestore: error creating filestore during mkcephfs
FYI, I saw this once when I was working on the OSD hotplug code paths. Mine might have been caused by a missing "osd ... Anonymous
09:29 AM Bug #2595 (Resolved): filestore: error creating filestore during mkcephfs
from ML:... Sage Weil
11:48 AM rbd Bug #2597 (Resolved): Import of image from file appears to succeed, but image not present in the ...
I have been testing with storing an image file, a basic QCOW2 image of latest Ubuntu distro on a pool, which is used ... Sam Zaydel
10:44 AM rbd Feature #2558: cls_rbd: child/parent methods
wip-clsrbd Sage Weil
10:44 AM rbd Feature #2558 (Fix Under Review): cls_rbd: child/parent methods
Sage Weil
09:44 AM CephFS Bug #2596 (Can't reproduce): mds: spinning on restart
from ML:... Sage Weil

06/14/2012

11:47 PM Revision dbbbe21f (ceph): osdcap: 'allow rwx pool foo' or 'allow pool foo rwx'
Make the match vs spec order not matter.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:38 PM Revision 1bd5d240 (ceph): nightly push to local repo
caleb miles
11:36 PM Revision 66df6be8 (ceph): ceph-authtool: update docs to reflect no more osdcap deny
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:35 PM Revision f9161688 (ceph): osdcap: make caps additive
Make 'allow pool foo r, allow pool foo w, allow x' sufficient when you
require rwx for pool foo.
Signed-off-by: Sage...
Sage Weil
10:17 PM Revision 01f48039 (ceph): Merge branch 'wip-radosgw-upstart'
Sage Weil
10:09 PM Revision b44092f5 (ceph): radosgw: stop startup timer on failed start
This fixes crashes like
-1> 2012-06-14 15:04:31.733009 7f544e18c780 -1 Couldn't init storage provider (RADOS)
...
Sage Weil
10:04 PM Revision 6f30f1fc (ceph): upstart: make id charset include - _ and .
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:03 PM Revision 7bfcc7c4 (ceph): radosgw: takes --id, but not -i
The -i short versio doesn't work on 'client' code that tend to use it for
input files.
Signed-off-by: Sage Weil <sag...
Sage Weil
09:19 PM Revision 92589f88 (ceph): objclass: allow class methods to retrieve their triggering entity_inst
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:18 PM Revision 7515b05e (ceph): doc: added qemu-img documentation for rbd.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:12 PM Revision 5db4509b (ceph): rgw: limit number of buckets per user
Adding a configurable max_buckets per user. Bucket creation
verifies that max_buckets has not reached.
Backport: dho...
Yehuda Sadeh
09:06 PM Revision 9aeac5de (ceph): add radosgw-admin test to regression suite
We wrote this test ages ago, but forgot to add it! Fixed up a few things
that have changed since then.
Sage Weil
09:04 PM Revision 57da2794 (ceph): schedule_suite.sh: add flavors, check/fix sha1s, optional templates
This should be everything we need to use this for the nightlies, with the
exception of updating the git trees, which ...
Sage Weil
09:04 PM Revision 83f8f3d1 (ceph): radosgw-admin: fix swift subuser/key tests
Need to do 'subuser (add|rm)', not 'key (add|rm)'. Sage Weil
09:04 PM Revision 474f8da4 (ceph): radosgw-admin: remove buckets before user
Otherwise user delete will fail. Sage Weil
09:04 PM Revision 697c3b94 (ceph): radosgw-admin: test max buckets limit
Sage Weil
09:04 PM Revision 3bd387f9 (ceph): radosgw-admin: fix for non-numeric bucket ids
Sage Weil
09:02 PM Linux kernel client Bug #2389 (Duplicate): rbd: hung xfstest 67
Sage Weil
09:01 PM Linux kernel client Bug #2359 (Can't reproduce): xfstest 62 failing
haven't seen this in a while Sage Weil
07:59 PM Revision 41488c02 (ceph): Merge branch 'master' of github.com:ceph/ceph into wip-2543
caleb miles
05:55 PM Feature #2571 (Resolved): sepia: enable virtualization
Dan Mick
11:34 AM Feature #2571 (In Progress): sepia: enable virtualization
BIOS settings changed on all plana; one reboot test shows good results. One can tell if
virtualization is enabled w...
Dan Mick
04:46 PM Revision 0adb33d8 (ceph): doc: Added steps for OpenStack install with DevStack
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:35 PM Revision 446e5d8e (ceph): doc: fixed bash syntax error.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:12 PM Bug #2593 (Resolved): logmonitor: decode failure
Saw this while trying to reproduce #2569. Sadly teuthology cleaned everything up before I could get to the data.
<pr...
Greg Farnum
04:07 PM Revision 97c9f014 (ceph): qa: disable xfstest 68 for now
Stop the qa noise we fix #2410. Looks like a freeze/thaw thing.
Maybe Jan's new freeze/thaw code will address this?...
Sage Weil
04:01 PM Revision 7f4d1710 (ceph): qa: disable xfstest 219 for now
The cause of 219 failing is non-obvious. Disable it for now. :(
Avoids #2522.
Signed-off-by: Sage Weil <sage@inkt...
Sage Weil
03:24 PM Feature #2581 (In Progress): perf: investigate 0.47.2 precise vs 0.46 oneiric discrepancy
Sage Weil
03:13 PM devops Feature #2415 (Resolved): upstart: support radosgw
Sage Weil
03:06 PM rbd Bug #2534: librbd: make sure watch is established on same header version as initial read was
Okay, this is blocked by #2592. Greg Farnum
03:06 PM Bug #2563: leveldb corruption
It's triggerable without ceph, I've filed a bug below with leveldb and I'm continuing to look into it.
http://code...
Samuel Just
03:05 PM Bug #2592: osd and all clients: watch version parameter is ignored
Alternatively, maybe the OSD should just enforce the version with those checks when setting a watch? It looks to me a... Greg Farnum
03:01 PM Bug #2592 (Resolved): osd and all clients: watch version parameter is ignored
Watch operations have a version parameter that is supposed act like an assert_version op. This could easily be done i... Josh Durgin
02:38 PM Feature #2471 (In Progress): osd: add prefix match to OSDCaps
you can have this one too, given your wip-osdcap branch. Greg Farnum
02:37 PM rbd Feature #1480 (Fix Under Review): librbd: image locking
wip-rbd-locking Greg Farnum
02:09 PM rgw Feature #2517 (Resolved): rgw: limit number of buckets per user (configurable per user)
added teuth tests, in master, backported to dho Sage Weil
02:04 PM rgw Bug #2591 (Resolved): misc rgw s3tests failures
Should be ok for now. I've set boto to 2.4.1, we can change that later once upstream fixes its issues. Yehuda Sadeh
10:15 AM rgw Bug #2591: misc rgw s3tests failures
boto 2.5.0 issue. For some reason it doesn't set the error.reason on 400 responses. Yehuda Sadeh
07:57 AM rgw Bug #2591 (Resolved): misc rgw s3tests failures
2012-06-13T12:51:42.657 INFO:teuthology.orchestra.run.err:s3tests.functional.test_headers.test_bucket_create_bad_auth... Sage Weil
12:59 PM rbd Bug #2535: rbd: random data corruption in vm
Sage Weil wrote:
> Just a bit of context: rbd without caching does a 'sparse-read' operation, which uses FIEMAP to d...
Sage Weil
12:52 PM rbd Bug #2535: rbd: random data corruption in vm
Just a bit of context: rbd without caching does a 'sparse-read' operation, which uses FIEMAP to determine which parts... Sage Weil
12:50 PM rbd Bug #2535: rbd: random data corruption in vm
Let's try a different tack: I pushed a osd-verify-sparse-read-holes branch to ceph.git (based on 0.47.2) that reads ... Sage Weil
09:09 AM rbd Bug #2535: rbd: random data corruption in vm
Status update:
I tried modifying the iotester so that it would work directly on the block device, in the hopes I c...
Guido Winkelmann
10:14 AM Feature #2472: osd: add opaque 'class <name> <foo>' cap that class can interpret/enforce
wip-osdcap is doing this way better than I was, although I'm happy to take it back to do the OSD changes if need be. Greg Farnum
09:09 AM rbd Bug #2410: hung xfstest #68
disabled 68 in qa for the time being. Sage Weil
09:03 AM rbd Bug #2522: xfstest #219
Sigh.. took a quick look and it's non-obvious why the repquota output doesn't match. Disabling this for now, but lea... Sage Weil
12:15 AM Revision 051e1686 (ceph): osdcap: move semantics of cap check into OSDCap class
The class needs to understand the semantics of possible actions in enough
detail to apply the capabilities it describ...
Sage Weil

06/13/2012

11:30 PM Revision 3dbad22c (ceph): Makefile: fix leveldb includes for system library case
Use the installed headers, not ours.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:30 PM Revision 529c4844 (ceph): Makefile: fix leveldb dep for system library case
We conditionally add this below only if using the bundled version.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:23 PM Revision e48dab43 (ceph): doc: Multi-monitor support means ceph::single_mon was renamed.
This changed in ceph-cookbooks.git commit
8e56551b11fe28cc4f29f4fcdcf6c38516bdc833.
Signed-off-by: Tommi Virtanen <t...
Tommi Virtanen
08:57 PM Linux kernel client Bug #2590 (New): possible irq lock inversion dependency with con->mutex and osdc->request_mutex
i thought this was #147, but on closer inspection it's something else;... Sage Weil
08:55 PM Revision 762f6d95 (ceph): osdcap: whitespace to make grammar more readable
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:42 PM Revision 1cfd65b1 (ceph): osdcap: use static const instead of #define
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:42 PM Revision bf72a00e (ceph): osdcap: drop unused 'auid' arg from get_cap(), is_match()
The intent was to do a cap like 'allow pools-that-i-own ...', but in
practice you just do 'allow pool <myuid> ...'.. ...
Sage Weil
07:17 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
Filed upstream: https://bugs.launchpad.net/upstart/+bug/1012938 Anonymous
06:24 PM Revision 67710a65 (ceph): rbd: fix usage test
Fixes: #2347
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:17 PM rgw Feature #2516: rgw: new bandwidth-only per-user log
I think the last thing we need here is to add it to the radosgw-admin test so that we don't break these commands in t... Sage Weil
12:26 PM rgw Feature #2516 (In Progress): rgw: new bandwidth-only per-user log
Sage Weil
06:05 PM Revision 66553d25 (ceph): Makefile: link gtest statically
The problem:
- the unittests link against gtest, and gtest is not installed. that's
normally fine, but...
- rb...
Sage Weil
05:04 PM rgw Feature #2473: rgw: revisit operation logging
Not the top priority, but we can have an async flush, similar to the one we have for the usage logging. Yehuda Sadeh
05:03 PM Revision 395c4468 (ceph): Merge branch 'master' of github.com:ceph/ceph into wip-2543
caleb miles
04:22 PM Revision 282277de (ceph): radosgw: upstart support
Like the other upstart configs, these assume the default value for
'rgw data'. Same pattern as ceph-mon and ceph-mds...
Sage Weil
04:22 PM Revision d1fc562d (ceph): debian: install radosgw upstart configs, daemon dir
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:20 PM Linux kernel client Bug #2573: libceph: many "socket closed" messages
In that case, if you want to run this with the osd messenger debug at 5 and can gather logs next time I'll be happy t... Greg Farnum
02:35 PM Linux kernel client Bug #2573: libceph: many "socket closed" messages
The test takes on the order of a minute to complete one pass
of test 049. During that time I typically see 10-20 so...
Alex Elder
10:46 AM Linux kernel client Bug #2573: libceph: many "socket closed" messages
The sockets have a default timeout of 15 minutes, after which they will close — the idea being that if the socket is ... Greg Farnum
10:40 AM Linux kernel client Bug #2573 (Resolved): libceph: many "socket closed" messages
While trying to reproduce a null pointer problem in the client
messenger code I was running xfstests #049 over RBD d...
Alex Elder
02:46 PM Revision c4a607aa (ceph): CrushTester.cc: eliminated old compiler warning of uninitialized pointer.
Signed-off-by: caleb miles <caselim@gmail.com> caleb miles
02:31 PM Feature #988 (Duplicate): librbd: trivial layering
replaced by other tasks Sage Weil
02:31 PM Feature #988 (Rejected): librbd: trivial layering
Sage Weil
02:31 PM devops Feature #2589 (Resolved): crowbar: Update barclamp-ceph for Essex, new ceph-cookbooks
Anonymous
02:30 PM devops Feature #2588 (Resolved): downburst: multiple, configurable networks to libvirt
Right now, it hardcodes that a vm only has the "default" network. Make that configurable. Anonymous
02:29 PM devops Feature #2587 (Resolved): sepia: isolated networking on vercoi (manual, a handful)
One-time switch & linux configuration for a handful of VLANs, manually allocated to people who want to run Crowbar. Anonymous
02:28 PM rbd Feature #2586 (Rejected): rbd: check/take locks on --lock
if you pass --lock to rbd, take an exclusive lock, do whatever, unlock Sage Weil
02:20 PM rbd Feature #2585 (Resolved): rbd: clone command
A command for the rbd tool to create a child image from a parent. Example:
rbd clone --parent pool/image@snap pool...
Sage Weil
01:56 PM rbd Feature #2467 (Rejected): qemu: implement bdrv_invalidate_cache
I've tested migration with caching, and read the code, and it looks like this is unnecessary. qemu is doing a flush b... Josh Durgin
01:47 PM devops Feature #2584 (Resolved): sepia: provide networking, DHCP for dynamic virtual machines
downburst can provision them really nicely, but right now only static networking works. To fix that, we need DNS to w... Anonymous
01:40 PM devops Feature #2583 (Resolved): crowbar: change barclamp-nova to use rbd
The nova proposal needs to point to a ceph proposal. Look at how nova&glance use mysql.
barclamp-chef should inclu...
Anonymous
01:25 PM Feature #1964 (Rejected): ferro: Create a cloud-init OVF config that reimages a machine
Dell's vMedia functionality is awfully buggy, aborting this plan (for now?). Anonymous
01:25 PM Feature #1965 (Rejected): ferro: Machine management state machine (fake actions)
Dell's vMedia functionality is awfully buggy, aborting this plan (for now?). Anonymous
01:24 PM Feature #1966 (Rejected): ferro: Connect actions to state machine
Anonymous
01:20 PM Feature #1966: ferro: Connect actions to state machine
Dell's vMedia functionality is awfully buggy, aborting this plan (for now?). Anonymous
01:21 PM Feature #1967 (Rejected): ferro: Single API endpoint that delegates to machine managers
Anonymous
01:20 PM Feature #1967: ferro: Single API endpoint that delegates to machine managers
Dell's vMedia functionality is awfully buggy, aborting this plan (for now?). Anonymous
01:20 PM Feature #1968 (Rejected): ferro: Batch resource allocation (not fair, no quotas yet)
Dell's vMedia functionality is awfully buggy, aborting this plan (for now?). Anonymous
01:20 PM rbd Bug #2522: xfstest #219
The problem here appears to be that the output of the repquota
command is not what's expected. I think the group qu...
Alex Elder
01:17 PM Feature #1962 (Rejected): ferro: Trigger vMedia boot via IPMI/DRAC
Dell's vMedia functionality is awfully buggy, aborting this plan (for now?). Anonymous
01:16 PM Feature #1961 (Rejected): ferro: Python wrapper for vmcli (using gevent)
Dell's vMedia functionality is awfully buggy, aborting this plan. Anonymous
01:12 PM Feature #1963 (Closed): ferro: OVF Environment creation as a library
downburst actually ended up containing this logic, not OVF but still cloud-init. Anonymous
01:04 PM rgw Feature #2517 (Fix Under Review): rgw: limit number of buckets per user (configurable per user)
Sage Weil
01:03 PM Feature #2582 (Resolved): set up chart.io + mysql (or equivalent) infrastructure for tracking perf
Sage Weil
12:44 PM Linux kernel client Bug #2287 (Resolved): rbd: crashes with 10Gbit network and fio
This looks like the bio->iter problem, which is now fixed by commit:43643528cce60ca184fe8197efa8e8da7c89a037 in ceph-... Sage Weil
12:38 PM Feature #2581 (Resolved): perf: investigate 0.47.2 precise vs 0.46 oneiric discrepancy
Sage Weil
12:37 PM Feature #2580 (Resolved): perf: investigate poor performance at 10 osds per node
Sage Weil
12:32 PM Feature #2578 (New): rados ager
aging function that is invoked (probably) similarly to rados bench, ideally using the same bencher abstraction so tha... Sage Weil
12:30 PM Feature #2577 (Resolved): teuthology: blktrace task
* run blktrace on the osds' disks.
* put results in the archive dir
* maybe an optional start delay, duration, ...
Sage Weil
12:30 PM Feature #2576 (Resolved): perf: 0.48 on long-term clusters
Sage Weil
12:29 PM Feature #2575 (Resolved): perf: 0.48 numbers
populate the spreadsheet with values from 0.48 Sage Weil
11:43 AM Messengers Bug #2569 (Need More Info): msgr: connect_rank crash
I'm attempting to reproduce this, but what's available right now is just the teuthology log — it didn't pull off any ... Greg Farnum
09:57 AM Messengers Bug #2569 (Resolved): msgr: connect_rank crash
... Sage Weil
11:31 AM devops Feature #2574 (Resolved): crowbar: use data disks automatically, journal inside data directory
Crowbar sets node['crowbar']['disks'] to an array of disks. First one is used for the OS, and disk['usage'] is set to... Anonymous
11:20 AM rbd Cleanup #2347 (Resolved): The rbd help text is misleading on required arguments
commit:67710a65c7cd1173c73c40241572d615dd7da1f3 Sage Weil
11:06 AM devops Feature #2415 (Fix Under Review): upstart: support radosgw
Sage Weil
11:02 AM Cleanup #2331 (Resolved): Makefile.am:182: `lib/libgtest.a' is not a standard libtool library name
commit:66553d25f09f0d0cea735a862a228060b72c0ce6 Sage Weil
10:30 AM rbd Bug #2572 (Resolved): krbd: writeback errors?
While trying to reproduce a null pointer messenger problem,
I kept hitting messages like this after some (fairly ran...
Alex Elder
10:29 AM Feature #2571 (Resolved): sepia: enable virtualization
Sage Weil
10:27 AM rbd Bug #2535: rbd: random data corruption in vm
Sage Weil wrote:
> Guido Winkelmann wrote:
> > Sage Weil wrote:
> > > Are there multiple partitions or is LVM on t...
Guido Winkelmann
10:03 AM Linux kernel client Bug #2389: rbd: hung xfstest 67
ubuntu@teuthology:/a/nightly_coverage_2012-06-13-a/7559 Sage Weil
09:55 AM Linux kernel client Bug #2389: rbd: hung xfstest 67
ubuntu@teuthology:/a/master-2012-06-12_16:17:15/7465 Sage Weil
10:02 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
ubuntu@teuthology:/a/nightly_coverage_2012-06-13-a/7579
ubuntu@teuthology:/a/nightly_coverage_2012-06-13-a/7587
<...
Sage Weil
09:59 AM CephFS Bug #1947: mds: SIGBUS during _mark_dirty
ubuntu@teuthology:/a/nightly_coverage_2012-06-13-a/7526 Sage Weil
09:23 AM rbd Feature #2568 (Resolved): qa: run xfstests on qemu+rbd
This will build on #2566:
* stage xfstests on vdb, like a regular workunit, and:
* map additional rbd images to r...
Sage Weil
09:21 AM rbd Feature #2567 (Resolved): qa: add qemu+rbd jobs to qa suite
Add a bunch of workunits to the qa suite that will run on top of rbd inside a vm. Sage Weil
09:20 AM rbd Feature #2566 (Duplicate): teuthology: task to run rbd workunits in a vm
teuthology task that will:
* download workunit vm
* create and format rbd image
* mount, stage a workunit in rbd...
Sage Weil
02:16 AM Revision c06e4075 (ceph): osd: reimplement OSDCap struct
The old OSDCaps had a few undesireable features:
- deny
- weird 'default permissions' semantics when auid matches ...
Sage Weil

06/12/2012

08:58 PM Revision 6e551f73 (ceph): init-ceph: make 'status' report running version
Try to report the running version with 'status'.
Backport: dho
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:48 PM Feature #2564 (Resolved): teuthology: install kernels from local dir
Sage Weil
08:01 PM Revision f822c025 (ceph): PG: track purged pgs during active
See bug #2462.
The following sequence could cause a log assuming a non-empty pg
to an empty replica:
1. primary sen...
Samuel Just
07:48 PM Revision e9390bb4 (ceph): Merge branch 'wip-keyrings'
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
06:50 PM Revision 4ec4b477 (ceph): ceph_argparse: make entityname parsing error more helpful
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:50 PM Revision 8b5336a1 (ceph): osd: simplify session peer type check
Check the authenticated peer name directly; don't store a copy in the
OSDCaps struct. Also set the entity_name membe...
Sage Weil
06:50 PM Revision 72edd353 (ceph): osdcaps: drop dead peer_type code
Not sure what this wasn intended for.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:50 PM Revision 500c72e8 (ceph): config: make meta expansion in set_val optional
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:50 PM Revision d422bf15 (ceph): config: expand any config variable in config
This is currently broken wrt 'foo' vs 'foo_bar', but otherwise works.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:50 PM Revision ce79f9e6 (ceph): config: improve variable substitution
- allow ${foo_bar}
- prevent $foogarbage from substitution variable 'foo'
- use std::string throughout
- improve test...
Sage Weil
06:50 PM Revision eeeb5993 (ceph): mds: add mds_data config
This is where the keyring will live. Eventually more may move in too.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:50 PM Revision e958efa2 (ceph): new default keyring location for daemons
$osd_data/keyring
$mds_data/keyring
Fixes: #2498
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:50 PM Revision ca2010ee (ceph): vstart.sh: set data dirs instead of keyring locations
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:50 PM Revision 6f9ecef7 (ceph): rgw: default keyring to $rgw_data/keyring
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:50 PM Revision 4cc122cc (ceph): entity_name: is_*() methods
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:29 PM Revision b88a5e53 (ceph): crush: fix retry histogram array overflow due to indexing error
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
06:28 PM Revision 37dea2d8 (ceph): cls_rgw: use updated internal omap api
These were changed by 23ebc093a64a9c9812124ba6b87e301f95a53a5c.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:27 PM Revision 6fd80d3e (ceph): cls_rgw: fix CLS_LOG macro usage
Now takes a level.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:19 PM Revision 42539daf (ceph): Merge branch 'wip-2543' of github.com:ceph/ceph into wip-2543
caleb miles
06:15 PM Revision 1fdd411a (ceph): crush: fix retry histogram array overflow due to indexing error
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
05:54 PM Revision ddc9b279 (ceph): mapper.c: fixed array indexing error.
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
05:10 PM Revision c31b4e31 (ceph): remove ceph-kdump-copy
Moved to its own repo.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:08 PM Revision d64600df (ceph): debian: limit build archs
We depend on leveldb, which only builds on
amd64 armel armhf i386 ia64 mipsel
Reported-by: Laszlo Boszormenyi (GCS...
Sage Weil
02:58 PM Bug #2462 (Need More Info): osd/PG.cc: 402: FAILED assert(log.head >= olog.tail && olog.head >= l...
f822c0257e4c7fad181332cd149205ad15a8b9db
See the commit description. Unfortunately, I don't really have evidence ...
Samuel Just
02:55 PM Bug #2563 (Resolved): leveldb corruption
This was also mentioned once in the mailing list.
ceph version 0.47.2 (commit:8bf9fde89bd6ebc4b0645b2fe02dadb1c17a...
Samuel Just
02:40 PM rbd Feature #2561: rbd: copyup command
What? How does a class function of any kind provide atomicity in cross-OSD data copies? Greg Farnum
02:37 PM rbd Feature #2561: rbd: copyup command
'rbd copyup pool/image' command to copy any missing objects up from the parent. simple O(n) operation that leverages... Sage Weil
02:11 PM rbd Feature #2561 (Resolved): rbd: copyup command
'rbd copyup pool/image' command to copy any missing objects up from the parent. simple O(n) operation that leverages ... Sage Weil
02:39 PM rbd Feature #2562 (Resolved): librbd: open parent images, read path, write path
- when we open an image, open the parent image too.
- make reads fall through to parent
- guard writes beyond paren...
Sage Weil
02:05 PM rbd Feature #2560 (Resolved): rbd: safe parent deletion
- maintain map of parent/child pairs in each child pool... Sage Weil
02:04 PM rbd Feature #2531: rbd: fencing broken clients
As I see it, we have two options that we need to choose between.
1) We can add fencing to librbd and let anybody do ...
Greg Farnum
01:58 PM Bug #2550: logrotate: SIGHUP upstart jobs too, not just sysvinit
The instance jobs make this a bit trickier. Either process "initctl list" output or copy the logic that walks the /va... Anonymous
01:04 PM Bug #2550 (Resolved): logrotate: SIGHUP upstart jobs too, not just sysvinit
Anonymous
01:55 PM rbd Feature #2559 (Resolved): cls_rbd: copyup method
- client provides object content
- if object exists, fail with EEXIST (or 0, or something)
- if object does not exi...
Sage Weil
01:54 PM rbd Feature #2558 (Resolved): cls_rbd: child/parent methods
On the new image header:
- set_parent(poolid, image (maybe id, maybe name), snapid)
On the per-pool child list:
...
Sage Weil
01:52 PM rbd Feature #2557 (Rejected): QEMU support for image locking
We should convert QEMU to make use of rbd cooperative locking, once it's done (#1480).
And any other appropriate c...
Greg Farnum
01:50 PM rbd Feature #2556 (Resolved): rbd tool: break image locks
Once #1480 is done, expose lock breaking via the rbd tool. Greg Farnum
01:47 PM devops Feature #2555 (Rejected): chef: SECURITY: Re-evaluate where configuration & key handoff gets stored
The current setting seems to mean root on all chef nodes (even ones not running Ceph), and all knife users, have full... Anonymous
01:44 PM devops Feature #2554 (Rejected): chef: open question: How do we discover what disks we should use as Cep...
For Crowbar, see #2574.
- This is somewhat a dangerous operation, run accidentally it will clobber a lot of data. ...
Anonymous
01:43 PM devops Feature #2553 (Closed): crowbar: open question: What's the correct way to add RBD support to the ...
We'll need to get set --volume-driver etc in nova.conf,
glance-api.conf, etc. So I guess we need to (temporarily) fo...
Anonymous
01:36 PM devops Feature #2415 (In Progress): upstart: support radosgw
Sage Weil
01:21 PM devops Feature #2552 (Rejected): chef: admin tool to generate config in json (uuid, secret)
The environment needs things like... Anonymous
01:12 PM Bug #2551 (Rejected): leveldb broke "make distcheck"
... Anonymous
01:03 PM devops Feature #2549 (Resolved): ceph-disk-prepare: take fstype, mkfs and mount options from ceph.conf
See #2548 for similar need. Anonymous
01:02 PM devops Feature #2548 (Resolved): ceph-disk-activate: take mount options from ceph.conf
Anonymous
01:02 PM devops Feature #2547 (Resolved): ceph-disk-prepare: handle partitioning and mkfs
spawn gdisk in a subprocess.
How much protection do admins need to avoid ceph-disk-prepare /dev/sda mistakes?
Anonymous
01:00 PM devops Feature #2546 (Resolved): ceph-disk-prepare: take fsid from ceph.conf (support --cluster=name)
Anonymous
12:49 PM devops Feature #2498 (Resolved): standardize keyring locations for daemons
Sage Weil
10:56 AM Bug #2545 (Resolved): init-ceph: stops if one instance fails to start
Sage Weil
10:52 AM Bug #2543 (Resolved): crush: invalid pointer when outputting local retry histogram for large rang...
caleb miles
10:10 AM rbd Bug #2535: rbd: random data corruption in vm
Guido Winkelmann wrote:
> Sage Weil wrote:
> > Are there multiple partitions or is LVM on the disk, or is the file ...
Sage Weil
10:07 AM rbd Bug #2535: rbd: random data corruption in vm
Sage Weil wrote:
> Are there multiple partitions or is LVM on the disk, or is the file system on the raw device?
...
Guido Winkelmann
09:29 AM rbd Bug #2535: rbd: random data corruption in vm
Are there multiple partitions or is LVM on the disk, or is the file system on the raw device? Sage Weil
05:32 AM rbd Bug #2535: rbd: random data corruption in vm
Am Montag, 11. Juni 2012, 09:30:42 schrieb Sage Weil:
> If you can reproduce it with 'debug filestore = 20' too, tha...
Guido Winkelmann
05:29 AM rbd Bug #2535: rbd: random data corruption in vm
The bug also does not seem to have any effect with the setting "filestore fiemap = false" in ceph.conf. Guido Winkelmann
02:27 AM Bug #2544 (Closed): Help text for "usage show" identical to "usage trim"
cerr << " usage show show usage (by user, date range)\n";
cerr << " usage trim ...
Dan Mick
01:36 AM Revision 6a3ec401 (ceph): Merge remote-tracking branch 'gh/wip-2516-2'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil

06/11/2012

11:39 PM Revision a3f86b80 (ceph): rgw: usage statistics also count num of ops/successful_ops
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
10:48 PM Revision d023cf5c (ceph): debian: move ./configure to correct rules section
Signed-off-by: Laszlo Boszormenyi (GCS) <gcs@debian.hu> Laszlo Boszormenyi (GCS)
10:48 PM Revision d63aca33 (ceph): debian: remove dup python build-dep
Reported-by: Laszlo Boszormenyi (GCS) <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:27 PM Revision af16f459 (ceph): upstart: Read crush location and weight from ceph.conf.
This introduces two new config variables, osd_crush_location
and osd_crush_weight. Not currently included in config_o...
Tommi Virtanen
10:22 PM Revision 8eb5c9b8 (ceph): mon: allow keys to be cleared in ceph osd crush set ... loc key/values
If we encounter a 'key=' with no value, clear any previous value for that
key, so that
'a=foo b=bar a='
is equival...
Sage Weil
10:13 PM Revision 6d8d0592 (ceph): rgw: fix cli test
Get it up to date with latest cli changes.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
10:09 PM Revision 9a1ea4ba (ceph): rgw: usage log cleanups
Also, recalc_round_timestamp should be done under lock.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
09:59 PM Revision 5959798b (ceph): Merge pull request #13 from fghaas/rgw-manpage
doc: update examples in radosgw-admin man page Yehuda Sadeh
09:49 PM Revision 753207c3 (ceph): doc: update examples in radosgw-admin man page
* "user gen" is gone and is now "user create"
* --uid option is mandatory for "user create"
Mailing list thread: htt...
Florian Haas
09:25 PM Revision d2b6e49c (ceph): rgw: rgw_rados related fixes for usage logging
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
09:17 PM Bug #2543 (Resolved): crush: invalid pointer when outputting local retry histogram for large rang...
buggered the memory when we are generating the histogram for a large range of x. caleb miles
09:00 PM Revision 046395cb (ceph): rgw/rgw_cls_api.h cleanup
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
08:25 PM Revision ea2f9552 (ceph): rgw: new config options
New config options for usage logging:
- rgw_enable_usage_log: enable usage logging
- rgw_usage_log_flush_threshold ...
Yehuda Sadeh
08:25 PM Revision baa3aff4 (ceph): rgw: access methods for new usage ops
Wrappers for new rados class methods, and new RGWRados
methods to handle usage functionality.
Signed-off-by: Yehuda ...
Yehuda Sadeh
08:25 PM Revision 744a1b31 (ceph): rgw: usage logger
Accumulate usage info and flush it periodically.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:25 PM Revision 9a56a5b8 (ceph): rgw_admin: usage admin commands
- radosgw-admin usage show
- radosgw-admin usage trim
Both commands can get --start-date, --end-date, --uid
Signed...
Yehuda Sadeh
08:25 PM Revision 9a70ec94 (ceph): rgw: new class methods for handling usage information
The new methods are:
- user_usage_log_add: add new usage information
- user_usage_log_read: get usage information
...
Yehuda Sadeh
07:56 PM Revision d265bb69 (ceph): rgw: don't fail initialization if socket path returns ENXIO
ENXIO is expected when trying to read the unix domain socket
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:31 PM Revision 5792f137 (ceph): workunit: grab 'all' config from the right variable
Josh Durgin
06:57 PM rgw Bug #2542 (Resolved): rgw: support S3 update of metadata
S3 metadata update is being done by copying of an object to itself with new metadata info. Yehuda Sadeh
05:36 PM Revision 78c35a44 (ceph): Merge remote-tracking branch 'gh/wip-crush-tunables'
Sage Weil
04:58 PM Revision 98516834 (ceph): cls_rbd: add get_all_features method
This is useful for reporting which features an osd supports, and for
testing rados_exec. Update the rados api tests t...
Josh Durgin
04:55 PM Revision ee7a027b (ceph): mon: fix pg state logging
PGMap->num_pg_by_state is a PG state to number of PG in the state
mapping. PGMonitor::update_logger wrongly interpret...
Yan, Zheng
04:22 PM Feature #1772 (Resolved): rbd: define new on-disk header format
Sage Weil
11:31 AM Feature #1772 (In Progress): rbd: define new on-disk header format
Sage Weil
03:17 PM Bug #2540 (Resolved): "ceph osd crush set" should treat "foo=" as if foo wasn't mentioned on the ...
Sage Weil
03:12 PM Bug #2540 (In Progress): "ceph osd crush set" should treat "foo=" as if foo wasn't mentioned on t...
Sage Weil
03:09 PM Bug #2540 (Resolved): "ceph osd crush set" should treat "foo=" as if foo wasn't mentioned on the ...
The current behavior, using an empty string as the name, is quite confusing.
Instead of an error message, a better...
Anonymous
03:13 PM RADOS Feature #2541 (Resolved): crush: move command to adjust non-leaf node position
the add or update function is intentionally limited to leaves. allow the hierarchy to be adjusted using a different ... Sage Weil
03:08 PM Feature #2510 (Resolved): update on-disk hobject_t encoding to include pool and namespace fields
Sage Weil
02:13 PM Feature #2539 (Duplicate): ceph should issue timeout message when it can't connect to mon
I forgot to start the ceph service before issuing ceph -s to check its status. The tool happily
waited forever to c...
Dan Mick
11:31 AM Feature #2496 (Resolved): reinstall pudgy
Sage Weil
10:32 AM RADOS Feature #2521 (Resolved): crush: control bucket vs device mark-down probabilities independently
Sage Weil
09:50 AM Linux kernel client Bug #2392: First read of symlink after ceph filesystem mounted gives error
This is going to be easy to fix once the atomic_open stuff is merged. Real Soon Now. Sage Weil
09:40 AM Linux kernel client Bug #2537 (Won't Fix): bad header for RHEL6-like kernels
That backports tree is very old and not maintained. Assuming you do get it working, you'll have 1-2 year old code. ... Sage Weil
05:07 AM Linux kernel client Bug #2537: bad header for RHEL6-like kernels
Sorry,
I forgot to mention that it implies caps.c and super.h files.
For detecting that kernel is RHEL it is mayb...
Yannick Perret
04:28 AM Linux kernel client Bug #2537 (Won't Fix): bad header for RHEL6-like kernels
Hello,
I tried to compile the kernel module (kclient-0.20) and get a problem with ceph_write_inode:
it is declared ...
Yannick Perret
09:33 AM Feature #1773 (Resolved): rbd: class interface for header interaction
Sage Weil
05:37 AM Revision 5012b73a (ceph): Add test for cls_rbd
Josh Durgin
05:01 AM Revision 5cd33cdd (ceph): workunits/rbd: add workunit for running cls_rbd tests
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
04:59 AM Revision 031d42ad (ceph): workunits/rbd: disable remove_with_watcher test
This will fail until #2533 is fixed.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
04:45 AM Revision 68f14b40 (ceph): Test old and new rbd formats
Josh Durgin
04:26 AM Revision 04ef5dcc (ceph): Update for new workunit task syntax
Josh Durgin
04:01 AM Bug #2536 (Can't reproduce): librados crashed while getting stat of an object
librados crashed while getting stat of an object:... Xiaopong Tran
01:43 AM Revision 8af8d0e2 (ceph): workunit: allow setting environment variables
This is useful for e.g. running the same tests against rbd in new and
old formats.
Josh Durgin

06/10/2012

09:58 PM Feature #1400 (Resolved): throw exceptions on unknown encoding
Sage Weil
09:46 PM Feature #2088: msgr: refactor 2 threads to one
Sage Weil
09:46 PM Feature #2149: osd: use omap for snap collections
Sage Weil
09:22 PM Feature #1772: rbd: define new on-disk header format
Sage Weil
09:12 PM Revision 720aa466 (ceph): Merge remote-tracking branch 'gh/wip-rbd-format'
Sage Weil
05:47 PM Feature #1773: rbd: class interface for header interaction
Sage Weil
05:47 PM Feature #1773 (Fix Under Review): rbd: class interface for header interaction
Sage Weil
05:41 PM Linux kernel client Bug #2389: rbd: hung xfstest 67
nightly_coverage_2012-06-10-a 6787 Sage Weil
11:05 AM CephFS Bug #2444: null pointer deference in ceph_d_prune inside kvm
hi,
same bug here on native x86 and amd64 machines.
It affects debian wheezy and ubuntu 12.04 LTS.
I did not check...
Christian Krafft
07:14 AM Revision 3d225461 (ceph): librbd: remove unnecessary notify from add_snap()
The only caller, snapshot_add(), already does a notify when add_snap()
succeeds.
Signed-off-by: Josh Durgin <josh.du...
Josh Durgin
07:14 AM Revision 44e5f3dc (ceph): librbd: ignore RBD_MAX_BLOCK_NAME_SIZE when generating object ids
The actual data object ids don't need to be artificially restricted in
length. RBD_MAX_BLOCK_NAME_SIZE just limits t...
Josh Durgin
12:24 AM Revision 5b40b4b9 (ceph): librbd: add create2 to create an image with the new format
This will fail if features are requested that the client or server
does not support. Currently there are no features ...
Josh Durgin
12:24 AM Revision 857bbccc (ceph): rbd: update for the new format
No features exist right now, so there are no extra options for them.
The old format is still used by default, and sin...
Josh Durgin
12:24 AM Revision 074ffc59 (ceph): rbd.py: add support for creating images in the new format
The new arguments are optional, so they are backwards compatible.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
12:24 AM Revision 123250b4 (ceph): test_librbd, test_rbd.py: optionally use the new format
Read the desired features from an environment variable, RBD_FEATURES.
Signed-off-by: Josh Durgin <josh.durgin@inktan...
Josh Durgin
12:24 AM Revision a5026303 (ceph): rados: add commands to interact with object maps
The input values are stored as-is, and any values read are dumped in
hex. Rename listomap to listomapkeys to distingu...
Josh Durgin
12:24 AM Revision 3d7925b1 (ceph): workunits/rbd: allow creating images in different formats
This will allow adding more arguments for testing combinations
of features in the future.
Signed-off-by: Josh Durgin...
Josh Durgin

06/09/2012

09:45 PM Revision b25f2770 (ceph): doc: Segregated set up git from clone.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:06 PM rbd Bug #2535: rbd: random data corruption in vm
The information that *should* let us fully diagnose:
* set
debug osd = 20
debug filestore = 20
debug ms = ...
Sage Weil
08:04 PM rbd Bug #2535 (Resolved): rbd: random data corruption in vm
From ML:... Sage Weil
04:27 PM CephFS Bug #1947 (Need More Info): mds: SIGBUS during _mark_dirty
It looks liek this one still lives on:... Sage Weil
05:30 AM Revision c5042fec (ceph): CrushTester: some whitespace
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:30 AM Revision f2fa31b3 (ceph): CrushTester: simplify, clean up mark down
- put it in a separate function
- operate on temporary weight vector, not user-modified input
- guard the whole thing...
Sage Weil
05:30 AM Revision 24364579 (ceph): crushtool: drop useless clitest
This is an ancient test for an old 'bug' in functionality we're removing.
Also, it is sensitive to tester output, whi...
Sage Weil
05:30 AM Revision 9b0195bf (ceph): CrushTester: randomize mark_down behavior
When marking a batch of devices down, select which buckets to visit sort
of randomly.
Signed-off-by: caleb miles <ca...
caleb miles
05:10 AM Revision 83377755 (ceph): debian: ceph depends on python
Due to the new ceph-disk-* scripts. Newer dists are smart enough to figure
this out, but maverick is not.
Signed-of...
Sage Weil
04:37 AM Revision c526bdb5 (ceph): CrushTester: drop range marking
This is subsumed by mark-down-ratio 1 with mark-down-bucket-ratio N.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:28 AM Revision 2deac081 (ceph): CrushTester: mark_down_ratio and mark_down_bucket_ratio
Move functionality to allow user to control bucket vs device mark-down
probabilities independently.
Signed-off-by: c...
caleb miles
04:28 AM Revision 72580769 (ceph): CrushTester: some cleanup
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
04:28 AM Revision 4921bbb9 (ceph): CrushTester: remove dead code
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
12:22 AM Revision 4551808f (ceph): mon: add 'auth get-or-create' variant that outputs a keyring
This is more convenient when you're creating daemons, since you want to
write out a keyring file for them. This lets...
Sage Weil

06/08/2012

11:38 PM Revision bfbed508 (ceph): librbd: use ImageCtx members instead of the old header in resize()
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
11:38 PM Revision 7376a773 (ceph): librbd: Update ImageCtx for new format
Detect the format when an image is opened by the presence of the
original format header object. Use member variables ...
Josh Durgin
11:38 PM Revision 4eb2138e (ceph): librbd: update ictx_refresh to work with both formats
It now sets the member variables of ImageCtx so other functions
don't have to use the on-disk header. If the features...
Josh Durgin
11:38 PM Revision 00f4e841 (ceph): librbd: check that the current snapid for a snap name matches
Checking that it exists doesn't prevent you from having the snapshot
change out from under you in the following situa...
Josh Durgin
11:38 PM Revision 8e51e060 (ceph): librbd: remove on-disk header argument from helper functions
Make most of them take the parameters they actually use.
trim_image() now takes an ImageCtx, which means remove() mus...
Josh Durgin
11:38 PM Revision f1d69639 (ceph): librbd: use cls_client functions for calling class methods
Use the old or new methods make resize, snapshot add and snapsnhot
remove work with both old and new formats.
Signed...
Josh Durgin
11:38 PM Revision ef38394f (ceph): librbd: make rename work with any header format
Instead of interpreting the header, just copy all the data and
omap values from the original header to the newly name...
Josh Durgin
11:38 PM Revision 127b4259 (ceph): librbd: rename md_oid parameters to header_oid
This is more consistent with the rest of the code now,
and is a bit more clear.
Signed-off-by: Josh Durgin <josh.dur...
Josh Durgin
11:38 PM Revision 5bc34f4a (ceph): librbd: validate order before creating an image
The value must be passed, and it shouldn't be below 4k
(enforced by the command line tool already) or above the
range...
Josh Durgin
11:38 PM Revision dcc7c963 (ceph): cls_rbd: add methods for interacting with the new header format
Add the client side (cls_rbd_client) for testing as well.
librbd will use the functions in cls_rbd_client to interact...
Josh Durgin
11:14 PM Bug #2524: librados crashed while connecting to cluster
Thanks for the update. Yes, we do have different models, including a pool of set number of rados_t instances, etc. Bu... Xiaopong Tran
10:37 PM Bug #2524: librados crashed while connecting to cluster
Xiaopong Tran wrote:
> This is on my system:
> [...]
>
> Does it create a thread to every configured osd or only one...
Sage Weil
09:27 PM Bug #2524: librados crashed while connecting to cluster
I bumped up the threads-max to:... Xiaopong Tran
07:40 PM Bug #2524: librados crashed while connecting to cluster
This is on my system:... Xiaopong Tran
07:17 AM Bug #2524: librados crashed while connecting to cluster
Sage Weil wrote:
> can you cat /proc/sys/kernel/threads-max ? on my system it's only 127837.
Yeah, for each libr...
Sage Weil
07:09 AM Bug #2524: librados crashed while connecting to cluster
can you cat /proc/sys/kernel/threads-max ? on my system it's only 127837. Sage Weil
03:17 AM Bug #2524: librados crashed while connecting to cluster
Ah, formatting... sorry... Xiaopong Tran
03:15 AM Bug #2524: librados crashed while connecting to cluster
Alright, more information. I was thinking, maybe it was the max number of open files, or the stack size is too low, s... Xiaopong Tran
11:04 PM Feature #2496 (In Progress): reinstall pudgy
Sage Weil
09:35 PM Revision 8c08482c (ceph): regression: fix new rados, rbd test yamls
Don't start cluster twice! Sage Weil
09:31 PM Revision 60c684a4 (ceph): objclass: add indent settings header
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision e0fda59d (ceph): objclass: pass strings as const references
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision ba56d812 (ceph): cls_rgw: add indent settings header
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision 5750e53e (ceph): test_librbd: don't include a .cc file
Add it as a source like the rest of the gtests.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
09:31 PM Revision fe46a749 (ceph): class_debug: add indent settings header
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision 715c1f13 (ceph): objclass: allow classes to log at different levels
Add a logging level parameter to CLS_LOG, and a CLS_ERR that's
analagous to derr.
For now cls_rgw is all logging at ...
Josh Durgin
09:31 PM Revision 23ebc093 (ceph): objclass: rename omap methods to be consistent with other interfaces
The osd ops and the librados methods all distinguish between keys
and values. It's useful to be able to read a bunch ...
Josh Durgin
09:31 PM Revision 49059b23 (ceph): objclass: add methods to get keys without values, and set multiple values
These will be useful in the new rbd header format. This brings the
API into parity with the available OSD ops for oma...
Josh Durgin
09:31 PM Revision 8c637f56 (ceph): vstart: debug osd classes when debugging is on
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision 13aa578d (ceph): DBObjectMap: remove extra semicolon
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision 70686c59 (ceph): librbd: remove useless ENOMEM checks
There will be an exception if memory can't be allocated.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
09:31 PM Revision 92325d0f (ceph): cls_rbd: remove unused test_exec and snap_revert methods
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision fefedc1b (ceph): cls_rbd: add indent settings header
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:03 PM Feature #2337 (Resolved): rgw and rados performance numbers
Sage Weil
10:14 AM Feature #2337: rgw and rados performance numbers
Actually, the specific sprint test is here:
https://docs.google.com/a/inktank.com/spreadsheet/ccc?key=0AnmmfpoQ1_9...
Mark Nelson
09:53 AM Feature #2337: rgw and rados performance numbers
Results are being posted here:
https://docs.google.com/a/inktank.com/folder/d/0B3mmfpoQ1_94amRLQW5YT3l3OG8/edit
Mark Nelson
07:48 PM Revision b2793c42 (ceph): mon: require force argument for 'mds newfs ...'
This is dangerous, make sure they know what they're doing. I bet <someone>
$1 that Greg asked me to do this original...
Sage Weil
06:55 PM Revision 6df344c7 (ceph): run rados, rbd api tests under thrashing
Sage Weil
06:34 PM Revision 3eca3604 (ceph): Cleaned some lingering references to "verbose"
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
06:18 PM Revision 3876dbfd (ceph): Merge branch 'wip-crush-tunables'
Reviewed-by: Sam Just <sam.just@inktank.com> Sage Weil
06:17 PM Revision c0a02a4a (ceph): crushtool: refine tunables warning message
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:02 PM Revision 978d5419 (ceph): re-include assert after boost::pool
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:58 PM Revision c04de2b3 (ceph): crush: generate histogram of choose tries
Optionally populate a histogram of choose descent attempts.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:58 PM Revision dba70eed (ceph): CrushTester: dump histogram of choose attempts
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:58 PM Revision eaad9845 (ceph): CrushTester: clean up output interface
Multiple accessors. Init in ctor. Avoid temp vars in crushtool.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:58 PM Revision cfbe34d5 (ceph): crushtool: --show-* instead of --output-*
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:58 PM Revision 1a2959ed (ceph): crushtool: add --enable-unsafe-tunables option
This is required to adjust tunables.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:58 PM Revision 740aa3df (ceph): crush: compile/decompile tunables
Lines like
tunable <name> <value>
will adjust tunables. Right now only three are defined.
--enable-unsafe-tunabl...
Sage Weil
05:58 PM Revision c4336a3a (ceph): crushtool: arguments to adjust tunables
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:58 PM Revision 48737114 (ceph): CrushTester: optionally output bad mappings
Optionally dump bad inputs to stdout.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:58 PM Revision 4abb53d4 (ceph): crush: fix leaf recursion if we already collided
This just saves us some cycles, but does not effect placement results at
all.
Signed-off-by: Sage Weil <sage@inktank...
Sage Weil
04:42 PM rbd Bug #2534 (Resolved): librbd: make sure watch is established on same header version as initial re...
Right now there's a race where it doesn't. Greg Farnum
11:16 AM Bug #2533 (Duplicate): osd: watchers tracked by entity_name_t, not by cookie
In the object info, watchers are tracked in a map<entity_name_t, watch_info_t>, but if there are multiple watchers fr... Josh Durgin
10:43 AM Feature #1711 (Resolved): chef: multiple monitor support
Works as of ceph-cookbook.git commit b5cc21bf5b9c3f59474a7dfe38e04ee01b584fa3 and ceph.git commit 7332e9c717fb627d51e... Anonymous
10:12 AM rbd Feature #2531: rbd: fencing broken clients
I talked to Sam about the combination of blacklisting, bad client writes, and changing primaries that we discussed an... Greg Farnum
10:11 AM Linux kernel client Feature #26 (Rejected): statlite
Sage Weil
10:09 AM Linux kernel client Cleanup #2093 (Resolved): ceph-client: messenger: the "to" parameter to read_partial() needs to go
Sage Weil
10:08 AM Linux kernel client Bug #2395 (Resolved): kernel crash after unmap a rdb device while the cluster is down
I'm going go assume this is running the older code and close it. If not, let us know! Sage Weil
10:06 AM rbd Bug #2478 (New): krbd: unmap on 3.4.0: scheduling while atomic...
Sage Weil
10:04 AM Linux kernel client Feature #949 (Rejected): rbd: async writes, flush/barrier
Sage Weil
10:04 AM Linux kernel client Bug #2243 (Resolved): btrfs: warning in orphan_commit_root
Sage Weil
09:51 AM rbd Bug #2532: rbd command allows passing in -K </path/to/secret>, but long version of (--secret) doe...
That's probably best. It is always easier though when all subcommands under the main command, rbd in this case used o... Sam Zaydel
09:00 AM rbd Bug #2532: rbd command allows passing in -K </path/to/secret>, but long version of (--secret) doe...
Oh, i see.
I think the right fix is to make '--secret' and synonym for '--keyfile', and fix up rbd to use the conf...
Sage Weil
08:20 AM rbd Bug #2532: rbd command allows passing in -K </path/to/secret>, but long version of (--secret) doe...
When I try to use --keyfile=<file> with map, it seemingly fails, but using --secret=<file> succeeds. ... Sam Zaydel
08:13 AM rbd Bug #2532: rbd command allows passing in -K </path/to/secret>, but long version of (--secret) doe...
This is part of the rbd cmd helper message. It seems that for the map command one uses --secret.... Sam Zaydel
07:00 AM rbd Bug #2532: rbd command allows passing in -K </path/to/secret>, but long version of (--secret) doe...
the option is --keyfile <file>... where did you see --secret <file> documented? Sage Weil
05:49 AM rbd Bug #2532 (Resolved): rbd command allows passing in -K </path/to/secret>, but long version of (--...
While rolling back a snapshot I succeed when I pass in `-K with location of key file, but it looks like I fail when I... Sam Zaydel

06/07/2012

10:57 PM Revision 8b796974 (ceph): crush: make magic numbers tunable
We have three magic numbers in crush_choose that are now tunable. The
first two control the local retry behavior, in...
Sage Weil
09:38 PM rbd Feature #2531 (Resolved): rbd: fencing broken clients
Sage Weil
08:57 PM Revision 7332e9c7 (ceph): mon: use mode 0600 throughout
Fixes: #2526
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:50 PM Revision 4fa665c1 (ceph): --summary: add total counts, also note free machines
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Dan Mick
07:22 PM Revision 80939977 (ceph): Merge remote-tracking branch 'gh/mon-auth'
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
06:45 PM Bug #2524: librados crashed while connecting to cluster
objdump on the NIF shared library. Xiaopong Tran
06:29 PM Bug #2524: librados crashed while connecting to cluster
This is weird, if the problem is caused by resource exhaustion. I run this app on a machine with i7 CPU (with 8 cores... Xiaopong Tran
09:24 AM Bug #2524: librados crashed while connecting to cluster
This assert means that either a malloc or a call to pthread_create failed. It's probably resource exhaustion of some ... Greg Farnum
04:23 AM Bug #2524 (Won't Fix): librados crashed while connecting to cluster
Librados crahsed while connecting to the cluster.
Here is some log information. Unfortunately, I don't have more i...
Xiaopong Tran
06:35 PM Revision 07169d2e (ceph): doc: Added mount cephfs with fstab.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:21 PM Revision 7d1b32a0 (ceph): osd: include past_intervals in pg query results
This will help us figure out *why* nodes are in the prior set.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-...
Sage Weil
06:21 PM Revision 36a39795 (ceph): OSD: _have_pg should return NULL if pg is not in map
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:21 PM Revision ff470104 (ceph): Merge remote-tracking branch 'gh/wip-assert2'
"So be it"
Reviewed-by: Sam Just <sam.just@dreamhost.com>
Sage Weil
05:19 PM Revision fae1d47a (ceph): deliberately break encoding macros when wrong assert is present
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:18 PM Revision 3571f43c (ceph): misc assert #include cleanup, hackery
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:18 PM Revision a1ae8b6a (ceph): reinclude assert.h after json_spirit
json_spirit clobbers it!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:08 PM Revision e87a66b7 (ceph): doc: Incorporated Sam's comments.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:41 PM Revision 36658552 (ceph): Merge remote-tracking branch 'gh/wip-assert'
Reviewed-by: Sam Just <sam.just@inktank.com> Sage Weil
04:25 PM rbd Documentation #2530 (Closed): Doc: rbd manpage doesn't mention watch; usage does, and it works
Dan Mick
04:20 PM Tasks #2529 (Resolved): debian: Merge packaging changes from Ubuntu 12.04
The package in ubuntu is split to ceph-fs-common (mount helpers), ceph-mds (not in main), etc. Merge what makes sense. Anonymous
03:10 PM rbd Bug #2528 (Resolved): Mounted RBD image appears to go read-only after a snapshot is created
I have been able to repeat this a number of times. Essentially, I create a small rbd device, using the map command in... Sam Zaydel
02:38 PM Revision 3939839b (ceph): doc: Typo fix.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:54 PM Bug #2526 (Resolved): ceph-mon $mon_data_dir/keyring is world readable
gah... commit:7332e9c717fb627d51efcaa3f31473a2c129e876 Sage Weil
01:25 PM Bug #2526 (Resolved): ceph-mon $mon_data_dir/keyring is world readable
Keys to the kingdom, for anyone to grab. ceph-mon --mkfs creates this file, it should enforce the access mode.
ubu...
Anonymous
01:52 PM rgw Bug #2527 (Resolved): RGW may return 409 Conflict when deleting a bucket
If a bucket delete call occurs immediately after running a delete operation on the final remaining object in that buc... Jeremy Hanmer
12:53 PM Bug #2525 (Resolved): librados: some functions are not thread-safe
Some functions are accessing the osdmap without any locks. There are probably other cases like this. Find and fix all... Josh Durgin
03:29 AM Revision 44374bc4 (ceph): new variable lock hid lock() function
Dan Mick
02:19 AM Revision 22863c30 (ceph): mon: set policy for client, mds before throttler
Otherwise we fail the assert in Messenger::set_policy_throttler()!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
02:10 AM Revision 71c2877c (ceph): DBObjectMap: fix some warnings
os/DBObjectMap.cc:197: warning: suggest a space before ';' or explicit braces around empty body in 'for' statement
S...
Sage Weil
01:54 AM Revision 9313cdea (ceph): teuthology-lock: add --summary and --brief options
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Dan Mick

06/06/2012

11:57 PM Revision 0f6d90c7 (ceph): make everyone use our assert #include and macro
...as detected by the previous patch.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:07 PM Revision a30601a1 (ceph): assert: detect when /usr/include/assert.h clobbers us
The normal assert.h is very rude in that it clobbers any existing assert
define and replaces it with its own. An sad...
Sage Weil
11:01 PM Revision 2be716cf (ceph): Merge remote-tracking branch 'gh/wip-msgr-interface'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
11:00 PM Revision 9ec28433 (ceph): pull s3-tests.git using git, not http
Sage Weil
11:00 PM Revision 7d5c7ee8 (ceph): pull s3-tests.git using git, not http
Sage Weil
10:30 PM Revision 165fbd13 (ceph): keyserver: also authenticate against mon keyring
If we don't have a secret, also check in the extra_secrets keyring.
This means we can also authenticate as any users...
Sage Weil
10:26 PM Revision 5d520f17 (ceph): keyring: implement get_caps()
Simple accessor, mirrors KeyServerData.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:26 PM Revision ca433f28 (ceph): mon: share mon keyring with KeyServer
This will let us authenticate against items in the mon keyring, like the
mon. key itself.
Signed-off-by: Sage Weil <...
Sage Weil
09:11 PM Revision 35a61af1 (ceph): Merge remote-tracking branch 'gh/wip_hobject_wpool'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
09:09 PM Revision 47b202ec (ceph): mon: put cluster log at /var/log/ceph/$cluster.log and/or send to syslog
Also, stop breaking it down by event severity on disk. If you want that,
use syslog.
Fixes: #2497
Backport: dho
Sig...
Sage Weil
09:07 PM Feature #1422 (Resolved): libvirt: rbd storage pool
Sage Weil
09:06 PM Feature #2486 (Resolved): crush: evaluate local retry behavior
Sage Weil
09:06 PM Feature #2493 (Resolved): teuthology-lock --status
Sage Weil
09:05 PM devops Feature #2498 (Fix Under Review): standardize keyring locations for daemons
Sage Weil
08:32 PM Revision 7523ff3e (ceph): ceph: simplify 'cluster' mon log handling
It's not a special file in the mon_data directory anymore, but intead
something in archive that will get slurped up n...
Sage Weil
06:29 PM Revision 37705edb (ceph): Merge branch 'wip-crushtool'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
06:28 PM Revision 78b0bea1 (ceph): monclient: be paranoid/defensive about send_log vs log_client==NULL
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:07 PM Revision 58b02f93 (ceph): crushtool: fix cli tests given new less-chatty output, help
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:07 PM Revision af4d8db5 (ceph): crushtool: allow user to select output reporting in blocks
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
05:45 PM Revision 87327c2d (ceph): doc: Added mount cephfs and included it in quick start.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:57 PM Messengers Cleanup #2150 (Resolved): repair the Simple/Messenger interface
Sage Weil
02:06 PM Feature #2497 (Resolved): mon: new cluster logging strategy
commit:47b202ecfdc00996b085a0c0d557564fbaa8bdfe Sage Weil
12:28 PM Feature #2497 (Fix Under Review): mon: new cluster logging strategy
Sage Weil
12:28 PM Feature #2497: mon: new cluster logging strategy
see wip-2497 Sage Weil
01:27 PM Linux kernel client Bug #2523 (Resolved): xfs: xfs_iolock_reclaimable
... Sage Weil
01:22 PM rbd Bug #2522: xfstest #219
ubuntu@teuthology:/a/nightly_coverage_2012-06-05-b Sage Weil
01:21 PM rbd Bug #2522 (Closed): xfstest #219
... Sage Weil
11:30 AM Bug #2518 (Resolved): mon: limit size of paxos log event
Sage Weil
11:29 AM RADOS Feature #2521: crush: control bucket vs device mark-down probabilities independently
Sage Weil
11:27 AM RADOS Feature #2521 (Resolved): crush: control bucket vs device mark-down probabilities independently
--mark-down-ratio -- probability that a device (in eligible bucket) will be marked down
--mark-down-bucket...
Sage Weil
11:27 AM RADOS Feature #2421 (Resolved): crush: quantitatively validate mapping quality
Sage Weil
09:16 AM Bug #2520 (Duplicate): iozone random read/write with 4k block size hangs
http://thread.gmane.org/gmane.comp.file-systems.ceph.devel/6777/focus=6856
User reports iozone random read/write (...
Anonymous
04:27 AM Revision 0167fd0b (ceph): mon: include all types of stuck pgs in health detail
We were just including the last one, which isn't as helpful.
Backport: dho
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:27 AM Revision d4c30da7 (ceph): mon: include pg acting in health detail
Backport: dho
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:27 AM Revision 6c1558ec (ceph): logclient: fix warning
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:20 AM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
Hi Josh,
i've increased osd_min_pg_log_entries to 5000. Let's see if it fixes the problem.
Simon
Simon Frerichs
03:16 AM Revision 37bf2a14 (ceph): test/cli/ceph-authtool: keyring.bin -> keyring
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:16 AM Revision 5b443ebd (ceph): doc: keyring.bin -> keyring everwhere
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:41 AM Revision 120ce3f8 (ceph): Pass up unmodified exceptions from connection.connect()
This allows useful errors to be reported from things like
mismatched hostkeys, etc.
Dan Mick
01:40 AM Revision fac88a40 (ceph): More shortnames fixes:
- Allow shortnames in teuthology-updatekeys as well
- Use list comprehensions instead of map()
Dan Mick
12:47 AM Revision f918049b (ceph): test/: Made omap_bench compatible with teuthology
added --name parsing, made histogram better, made rados_id
configurable, changed object names to use configurable pre...
Eleanor Cawthon
12:09 AM Revision 02a9a013 (ceph): doc: Added the root discussion to deploy with mkcephfs.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:08 AM Revision 993caf81 (ceph): doc: Added chmod for keyring, and moved client.admin user higher.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins

06/05/2012

11:39 PM Revision 85fee040 (ceph): logclient: fix crashes, fix which entries are sent
I was seeing crashes when the monitor tried to send log entries.
* Send log entries that haven't already been sent.
...
Sage Weil
11:39 PM Revision b6059ccc (ceph): monclient: send more log entries when first set is acked
Immediately send more log messages if we had more when the first set was
sent. Otherwise, wait until the next tick t...
Sage Weil
11:39 PM Revision 235f9274 (ceph): logclient: not a dispatcher
Let MonClient and Monitor handle delivery of messages. This puts them in
control and lets them trigger sending of mo...
Sage Weil
11:38 PM Revision dfd9f5f5 (ceph): logclient: limit messages per MLog message
This will avoid sending huge chunks of entries to the monitor and making
its life difficult.
Signed-off-by: Sage Wei...
Sage Weil
11:38 PM Revision 540865f4 (ceph): mon: limit size of each logm paxos event
Limit the number of log events we cram into a single paxos event.
Fixes: #2518
Signed-off-by: Sage Weil <sage@inktan...
Sage Weil
11:33 PM Revision ee945128 (ceph): config_opts: filestore_update_to defaults to 1000
This way, filestores will be auto-upgraded by default.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:09 PM Revision 01ad679a (ceph): DBObjectMap: update check() for new format
Signed-off-by: Samuel Just <sam.just@dreamhost.com> Samuel Just
11:09 PM Revision 19c667fb (ceph): DBObjectMap::init: initialize seq and v to correct values
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:09 PM Revision 6ec74acd (ceph): ReplicatedPG: remove_object_with_snap_hardlinks before creating temp obj
hobject_ts must be unique in the filestore. Thus, when we create the
new temp object, the old one must have been del...
Samuel Just
11:09 PM Revision 7c8aac7b (ceph): ReplicatedPG: adjust missing at push_start
When we start recieving an object, we remove the old copy. This will
prevent the primary from using that old copy af...
Samuel Just
11:09 PM Revision 92807c21 (ceph): OSD: exit(0) once filestore is converted
Also, do not upgrade filestore automatically
Signed-off-by: Samuel Just <sam.just@dreamhost.com>
Samuel Just
11:09 PM Revision df7c4072 (ceph): FileStore: skip omap during list_collections
Signed-off-by: Samuel Just <sam.just@dreamhost.com> Samuel Just
11:09 PM Revision ebfde20a (ceph): DBObjectMap: update header comments for new structure
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:09 PM Revision 875bec29 (ceph): DBObjectMap,FileStore: Remove IndexedPath parameters from ObjectMap
IndexedPath parameters are no longer needed for getting the object
collections or for supporting the TMAP implementat...
Samuel Just
11:09 PM Revision 56cf461e (ceph): OSD,FileStore: clean up filestore convsersion
Previously, we messed with the filestore_update_collections config
option to enable upgrades in the filestore. We no...
Samuel Just
11:09 PM Revision c64fc291 (ceph): test_object_map: remove DBObjectMapv0
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:09 PM Revision a2eedec5 (ceph): osd_types,PG: handle new hobject format in object_info,pg_log
There are also legacy hobject encodings in the pg log and in object_info
attributes on objects.
Signed-off-by: Samue...
Samuel Just
11:09 PM Revision 5250e247 (ceph): test_filestore_idempotent_sequence: add omap
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:09 PM Revision ffc5f7ef (ceph): ReplicatedPG: push_start, don't insert empty extent into data_subset
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:09 PM Revision 50331ab1 (ceph): FileStore,DBObjectMap: add SequencerPosition argument to ObjectMap
Previously, sequences like:
1. touch (c1, a)
2. link (c1, c2, a)
3. rm (c1, a)
4. setattr (c2, a)
5. clone (c2, a, b...
Samuel Just
11:09 PM Revision d8f8b1fa (ceph): OSD: do not convert an entire collection in one transaction
Previously, we atomically moved the collection out of the way, created a
new collection, moved the contents of the ol...
Samuel Just
11:09 PM Revision d5ab8779 (ceph): src/: Add namespace and pool fields to hobject_t
From this point, hobjects in the ObjectStore will be globally unique. This
will allow us to avoid including the coll...
Samuel Just
11:09 PM Revision 927458cf (ceph): test/ObjectMap: Copy current DBObjectMap implementation
This implementation will be used to test the upgrade process.
Signed-off-by: Samuel Just <sam.just@dreamhost.com>
Samuel Just
11:09 PM Revision dab238b0 (ceph): os/: update CollectionIndex filename encodings
filename encodings now include namespace and pool.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:09 PM Revision 7a3a2d24 (ceph): DBObjectMap: add parse method for old encoding
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:09 PM Revision 9ceed879 (ceph): DBObjectMap: version bump for new format
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:09 PM Revision 761140a0 (ceph): FileStore,DBObjectMap: remove ObjectMap link method
hobject_t's are now globally unique in filestore. Essentially, there is
a 1-to-1 mapping from inodes to hobject_t's....
Samuel Just
11:09 PM Revision e083e0e9 (ceph): DBObjectMap: restructure for unique hobject_t's
Previously, the ObjectStore operated in terms of (coll_t,hobject_t)
tupples. Now that hobject_t's are globally uniqu...
Samuel Just
11:09 PM Revision b6a99321 (ceph): DBObjectMap: Implement upgrade from previous format
Also includes tests in test_object_map.cc
Signed-off-by: Samuel Just <sam.just@dreamhost.com>
Samuel Just
10:30 PM Revision 23c72930 (ceph): task/: Added object map benchmarking test
Signed-off-by: Eleanor Cawthon <eleanor.cawthon@inktank.com> Eleanor Cawthon
06:57 PM Revision ec689e3e (ceph): rgw: replace 'should_log' with 'enable_ops_log'
code cleanup, make it more clear about log intention.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:55 PM Revision 8e41ac15 (ceph): mon: clear osd_stat on osd creation/destruction
Reported-by: Vladimir Bashkirtsev <vladimir@bashkirtsev.com>
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:09 PM Revision fb6ea82f (ceph): doc: Added S3 examples to the toctree.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:55 PM Revision 62fe9f42 (ceph): doc: adding code samples for S3 API usage (thanks, DH!)
Signed-off-by: Ross Turk <ross@inktank.com> Emily Popper
05:40 PM Revision 644a615d (ceph): Makefile.am: explicitly mention that -Wl,--as-needed is location-sensit...
Tommi Virtanen
03:15 PM Revision e83a84c0 (ceph): doc: Added ${lsb_release -sc} based on Sam's feedback.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:36 PM Feature #2519 (Resolved): rados: allow setting pg_num and pgp_num when creating a pool
Right now rados mkpool creates a pool with 8 pgs, which is almost always too few. 'ceph osd pool create' accepts pg_n... Josh Durgin
01:04 PM Bug #2518: mon: limit size of paxos log event
Sage Weil
01:03 PM Bug #2518 (Resolved): mon: limit size of paxos log event
dho was having trouble with a 400MB paxos event/record. make LogMonitor limit an individual paxos event to something... Sage Weil
11:42 AM rgw Feature #2517 (Resolved): rgw: limit number of buckets per user (configurable per user)
Yehuda Sadeh
11:37 AM rgw Feature #2516 (Resolved): rgw: new bandwidth-only per-user log
- orthogonal to operations logs
- only aggregate user bandwidth usage (read, write) per date
- rgw sends a perio...
Yehuda Sadeh
11:02 AM Bug #2508: osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobject_t, loff_...
Hi Simon,
If this is at all reproducible, could you try setting osd_min_pg_log_entries higher on all your osds, sa...
Josh Durgin
07:47 AM Bug #2508 (Resolved): osdc/ObjectCacher.cc:761: void ObjectCacher::bh_write_commit(int64_t, sobje...
Hi,
we've random KVM VPS crashes with the following error:...
Simon Frerichs
10:32 AM Feature #2510: update on-disk hobject_t encoding to include pool and namespace fields
Samuel Just
10:15 AM Feature #2510 (Resolved): update on-disk hobject_t encoding to include pool and namespace fields
This will allow hobject_t's to be globally unique in the filestore. That is, there will be a 1-to-1 inode to hobject... Samuel Just
10:31 AM Subtask #2515: allow collection upgrade to use more than one transaction
Samuel Just
10:31 AM Subtask #2515 (Resolved): allow collection upgrade to use more than one transaction
Samuel Just
10:31 AM Subtask #2514: Implement DBObjectMap upgrade from old version
Samuel Just
10:30 AM Subtask #2514 (Resolved): Implement DBObjectMap upgrade from old version
Samuel Just
10:31 AM Subtask #2513: Update DBObjectMap implementation to ignore collection
Samuel Just
10:30 AM Subtask #2513 (Resolved): Update DBObjectMap implementation to ignore collection
This allows us to remove the (coll_t,hobject_t)->seq mapping and directly store the leaf nodes keyed by hobject_t. Samuel Just
10:31 AM Subtask #2512: implement upgrade process for collections
Samuel Just
10:29 AM Subtask #2512 (Resolved): implement upgrade process for collections
also upgrade object_info and pg log encodings Samuel Just
10:31 AM Subtask #2511: Change hobject_t encoding
Samuel Just
10:16 AM Subtask #2511 (Resolved): Change hobject_t encoding
Samuel Just
10:17 AM CephFS Bug #733: cmds crash: mds/LogEvent.cc:88: FAILED assert(p.end())
ok here is a logfile with the following config:
[mds]
debug = 20
debug ms = 1
debug md...
Eric Dold
10:08 AM Subtask #2402 (In Progress): audit calls into osd from pg for locking correctness
Samuel Just
10:07 AM Subtask #2509 (Resolved): create OSDService to limit pg/osd interface
Samuel Just
10:06 AM Subtask #2430: simplify pg removal
Samuel Just
10:06 AM Subtask #2403: remove osd pointer from PG
Samuel Just
10:06 AM Subtask #2333: create queueing for peering messages
Samuel Just
10:06 AM Subtask #825: osd: remove pg map updating from handle_osd_map
Samuel Just
10:06 AM Subtask #2332: move pg queueing into pgs
Samuel Just
10:06 AM Subtask #2282: Handle map updates on a per-pg basis
Samuel Just
09:56 AM rbd Feature #1480: librbd: image locking
lock(entity)
unlock(entity)
new code should lock before open, unlock on close.
the rbd map tool have 'lock lis...
Sage Weil
12:50 AM Revision 044697d1 (ceph): Allow short names to teuthology-lock (e.g. "plana14")
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sam Just <sam.just@inktank.com>
Dan Mick
12:28 AM CephFS Bug #1047: mds: crash on anchor table query
No, I am not sure about that. Only saw the same assert message and a similar trace, so I assumed it to be the same bug. Amon Ott

06/04/2012

11:15 PM Revision fc1f9e0b (ceph): rgw: shutdown init_timer
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
11:12 PM Revision 5087997a (ceph): rgw: try to create fcgi socket through open() first
FCGX_OpenSocket might just exit() without any warning if it fails
to create the socket.
Signed-off-by: Yehuda Sadeh ...
Yehuda Sadeh
04:09 PM rgw Bug #2503 (Resolved): rgw: ungraceful failure when cannot create unix domain socket
Fixed, commit:5087997a1c90ecd1244dc1047a17858607c940f9. Yehuda Sadeh
03:09 PM rgw Bug #2503: rgw: ungraceful failure when cannot create unix domain socket
No, another problem. This refers to the 'rgw socket path' that is being used for fastcgi. Yehuda Sadeh
06:26 AM rgw Bug #2503: rgw: ungraceful failure when cannot create unix domain socket
There was a stupid error in master for a few days that was making noise about the admin socket.. is that what this wa... Sage Weil
03:56 PM Bug #2507 (Resolved): auth: "ceph auth get-or-create-key" argument validation is lacking
This should probably have errored out:
ubuntu@inst01:~$ sudo ceph auth get-or-create-key client.foo borkbork
AQBW...
Anonymous
01:08 PM CephFS Bug #1047: mds: crash on anchor table query
Amon, are you sure you're hitting exactly this bug with your users? This particular one requires hard links to be in ... Greg Farnum
01:04 PM CephFS Bug #733: cmds crash: mds/LogEvent.cc:88: FAILED assert(p.end())
Aww, the actual debug line that's interesting here is generic_dout().
Can you do it again, this time adding "debug =...
Greg Farnum
10:05 AM Messengers Cleanup #2150: repair the Simple/Messenger interface
I scheduled another test run but I don't anticipate any problems — this should be reviewed for merge! Greg Farnum
09:23 AM CephFS Bug #2494: mds: Cannot remove directory despite it being empty.
Note that this was triggered frequently by backuppc runs:
http://thread.gmane.org/gmane.comp.file-systems.ceph.devel...
Anonymous
09:23 AM Linux kernel client Bug #2506: ceph: ceph_add_cap: couldn't find snap realm NNN
Note that this was triggered frequently by backuppc runs:
http://thread.gmane.org/gmane.comp.file-systems.ceph.devel...
Anonymous
06:33 AM Bug #2487 (Resolved): rgw: (re)creating a suspended bucket succeeds
Sage Weil
06:29 AM Bug #2491 (Resolved): watch/notify: racing notify and unwatch
Sage Weil
04:31 AM Revision 000f9d30 (ceph): msg: make clear_pipe work only on a given Pipe, rather than the current...
This way old Pipes that have been replaced can't clear the new Pipe
out of a Connection's link.
We might attempt to i...
Greg Farnum
01:35 AM Bug #2346: xfs filesystem on top of rbd volume corrupts
I am not 100% sure but it looks like kernel 3.2.17-1 fixed the problem. Let's wait 4 weeks to make sure of it. Maciej Galkiewicz

06/03/2012

11:12 PM Revision dde85a20 (ceph): CrushTester: allow build without boost stuff for chi^2 testing
With limited functionality.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:54 PM Revision 11b24f7c (ceph): crush: simiulate using rng; use chi-squared to measure result
Signed-off-by: Caleb Miles <caleb.miles@inktank.com> Caleb Miles
10:25 PM Revision aeb3a8f4 (ceph): crush: check_item_present
True if id is present in the map.
Signed-off-by: Caleb Miles <caleb.miles@inktank.com>
Caleb Miles
10:21 PM Revision 4061ae23 (ceph): ceph_argparse: with_float
Signed-off-by: Caleb Miles <caleb.miles@inktank.com> Caleb Miles
08:51 PM Revision a1c5ef04 (ceph): admin_socket: only init if path is defined
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:41 PM Linux kernel client Bug #2506 (Resolved): ceph: ceph_add_cap: couldn't find snap realm NNN
From Martin Wilderoth on ML:... Sage Weil
05:39 PM CephFS Bug #2494: mds: Cannot remove directory despite it being empty.
Sage Weil

06/02/2012

10:19 PM Revision 3764ca61 (ceph): upstart: simplify start; allow group stop via an abstract job
Use a 'ceph-mds' or 'ceph-mon' event to start instances instead of
explicitly calling start. This avoids the ugly is...
Sage Weil
09:49 PM Revision ac28c260 (ceph): Merge remote branch 'gh/chef-3'
Sage Weil
09:29 PM Revision c48de3ad (ceph): paxos: warn on extreme clock skew
This would have helped us diagnose #2480.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:03 PM Revision 576dff89 (ceph): omapbench: fix warning
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:18 PM Revision ab5139b3 (ceph): omapbench: fix misc warnings
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:10 PM Feature #2003 (Rejected): limit XFS extent fragmentation for rbd
Sage Weil
07:09 PM Cleanup #2435 (Resolved): Remove binary keyring support
Sage Weil
07:09 PM Feature #2450 (Resolved): dho git commit emails
Sage Weil
06:35 PM Feature #2505 (Resolved): Automatically set filestore xattr use omap
It would be really nice if the filestore could automatically figure out if it should set "xattr use omap" depending o... Mark Nelson
03:45 PM Feature #2414 (Resolved): upstart: support mds
updated, tested, merged Sage Weil
03:26 PM Bug #2480 (Rejected): mon: election won't complete
Sage Weil
03:20 PM Bug #2480: mon: election won't complete
this was just a skewed clock. making the log warnings louder. Sage Weil

06/01/2012

11:13 PM Revision 6f3e7a84 (ceph): ReplicatedPG: fix pgls listing, add max listing size
Previously, a client requesting a large pgls could tie up the
osd for an unacceptable amount of time. Also, it's pos...
Samuel Just
11:11 PM Revision d8e07a34 (ceph): objecter: fix pgls
First problem: if the osd returns more entries than we ask for, max_entries
was going negative, and we were requestin...
Sage Weil
08:57 PM Revision 6eb99d48 (ceph): Merge remote-tracking branch 'gh/wip-admin'
Sage Weil
08:54 PM Revision b9ef3149 (ceph): mon: fix slurp latest race
It is possible for the latest version to get out in front of the
last_committed version:
a- start slurping
a- slur...
Sage Weil
08:46 PM Revision f871d837 (ceph): Makefile: include ceph-mds upstart bits in dist tarball
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:02 PM Revision 7a4e9234 (ceph): test/: Added object map benchmarking tool
omap_bench writes configurable objectmaps to a configurable number
of objects and generates latency statistics.
Sign...
Eleanor Cawthon
07:53 PM Revision e0f19525 (ceph): doc: fix autobuild debian source line
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:43 PM rgw Bug #2504 (Resolved): rgw: use multiple notifications objects
In order to spread the load. Yehuda Sadeh
07:35 PM rgw Bug #2503 (Resolved): rgw: ungraceful failure when cannot create unix domain socket
At least need to add a log message before crashing. Yehuda Sadeh
04:44 PM Revision 97d43964 (ceph): mon: throttle client msgr memory
Limit the amount of memory that can be consumed by client messages, similar
to the OSD. Do not limit inter-mon messa...
Sage Weil
04:42 PM Bug #2502 (Resolved): osd: do not mark self back up if internal heartbeats are failing
saw this on congress. blocked up peering on teh cluster bc the osd was up but not doing anything useful due to clogg... Sage Weil
04:38 PM Bug #2501 (Won't Fix): acting.size() < nreplicas shouldn't prevent trimming old maps
Currently, we won't mark a pg clean if we have too few replicas. This is correct, but it means we won't trim maps wh... Samuel Just
04:38 PM Bug #2500 (Resolved): osd: unprotected ::decodes in ReplicatedPG::do_osd_ops
Decoding of data supplied by a client should not be able to crash the OSD. These should each be in a try/catch block. Josh Durgin
04:30 PM Revision 52b28ed6 (ceph): Merge remote-tracking branch 'origin/wip-2491'
Yehuda Sadeh
03:35 PM rgw Feature #2499 (Resolved): rgw: ability to delete users without first emptying and deleting all bu...
Currently there is no way to simply delete a user without first going through and emptying then deleting its buckets.... Matthew Wodrich
01:51 PM Bug #2379 (Resolved): Mon crash after start
commit:b9ef3149f8639e1f3cdbbb81eecb05e42be00711 Sage Weil
10:42 AM devops Feature #2498 (Resolved): standardize keyring locations for daemons
Currently keyring is controlled by g_conf->keyring, with a default search path that makes sense for clients but not m... Sage Weil
10:40 AM Feature #2497 (Resolved): mon: new cluster logging strategy
* putting the log files in $mon_data is probably a bad idea
* they don't get rotated
* they aren't in a sufficientl...
Sage Weil
10:39 AM Feature #2496 (Resolved): reinstall pudgy
Sage Weil
09:41 AM Feature #2495 (Resolved): mon: throttle incoming non-monitor msgr traffic
commit:97d4396415a95f62f6deccda62d105d8d04717a4 Sage Weil
09:02 AM Feature #2495 (Resolved): mon: throttle incoming non-monitor msgr traffic
Sage Weil
04:39 AM Revision d3f855ec (ceph): fix up dist var
This lets you override the default (now precise) in the ceph config yaml,
e.g.
- ceph:
dist: oneiric
branch:...
Sage Weil
04:21 AM Bug #2267: Ceph client crashed after shutting down one mds and osd
Alex Elder wrote:
> It looks to me that in order for msg->bio_iter to be null, the
> value of msg->bio itself must ...
Karol Jurak
02:05 AM Revision 7e3d90ac (ceph): Objecter: tone down linger op messages on tick
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:09 AM Revision af4fe154 (ceph): Change hardcoded oneiric to precise
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
12:04 AM Revision 4a7683cb (ceph): test_stress_watch: exercise watch/unwatch/close from second client
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh

05/31/2012

11:46 PM Revision 95e0a880 (ceph): qa: stress_watch.sh workunit runs test_stress_watch
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:44 PM Revision 95ecf40e (ceph): add rados_stress_watch to regression
Sage Weil
11:42 PM Revision a4c90b72 (ceph): Merge remote-tracking branch 'gh/wip-mon-doc'
Sage Weil
11:01 PM Bug #2379 (Fix Under Review): Mon crash after start
see wip-2379 Sage Weil
10:46 PM Bug #2379: Mon crash after start
triggered this on congress.
looks like latest is a 150 epochs out ahead of last_committed and the actual states:
<pr...
Sage Weil
10:35 PM Revision fb7ce59b (ceph): doc: Added preliminary rbd and ko info.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:20 PM Revision 10b0db3b (ceph): osd: Adding const to methods in OSDMap
Signed-off-by: Adam Crume <adamcrume@gmail.com>
Signed-off-by: Sage Weil <sage@inktank.com>
Adam Crume
06:38 PM Revision 8609caff (ceph): vstart.sh: initialize keyring even if not using cephx
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:37 PM Revision e929d54d (ceph): Merge branch 'master' of ssh://github.com/ceph/ceph
Yehuda Sadeh
02:40 PM CephFS Bug #2494 (Resolved): mds: Cannot remove directory despite it being empty.
Running ceph 0.47.1 on top of XFS I've got at least two directories which used to contain files but are now empty and... Shawn Bohrer
01:47 PM Feature #2493 (Resolved): teuthology-lock --status
something similar to... Sage Weil
11:23 AM Feature #2413 (Resolved): qa: Test co-existence of sysvinit and upstart, 3: upstart controlled
Sage Weil
09:39 AM Feature #2337: rgw and rados performance numbers
Created a google doc with some testing plans. Check your mail! Mark Nelson
09:36 AM CephFS Feature #2492 (New): ceph-fuse: improve ceph-fuse error reporting (especially missing cephx)
If a client-side ceph.conf does not have "auth supported = cephx", but the server has it, ceph-fuse fails to mount, a... Anonymous
05:22 AM Revision fa2f1dea (ceph): admin_socket: fix unit test
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:22 AM Revision ea1c0698 (ceph): mon: fix admin socket lock dependency
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:22 AM Revision 594f2bbd (ceph): perf_counters: use bufferlist instead of vector<char>
bufferlist allocates page-sized chunks and avoids realloc, which'll be
more efficient than vector resize doubling jus...
Sage Weil
05:16 AM Revision 51de7c1a (ceph): osd: ack pending notify when unwatching
If an unwatch crosses paths with a notify, we currently force the notifier
to keep waiting. Instead, implicitly ack ...
Sage Weil
04:58 AM Revision 5adc85bd (ceph): admin_socket: pass args separately
This avoids making the callback parse off the command portion on their own.
It also lets them assert that the command...
Sage Weil
03:38 AM Revision e30e937c (ceph): admin_socket: register command prefixes
Allow any command prefix to be registered, provided it is separated by a
space. Previously, we always matched agains...
Sage Weil
03:38 AM Revision ca8f3ba8 (ceph): admin_socket: streamline command set
log flush
log dump
log reopen
config show
config set <var> <val>
perf dump
perf schema
Signed-off-by: Sage Wei...
Sage Weil
03:38 AM Revision 895e24d1 (ceph): admin_socket: streamline 'version' commands
version -- ceph version
git_version -- git sha1
0 -- proto version (for backward compat)
Signed-off-...
Sage Weil
03:38 AM Revision 640eec9c (ceph): Merge remote branch 'gh/wip-admin'
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com> Sage Weil

05/30/2012

11:48 PM Revision c80bd9da (ceph): librbd: Simplify timing init
Remove possibility of set_start_time before set_ictx error
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
11:22 PM Revision d8df1e99 (ceph): rgw: return default policy if policy attr is broken
In case the policy attr on object/bucket is broken
return a default policy, in which the owner is set
as the bucket o...
Yehuda Sadeh
11:22 PM Revision 767664dc (ceph): rgw: put_bucket_info does not override attrs
This fixes #2487. When writing bucket info we just
wrote the object content, and were overriding any
attrs that objec...
Yehuda Sadeh
11:18 PM Revision 04e66b42 (ceph): librbd: Add latency (elapsed-time) stats for rbd operations
Fixes: #2408
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Reviewed-by: Jo...
Dan Mick
10:35 PM Revision a24145fc (ceph): log: add missing .cc file
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:28 PM Feature #2477: rados bench cleanup
I think we need both:
* clean up objects after each run
* search+clean up objects from previous (maybe aborted) r...
Sage Weil
10:20 PM Bug #2491 (Resolved): watch/notify: racing notify and unwatch
the following scenario (needs to be verified that it actually happens):
1. client 1 registers watch
2. client 2 reg...
Yehuda Sadeh
10:18 PM Feature #1240 (Resolved): teuthology: fail/clean up when daemons crash
Sage Weil
10:17 PM CephFS Cleanup #814 (In Progress): hadoop: refactor hadoop shim in terms of java libceph bindings
Sage Weil
10:16 PM Feature #2431 (Duplicate): teuthology: qemu + rbd testing
Sage Weil
10:15 PM Revision 52187c3e (ceph): log: uninline SubsystemMap mutators
This lets you do something like
p g_ceph_context->_conf->subsys.set_log_level(ceph_subsys_mon, 20)
from gdb.
Sign...
Sage Weil
10:15 PM Feature #2428 (Fix Under Review): auth: revise auth config params
Sage Weil
10:15 PM Feature #2479 (Resolved): osd: runtime warning/message about syncfs variant in use
Sage Weil
10:06 PM Revision a6f8fe35 (ceph): filestore: fix my broken ifdef flow
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:05 PM rgw Feature #2490 (New): rgw-admin: only register watch when needed
Certain radosgw-admin operations don't need to use the cache notifications (e.g., log manipulation). We shouldn't reg... Yehuda Sadeh
09:54 PM Bug #2346 (Need More Info): xfs filesystem on top of rbd volume corrupts
Has this issue been diagnosed? Sage Weil
09:41 PM Revision 188d1718 (ceph): doc: ceph.conf cleanup for cephx authentication.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:40 PM Revision 695f92d8 (ceph): doc: fix broken link.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:33 PM Feature #2463 (Resolved): adminsocket: 'show_config' command
Sage Weil
09:29 PM Revision d05d98ce (ceph): doc: Added -a to service ceph stop.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:29 PM Revision cb5c6b5d (ceph): doc: Consolidated mkcephfs deployment files.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:28 PM Revision dd1d3880 (ceph): osd: fix null dereference
The pg arg is optional!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:48 PM Revision a3f4d595 (ceph): doc: Added pools and authentication.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:53 PM Revision a167213a (ceph): mon: fix reweight_by_utilization waiting
Propose if we changed something, but not if we don't.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:53 PM Revision 519fadbc (ceph): osd: fix rewewight_by_utilization
Update the incremental, not the in-memory OSDMap!
Fixes: #2454
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:14 PM Revision 07498d66 (ceph): filestore: log about syncfs(2) etc support
Fixes: #2479
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:02 PM Revision 2b856ace (ceph): Merge remote-tracking branch 'gh/stable'
Sage Weil
05:14 PM Revision 6dd93b9a (ceph): doc: Updated authentication with header canonicalization.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:05 PM Revision e9588f0c (ceph): Merge remote-tracking branch 'gh/wip-2349'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
04:55 PM Feature #2408: librbd: track latency with perfcounters
Actually really
commit:c80bd9daea0c436fcf0af33fea9fc6b961279756
Dan Mick
04:32 PM Feature #2408 (Resolved): librbd: track latency with perfcounters
commit:04e66b420174d49e0f773cab1f6ef213dadc3397 Dan Mick
04:30 PM Revision 26ecf608 (ceph): workloadgen: Fix the statistics output and a bug triggered by coll dest...
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
04:13 PM Revision f685f4d8 (ceph): filestore: initialize dummy xattr buffer
Shut up valgrind.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
03:51 PM Bug #2409 (Resolved): osd: pgs stuck in active
Sage Weil
02:28 PM rbd Bug #2488 (Can't reproduce): rbd: EIO from rbd map
... Sage Weil
01:54 PM Bug #2487 (Resolved): rgw: (re)creating a suspended bucket succeeds
When trying to create a bucket that already exists and is suspended, operation succeeds. Then the bucket appears in t... Yehuda Sadeh
11:51 AM Bug #2454 (Resolved): "rbd info xyz" hanging forever sometimes
reweight-by-utilization was broken (modified in-memory OSDMap instead of updating pending incremental). fixed by com... Sage Weil
11:17 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Sage Weil wrote:
> for f in `seq 3 10`; do ceph osd reweight $f 1 ; done
Thank you very much.
I'm unable to prod...
Simon Frerichs
11:13 AM Bug #2454: "rbd info xyz" hanging forever sometimes
That's too bad.. i'll futz with it some here and see if i can reproduce.
You can fix your cluster with:
for f i...
Sage Weil
11:03 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Sage Weil wrote:
> Ok, the problem is that the weights are different on the osd's and mon's map. I'm not sure how t...
Simon Frerichs
10:55 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Ok, the problem is that the weights are different on the osd's and mon's map. I'm not sure how that could have happe... Sage Weil
10:16 AM Bug #2454: "rbd info xyz" hanging forever sometimes
progress:... Sage Weil
10:12 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Sage Weil wrote:
> Aha, ok I think we're on the right track. osd.2's map for 6924 says [3,15], but the client sent ...
Simon Frerichs
10:08 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Aha, ok I think we're on the right track. osd.2's map for 6924 says [3,15], but the client sent it to [2,15].
Can...
Sage Weil
10:03 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Sage Weil wrote:
> Just realized those were osd.3 messages mixed into the log, that's why it was confusing. However...
Simon Frerichs
09:56 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Just realized those were osd.3 messages mixed into the log, that's why it was confusing. However, osd.3 seems to thi... Sage Weil
09:37 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Sage Weil wrote:
>
> Is it possible to attach the whole osd2 log? Or a section that includes an osd map update. ...
Simon Frerichs
09:14 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Simon Frerichs wrote:
> Simon Frerichs wrote:
> > I've reattached the osd log because the first version hat some ol...
Sage Weil
08:22 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Simon Frerichs wrote:
> I've reattached the osd log because the first version hat some old data in it.
hum, lost ...
Simon Frerichs
08:16 AM Bug #2454: "rbd info xyz" hanging forever sometimes
I've reattached the osd log because the first version hat some old data in it. Simon Frerichs
08:07 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Here we go.
I've started the hanging rbd info process at 2012-05-30 16:46:35.281623
Simon Frerichs
07:32 AM Bug #2454: "rbd info xyz" hanging forever sometimes
Ok, the puzzling bit is that it isn't able to find the pg in question. It appears a bit hgher up in th elog:
May ...
Sage Weil
10:15 AM Feature #2486 (Resolved): crush: evaluate local retry behavior
Sage Weil
10:14 AM Feature #2485 (Resolved): crush: validate hash function
Sage Weil
10:04 AM Feature #2349 (Resolved): rados bench: Extra statistics
Sage Weil
09:28 AM Bug #2484 (Resolved): ceph.spec does not list libuuid as build time dependency
already fixed by commit:e868b44b3959a71c731f4ec9ff9773dead6dfcb5 Sage Weil
08:33 AM Bug #2484 (Resolved): ceph.spec does not list libuuid as build time dependency
See http://marc.info/?l=ceph-devel&m=133795298501327&w=2
I assume it's easy to add this to the .spec (or maybe it ...
Greg Farnum
08:50 AM Bug #2481: ceph tell has almost no error reporting
Greg Farnum wrote:
> This is actually a lot harder than you'd think — "ceph tell" works by sending a message to the ...
Sage Weil
08:14 AM Bug #2481: ceph tell has almost no error reporting
This is actually a lot harder than you'd think — "ceph tell" works by sending a message to the monitors, which is the... Greg Farnum
08:35 AM rbd Bug #2478: krbd: unmap on 3.4.0: scheduling while atomic...
with patch above, this trace appears after ~15 sec from unmap operation
[ 2453.203365] ------------[ cut here ]---...
Andrey Korolyov
08:28 AM CephFS Bug #2187: pjd chown/00.t failed test 97
I haven't gone through the annoyance of counting it out, but it looks like the initially-reported test (112) is in th... Greg Farnum
07:23 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
I've been working for a week or two on refactoring the messaging
code in hopes to make it be based on a much cleaner...
Alex Elder
07:13 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
Sorry I haven't updated this for some time.
In response to Yehuda: yes, I noticed that, and my instinct when
fir...
Alex Elder
07:10 AM Bug #2267 (In Progress): Ceph client crashed after shutting down one mds and osd
> The variable which holds the NULL pointer is msg->bio_iter in]
> write_partial_msg_pages in net/ceph/messenger.c
...
Alex Elder
03:41 AM Revision 5842dcda (ceph): osd: fix misdirected op check
The sense of this check was wrong from the get-go, back in a5d99add. Fix
it. This may be silencing a lot of misdire...
Sage Weil
01:34 AM Revision c5015ef3 (ceph): mon: add doxygen-format comments to the PaxosService class.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
01:34 AM Revision 28a64fdb (ceph): mon: add doxygen-format comments to the Monitor class.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
01:34 AM Revision 1816856e (ceph): mon: Do some aesthetic reformatting on the Paxos class.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
01:34 AM Revision 8c95e8bc (ceph): mon: add doxygen-format comments to the Elector class.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
01:34 AM Revision 15a30d92 (ceph): mon: Do some aesthetic reformatting on the Elector class.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
01:34 AM Revision 05469b2b (ceph): mon: add some info on can_mark_out function
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
01:34 AM Revision 504c6ce9 (ceph): doc: Add Paxos call chain to doc/mon/
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
01:34 AM Revision 7064a9c0 (ceph): mon: add doxygen-format comments to the Paxos class.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
12:09 AM CephFS Bug #1047: mds: crash on anchor table query
We are auto creating up to 200 preconfigured home directories on all four nodes, each home dir consists of ca. 400 di... Amon Ott

05/29/2012

11:52 PM Revision 92fec423 (ceph): test_filestore_workloadgen: write zeros, not random bytes
The random byte generation is sufficiently slow that it skews the results.
Zeros are good enough!
Signed-off-by: Sag...
Sage Weil
11:52 PM Revision b46b815a (ceph): test_filestore_workloadgen: less noise
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:31 PM Revision 4e2846b4 (ceph): config: add 'set_config <var> <val>' admin socket command
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:31 PM Revision 29e6f8c5 (ceph): log: add log flush, dump_recent, reopen admin socket commands
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:31 PM Revision bb0c7c6a (ceph): admin_socket: fix unittest
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:57 PM Bug #2454: "rbd info xyz" hanging forever sometimes
Found a copy of the old osdmap at one monitor. I've attached it.
Simon Frerichs
09:40 PM Bug #2454: "rbd info xyz" hanging forever sometimes
Well, i see one (unrelated) oddity in this code, but it doesn't explain how the non-existent message is coming up. (... Sage Weil
11:23 AM Bug #2454: "rbd info xyz" hanging forever sometimes
if you need more information, feel free to ping me. Simon Frerichs
09:07 PM Bug #2409: osd: pgs stuck in active
oh, bother. this is wip-osd-peering not getting backported to dho branch. commit:17114f266a336b6edd7e98975d494fdd48... Sage Weil
12:41 PM Bug #2409: osd: pgs stuck in active
saw this on burnupi. cranked up logging. hopefully we'll catch it again... Sage Weil
08:33 PM Revision 6849f82e (ceph): rados.cc: added listomap and getomap commands
Signed-off-by: Eleanor Cawthon <eleanor.cawthon@inktank.com> Eleanor Cawthon
07:07 PM Revision ab8f6b1b (ceph): admin_socket: initialize explicitly on startup; disallow changes
There is an annoying dependency between the config lock and the admin
socket lock due to the fact that we initialize ...
Sage Weil
07:07 PM Revision a3f81376 (ceph): config: add 'show_config' admin socket command
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:23 PM Revision f953c4c0 (ceph): librados.h: update rados_object_list_next documentation for key/locator
*entry and *key don't need to be freed, and are only valid until the
next call to rados_objects_list_*.
Signed-off-b...
Samuel Just
05:10 PM Feature #2349 (Fix Under Review): rados bench: Extra statistics
Sage Weil
05:09 PM Revision 1277a465 (ceph): cephcontext: generalize admin socket hook
We'll process non-perfcounter commands too.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:20 PM rgw Documentation #2483 (Resolved): doc: radosgw api diffs to swift
Sage Weil
03:19 PM rgw Documentation #1813 (Resolved): doc: document radosgw api diffs with s3
Sage Weil
03:08 PM rgw Bug #2482 (Rejected): rgw: duplicate content-length results in 400
Not sure if it's from apache of haproxy. While this is not really a bug, we may want to change this behavior. Yehuda Sadeh
02:20 PM Bug #2481 (Won't Fix): ceph tell has almost no error reporting
fumbling around as a noob, I tried all of:
ubuntu@burnupi50:/etc/ceph$ ceph osd tell 1 injectargs 'debug osd = 0'
...
Dan Mick
01:43 PM Documentation #155 (In Progress): document ceph auth
John Wilkins
01:41 PM Documentation #2474 (Resolved): re-document using autobuilt branches
Needed to restore from git, update the hyperlinks, and add it into the package section. Verified with Sage. John Wilkins
01:30 PM Feature #2412 (Resolved): qa: Test co-existence of sysvinit and upstart, 2: sysvinit controlled, ...
Changing state to reflect that the case has been tested and passed. Ken Franklin
01:29 PM Feature #2411 (Resolved): qa: Test co-existence of sysvinit and upstart, 1: sysvinit controlled, ...
Updating status to reflect that the fix has been tested and passed. "Resolved" is the closest option for this. Ken Franklin
12:41 PM Bug #2464 (Resolved): osdmap: assert in get_inst()
Sage Weil
09:13 AM Bug #2480: mon: election won't complete
probable regression from mon bootstrap changes. Sage Weil
09:11 AM Bug #2480 (Rejected): mon: election won't complete
... Sage Weil
08:52 AM CephFS Bug #2187: pjd chown/00.t failed test 97
ceph-fuse *and* kclient affected.. mds?... Sage Weil
08:47 AM Feature #2479 (Resolved): osd: runtime warning/message about syncfs variant in use
Sage Weil
04:02 AM Bug #2267: Ceph client crashed after shutting down one mds and osd
The variable which holds the NULL pointer is msg->bio_iter in write_partial_msg_pages in net/ceph/messenger.c. I conf... Karol Jurak
12:47 AM CephFS Bug #1047: mds: crash on anchor table query
This bug seems to be reproduced with the deadlock described on the mailing list as
"OSD deadlock with cephfs client ...
Amon Ott

05/26/2012

01:30 PM rbd Bug #2478 (In Progress): krbd: unmap on 3.4.0: scheduling while atomic...
commit:cd9d9f5df6098c50726200d4185e9e8da32785b3 already fixes this; we just neglected to send it upstream for 3.4. I... Sage Weil
06:05 AM rbd Bug #2478 (Can't reproduce): krbd: unmap on 3.4.0: scheduling while atomic...
On 3.4.0 on the 'rbd unmap' call kernel rbd driver produces following trace(same for vanilla):
[ 8660.786269] BUG...
Andrey Korolyov
09:10 AM CephFS Bug #2385: max mds = 2, mds hang and crash
net.core.wmem_max = 536870912
net.core.rmem_max = 536870912
net.core.wmem_default = 262144
net.core.rmem_default =...
Yavuz Selim Komur

05/25/2012

11:15 PM Revision 73e29b65 (ceph): doc: needed carriage returns between headings, code tag and syntax.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
11:03 PM Revision d33f4fb6 (ceph): doc: removed redundant install.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
11:03 PM Revision 68874afa (ceph): doc: Filled in descriptions for object operations.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:29 PM Revision e868b44b (ceph): ceph.spec.in: buildrequires libuuid-devel
Reported-by: Guido Winkelmann <guido-ceph@thisisnotatest.de>
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:28 PM Revision 5eda9b70 (ceph): ceph.spec.in: drop unnecessary rmdirs
This hunk was inadvertantly included in a330c64.
Reported-by: Guido Winkelmann <guido-ceph@thisisnotatest.de>
Signed...
Sage Weil
02:27 PM Feature #2412: qa: Test co-existence of sysvinit and upstart, 2: sysvinit controlled, using /var/...
sudo service ceph start -pass
sudo /etc/init.d/ceph start - pass
reboot. - pass
sudo initctl list|grep ceph" has o...
Ken Franklin
02:24 PM Feature #2411: qa: Test co-existence of sysvinit and upstart, 1: sysvinit controlled, outside of ...

Tested
sudo service ceph start - pass
sudo /etc/init.d/ceph start - pass
reboot. - pass
sudo initctllist|gre...
Ken Franklin
09:54 AM Feature #2472 (In Progress): osd: add opaque 'class <name> <foo>' cap that class can interpret/en...
Greg Farnum
09:54 AM Feature #2471 (Fix Under Review): osd: add prefix match to OSDCaps
"allow rwx pool=rbd prefix rbd_data; allow x pool=rbd prefix rbd_header"
"allow rwx pool=rbd; deny rw pool=rbd prefi...
Greg Farnum
04:05 AM CephFS Bug #1047: mds: crash on anchor table query
Just hit this bug again with 0.47.2 on ext4, so it is still open and it is still an issue. Amon Ott

05/24/2012

11:54 PM Revision 6601ef95 (ceph): doc: Restored install for dev/unstable packages.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:01 PM Revision ba540623 (ceph): filestore: make SNAP_DESTROY ioctl check more informative
Tell user about user_subvol_rm_allowed mount option.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:53 PM Revision 07c9eda2 (ceph): filestore: fix btrfs subvol ioctl tests
Create a temp subvol that *we own* to test snaps against. Otherwise, we
won't be able to remove them if we weren't t...
Sage Weil
09:07 PM Revision bea1e031 (ceph): filestore: verify that op_seq is nonzero
If we get zero from commit_op_seq something is very wrong, and we should
not continue. Broken mkfs or something dele...
Sage Weil
09:05 PM Revision 18b0d657 (ceph): vstart.sh: wipe out btrfs subvols in osd dirs
For testing on btrfs.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:05 PM Revision 0f38d756 (ceph): filestore: fix initial btrfs snap creation (on mkfs)
Create the snap_1 snap of current *after* we write the op_seq. Otherwise
we will mistakingly believe the snapshot is...
Sage Weil
08:07 PM Revision 3ae84eec (ceph): rgw: replace dump_format() with dump_int()
Cleanup, fixes issue #2469.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
04:47 PM Revision ca79f45a (ceph): Merge branch 'stable'
Josh Durgin
04:45 PM Revision 9a9418eb (ceph): librados, ObjectCacher: include limits.h for INT_MAX
This was required to build on CentOS 6.2. I'm not sure why it worked
without it on Debian.
Signed-off-by: Josh Durgi...
Josh Durgin
04:37 PM Revision ea6c546e (ceph): Added python-dev to list of required packages.
Sam Zaydel
01:55 PM rgw Cleanup #2469 (Resolved): rgw: replace Formatter->dump_format(..., "%d", ...) with Formatter->dum...
Fixed, commit:3ae84eec2dbcb83a31924cc348e9b91157ecb3b1. Yehuda Sadeh
11:02 AM Bug #2475: rbd.py can leave Image object in inconsistent state on failure to construct
In the interactive interpreter, "the next statement" is where the interpreter knows the prior scope is closed, and de... Dan Mick
10:35 AM Bug #2475 (Resolved): rbd.py can leave Image object in inconsistent state on failure to construct
commit:8c1c882999b8eef2dbf4fca066a5978e547db83e
Dan Mick
10:54 AM Feature #2477 (Resolved): rados bench cleanup
Right now rados bench write leaves a lot of data just lying around the cluster, without a good way to clean itself up... Greg Farnum
09:16 AM Bug #2476 (Resolved): osd: watch timeout depends on operations to an object
The watch timeout is an in-memory thing that's local to the primary. If the primary changes, the timer for ending the... Josh Durgin
09:16 AM rgw Bug #2465 (Resolved): rgw: bad marker output when listing a bucket
Fixed, commit:9094dce4cc232fc70fbc6d4789a79dd6af6ff4b6. Yehuda Sadeh
05:43 AM CephFS Bug #733: cmds crash: mds/LogEvent.cc:88: FAILED assert(p.end())
here you go. a log with ms = 1 and mds = 20.
dirs and files are replaced with 'o's.
Eric Dold
02:35 AM Revision 8c1c8829 (ceph): rbd.py: Fix seemingly-random segfaults
Avoid resetting Image.closed until image is successfully opened.
Fixes: #2475
Signed-off-by: Dan Mick <dan.mick@inkt...
Dan Mick
02:33 AM Revision 2f963fa7 (ceph): librados: Remove errant sha1
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Dan Mick
02:33 AM Revision 6e46de37 (ceph): rbd.py doc: Fix markup on :class: references
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Dan Mick
 

Also available in: Atom