Project

General

Profile

Activity

From 02/21/2012 to 03/21/2012

03/21/2012

11:41 PM Revision 2e21adf2 (ceph): Objecter: resend linger_ops on any change
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>
Samuel Just
11:35 PM Revision b47454b6 (ceph): ObjectStore: add COLLECTION_MOVE to dump
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
11:35 PM Revision 23313ee6 (ceph): FileStore: whitelist COLLECTION_MOVE on replay
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
11:35 PM Revision ec52eeb2 (ceph): FileStore: remove src on EEXIST during collection_move replay
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
11:35 PM Revision 52aff487 (ceph): ObjectStore: Add collection_move to generate_instances
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:01 PM Revision 3caa4319 (ceph): ceph: define and use a shell_scripts Makefile variable
Define a variable "shell_scripts" in the Makefile.in, and use it
along with some pattern rules to avoid some duplicat...
Alex Elder
09:01 PM Revision 1b2a0669 (ceph): ceph-kdump-copy: add tools for saving kdumps
This puts in place an init script and a command it runs to save a
kernel core dump to a remote server when a panic or...
Alex Elder
08:41 PM Bug #2199: mon: get_bl osdmap_full/9583 No such file or directory
kept logs for the failing monitor under /var/log/ceph/2199 Yehuda Sadeh
08:26 PM Bug #2199 (Resolved): mon: get_bl osdmap_full/9583 No such file or directory
Happened on congress (afair, off 0.41). One monitor is out for more than a month. Following network outage, both moni... Yehuda Sadeh
07:00 PM Revision 6f0f250b (ceph): suite: add missing print statement
Josh Durgin
06:58 PM Revision 8a9a5670 (ceph): suite: fix print statement when summary doesn't exist
Josh Durgin
04:59 PM Feature #2198 (New): add an option to force a down osd to be marked immediately out
Samuel Just
02:25 PM rgw Bug #2197 (Resolved): rgw: need to throttle incoming requests
In case we can't handle requests, we'd end up accepting requests indefinitely thus we consume fds endlessly. This wil... Yehuda Sadeh
01:30 PM Revision d0e8f148 (ceph): doc: update list of debian dists
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:28 PM Revision a608a8fe (ceph): Merge branch 'stable'
Sage Weil
12:52 PM Bug #2196 (Resolved): `rados bench` will write test objects with a constant oid, under-reporting ...
(As discussed on @#ceph@, 2012/03/21 -- with thanks to @joshd@)
The command @rados bench@ generates a sequence of ...
David McBride
08:21 AM Bug #2178: rbd: corruption of first block
The next object is whatever the MBR points to. You can find the object name from the sector offset that gdisk gives y... Josh Durgin
02:55 AM Bug #2178: rbd: corruption of first block
Hi Josh,
thanks for taking the time to investigate this... And yes, many others show the same behaviour. Is "the n...
Oliver Francke
06:50 AM Feature #2127: Save kernel core dumps on all of our test machines
I seem to remember seeing a reference to 'mkcrashrd', a mkinitrd type script that generates the initrd image the cras... Sage Weil
02:00 AM Revision 91c08f6e (ceph): Add watch op to rados.py
Signed-off-by: Samuel Just <sam.just@dreamhost.com> Samuel Just
12:51 AM Revision 72361784 (ceph): Objecter: resend linger_ops on any change
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>
Samuel Just
12:51 AM Revision 3019d460 (ceph): TestRados: Add watch
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>
Samuel Just
12:20 AM Revision 2998368a (ceph): rgw: remove unused definition
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
12:20 AM Revision 4760536f (ceph): rgw: keep pool placement info also in cacheable location
Mirror the pools placement info, so that we can cache it.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
12:00 AM Revision f1563a66 (ceph): Revert "Objecter: add op->resend_on_any_change"
This reverts commit c53194d75390dd6d5aa4a9a33f741cbd106e3338.
recalc_linger_op_target is used for linger_ops
Signed...
Samuel Just

03/20/2012

11:11 PM Revision 2daff0e9 (ceph): ReplicatedPG: osd_max_notify_timeout -> osd_default_notify_timeout
This setting should not override user specified timeout.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
11:11 PM Revision c53194d7 (ceph): Objecter: add op->resend_on_any_change
lingers must be resent even if the primary does not change.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
11:11 PM Revision fc7a1bda (ceph): ReplicatedPG: return -EBUSY on delete for objects with watchers
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
10:15 PM Revision 6a5cbec3 (ceph): rgw: replace bucket_id generation
bucket_id is now string: <global instance id>.<num> where
num is increasing monotonically within the current rgw
inst...
Yehuda Sadeh
09:07 PM Feature #2127 (In Progress): Save kernel core dumps on all of our test machines
I finally have crash dumps getting packaged and sent over to a
remote machine reliably. The problem is that it does...
Alex Elder
06:59 PM Bug #2178: rbd: corruption of first block
I looked at the block you attached, and compared it to the first 4MiB of my desktop's hard drive. It looks like it co... Josh Durgin
03:58 AM Bug #2178: rbd: corruption of first block
Hi *,
any update on this topic? Cause we are working for hours and days with three people to rescue as many images...
Oliver Francke
06:41 PM Revision cdd5298d (ceph): v0.44
Sage Weil
05:59 PM Revision e42fbb70 (ceph): rgw: process default alt args before processing conf file
this fixes #2189
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:52 PM Revision e0b8f7a0 (ceph): rgw: process default alt args before processing conf file
this fixes #2189
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:37 PM Revision 51a07339 (ceph): rgw: incrase socket backlog
20 is too small
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:25 PM Revision 5b331987 (ceph): rgw: fix internal cache api
This fixes issue #2190
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:14 PM rgw Bug #2193 (Resolved): rgw: .pools.avail is not cached
Fixed, commit:4760536fe573c702bac8fb1d51213d76059e32dc.
We now mirror the info in the object. Still keeping the om...
Yehuda Sadeh
09:28 AM rgw Bug #2193 (Resolved): rgw: .pools.avail is not cached
Probably due to recent omap changes, we don't cache omap operations. Either we cache it, or just keep available pools... Yehuda Sadeh
03:10 PM rgw Feature #2194 (Resolved): rgw: replace bucket-marker-ver with better, fast, more scalable solution
Fixed, commit:6a5cbec38b761d524e699e2a7410a340d093ccca. Yehuda Sadeh
09:32 AM rgw Feature #2194 (Resolved): rgw: replace bucket-marker-ver with better, fast, more scalable solution
We use this object in order to create unique prefix for bucket objects (we do it at bucket creation). Instead of this... Yehuda Sadeh
02:50 PM Revision 815fc3e2 (ceph): suite: failed runs might not have durations
This was one cause of emails not being sent - stale /tmp/cephtest dirs
fail without recording a duration.
Josh Durgin
10:47 AM rgw Bug #2189 (Resolved): rgw: can't change debug level through ceph.conf
Fixed, commit:e0b8f7a0331b0ceee54a911bb9231cb168eb2d0f. Yehuda Sadeh
10:28 AM rgw Bug #2190 (Resolved): rgw: cache disabled
Fixed, commit:5b3319870ea9d6c715c671e006e3a772008e3e78. Yehuda Sadeh
09:43 AM CephFS Feature #2195 (Resolved): Allow removal of last MDS if there's no filesystem
Right now you can't remove the last MDS from your cluster, which means that if you aren't using it and it's off you w... Greg Farnum
05:43 AM Bug #2192 (Won't Fix): ceph-mon hangs consuming 100% CPU
I have a test setup of two nodes each running 0.43 mds, mon and osd. I mount ceph kernel filesystem at /srv/ceph on b... Vladimir Kulev

03/19/2012

11:36 PM Cleanup #2191 (Resolved): reexamine simple_spinlock
We've got a homebrewed spinlock implementation in src/common/simple_spin.h/cc. It was written so we could use dout in... Greg Farnum
11:10 PM Revision f923b840 (ceph): OSD: do not hold obc lock in disconnect_session_watches
ObjectContext::lock is used only for implementing read_lock and
write_lock. PG::lock is used to protect the ObjectCo...
Samuel Just
09:16 PM Revision a65d4136 (ceph): suite, coverage: use absolute dirs for isdir checks
This fixes the results to wait for all jobs to complete again. Josh Durgin
06:57 PM Revision bdb72c28 (ceph): filestore_idempotent: get coverage and coredumps
Josh Durgin
06:31 PM Revision 6c8db1a8 (ceph): suite: more results logging
Josh Durgin
05:34 PM rgw Bug #2190 (Resolved): rgw: cache disabled
in master branch only, due to internal api change. Yehuda Sadeh
05:33 PM rgw Bug #2189 (Resolved): rgw: can't change debug level through ceph.conf
Yehuda Sadeh
05:12 PM Bug #2188 (Resolved): mon: mds rm should be harder to break things with
If you run ceph mds rm 0 on a healthy cluster, it breaks the Monitor's world. I'm uncomfortable with the command exis... Greg Farnum
04:04 PM Bug #2183 (Resolved): osd: lockdep cycle with obc lock and watch_lock
pushed to master f923b840edec79df5791a7fb7fdec8b0b40f25f1 Samuel Just
03:33 PM Bug #2183: osd: lockdep cycle with obc lock and watch_lock
I believe it's inappropriate to hold obc->lock there anyway, pg lock serves that purpose. Samuel Just
11:07 AM Bug #2185: osd/ReplicatedPG.cc: 5938: FAILED assert(r >= 0) in ReplicatedPG::scan_range()
In the wip-rbd-bid branch that I pushed last week I added an option to the rbd tool to create images using existing d... Yehuda Sadeh
11:01 AM Bug #2185: osd/ReplicatedPG.cc: 5938: FAILED assert(r >= 0) in ReplicatedPG::scan_range()
should be pretty easy to rebuild the xattr, removing the object would corrupt the rbd image Samuel Just

03/18/2012

10:36 PM Bug #2173: MDS crash when start with end of buffer
I have managed to start mds server after resetting the journal. So I can get my data back.
Thanks very much to all o...
soft crack
06:56 PM Revision 7173a8af (ceph): ceph.conf: no comment
Sage Weil
06:06 PM Revision 7de798f6 (ceph): ceph.conf: set 'filestore xattr use omap = true'
Sage Weil
05:50 PM Revision 7d2e1056 (ceph): fix teuthology-ls isdir check
Sage Weil
05:48 PM Revision 94f0ba1e (ceph): run valgrind with cwd set to /tmp/cephtest/archive/coredump
This lets us capture the vgcore.* files, which always go to valgrind's
cwd.
Fixes: #1953
Sage Weil
04:09 PM Revision fd851304 (ceph): ReplicatedPG: there should be no object_contexts during on_activate
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
04:08 PM Revision 6c17a7b3 (ceph): Merge branch 'next'
Sage Weil
04:08 PM Revision 77c08f86 (ceph): osd: fix object_info.size mismatch file due to truncate_seq on new object
If the first write that creates an object includes a truncate_seq and
truncate_size, we were taking the truncte patch...
Sage Weil
01:46 PM CephFS Bug #2187 (Can't reproduce): pjd chown/00.t failed test 97
on both ceph-fuse and kclient, nightly_coverage_2012-03-17-a,
> 1727 FAIL scheduled_teuthology@teuthology collection...
Sage Weil
01:43 PM CephFS Bug #2159 (Resolved): ceph-fuse: big_writes option not recognized
Sage Weil
12:09 PM Bug #2080 (Resolved): osd: scrub on disk size does not match object info size
Sage Weil
12:09 PM Bug #1953: teuthology: core files aren't archived when using valgrind
Sage Weil
12:07 PM Bug #2164: osd: scrub missing _, snapset attrs
this was non-btrfs, right after the new idempotent replay stuff was fixed. Sage Weil
10:50 AM Bug #2186 (Can't reproduce): osd: shutdown race
... Sage Weil
10:07 AM Bug #2180 (Resolved): osd/ReplicatedPG.cc: 3381: FAILED assert(obc->watchers.size() == 0)
Sage Weil

03/16/2012

11:59 PM Revision 619fe730 (ceph): .gitignore: xattr_bench
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:06 PM Revision 3a6c085e (ceph): heartbeatmap: use utimes(2) instead of futimens(2)
For poor users with ancient glibc. We don't much care about rename races
here anyway.
Signed-off-by: Sage Weil <sag...
Sage Weil
09:36 PM Revision 63ec06b3 (ceph): osd: remove special handline for head recovery from clone
This breaks because:
- we don't have the head or current snapset
- get_object_context() creates a new snapset, whi...
Sage Weil
08:49 PM Revision d8bcc1b3 (ceph): config: fix recursive locking of md_config_t::lock
Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com>
Sage Weil
08:30 PM Revision 58c5d5a0 (ceph): osd: ReplicatedPG::create_object_context()
New helper that creates a new object context.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:30 PM Revision d4addf57 (ceph): osd: re-use create_object_context() in get_object_context()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:30 PM Revision 15d85af4 (ceph): osd: explicitly create new object,snap contexts on push
We specifically want to use this during recovery to avoid loading the obc
or ssc for a previous version of the object...
Sage Weil
08:28 PM Revision 01924a22 (ceph): disable rbd thrash workload, #2174
Sage Weil
08:04 PM Revision 96780bd1 (ceph): osd: create_snapset_context()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:15 PM Revision 872bdd0d (ceph): osd: ensure we don't clobber other *contexts when registering new ones
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:52 PM Revision 9791035d (ceph): Merge branch 'wip_omap_xattrs'
Samuel Just
06:44 PM Revision 07b97fe7 (ceph): suite: log results and coverage generation
Need to figure out where and when results emails are failing. Josh Durgin
06:40 PM Revision 2a593dda (ceph): RadosModel: test xattrs with omap
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:40 PM Revision a49a1972 (ceph): ReplicatedPG,FileStore: clone should copy xattrs as well
_make_clone (called from make_writeable) and _rollback_to included
attr reads from head or a clone. In that case, an...
Samuel Just
06:40 PM Revision 14506dc6 (ceph): FileStore: add support for omap xattrs
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:31 PM Revision a5f143d2 (ceph): Merge branch 'wip-msgr4'
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> Sage Weil
06:29 PM Revision 983fd190 (ceph): ObjectMap: add interface for storing xattrs
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:29 PM Revision d8325e50 (ceph): DBObjectMap: implement xattr interface
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:29 PM Revision fdb92748 (ceph): test_object_map: update unit test for xattr
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:29 PM Revision 8fc43179 (ceph): config_opts.h: opts for omap_xattrs
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:29 PM Revision ecd875fe (ceph): tests/: Added xattr bench
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:29 PM Revision b09fb15d (ceph): ObjectMap: use Index object for locking rather than path object
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:28 PM Revision 9fd4a12a (ceph): DBObjectMap: add support for storing xattrs
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:18 PM Bug #2185: osd/ReplicatedPG.cc: 5938: FAILED assert(r >= 0) in ReplicatedPG::scan_range()
strace indicated we had a missing xattr on
2268 stat("/data/osd0/current/164.2_head/rb.0.0.000000000000__head_DA6...
Sage Weil
06:02 PM Bug #2185: osd/ReplicatedPG.cc: 5938: FAILED assert(r >= 0) in ReplicatedPG::scan_range()
... Sage Weil
03:33 PM Bug #2185: osd/ReplicatedPG.cc: 5938: FAILED assert(r >= 0) in ReplicatedPG::scan_range()
Here output from osd.3 after recent crash:
root@fcmsnode3:/data/osd3/current# find 0.0_head
0.0_head
0.0_head/10...
Oliver Francke
03:22 PM Bug #2185 (Won't Fix): osd/ReplicatedPG.cc: 5938: FAILED assert(r >= 0) in ReplicatedPG::scan_ran...
... Sage Weil
06:01 PM Bug #2173: MDS crash when start with end of buffer
Talked more on irc, soft crack is trying to reset his journal since it looks like at least all his metadata objects a... Josh Durgin
04:43 PM Bug #2173: MDS crash when start with end of buffer
osd map file for 'ceph osd getmap 3212 -o /tmp/osdmap' soft crack
01:31 PM Bug #2173: MDS crash when start with end of buffer
Greg: look at the osd dump above: all pools are rep size 3. Josh Durgin
01:13 PM Bug #2173: MDS crash when start with end of buffer
Did all the pools get set to 3x replication, or are the confused PGs all part of the metadata pool? Greg Farnum
12:26 PM Bug #2173: MDS crash when start with end of buffer
Could you attach the output of 'ceph osd dump 3212' and the binary version of that osdmap (ceph osd getmap 3212 -o /t... Josh Durgin
09:56 AM Bug #2173: MDS crash when start with end of buffer
Unfortunately we can see that this assert too is caused by ENOENT on an object that really ought to be there, which m... Greg Farnum
08:19 AM Bug #2173: MDS crash when start with end of buffer
Can you post an mds log with debug mds = 20 leading up to that last crash?
Resetting the journal is not something ...
Sage Weil
08:15 AM Bug #2173: MDS crash when start with end of buffer
I managed to insert a empty sessionmap. The server continue starting.
And I get an assert error:...
soft crack
05:39 PM Revision 0904c7b7 (ceph): configure: fix warnings
Finally!
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:04 PM Revision f2e6b8d7 (ceph): ReplicatedPG: populate_object_context during handle_pull_response
A cached objectcontext should always have its watchers populated.
Signed-off-by: Samuel Just <samuel.just@dreamhost....
Samuel Just
04:43 PM Revision 4cfc34f8 (ceph): leveldb: .gitignore TAGS
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:40 PM Revision 5db6902b (ceph): leveldb: un-revert
Accidentally reverted by c2af646b38995ba005140e748a21baba4263e53f.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
02:33 PM Bug #2080: osd: scrub on disk size does not match object info size
wip-2080 Sage Weil
01:33 PM Bug #2184 (Resolved): audit calls to populate_obc_watchers and add watch/notify to RadosModel
Samuel Just
01:32 PM Feature #2125 (Resolved): osd: put large xattrs in leveldb
Samuel Just
01:20 PM Bug #2183: osd: lockdep cycle with obc lock and watch_lock
crashed it with this mutl... Sage Weil
01:18 PM Bug #2183 (Resolved): osd: lockdep cycle with obc lock and watch_lock
... Sage Weil
12:04 PM Bug #2180: osd/ReplicatedPG.cc: 3381: FAILED assert(obc->watchers.size() == 0)
Hi Sage,
here the according log after upgrading and starting 0.43-1...
Hope it helps,
Oliver.
Oliver Francke
08:56 AM Bug #2180 (Resolved): osd/ReplicatedPG.cc: 3381: FAILED assert(obc->watchers.size() == 0)
... Sage Weil
10:58 AM Bug #2182 (Resolved): audit osd reads for reads from potentially unstable objects
In particular, there are places we read object_info and snapset outside of the get_object_context and get_snapset_con... Samuel Just
10:50 AM Bug #2181 (Won't Fix): 4051: FAILED assert(!missing.is_missing(soid)) in ceph version 0.43-244-g9...
v0.43 and this commit from master aren't compatible; the final v0.44 will have a protocol rev to prevent this problem. Sage Weil
10:21 AM Bug #2181 (Won't Fix): 4051: FAILED assert(!missing.is_missing(soid)) in ceph version 0.43-244-g9...
Hi Sage,
here u r. This was the version, which failed, too, after all others didn't help either... Similar with al...
Oliver Francke
09:13 AM Bug #2132: FAILED assert(!missing.is_missing(soid))
Oliver Francke wrote:
> Well,
>
> its tagged as resolved, but today another node died...:
>
> osd/ReplicatedPG...
Sage Weil
05:36 AM Bug #2132: FAILED assert(!missing.is_missing(soid))
Well,
its tagged as resolved, but today another node died...:
osd/ReplicatedPG.cc: In function 'void Replicated...
Oliver Francke
04:19 AM Bug #2178: rbd: corruption of first block
Here is one of many, where the header is missing:
--- 8-< ---
fcms@fcmsnode3:~$ rbd ls 1320396354
vm-451-disk-1....
Oliver Francke
12:34 AM Revision 8fbd087d (ceph): results: make sure email is sent before anything else fails
Josh Durgin

03/15/2012

06:08 PM Bug #2173: MDS crash when start with end of buffer
Sorry for mistake.
ceph osd dump -o -:
2012-03-16 09:10:04.887611 mon <- [osd,dump]
2012-03-16 09:10:04.888161...
soft crack
06:01 PM Bug #2173: MDS crash when start with end of buffer
ceph -s:... soft crack
10:43 AM Bug #2173: MDS crash when start with end of buffer
Well that's exciting; this means it's an OSD bug.
The meaning of that output is that of your 209 PGs, 185 are happy;...
Greg Farnum
05:35 PM Revision 89ccd95a (ceph): osd: maybe clear DEGRADED on recovery completion
We set degraded if we don't have enough "active" replicas, which excludes
the backfill target. We need to recheck th...
Sage Weil
05:32 PM Revision b4572351 (ceph): Revert "disable rbd thrash workload, #2174"
This reverts commit 1bec416c7c7ff8a6462d94baaba8e7da73e88ab4.
Fixed with #2174
Sage Weil
12:58 PM rgw Feature #1941 (Rejected): rgw: revisit bucket removal
Sage Weil
12:57 PM rgw Feature #785 (Rejected): rgw: fix filesystem backend
Sage Weil
10:29 AM Bug #2160 (Resolved): active+recovering+degraded+backfill becomes active+clean+degraded when reco...
Sage Weil
09:49 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
The test that reproduced the problem has now run once to completion
without hitting it. Therefore it's ready to shi...
Alex Elder
08:35 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
... Alex Elder
07:57 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
Thanks Alex. I remember thinking it fixed a race initially, but then going back later and being unable to find the ra... Josh Durgin
07:43 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
That's excellent Josh. I'll use it, it's basically what I was
thinking of doing anyway, now I'll just use yours. D...
Alex Elder
07:38 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
That analysis of the race looks correct to me. The first unapplied patch in wip-rbd would have fixed this (9a3e22a0ce... Josh Durgin
07:14 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
... Alex Elder
07:12 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
I think I can explain this:
[ 265.117432] INFO: trying to register non-static key.
[ 265.149933] the code is ...
Alex Elder
12:16 AM Revision 826d30f1 (ceph): rgw: remove extra layer of RGWAccess
Not needed, now that we got rid of RGWFS
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh

03/14/2012

11:33 PM Revision 80e2a5e8 (ceph): msgr: switch all users over to abstract interface
This will let us transparently swap implementations out.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
11:29 PM Revision 1e1453c1 (ceph): msgr: introduce static Messenger::create() function
Create a new messenger, with whatever implementation is appropriate.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
11:29 PM Revision d26feffd (ceph): msgr: promote more methods to abstract Messenger interface
This will be everything that people actually use.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
11:01 PM Revision c2af646b (ceph): rgw: put_obj() uses bufferlist instead of extra alloc/copy
makes it cleaner.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
11:01 PM Revision 2b3bfd0c (ceph): rgw: remove fs backend
was broken anyway
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
10:51 PM Revision 1bec416c (ceph): disable rbd thrash workload, #2174
Sage Weil
08:53 PM Linux kernel client Bug #2174: rbd: iozone thrashing failure
I tried reproducing the problem, and although I'm not sure I know
how to recognize it my test did end in failure.
...
Alex Elder
09:54 AM Linux kernel client Bug #2174 (Can't reproduce): rbd: iozone thrashing failure
consistently failing
- ceph:
log-whitelist:
- wrongly marked me down or wrong addr
- objects unfo...
Sage Weil
08:32 PM Revision e14d428c (ceph): Merge branch 'master' of github.com:ceph/teuthology
Mark Nelson
08:32 PM Revision 2b879905 (ceph): Merge branch 'master' of github.com:ceph/teuthology
Mark Nelson
08:01 PM Revision a81b23e2 (ceph): Merge branch 'next'
Sage Weil
07:59 PM Revision bec47b57 (ceph): introduce CEPH_FEATURE_OMAP
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:55 PM Revision 8c96fd26 (ceph): leveldb: new .gitignore entry
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:14 PM Revision 20d11714 (ceph): osd: rev cluster internal protocol
This covers:
- the push/pull changes in 0.43 (which we forgot to protect against; see
#2132)
- the new omap stuff ...
Sage Weil
06:23 PM Bug #2173: MDS crash when start with end of buffer
Thanks for your responses.
I created this ceph file system with 1 mon, 1 osd, 1 mds. It works perferctly, and I wr...
soft crack
04:11 PM Bug #2173: MDS crash when start with end of buffer
Huh. Is this a new filesystem? Have you had any problems with the RADOS cluster (the OSDs)?
What's happening now i...
Greg Farnum
04:16 AM Bug #2173: MDS crash when start with end of buffer
I also tried: 'ceph-mds -i 1 -d --reset-journal 0'.
It just freeze.
soft crack
04:14 AM Bug #2173 (Resolved): MDS crash when start with end of buffer
My system is ubuntu 11.10 64bit. Mds just crashes when startup.
I noticed the message: 'No such file or directory'...
soft crack
04:36 PM Revision a0bcab5a (ceph): ceph-fuse: make big_writes optional via 'fuse big writes'
Fixes: #2159
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:35 PM rgw Bug #2001 (Resolved): radosgw memory leak
At this point I can't see any other leak (I already fixed one). Doesn't mean that there isn't another one, but I'm re... Yehuda Sadeh
04:08 PM CephFS Bug #2179 (Resolved): mds: don't crash on nonexistent SessionMap
Inspired by #2173. When the MDS tries to load the SessionMap it unconditionally decodes it, which causes a crash if t... Greg Farnum
03:46 PM Feature #2127: Save kernel core dumps on all of our test machines
http://linux.die.net/man/8/netdump
this mechanism looks simpler?
Sage Weil
02:55 PM Feature #2127: Save kernel core dumps on all of our test machines
Wed Mar 14 11:14:50 CDT 2012
OK, I got kernel core dumps and crash working in Ubuntu 11.10.
A lot of what I use...
Alex Elder
02:54 PM Feature #2127: Save kernel core dumps on all of our test machines
Oh, I forgot to mention I also wrote a little program that extracts
identifying information from a dump file that "k...
Alex Elder
02:48 PM Feature #2127: Save kernel core dumps on all of our test machines
I have been able to generate a core dump on an Ubuntu system.
I have transferred the result using scp to another hos...
Alex Elder
02:16 PM Bug #2178 (Resolved): rbd: corruption of first block
Sage Weil
01:02 PM Bug #2132 (Resolved): FAILED assert(!missing.is_missing(soid))
Sage Weil
12:01 PM Bug #2132: FAILED assert(!missing.is_missing(soid))
Aha, that explains it... the 0.42.2 and 0.43 interaction looks like the culprit here. We should have made them expli... Sage Weil
11:37 AM Bug #2132: FAILED assert(!missing.is_missing(soid))
All cephfs workload. It could be a versioning issue, I don't have the syslogs anymore that would show when I updated ... Matthew Roy
11:19 AM Bug #2132 (Need More Info): FAILED assert(!missing.is_missing(soid))
Matthew Roy: What was the nature of the workload? rbd? ceph fs? Samuel Just
11:06 AM Bug #2132: FAILED assert(!missing.is_missing(soid))
Josh Durgin wrote:
> stxShadow saw this as well.
It looks like in stxshadow's case, it was a version mismatch (cr...
Sage Weil
11:23 AM CephFS Cleanup #2177 (Resolved): mds: play nicely with omap
Convert the MDS to use OMAP properly.
There is at least one specific thing: right now it has optimizations for whe...
Greg Farnum
10:46 AM Bug #2176 (Resolved): dependencies not checked by autoconf
I recently resurrected a build of the user-mode and kernel clients on CentOS and found that I was missing a few packa... Anonymous
10:09 AM rgw Feature #2171: rgw: asynchronously calculate md5
Actually, I think it'll be easier doing it the other way around. As we already write the object asynchronously we can... Yehuda Sadeh
10:08 AM Documentation #2175 (Resolved): doc: fix doc build errors
e.g., http://ceph.newdream.net/gitbuilder-doc/log.cgi?log=a0bcab5a583e6c1fd87430252590ec902d1b6b98
It would be gre...
Sage Weil
09:56 AM Bug #2022: osd: misdirectect request
Just saw this with a different workload:... Sage Weil
09:51 AM CephFS Bug #2071: kclient: pjd mkfifo failures
hit this again:... Sage Weil
09:49 AM rgw Cleanup #2166 (Resolved): rgw: make sure librgw doesn't link against libfcgi
Fixed, commit:e19417ef55c713e60c61edd0de7c2228953407a1. Yehuda Sadeh
09:48 AM rgw Bug #2170 (Resolved): librgw references g_ceph_context
Fixed, commit:5912312c14a6214f4318fd7bfb6fd08714458b6f. Yehuda Sadeh
12:21 AM Revision 5912312c (ceph): rgw: remove some more globals from librgw
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
12:04 AM Revision 213a3f5e (ceph): rgw: fix identation
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
12:04 AM Revision d90298de (ceph): ceph-dencoder: don't use rgw types if configured without rgw
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh

03/13/2012

11:40 PM rgw Feature #2172 (Resolved): rgw: get chunks asynchronously
Chunks are read synchronously. We need to have a window of chunks that are read asynchronously (as with PUT). Yehuda Sadeh
11:38 PM rgw Feature #2171 (Rejected): rgw: asynchronously calculate md5
When doing a PUT we calculate the md5 of the content (used later for the etag) synchronously. We need to be able to c... Yehuda Sadeh
11:23 PM Revision a9d18975 (ceph): Merge branch 'master' of github.com:ceph/ceph
Johnw
11:22 PM Revision 60524aba (ceph): Added documentation for building the ceph documentation.
Johnw
09:59 PM Revision b9097619 (ceph): rgw: get rid of references to g_ceph_context where required
trickling down ceph context.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
08:48 PM Revision e6969258 (ceph): global: drop yellow warning on startup
Fixes: #2143
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:48 PM Revision e455d388 (ceph): doc: update project status/stability blurb
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Reviewed-by: Mark Kampe <mark.kampe@dreamhost.com>
Sage Weil
07:55 PM Revision e5934f10 (ceph): qa: kclient/file_layout.sh: ...
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:46 PM Revision 0a2068fc (ceph): Merge branch 'librados-cleanup'
Reviewed-by: Sage Weil <sage.weil@dreamhost.com> Josh Durgin
06:46 PM Revision 8f278647 (ceph): librados: split into separate files and remove unnecessary headers
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:46 PM Revision 5f92f338 (ceph): librados: move methods that require an IoCtx to IoCtxImpl
RadosClient still does a few different things, but at least it
no longer does all the work of an IoCtx.
Signed-off-b...
Josh Durgin
06:46 PM Revision db126279 (ceph): ObjectCacher: remove unused and crufty atomic sync operations
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:46 PM Revision 095c3a0e (ceph): OSDMap: make get_pools() const
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:46 PM Revision 16f99606 (ceph): osd_types: use uint64_t for ObjectExtent offsets and lengths
This is just client in-memory state, and allows us to address objects >4GiB,
to match the existing librados/Objecter ...
Josh Durgin
05:49 PM Revision b90354db (ceph): thrash: put client on separate machine from osds
This allows us to run kenrel clients (kclient, rbd) against the thrashing
cluster.
Sage Weil
05:09 PM Revision 5c9acbd8 (ceph): gitbuilder: put flavor last
in case we refine the field later Sage Weil
05:02 PM Revision 1a01ccaa (ceph): Pull from new gitbuilder.ceph.com locations.
Simplifies the flavor stuff into a tuple of
<package,type,flavor,dist,arch>
where package is ceph, kenrel, etc.
typ...
Sage Weil
01:56 PM Bug #2132: FAILED assert(!missing.is_missing(soid))
stxShadow saw this as well. Josh Durgin
01:45 PM Cleanup #2143 (Resolved): Remove ALL "don't use this product" warnings
Sage Weil
01:31 PM Feature #2145 (Resolved): doc gitbuilder
Sage Weil
12:28 PM Linux kernel client Bug #2064: ceph-client: messenger: nocrc flag not implemented correctly
Update: the commit had to be rebased, so it's id is now: 4d3e7aa992
Alex Elder
08:09 AM Linux kernel client Bug #2064: ceph-client: messenger: nocrc flag not implemented correctly
This is fixed by this commit:
086da4c6f8 libceph: fix inverted crc option logic
That is now present in the c...
Alex Elder
12:26 PM Linux kernel client Bug #2157: ceph: xattr: fix nanosecond display on i_rctime
This has been fixed in this commit:
260ac0e65b ceph: fix three bugs, two in ceph_vxattrcb_file_layout()
The comm...
Alex Elder
12:26 PM Linux kernel client Bug #2156: ceph: xattr: fix a possible buffer overrun bug
This has been fixed in this commit:
260ac0e65b ceph: fix three bugs, two in ceph_vxattrcb_file_layout()
The comm...
Alex Elder
12:26 PM Linux kernel client Bug #2155: ceph: xattr: wrong value assumed for "no preferred PG"
This has been fixed in this commit:
260ac0e65b ceph: fix three bugs, two in ceph_vxattrcb_file_layout()
The comm...
Alex Elder
11:01 AM rgw Bug #2170: librgw references g_ceph_context
Ouch. Mostly through dout, but there are other references. Yehuda Sadeh
10:40 AM rgw Bug #2170 (Resolved): librgw references g_ceph_context
2012-03-13T00:48:30.009 INFO:teuthology.task.workunit.client.0.err:OSError: /tmp/cephtest/binary/usr/local/lib/librgw... Sage Weil
09:31 AM rgw Feature #2169 (Resolved): rgw: api to control bucket placement
It'd be nice to be able to control which pool the bucket would be placed in when creating it. Yehuda Sadeh
12:02 AM Revision 98792e93 (ceph): rgw: add more meaningful tests instances of encoded objects
this completes #2140
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh

03/12/2012

11:46 PM Revision dfe50bcb (ceph): build-doc: use alternate virtualenv dir, if specified
The docs gitbuilder will use this to avoid rebuilding the virtualenv on
every build.
Signed-off-by: Sage Weil <sage@...
Sage Weil
10:22 PM Revision 096427d5 (ceph): remove dup ceph tasks from new thrash workloads
Sage Weil
10:12 PM Revision 6869c576 (ceph): test_idempotent: fix global_init call
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:01 PM Revision 401a5716 (ceph): qa: kclient/file_layout.sh poking
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:59 PM Revision 7e0e7ce1 (ceph): rgw: modify bucket instance for encoding test
This makes 'make check' happy, otherwise we need to create
a bucket name that starts with a period. This version is b...
Yehuda Sadeh
09:58 PM Revision 9969bba8 (ceph): FileStore: clarify debug/error output
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:58 PM Revision 6ca070ad (ceph): FileStore: ignore ERANGE and ENOENT on replay
The source object may either not exist or be the wrong size
during replay if the destination object was deleted in a ...
Samuel Just
09:58 PM Revision f20f6ea7 (ceph): test: add test_filestore_idempotent2
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
09:41 PM Revision e19417ef (ceph): Makefile.am, rgw: remove fcgi dependency where not needed
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
09:22 PM Revision 3ff5492c (ceph): rgw: add more meaningful test instances of some encoded objects
still need to add tests for other objects
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
08:22 PM Revision 31578c19 (ceph): rgw: tone down some log messages
dout(0) -> dout(1)
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
08:21 PM Revision 452b1248 (ceph): rgw: provide different default values for 'debug rgw'
Currently rgw and radosgw-admin require different chattiness
defaults.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdre...
Yehuda Sadeh
08:21 PM Revision 2c88f85c (ceph): config: alternative config options for global_init()
We want to be able to provide alternative default config values, than
the ones we set in common/config_opts.h. This c...
Yehuda Sadeh
08:13 PM Revision 3833ada8 (ceph): Made the example better with multiple roles.
Mark Nelson
07:33 PM Revision 0a61ffad (ceph): Added some example yaml files and an example parallel execution task.
Mark Nelson
07:11 PM Bug #2168 (Resolved): qa: plana won't select correct kernel for boot
Just need to run chef once before trying to install another kernel. Josh Durgin
05:38 PM Bug #2168: qa: plana won't select correct kernel for boot
Josh Durgin
04:26 PM Bug #2168 (Resolved): qa: plana won't select correct kernel for boot
the grub.cfg has default set to the kernel we want, but it boots up into a different one.
no console, so hard to d...
Sage Weil
07:08 PM Feature #2052 (In Progress): librbd: caching
Josh Durgin
07:08 PM Feature #2053 (New): librados: caching
Josh Durgin
07:01 PM Revision 968d29d1 (ceph): qa: use recent kernel for kernel_untar_build.sh
Happier on oneiric!
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
06:43 PM Revision 93164393 (ceph): rgw: switch ops log flag to use ceph config
It's turned on by default. So now we're using the
'rgw enable ops log' config param in ceph.conf, instead
of RGW_SHOU...
Yehuda Sadeh
06:22 PM Revision c9898f43 (ceph): filestore: fix op_num offset/labels
Start at 0, not 1.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
06:20 PM Revision 7c753498 (ceph): config: tmap to omap upgrade, true by default
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
05:00 PM Feature #2142 (In Progress): Add support for Ubuntu 12.04 (precise pangolin) in deb repository
Sage Weil
04:58 PM rgw Feature #2140 (Resolved): rgw: use new encode/decode macros
Yehuda Sadeh
03:01 PM rbd Feature #2167 (Duplicate): rbd: report the allocated size/space of a RBD with "rbd info <rbd name>"
With "rbd info <rbd name>" it reports the provisioned space, but not the allocated space (rbds are thin provisioned). Stefan Kleijkers
02:58 PM CephFS Bug #1737: ceph-fuse crash in xlist::remove
ubuntu@teuthology:/a/nightly_coverage_2012-03-12-b/727 Sage Weil
01:19 PM rgw Cleanup #2089 (Resolved): rgw: less dout(0) noise?
Fixed, commit:31578c19ffe9276a3e6495707e72180436ef3272. I've also set different 'debug rgw' default for rgw and rados... Yehuda Sadeh
01:09 PM Bug #2098 (Resolved): xfs/ext4 non-idempotent transaction
Sage Weil
12:18 PM Feature #2145 (In Progress): doc gitbuilder
Sage Weil
11:56 AM rgw Feature #2040 (Resolved): rgw: disable rgw log through ceph.conf
Sage Weil
10:07 AM Feature #2125: osd: put large xattrs in leveldb
Sage Weil
09:39 AM Feature #1956 (Resolved): rgw: revisit atomic GET/PUT
Sage Weil
04:50 AM Revision 2b9e7bc5 (ceph): clusters/fixed-3.yaml: 2 -> 6 osds
plana nodes have 3 scratch disks... use them! Sage Weil
04:32 AM Revision 51d817fe (ceph): Revert "disable s3tests on valgrind/lockdep until #2103 is fixed"
This reverts commit 9f757ca9511374f6565d74263e242c74e39f8a3f. Sage Weil
04:28 AM Revision af445189 (ceph): add rbd, kclient workloads to regression thrash collection
This will get us some kernel osd_client osd restart coverage. Sage Weil
04:16 AM Revision 3bba6b72 (ceph): Makefile: link libfcgi to librgw
Need this to make a linker error go away on my squeeze dev box. We
probably need to make sure librgw doesn't touch f...
Sage Weil
03:36 AM Revision 945218cc (ceph): qa/workunits/kclient/file_layout: escape *
Escape * so that it is expanded as root.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil

03/11/2012

10:24 PM Feature #2020 (Duplicate): collectd: submit plugin upstream
Sage Weil
09:47 PM Bug #2160 (Fix Under Review): active+recovering+degraded+backfill becomes active+clean+degraded w...
See wip-2160 for a fix. Sage Weil
09:41 PM rgw Cleanup #2166 (Resolved): rgw: make sure librgw doesn't link against libfcgi
Currently it does, and seems to need to to build on squeeze. Sage Weil
09:18 PM Bug #2165 (Resolved): osd: recovering ending with missing
... Sage Weil
09:16 PM Bug #2164 (Resolved): osd: scrub missing _, snapset attrs
... Sage Weil
08:03 PM Revision 71e6e62e (ceph): fix typo, ceph-fyuse -> ceph-fuse
Sage Weil
07:31 PM Revision a8497874 (ceph): filejournal: less log noise
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:20 PM Feature #2163: qa: full xfstests on rbd
iirc xfstests normally wants a block device it is free to mount/umount/reformat/etc. we can do this on top of rbd. Sage Weil
03:18 PM Feature #2163 (Resolved): qa: full xfstests on rbd
Sage Weil
02:49 PM Feature #1929 (Resolved): teuthology: log runtime
Sage Weil
04:01 AM Revision b84897e5 (ceph): use dbench workunit, not the autotest one
The autotest one uses an old tarball that doesn't build. Workunit assumes
the dbench package is installed.
Sage Weil
03:15 AM Revision 008cf7fd (ceph): autotest: pull from github.com/ceph/autotest
Sage Weil

03/10/2012

11:34 PM Revision 2124129e (ceph): workunit: include python2.7 path too
Sage Weil
04:42 PM Feature #2114 (Resolved): old sepia setup on new hardware
Sage Weil
04:42 PM Feature #2162 (Resolved): change gcov, notcmalloc gitbuilders to oneiric
Sage Weil
04:55 AM Revision 0851b747 (ceph): filestore: remove unused bool idempotent
This was from the old broken mechanism.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
01:07 AM Revision 6980ed98 (ceph): filestore: fix arguments
From a change that was rebased out; missed this caller.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
12:42 AM Revision ea34aa59 (ceph): Merge remote branch 'gh/wip-2098'
Reviewed-by: Samuel Just <samuel.just@dreamhost.com> Sage Weil
12:34 AM Revision 9b2a5c5d (ceph): filestore: sync object_map on _set_replay_guard()
We need to sync the object_map too. We can _almost_ check to see if there
are keys for the object and only do it the...
Sage Weil
12:32 AM Revision 51372a76 (ceph): filestore: guard replay of collection_add
- set guard on apply
- check guard on replay
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:32 AM Revision 6613b415 (ceph): filestore: guard collection_remove replay
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:32 AM Revision 13afce83 (ceph): filestore: remove old post-idempotent transaction trigger_commit
The old strategy was to initiate a commit after any non-idempotent
transaction. This only worked if the transaction ...
Sage Weil
12:32 AM Revision c9550c59 (ceph): omap: add sync method to ObjectMap
Signed-off-by: Samuel Just <rexludorum@gmail.com> Sage Weil
12:32 AM Revision 808812a2 (ceph): filestore: use proper type for readdir_r tmp
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:32 AM Revision a6077f6c (ceph): filestore: pass trans_num into _do_transaction
This gives us the <op_seq, trans_num, op_num> triple to identify every
constituent operation.
Signed-off-by: Sage We...
Sage Weil
12:32 AM Revision babb1e90 (ceph): add SequencerPosition type
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:32 AM Revision e81bc971 (ceph): filestore: fgetxattr helpers/wrappers
Also, do the getxattr using fgetxattr, to avoid duplicating code. This is
slightly slower probably because we open a...
Sage Weil
12:32 AM Revision 3b47c6ba (ceph): filestore: maintain SequencerPosition during _do_transaction
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:32 AM Revision f8a047f2 (ceph): filestore: implement _set_replay_guard, _check_replay_guard
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:32 AM Revision e99940d1 (ceph): filestore: guard clone replay
- set guard xattr on clone, clone_range
- check before applying/replaying
Signed-off-by: Sage Weil <sage.weil@dreamh...
Sage Weil
12:32 AM Revision 63d2c93d (ceph): filestore: fix collection_rename error code
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:32 AM Revision dd284c4f (ceph): filestore: guard collection_rename replay
- check guard on replay
- set guard on apply
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
12:32 AM Revision 4ea6e0d2 (ceph): filestore: guard replay of basic collection ops
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil

03/09/2012

10:24 PM Revision 11844978 (ceph): levedb: fix commit
This got reverted back to the old commit, somehow.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
10:13 PM Revision 184b1802 (ceph): Merge branch 'master' of github.com:ceph/ceph
Sage Weil
10:12 PM Revision 7f9cfdd9 (ceph): RadosModel: fix omap_clear case in RemoveAttrsOp
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
10:03 PM Revision fce180df (ceph): Merge branch 'wip-rgw-encode'
Conflicts:
src/rgw/rgw_cls_api.h
Reviewed-by: Sage Weil <sage@newdream.net>
Sage Weil
10:01 PM Revision b66cdea6 (ceph): Makefile.am: update link dependencies for some unit tests
Yehuda Sadeh
10:01 PM Revision b4f05788 (ceph): test/encoding/import.sh: fix target directory
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Yehuda Sadeh
10:01 PM Revision b71d814c (ceph): ceph-object-corpus: added 0.43
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Yehuda Sadeh
10:01 PM Revision 799e6f97 (ceph): rgw: fix rgw_cls_list_ret ctor
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:01 PM Revision 09949b3d (ceph): rgw: various encoding related fixes
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
09:34 PM Revision e43546de (ceph): osd: fix watch_lock vs map_lock ordering
watch_lock is inside map_lock (and pg->lock), which means we need to
drop it to take pg->lock here. That means verif...
Sage Weil
08:26 PM Revision 80d92528 (ceph): osd: update_heartbeat_peers as needed
Before, we were being very careful about updating the heartbeat peers if
new PGs were created or when certain types o...
Sage Weil
05:05 PM Feature #2162 (Resolved): change gcov, notcmalloc gitbuilders to oneiric
Sage Weil
04:39 PM Bug #2098: xfs/ext4 non-idempotent transaction
merged. passing initial tests. will hammer on it more with sam's new thing. Sage Weil
03:31 PM Bug #2161 (Resolved): nonlinear scaling for PGMap::pg_stat encode
... Sage Weil
02:03 PM rgw Feature #2140: rgw: use new encode/decode macros
we should add some semi-meaningful test instances for these objects, otherwise all looks good Sage Weil
01:34 PM Bug #2103 (Resolved): osd: lockdep error on watch_lock
Sage Weil
11:07 AM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
Hrm... well, if that's the case then we should merge the wip-2116 as is, then... putting that on my list
Sage Weil
10:58 AM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
I'm not willing to say it's completely gone, but it is starting to seem like it.
In the past few days the cluster ...
Wido den Hollander
09:04 AM Bug #2160 (Resolved): active+recovering+degraded+backfill becomes active+clean+degraded when reco...
In a configuration with 3 replicas of each PG (I haven't tested with others), when one of the disks is replaces, some... Alexandre Oliva
08:16 AM CephFS Bug #2159 (Resolved): ceph-fuse: big_writes option not recognized
... Sage Weil
08:06 AM Revision 619244d8 (ceph): rgw: implement dump() for encoders
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
12:58 AM Revision 2277fb45 (ceph): rgw: add stubs for dencoder test
still need to add some content to the dump methods
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
12:15 AM Revision 71323ffe (ceph): Merge remote branch 'origin/wip-2139'
Conflicts:
src/cls_rgw.cc
src/rgw/rgw_rados.cc
src/rgw/rgw_rados.h
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@drea...
Yehuda Sadeh
12:14 AM Revision ddc1ab0c (ceph): rados.py: include setattr and rmattr
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just

03/08/2012

11:54 PM Revision 64e3ffb2 (ceph): Merge branch 'master' of ssh://github.com/ceph/ceph
Yehuda Sadeh
11:18 PM Revision ac28b603 (ceph): ceph: document the way files are laid out
This adds a document that I wrote about how Ceph client file data
is striped across Ceph objects to the repository. ...
Alex Elder
11:09 PM Revision 1ed1d466 (ceph): librados: fix unit test for omap_get_vals_by_key rename
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:06 PM Revision ce43bd53 (ceph): osd: format time nicely in ops_in_flight output
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:06 PM Revision 934ee487 (ceph): librados: fix map -> std::map in header, string -> std::string
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:02 PM Revision fafbddd9 (ceph): doc: example of diagnosing radosgw hang
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:55 PM Revision 99cb9945 (ceph): doc: explain how unfound objects happen
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:55 PM Revision 47ce7ade (ceph): doc: make osd failure example include >3 osds
More realistic.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:46 PM Revision 80864a12 (ceph): testrados: fix omap_get_vals_by_keys call
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:30 PM Revision e7e3baeb (ceph): osd: add zero_to field to PG::OndiskLog; track zeroed region of pg log
Track which region of the log has been zeroed on disk. This may be
different from tail if 'osd preserved trimmed log...
Sage Weil
10:30 PM Revision ed6cc47b (ceph): filestore: use FL_ALLOC_PUNCH_HOLE to zero, when available
First try the FL_ALLOC_PUNCH_HOLE fallocate() flag. If we get EOPNOTSUPP,
fall back to writing zeros.
Check for fal...
Sage Weil
10:17 PM Bug #2103 (Fix Under Review): osd: lockdep error on watch_lock
Sage Weil
04:47 PM Bug #2103 (In Progress): osd: lockdep error on watch_lock
Sage Weil
10:16 PM Revision 1468d951 (ceph): osd: fix op_wq vs pg->lock ordering
map_lock
-> pg->lock
-> op_wq
Fixes: #2153
Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Samuel Just...
Sage Weil
04:07 PM rgw Bug #2139 (Resolved): rgw: no cleanup of index update in case of cancelled operations
Fixed, merged at commit: 71323ffeda665defdd133d9fd3c40e08952082e4. Yehuda Sadeh
02:57 PM Feature #2158 (Duplicate): cephtool: helpful error/timeout when no monitor quorum
Sage Weil
02:44 PM Feature #975 (Resolved): osd: use fallocate(FL_PUNCH_HOLE) for zero
commit:e7e3baebd6d5d9e3dfbddc77a2312e9f8e4e9da8 Sage Weil
02:34 PM Feature #2088: msgr: refactor 2 threads to one
Maybe it's a good time to switch from poll to epoll (where available)? Yehuda Sadeh
02:30 PM Linux kernel client Bug #2157 (Resolved): ceph: xattr: fix nanosecond display on i_rctime
In ceph_vxattrcb_dir_rctime() the nanoseconds part of the time
displayed does not include leading 0's, which leads t...
Alex Elder
02:25 PM Bug #2153 (Resolved): osd: locking cycle on pg::_lock. op_tp::lock, OSD::map_lock
commit:1468d95101adfad44247016a1399aab6b86708d2 Sage Weil
09:27 AM Bug #2153 (Resolved): osd: locking cycle on pg::_lock. op_tp::lock, OSD::map_lock
... Sage Weil
02:10 PM Linux kernel client Bug #2156 (Resolved): ceph: xattr: fix a possible buffer overrun bug
In ceph_vxattrcb_file_layout(), if an inode has a preferred PG its
value is added to the formatted output buffer. I...
Alex Elder
01:56 PM rgw Cleanup #2089: rgw: less dout(0) noise?
sounds good to me! Sage Weil
01:40 PM rgw Cleanup #2089: rgw: less dout(0) noise?
debug rgw = 0 removes almost everything except two:
rgw_main.cc: dout(0) << "====== starting new request req=" <<...
Dan Mick
10:05 AM rgw Cleanup #2089: rgw: less dout(0) noise?
Actually, those are dout(10). The problem is that the default rgw log level is 20. Yehuda Sadeh
01:41 PM Linux kernel client Bug #2155 (Resolved): ceph: xattr: wrong value assumed for "no preferred PG"
In ceph_vxattrcb_file_layout() the layout for a file is printed.
The preferred placement group is extracted from t...
Alex Elder
01:14 PM Bug #2154 (Resolved): rados: bench seq should not segfault when blocksize doesn't match write blo...
I misunderstood the problem to begin with. The issue is when read size does not match the prior write size (I hadn't... Dan Mick
09:50 AM Feature #2149: osd: use omap for snap collections
Sage Weil
09:40 AM rgw Feature #1739 (Resolved): rgw: multipart upload should use manifest object
Sage Weil
08:56 AM Feature #1895 (Rejected): osd: detect duplicate requests by tracking per-client last_acked_tid in...
Sage Weil
08:54 AM Feature #1802: qa: test to exercise divergent osd logs
the backfill.py exercises a divergent backfill target. we just need to do the same on a non-backfill target. Sage Weil
07:46 AM rgw Feature #2066 (Resolved): rgw: make list_objects efficient
Done, merge in with rgw-omap changes at commit:89648a0d1abe06832f57db2ff8be85749f667094 Yehuda Sadeh
07:37 AM rgw Feature #1715 (Rejected): rgw: use RENAME osd operation to avoid slow CLONE operations
With new atomic scheme we're not using clone anymore (unless it's an old object for backward compatibility). Yehuda Sadeh
07:34 AM rgw Feature #2040: rgw: disable rgw log through ceph.conf
Yeah, this is the rgw user operations log, not the debug log. Yehuda Sadeh
07:25 AM rgw Feature #1847 (Resolved): rgw: revisit the way we store large objects
Revisited, solution was merged in commit:5f8bbf1ff2d547023c15628dd0e3a5711e584c2b (rgw-new-atomic). Yehuda Sadeh
07:23 AM rgw Feature #1872 (Resolved): rgw: only use shadow objects for large objects
This has long been done.
Pushing acls check to cls_rgw can be done later on, but that's a different issue.
Yehuda Sadeh
07:21 AM rgw Bug #2072 (Resolved): rgw: owner cannot change acl if it doesn't have bucket read permission
Fixed, merged in with swift-acls. Yehuda Sadeh
07:19 AM rgw Feature #2140 (In Progress): rgw: use new encode/decode macros
Yehuda Sadeh
06:58 AM Revision c0aa3319 (ceph): Merge branch 'master' of ssh://skinny/home/yehudasa/ceph
Yehuda Sadeh
06:53 AM Revision 5f8bbf1f (ceph): Merge branch 'wip-rgw-new-atomic'
Yehuda Sadeh
06:52 AM Revision 465a88ea (ceph): rgw: append the currect bucket marker when removing bucket
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
06:39 AM Revision 8ce3d229 (ceph): Merge branch 'master' of ssh://skinny/home/yehudasa/ceph
Yehuda Sadeh
06:35 AM Revision 89648a0d (ceph): Merge branch 'wip-rgw-omap'
Yehuda Sadeh
06:25 AM Revision 72d678b8 (ceph): cls_rgw: fix rgw_bucket_init_index
was failing to error in case header already existed
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
06:19 AM Revision eaafc578 (ceph): rgw: remove extra unused params from omap_get()
and also rename it to omap_get_all()
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
06:18 AM Revision 765962dc (ceph): rgw: add cls_cxx_map_clear
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
05:59 AM Revision afe2598a (ceph): leveldb: drop compaction unit test
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
01:20 AM Revision 4d6578ca (ceph): ReplicatedPG,librados: add filter_prefix to omap_get_vals
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
01:10 AM Revision e0744390 (ceph): rgw: use prefix filter for bucket listing
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
01:03 AM Revision 4c908442 (ceph): objclass, cls_rgw: add prefix to omap_get_vals()
Yehuda Sadeh
01:02 AM Revision a6b524f6 (ceph): librados: add higher level call for omap_get_keys() with prefix
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
12:46 AM Revision 7a72bf01 (ceph): Merge remote-tracking branch 'origin/wip_prefix' into wip-rgw-omap
Yehuda Sadeh
12:32 AM Bug #2146: Ceph -s crash and reports old ceph version
I have updated ceph package only. The binary used for getting statistics is in ceph-common. After upgrade it work fin... Maciej Galkiewicz
12:29 AM Revision ffcf62f6 (ceph): rgw: switch to new encoders/decoders
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
12:28 AM Revision ffae4dfe (ceph): enconding: add legacy decoder for 32 bit versions
In these cases if the version is smaller than the compat ver
we skip 3 bytes.
Signed-off-by: Yehuda Sadeh <yehuda@hq...
Yehuda Sadeh
12:05 AM Revision b0c9d58c (ceph): rbd: pass all mon addrs when mapping devices
Previously this repeated the address of the first monitor.
Fixes: #2152
Signed-off-by: Josh Durgin <josh.durgin@drea...
Josh Durgin

03/07/2012

10:07 PM Revision 4d0bcdc8 (ceph): objectstore: fix collection_move() encoding
This was broken in the original f43c3d958fe5c32ae647ffa715390ada51ae2650.
Signed-off-by: Sage Weil <sage.weil@dreamh...
Sage Weil
09:54 PM Revision 9dc7c653 (ceph): msgr: remove SimpleMessenger::get_ms_addr() in favor of Messenger::get_...
And fix the comments on set_ip.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Reviewed-by: Sage Weil <sa...
Greg Farnum
09:54 PM Revision 65d170a8 (ceph): msgr: remove declaration of undefined SimpleMessenger::write_pid_file
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Reviewed-by: Sage Weil <sage.weil@dreamhost.com>
Greg Farnum
09:21 PM Feature #2114 (In Progress): old sepia setup on new hardware
Sage Weil
09:20 PM Feature #2087 (In Progress): lightweight filestore workload generator
Sage Weil
09:20 PM Feature #2088 (In Progress): msgr: refactor 2 threads to one
Sage Weil
09:20 PM Feature #2053 (In Progress): librados: caching
Sage Weil
09:20 PM rgw Feature #2066 (In Progress): rgw: make list_objects efficient
Sage Weil
09:20 PM rgw Feature #1950 (In Progress): rgw: create S3/Swift ACL interoperability suite
Sage Weil
09:20 PM rgw Feature #830 (Resolved): rgw: swift per-object ACLs
Sage Weil
09:19 PM rgw Feature #2140 (Resolved): rgw: use new encode/decode macros
Sage Weil
09:08 PM Revision 0a59d08d (ceph): ReplicatedPG,librados: add filter_prefix to omap_get_vals
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
08:55 PM Revision 31762c00 (ceph): lock: Improved logging when there aren't enough nodes available to lock...
Mark Nelson
08:55 PM Revision 05a07dda (ceph): lock: Added a --locked flag to teuthology-lock.
Can be used to restrict searches based on lock status, e.g.
'teuthology-lock --list -a --locked false --status up' sh...
Mark Nelson
08:53 PM rgw Feature #2040: rgw: disable rgw log through ceph.conf
Dan Mick wrote:
> Yes please! :) rgw debug = 0 doesn't seem to have any effect. (you knew that.)
...but this i...
Dan Mick
08:47 PM rgw Feature #2040: rgw: disable rgw log through ceph.conf
Yes please! :) rgw debug = 0 doesn't seem to have any effect. (you knew that.) Dan Mick
08:34 PM Revision 6bf72019 (ceph): rgw: some minor cleanups
following a review
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
06:45 PM Revision 5f8ffdee (ceph): objclass: fix cls_cxx_map_write_header
Claiming the buffer instead of encoding it.
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
Yehuda Sadeh
06:44 PM Revision 799c8aca (ceph): cls_rgw: fix debug message
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Yehuda Sadeh
06:32 PM Revision 0b1037da (ceph): Merge remote-tracking branch 'gh/wip-doc'
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com> Sage Weil
04:56 PM Revision 9ba96c01 (ceph): osd: make degraded pgs count missing replicas as degraded objects
If a PG is smaller than it should be, make sure the missing replicas are
included in the degraded object count. This...
Sage Weil
04:14 PM Bug #2152 (Resolved): rbd: first monitor address is passed to kernel n times in rbd map
Sage Weil
02:45 PM Bug #2152 (Resolved): rbd: first monitor address is passed to kernel n times in rbd map
Using rbd map with mon0 (addr 10.10.10.4:6789) down, this happened:... Josh Durgin
02:41 PM Linux kernel client Bug #2081: msgr: spinlock badness?
This:
ubuntu@teuthology:/a/nightly_coverage_2012-02-20-b/12984
does not provide very much useful informatio...
Alex Elder
02:36 PM Feature #2127: Save kernel core dumps on all of our test machines
Researched this a bit this afternoon. I have used LKCD in the past and
find now it's fairly old. Here someone has ...
Alex Elder
02:11 PM Bug #2151 (Resolved): mon: pg creation retry is slow
right now if you are creating a pg and the osd crashes, the monitor will retry every once in a while, but that retry ... Sage Weil
02:00 PM Messengers Cleanup #2150 (Resolved): repair the Simple/Messenger interface
I made a start on this with my interface work, but there are still
1) a lot of public functions that ought to be pri...
Greg Farnum
10:51 AM Feature #2149 (Resolved): osd: use omap for snap collections
Sage Weil
09:55 AM Bug #2148 (Resolved): osd: class error return not propagated to client
Sage Weil
09:49 AM Feature #2147 (Resolved): objclass: add CLS_ERR macro
Otherwise, dumping errors through CLS_LOG hides them if debug objclass is not set. Yehuda Sadeh
09:06 AM Bug #2137 (Resolved): osd: refine degraded % calculation
commit:9ba96c0123c050f7e7c62ea644e240aed0c954fb Sage Weil
08:24 AM Bug #2146: Ceph -s crash and reports old ceph version
I'm curious how you managed to install new packages while keeping around an old binary, though. (Yes, that's definite... Greg Farnum
07:48 AM Bug #2146 (Rejected): Ceph -s crash and reports old ceph version
The problem is just that your ceph binary is out of date. You need to run v0.43 or later, due to the structure encod... Sage Weil
03:56 AM Bug #2146 (Rejected): Ceph -s crash and reports old ceph version
# ceph -s
terminate called after throwing an instance of 'ceph::buffer::malformed_input'
what(): buffer::malform...
Maciej Galkiewicz
05:03 AM Revision 184a9dda (ceph): mon: fix full osd detail
And use a helper to avoid dup code.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:03 AM Revision bd97923c (ceph): doc: fix misc typos, bad phrasing
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:03 AM Revision a300db39 (ceph): mon: assign severity to each health summary/detail item
These can be included in the detail dump in the future.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
01:17 AM Revision f266d5a2 (ceph): objclass, cls_rgw: update to use omap
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Yehuda Sadeh
01:05 AM Revision 2822fe50 (ceph): mon: include unfound count in health detail
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision 2b87d4f2 (ceph): osd: list might_have_unfound locations in query result
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision d72b8217 (ceph): doc: document some osd failure recovery scenarios
- simple osd failure
- ceph health [detail]
- peering failure ('down') state
- unfound objects
Signed-off-by: Sage W...
Sage Weil
01:05 AM Revision 2bec51a2 (ceph): doc: describe 'stuck' states we check for
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision 956e2e22 (ceph): mon: list nearfull/full osd detail
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision 75ad8979 (ceph): doc: diagnose full osd cluster
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision 25cceca0 (ceph): doc: slow osd requests
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision ab41d675 (ceph): doc: fix link
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision 0becb843 (ceph): doc: talk about mon failures a bit
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:05 AM Revision b0ed78c3 (ceph): doc: 2 words about radosgw failures
- restarting the daemon.
- using the admin socket
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
01:05 AM Revision 8b0bd127 (ceph): mon: refactor health, include optional detail
'ceph health' to get the usual summary, 'ceph health detail' to
additionally get a comprehensive list of problems fou...
Sage Weil
12:05 AM Revision db223363 (ceph): Merge branch 'wip-collmove'
Samuel Just

03/06/2012

11:48 PM Revision 85446298 (ceph): rgw: switch to omap api
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:15 PM Revision 6c0cb55e (ceph): leveldb: remove flawed unit test for now
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
10:53 PM Revision 7012c20b (ceph): librados: rename omap_get_vals_by_key to omap_get_vals_by_keys
merge fail
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
10:35 PM Bug #2137 (Fix Under Review): osd: refine degraded % calculation
wip-2137 Sage Weil
10:18 PM Revision 1fefb146 (ceph): librados: add high level omap calls
also rename get_vals_by_key to get_vals_by_keys
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
10:17 PM Revision f85ab71f (ceph): rgw: read bucket through tmap_get
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
10:17 PM Revision 7fc779a6 (ceph): rgw: fix warning
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
07:46 PM Revision 98f8219d (ceph): Merge branch 'wip_omap'
Reviewed-by: Sage Weil <sage.weil@dreamhost.com> Samuel Just
07:44 PM Revision d8dcb28e (ceph): librados: add tmap_put to ObjectWriteOperation
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision b85f7d7b (ceph): ReplicatedPG: add omap_header to recovery
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision d2bf68d1 (ceph): osdc: Add omap operation stubs to Objecter::ObjectOperation
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision 81c22dfb (ceph): librados: Added omap operations to librados
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision 82287986 (ceph): ReplicatedPG: Add omap ops to ReplicatedPG
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision 2abf3776 (ceph): RadosModel: Add omap operations to RadosModel
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision adace1cf (ceph): ReplicatedPG: transparently upgrade TMAP
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision 4c4fcea3 (ceph): osd: testing for tmap auto upgrade
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:44 PM Revision b6c2e839 (ceph): test_rados_api_aio: add omap
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:03 PM Revision b52d4087 (ceph): Merge branch 'wip-1796'
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> Sage Weil
07:02 PM Revision 769ef369 (ceph): journaler: add generic write error handler
Specify a generic callback for any write error the journaler encounters.
This is more helpful than passing up write e...
Sage Weil
07:02 PM Revision 195301ef (ceph): mds: respawn when blacklisted
If we are blacklisted by the OSD cluster, it's because we were too slow
and were replaced by another ceph-mds. Respa...
Sage Weil
06:49 PM Revision 50682189 (ceph): Merge remote-tracking branch 'gh/wip-2105'
Reviewed-by: Samuel Just <samuel.just@dreamhost.com> Sage Weil
06:24 PM Revision 86186405 (ceph): .gitignore: src/ocf/rbd
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:23 PM Linux kernel client Bug #2064 (In Progress): ceph-client: messenger: nocrc flag not implemented correctly
I've been unable to test rigorously today so I analyzed the affected code.
If the "nocrc" flag is supplied, the ce...
Alex Elder
03:22 PM Linux kernel client Bug #2064: ceph-client: messenger: nocrc flag not implemented correctly
The problem also exists in write_partial_msg_pages(). In other words, it
looks like neither the reader nor the writ...
Alex Elder
05:46 AM Linux kernel client Bug #2064: ceph-client: messenger: nocrc flag not implemented correctly
This is a trivial fix. The only thing that needs to be done really is
testing the result to make sure things don't ...
Alex Elder
05:34 PM Revision 2a18c3e1 (ceph): nuke: unmount osd data directories
This helps us avoid reboot to clean up osd data directories that are left
mounted.
Sage Weil
05:19 PM Revision e3b4ba99 (ceph): filestore: create snap_0 on mkfs
If we create a new filestore, apply one transaction, and then crash, we
want to make sure roll back to a consistent r...
Sage Weil
05:19 PM Revision a14d44fc (ceph): filestore: drop useless read_op_seq() arg
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:14 PM Revision b78b725d (ceph): Merge pull request #9 from fghaas/ocf-ra
OCF resource agents: add rbd
Reviewed-by: Sage Weil <sage@newdream.net>
Reviewed-by: Tommi Virtanen <tommi.virtanen@...
Sage Weil
04:40 PM Feature #2145 (Resolved): doc gitbuilder
it can automatically rsync to ceph.newdream.net/docs/<branch or tag name> Sage Weil
03:54 PM Bug #2136 (Resolved): leveldb fails unit tests in gitbuilders
Sage Weil
03:45 PM Subtask #1919 (Resolved): implement the key value interface in terms of leveldb
Samuel Just
03:45 PM Subtask #1920 (Resolved): Update OSD to use ObjectStore tmap implentation (object recovery must c...
Samuel Just
03:45 PM Subtask #1917 (Resolved): create interface for backing key-value store and create object map impl...
Samuel Just
03:45 PM Subtask #1918 (Resolved): create mock key-value store and tests for the related object map implen...
Samuel Just
03:45 PM Subtask #1915 (Resolved): Create trivial implementation for the object map interface (using curre...
Samuel Just
03:45 PM Subtask #1916 (Resolved): add tests ObjectStore level tests for tmap operations
Samuel Just
03:45 PM Subtask #1914 (Resolved): Create interface for object map implementation
Samuel Just
03:45 PM Subtask #1913 (Resolved): Add tmap operations to ObjectStore interface
Samuel Just
03:43 PM Feature #1571 (Resolved): osd: non-trivial map object
Samuel Just
03:42 PM Feature #1571 (Closed): osd: non-trivial map object
Samuel Just
02:20 PM CephFS Feature #2144 (Resolved): mon: improve mds health checks
e.g. replay, rejoin, resolve, etc. should be warning, and so forth. Sage Weil
11:27 AM Cleanup #2143: Remove ALL "don't use this product" warnings
should we leave it on ceph-mds startup? Sage Weil
11:26 AM Cleanup #2143 (Fix Under Review): Remove ALL "don't use this product" warnings
wip-2143 Sage Weil
11:21 AM Cleanup #2143 (Resolved): Remove ALL "don't use this product" warnings
We are about to push this product out to Open Stack and Canonical, and will be charging for support.
"Do not use thi...
Anonymous
11:10 AM CephFS Bug #1796 (Resolved): mds: exit cleanly on EBLACKLISTED
Sage Weil
10:57 AM Bug #2105 (Resolved): filestore: mkfs does not create initial snap
Fix is merged.. this'll be in v0.44.
Thanks for the bug report (and detailed log!)
Sage Weil
09:33 AM Bug #2105 (Fix Under Review): filestore: mkfs does not create initial snap
Sage Weil
10:21 AM CephFS Bug #1318: directories disappear across multiple rsyncs
I've just run into this on 0.43 using the ceph kernel module in 3.2.7. My symptoms are that a repeated rsync from a ... Jonathan Dieter
09:34 AM rgw Bug #2139 (Fix Under Review): rgw: no cleanup of index update in case of cancelled operations
Sage Weil
09:26 AM Feature #1655: gitbuilder aggregator page
We've been using this script for the past month or so, and it's been pretty handy. Here's an ajax version of the abov... Jimmy Tang
08:58 AM Revision affda7c0 (ceph): rbd OCF RA: fix whitespace inconsistency
Signed-off-by: Florian Haas <florian@hastexo.com> Florian Haas
08:25 AM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
I ran multiple I/O loads on the cluster last night and performed multiple tests, but this problem didn't come back ye... Wido den Hollander
07:17 AM Feature #2142: Add support for Ubuntu 12.04 (precise pangolin) in deb repository
I'll do this for v0.44. Sage Weil
01:25 AM Feature #2142: Add support for Ubuntu 12.04 (precise pangolin) in deb repository
This add is quite usefull for me, because I use chef (I will try to improve the current cookbooks) and chef use "node... Guilhem Lettron
01:21 AM Feature #2142 (Resolved): Add support for Ubuntu 12.04 (precise pangolin) in deb repository
Precise Pangolin is in beta now, it can't be used with the current repository. Guilhem Lettron
06:48 AM Revision d9d5cf2e (ceph): Merge remote branch 'gh/wip-msgr-interface'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
05:42 AM Revision d36b6a90 (ceph): osd: use new collection_move() operation
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:42 AM Revision 14929e4d (ceph): filestore: implement OP_COLL_MOVE
Equivalent to OP_COLL_ADD, OP_COLL_REMOVE.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:41 AM Revision f43c3d95 (ceph): objectstore: OP_COLL_MOVE
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:41 AM Revision 5137f64f (ceph): objectstore: use enum for OP_*
Enforce no dups.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:15 AM Revision d8740094 (ceph): objectstore: remove _fake_writes, _get_frag_stat
Also only implemented by ebofs.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:12 AM Revision 737132be (ceph): filestore: drop trim_from_cache, is_cached
These were used for read optimizations in ebofs; I don't think they'll
come back.
Signed-off-by: Sage Weil <sage.wei...
Sage Weil
05:09 AM Revision 3cd5e31a (ceph): objectstore: remove cruft
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:09 AM Revision 97993f6c (ceph): filestore: remove collection, attr faking
Useless functionality from the dark ages of development, when xattrs were
scarce.
Signed-off-by: Sage Weil <sage.wei...
Sage Weil
01:28 AM Revision d0c6ad0f (ceph): rgw: cancel bucket update if operation failed
Beforehand the operation was leaking.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
01:27 AM Revision 14857582 (ceph): cls_rgw: add cancel operation
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh

03/05/2012

11:28 PM Revision 0661ea1d (ceph): Merge branch 'master' of ssh://github.com/ceph/ceph
Yehuda Sadeh
11:00 PM Revision 4e9ebd6b (ceph): rgw: make sure correct locator is used
Or more correct: locator is not used where not needed.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
11:00 PM Revision e0650ce3 (ceph): rgw: implement copy using new scheme
for some reason target tail uses locator, this needs to be
fixed.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
11:00 PM Revision 53a2ceb7 (ceph): rgw: don't use locator for multipart uploads
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:00 PM Revision dfcb8412 (ceph): rgw: multipart object working with manifest
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:00 PM Revision 578c0510 (ceph): rgw: manifest object contains source offset info
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:00 PM Revision d409c313 (ceph): rgw: basic functionality of new atomic get/put works
get/put of objects works. Stuff that is known to be broken:
copy object
Also, going through the code, we can probabl...
Yehuda Sadeh
11:00 PM Revision 20244d64 (ceph): rgw: get_obj uses manifest
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:00 PM Revision c076e351 (ceph): rgw: atomic objects hold manifest header
When writing an object we update where all the chunks of this object
reside.
Signed-off-by: Yehuda Sadeh <yehuda@hq....
Yehuda Sadeh
11:00 PM Revision e36940af (ceph): rgw: atomic processor writes to shadow object
And the first chunk is going to the head object in the end
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
10:51 PM Bug #2126 (Duplicate): osd: recover_primary did nothing when num_missing==1
Sage Weil
10:45 PM Bug #2105: filestore: mkfs does not create initial snap
Thanks for the report!
The mkfs doesn't create an initial snap, so if we crash/stop before creating one, our first...
Sage Weil
10:35 PM CephFS Bug #1796 (Fix Under Review): mds: exit cleanly on EBLACKLISTED
Sage Weil
10:35 PM Revision ed0f6053 (ceph): Merge remote branch 'gh/wip-swift-acls'
Lightly-reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
10:21 PM Revision 3e95dfdf (ceph): osd: delay non-replayed ops during replay
If we get new (non-replayed) ops during replay, those need to wait until
after the replayed ops are ordered and appli...
Sage Weil
10:21 PM Revision 702f09ea (ceph): librados: close narrow shutdown race
timer.shutdown() will drop and retake the lock, so set DISCONNECTED first
to avoid a message slipping in and reaching...
Sage Weil
10:21 PM Revision 743da9bd (ceph): osd: don't trust pusher's data_complete
The pusher doesn't know what clone_overlap we'll see, so it has no idea
if we are data_complete from our perspective,...
Sage Weil
10:20 PM Revision e1a9e18b (ceph): osd: warn if recovery still has missing at end
We shouldn't get to this point. If we do, recover_primary didn't do what
it needed to. Dump the remaining missing s...
Sage Weil
10:14 PM Bug #2098 (In Progress): xfs/ext4 non-idempotent transaction
Sage Weil
09:34 PM Revision 14936747 (ceph): Use non-zero exit status if any tests failed
Fixes: #1989 Josh Durgin
08:30 PM Revision c31b8696 (ceph): OCF resource agents: add rbd
Add a resource agent for mapping, unmapping and monitoring RBD devices.
Maps an RBD on start, unmaps it on stop. Che...
Florian Haas
03:53 PM Feature #2141 (Resolved): ceph: 'object map <poolname> <objectname>' or similar
'ceph pg map 1.2' will tell you what osd handles a pg... having the same thing that lets you provide pool and object ... Sage Weil
02:32 PM rgw Feature #2140 (Resolved): rgw: use new encode/decode macros
so we can safely revise encoding for the structures in a forward/backward compatible way Sage Weil
02:18 PM Bug #2135 (Resolved): cephtool: osdc/Objecter.cc: 375: FAILED assert(initialized)
Sage Weil
02:17 PM Bug #2133 (Resolved): osd: recovery_complete
Sage Weil
02:17 PM Bug #2082 (Resolved): osd: broken queuing during replay
Sage Weil
01:23 PM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
I tried to compile the wip-2116 and master branches today, that failed. So I haven't been able to test the new code y... Wido den Hollander
12:39 PM rgw Bug #2120 (Resolved): rgw: atomic write guard doesn't scale well
Yehuda Sadeh
12:37 PM rgw Bug #2120: rgw: atomic write guard doesn't scale well
A fix for that was already pushed (backing off when identifying a race in atomic write scheme). However, a more compl... Yehuda Sadeh
11:43 AM Bug #1900 (Resolved): Fix detection and build issues with libcrypto++
commit:c9e028f4e5287ef6b7b237fca9ee34aa78ae1212 Sage Weil
10:51 AM Linux kernel client Bug #1600 (Can't reproduce): general protection fault during blogbench
Sage Weil
10:51 AM Linux kernel client Bug #1615 (Can't reproduce): null pointer dereference in ceph_msg_new
Sage Weil
10:50 AM Linux kernel client Bug #1597 (Can't reproduce): general protection fault during mount
Sage Weil
10:50 AM Linux kernel client Bug #2069 (Need More Info): client crash during kernel_untar_build rm -r step
Sage Weil
10:48 AM Linux kernel client Bug #1793 (Need More Info): NULL pointer dereference at try_write+0x627/0x1060
Sage Weil
10:45 AM rgw Bug #2063 (Resolved): rgw: access key shouldn't contain chars that need to be url encoded
Sage Weil
10:43 AM Feature #2087: lightweight filestore workload generator
*task Samuel Just
10:42 AM Feature #2087: lightweight filestore workload generator
Mark suggested that this would be a good talk for Joao. Samuel Just
10:42 AM Bug #1989 (Resolved): teuthology: error in ceph.log didn't make teutholgy return error code
Josh Durgin
10:40 AM CephFS Bug #2071 (Need More Info): kclient: pjd mkfifo failures
Sage Weil
10:40 AM rgw Bug #2035 (Resolved): rgw: bucket removal fails
Sage Weil
10:39 AM Bug #2042 (Need More Info): mon: crash in LogMonitor::update_from_paxos
Sage Weil
10:37 AM Feature #2003: limit XFS extent fragmentation for rbd
librbd caching will help a lot! Sage Weil
10:35 AM rgw Bug #1948 (Resolved): rgw: need to read intent log in chunks
Sage Weil
10:33 AM rgw Bug #2027 (Need More Info): rgw -> apache miscommunication
Sage Weil
10:08 AM rgw Bug #1864 (Resolved): rgw: atomic bucket info
Sage Weil
10:07 AM Bug #1873 (Won't Fix): crush_rule type is inconsistent
Sage Weil
10:06 AM CephFS Bug #1811 (Need More Info): 2 pjd chown tests failed on cfuse
Sage Weil
10:02 AM Bug #1627 (Need More Info): ceph-mon memleak if ceph-osd cluster ip is not reachable, but public ...
Sage Weil
10:01 AM CephFS Bug #1047: mds: crash on anchor table query
Sage Weil
09:57 AM CephFS Bug #1682 (Resolved): mds: segfault in CInode::authority
calling this resolved too. Sage Weil
09:51 AM CephFS Bug #1549 (Resolved): mds: zeroed root CDir* vtable in scatter_writebehind_finish
using clean shutdown now, yay Sage Weil
09:50 AM rgw Bug #1822 (Closed): radosgw can be slow to respond to requests
several other bugs are open to track specific problems Sage Weil
09:49 AM rgw Bug #2139 (Resolved): rgw: no cleanup of index update in case of cancelled operations
Yehuda Sadeh
09:33 AM Bug #2128 (Rejected): filestore: check() fails during sync
actually, i hit this running the new code with the filestore transaction leak bug fix reverted (to reproduce the btrf... Sage Weil
07:49 AM Linux kernel client Tasks #2138 (Resolved): rbd: run xfstests on a local XFS filesystem over RBD
This still can't really be done cleanly in a teuthology script, but
I would like to run xfstests on an RBD client sy...
Alex Elder
07:23 AM Bug #2137 (Resolved): osd: refine degraded % calculation
based on the recent pg state updates... Sage Weil

03/04/2012

01:40 PM Bug #2136 (Resolved): leveldb fails unit tests in gitbuilders
always ... Sage Weil
01:36 PM Bug #2082: osd: broken queuing during replay
Hit this again and dug into the logs. It's a bug on the osd... testrados is using the data, which has replay enabled... Sage Weil
05:01 AM Revision 75cbed61 (ceph): DBObjectMap: remove stray ;
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil

03/03/2012

10:45 PM Revision 0272b590 (ceph): LevelDBStore: #include types.h
This fixes some compile errors on one of my boxes (squeeze).
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
09:33 PM Bug #2128: filestore: check() fails during sync
could it be commit:75cbed61e94a7974e40230360c6781d85f47576d ? Sage Weil
09:11 PM Bug #2133: osd: recovery_complete
Sage Weil
02:18 PM Bug #2133 (Resolved): osd: recovery_complete
pull raced with clones, clone_subset changed, it got confused.... Sage Weil
09:10 PM Bug #2135: cephtool: osdc/Objecter.cc: 375: FAILED assert(initialized)
librados shutdown race Sage Weil
07:38 PM Bug #2135 (Resolved): cephtool: osdc/Objecter.cc: 375: FAILED assert(initialized)
... Sage Weil
03:16 PM CephFS Bug #1796: mds: exit cleanly on EBLACKLISTED
people hit this and it's confusing when ceph-mds crashes...
wip-1796
Sage Weil
02:38 PM Feature #2134 (Resolved): qa: smoke suite
pick out some regression tests that run reasonably quickly and have decent coverage. Sage Weil

03/02/2012

10:59 PM Revision 004ec667 (ceph): .gitignore: *.tar.bz2
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:46 PM Revision 6e2a16b8 (ceph): msgr: start re-ordering functions into a better order
This is the start of making the SimpleMessenger interface legible
to users. In addition to moving the configuration a...
Greg Farnum
09:59 PM Bug #2132 (Resolved): FAILED assert(!missing.is_missing(soid))
Possibly a duplicate of Issue #1191 or Issue #339 (both closed with could not reproduce).
Prior to this assert th...
Matthew Roy
09:45 PM Revision 38537ba7 (ceph): Merge branch 'stable'
Sage Weil
09:36 PM Linux kernel client Bug #2099 (Rejected): messenger: unexpected socket state (4)
OK, this is not a bug. I caused it by inserting this WARN_ON() message
in a case statement in ceph_state_change(). ...
Alex Elder
09:29 PM Linux kernel client Cleanup #2131 (New): ceph: xattr: use the generic kernel xattr code
The Linux kernel has a generic set of routines to support
extended attributes. When I posted some recent changes
t...
Alex Elder
09:28 PM Linux kernel client Cleanup #2130: ceph: xattr: complete cleanups following review
Forgot to assign it to myself Alex Elder
09:27 PM Linux kernel client Cleanup #2130 (Rejected): ceph: xattr: complete cleanups following review
As requested by Mark... I have a number of changes to make to
fs/ceph/xattr.c based on my review of that code last ...
Alex Elder
08:32 PM Revision 29be5282 (ceph): librados: remove gratuitous call to add_dispatcher_head.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:32 PM Revision 26e48f42 (ceph): msgr: Require that init functions are called before bind() and start().
Fix up callers to handle these constraints.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
08:32 PM Revision 091b1760 (ceph): msgr: make nonce a required part of the SimpleMessenger constructor.
With that, remove the set_nonce function and the gratuitous passing
of nonce around through layers of functions.
Sig...
Greg Farnum
08:32 PM Revision 38bec5da (ceph): msgr: remove refcounting of Messengers.
This was pretty pointless since each Messenger has a well-defined
exit point and shutdown process.
Signed-off-by: Gr...
Greg Farnum
08:32 PM Revision cd174c5e (ceph): msgr: promote the started bool to Messenger.
Make it a protected member of Messenger instead of a public part of
SimpleMessenger.
Signed-off-by: Greg Farnum <gre...
Greg Farnum
08:12 PM Linux kernel client Bug #2129 (New): ceph: xattr: call __build_xattrs() *before* cap check
While reviewing a change to the xattr code, Sage noticed that some
calls to __build_xattrs() were being made *after*...
Alex Elder
07:20 PM Revision ffa59559 (ceph): msgr: Remove SimpleMessenger::register_entity
This function has been vestigial for a long time. Remove it and move
its remaining functionality into the constructor...
Greg Farnum
07:20 PM Revision ef244773 (ceph): msgr: Remove the SimpleMessenger start/start_with_nonce distinction.
Instead, have a settable nonce value that you can fill in any time
after construction and that it uses during regular...
Greg Farnum
07:20 PM Revision 578bc9c4 (ceph): msgr: Remove the SimpleMessenger::bind() nonce parameter.
Instead, use the just-established nonce value.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
07:20 PM Revision 3bd1d2ae (ceph): msgr: add start() and wait() stubs to the Messenger interface
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
07:00 PM Revision 70360f84 (ceph): github.com/NewDreamNetwork -> github.com/ceph
Sage Weil
06:55 PM Revision dc1abab2 (ceph): github.com/NewDreamNetwork -> github.com/ceph
Sage Weil
06:55 PM Revision 1970713a (ceph): github.com/NewDreamNetwork -> github.com/ceph
Sage Weil
05:50 PM Revision cacf0fde (ceph): filestore: fix rollback safety check
There is a window in the old check between when current/commit_op_seq is
written and the snapshot is taken. If ceph-...
Sage Weil
05:35 PM Revision 098cd921 (ceph): Merge remote branch 'gh/wip_fs_omap'
Reviewed-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:53 PM Revision 9fa8781c (ceph): v0.43
Sage Weil
04:27 PM Bug #2128 (Rejected): filestore: check() fails during sync
... Sage Weil
03:08 PM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
ok, i have a theory what's going on. can you try the new wip-2116, and run with debug ms = 20?
thanks!
Sage Weil
10:07 AM Feature #2127 (New): Save kernel core dumps on all of our test machines
The claim is that there is a netdump module that will UDP-squirt kernel coredumps to a waiting server, which is proba... Anonymous
09:53 AM Bug #2126 (Duplicate): osd: recover_primary did nothing when num_missing==1
... Sage Weil
09:46 AM Bug #2118 (Resolved): osd: flawed commit_op_seq check on startup
Sage Weil
08:43 AM Feature #2125 (Resolved): osd: put large xattrs in leveldb
either when we fear the fs can't handle them, or unconditionally, or something.
Sage Weil
07:33 AM Feature #1422: libvirt: rbd storage pool
Made some more progress on this, code seems to be stable.
Working:
* Single and multiple monitors
* Authenticati...
Wido den Hollander
06:04 AM Revision 9f757ca9 (ceph): disable s3tests on valgrind/lockdep until #2103 is fixed
Sage Weil
01:18 AM Revision 3a835172 (ceph): RadosModel: separate initialization and construction
Several error codes needed to be checked.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Reviewed-by: Samuel...
Josh Durgin
01:17 AM Revision 2b176fbe (ceph): Merge branch 'next'
Josh Durgin
01:16 AM Revision cd313885 (ceph): librados: only shutdown objecter after it's initialized
The objecter is only initialized once the RadosClient state is
CONNECTED from the perspective of a RadosClient::shutd...
Josh Durgin

03/01/2012

10:00 PM Bug #2103: osd: lockdep error on watch_lock
must reenable this in qa suite when it's fixed! Sage Weil
06:11 PM Revision 58a3b7f7 (ceph): Added leveldb submodule
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:11 PM Revision d9b130fa (ceph): Added LevelDBStore
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:11 PM Revision e9dd01f5 (ceph): os/CollectionIndex: Add debugging constructor and Path::coll()
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:11 PM Revision 2ab6f023 (ceph): ObjectStore.h: Initial ObjectStore omap interfaces
ObjectMap.h defines the interface which will be implemented by
leveldb. store_test now tests basic omap operations.
...
Samuel Just
06:11 PM Revision 82199d5d (ceph): os/: DBObjectMap and KeyValueDB interface with tests
DBObjectMap is an implementation of ObjectMap in terms of KeyValueDB.
Signed-off-by: Samuel Just <samuel.just@dreamh...
Samuel Just
06:11 PM Revision 6a624b96 (ceph): test: Add KeyValueDB atomicity checker
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:11 PM Revision 9331e633 (ceph): MOSDSubOp: Add entry for omap recovery
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:11 PM Revision feaf44e7 (ceph): ReplicatedPG: Add omap to recovery
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:11 PM Revision 2c275efb (ceph): Makefile: add headers for distcheck
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
05:18 PM Bug #2122 (Resolved): objecter: Asserts if authorization fails
Fixed by commit:cd313885783a5a69a554139b5b41d21a666c815b Josh Durgin
08:36 AM Bug #2122: objecter: Asserts if authorization fails
Ah, I had a patch to fix this in the wip-testrados branch. I'll rebase and merge that today. The new asserts in the o... Josh Durgin
06:45 AM Bug #2122 (Resolved): objecter: Asserts if authorization fails
While working on the libvirt RBD storage driver I noticed the following crash:... Wido den Hollander
01:46 PM Tasks #2123 (Closed): Ignore this task - I'm checking out the bug report process.
Ken Franklin
09:02 AM Tasks #2123: Ignore this task - I'm checking out the bug report process.
using "Update" option in tracker Ken Franklin
09:00 AM Tasks #2123 (Closed): Ignore this task - I'm checking out the bug report process.
just using the task ticket to walk through the issue lifecycle. Ken Franklin
11:45 AM Bug #2124 (Resolved): crash when malformed auth key is provided
We should guard all calls to decode_base64:... Yehuda Sadeh
09:52 AM Linux kernel client Bug #2099: messenger: unexpected socket state (4)
Saw this a couple of times on a client in a small ceph cluster. It seems to be correlated with dd runs using various... Mark Nelson
08:11 AM Bug #2115 (Rejected): OSD failed to start: Operation not permitted
Sage Weil
02:13 AM Bug #2115: OSD failed to start: Operation not permitted
problem resolved. Thank you very much to your hint! I didn't ever think it is caused by communication.
I created a...
soft crack
04:46 AM Revision cddcc2d2 (ceph): Makefile: make check-local relative to $(srcdir)
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
02:48 AM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
I can almost always reproduce it.
I just upgraded my cluster to:
> ceph version 0.42.2-206-gd77c579 (commit:d77c5...
Wido den Hollander
12:21 AM Revision 749281ed (ceph): Makefile: add json_spirit headers to tarball
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

02/29/2012

11:47 PM Revision a80246c1 (ceph): dump_stuck: note required ceph configuration
Josh Durgin
11:45 PM Revision b2bbede8 (ceph): dump-stuck: set pg stuck threshold to match test
Josh Durgin
10:46 PM Revision 86340655 (ceph): rgw: don't retry certain operations if we raced
The atomic get/put scheme was retrying writes in case where it lost
races (head object was rewritten by another clien...
Yehuda Sadeh
10:46 PM Revision 85d04c6c (ceph): rgw: don't check for ECANCELED in the _impl() functions
We already check it in the outer functions.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
09:22 PM Bug #2022: osd: misdirectect request
... Sage Weil
09:22 PM Revision b1f26440 (ceph): msgr: fix race in learned_addr()
- two connect() threads
- both hit if (need_addr) check
- one takes lock, sets addr, need_addr = false, unlocks
- con...
Sage Weil
09:16 PM Bug #2080: osd: scrub on disk size does not match object info size
hit this again, ... Sage Weil
08:28 PM Revision 8a2b7641 (ceph): msgr: print existing->state before failing assert
May help with #1378.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:07 PM Revision cbb12809 (ceph): Merge remote-tracking branch 'gh/wip-2121'
Reviewed-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Sage Weil
05:46 PM Revision 052d64e1 (ceph): osd: unregister signal handlers on shutdown
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:46 PM Revision db96831b (ceph): mon: unregister signal handlers on shutdown
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:45 PM Revision 8e9bf611 (ceph): mds: unregister SIGHUP too
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:45 PM Revision bb5c7640 (ceph): radosgw: handle SIGHUP
Fixes: #2121
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:23 PM Revision 9c7b63e1 (ceph): init-radosgw: add 'reload' command to send SIGHUP
Fixes: #2121
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:21 PM Revision e8437665 (ceph): osd: fix typo is recovery_state query dump
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:17 PM Revision 0e03e9dd (ceph): osd: add missing space to scrub error
[ERR] 18.5 osd.3: soid 8a5e37ad/rb.0.0.000000002b99/headextra attr _, extra attr snapset
Signed-off-by: Sage Weil <s...
Sage Weil
05:12 PM Revision 2437ce02 (ceph): msgr: discard the local_pipe's queue on shutdown.
To facilitate this, we do two things:
1) actually identify the number of special code values we pass around
2) use th...
Greg Farnum
05:10 PM Revision 7690f0b9 (ceph): osd: remove down OSDs from peer_info on reset
If an OSD goes down, remove it from peer_info. In particular, I saw
2012-02-28 11:04:25.851038 12e53700 osd.5 3602 p...
Sage Weil
02:57 PM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
i'm hoping wip-2116 fixes it... Sage Weil
02:31 PM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
Wido, are you able to reproduce this reliably? I have an idea what the problem is, but have never reproduced this. ... Sage Weil
02:17 PM Bug #2002: osd: racy push/pull for clones
reenabling this in my thrashing tests. if all goes well, i'll reenable in master under the assumption that sam's cle... Sage Weil
02:16 PM Bug #1977 (Can't reproduce): mon: ceph command hang
we can reopen if this ever pops up again Sage Weil
01:59 PM Feature #2111 (In Progress): msgr workloads
What we're looking for here are basic tests like connect, send message, kill connection, send another message; and ve... Greg Farnum
01:30 PM Messengers Bug #1747 (Resolved): msgr: osd connection originates from wrong port
commit:b1f264406f93af35600786f58e75908c393cf2ed Sage Weil
12:21 PM Messengers Bug #1747: msgr: osd connection originates from wrong port
wip-1747 Sage Weil
11:25 AM Messengers Bug #1747: msgr: osd connection originates from wrong port
just hit this again. osd.1:... Sage Weil
12:48 PM rgw Bug #2121 (Resolved): radosgw: reload command for init script
Sage Weil
09:48 AM rgw Bug #2121: radosgw: reload command for init script
Sage Weil
09:25 AM rgw Bug #2121 (Resolved): radosgw: reload command for init script
Sage Weil
12:48 PM Bug #1458 (Resolved): Run ceph suite with valgrind enabled
Sage Weil
11:13 AM Bug #1975: btrfs: EINVAL on snap create
see also this thread: http://marc.info/?t=132768583600004&r=1&w=2 Sage Weil
10:46 AM Bug #1975: btrfs: EINVAL on snap create
the EINVAL seems to have come from... Sage Weil
10:44 AM Bug #1975: btrfs: EINVAL on snap create
somehow we end up here in btrfs:... Sage Weil
10:39 AM Bug #1975: btrfs: EINVAL on snap create
quick brain dump:
- last time this reproduced i narrowed it down to a case where there were racing rmdirs with the...
Sage Weil
10:55 AM Bug #2115: OSD failed to start: Operation not permitted
it looks like you may be having trouble authenticating with the monitor. can you reproduce this with 'debug ms = 1'? ... Sage Weil
10:28 AM Bug #2031 (Can't reproduce): paxos: failed assert (begin->last_committed == last_committed)
Sage Weil
10:09 AM Messengers Bug #2086 (Resolved): msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
merged! Sage Weil
10:06 AM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
Sage suggested I could just add a local dispatch to the shutdown or wait functions to test this properly...I did, and... Greg Farnum
09:18 AM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
Sage Weil
09:27 AM Bug #1873: crush_rule type is inconsistent
It's __s16 or int so that a negative value can mean undefined/not specified. I'm inclined to just leave this as is... Sage Weil
09:18 AM Bug #2119 (Resolved): osd: do_query to !up osd
Sage Weil
01:04 AM Revision fe94c041 (ceph): Merge branch 'next'
Sage Weil

02/28/2012

10:05 PM Revision 23a0c039 (ceph): rgw: check for bucket swift permissions only if failed
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
09:55 PM Revision 85cc96c1 (ceph): dump_stuck: verify that 'ceph health' mentions the right number of inac...
Josh Durgin
09:53 PM Revision b9a675a2 (ceph): mon: report pgs stuck inactive/unclean/stale in health check
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Reviewed-by: Sage Weil <sage.weil@dreamhost.com>
Josh Durgin
09:31 PM Revision e73ab2cc (ceph): Merge branch 'master' into wip-swift-acls
Yehuda Sadeh
09:29 PM Revision bc80ba1f (ceph): rgw: fix swift bucket acl verification
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
08:37 PM Revision cc935180 (ceph): rgw: implement swift public group
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
08:29 PM Revision d10e1f46 (ceph): mon: fix slurp_latest to fill in any missing incrementals
Fixes #1789.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
06:39 PM Bug #2115: OSD failed to start: Operation not permitted
See attachment please soft crack
09:17 AM Bug #2115: OSD failed to start: Operation not permitted
Can you attach the actual log? I want to make sure there is no subtle difference in the output. Thanks! Sage Weil
01:40 AM Bug #2115: OSD failed to start: Operation not permitted
ceph version 0.42.2 (commit:732f3ec94e39d458230b7728b2a936d431e19322) soft crack
01:38 AM Bug #2115 (Rejected): OSD failed to start: Operation not permitted
I'm setting up a new ceph cluster on ubuntu 11.10 with kernel version 3.0.0-16-server x86_64. The osd server failed t... soft crack
05:57 PM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
To be clear, I didn't try and generate the actual failure condition that was causing an assert before — that should b... Greg Farnum
05:55 PM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
wip-2086 should fix this.
Ran a simple test:...
Greg Farnum
05:27 PM Messengers Bug #2086 (In Progress): msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
Greg Farnum
04:51 PM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
Okay, looks like the local_pipe doesn't get its message queue cleared...I'm checking the others and looking at how it... Greg Farnum
05:50 PM Revision 999e2192 (ceph): peer: ignore +scrubbing portion of pg state
It can cause the mon state and osd states to not match. Sage Weil
05:33 PM Revision 7b48cca1 (ceph): test_osd_types: fix unit test for new pg_t::is_split() prototype
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:30 PM Revision fd0712df (ceph): Makefile: drop separate libjson_spirit.la
automake seems to have difficulty with the .la dependency on another .la.
Since libjson_spirit.la is only used by lib...
Sage Weil
05:26 PM Revision edd35c04 (ceph): osd: drop useless ENOMEM check
new throws exception; doesn't return NULL.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:11 PM Revision a7de459f (ceph): ceph-osd: clarify error messages
So we know where the error came from. And use real error codes in init().
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:10 PM Revision 97926e18 (ceph): init: Actually do start the daemons when 'service ceph start <type>' is...
A bug in my previous patch prevented any daemon with auto_start set to false from starting.
This patch allows:
* /et...
Wido den Hollander
04:55 PM rgw Bug #2120: rgw: atomic write guard doesn't scale well
Implementing #1956 would solve this issue, and would make the entire atomic scheme simpler.
Yehuda Sadeh
03:03 PM rgw Bug #2120: rgw: atomic write guard doesn't scale well
This was reported by a user through the ml. We should figure out with that user whether it's a real issue, or a red h... Yehuda Sadeh
02:51 PM rgw Bug #2120: rgw: atomic write guard doesn't scale well
Do we care? You can't do partial updates to objects IIRC, so many writers pretty much has to be wrong somehow or other. Greg Farnum
02:35 PM rgw Bug #2120 (Resolved): rgw: atomic write guard doesn't scale well
shen there is a large number of writers to the same object. Yehuda Sadeh
04:48 PM rgw Bug #2106 (Resolved): failed s3tests.functional.test_s3.test_100_continue
Machines were running wrong apache and fastcgi modules. Yehuda Sadeh
04:23 PM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
This may be a messenger issue, but it's not losing that initial message — notice how osd5 tries to send a ping back t... Greg Farnum
11:26 AM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
the other side of this conversation is... Sage Weil
11:20 AM Bug #2116 (In Progress): Repeated messages of "heartbeat_check: no heartbeat from"
looks like a msgr issue?... Sage Weil
07:35 AM Bug #2116 (Resolved): Repeated messages of "heartbeat_check: no heartbeat from"
As discussed on the ml I gathered some logs.
Today I upgraded my whole cluster to 0.42.2 from 0.41.
Due to the ...
Wido den Hollander
12:54 PM Bug #1789 (Resolved): mon: failed assert(paxosv == pg_map.version)
Pushed to master in commit:d10e1f46df8cc252f2f1d57cf5e577ea38eee1ae Greg Farnum
12:48 PM Bug #1789: mon: failed assert(paxosv == pg_map.version)
Okay, figured it out. Our current slurp code pulls in all the incrementals, then sends off a request for latest_stash... Greg Farnum
12:01 PM Bug #2119 (Resolved): osd: do_query to !up osd
... Sage Weil
11:09 AM Bug #2118: osd: flawed commit_op_seq check on startup
Sage Weil
10:08 AM Bug #2118 (Resolved): osd: flawed commit_op_seq check on startup
the check that current/commit_op_seq == newest snap is flawed because ceph-osd can write a new current/commit_op-seq ... Sage Weil
10:09 AM Bug #2104 (Won't Fix): teuthology: wait_for_clean doesn't wait for last_epoch_started to propagate
Sage Weil
10:09 AM Bug #2107 (Resolved): teuthology: lost_unfound fails pg state assert
Sage Weil
09:41 AM devops Feature #2117 (New): qa: gitbuilder that does ENCODE_DUMP
Sage Weil

02/27/2012

11:41 PM Revision f317028f (ceph): doc: beginnings of documentation of stuck pgs and pg states
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Reviewed-by: Sage Weil <sage@newdream.net>
Sage Weil
11:13 PM Revision 19170241 (ceph): filestore: make less noise on ENOENT
Don't generate high-level log spam on every open error.
Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Sa...
Sage Weil
10:52 PM Revision 722af1a4 (ceph): no peer as part of lost_unfound
Sage Weil
10:49 PM Revision 244b7029 (ceph): pg: use get_cluster_inst instead of get_inst in activate
This was mistakenly broken in 4b3bb5ab37a05fa001d59f24da7d9c30d650321b
Signed-off-by: Greg Farnum <gregory.farnum@dr...
Greg Farnum
10:37 PM Revision f02195b4 (ceph): Merge branch 'wip-split2'
Reviewed-by: Samuel Just <samuel.just@dreamhost.com> Sage Weil
10:35 PM Revision b6a04174 (ceph): osd: pg_t::is_split(): make children out param a pointer, and optional
Also unit test it.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:18 PM Revision 85ed06e9 (ceph): osd: bypass split code
Until it is fully implemented. It's also disabled in the monitor
currently, but just in case it gets into the OSDMap...
Sage Weil
10:16 PM Revision 15d53249 (ceph): osd: fix pg locking flags
Two things we need to handle:
- callers who already hold map_lock (split_pg())
- callers who already hold another ...
Sage Weil
10:04 PM Revision fc7b11a9 (ceph): osd: partially refactor pg split
This partially refactors the OSD split code to do the split synchronously
when processing a new OSDMap. It is incomp...
Sage Weil
07:44 PM Revision 6a081888 (ceph): osd: factor hobject key into child pgid calc during split
When we calculate the object's new pg, take the locator key into
consideration, to avoid a crash like
osd/OSD.cc: In...
Sage Weil
07:44 PM Revision d9cf3322 (ceph): osd: implement pg_t::is_split()
Test to determine if a pg has split between two pool sizes, and if so,
what its children are.
Signed-off-by: Sage We...
Sage Weil
07:39 PM Revision ee4d9909 (ceph): journaler: log on unexpected objecter error
This will help with #2110, #1796, #1640.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:56 PM Revision 91b119a0 (ceph): osd: fix recursive map_lock via check_replay_queue()
Also drop activate_pg() helper while we're at it, so it's clear that we
are the only user.
recursive lock of OSD::ma...
Sage Weil
04:20 PM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
The guards for something like that shouldn't be too complicated to set up...actually, I thought they were at one poin... Greg Farnum
04:19 PM Bug #1789 (In Progress): mon: failed assert(paxosv == pg_map.version)
Iiiinteresting. This assert is the post-update check, after loading and running through all the incrementals. (Meanin... Greg Farnum
01:41 PM Bug #1789: mon: failed assert(paxosv == pg_map.version)
Shouldn't be related — this is a problem with a single monitor daemon and the other is a write problem that an MDS is... Greg Farnum
12:35 PM Bug #1789: mon: failed assert(paxosv == pg_map.version)
Core dump attached. Dumb thought: could this be related to http://tracker.newdream.net/issues/2110, they happened wit... Matthew Roy
10:14 AM Bug #1789: mon: failed assert(paxosv == pg_map.version)
Crash occurred on the third monitor when starting after being down for several hours shortly after cluster creation. ... Matthew Roy
02:07 PM CephFS Bug #2110 (Duplicate): osdc/Journaler.cc: 360: FAILED assert(r >= 0)
#1796 Sage Weil
01:40 PM CephFS Bug #2110: osdc/Journaler.cc: 360: FAILED assert(r >= 0)
can you attach ceph-mds too? or better yet, fire up gdb ceph-mds core and print out the value of r from that frame. ... Sage Weil
12:00 PM CephFS Bug #2110: osdc/Journaler.cc: 360: FAILED assert(r >= 0)
Sage Weil wrote:
> Do you have a core file? I'm curious what the value of 'r' is.
Attached. Probably. (datetime ...
Matthew Roy
11:43 AM CephFS Bug #2110: osdc/Journaler.cc: 360: FAILED assert(r >= 0)
Do you have a core file? I'm curious what the value of 'r' is. Sage Weil
11:40 AM CephFS Bug #2110 (Duplicate): osdc/Journaler.cc: 360: FAILED assert(r >= 0)
Assert in MDS. This cluster was running a CephFS home directory workload with one active MDS and one MDS in standby r... Matthew Roy
01:49 PM Bug #2045 (Need More Info): osd: dout_lock deadlock
Sage Weil
01:33 PM Feature #2114 (Resolved): old sepia setup on new hardware
Sage Weil
01:31 PM Feature #2113 (Resolved): objectcacher perfcounters
Sage Weil
01:18 PM Feature #2112 (Resolved): msgr fault injection
Sage Weil
01:18 PM Feature #2111 (Fix Under Review): msgr workloads
Develop the interfaces which will allow us to break messenger sockets at precisely-defined points.
Allow comparison ...
Sage Weil
11:38 AM Tasks #2109: qa/benchmark: Explore using Filebench for benchmarks / stress testing
Justification and a good intro: http://cuddletech.com/blog/pivot/entry.php?id=949 Anonymous
11:36 AM Tasks #2109 (New): qa/benchmark: Explore using Filebench for benchmarks / stress testing
http://filebench.sourceforge.net/
"Ships with more than 40 pre-defined personalities, including the one that descr...
Anonymous
11:05 AM Feature #2108 (New): track object states to inform error injection/testing
Sage Weil
11:04 AM Feature #1412 (Resolved): qa: spec out messenger testing
we now have a high-level plan on how to attack msgr testing. Sage Weil
10:03 AM Bug #1977: mon: ceph command hang
Pretty sure you pushed changes the day you filed it (note reference in previous message), although I can't find the e... Greg Farnum
09:51 AM rgw Bug #2106: failed s3tests.functional.test_s3.test_100_continue
Strange, I can see the request in the apache logs, but not in the rgw logs.... Yehuda Sadeh
09:12 AM Bug #2107 (Resolved): teuthology: lost_unfound fails pg state assert
ubuntu@teuthology:/a/nightly_coverage_2012-02-27-a/14063... Sage Weil
04:56 AM Revision 402ece5e (ceph): init-ceph: stick with /var/run for the time being
/run isn't present on older systems. Stick with the old location until it
is more pervasive, or we add an autoconf o...
Sage Weil
04:47 AM Revision 41295b58 (ceph): debian: /var/run/ceph -> /run/ceph
/run/ceph should exists for creating UNIX domain sockets
ceph uses UNIX domain sockets for internal communication. Cr...
Laszlo Boszormenyi
04:45 AM Revision 0d8b5756 (ceph): debian: build-{indep,arch}
Signed-off-by: Laszlo Boszormenyi <gcs@debian.hu> Laszlo Boszormenyi
04:45 AM Revision 3ad6ccb4 (ceph): debian: sdparm|hdparm, new standards version
Signed-off-by: Laszlo Boszormenyi <gcs@debian.hu> Laszlo Boszormenyi
01:09 AM Revision 9afafdf1 (ceph): move peer to separate test for now
Sage Weil

02/26/2012

08:56 PM Bug #1977: mon: ceph command hang
Hmm, I wonder if somehow misdiagnosed this, or inadvertantly fixed it: haven't seen this hang in weeks, and it happen... Sage Weil
05:09 PM rgw Bug #2106 (Resolved): failed s3tests.functional.test_s3.test_100_continue
... Sage Weil
05:02 PM Bug #2022: osd: misdirectect request
ubuntu@teuthology:/a/nightly_coverage_2012-02-26-a/13876$ grep WRN ceph.log
2012-02-26 01:18:03.166529 osd.1 10.3.1...
Sage Weil
11:19 AM Bug #2105 (Resolved): filestore: mkfs does not create initial snap
This bug almost the same as this bug:http://tracker.newdream.net/issues/1707
I followed the instruction:http://ceph....
Yunpeng Gao
05:35 AM Revision 6295578f (ceph): lost_unfound: do peer after, until wait_for_clean propagates last_epoch...
The peer task does wait_for_clean, and then lost_unfound immediately marks
something down. But the PGs become clean ...
Sage Weil
05:05 AM Revision 84cd4ed6 (ceph): peer: wait for peering to complete, or block
We need to wait for peering to either complete, or block because it is
waiting for another PG. _Then_ look at all th...
Sage Weil

02/25/2012

09:33 PM Bug #2104 (Won't Fix): teuthology: wait_for_clean doesn't wait for last_epoch_started to propagate
Sage Weil
09:06 PM Bug #2103 (Resolved): osd: lockdep error on watch_lock
... Sage Weil
09:04 PM Bug #2102 (Can't reproduce): osd: pg stuck in backfill
... Sage Weil
05:39 AM Revision d944e7ee (ceph): fix lockdep.yaml conf syntax
Sage Weil
01:01 AM Revision 266902a9 (ceph): rgw: initialize bucket_id in bucket structure
might make valgrind a little bit less noisy.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
12:07 AM Revision 4a5a0911 (ceph): Merge branch 'master' of ssh://github.com/NewDreamNetwork/ceph
Yehuda Sadeh

02/24/2012

11:32 PM Revision f8f6e4d8 (ceph): rgw: _exit(0) on SIGTERM
We need to do something a bit smarter to get coverage information, but this
is a start.
Signed-off-by: Sage Weil <sa...
Sage Weil
11:20 PM Revision 5d5a022c (ceph): run radosgw through valgrind for s3tests
Sage Weil
11:05 PM Revision edbb41e1 (ceph): add peer task
Force a pg to get stuck in 'down' state, verify we can query the peering
state, then start the OSD so it can recover.
Sage Weil
11:04 PM Revision c9c1a4ab (ceph): do peer test along with lost_unfound
Sage Weil
11:01 PM Revision b8739585 (ceph): peer: remove unused variable
Josh Durgin
10:56 PM Revision 62bda127 (ceph): misc: always return a usable result from get_valgrind_args
Josh Durgin
10:56 PM Revision e4801819 (ceph): rgw: simplify valgrind args
Josh Durgin
09:52 PM Revision 708be0a5 (ceph): Merge remote branch 'gh/wip-crush-adjust'
Reviewed-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Sage Weil
09:48 PM Revision b0feba56 (ceph): Merge remote branch 'gh/wip-mds-resetter'
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> Sage Weil
09:43 PM Revision 5c6e8b37 (ceph): Merge branch 'wip-pg-query'
Reviewed-by: Samuel Just <samuel.just@dreamhost.com> Sage Weil
09:22 PM Revision 008ce6b2 (ceph): Merge branch 'stable'
Sage Weil
09:00 PM Revision 732f3ec9 (ceph): v0.42.2
Sage Weil
09:00 PM Revision 321ba67f (ceph): Merge remote-tracking branch 'gh/stable' into stable
Sage Weil
08:54 PM Revision be761149 (ceph): Merge branch 'stable'
Sage Weil
08:49 PM Revision fc531a91 (ceph): rename valgrind -> verify, add in runs under lockdep
Sage Weil
08:42 PM Revision c43e87d1 (ceph): ceph_manager: list_pg_missing
List missing objects for the given pgid. Sage Weil
08:42 PM Revision 7ac04a42 (ceph): lost_unfound: list missing/unfound for each pg and verify the unfound c...
This also tests the pg list_missing functionality. Sage Weil
08:40 PM Revision d85ed91c (ceph): osd: fix array index
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:40 PM Revision 722e9e59 (ceph): lockdep: don't make noise on startup
Who cares!
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:40 PM Revision fdaed0a7 (ceph): formatter: fix trailing dump_stream()
Flush a previous dump_stream() if it was the last thing prior to a
close_section().
Signed-off-by: Sage Weil <sage.w...
Sage Weil
08:05 PM Revision 7bf64b73 (ceph): rgw: accept dict
e.g.,
tasks:
...
- rgw:
client.0:
client.1:
Sage Weil
08:05 PM Revision e2ea73d1 (ceph): rgw: add valgrind support
tasks:
- ceph:
- rgw:
client.a:
valgrind: [--tool=memcheck]
Sage Weil
08:05 PM Revision 7af6e46c (ceph): ceph: always try to process valgrind logs
Check for errors in valgrind logs even if there is no valgrind option
the ceph task config stanza. Other tasks can r...
Sage Weil
08:05 PM Revision 90fdc840 (ceph): ceph: always create valgrind logs dir
Other tasks use it too. It's more annoying to conditionally create it. Sage Weil
08:05 PM Revision 9ec04722 (ceph): refactor all valgrind users to use a get_valgrind_args() helper
This avoids much annoying, duplicated code. Sage Weil
08:05 PM Revision 3bfb8d69 (ceph): ceph, ceph-fuse: simplify valgrind argument additions
Josh Durgin
08:05 PM Revision c93a08ed (ceph): Whitespace and unnecessary formatting fixes
Josh Durgin
08:04 PM Revision 7ad35ce4 (ceph): osd: include timestamps in state json dumps
Include the time we entered this state in the dump.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:00 PM Revision 185c6b1f (ceph): Merge branch 'wip-2007'
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com> Sage Weil
07:59 PM Revision e22adac2 (ceph): osd: use blocks for readability in list_missing
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:33 PM Revision e22a45a1 (ceph): osd: query recovery state machine
For now, just append this to the end of the pg <pgid> query json dump.
We definitely want to do something smarter her...
Sage Weil
07:33 PM Revision a7c8bfbe (ceph): osd: query Peering substates
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:33 PM Revision 6d90a6dd (ceph): osd: dump recovery_state states in json
Use a formatter. Present a vector of states, inner to outer.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
07:24 PM Revision d3b203af (ceph): osd: add tunable for number of records in osd command replies
e.g., 'pg <pgid> list_missing [offset]'.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:24 PM Revision 0361a3c4 (ceph): osd: pass in data to do_command
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:24 PM Revision 2677c72f (ceph): add libjson_spirit.la
This is lightweight and relies on boost spirit, which we already use, so
there are no new dependencies.
There were s...
Sage Weil
07:24 PM Revision 6c257c4d (ceph): hobject_t: decode json
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:24 PM Revision 91fbc687 (ceph): osd: 'pg <pgid> list_missing <json hobject_t offset>'
Dump missing objects in json. If more key is non-zero, user should ask for
more by passing the last object as the of...
Sage Weil
07:24 PM Revision c9416e61 (ceph): osd: 'tell osd.N mark_unfound_lost revert' -> 'pg <pgid> mark_unfound_l...
More consistent interface.
Fixes: #2030
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Reviewed-by: Josh Durgin ...
Sage Weil
07:15 PM Revision 64038524 (ceph): lockdep: warn on stderr (via derr), not stdout
Otherwise we screw up ceph-conf output and the like.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
07:15 PM Revision 804f243b (ceph): do_autogen.sh: -T for --without-tcmalloc
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:30 PM Feature #2054 (Resolved): teuthology: run radosgw through valgrind
ok, this now works with yaml like... Sage Weil
01:52 PM Feature #2006 (Resolved): osd: report what is blocking peering completion
commit:5c6e8b3795d0cf58814619bfc15cb0841e9a4f17 Sage Weil
01:51 PM CephFS Bug #1792 (Can't reproduce): crash in ceph-mds
even if we could, we would never know, since there isn't any distinguishing info here, and the teuth archive is gone. Sage Weil
01:48 PM RADOS Bug #2096 (Resolved): crush: adjust weight broken for tree, list buckets
commit:708be0a5abef63a5da8409ad13719adb7bb744f8 Sage Weil
01:47 PM RADOS Feature #2101 (Resolved): crushtool: check for weight overflow on reweight
Sage Weil
11:56 AM Feature #2007 (Resolved): osd: enumerate unfound, lost objects, possible locations
Sage Weil
09:52 AM Feature #2007: osd: enumerate unfound, lost objects, possible locations
wip-2007 Sage Weil
11:34 AM Feature #2030 (Resolved): osd: clean up mark_unfound api
Sage Weil
10:34 AM Messengers Feature #2100 (Resolved): msgr: Prevent throttled clients from slowing down non-throttled connect...
Right now, it seems a throttled connection will still receive a TCP receive buffer's worth of data, but because the u... Anonymous
09:15 AM Linux kernel client Bug #2099: messenger: unexpected socket state (4)
I don't think any of these other states are necessarily problematic, as long as the socket eventually ends up in CLOS... Sage Weil
08:49 AM Linux kernel client Bug #2099: messenger: unexpected socket state (4)
This may be related to http://tracker.newdream.net/issues/1803 and http://permalink.gmane.org/gmane.comp.file-systems... Josh Durgin
08:33 AM Linux kernel client Bug #2099: messenger: unexpected socket state (4)
Adding that I see more of the same WARNING() messages in the log for
the same state, as well as others for state 5, ...
Alex Elder
08:13 AM Linux kernel client Bug #2099 (Rejected): messenger: unexpected socket state (4)
Running tests defined by the YAML file below. Note that branch
wip-messenger is 107a8aaf21d01ee6cbc7a638faf1328f2bd...
Alex Elder
07:59 AM CephFS Bug #2092: BUG at fs/ceph/caps.c:999
mdsc->mutex protects the globalish mds client state (request/session lists), which is different from session->s_mutex... Sage Weil
06:57 AM CephFS Bug #2092: BUG at fs/ceph/caps.c:999
Just a quick look at this.
Here's the code:
static void __queue_cap_release(struct ceph_mds_session *session,
...
Alex Elder
06:10 AM Bug #2091 (Can't reproduce): corrupt v5 inc osdmap
logs don't go far enough back. :(
moral of the story: next time grab the full mon data dir immediately in case it...
Sage Weil
05:57 AM Linux kernel client Bug #1907 (Resolved): rbd: don't reuse device ids while they're still in use elsewhere
Committed a couple of weeks ago and has seen no bad effect during the
intervening testing. So I'm marking this one ...
Alex Elder
04:22 AM Revision 5efa821c (ceph): rgw: swift read acls allow bucket listing
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
04:11 AM Revision f09fb870 (ceph): rgw: fix swift acl enforcement
we'll also need to make it so that swift read acls allow bucket listing
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdr...
Yehuda Sadeh
04:09 AM Revision d40a9b27 (ceph): lost_unfound: new mark_unfound_lost syntax
Sage Weil
02:58 AM Revision 7c7349ef (ceph): ceph: fix help.t
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
02:48 AM Revision 730b9ee0 (ceph): ceph-dencoder: man page
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
02:48 AM Revision f6e42a8b (ceph): ceph.spec.in: add ceph-dencoder
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
02:48 AM Revision 0281f1c6 (ceph): debian: add ceph-dencoder
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
02:48 AM Revision c3e1291d (ceph): v0.42.1
Sage Weil
02:13 AM Revision cbf79a97 (ceph): ceph-tool: remove reference to "stop" command
This doesn't exist any more, and I don't think it
ever "cleanly shut down the filesystem" -- certainly not
within my ...
Greg Farnum
02:13 AM Revision 3bad945b (ceph): mds: remove unused MDBalancer dump_pop_map() function.
Commenting it out is not the right answer. ;)
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Reviewed-by:...
Greg Farnum
01:22 AM Revision 4dfec574 (ceph): rgw: enforce swift acls
doesn't work yet, but almost.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
01:07 AM Revision 81a46c46 (ceph): dump_stuck: flush stats before waiting for recovery/clean
Josh Durgin
12:35 AM Revision 159f2b86 (ceph): mds: fix Resetter locking
We need to hold the lock for ms_dispatch, esp calls into objecter. We
should only drop it when blocking; use distinc...
Sage Weil
12:35 AM Revision 065d6dd8 (ceph): mds: clean up useless block
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

02/23/2012

11:34 PM Revision f5bf9d9c (ceph): rgw: s3 only shows s3 acls
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:33 PM Revision c88da93e (ceph): Merge remote branch 'origin/wip-mds-old-inodes'
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
11:06 PM Revision 5aa60ce4 (ceph): Merge remote branch 'origin/wip-dencoder'
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
11:06 PM Revision db99217b (ceph): Merge remote branch 'origin/wip-1820'
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
11:05 PM Revision e8bc42ff (ceph): osd: only set CLEAN when we are not remapped (up == acting)
If we have a temporary mapping for this PG, consider that unclean. This
makes CLEAN and REMAPPED mutually exclusive....
Sage Weil
10:59 PM Revision 4d1d5229 (ceph): rgw: show swift ACLs
Yehuda Sadeh
10:56 PM Revision d8df5655 (ceph): Merge remote-tracking branch 'gh/wip-pg-query'
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com> Sage Weil
10:55 PM Revision ddc99983 (ceph): osd: conditionally encode old pg_pool_t when no CEPH_FEATURE_OSDENC
This fixes OSDMap compatibility between v0.42 and <v0.42.
For MOSDMap, reencode maps if OSDENC feature is missing. ...
Greg Farnum
10:38 PM Revision cd9f7df9 (ceph): Merge remote-tracking branch 'gh/wip-dump-ops-in-flight'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
10:28 PM Revision 079dd6db (ceph): mon: mds "stop" -> "deactivate"
See #1820.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:28 PM Revision a1544c0e (ceph): doc: 'deactivate mds' instead of 'stop mds'
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:28 PM Revision d85e9153 (ceph): mon: use pending_mdsmap for deactivate
We should always look at the proposed map to avoid weird races.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
09:56 PM Revision 2824c07f (ceph): rgw: can use swift to set bucket permissions
Currently only setting, not reading. Also, at the moment it's
setting the wrong permissions.
Signed-off-by: Yehuda S...
Yehuda Sadeh
08:12 PM Revision 700fe079 (ceph): test: add basic test for the OSD's dump_ops_in_flight adminsocket command
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:12 PM Revision 5944016b (ceph): osd: add "dump_ops_in_flight" to the AdminSocket.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:08 PM Revision 21c43133 (ceph): mon: refuse to stop mds if max_mds will make it rejoin
Otherwise the MDS will leave the cluster and immediately rejoin, which is
useless and confusing to users. See #1820....
Sage Weil
08:07 PM Feature #2030: osd: clean up mark_unfound api
wip-2030
Sage Weil
07:53 PM Revision 7700ea94 (ceph): crushtool: add --reweight-item cli tests
Test list, tree, and straw buckets.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:39 PM Revision 286df2db (ceph): crush: fix weight adjust for list, tree buckets
Fix the typo. Code now matches that for straw buckets.
Reported-by: ZhuRongze <zrz4ceph@gmail.com>
Signed-off-by: S...
Sage Weil
07:16 PM Revision 963dec82 (ceph): Merge branch 'wip-2090'
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> Sage Weil
07:15 PM Revision d1fe2f8f (ceph): mon: deprecate mon 'stop' command
Send SIGTERM.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
07:15 PM Revision 62a113aa (ceph): mon: unlock mon before msgr shutdown
The ceph_mon.cc main() will delete mon when the msgr dispatch thread
completes. Make sure we unlock before we shut d...
Sage Weil
07:14 PM Revision 962aa3ea (ceph): msgr: join dispatch_thread after it completes
This is just for completeness. No change in behavior, since we don't
get here until the thread has signaled it is do...
Sage Weil
07:04 PM Revision d8192222 (ceph): Merge remote-tracking branch 'gh/wip-stop'
Sage Weil
06:52 PM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
it did. probably a race with another thread in connect() or accept() reregistering a new Pipe.. connect() pbly
Sage Weil
06:47 PM Messengers Bug #2086: msgr: msg/SimpleMessenger.h: 203: FAILED assert(!i->second->is_on_list())
We sure this was run including commit:ebbfdefa120ae93b95780c67027ec9efd4b7b5cd? Greg Farnum
05:51 PM Revision 86a54a6e (ceph): filestore: use IOC_CLONERANGE intead of IOC_CLONE ioctl
This is functionally equivalent, except that valgrind doesn't complain
about a bad pointer passed to an ioctl.
Signe...
Sage Weil
05:43 PM Revision 49588e94 (ceph): osd: drop "stop" command
Send SIGINT.
Fixes: #1820
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:42 PM Revision 560ddf46 (ceph): osd: drop unused "stop" check
This is never reached: both callers handle "stop" explicitly.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:39 PM Revision 64ca584d (ceph): osd: don't complete recovery if unfound
Otherwise we fail the !needs_recovery() assert. Because we aren't
recovered. For example,
2012-02-21 16:16:13.1046...
Sage Weil
04:38 PM Feature #2006 (In Progress): osd: report what is blocking peering completion
wip-pg-query Sage Weil
04:07 PM Bug #2098 (Resolved): xfs/ext4 non-idempotent transaction
Forcing a sync after a non-idempotent transaction is not adequate to ensure correctness during journal replay.
Con...
Samuel Just
03:36 PM Bug #1820 (Resolved): deprecate "ceph stop"
Sage Weil
02:37 PM Bug #1820: deprecate "ceph stop"
ok, tested all this in wip-1820. 'deactivate' already moves the ceph-mds to standby (not exit), all good there.
n...
Sage Weil
11:30 AM Bug #1820: deprecate "ceph stop"
yeah. i think the simplest is to make 'leave' refuse if it's is < max_mds.
and we could drop max mds from the cep...
Sage Weil
11:22 AM Bug #1820: deprecate "ceph stop"
Oh, I've talked of this before. It might be nice to have a "start ceph-mds only to process a leftover journal and han... Anonymous
11:19 AM Bug #1820: deprecate "ceph stop"
Changing docs is easy, and the branches already rip out "documented" commands. Let's just make it make sense.
I wo...
Anonymous
11:04 AM Bug #1820: deprecate "ceph stop"
It can easily go back into standby (via the respawn() -> execve() path) instead of shutting down. Then it's really "... Sage Weil
10:54 AM Bug #1820: deprecate "ceph stop"
On termination the process exits. On receipt of a stop command it exports authority over the filesystem hierarchy to ... Greg Farnum
10:52 AM Bug #1820: deprecate "ceph stop"
Tommi Virtanen wrote:
> Greg, how is "ceph mds stop 0" different from that ceph-mds receiving a local request to ter...
Anonymous
10:51 AM Bug #1820: deprecate "ceph stop"
Greg, how is "ceph mds stop 0" different from that ceph-mds receiving a local request to terminate (e.g. SIGTERM)? Anonymous
10:49 AM Bug #1820: deprecate "ceph stop"
No, the important part is the hierarchy authority export. Then it shuts down; it's not a "go standby". I guess you co... Greg Farnum
10:48 AM Bug #1820: deprecate "ceph stop"
Which makes me think, is the concept of "go standby" of any value, if there's something that'll automatically say the... Anonymous
10:44 AM Bug #1820: deprecate "ceph stop"
It sounds like that does two things: move the MDS from active to standby, and terminate it. And we're removing the "r... Anonymous
10:31 AM Bug #1820: deprecate "ceph stop"
That one is a bit different.. it's instructing ceph-mds to export all of it's metadata to another node and leave the ... Sage Weil
10:11 AM Bug #1820: deprecate "ceph stop"
Yeah. I can't speak for the threading & locking changes, but the command removal is trivial.
That still leaves
...
Anonymous
09:51 AM Bug #1820: deprecate "ceph stop"
wip-stop and wip-2090 Sage Weil
03:35 PM Bug #2095 (Resolved): osd: need feature bit for v0.42 osdmap encoding change
commit:ddc99983228e761f754e0038aecbe341d7e2181f Sage Weil
09:27 AM Bug #2095: osd: need feature bit for v0.42 osdmap encoding change
we had a feature bit already, we just needed to conditonally encodee the old format, and tweak MOSDMap to reencode ma... Sage Weil
03:16 PM Bug #2094 (Resolved): osd: pgs remapped to down+out osd
making remapped and clean mutually exclusive. commit:e8bc42ff435e5648b88b818775d8fa47989af5dc Sage Weil
10:43 AM Bug #2094: osd: pgs remapped to down+out osd
Reproduced again with stats flushing. This seems to happen every time with this configuration (maybe having only 2 os... Josh Durgin
03:14 PM Bug #2091: corrupt v5 inc osdmap
ok.. yeah, it looks like the monitor may have published a bad inc update or something? unclear. i'll check with the... Sage Weil
03:11 PM Bug #2091: corrupt v5 inc osdmap
OK, picking a few things out of the original corruption report.
The basic header stuff is the same as before, as e...
Alex Elder
02:48 PM Feature #2015 (Resolved): osd: dump in-flight ops via admin socket
Sage Weil
02:37 PM CephFS Feature #2097 (Rejected): mds: 'ceph mds activate <gid>'
ability to explicitly instruct a standby mds to join the active cluster. Sage Weil
12:04 PM Messengers Bug #1985 (Won't Fix): msgr: creating new Pipe for pre-existing connection leaks Pipe if they don...
at least until we demonstrate the problem (after the msg leak fix). this will probably be moot after refactoring som... Sage Weil
12:01 PM RADOS Bug #2096: crush: adjust weight broken for tree, list buckets
wip-crush-adjust Sage Weil
10:48 AM RADOS Bug #2096 (Resolved): crush: adjust weight broken for tree, list buckets
... Sage Weil
11:25 AM Bug #2090 (Resolved): mon: assertion failed on shutdown
commit:963dec82880717054c760a745cf93cc7b43112df Sage Weil
09:06 AM Bug #2080 (Resolved): osd: scrub on disk size does not match object info size
Sage Weil
05:24 AM Revision 3628f901 (ceph): mds: make EMetaBlob::fullbit::old_inodes non-ptr
No need to put this separately on the heap, as a static map<> isn't much
more expensive than a pointer. Also, this e...
Sage Weil
05:21 AM Revision 7842bb50 (ceph): mds: Add old_inodes to emetablob
Add information about old inodes to the mds journal.
Signed-off-by: Alexandre Oliva <oliva@lsd.ic.unicamp.br>
Signed...
Alexandre Oliva
05:08 AM Revision 26b56753 (ceph): Fix ceph-mds --journal-reset
Complete configuration initialization for special actions, and
hold Resetter lock while running reset.
Signed-off-by...
Alexandre Oliva

02/22/2012

10:12 PM Linux kernel client Cleanup #2093: ceph-client: messenger: the "to" parameter to read_partial() needs to go
I think it's right as is... all of those read calls are non-blocking. So the first time around in_base_pos is 0 and ... Sage Weil
05:28 PM Linux kernel client Cleanup #2093 (Resolved): ceph-client: messenger: the "to" parameter to read_partial() needs to go
I have been doing some refactoring of the net/ceph/messenger.c. One of
my aims was to understand the how (and why) ...
Alex Elder
09:33 PM Bug #2091: corrupt v5 inc osdmap
the first badness in the log is below. once it missed one incremental, things probably got out of sync and the pg_te... Sage Weil
09:28 PM Bug #2091: corrupt v5 inc osdmap
Oh.. that means the pg_temp mapping was inserted by a previous inc map, probably. we need to find the first instance... Sage Weil
06:23 PM Bug #2091: corrupt v5 inc osdmap
I've manually decoded the entire ceph_osdmap dumped in the log and everything
therein looks fine. (This was overkil...
Alex Elder
01:20 PM Bug #2091: corrupt v5 inc osdmap
I'm starting to look at this in detail but haven't concluded what went wrong yet.
Does it matter whether it was th...
Alex Elder
09:33 AM Bug #2091: corrupt v5 inc osdmap
reencoded to old format (using latest ceph-dencoder) gives us... Sage Weil
09:28 AM Bug #2091 (Can't reproduce): corrupt v5 inc osdmap
... Sage Weil
09:20 PM Bug #2090: mon: assertion failed on shutdown
... Sage Weil
09:20 PM Bug #2090: mon: assertion failed on shutdown
wip-2090 Sage Weil
05:04 AM Bug #2090 (Resolved): mon: assertion failed on shutdown
I was running repeated cycles of the kernel_untar_build.sh workunit
to try to reproduce a problem in the client and ...
Alex Elder
09:17 PM Bug #2095 (Resolved): osd: need feature bit for v0.42 osdmap encoding change
Sage Weil
07:02 PM Bug #2094 (Resolved): osd: pgs remapped to down+out osd
This is why the dump_stuck test fails on master. When one osd is marked out, the pg is remapped incorrectly:... Josh Durgin
10:06 AM Feature #2005 (Resolved): mon: track timestamps on pg states
Sage Weil
10:06 AM Feature #2058 (Resolved): ceph: query pg state
Sage Weil
10:03 AM Feature #2054: teuthology: run radosgw through valgrind
wip-valgrind Sage Weil
09:45 AM CephFS Bug #2092 (Can't reproduce): BUG at fs/ceph/caps.c:999
... Sage Weil
09:36 AM Bug #2022: osd: misdirectect request
hit this again:... Sage Weil
01:11 AM Revision 761ecc69 (ceph): Makefile: include encoding check scripts in dist tarball
This makes 'make distcheck' happy. Well, more happy at least; it's still
cranky but I can't tell why.
Signed-off-by...
Sage Weil
12:21 AM Revision 52a52cf4 (ceph): Add test for 'ceph pg dump_stuck'
Josh Durgin

02/21/2012

11:44 PM Revision a6c7f999 (ceph): ceph-dencoder: man page
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:44 PM Revision cd5a8f7e (ceph): ceph.spec.in: add ceph-dencoder
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:44 PM Revision 7fab4fa0 (ceph): debian: add ceph-dencoder
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:24 PM Revision 8c48a8e0 (ceph): rgw: read correct acls for swift metadata update ops
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:12 PM Revision 995dc1f7 (ceph): Add a task for testing stuck pg visibility.
Josh Durgin
11:12 PM Revision 2a1c74c5 (ceph): Move duration calculation to an internal task
This excludes all generic start up costs, like waiting for locks,
rebooting into a new kernel, etc.
Josh Durgin
11:08 PM Revision e67c0ff0 (ceph): osd: make object_info_t::dump using hobject_t and object_locator_t dum...
Makes the output more readable.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
11:04 PM Revision eb434a50 (ceph): Add necessary imports for s3 tasks, and keep them alphabetical.
Josh Durgin
11:04 PM Revision 1ac4bb10 (ceph): Add necessary imports for s3 tasks, and keep them alphabetical.
Josh Durgin
10:46 PM Revision f7feded0 (ceph): Merge remote-tracking branch 'gh/wip-dump-stuck-pgs'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
10:44 PM Revision 04c8e01d (ceph): Merge remote-tracking branch 'gh/wip-osd-write'
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com> Sage Weil
10:43 PM Revision 55a60651 (ceph): osdmap: dump embedded crush map in Incremental::dump()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:39 PM Revision 2365c77a (ceph): rgw: maintain separate policies for object and bucket
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
10:39 PM Revision cc78fdaa (ceph): Merge branch 'wip-crush'
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com> Sage Weil
10:37 PM Revision d2335fab (ceph): crush: write CrushWrapper:dump()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:58 PM Revision 174f6b84 (ceph): osd: refuse to return data payload if request wrote anything
Write operations aren't allowed to return a data payload because
we can't do so reliably. If the client has to resend...
Sage Weil
09:58 PM Revision 27c8a3f4 (ceph): test/rados-api/misc: fix LibRadosMisc.Operate1PP test
It's a mutation, so we get a result of 0 (or error).
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
09:51 PM Revision 270bb5cf (ceph): Merge branch 'wip-osdmap'
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com> Sage Weil
09:50 PM Revision 7cafa255 (ceph): osdmap: dump fullmap from dump()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:43 PM Revision 80d86306 (ceph): Merge branch 'wip-1821'
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com> Sage Weil
08:23 PM Revision 11073e50 (ceph): s3roundtrip, s3readwrite: access key uses url safe chars
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Yehuda Sadeh
08:18 PM Revision 0e4367aa (ceph): rgw: accepted access key chars should be url safe
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
08:12 PM Revision 6e1b3a56 (ceph): rgw: access key uses url safe chars
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Yehuda Sadeh
08:12 PM Revision 92110e5a (ceph): rgw: access key uses url safe chars
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Yehuda Sadeh
06:02 PM Revision df5f5738 (ceph): add valgrind collection to regression suite
Run a smaller set of tests with valgrind on the mon, osd, and mds.
Valgrind is currently ignoring leaks, but this wi...
Sage Weil
05:29 PM Revision 17d38700 (ceph): rgw: don't invalidate cache when adding xattrs
Yehuda Sadeh
04:58 PM rgw Cleanup #2089 (Resolved): rgw: less dout(0) noise?
i think that's hwere this si coming from:... Sage Weil
03:32 PM Feature #1932 (Resolved): mon: before accepting a new crushmap, monitor should validate and test ...
Sage Weil
03:31 PM Feature #2088 (Rejected): msgr: refactor 2 threads to one
Sage Weil
03:30 PM Feature #1412 (New): qa: spec out messenger testing
Sage Weil
03:29 PM Feature #1412: qa: spec out messenger testing
er, wrong bug! Sage Weil
12:22 PM rgw Bug #2083 (Resolved): rgw: test_object_raw_authenticated* fail (on xfs?)
Should be fixed now. Updated relevant teuthology tests to use only url safe chars. Also updated rgw-admin to disallow... Yehuda Sadeh
10:34 AM rgw Bug #2083: rgw: test_object_raw_authenticated* fail (on xfs?)
Not really related to xfs. The problem is that when generating authenticated urls, boto doesn't escape the access key... Yehuda Sadeh
10:55 AM Feature #2087 (Resolved): lightweight filestore workload generator
simple program that uses FileStore and generates something that looks vaguely like what an OSD does. e.g.,
- stre...
Sage Weil
09:13 AM Bug #2084: segfault in tcmalloc
and again (hammer b.yaml). right before the crash sched_scrub() was called...... Sage Weil
04:40 AM Revision cedb3d73 (ceph): ceph: if 'pg <pgid> ..' doesn't parse a pgid, send to mon
E.g., 'pg dump'. Sigh.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:03 AM Revision 9927671b (ceph): Makefile: fix misplaced unit tests
These weren't run on make check because they were defined in the wrong
spot.
Signed-off-by: Sage Weil <sage@newdream...
Sage Weil
12:03 AM Revision 1ff75684 (ceph): hobject_t: remove unused back_up_to_bounding_key()
This was a path not taken in the backfill code.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
 

Also available in: Atom