Project

General

Profile

Activity

From 03/20/2012 to 04/18/2012

04/18/2012

11:37 PM Revision 7fbc0fcc (ceph): msgr: add error code returns to send_keepalive.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
11:27 PM Revision cb43eeee (ceph): msgr: add some validity checks to ~SimpleMessenger()
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
10:54 PM Revision 0c2c18c1 (ceph): msgr: convert global_seq_lock to a spinlock.
Because nobody wants to get put to sleep waiting on a unique integer.
(I would just make this an atomic_t but we need...
Greg Farnum
10:51 PM Bug #2263: obsync: move man page to section 1
Sage Weil
10:45 PM Revision dc4c759c (ceph): msgr: convert get_policy() to return a const reference.
The only user who didn't want a const should have been using
direct access anyway (and now is doing so).
Signed-off-...
Greg Farnum
10:45 PM Revision aab516da (ceph): rgw: allow subuser creation on radosgw-admin user create
This fixes issue #2312: allow the creation of both user
and a subuser in a single radosgw-admin command.
Signed-off-...
Yehuda Sadeh
10:44 PM Revision c9adf07b (ceph): msgr: move SimpleMessenger function declarations into good order
We now have one declaration each of private and protected, and two
declarations of public -- one for the Messenger-re...
Greg Farnum
10:42 PM CephFS Bug #2293 (Resolved): admin sockets don't persist with ceph-fuse
commit:e82c33099a0efda027bc7fa991dcd2073baea539 Sage Weil
10:23 PM Revision dbe779a5 (ceph): osd: dump old ops singly rather than all at once.
Fixes #2269. Convert the OpTracker::check_ops_in_flight interface
to take a vector<string> and create a separate warn...
Sage Weil
10:23 PM Revision b0532aa6 (ceph): osdmap: restructure flow in OSDMap::decode
Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com>
Sage Weil
10:19 PM Revision 7ae1aefa (ceph): gather logs for cfuse dbench workload, hopefully catch #1737
Sage Weil
10:13 PM Revision bc458bfd (ceph): ceph-object-corpus: prune archive
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:09 PM Revision ebb19e70 (ceph): test/encoding/import.sh: fix
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:53 PM Revision e1622e4f (ceph): rgw: HEAD bucket also dumps bucket meta info
Shows total objects, bytes used, actual bytes used.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
09:53 PM Revision 0769efa5 (ceph): rgw: add x-rgw-* as a possible meta header prefix
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
09:46 PM rgw Bug #2027: rgw -> apache miscommunication
Not completely unlikely. We can set it to "can't reproduce", and reopen if we see it again. Yehuda Sadeh
06:12 PM rgw Bug #2027: rgw -> apache miscommunication
do we think this is fixed now by the rgw throttling? Sage Weil
09:36 PM Revision 2de37e01 (ceph): Merge remote-tracking branch 'gh/wip_journal'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
09:10 PM Revision c3bfe456 (ceph): osdmap: remove bad cct reference
Not in OSDMap
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:28 PM Revision 9ef953b5 (ceph): Merge branch 'wip-2286-ratio-a'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
07:50 PM Revision eea982e5 (ceph): osdmap: workaround old bug that zeroed pool_max
Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com>
Sage Weil
06:32 PM Revision ee22c97b (ceph): python-ceph: remove rgw bindings
We shouldn't expose these internal apis.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
06:21 PM Revision 407b2e0b (ceph): whitelist xfs_fsr syslog noise
Ignore lines like
2012-04-17T13:44:11-07:00 plana59 fsr[5454]: DEBUG: fsize=450560 blsz_dio=450560 d_min=512 d_max=2...
Sage Weil
06:10 PM Revision eb8a8fdf (ceph): pgmap: allow Incrementals to specify [near]full_ratios of 0
This commit isn't entirely safe: old monitors used 0 to mean "no change".
We can revert this (and the PGMonitor.cc po...
Greg Farnum
06:05 PM Revision 88525eaa (ceph): pgmon: add "set_full_ratio x" and "set_nearfull_ratio x" commands
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
06:05 PM Revision 882bd5b8 (ceph): pgmon: convert PGMonitor::prepare_command to our standard infrastructure.
So that I can add another command.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
06:05 PM Revision f1152e67 (ceph): mon: only fill in full/nearfull sets if the ratio > 0
This avoids putting all OSDs in both sets when the ratios are 0, as they
are with a fresh cluster and pgmap. This al...
Sage Weil
06:05 PM Revision 1242b754 (ceph): pgmon: remove the PGMonitor update_full_ratios stuff
Making it a config watcher is just a huge mess in terms of consistently
updating it appropriately.
The next commit wi...
Greg Farnum
06:05 PM Revision 841f2885 (ceph): mon: unconditionally encode PGMap full ratios in the Incremental
This properly spreads the real value to peon monitors -- they weren't
seeing the right values at all before.
Initiali...
Greg Farnum
06:00 PM Bug #2310: osd: too many open files
failed to capture a full strace.. try it again (once we find a failing osd on congress) with
strace -e trace=open,...
Sage Weil
09:48 AM Bug #2310 (Resolved): osd: too many open files
... Sage Weil
04:39 PM Bug #2315 (Resolved): unrecognized admin socket command 'objecter_requests'
From teuthology:/a/nightly_coverage_2012-04-18-a/1602/teuthology.log:... Josh Durgin
03:57 PM rgw Bug #2312 (Resolved): rgw: create user and subuser in a single radosgw-admin command
Fixed, commit:aab516da7f89310445be4e4fb61836084d2dac32. Yehuda Sadeh
02:01 PM rgw Bug #2312 (Resolved): rgw: create user and subuser in a single radosgw-admin command
Yehuda Sadeh
03:41 PM Bug #2211 (Resolved): osd: entity_inst_t OSDMap::get_inst(int) const
Sage Weil
03:41 PM Bug #2262 (In Progress): qa: osd-recovery tasks fails on flush_pg_stats
Sage Weil
03:27 PM Feature #2314 (Resolved): remove localized pgs
Sage Weil
02:47 PM rgw Feature #2313: rgw: expose extra bucket info trough S3 api
Ok, let's just send those extra headers anyway. Otherwise we'd have some issue creating the request signature for the... Yehuda Sadeh
02:11 PM rgw Feature #2313 (Resolved): rgw: expose extra bucket info trough S3 api
syntax:
HEAD /<bucket>
X-RGW-Params: extrainfo
extra response headers:
X-RGW-Object-Count: <object count>
X-RG...
Yehuda Sadeh
02:23 PM Feature #2252 (Resolved): rgw long run kernels
Sage Weil
02:22 PM Feature #2250 (Resolved): rgw long run raid config
Sage Weil
02:14 PM Feature #2265 (Rejected): make sure objecter/kclient error out when localized pgs don't exist
Sage Weil
01:58 PM rgw Feature #2308 (Resolved): radosgw-admin: make user create idempotent
done, commit:5a6bbd0c473e15aa7642da367e7936015d19d77a. Yehuda Sadeh
01:46 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
And I gave him a patched monitor so he could set pg_num, which should fix it. Waiting to hear back, and will apply th... Greg Farnum
01:16 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
pushed workaround that will repair osdmaps that saw your corruption, commit:eea982e56739a7a91ca907ccc5c5ec1f78d9460d. Sage Weil
01:30 PM Bug #2311 (Resolved): rbd: delete + create image led to EEXIST
this is 'rbd writeback window' at its best. long live 'rbd cache'! Sage Weil
01:06 PM Bug #2311: rbd: delete + create image led to EEXIST
Congrats for closing the annoying ticket #2178 :-D
Fair enough, to have a new one on this issue, here my last note...
Oliver Francke
12:46 PM Bug #2311: rbd: delete + create image led to EEXIST
Is it possible there is some other user, or the logs are from the wrong cluster?
I see:
- client.13507 deletes 90...
Sage Weil
12:45 PM Bug #2311 (Resolved): rbd: delete + create image led to EEXIST
Here is a sequence copy-n-pasted:
rbd rm data/905-testdisk.rbd
Removing image: 100% complete...done.
rbd create ...
Sage Weil
01:28 PM Bug #2286 (Resolved): mon: different full/near_full values on different monitors
commit:9ef953b5e20c3d232cfe4aa90f26476a2a2f911b Sage Weil
11:18 AM Bug #2286 (Fix Under Review): mon: different full/near_full values on different monitors
Check out wip-2286-ratio-a and see what you think. It fills in the ratios from g_conf on create_initial, only changes... Greg Farnum
12:51 PM Bug #2178: rbd: corruption of first block
Hi Sage,
sorry, was not clear enough. The logfiles provide informations for "907-testdisk.rbd..." not "906..."
Th...
Oliver Francke
12:46 PM Bug #2178 (Resolved): rbd: corruption of first block
moved this new issue to #2311, and resolving this bug. hooray! Sage Weil
12:45 PM Bug #2178: rbd: corruption of first block
Oliver Francke wrote:
> Here is a sequence copy-n-pasted:
>
> rbd rm data/905-testdisk.rbd
> Removing image: 100...
Sage Weil
10:41 AM Bug #2178: rbd: corruption of first block
Oliver Francke wrote:
> Hi Sage,
>
> here my notes, after almost 40 tests no bad things happened, only once a min...
Sage Weil
07:22 AM Bug #2178: rbd: corruption of first block
second logfile here, sorry. Oliver Francke
07:18 AM Bug #2178: rbd: corruption of first block
Here is a sequence copy-n-pasted:
rbd rm data/905-testdisk.rbd
Removing image: 100% complete...done.
rbd create ...
Oliver Francke
05:51 AM Bug #2178: rbd: corruption of first block
Meanwhile continued to test...:
I noticed some negative degredation:
2012-04-18 14:43:37.282634 pg v128104: ...
Oliver Francke
05:36 AM Bug #2178: rbd: corruption of first block
Hi Sage,
here my notes, after almost 40 tests no bad things happened, only once a minor hickup, where the rbd-head...
Oliver Francke
08:22 AM Linux kernel client Bug #2298 (In Progress): rbd: broken encode_op for big-endian hosts?
I sent a note to the various lists Al Viro posted to, to confirm the
bug (wasn't sure whether Sage had or not).
I...
Alex Elder
05:58 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
Interesting... The warning showed up again despite test 232 being
removed from the list. Based on the time stamp o...
Alex Elder
04:38 AM Revision 5a6bbd0c (ceph): rgw-admin: user create is idempotent
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Reviewed-by: Sage Weil <sage@newdream.net>
Yehuda Sadeh
04:38 AM Revision 413d8be9 (ceph): osdmap: dump pool_max
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

04/17/2012

10:33 PM Revision 4627b67f (ceph): msgr: Document remaining SimpleMessenger functions and data members
This doesn't handle the inner classes (Pipe, mostly) yet. It also
raises questions about whether these functions shou...
Greg Farnum
10:01 PM Revision 1f6b7f81 (ceph): msgr: Group the SimpleMessenger functions following the Messenger's pat...
Ordering everything nicely will make for a cleaner interface, make
it easier to see what documentation we need, and m...
Greg Farnum
08:47 PM Bug #2286: mon: different full/near_full values on different monitors
Greg Farnum wrote:
> Hmm. I looked at redoing this and got stuck on the semantics we want. If we're interested in fu...
Sage Weil
04:52 PM Bug #2286 (In Progress): mon: different full/near_full values on different monitors
Hmm. I looked at redoing this and got stuck on the semantics we want. If we're interested in full_ratio == 0 being an... Greg Farnum
11:00 AM Bug #2286: mon: different full/near_full values on different monitors
yeah. actually, i think the check should go in tick() inside the is_leader() block, and not update_from_paxos().
Sage Weil
10:54 AM Bug #2286: mon: different full/near_full values on different monitors
Oh, I see...I wasn't following that need_*_ratio_update stuff properly. And update_full_ratios() will be called on th... Greg Farnum
10:30 AM Bug #2286: mon: different full/near_full values on different monitors
Greg Farnum wrote:
> I'm looking at your patch and it doesn't make a lot of sense to me.
> First off, when do you t...
Sage Weil
09:45 AM Bug #2286: mon: different full/near_full values on different monitors
I'm looking at your patch and it doesn't make a lot of sense to me.
First off, when do you think that peon monitors ...
Greg Farnum
08:43 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
Greg Farnum wrote:
> I'm confused how you're getting that pool_max printout — I don't see it at all when I run that ...
Sage Weil
06:57 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
I'm confused how you're getting that pool_max printout — I don't see it at all when I run that command with a ceph-de... Greg Farnum
04:16 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
at some point the osdmap pool_max got set to -1.
nine:2307 04:15 PM $ ~/src/ceph/src/ceph-dencoder type OSDMap i...
Sage Weil
03:56 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
nine:2307 03:56 PM $ osdmaptool osdmap_full/5754 -p | grep ^pool
pool 0 'data' rep size 2 crush_ruleset 0 object_has...
Sage Weil
03:52 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
It looks to me liek the 'data' pool (0) was deleted, and then a new one (vmimages) was created. but somehow that was... Sage Weil
10:34 AM Bug #2307 (Resolved): OSD & Monitor disagree on the contents of pg_temp
See: http://marc.info/?t=133352732900001&r=1&w=2
It seems that (for example) pg 0.138 is in pg_temp, but the OSD c...
Greg Farnum
08:22 PM Revision 3053e477 (ceph): Merge branch 'wip-2289-2'
Yehuda Sadeh
08:20 PM Revision dcd34fc8 (ceph): FileJournal: let peek_write return a reference
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
08:17 PM Revision 65614847 (ceph): common: add blkdev.{h,cc}
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:12 PM Revision 82d7a681 (ceph): rbd: fix probe of source when block device
Fixes: #2304
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:11 PM Revision 41cb40ad (ceph): filestore: factor block device size probe out into helper
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:56 PM Revision dea85368 (ceph): FileJournal: clarify locking in header
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:53 PM Revision 377f75f9 (ceph): OpRequest: mark_event output is now at debug 5
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:39 PM Revision 98e2369e (ceph): librados: add methods to check and wait for safe/complete -and- callback
Allow user to check for (safe|complete) -and- callback return, and wait
for that condition.
Fixes: #2301
Signed-off-...
Sage Weil
06:17 PM Revision 001aa462 (ceph): msgr: add a little detail to mark_down_on_empty() and mark_disposable()
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:53 PM Revision 2eb2aa5a (ceph): msgr: group functions more appropriately
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:53 PM Revision a422c479 (ceph): msgr: remove out-of-date comments and function
I think that destroy() is left over from a misinformed attempt
to implement refcounting on the Messenger.
Signed-off...
Greg Farnum
05:45 PM Revision 0be9718a (ceph): librbd: 'rbd cache enabled' -> 'rbd cache'
'enabled' is useless verbiage. We should fix the rgw option too,
protably...
Signed-off-by: Sage Weil <sage@newdrea...
Sage Weil
03:59 PM Revision e875b89f (ceph): Add task for running fsx on an rbd image.
Josh Durgin
03:03 PM Feature #2309 (Duplicate): rados namespaces
Sage Weil
01:33 PM Revision 6cbda4c3 (ceph): qa: comment out xfstest 232
Test 232 in the xfstests suite produces an XFS error in the log
when run over an RBD device. This is most likely an ...
Alex Elder
01:23 PM rgw Bug #2289 (Resolved): rgw: listing a bucket hangs after removing inexisting object
Fixes merged into master at commit:3053e4773bae93cfa3158882aa4963803862f9b2. Yehuda Sadeh
01:13 PM Bug #2304 (Resolved): rbd import fails on block device
Sage Weil
11:57 AM CephFS Bug #2299 (Rejected): all MDS commit suicide on startup
Sage Weil
11:54 AM Bug #2219 (Can't reproduce): OSD's commit suicide with 0.44
Let us know if you see this again! Thanks Sage Weil
11:40 AM Bug #2301 (Resolved): librados: LibRadosMisc.AioOperatePP failure
Sage Weil
11:27 AM rgw Feature #2308 (Resolved): radosgw-admin: make user create idempotent
radosgw-admin user create should be idempotent and work similar to user modify. We would need to verify that the same... Yehuda Sadeh
08:11 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
I believe we are seeing the same problem here. I have been able to reproduce it each time I have tried. The hardwar... Nick Bartos
07:14 AM Revision e0ec2865 (ceph): objecter: recalc pgid in case it wasn't a precalculated pgid
We had an issue where we didn't call it since we relied on the
existance of the object name and we were passed an emp...
Yehuda Sadeh
07:12 AM Revision f22da6c0 (ceph): cls_rgw: rgw_dir_suggest_changes move cur_disk_bl inside loop
Otherwise we'd fail in case the cls_cxx_read_key failed, since
we rely on its size.
Signed-off-by: Yehuda Sadeh <yeh...
Yehuda Sadeh
06:36 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
I have updated the run_xfstests.sh script so that it simply no longer
runs test 232. That way we can still benefit ...
Alex Elder
04:36 AM Revision 998498f2 (ceph): Merge branch 'wip-oc-perf'
Sage Weil
04:32 AM Revision 31d0d1d7 (ceph): librados: test get_pool_name
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:32 AM Revision 2dbea128 (ceph): librados: C bindings for get_pool_name()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

04/16/2012

10:02 PM Bug #2178: rbd: corruption of first block
The most recent occurrence has been confirmed to be a replay issue with non-btrfs filesystems. The wip-guard branch ... Sage Weil
09:54 PM Bug #2255 (Resolved): osd: fix object name collisions between pools in temp collection
Sage Weil
09:52 PM Bug #2286: mon: different full/near_full values on different monitors
pushed a patch that confines the logic of when to update this into a single bit of code. look okay?
i think the b...
Sage Weil
12:57 PM Bug #2286: mon: different full/near_full values on different monitors
Sage asked on irc about just setting it up on the initial create_empty. The problem with that the only data which is ... Greg Farnum
11:26 AM Bug #2286 (Fix Under Review): mon: different full/near_full values on different monitors
This got (obviously) broken by commit:b6d1c0c9b7290a237560528b6ff0d6b2b2998ee2, which put in the use of magic numbers... Greg Farnum
09:37 PM Feature #2113 (Resolved): objectcacher perfcounters
Sage Weil
11:24 AM Feature #2113: objectcacher perfcounters
My bad — I'll try and do that today! Greg Farnum
11:13 AM Feature #2113 (Fix Under Review): objectcacher perfcounters
not merged yet! i wanted to get feedback first on my naming kludge... Sage Weil
10:02 AM Feature #2113 (Resolved): objectcacher perfcounters
Sage merged this. Greg Farnum
09:21 PM Revision 9f5b9ada (ceph): Merge remote branch 'upstream/wip-guard'
Samuel Just
09:19 PM Revision 2f5af83f (ceph): Filestore: fix guard errors in COLL_REMOVE and COLL_MOVE
Use object rather than collection check in COLL_REMOVE. Adjust
COLL_MOVE check for -1/1 distinction.
Signed-off-by:...
Samuel Just
07:32 PM Revision f87a6a0f (ceph): cls_rgw: rgw_dir_suggest_changes() don't try to decode cur_disk
don't try to decode cur_disk if bufferlist is zero sized.
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
Yehuda Sadeh
07:32 PM Revision 6868ec6a (ceph): rgw: skip empty entries when listing a bucket
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
07:32 PM Revision 73badda6 (ceph): cls_rgw: don't zero out error code before reading value
We were zeroing the return code before reading it.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
04:12 PM Bug #2306: objecter: accessing empty object maps to pool 0
that looks right to me.
and yeah, i don't think object operations should be possible on an empty object name...
Sage Weil
04:03 PM Bug #2306: objecter: accessing empty object maps to pool 0
Yep, that's pretty much exactly what I was thinking.
The only other question is if this fix is the right approach ...
Greg Farnum
04:00 PM Bug #2306: objecter: accessing empty object maps to pool 0
Would something like this work (not tested)?... Yehuda Sadeh
03:52 PM Bug #2306: objecter: accessing empty object maps to pool 0
i prefer an explicit separate field for oid-vs-pg mode so that we can distinguish between pg 0.0 (really) and no pg/n... Sage Weil
03:07 PM Bug #2306: objecter: accessing empty object maps to pool 0
Ah, nope. list_objects is broken. Yehuda Sadeh
03:06 PM Bug #2306: objecter: accessing empty object maps to pool 0
From what I see, the pg ops call pool_op_submit() and not op_submit() so Greg's fix might be ok? Yehuda Sadeh
02:53 PM Bug #2306: objecter: accessing empty object maps to pool 0
Ah, you're right. I missed that function when looking to see who filled in the op->pgid.
In that case we should ma...
Greg Farnum
02:33 PM Bug #2306: objecter: accessing empty object maps to pool 0
i think that if was there for the pg ops (PGLS) where there is no object... the list_objects code is filling in the p... Sage Weil
02:07 PM Bug #2306 (Fix Under Review): objecter: accessing empty object maps to pool 0
Yep, the Objecter doesn't calculate pg placement for objects with a zero-length name. I'm pretty sure the if guard th... Greg Farnum
01:51 PM Bug #2306: objecter: accessing empty object maps to pool 0
Empty object <== object with empty name Yehuda Sadeh
01:51 PM Bug #2306 (Resolved): objecter: accessing empty object maps to pool 0
Even if different pool was specified. Yehuda Sadeh
03:34 PM CephFS Bug #2299: all MDS commit suicide on startup
this issue can be closed, there was an error in the underlying fileystem of osd.0 :) Martin Scheffler
02:59 PM CephFS Bug #2277: qa: flock test broken
I was going to move this over to the kernel client project and then realized I can't — should we close this bug (reje... Greg Farnum
02:46 PM CephFS Bug #2277: qa: flock test broken
... Greg Farnum
02:15 PM Linux kernel client Bug #2298: rbd: broken encode_op for big-endian hosts?
there are some old g5's in the closet here at aon that we can use.
in the past we've found/fixed these issues with...
Sage Weil
01:46 PM Linux kernel client Bug #2298: rbd: broken encode_op for big-endian hosts?
I haven't looked at this in any detail but I presume Al is correct.
We don't have any big endian hardware anywhere, ...
Alex Elder
01:49 PM CephFS Bug #2288: libcephfs: setxattr returns EEXIST following removexattr
More info:
That branch has a patch which adds a call to removexattr before the setxattr. If you run testceph twice i...
Greg Farnum
01:38 PM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
Should have waited. It have reproduced the problem by running test 232.
Alex Elder
01:37 PM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
After a lot of repetitions, I've narrowed it down to test 232 or 234. Alex Elder
10:12 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
I ran subsets of that list at least three times and never
reproduced it. I tried again after a reboot, and again,
...
Alex Elder
06:43 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
Looking at the list of tests that indicate they include quota testing,
the ones that are currently being run by the ...
Alex Elder
06:36 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
I sent a report to the XFS mailing list about the warning. I have to try
to narrow down which test was running when...
Alex Elder
01:18 PM CephFS Bug #2285: libcephfs: failure with empty name components
Yep, it's client-local; there's no request to the MDS for this either.
Guess that means we don't care right now?
Greg Farnum
01:04 PM CephFS Bug #2285 (In Progress): libcephfs: failure with empty name components
Oddly, this looks like it's a race. I can't reproduce it with any client debugging on... Greg Farnum
11:33 AM Feature #2305: Moving rbd images between pools
Not quite; copy works, but slowly (because of course it's duplicating all the data). I don't know if mv/rename could... Dan Mick
11:24 AM Feature #2305 (Rejected): Moving rbd images between pools
We discovered it does work if you keep the image names the same and vary the pool names. :) Greg Farnum
11:01 AM Feature #2305 (Resolved): Moving rbd images between pools
It would be nice to have an option to move rbd's between pools with a syntax like:
rbd mv <first poolname>/<image na...
Stefan Kleijkers
10:02 AM Messengers Cleanup #2150 (In Progress): repair the Simple/Messenger interface
Not really done! ;) Greg Farnum
08:44 AM rbd Feature #2297: ObjectCacher: mark buffers mergeable for ksm
I'm really not sure this is something we want to do, especially unconditionally. Let's wait until we get some idea of... Greg Farnum
07:27 AM Bug #2304 (Resolved): rbd import fails on block device
root@burnupi30:~# rbd import /dev/sda burnupi30.sda
fiemap ioctl() failed
Importing image: 100% complete...done.
...
Sage Weil
03:39 AM Revision 6bede298 (ceph): dump_stuck: whitelist 'wrongly marked me down'
The test marks the osds down.. they may generate this error if they get
that faster than they get the signal via the ...
Sage Weil

04/15/2012

08:30 PM Bug #2303 (Can't reproduce): osd: failed to peer on startup
ubuntu@teuthology:/a/nightly_coverage_2012-04-14-b/994 Sage Weil
08:24 PM Linux kernel client Bug #2302 (Can't reproduce): xfs: warning at mutex_remove_waiter
... Sage Weil
03:33 PM Feature #1044 (In Progress): librbd: discard support
Sage Weil
03:33 PM Feature #2163 (Resolved): qa: full xfstests on rbd
Sage Weil
03:33 PM Subtask #2249 (Resolved): teuthology task (3)
Sage Weil
03:33 PM Feature #2226 (Resolved): osd: better filestore idempotency test
Sage Weil
05:39 AM Linux kernel client Bug #2287: rbd: crashes with 10Gbit network and fio
Here some more info from the crash:
@[58113.180039] libceph: tid 387083 timed out on osd92, will reset osd
[5818...
Danny Kukawka

04/14/2012

10:38 PM Revision 1bc01289 (ceph): Merge branch 'stable'
Sage Weil
09:06 PM Revision 19e673cc (ceph): filestore_idempotent: use new sequence-based tester
random seed, inject at 50-300. Sage Weil
08:53 PM Revision 20ab74ea (ceph): test_librbd: fix write sizes
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:07 PM CephFS Bug #2299: all MDS commit suicide on startup
after i told osd.0 to get lost and reformatted it, the cluster started resyncing.
then (magically) mds.0 started up ...
Martin Scheffler
09:39 AM CephFS Bug #2299 (Rejected): all MDS commit suicide on startup
my setup is: 1 MON, 2 MDS and 4 OSD.
ceph version is commit:1e76a8713feac6883c648512dcdc28c83f7ff69e.
after copyi...
Martin Scheffler
05:26 PM Revision 81c8a7b9 (ceph): test_idempotent_sequence: no config file
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:43 PM Revision a40fe5bd (ceph): run_xfstests.sh: ensure cleanup on errors
Because we exit on any error (due to 'set -e'), the cleanup call was
never getting made in the event of an error. Th...
Alex Elder
04:41 PM Bug #2301: librados: LibRadosMisc.AioOperatePP failure
the problem is that the completion callback is now async, but wait_for_complete() is not.
do we think that is ok?
Sage Weil
02:59 PM Bug #2301 (Resolved): librados: LibRadosMisc.AioOperatePP failure
2012-04-14T00:11:00.763 INFO:teuthology.task.workunit.client.0.out:[ RUN ] LibRadosMisc.AioOperatePP
2012-04-14...
Sage Weil
04:26 PM Revision 3d187db5 (ceph): run_xfstests.sh: pass test result via exit status
It turns out that xfstests *does* exit with non-zero status
when a test fails. Its exit status is the number of test...
Alex Elder
01:58 PM Bug #2300 (Rejected): objecter: not sending stat request
Yehuda Sadeh
01:50 PM Bug #2300: objecter: not sending stat request
Ah, actually we try to access an object with empty oid, which is obviously wrong. Probably due to #2289 issues. Yehuda Sadeh
12:34 PM Bug #2300 (Rejected): objecter: not sending stat request
Happens in rgw (can only see it on congress). Following a rgw.bucket_list call response, we call librados io_ctx->sta... Yehuda Sadeh
12:38 PM rgw Bug #2289: rgw: listing a bucket hangs after removing inexisting object
Pushed several fixes to wip-2289. The scenario was:
creating bucket
trying to remove object that does not exist
...
Yehuda Sadeh
06:53 AM Revision a4dd9204 (ceph): objclass: fix cls_cxx_map_remove_key()
didn't set the correct osd op, didn't do anything.
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
Yehuda Sadeh
06:10 AM Revision b352a3e1 (ceph): rgw: call dir_suggest_changes with correct bufferlist
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
05:28 AM Revision 6ba4efcd (ceph): rbd.py: add xfstests functionality
Add tasks for running xfstests over a pair of rbd volumes. The main
one is called xfstests, and it sets up rbd volum...
Sage Weil
05:27 AM Revision 4498825a (ceph): add rbd_xfstests to regression suite
Sage Weil
05:19 AM Revision a31392cf (ceph): objectcacher: name them
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:18 AM Revision 28e61811 (ceph): librados: add IoCtx::get_pool_name() to c++ api
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:51 AM Revision 446b5e24 (ceph): ObjectCacher: move functions out of the header.
These all set off my "big-functions-in-the-header" alarm.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
04:51 AM Revision 6257a94e (ceph): ObjectCacher: add a bunch of perfcounters
Hurray, we can collect cache statistics!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
04:51 AM Revision f355875b (ceph): ObjectCacher: group all the function definitions properly
Some of the ObjectCacher functions were separate, which works poorly
if you're trying to do things like use ldout.
S...
Greg Farnum
04:51 AM Revision d80c126a (ceph): ObjectCacher: use "objectcacher" instead of the CephContext::name for p...
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:51 AM Revision 72a75730 (ceph): objectcacher: perf skeleton
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Conflicts:
src/osdc/ObjectCacher.cc
src/osdc/ObjectCacher.h
Yehuda Sadeh
04:41 AM Revision 2cc854f7 (ceph): msgr: Document all the functions in the Messenger interface.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:40 AM Revision b8207dd2 (ceph): msgr: remove several functions from the interface.
The suicide() function is unused and its current SimpleMessenger
implementation just calls shutdown(). And I don't se...
Greg Farnum
04:02 AM Revision bf8847e7 (ceph): qa: add run_xfsests.sh script
Add a script that runs xfstests over a pair of devices that are
specified using command line arguments. The tests ar...
Alex Elder
04:01 AM Revision ee69f271 (ceph): test/rbd/fsx: fix signed/unsigned warnings
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:52 AM Revision 812c7c7a (ceph): librbd: implement discard
Implement sync and async discard. Embed an ObjectWriteOperation in the
BlockCompletion struct.
The sync version doe...
Sage Weil
03:51 AM Revision 589c6269 (ceph): Merge branch 'wip-librbd-caching'
Reviewed-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:46 AM Revision 5c2d5b18 (ceph): test: copy fsx to rbd directory
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision b26d7024 (ceph): test: port fsx to run on an rbd image instead of a file
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision 78d14da8 (ceph): librbd: flush pending writes when a new snapshot is created
This makes sure the state is as consistent as librbd can make it
before the snapshot is actually created.
Signed-off...
Josh Durgin
03:46 AM Revision 30805bfa (ceph): test_librbd_fsx: clean up properly
Delete and close everything on exit.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
03:46 AM Revision 68e2e004 (ceph): librbd: flush cache before creating a snapshot
This is a temporary workaround until the ObjectCacher
is smarter about snapshots.
Signed-off-by: Josh Durgin <josh.d...
Josh Durgin
03:46 AM Revision d0b1f72c (ceph): ObjectCacher: add some asserts about when the lock is held
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision 7197825d (ceph): Remove unused Completion.h header
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision 1dd9c303 (ceph): ObjectCacher: remove redundant 'public:'
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision 7868dbe0 (ceph): ObjectCacher: remove unused/unimplemented methods
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision ee9987b2 (ceph): librados: debug logging for aio writes
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision 20be92ec (ceph): ObjectCacher: always delete callback in commit_set
The GatherBuilder used later always deletes it. To make
the interface sane, delete it in every return path.
Signed-o...
Josh Durgin
03:46 AM Revision 6cf22655 (ceph): ObjectCacher: remove useless conditional
We assert this condition at the beginning of the function.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
03:46 AM Revision 55b65544 (ceph): test_librbd: flush before opening another handle to the image
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision d9aa81b0 (ceph): finisher: add indent settings header
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision f464e6c3 (ceph): ObjectCacher: include object when outputting a BufferHead
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision 9c8b3f70 (ceph): .gitignore: clean up to not match so many things recursively
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
03:46 AM Revision a0fc3336 (ceph): librados: unconditonally associate IoCtxImpl with AioCompletionImpl
We were only setting the pointer if we were on the aio_write_list. Set
this pointer for any aio.
Signed-off-by: Sag...
Sage Weil
03:46 AM Revision 34a10fae (ceph): librados: do aio callbacks in async thread
Call user completions in an async thread. This allows callers to call back
into librados from the callback, and allo...
Sage Weil
03:46 AM Revision cbbd5f97 (ceph): ObjectCacher: check for read/write errors
The objecter doesn't fix every error for us.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
03:46 AM Revision abbfd937 (ceph): Filer: make file_to_extents static
Now the ObjectCacher has no need for a Filer instance (not that
it had a good reason for one before).
Signed-off-by:...
Josh Durgin
03:46 AM Revision d6dbff01 (ceph): ObjectCacher: remove dependency on Objecter
Abstract out how writeback is done with a WritebackHandler object.
For RBD caching, this will be done by librados, bu...
Josh Durgin
03:46 AM Revision 9546c158 (ceph): librbd: remove writeback window
This is superseded by a full-fledged writeback cache.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
03:46 AM Revision c0073770 (ceph): librbd: allow writeback caching
This uses the existing infrastructure of ObjectCacher for
buffer management and expiry.
Signed-off-by: Josh Durgin <...
Josh Durgin
03:46 AM Revision 579d30b6 (ceph): librbd: check for writes to snapshots
librados does this for us normally, but caching does not check for this.
We might as well check early to avoid schedu...
Josh Durgin
03:46 AM Revision bb35ecf0 (ceph): librbd: fix bytes read accounting in read_iterate
ObjectCacher will never do short reads, and always returns 0.
librados may do short reads at the end of an object.
S...
Josh Durgin
03:41 AM Revision cc1ae672 (ceph): filestore: flush log on failure injection
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:41 AM Revision ad2eba43 (ceph): Merge branch 'wip-2226b'
Conflicts:
src/common/config_opts.h
src/os/FileStore.cc
src/os/FileStore.h
Sage Weil
03:38 AM Revision 4fc48e5b (ceph): run_seed_to_range.sh: run filestore idempotent sequence tests over a range
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision f052c82b (ceph): run_seed_to: assume tool is in path, not cwd
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 2cb0032e (ceph): DeterministicOpSequence: writing to object being cloned in the same tx.
We write to the object being cloned prior to the clone to ensure we are
cloning a valid range of bytes.
The write an...
Joao Eduardo Luis
03:38 AM Revision c7729847 (ceph): test_idempotent_sequence: simple test script
Takes seed and kill point, then verifies the result.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:38 AM Revision 95bb4239 (ceph): filestore: use our assert
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision a8b9b98c (ceph): run_seed_to.sh: set -e
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 56fb027f (ceph): test_idempotent_sequence: make get-last-op return 0 if no txns applied
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 2d24eb33 (ceph): FileStore: don't return ENOENT from object_map getters
ENOENT in those contexts means that the object_map does not know
about the object, not that the object does not exist...
Samuel Just
03:38 AM Revision 21be4ad9 (ceph): test_idempotent_sequence: Add more info on diff's dout's.
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:38 AM Revision bb0499a4 (ceph): test_idempotent_sequence: outputting collection being diff'ed.
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:38 AM Revision 27cb82e5 (ceph): test_idempotent_sequence: use get_obj_at() for clone source
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 1b451294 (ceph): test_idempotent_sequence: fix/enable do_coll_add
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision a5f533a7 (ceph): test_idempotent_sequence: fix diff return value, mount behavior
No mkfs!
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:38 AM Revision 08cf4d28 (ceph): test_idempotent_sequence: include object name in diff output
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 2d20b4c2 (ceph): test_idempotent_sequence: return true from diff if different
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 5d30581e (ceph): test_idempotent_sequence: add remove op
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision e270e0b5 (ceph): test_idempotent_sequence: diff two stores.
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:38 AM Revision 2c940a4f (ceph): test_idempotent_sequence: Remove references to VerifyFileStore
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:38 AM Revision 42899110 (ceph): Makefile: remove VerifyFileStore from test_idempotent_sequence build
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:38 AM Revision 84175c7d (ceph): VerifyFileStore: delete from tree
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:38 AM Revision 794fc0b6 (ceph): test_idempotent_sequence: fix collection_move arg order
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision a8c32057 (ceph): test_idempotent_sequence: whitesepace
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision bc5dffe9 (ceph): test_idempotent_sequence: replace on coll_move
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision c056f603 (ceph): test_idempotent_sequence: simplify clone
- pick existing source
- any target is ok, as long as it != source
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:38 AM Revision 51bc6dfd (ceph): test_idempotent_sequence: make object names overlap
This way when we move objects between collections they will replace
each other in interesting ways.
Signed-off-by: S...
Sage Weil
03:38 AM Revision db3d14b9 (ceph): test_idempotent_sequence: fewer objects per collection by default
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision bec1e50e (ceph): test_idempotent_sequence: remove existing target for collection_add
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 975e59ad (ceph): test_idempotent_sequence: Add commands and lose a couple of optional args.
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:38 AM Revision 5ced86be (ceph): filestore: init filestore_kill_at in ctor
Otherwise we don't get the option for FileStore instances created after
common_init_finish() (which does md_config_t:...
Sage Weil
03:38 AM Revision 9dba3172 (ceph): test_idempotent_sequence: require nonexistent filestore dir
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 795274bc (ceph): test_idempotent_sequence: include tx number in each transaction
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 57866946 (ceph): test_idempotent_sequence: implement get-last-op
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision 6573690a (ceph): VerifyFileStore: Check if two FileStore's match after applying a set of...
With DeterministicOpSequence we are able to reproduce exactly the same
sequence of operations, over and over. However...
Joao Eduardo Luis
03:38 AM Revision dee47bb2 (ceph): test_idempotent_sequence: Generate a reproducible sequence of txs.
With this test we aim at reproducing the same sequence of transactions
as long as we are provided with the same seed ...
Joao Eduardo Luis
03:38 AM Revision 4a6cfcb3 (ceph): deterministicopseq: add collection_rename() support
Joao Eduardo Luis
03:38 AM Revision 01daa88d (ceph): test_idempotent_sequence: no need to reinject value that is already there
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:38 AM Revision a5a83950 (ceph): filestore: name internally
We need to allow the perfcounter name to be controlled so that we can have
two instances of FileStore in the same pro...
Sage Weil
03:38 AM Revision 83227550 (ceph): TestFileStoreState: Represent a FileStore's state to be used by tests.
Instead of having each test creating the same representation of a
FileStore's state, with a map/set of collections an...
Joao Eduardo Luis
03:38 AM Revision 2f02a8a5 (ceph): DeterministicOpSequence: Generate a reproducible sequence of operations.
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
03:33 AM Revision 583c79af (ceph): filestore: fix collection_add argument names
No functional changes, just fixing and clarifying argument names so that it
is less confusing/wrong.
Signed-off-by: ...
Sage Weil
03:33 AM Revision dfcadc94 (ceph): filestore: simple failure injections via --filestore-kill-at <n>
This will make filestore suicide (_exit(1)) on the n'th potential failure
call site. We can potentially fail:
...
Sage Weil
03:33 AM Revision f4001790 (ceph): filestore: two-phase guard
For certain operations (collection_add) we need a two-phase guard, and an
"in-progress" state.
- before exposing an...
Sage Weil
03:33 AM Revision 8d3acb5a (ceph): filestore: replay collection_move using add+remove
This approximates the buggy collection_move. It is still buggy. It is
only there to replay old journals.
Rip out b...
Sage Weil
01:14 AM Revision c6551fb8 (ceph): test_rados.py: add basic tests for the Object class
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com>
Josh Durgin
01:13 AM Revision 76799680 (ceph): rados.py: fix Object.read argument order
57cfd7673da66d191cc397689da35d0a1f3dc61a switched the argument order but didn't change this caller.
Signed-off-by: T...
Tomasz Paskowski

04/13/2012

11:12 PM Revision 5ce8d71f (ceph): FileJournal: allow write_thread and write_finisher to run in parallel
Previously, the write_thread and the write_finisher serialized on
write_lock limiting performance. aio_lock has been...
Samuel Just
11:03 PM Feature #1044 (Resolved): librbd: discard support
Sage Weil
11:03 PM Feature #2163: qa: full xfstests on rbd
Sage Weil
11:02 PM Feature #2052 (Resolved): librbd: caching
Sage Weil
06:37 PM Feature #2052: librbd: caching
This is passing long-running fsx with osd thrashing consistently, and all the other rbd tests. I think the branch (wi... Josh Durgin
10:40 PM Linux kernel client Bug #2298 (Resolved): rbd: broken encode_op for big-endian hosts?
... Sage Weil
10:17 PM Subtask #2249: teuthology task (3)
Sage Weil
09:26 PM Subtask #2237 (Resolved): failure+replay tester (8)
Sage Weil
06:39 PM Bug #2278 (Resolved): librados: python read has arguments swapped
Fixed by 76799680546a79fc73ad7bbc58960a31ae2290ad. Josh Durgin
10:10 AM Bug #2278: librados: python read has arguments swapped
Sage Weil
07:56 AM Bug #2278 (Resolved): librados: python read has arguments swapped
Object.read from rados.py is passing arguments to ioctx.read in a wrong order.
--- rados.py.dist 2012-04-13 16:5...
Tomasz Paskowski
06:38 PM rbd Feature #2297 (New): ObjectCacher: mark buffers mergeable for ksm
This is done with a simple madvise call, but we should test that it works with ksm and verify that all the buffers ar... Josh Durgin
06:29 PM Feature #2296 (Resolved): librbd: allow resizing to arbitrary sizes
Right now resizing to a non-object-size multiple will round down the remainder. With discard support, we support this... Josh Durgin
06:25 PM Feature #2295 (Resolved): make qemu cache=writeback,writethrough option turn on librbd caching
This will enable more familiar use of caching with qemu/rbd, and let people configure it with libvirt's existing xml. Josh Durgin
05:51 PM rbd Feature #2294 (New): librbd: optionally cache entire objects, instead of only requesting the part...
This may save many round trips for small read sizes (common to vms). Josh Durgin
05:49 PM Feature #2113 (Fix Under Review): objectcacher perfcounters
Okay, I checked and these work — if you run ceph-fuse -f and play around you can do a dump_perfcounters and see the v... Greg Farnum
05:46 PM CephFS Bug #2293 (Resolved): admin sockets don't persist with ceph-fuse
It looks like the admin socket is associated with the launching process, rather than the background process that cont... Greg Farnum
05:46 PM rbd Feature #2292 (New): ObjectCacher: support sparse objects
The ObjectCacher doesn't store which objects or parts of objects don't exist. This info could improve read performance. Josh Durgin
05:46 PM Revision dc44950b (ceph): filestore: implement collection_move() as add + remove
This ensures we get add and remove steps with different spos values, which
makes the guard work. The collection_move...
Sage Weil
05:46 PM Bug #2291 (Can't reproduce): objectcacher perfcounters don't work with test_librbd_fsx
The admin socket perfcounters_dump command only outputs objecter data. I'm speculating that it has to do with the obj... Greg Farnum
05:30 PM Feature #2290 (Resolved): ObjectCacher: handle read/write errors
Currently the return value of the underlying read/write calls is ignored (I left TODO notes there). We should figure ... Josh Durgin
05:02 PM rgw Bug #2289 (Resolved): rgw: listing a bucket hangs after removing inexisting object
Yehuda Sadeh
04:59 PM Revision dc10d04e (ceph): OSD: improve information and format of OSDTracker messages
Also, Message now has a timestamp indicating when the message
was fully recieved for use by OSDTracker.
Signed-off-b...
Samuel Just
03:12 PM Subtask #2235 (Resolved): generate deterministic sequence of transactions (5)
Sage Weil
02:01 PM CephFS Bug #2288 (Resolved): libcephfs: setxattr returns EEXIST following removexattr
running cephtest a couple of time (out of wip-testlibcephfs):... Yehuda Sadeh
01:48 PM Linux kernel client Bug #2287 (Resolved): rbd: crashes with 10Gbit network and fio
From http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/5968:... Josh Durgin
01:47 PM Bug #2286 (Resolved): mon: different full/near_full values on different monitors
If you run vstart, you get... Greg Farnum
01:39 PM CephFS Bug #2285 (Resolved): libcephfs: failure with empty name components
the following in client/testceph.cc fails:
// test empty name components
my_fd = ret = ceph_open(cmount, "rea...
Yehuda Sadeh
11:19 AM rgw Feature #2284 (Resolved): rgw: bench based on rados_bench
Yehuda Sadeh
11:17 AM rgw Feature #2171 (Rejected): rgw: asynchronously calculate md5
Sage Weil
11:16 AM Feature #2283: The ceph command should time out
Sage Weil
10:21 AM Feature #2283 (New): The ceph command should time out
When using ceph to query certain parts of the cluster, there should be an option to time out after a certain set numb... Bernard Grymonpon
09:44 AM Subtask #2282 (Resolved): Handle map updates on a per-pg basis
Currently, we advance all pgs to the next map at once. This requires us to flush the filestore queue and basically h... Samuel Just
09:27 AM Feature #2281 (Resolved): build big burnupi cluster for testing
Sage Weil
09:23 AM Feature #2280 (Resolved): improve gitbuilder infrastructure
* do not fill up local disk; sync results out immediately
* resolve branches immediately, not after each full pass
?
Sage Weil
09:20 AM rbd Feature #2279 (Resolved): rbd: trivial layering design doc
- how parent images are marked read-only
- how parent/child relationship is represented
- possibly how this allow...
Sage Weil
09:16 AM Bug #2192 (Need More Info): ceph-mon hangs consuming 100% CPU
Sage Weil
09:14 AM Feature #2246 (Resolved): force10s on sepia
Sage Weil
09:13 AM Messengers Cleanup #2150 (Resolved): repair the Simple/Messenger interface
Sage Weil
09:13 AM Feature #2240 (Resolved): osd: new default locations
Sage Weil
06:16 AM Revision 3d11455b (ceph): osdmaptool: fix clitest for lpg_num 0
Broke this in 11b93d3af2c0f1bca3e48fe773a14f07f479ccc3
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:56 AM Revision 55535d04 (ceph): move tasks:cfuse_workunit_suites_dbench.yaml to stress pending #1737 fix
Sage Weil
05:36 AM Revision 1e76a871 (ceph): Merge remote branch 'gh/wip-defaults'
Sage Weil
04:47 AM Revision 8a6c3c17 (ceph): msgr: dispatcher: Documentation of security functions
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum

04/12/2012

11:24 PM Revision 9b2aff20 (ceph): FileStore: don't return ENOENT from object_map getters
ENOENT in those contexts means that the object_map does not know
about the object, not that the object does not exist...
Samuel Just
11:17 PM Subtask #2237 (In Progress): failure+replay tester (8)
Sage Weil
11:17 PM Subtask #2235: generate deterministic sequence of transactions (5)
Sage Weil
11:15 PM Feature #2240: osd: new default locations
Sage Weil
10:58 PM CephFS Bug #2277 (New): qa: flock test broken
ubuntu@teuthology:/a/nightly_coverage_2012-04-12-b/687
ubuntu@teuthology:/a/nightly_coverage_2012-04-11-b/525
thi...
Sage Weil
10:50 PM Revision 241c5e5e (ceph): msgr: Reorder functions to reduce the number of privacy tags.
No functional changes.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
10:48 PM CephFS Bug #1737: ceph-fuse crash in xlist::remove
ubuntu@teuthology:/a/nightly_coverage_2012-04-12-b/717
- chef: null
- ceph: null
- ceph-fuse: null
- workunit:
...
Sage Weil
10:45 PM CephFS Bug #2187: pjd chown/00.t failed test 97
2012-04-12T13:09:27.496 INFO:teuthology.task.workunit.client.0.out:../pjd-fstest-20080816/tests/chown/00.t (Wstat: ... Sage Weil
10:45 PM Revision 5cf7628f (ceph): msgr: miscellaneous fixes.
Indentation on a public label; various function documentation; remove
an inappropriate definition of a virtual functi...
Greg Farnum
10:35 PM Bug #2276 (Rejected): osd: eat cpu on restart
osd.856 on congress. Sage Weil
10:17 PM Revision 8e5f5625 (ceph): DBObjectMap: check in clone and link that source is not destination
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:35 PM Bug #2275 (Resolved): osd: crash in FileJournal::wrap_read_bl
... Sage Weil
07:16 PM Revision a5730c85 (ceph): Merge branch 'wip-2223'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
05:59 PM Revision 9565a8ba (ceph): ObjectStore: Remove code duplication when dumping transactions.
By using OStreamFormatter, we can have a single function responsible for
dumping a transaction. We keep the same old ...
Joao Eduardo Luis
05:59 PM Revision ddbaf584 (ceph): filestore: remove commented code, fix whitespace
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:59 PM Revision 7223f00a (ceph): filestore: dump transactions sets, include seq and trans_num
Lists of transactions commit as a unit; reflect that in the dump. Also
include seq and trans_num.
Signed-off-by: Sa...
Sage Weil
05:59 PM Revision 816a5128 (ceph): objectstore: tweak dump() a bit
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:59 PM Revision 308c1fc6 (ceph): filestore: use config observer to start/stop dumping/tracing
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:59 PM Revision 6ee0b718 (ceph): filejournal: dump injson
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:59 PM Revision 6a7bbe86 (ceph): filestore: dump transaction in json on error
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:29 PM Documentation #2274 (Closed): Basic Availability Model
(1) Construct a continuous-time markov availability model for a basic cluster (3 mons, 4 osds, 2 copy)
(Petri ne...
Anonymous
04:19 PM Documentation #2273 (Closed): basic reliability models
1. construct a probabilistic model for data loss in 1, 2, and 3 copy systems, assuming independent failures
2. plug ...
Anonymous
04:13 PM RADOS Documentation #2272 (Closed): FAQs: RADOS reliability and availability
I expect others to improve this, but this is just to capture the ideas.
It is probably more of a white paper than an...
Anonymous
04:06 PM Documentation #2271 (Resolved): FAQ: BTRFS vs XFS
I expect others to improve this list, but to start it out ...
what file systems we run on (and test on)
how you...
Anonymous
12:15 PM Feature #2223 (Resolved): Tracing facility on FileStore
Sage Weil
09:05 AM RADOS Feature #2268 (Resolved): crush: update item's position in crush map
via crushtool and 'ceph osd crush ...' Sage Weil
03:55 AM Bug #2267 (Closed): Ceph client crashed after shutting down one mds and osd
Ceph version: 0.44.1-1~bpo70+1
Kernel version: 3.2.12-1
Ceph config:
[global]
auth supported = cephx
keyri...
Maciej Galkiewicz
12:38 AM Revision cfac4a97 (ceph): pgmon: clear out osd reports after the OSD has gone down
Previously we never removed report times from last_osd_report. Do
so, in check_osd_map (which, on the leader, is call...
Greg Farnum

04/11/2012

11:00 PM Revision 119dd5ae (ceph): mkcephfs: update man page
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:00 PM Revision 4a4b7994 (ceph): ceph-authtool: update man page
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:53 PM Revision ab08fb8b (ceph): mkcephfs: note that btrfs (and --mkbtrfs) are optional and experimental
And that --mkbtrfs will be deprecated soon.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
10:53 PM Revision ee39291a (ceph): ceph-authtool: add warning to man page
- data is not encrypted over the wire
- intended for trusted environments
Signed-off-by: Sage Weil <sage.weil@dreamh...
Sage Weil
10:40 PM Revision 11b93d3a (ceph): osd: disable localized pgs by default
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:40 PM Revision 8836b81f (ceph): mon: alloc pgp_num adjustment up and down
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:39 PM Revision 83e1260b (ceph): mon: set pgp_num == pg_num (by default) for new pools
For when pg_num is specified but not pgp_num. Thanks Greg!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
10:39 PM Revision 58671a4c (ceph): mon: command to disable localized pgs for a pool
ceph osd pool disable_lpgs <poolname> --yes-i-really-mean-it
Grr, these should be off by default. We can't adjust t...
Sage Weil
08:35 PM Revision 7fdf25bc (ceph): debian: python-support -> dh_python2
I followed the instructions on
http://wiki.debian.org/Python/TransitionToDHPython2
Signed-off-by: Sage Weil <sage@...
Sage Weil
07:35 PM Revision ed0653b4 (ceph): COPYING: doc/ CC BY-SA
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:33 PM Revision 6e83e119 (ceph): README: update
- refer to COPYING, SubmittingPatches
- a word about dependencies
- building packages
- drop the list of built binari...
Sage Weil
06:42 PM Revision 838a7618 (ceph): ceph-rbdnamer: include in dist tarball and debs/rpms
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:42 PM Revision af502735 (ceph): obsync: include man page in tarball, packages
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:42 PM Revision 9678c097 (ceph): init-radosgw: start in runlevel 4
Fixes lintian error
W: radosgw: init.d-script-missing-start etc/init.d/radosgw 4
Signed-off-by: Sage Weil <sage@new...
Sage Weil
06:42 PM Revision 84efc554 (ceph): debian: drop unnecessary conflicts on librgw
Cut and paste baggage from libcephfs, it looks like.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:18 PM Messengers Cleanup #2150 (In Progress): repair the Simple/Messenger interface
I haven't done it, but I had enough time to glance over it and see at least a couple things that need fixing before t... Greg Farnum
05:49 PM Feature #2113: objectcacher perfcounters
Sage asked me to run it under an rbd mount and look at it. Need to get tests from Josh and then figure out how to do ... Greg Farnum
04:30 PM Feature #2113 (Fix Under Review): objectcacher perfcounters
Compile-tested. Greg Farnum
10:51 AM Feature #2113 (In Progress): objectcacher perfcounters
Yoink. Greg Farnum
05:40 PM Revision 292898a8 (ceph): init-ceph: start at all runlevels
This fixes lintian error:
W: ceph: init.d-script-missing-start etc/init.d/ceph 4
Signed-off-by: Sage Weil <sage@new...
Sage Weil
05:03 PM Revision b1946290 (ceph): Merge branch 'stable'
Sage Weil
04:30 PM Bug #2266 (Resolved): teuthology: nuke after failure is failing
it fails, and then fails to unlock, and eats up machines.
for example, ubuntu@teuthology:/a/nightly_coverage_2012-...
Sage Weil
03:08 PM Feature #2265 (Rejected): make sure objecter/kclient error out when localized pgs don't exist
Sage Weil
11:02 AM Bug #2264 (Can't reproduce): mon: failed assert in bump_epoch
During startup of a teuthology run on commit 1775301bb46379648f3f88914ef56aa1982db020 (before the cluster was healthy... Josh Durgin
10:48 AM Bug #2263 (Resolved): obsync: move man page to section 1
Sage Weil
09:25 AM Bug #2262 (Resolved): qa: osd-recovery tasks fails on flush_pg_stats
consistently Sage Weil
08:09 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
Looks like the problem arose while running fsstress on the xfs loop
mount on top of a file on the ext2 filesystem.
...
Alex Elder
07:56 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
FYI, xfstests 49 tests running XFS on a loop device. I have to wait for a
reboot in order to see if I can tell at w...
Alex Elder
07:49 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
Looks like xfstests #49 is a reproducer for this problem, at least
after running the tests that lead up to it first ...
Alex Elder
05:47 AM Revision be5b25b6 (ceph): filestore: fix collection_move guard
We had a sequence like:
1- write A block 1
2- write A block 2
3- write A block 3
4- write A block 4
5- move A -...
Sage Weil
05:47 AM Revision 4bd9d1bb (ceph): filestore: fix collection_add guard
If we crash between the link() and setting the guard, we will get
EEXIST. Tolerate that.
Signed-off-by: Sage Weil <...
Sage Weil
05:47 AM Revision df4d7a47 (ceph): filestore: fix collection_rename guard
If we crash between the rename and setting the guard, we can get EEXIST
or ENOTEMPTY on rename. Tolerate that.
Sign...
Sage Weil
05:47 AM Revision 85db25e8 (ceph): filestore: fix fd leak on collection_rename
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:47 AM Revision c3e4c5b7 (ceph): filestore: cleanup: flip sense of replay guard check
The other are all if (_check_replay_guard(..)) do_it;. Make this one
match.
Signed-off-by: Sage Weil <sage.weil@dre...
Sage Weil
05:43 AM Revision 43de5e4f (ceph): FileStore: dumping transactions to a file
Dump each queued transaction to a predefined file, specified with
--filestore-dump-file, in JSON format.
Signed-off...
Joao Eduardo Luis
05:43 AM Revision cd4a760e (ceph): osd: fix heartbeat set_port()
set_port() fails an assert if it isn't an in4 or in6 address, which a
default entity_addr_t is not.
Signed-off-by: S...
Sage Weil
05:29 AM Linux kernel client Bug #2261 (In Progress): paging error in libceph after crashed osd comes back online
Alex Elder
05:22 AM Linux kernel client Bug #2261 (Can't reproduce): paging error in libceph after crashed osd comes back online
... Pim van Riezen
04:43 AM Revision 1775301b (ceph): osd: reenable clone on recovery
This hasn't turned up problems in QA.
Fixes: #2002
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
02:25 AM Bug #2178: rbd: corruption of first block
Well Sage,
I have a torture-test already :-D
OK, so it's independent from yours and that's good. It sounds, we ar...
Oliver Francke

04/10/2012

11:24 PM Feature #2223: Tracing facility on FileStore
did some cleanup, changed the way the output is structured wrt the transaction lists, and tweaked a few other things.... Sage Weil
11:23 PM Revision ddb98f77 (ceph): ceph_manager: don't try to start greenlet twice
spawn already scheduled it. Trying to start it again hits an assert. Josh Durgin
11:11 PM Revision 6fbac10d (ceph): osd: allow users to specify the osd heartbeat server address.
Reported-by: Nick Bartos <nick@pistoncloud.com>
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Reviewed-by...
Greg Farnum
10:23 PM Bug #2002 (Resolved): osd: racy push/pull for clones
Sage Weil
10:19 PM Bug #2161 (Resolved): nonlinear scaling for PGMap::pg_stat encode
commit:bd518e998c0ff12d611db19a8cff6da3622597cb Sage Weil
10:18 PM Bug #1953 (Resolved): teuthology: core files aren't archived when using valgrind
it works! Sage Weil
10:10 PM Bug #2225 (Resolved): gitbuilder.ceph.com returning 503: Service Temporarily Unavailable.
Yehuda found the bad apache option.. override it in the domain_service (maxconnperip=1000 param) Sage Weil
09:56 PM Revision 4f030e1b (ceph): osd_types: fix off by one error in is_temp
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Gregory Farnum <gregory.farnum@dreamhost.com>
Samuel Just
09:49 PM Messengers Cleanup #2150 (Resolved): repair the Simple/Messenger interface
Sage Weil
09:49 PM Feature #1044 (Fix Under Review): librbd: discard support
Sage Weil
09:48 PM Revision 31f16a4c (ceph): rgw: list multipart response fix
LastModified was formatted outside of the Part block.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
09:06 PM Revision 89fecda6 (ceph): Makefile.am: remove some clutter
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
09:04 PM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
I'm going to have to look at this again in the morning, but I think
we're in this block of code:
#ifdef CONFIG_BL...
Alex Elder
08:37 PM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
Here's a disassembled block of the code where the fault occurred.
The address listed corresponds to offset 3468 belo...
Alex Elder
08:10 PM Linux kernel client Bug #2260 (Resolved): libceph: null pointer dereference at try_write+0x638+0xfb0
It's not an exact match but it's close enough that I wanted to reopen
bug 1793 or 1866, but found myself unable to. ...
Alex Elder
08:41 PM Revision 1ac5554d (ceph): kernel: kludge around mysterious 0-byte .git/HEAD files
No idea where these are coming from, but they break nodes with behavior
like
ubuntu@plana08:~$ sudo install -d -m075...
Sage Weil
05:42 PM Revision 0aea1cb1 (ceph): v0.45
Sage Weil
04:17 PM Revision 0d5918f8 (ceph): kernel: reset to remote firmware branch; don't pull
Pull might merge if upstream rebases. Just make our branch match the
remote one.
Sage Weil
04:12 PM Revision 9b755fd6 (ceph): kernel: change git incantation for firmware pull
The 'git pull <uri>' seemed to consistently fail on some nodes. Can't be
sure this was really the problem with them ...
Sage Weil
03:59 PM Revision 22b1f17f (ceph): ls: another newline
Sage Weil
03:57 PM Revision 7757fbb9 (ceph): ls: remote stray newline
Sage Weil
03:27 PM Feature #2246: force10s on sepia
Fabric brought up by Networking group. Interfaces up, configured, and working (nuttcp shows 9.5GB/s or so with
defa...
Dan Mick
01:26 PM Feature #2111: msgr workloads
I think the messenger tester may be at a point where we can call this bug satisfied. Greg Farnum
01:18 PM Bug #2178: rbd: corruption of first block
the good news is i see the problem. the bad news is its the exact bug we thought we fixed. the other good news is w... Sage Weil
07:38 AM Bug #2178: rbd: corruption of first block
Hi Sage,
just in case, the reply from yesterday did not reach you:
--- 8-< ---
Good morning,
it's already...
Oliver Francke
12:27 PM Feature #2258 (Resolved): use external leveldb package
autoconf lets you use the installed library. not doing so by default to avoid the pain of building on older distros. Sage Weil
04:22 AM Revision 965f83d4 (ceph): Merge branch 'next'
Sage Weil
04:20 AM Revision d348e1ab (ceph): configure: --with-system-leveldb
Default to bundled leveldb. Optionally check.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:20 AM Revision 34cc308e (ceph): filestore: fix leveldb includes
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:23 AM Revision 0b2e1cd2 (ceph): cephfs: fix uninit var warning
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil

04/09/2012

11:58 PM Revision 9906d5ed (ceph): Change to local mirror of linux-firmware repo to try to stop failures
Dan Mick
11:17 PM Revision f79b95e5 (ceph): Makefile: add missing .h to tarball
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:56 PM Revision 8d5c87a8 (ceph): rgw: fix object name with slashes when vhost style bucket used
Fixes issue #2259. The problem was that we were initializing the
object name, then in the case of a virtual host buck...
Yehuda Sadeh
09:02 PM Revision 853b0458 (ceph): OSD: use per-pg temp collections, bug #2255
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:08 PM Revision 36d42dea (ceph): buffer: allow advance() to move an iterator backward
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:08 PM Revision bd518e99 (ceph): encoding: fix iterator use for struct_len copy_in
The end() iterator position does not record an offset when the list is
modified.
Signed-off-by: Sage Weil <sage.weil...
Sage Weil
04:30 PM rgw Bug #2259 (Resolved): rgw: object name cut after slash when virtual host style is used
Fixed, commit:8d5c87a86e070b4e95ef0d58a469bdbbef4a826c. Yehuda Sadeh
03:42 PM rgw Bug #2259 (Resolved): rgw: object name cut after slash when virtual host style is used
Yehuda Sadeh
09:32 AM Bug #2178: rbd: corruption of first block
The missing piece of information is mapping the file offset to a block device offset. Can you, inside the VM,... Sage Weil
03:59 AM Revision 7951d7e4 (ceph): Merge remote branch 'gh/stable' into next
Sage Weil
03:58 AM Revision dd8fd168 (ceph): configure: HAVE_FALLOCATE -> CEPH_HAVE_FALLOCATE
/usr/include/linux/fs.h defines this on CentOS 5, even though it does not
in fact compile. This stupid workaround av...
Sage Weil

04/08/2012

09:53 PM Feature #2258 (Resolved): use external leveldb package
- make our configure take/require a --with-system-leveldb or similar to not use the bundled leveldb
- update the deb...
Sage Weil
08:31 AM Bug #2178: rbd: corruption of first block
Hi Sage and *Happy easter*,
yesterday I had some "luck" after 10 tries....
Here is what I have for you:
first ...
Oliver Francke

04/06/2012

09:27 PM Feature #1692 (Duplicate): librbd: Support TRIM (hole punching) (userspace client)
dup of #1044 Sage Weil
09:07 PM Revision 8e1cc8ab (ceph): init-ceph: manage pid_file from init script
With upstart the daemon shouldn't manage the pid file itself. Move this
out of the default config and into the legac...
Sage Weil
08:48 PM Revision 81d2cbeb (ceph): config: move /var/run and /var/log defaults to config_opts.h
This flips the sense of the common_init defaults. Before, the alternate
defaults were filled in if it was a daemon. ...
Sage Weil
08:39 PM Revision dfa043df (ceph): config: {osd,mon}_data default to /var/lib/ceph/$type/$cluster-$id
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:26 PM Revision 2ceda946 (ceph): Merge branch 'stable'
Sage Weil
06:44 PM Revision 7680cdad (ceph): dencoder, rgw: make ceph-dencoder load much faster
by avoiding linking with unneeded shared objects.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:00 PM Revision 98326968 (ceph): encoding: use iterator to copy_in encoded length
This gives us a pointer to the position into the list where the final
length value will be copied. Previously we use...
Sage Weil
03:47 PM rgw Feature #2257 (Rejected): rgw: detect fastcgi module 100-continue support automatically
The current default that is used doesn't work with vanilla fastcgi module. It'd be great if that could be set automat... Yehuda Sadeh
02:46 PM rbd Feature #2256 (Resolved): rbd: parallelize deletions
There are a few places where we delete things one at a time: resizing to a smaller size, deleting all snapshots, and ... Josh Durgin
02:04 PM Feature #2240 (Fix Under Review): osd: new default locations
wip-defaults Sage Weil
12:05 PM Bug #2161: nonlinear scaling for PGMap::pg_stat encode
wip-encoding Sage Weil
09:18 AM Bug #2161: nonlinear scaling for PGMap::pg_stat encode
Ake van der Meer wrote:
> My ceph-osd processes run at 100% CPU for many minutes at a time doing this: http://pasteb...
Sage Weil
08:25 AM Bug #2161: nonlinear scaling for PGMap::pg_stat encode
My ceph-osd processes run at 100% CPU for many minutes at a time doing this: http://pastebin.com/wYnPKWeJ
In src/i...
Ake van der Meer
10:05 AM Feature #2246 (In Progress): force10s on sepia
Ports being mapped yesterday and today in preparation for switch config review. Dan Mick
09:21 AM Bug #2255 (Resolved): osd: fix object name collisions between pools in temp collection
Sage Weil
08:28 AM Feature #2223: Tracing facility on FileStore
Made some changes to the ObjectStore.cc, regarding code duplication of the transaction's dump methods. Feedback would... Joao Eduardo Luis

04/05/2012

09:55 PM Revision 689ac5d7 (ceph): v0.44.2
Sage Weil
09:53 PM Revision e0c4db9e (ceph): FileStore: do not check dbobjectmap without option set
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:25 PM Revision 38e24b1e (ceph): config: include /etc/ceph/$cluster.keyring in keyring search path
mkcephfs and the docs etc still write to /etc/ceph/keyring.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
09:25 PM Revision 57dff032 (ceph): config: expand metavariables for --show-config, --show-config-value
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:08 PM Revision 90e88a08 (ceph): Merge branch 'wip-cluster'
Reviewed-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> Sage Weil
08:35 PM Revision cfee0333 (ceph): config: parse fsid uuid in config, not ceph_mon
Use the new OPT_UUID type.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:35 PM Revision 2c14c8b2 (ceph): config: add distinct UUID type
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:32 PM Revision 2c0dc47e (ceph): global: add -C or --cluster early args to specify cluster name
This will let you specify which cluster to talk to on the command line
(e.g., 'ceph -C foo ...' or when starting a da...
Sage Weil
08:32 PM Revision 930a669a (ceph): config: add cluster name as metavariable; use for config locations
Add a cluster name (default "ceph") to the config structure, and expand
$cluster in all config values.
Make the defa...
Sage Weil
08:25 PM Revision bda562fb (ceph): config: implement --show-config and --show-config-value <option>
Dump internal config value(s) to stdout and then exit.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:04 PM Revision f18b219a (ceph): test_workload_gen: fix logging
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:01 PM Revision 32b5d0f8 (ceph): config: remove obsolete bdev_* options
These were part of ebofs.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:59 PM Revision 1b769535 (ceph): Merge remote-tracking branch 'gh/wip-log'
Sage Weil
06:43 PM Revision 0e5d087c (ceph): README: update instructions
Needed to add submodule instructions.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
03:49 PM Revision 3d7f1db7 (ceph): Kernel: Pull linux-firmware from git
Signed-off-by: Mark Nelson <nhm@clusterfaq.org> Mark Nelson
02:21 PM Feature #2248 (Resolved): cluster naming
Sage Weil
02:20 PM Subtask #2236 (Resolved): filestore failure injection (3)
wip-filestore-failure
I don't think enumerating/identifying the callers is needed here. For the idempotency teste...
Sage Weil
01:19 PM Feature #2226: osd: better filestore idempotency test
Thought about the a bit more. The filestore failure injection is easiest to implement with an _exit(1) or something,... Sage Weil
01:13 PM Feature #1890 (Resolved): log: async log writeout
Sage Weil
01:13 PM Feature #1889 (Resolved): log: structure log records
Sage Weil
12:30 PM Feature #2254 (Resolved): doc: cephx
pending improved documentation:
* was is, is not protected
* how to convert/upgrade a non-cephx cluster to cephx (e...
Sage Weil
12:22 PM Subtask #2235 (In Progress): generate deterministic sequence of transactions (5)
Joao Eduardo Luis
10:51 AM Bug #2178: rbd: corruption of first block
Ok, my attempts to parse the log to find out of order replies is quickly snowballing. (complexity of dropped replies... Sage Weil
08:21 AM Bug #2178: rbd: corruption of first block
Oliver Francke wrote:
> Uhm...
>
> ... I thought, we were talking about the same issue since the very beginning.....
Sage Weil
01:25 AM Bug #2178: rbd: corruption of first block
Uhm...
... I thought, we were talking about the same issue since the very beginning... corruption of .rbd-blocks.....
Oliver Francke

04/04/2012

11:12 PM Revision 0df6fbd3 (ceph): rados: fix rados import
This fixes issue #2253. Wrong param order to fread().
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
11:11 PM Feature #2248 (Fix Under Review): cluster naming
Sage Weil
11:00 AM Feature #2248: cluster naming
- new ocmmand line arg (-C, --cluster)
- controls default config files
- becomes another subst ($cluster) to be use...
Sage Weil
10:38 AM Feature #2248 (Resolved): cluster naming
Sage Weil
08:56 PM Revision ba0fb3ed (ceph): cleanup-and-unlock.sh: helper to nuke and then unlock a set of nodes
I usually do something like
teuthology-lock --list-targets --owner scheduled_sage@metropolis > /tmp/b
./cleanup-an...
Sage Weil
08:54 PM Revision 3adf2bf9 (ceph): schedule_suite.sh: helper to schedule a suite
There's a bunch of stuff hardcoded in here, similar to the nightly, but
it's a useful starting point.
Sage Weil
04:09 PM Bug #2253 (Resolved): rados import: uploaded objects are empty
Fixed, commit:0df6fbd3a66741ad02c7556b0c4026dc3577d797. Yehuda Sadeh
03:37 PM Bug #2253 (Resolved): rados import: uploaded objects are empty
Yehuda Sadeh
03:33 PM rgw Documentation #1813: doc: document radosgw api diffs with s3
We'd like to have it for the current sprint, or at least no later than the next sprint. 5/1 as an upperbound target d... Yehuda Sadeh
12:45 PM Bug #2233: Throttle when there are lots of large conccurent IOs
Yeah, it's the failing gracefully bit that I'm interested in. :) Mark Nelson
12:38 PM Bug #2233: Throttle when there are lots of large conccurent IOs
Just the rados bench tool itself is allocating 16GB to feed into librados.
Now that you mention it, librados might...
Greg Farnum
12:29 PM Bug #2233: Throttle when there are lots of large conccurent IOs
Aha! The plana nodes appear to only have 8GB of ram and 8GB of swap.
Is the allocation of that memory part of libra...
Mark Nelson
11:20 AM Linux kernel client Bug #2242: rbd: spinlock on wrong cpu
OK, I think this problem arises because of the switch to a spinlock to
protect the client list. Doing so was the ri...
Alex Elder
09:53 AM Linux kernel client Bug #2242 (Resolved): rbd: spinlock on wrong cpu
... Sage Weil
11:19 AM Bug #2178: rbd: corruption of first block
Oliver Francke wrote:
> Hi Sage,
>
> I was talking about the verbose logfiles from monday. TBH, I don't expect Ba...
Sage Weil
10:32 AM Bug #2178: rbd: corruption of first block
Hi Sage,
I was talking about the verbose logfiles from monday. TBH, I don't expect BadThings without "rbd_writebac...
Oliver Francke
09:49 AM Bug #2178: rbd: corruption of first block
Oliver Francke wrote:
> Whew, that was fast,
>
> after second run I had some errors in one file with:
> [osd]
>...
Sage Weil
07:01 AM Bug #2178: rbd: corruption of first block
Whew, that was fast,
after second run I had some errors in one file with:
[osd]
filestore fiemap threshol...
Oliver Francke
05:43 AM Bug #2178: rbd: corruption of first block
Well Sage,
its harder these days to reproduce, cause I think the current version has made "something more stable"(...
Oliver Francke
10:57 AM Feature #2252 (Resolved): rgw long run kernels
Sage Weil
10:54 AM Feature #2251 (Resolved): rgw long run workloads
Sage Weil
10:53 AM Feature #2250 (Resolved): rgw long run raid config
Sage Weil
10:47 AM Subtask #2249 (Resolved): teuthology task (3)
Sage Weil
10:35 AM Feature #2246 (Resolved): force10s on sepia
Sage Weil
10:32 AM Feature #2245 (Resolved): rgw long run ceph install
Sage Weil
10:29 AM Messengers Feature #2244 (New): msgr: performance tester
Sage Weil
09:54 AM Linux kernel client Bug #2243 (Resolved): btrfs: warning in orphan_commit_root
2012-04-04T01:02:59.191518-07:00 plana32 kernel: [ 8815.371555] ------------[ cut here ]------------
2012-04-04T01:0...
Sage Weil
09:45 AM Feature #2241 (Rejected): upstart
Sage Weil
09:45 AM Feature #2240 (Resolved): osd: new default locations
Sage Weil
09:42 AM Subtask #2239 (New): install + configure package everywhere
chef! Sage Weil
09:42 AM Subtask #2238 (Rejected): vm for coredump archive
Sage Weil
09:41 AM Subtask #2237 (Resolved): failure+replay tester (8)
Sage Weil
09:39 AM Subtask #2236 (Resolved): filestore failure injection (3)
add a hook to operations that we want to potentially fail.
need to identify the caller so that the tester can pote...
Sage Weil
09:38 AM Subtask #2235 (Resolved): generate deterministic sequence of transactions (5)
Sage Weil
09:22 AM Bug #2234 (Resolved): Sometimes 'ceph -s' is unable to show pg data and crashes
ceph -s / ceph -w sometimes gives me output as below:... Szymon Szypulski
09:15 AM CephFS Feature #1237: mds caps limit mount to some subdir
Nope — as with all the other MDS stuff, this is currently not a priority. Greg Farnum
07:10 AM CephFS Feature #1237: mds caps limit mount to some subdir
Is there any progress on this issue? Maciej Galkiewicz
04:21 AM Revision 0921c062 (ceph): config: drop loud ERROR prefix
This makes gitbuilder sad.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:03 AM Revision b9185bb2 (ceph): osdmap: allow row, room, datacenter, pool in conf for initial crush map
These work just like host and rack, except that they are optional.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:02 AM Revision 4313a2d8 (ceph): crush: don't warn on skipped types
It's perfectly okay to skip some.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:01 AM Revision 56a6aa7a (ceph): osdmap: set 'default' pool type correctly
Got this wrong in e85961167eb1f37f80f263257799e4e901d17e74
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil

04/03/2012

11:33 PM Revision dd7b84a5 (ceph): ceph-fuse: fix log reopen when -f is specified
Don't restart if it wasn't stopped.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
10:56 PM Revision 1836d467 (ceph): Added assertion to check that targets > roles
Signed-off-by: Mark Nelson <mark.nelson@dreamhost.com> Mark Nelson
10:56 PM Revision 95294027 (ceph): nuke: don't run umount when no xargs args
Gets rid of this noise:
INFO:teuthology.nuke:Unmount any osd data directories...
INFO:teuthology.orchestra.run.err:U...
Sage Weil
10:40 PM Revision e8596116 (ceph): osd: define more crush types
We don't use these by default, but this way they are there should someone
want to use them.
Signed-off-by: Sage Weil...
Sage Weil
10:37 PM Messengers Bug #1674 (Need More Info): daemons crash when sent random data
FWIW I was unable to reproduce this with the current code, with or without cephx enabled. Sage Weil
10:35 PM Revision 2dbdadbe (ceph): test_rewrite_latency: check return value
Fixes warning
warning: test/test_rewrite_latency.cc:27:36: ignoring return value of ‘ssize_t pwrite(int, const void*...
Sage Weil
10:28 PM Revision 493344fd (ceph): Makefile: add mssing header
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:07 PM Bug #1627 (Can't reproduce): ceph-mon memleak if ceph-osd cluster ip is not reachable, but public...
Sage Weil
09:21 PM Revision d57d8af7 (ceph): rgw: throttle at num_threads * 2
If we throttle at num_threads, then nothing gets into the workqueue until
a worker thread is idle, which means you pa...
Sage Weil
08:44 PM Revision 1ef37ab8 (ceph): Merge remote-tracking branch 'gh/msgr-api-changes'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
08:41 PM Revision a31efd9c (ceph): filestore: print Sequencer name in debug output
And clean it up just a bit.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:22 PM Revision 756621d5 (ceph): msgr: clean up Pipe::do_sendmsg.
Document it as with the tcp stuff, remove an if(0)'d debugging block,
and remove the useless "sd" parameter since it'...
Greg Farnum
08:22 PM Revision 9f10a991 (ceph): msgr: write minimal documentation for the tcp functions.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:22 PM Revision e966c39d (ceph): msgr: make a bunch of stuff private.
Why were all these data members public? They're accessed by Pipes
and the Accepter and stuff, so maybe that's why...b...
Greg Farnum
08:22 PM Revision 096971d4 (ceph): msg: update the Dispatcher and Messenger documentation
Clarify what mark_down() and mark_down_on_empty() actually do.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost....
Greg Farnum
08:21 PM Revision 36ec8e93 (ceph): dispatcher: fix documentation for ms_handle_reset
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:21 PM Revision cbe13ab2 (ceph): msgr: rename set_ip() -> set_addr_unknowns()
The generic interface shouldn't reference specifics like that.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost....
Greg Farnum
08:13 PM Revision 607f35e7 (ceph): msgr: Remove _my_name and ms_addr, replace with direct access to my_inst.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:13 PM Revision 77f45667 (ceph): msgr: store the entity_inst_t in the Messenger.
Convert ms_addr and _my_name to be references to their fields in
the entity_inst_t my_inst.
This way we can use const...
Greg Farnum
08:11 PM Revision 6374d064 (ceph): buffer: implement a contents_equal function on bufferlists
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:11 PM Revision 5681461b (ceph): msgr: change the signature of get_myaddr()
Return a const reference to the actual address, instead of copying it.
All current users are happy with this, and I c...
Greg Farnum
08:11 PM Revision 45a76eaf (ceph): msgr: get_connection() is required to establish a connection if none ex...
Making an allowance for lossy server connections is silly. Just don't
ask for the Connection in that case. (There are...
Greg Farnum
08:10 PM Revision e80126ea (ceph): test: fix monmaptool help text
Broken by commit:15f0a3270fdcf09acce554313f2d0c0814a511e4
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
06:32 PM Revision e06436e9 (ceph): cls_rgw: guard decode
thee were few cases where decode wasn't guarded.
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
Yehuda Sadeh
06:30 PM Revision ebb487a6 (ceph): cls_rgw: reset return code in some cases
Beforehand the return code was ignored, so fixed the cases
where we erroneously return error instead of success.
Sig...
Yehuda Sadeh
05:12 PM Revision a8938422 (ceph): librados: fix exec test
Return for read operations is now returned correctly.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:52 PM rgw Bug #1681: rgw: user rm with --purge doesn't remove data
Maybe we should disallow removal of user that has data? We can suspend it instead. Yehuda Sadeh
04:06 PM Revision 57f52479 (ceph): doc: disable broken 'doxygenclass' class in librados c++ doc
This is the last remaining gitbuilder error. Add it back when the C++
docs actually build.
Signed-off-by: Sage Weil...
Sage Weil
03:58 PM Revision 9d4fcd08 (ceph): Merge remote-tracking branch 'gh/stable'
Sage Weil
03:57 PM Bug #1921 (Resolved): teuthology: silently continues when len(targets) != len(roles)
Sage Weil
03:44 PM Revision e40cf8ca (ceph): test_workload_gen: fix Sequencer ctor
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
02:43 PM Feature #2226: osd: better filestore idempotency test
Sage Weil
02:32 PM Documentation #2175 (Resolved): doc: fix doc build errors
got this to yellow (only warnnings), yay! Sage Weil
01:39 PM Feature #1890: log: async log writeout
Sage Weil
01:39 PM Feature #1889: log: structure log records
Sage Weil
10:45 AM Feature #2134 (Resolved): qa: smoke suite
Sage Weil
10:31 AM Bug #2178: rbd: corruption of first block
Hi Oliver,
I have two things to try:
- 'rbd writeback window = 0'. I know it's not what you want to run, but t...
Sage Weil
10:29 AM Bug #2233: Throttle when there are lots of large conccurent IOs
That is 16GB of RAM being allocated and used — I don't remember what hardware these are running on and have no idea w... Greg Farnum
09:47 AM Bug #2233 (Won't Fix): Throttle when there are lots of large conccurent IOs
When sending large amounts of data via a single client (ie 256 concurrent 64MB IOs) we can hit a bad_alloc on the cli... Mark Nelson
09:15 AM Cleanup #2191 (Resolved): reexamine simple_spinlock
Sage Weil
08:51 AM Feature #2087 (Resolved): lightweight filestore workload generator
Sage Weil
05:04 AM Revision b5ca2fe0 (ceph): Merge remote-tracking branch 'gh/wip-name-sequencers'
Sage Weil
05:03 AM Revision d70191a8 (ceph): Merge remote-tracking branch 'gh/wip-2087'
Sage Weil

04/02/2012

08:24 PM Revision addc7446 (ceph): rgw: check for subuser existence
This fixes #1856: looking up subuser that doesn't exist returns
user as long as subuser prefix defined existing user....
Yehuda Sadeh
02:30 PM rgw Bug #1853 (Resolved): rgw: qa test to verify bucket recreation does not override bucket
Implemented, commit:1551c5b08714b415c49fc759002b7c6a6d4d611a. Yehuda Sadeh
01:26 PM rgw Bug #1856 (Resolved): It is possible to look up an rgw user by a subuser that does not exist as l...
Fixed, commit:addc744692f60885a747c4531cd12bf19b3a7f2a. Yehuda Sadeh
11:15 AM rgw Feature #2171: rgw: asynchronously calculate md5
Thinking about it some more, it's probably not the best use of time and effort. We initiate the md5 calculation after... Yehuda Sadeh
08:29 AM Bug #2178: rbd: corruption of first block
Hi Sage,
here we go again, with ceph-0.44.1-1-g41f84fa
One bad file with following infos:
20120402 171642.12...
Oliver Francke
12:04 AM Revision e792cd93 (ceph): filestore: fix ZERO fallback write
It helps if we write zeros!
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil

04/01/2012

11:24 PM Revision 8434caf5 (ceph): qa: test_rewrite_latency
Tool to measure latency of overwriting a single block.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:23 PM Bug #2221: Monitor setup bugs
2) ... Greg Farnum
06:35 PM rbd Feature #2232: qemu: resize guest disk when rbd image is resized
I tested this on Friday, and qemu rereads the size (at least when using virtio) when the guest requests it (i.e. echo... Josh Durgin
04:21 PM rbd Feature #2232 (New): qemu: resize guest disk when rbd image is resized
According to Christoph, this is probably just a matter of calling bdrv_truncate() with the new size. If that doesn't... Sage Weil
04:19 PM rbd Feature #2231 (Resolved): librbd: expose header change (resize?) via api
we need a callback or something so that users (qemu) can be informed when the header changes. this will let them, sa... Sage Weil

03/31/2012

03:22 PM Feature #1655: gitbuilder aggregator page
I took some inspiration from the updated aggregator script that is now at http://ceph.newdream.net/gitbuilder.cgi. I'... Jimmy Tang
03:31 AM Revision dbc70b9d (ceph): Merge remote branch 'gh/wip-mon_setup'
Reviewed-by: Sage Weil <sage@newdream.net> Sage Weil
03:18 AM Revision f8a53869 (ceph): osd: fix error code return from class methods
Don't shadow the result at function scope.
Fixes: #2148
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:22 AM Revision 15f0a327 (ceph): monmaptool: make clear you can set the fsid when making a new map.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
12:07 AM Revision 208daeb3 (ceph): ceph_mon: fix fsid parsing.
fsid is a field in the CephContext _conf structure and is parsed by
the standard options parsing library before it ge...
Greg Farnum

03/30/2012

11:15 PM Revision 9a69c3f3 (ceph): ceph.conf: enable 'osd recover clone overlap'
to test the recovery cloning in qa. this was redone, but forgot to enable
it in qa.
Sage Weil
11:14 PM Revision aa31035e (ceph): osd: update_stats() on reads too
Update pg stats on any op completion (read or write), not just writes. Do
the calls with log_op_stats() for consiste...
Sage Weil
11:11 PM Revision 28788654 (ceph): log: dump_recent in fatal signal handler
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:07 PM Revision f27acbc8 (ceph): Merge remote-tracking branch 'gh/wip-log'
Conflicts:
src/common/config_opts.h
Sage Weil
11:00 PM Revision 374bef9c (ceph): Merge remote branch 'gh/wip-osd-hb'
Sage Weil
10:37 PM Revision f7f65ebe (ceph): osd: fix typo in debug message
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:57 PM Revision 75e3b9b3 (ceph): Merge remote branch 'gh/wip-osd-recovery-sources'
Sage Weil
09:23 PM Revision df5860fe (ceph): objectstore: name Sequencers
Assign a (unique) name to each Sequencer. This will aid in debugging, and
can be useful when dumping traces of FileS...
Sage Weil
09:11 PM Cleanup #2230 (Resolved): deprecate 'btrfs devs'
Sage Weil
09:00 PM rgw Feature #2229 (New): rgw: functional tests for rgw class
A series of simple functional tests to verify the rgw class methods behave as they should. Sage Weil
08:58 PM Bug #2148 (Resolved): osd: class error return not propagated to client
commit:f8a53869f6db4c76516ee525f00f87f930920692 Sage Weil
06:57 PM Revision 29c01f25 (ceph): ceph_common.sh: Remove dead code.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
06:27 PM Revision ba6bb4cf (ceph): man: Oops, update ceph-mon(8) for real. Sorry about that.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
06:26 PM Revision 541a543c (ceph): man: Update ceph-mon(8) after reStructuredText syntax fixes.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
06:16 PM Revision 2c542442 (ceph): doc: Remove duplicate anchor from (unused) overview doc.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
06:11 PM Revision 1ec47db1 (ceph): doc: Convert the mailing list mention to not be a section heading.
If toctree is inside a section, the subtree is inside the section too.
We don't want all of dev/* to be under "Mailin...
Tommi Virtanen
06:11 PM Revision b162696b (ceph): doc: Fix reStructuredText syntax errors.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
05:52 PM Revision 2d1a96d3 (ceph): add include/stringify.h
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:27 PM Bug #2221: Monitor setup bugs
(1) is a problem due to options parsing collisions...fixed!
(2) is directly contradicted by my testing...?
(3) I ne...
Greg Farnum
04:59 PM Revision b25817a5 (ceph): FileJournal: check pwrite return value when zeroing journal
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
04:52 PM Revision 41f84fac (ceph): filestore: set guard on collection_move
During recovery we submit transactions like:
- delete a/foo
- move tmp/foo to a/foo
This prevents the EEXIST chec...
Sage Weil
04:25 PM Bug #2026 (Can't reproduce): osd: ceph::HeartbeatMap::check_touch_file
Sage Weil
04:25 PM Bug #2045 (Can't reproduce): osd: dout_lock deadlock
haven't seen this in a while.
also, this code is about to go away anyway with wip-log.
Sage Weil
04:16 PM Bug #2102 (Can't reproduce): osd: pg stuck in backfill
Sage Weil
04:15 PM Bug #2102 (Duplicate): osd: pg stuck in backfill
Sage Weil
04:14 PM Bug #2002: osd: racy push/pull for clones
i take that back; this wasn't enabled in qa. adding to the teuthology ceph.conf file. Sage Weil
04:12 PM Bug #2002 (Resolved): osd: racy push/pull for clones
haven't seen this in forever; looks fixed. Sage Weil
04:11 PM Bug #2209 (Resolved): osd: read kb stats not tracked?
commit:aa31035e555129e56888320b84f16264f28bd7df Sage Weil
03:59 PM Bug #2116 (Resolved): Repeated messages of "heartbeat_check: no heartbeat from"
fixed by commit:374bef9c97266600b4c6b83100485d7250363213 Sage Weil
03:59 PM Bug #2165 (Resolved): osd: recovering ending with missing
fixed with merge of commit:75e3b9b309e5365975e3e5855c065bd4fe28b64c Sage Weil
03:58 PM Bug #2178: rbd: corruption of first block
Sage Weil
02:51 PM Bug #2178: rbd: corruption of first block
Please build the current git stable branch, which includes 41f84fac1ae4b4c72bf9bfe07614c4066c916fd1. The version sho... Sage Weil
07:35 AM Bug #2178: rbd: corruption of first block
Here the remaining timestamps from the other VM's with bad blocks:
VM-2:
20120330 105139.579830 filling block 171...
Oliver Francke
07:12 AM Bug #2178: rbd: corruption of first block
Hi *,
I needed a couple of runs, but managed now to provide some 81MiB/97MiB osd.X.log-files, where in between sh....
Oliver Francke
03:58 PM Bug #2164 (Resolved): osd: scrub missing _, snapset attrs
commit:41f84fac1ae4b4c72bf9bfe07614c4066c916fd1 Sage Weil
03:49 PM Revision f89f98df (ceph): osd: clear RECOVERING on start_peering_interval
This prevents us from, say, getting into a recovering+stray state.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
03:45 PM Revision 3cdd8d58 (ceph): osd: more heartbeat debug
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:45 PM Revision e1a58912 (ceph): osd: discard heartbeat_peer in note_down_osd
Discard the heartbeat_peer as soon as we find out, along with queued
failures, or else the heartbeat_check may come a...
Sage Weil
03:45 PM Revision 21e6e2b8 (ceph): osd: ignore peer epoch of 0 on ping reply
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:45 PM Revision efc27f19 (ceph): osd: don't fail new heartbeat peers
last_tx may be 0 because we just added this peer; don't mark them down
yet!
Signed-off-by: Sage Weil <sage.weil@drea...
Sage Weil
03:45 PM Revision 33b9187a (ceph): osd: rename hbin -> hbclient, hbout -> hbserver
This is way less confusing.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
03:44 PM Revision 4e2f0d14 (ceph): osd: simplify heartbeat logic
Simplify heartbeats to use a simple request/reply model.
- avoid any weirdness with map update timing
- no from/to...
Sage Weil
03:44 PM Revision fe5f0331 (ceph): osd: send pings from hbin
Fixes: #2212
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
02:32 PM Revision eebc9ec2 (ceph): test: test_workload_gen: Add callback for collection destruction.
When we remove a collection, we must cleanup after the coll_entry_t we
once had on the available collections set. For...
Joao Eduardo Luis
01:53 PM Revision 424b5b07 (ceph): ceph: --concise by default, add --verbose option
It's time.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:50 PM Feature #2227 (Closed): QA: create a test to verify operation with non-default layouts
I submitted a patch that modified ceph_calc_file_object_mapping()
in the ceph client, and when reviewing it Sage poi...
Alex Elder
09:53 AM Feature #2226 (Resolved): osd: better filestore idempotency test
... Sage Weil
03:31 AM Revision 409b648b (ceph): config: drop old debug_* items
...and replace code references with conf->subsys.should_gather().
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
02:16 AM Revision 5d981b15 (ceph): rgw: add unittest just to verify we link
This will flush out references to stuff in libglobal.la, among other
things.
Signed-off-by: Sage Weil <sage.weil@dre...
Sage Weil
02:06 AM Revision 69b01726 (ceph): config: fix librados, libcephfs unit tests
No more g_conf->debug.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
01:08 AM Revision 394d8b1e (ceph): Add test for object source marked down
Samuel Just
01:08 AM Revision b4aa098f (ceph): make Thrasher not inherit from Greenlet
Samuel Just
01:02 AM Revision 1c8ec702 (ceph): PG,ReplicatedPG: update missing_loc_sources with missing_loc
In some cases missing_loc was updated without missing_loc_sources
Signed-off-by: Samuel Just <samuel.just@dreamhost....
Samuel Just
01:02 AM Revision 05ef3ba6 (ceph): ReplicatedPG: fix loop in check_recovery_sources
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
12:35 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
I think I can be optimistic :)... Wido den Hollander

03/29/2012

10:06 PM Bug #2178: rbd: corruption of first block
Okay, I suspect this is actually bug #2164, which was causing the _ xattr to get lost when ceph-osd restarts on non-b... Sage Weil
09:52 PM Bug #2225 (Resolved): gitbuilder.ceph.com returning 503: Service Temporarily Unavailable.
I can't find any 503 in the apache logs on this machine. Could it be on the client side? Sage Weil
09:48 PM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
Well, I fixed one problem, but I can't see how it could have resulted in the log you posted.
Pushed a few more pat...
Sage Weil
11:36 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
I collected logs from 4 OSDs, they can be downloaded at: http://logger.ceph.widodh.nl/ceph/issues/2212/
At 10:13 t...
Wido den Hollander
09:21 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
Der.. do you have a log you can attach/post? Sage Weil
02:59 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
I reverted the extra debugging for the heartbeat stuff, but that didn't seem to consume all the CPU time.
The load...
Wido den Hollander
01:40 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
I just installted the code on my cluster and things do not seem to behave yet.
The cluster is still jumping around...
Wido den Hollander
08:54 PM Linux kernel client Bug #1940 (Resolved): locking cycle in ceph_osdc_start_request
commit:ab434b60ab07f8c44246b6fb0cddee436687a09a Sage Weil
08:15 PM Revision 41a09bea (ceph): Merge remote branch 'upstream/wip_latency'
Samuel Just
07:53 PM Linux kernel client Bug #1793 (Can't reproduce): NULL pointer dereference at try_write+0x627/0x1060
Marking this Can't Reproduce. Will reopen if it shows up again. Alex Elder
03:21 PM Linux kernel client Bug #1793: NULL pointer dereference at try_write+0x627/0x1060
Another 100 iterations of kernel_untar_build.sh using the current
master branch (c666601a935b94cc0f3310339411b6940de...
Alex Elder
07:51 AM Linux kernel client Bug #1793: NULL pointer dereference at try_write+0x627/0x1060
Bugs 1793 and 2081 have a signature of a page fault/bad memory reference
from process_one_work() -> con_work(), and ...
Alex Elder
07:53 PM Linux kernel client Bug #2069 (Can't reproduce): client crash during kernel_untar_build rm -r step
I just finished at least 150 iterations of kernel_untar.sh and never
hit this using the current master branch of cep...
Alex Elder
07:51 PM Linux kernel client Bug #2081 (Can't reproduce): msgr: spinlock badness?
Marking this Can't Reproduce. Will reopen if it happens again. Alex Elder
07:43 PM Linux kernel client Bug #2081: msgr: spinlock badness?
Another 100 iterations of kernel_untar_build.sh using the current
master branch (c666601a935b94cc0f3310339411b6940de...
Alex Elder
07:51 AM Linux kernel client Bug #2081 (Need More Info): msgr: spinlock badness?
Bugs 1793 and 2081 have a signature of a page fault/bad memory reference
from process_one_work() -> con_work(), and ...
Alex Elder
07:50 PM Linux kernel client Bug #2174 (Can't reproduce): rbd: iozone thrashing failure
OK, I'll go ahead and state that I can't reproduce this... Alex Elder
07:46 PM Linux kernel client Bug #2174: rbd: iozone thrashing failure
Status was Verified. Changing it to Need More Info because I can't even
seem to reproduce it at this point. (I sup...
Alex Elder
07:44 PM Linux kernel client Bug #2174: rbd: iozone thrashing failure
Another 12 iterations of suites/iozone.sh using the current
master branch (c666601a935b94cc0f3310339411b6940de751ba)...
Alex Elder
07:59 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
I don't know whether we've adequately captured the signature or symptoms
of this problem. I believe though that it ...
Alex Elder
07:20 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
I have been trying to reproduce this using the latest testing/master/for-linus
branch (they're the same right now) a...
Alex Elder
02:34 PM Revision c39ed568 (ceph): test: test_workload_gen: Fixing a memleak.
Apparently, the FileStore does not cleanup after transactions once they
are applied, which may lead to huge memory le...
Joao Eduardo Luis
09:27 AM Linux kernel client Bug #2224 (Rejected): Oops in __cfh_to_dentry
I setup an HA pair of NFS servers which re-export Ceph to NFS clients.
The HA pair is in active/standby mode, using...
Henry Chang
07:42 AM Feature #2087: lightweight filestore workload generator
Memory leak fixed.
Apparently, the FileStore does not cleanup after transactions once they are applied, which may ...
Joao Eduardo Luis
06:21 AM Feature #2087 (In Progress): lightweight filestore workload generator
Looks like some memory should be leaking bad, such that valgrind hangs on exit.
==19080==
==19080== HEAP SUMMARY...
Joao Eduardo Luis
07:24 AM Linux kernel client Bug #2064 (Resolved): ceph-client: messenger: nocrc flag not implemented correctly
Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved.
Alex Elder
07:12 AM Linux kernel client Bug #2157 (Resolved): ceph: xattr: fix nanosecond display on i_rctime
Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved.
Alex Elder
07:12 AM Linux kernel client Bug #2156 (Resolved): ceph: xattr: fix a possible buffer overrun bug
Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved.
Alex Elder
07:11 AM Linux kernel client Bug #2155 (Resolved): ceph: xattr: wrong value assumed for "no preferred PG"
Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved.
Alex Elder
05:56 AM Feature #2223 (Resolved): Tracing facility on FileStore
Allow a user to specify a file onto which log the transactions that come through OSDs' FileStores.
This should all...
Joao Eduardo Luis
05:47 AM Revision b3069e50 (ceph): ceph_argparse: drop useless declaration from unit test
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:16 AM Revision 4269f8d5 (ceph): ReplicatedPG: ctx might not contain an OpRequest
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
12:16 AM Revision 135a11ba (ceph): FileJournal: optionally zero journal on create
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
12:15 AM Revision 2486c61a (ceph): FileStore: Pass OpRequestRef into filestore in queue_transaction
This allow us to track op progress through the filestore.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
12:15 AM Revision d026cdc7 (ceph): FileJournal: use DSYNC for directio path
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
12:15 AM Revision 533bbf7b (ceph): osd/: OpRequest implements TrackedOp for passing into filestore
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just

03/28/2012

11:12 PM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
Ah, I see the bug now. Pushed a fix to wip-osd-hb, thanks!
Let us know if this behaves for you.. if so I'll pull ...
Sage Weil
04:23 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
It's quite large (222MB), so I uploaded the file, available at: http://logger.ceph.widodh.nl/ceph/osd.1.log_27-03-201... Wido den Hollander
10:51 PM Bug #2165: osd: recovering ending with missing
see wip-osd-recovery-sources Sage Weil
10:46 PM CephFS Bug #1811: 2 pjd chown tests failed on cfuse
... Sage Weil
04:02 PM Revision 4f0d170a (ceph): test: test_workload_gen: Change CLI option and add '--help' usage.
With this commit, we support the following options (and old ones are no
longer available):
--test-num-colls VAL ...
Joao Eduardo Luis
03:34 PM Revision 18d219e5 (ceph): rgw: replace dout with ldout
librgw can't use g_ceph_context
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
03:21 PM Feature #2222: osd: distinguish between 'degraded' and 'misplaced'
We should pick a designator that doesn't make it sound like the objects are lost. Greg Farnum
02:27 PM Feature #2222 (Resolved): osd: distinguish between 'degraded' and 'misplaced'
normal data migration happens with a acting set > the up set, so that we never drop below N replicas, but we still ca... Sage Weil
02:45 PM Feature #2087: lightweight filestore workload generator
Joao Eduardo Luis
02:07 PM Bug #2221 (Resolved): Monitor setup bugs
Carl reported several configuration issues when creating new monitors (based on the instructions at http://ceph.newdr... Greg Farnum
01:59 PM Revision a3bdf055 (ceph): test: test_workload_gen: Default arguments, and minor changes.
Besides adding support for default arguments, passed onto global_init(),
this commit fixes a conflict in Makefile.am,...
Joao Eduardo Luis
01:32 PM Revision 37cdbcd4 (ceph): log: fix up unittest
Fewer entries; compile.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:35 AM rgw Bug #2220 (Resolved): rgw: librgw dep on g_ceph_context
Fixed, commit:18d219e512a8e0f427a2229a71e15869cac3b593. Yehuda Sadeh
07:16 AM rgw Bug #2220 (Resolved): rgw: librgw dep on g_ceph_context
from last night's qa,... Sage Weil
04:37 AM Bug #2219: OSD's commit suicide with 0.44
I accidentally removed the core file(s) :(
Hope this one pops up again so I have a core file.
Wido den Hollander
04:11 AM Linux kernel client Tasks #2138: rbd: run xfstests on a local XFS filesystem over RBD
After setting up two rbd devices and making some fairly simple changes
to xfstests, then setting up appropriate envi...
Alex Elder
04:04 AM Linux kernel client Bug #2155: ceph: xattr: wrong value assumed for "no preferred PG"
This got rebased: 3489b42a72a41d477665ab37f196ae9257180abb
This has been sent as part of a pull request to Linus ...
Alex Elder
04:04 AM Linux kernel client Bug #2156: ceph: xattr: fix a possible buffer overrun bug
This got rebased: 3489b42a72a41d477665ab37f196ae9257180abb
This has been sent as part of a pull request to Linus ...
Alex Elder
04:03 AM Linux kernel client Bug #2157: ceph: xattr: fix nanosecond display on i_rctime
This got rebased: 3489b42a72a41d477665ab37f196ae9257180abb
This has been sent as part of a pull request to Linus ...
Alex Elder
04:01 AM Linux kernel client Bug #2064: ceph-client: messenger: nocrc flag not implemented correctly
It got rebased once more, and this should be the last:
37675b0f42a8f7699c3602350d1c3b2a1698a3d3
This has been s...
Alex Elder
03:52 AM Bug #2178: rbd: corruption of first block
Hi,
I decided to upgrade to "latest-n-greatest" in the test-cluster, to make sure, that if I hit the error again w...
Oliver Francke
02:58 AM Revision 94e3abf8 (ceph): Merge branch 'stable'
Sage Weil
12:22 AM Revision 8948ad01 (ceph): test: test_workload_gen: CodeStyle compliance and cleanup.
This commit aims at the compliance with Ceph's CodeStyle, as well
as cleaning up some lingering unused code.
Also, n...
Joao Eduardo Luis
12:22 AM Revision d172b40c (ceph): test: test_workload_gen: Destroy collections.
Joao Eduardo Luis
12:22 AM Revision 3770096a (ceph): test: test_workload_gen: Mimic an OSD's workload.
In it's current state, the workload generator will queue a lot of
transactions onto the FileStore, and will wait if n...
Joao Eduardo Luis
12:18 AM Revision 749826c2 (ceph): allow use of a separate journal block device
Samuel Just

03/27/2012

11:44 PM Revision ffc468f2 (ceph): osdmap: less noisy about osd additions during buildmap
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
11:44 PM Revision 36c2f27d (ceph): osdmaptool: fix clitest conf filename
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
11:37 PM Revision ca1f79b5 (ceph): dout: no newlines on dout_emergency
Preserve old behavior to avoid breaking all the cli tests.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
11:27 PM Revision d5360968 (ceph): throttle: fix off by one issue
We were blocking only if we exceeded max count, not if
we reached it.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdrea...
Yehuda Sadeh
11:23 PM Revision a52d048a (ceph): rgw: throttle incoming requests
Don't accept more than the number of threads, otherwise if cluster is
backed up for any reason we'd end up exhausting...
Yehuda Sadeh
11:16 PM Revision 30cadf01 (ceph): prebufferedstreambuf: fix typedef
'typename' not allowed here:
./common/PrebufferedStreambuf.h:27: error: using 'typename' outside of template
Signed...
Sage Weil
10:35 PM Revision 93ba4c00 (ceph): Merge branch 'wip-intent-fixes'
Yehuda Sadeh
10:35 PM Revision ca4fab47 (ceph): Merge branch 'master' of ssh://github.com/ceph/ceph
Yehuda Sadeh
10:35 PM Revision 16b60b3e (ceph): rgw: minor style fixes
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
10:12 PM Revision 4d74a7b2 (ceph): osd: fix handling of recovery sources when osds go down
If a source osd goes down, we need to
- reset any pulls (already did that before)
- remove peer from missing_loc s...
Sage Weil
10:03 PM Revision 8fdde24c (ceph): osd: remove down osds from peer_*_requested maps
This will leave less crap around to confuse recovery if a source osd goes
down and then up.
Signed-off-by: Sage Weil...
Sage Weil
10:02 PM Revision 1ee60873 (ceph): osd: maintain missing_loc_sources
This is a superset of all missing_loc values... everywhere we might
pull an object from, or are currently pulling fro...
Sage Weil
09:37 PM Revision 5dbb9715 (ceph): rgw: all intent log operations are now async
That includes removing a directory index object, and the removal of
the actual intent log object.
Signed-off-by: Yeh...
Yehuda Sadeh
09:20 PM Revision 0b1e3ed4 (ceph): osd: increase default heartbeat_interval to 6 seconds
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:12 PM Revision 69844496 (ceph): rgw: remove pool_list(), can't list_objects() on system buckets
pool_list() was broken, replaced now with pool_iterate(). list_objects()
shouldn't be used any more with system bucke...
Yehuda Sadeh
09:04 PM Revision 2e9079cf (ceph): rgw: intent log processing uses new pool_iterate()
intead of pool_list(), which is broken (assuming pgls results are
sorted, which are not).
Signed-off-by: Yehuda Sade...
Yehuda Sadeh
08:57 PM Revision 1814aac1 (ceph): Merge branch 'misc-fixes-for-review'
Greg Farnum
08:57 PM Revision d5c4015d (ceph): uclient: We want to release cache when we lose the CACHE cap, not gain it!
Looks like this was detected as a problem back in
84644dc56183b67050793a1b8da07850508b29d6 but the fix wasn't complet...
Greg Farnum
08:57 PM Revision c3b04644 (ceph): paxos: share_state sends every unknown value, including the stashed one
Sage points out that the stashed object might not be the same as the
one we actually archive. For instance, OSDMonito...
Greg Farnum
08:57 PM Revision 2acf4aea (ceph): mon: Paxos needs to store the latest version permanently on-disk.
Previously it was only storing this m->latest_value in the stash,
which of course got overwritten. And then when some...
Greg Farnum
08:57 PM Revision d0ba27ae (ceph): doc: add a short thing on kernel client troubleshooting.
I just noticed this sitting uncommitted in my tree.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
08:02 PM Revision c89b7f22 (ceph): v0.44.1
Sage Weil
06:35 PM Revision 6044c5b8 (ceph): hadoop: define subsystem, fix logging
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:31 PM CephFS Bug #2218: CephFS "mismatch between child accounted_rstats and my rstats!"
The MDS log is at https://matthew.royhousehold.net/mds.a.log.1.gz (1505MB, md5 197ef232d50d27e2b7c2f62370c9c6b6) Matthew Roy
02:45 PM CephFS Bug #2218 (Need More Info): CephFS "mismatch between child accounted_rstats and my rstats!"
There's not enough info in the attached log to figure out what happened. I can tell you that your home directory beli... Greg Farnum
06:20 PM Revision ce61a83f (ceph): log: throttle message submission, trim recent
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:20 PM Revision fe56818e (ceph): config: configure log thresholds
- max new entries before we wait for flush
- max recent entries to keep around
Signed-off-by: Sage Weil <sage@newdre...
Sage Weil
06:05 PM Revision 339956df (ceph): log: don't spam -1 to syslog; add err_to_syslog for consistency
This matches the stderr settings.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:44 PM Revision 17a95c22 (ceph): log: use PrebufferedStreambuf
It's faster than ostringstream!
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:44 PM Revision bfa2bcd7 (ceph): prebufferedstreambuf: fix get_str()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:43 PM Revision 0e3c0c44 (ceph): bench_log: flush
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:42 PM Revision 3a87e452 (ceph): log/EntryQueue: no implicit trim
dequeue() things explicitly if you want to remove them.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:41 PM Revision f66e0750 (ceph): utime_t: sprintf() method
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:41 PM Revision 6ab85264 (ceph): do_autogen: control optimization level
-O 2 -> -O2
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:41 PM Revision a4509273 (ceph): common: add PrebufferedStreambuf
Simple streambuf that uses a preallocated buffer, and then spills over
into a std::string if necessary.
Signed-off-b...
Sage Weil
05:41 PM Revision 23f0af3c (ceph): test log performance with PreallocatedStreambuf
- faster than ostringstream in optimistic case
- same as ostreamstream + std::string assignment in worst case (use
...
Sage Weil
05:41 PM Revision 8c5046fa (ceph): bench_log: simple util to time how long it takes to log stuff
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:41 PM Revision 362ca19b (ceph): log: move create_entry() into Log interface
This will let us be smarter than putting it on the heap.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:41 PM Revision c7242bfe (ceph): log: flush on_exit
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:41 PM Revision abfadb9b (ceph): assert: dump recent log entries on failed assertions
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:41 PM Revision f41887e3 (ceph): log: new logging infrastructure
- explicitly defined subsystems, and ceph_subsys_FOO enums to go with them
- modular log system with Entry object
- s...
Sage Weil
04:26 PM rgw Bug #2197 (Resolved): rgw: need to throttle incoming requests
Fixed, commit:a52d048ac429c3d2b6a9286d96253308f6588762. Yehuda Sadeh
04:10 PM Bug #2178: rbd: corruption of first block
The next step is to reproduce the corruption on the test cluster with logs:
debug osd = 20
debug ms = 1
debug...
Sage Weil
08:37 AM Bug #2178: rbd: corruption of first block
Well,
one more comment:
my guess would be, it has todo something with expansion of the "sparse-file" while writin...
Oliver Francke
05:24 AM Bug #2178: rbd: corruption of first block
Good morning ;)
meanwhile I have not been lazy. I've managed - with current setup in test-cluster - to produce "in...
Oliver Francke
04:07 PM Bug #2164: osd: scrub missing _, snapset attrs
wip-2164
it's a problem with the collection_move guard (or lack thereof)
Sage Weil
03:40 PM rgw Bug #2208 (Resolved): rgw: radosgw-admin temp remove failure
Fixed, merged at commit:93ba4c004a9269148a75b67da2522855cb1842a3. Yehuda Sadeh
02:19 PM Bug #2219 (Need More Info): OSD's commit suicide with 0.44
Can you look at the core file and 'thread apply all bt'? Sage Weil
05:57 AM Bug #2219: OSD's commit suicide with 0.44
... Wido den Hollander
05:03 AM Bug #2219 (Can't reproduce): OSD's commit suicide with 0.44
I noticed this myself today, but on IRC somebody else came along:... Wido den Hollander
02:03 PM Bug #2199 (Resolved): mon: get_bl osdmap_full/9583 No such file or directory
Merged to master in commit:1814aac17593dee0fa4c774d5b462f277f6698da, reviewed by Sage — even though I forgot to add t... Greg Farnum
12:25 PM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
Can you attach the full osd.1 log? Sage Weil
12:36 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
Over night I saw 16 OSD's go down with the same backtrace.
All OSD's were running with debug ms/osd set to 1, this...
Wido den Hollander
09:07 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
I've been off on other things, but this problem apparently recurred
even if the latest check-in (Josh's change) in p...
Alex Elder
08:38 AM CephFS Bug #2217: sync and O_DIRECT writes only write first extent in iov vector
The code should not be written that way.
However I think it doesn't matter at this point, because the only caller
...
Alex Elder

03/26/2012

11:48 PM Revision 974a2013 (ceph): objecter: don't call op_throttle_ops.take(1) unconditionally
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
11:17 PM Revision 679cd1fe (ceph): objecter: add in-flight ops throttling
In addition to ops length, we also want to throttle it by
actual number of ops.
Signed-off-by: Yehuda Sadeh <yehuda@...
Yehuda Sadeh
10:02 PM Revision d6b0cbd4 (ceph): config: use our assert
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:40 PM Revision c3dc6a6e (ceph): msg: assert pipe->msgr == msgr
Fixes: #2216
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:57 PM Revision e30b7710 (ceph): rbd: fix typo in default config
pyflakes would have caught this if 'all' weren't a built-in function Josh Durgin
06:43 PM Revision 483fcf80 (ceph): doc: include crush in toctree
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:30 PM Revision 3bd1f18e (ceph): doc: few notes on manipulating the crush map
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:24 PM CephFS Bug #2218 (Resolved): CephFS "mismatch between child accounted_rstats and my rstats!"
The mismatch is detected at 2012-03-26 18:39:54.306661... Matthew Roy
05:15 PM Revision 6db77158 (ceph): doc/dev/peering.rst: fix typo
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:39 PM Revision 1a0360cb (ceph): osd/: OpRequest is no longer a RefCountedObject, remove puts/gets
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
04:38 PM Revision ea377a08 (ceph): osd/: Convert OpRequest* to OpRequestRef
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
04:38 PM Revision 2cb6c7d0 (ceph): OSD: Add typedef for shared_ptr<OpRequest>
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
04:36 PM Revision 3ed784c9 (ceph): osd/: add mark_event to OpRequest and move tracking into OpTracker
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
03:51 PM Bug #2192: ceph-mon hangs consuming 100% CPU
It was reproduced all the time, for 0.44 also. After I adjusted cluster to have only one monitor problem has gone. (U... Vladimir Kulev
02:44 PM CephFS Bug #2217 (Resolved): sync and O_DIRECT writes only write first extent in iov vector
static ssize_t ceph_aio_write(struct kiocb *iocb, const struct iovec *iov,
unsigned long nr_segs, loff_t po...
Sage Weil
01:34 PM Bug #2199 (Fix Under Review): mon: get_bl osdmap_full/9583 No such file or directory
Re-pushed misc-fixes-for-review. Greg Farnum
09:59 AM Bug #2199 (In Progress): mon: get_bl osdmap_full/9583 No such file or directory
Sage pointed out the stash data structure isn't necessarily the same as the other stored data structures, so this nee... Greg Farnum
12:47 PM Messengers Cleanup #2216 (Resolved): SimpleMessenger should make sure it owns passed-in Connections
Sage Weil
10:50 AM Messengers Cleanup #2216 (Resolved): SimpleMessenger should make sure it owns passed-in Connections
Otherwise we get weird issues like #2212. Greg Farnum
12:38 PM Cleanup #2191: reexamine simple_spinlock
my log branch drops this for the dout logging. the last user is the buffer.h debugging (enabled manually via a macro... Sage Weil
12:06 PM RADOS Bug #2047: crush: with a rack->host->device hierarchy, several down devices are likely to cause b...
fwiw dropping the local search behavior fixes this bad behavior. the question is what probably was the local search ... Sage Weil
11:27 AM RADOS Bug #2047: crush: with a rack->host->device hierarchy, several down devices are likely to cause b...
Sage Weil
11:27 AM Bug #2210 (Duplicate): osd: some PGs remains remapped or degraded
this is actually a crush problem, see #2047. Sage Weil
09:45 AM Bug #2210: osd: some PGs remains remapped or degraded
#2173 has some osd logs and related info for the same problem on a less clean cluster. Thanks for the detailed steps ... Josh Durgin
10:36 AM CephFS Fix #2215 (Resolved): ceph-fuse does not invalidate page cache
Right now the userspace client doesn't invalidate the page cache when it loses the cache capability on an inode. Appa... Greg Farnum
09:58 AM Bug #2212 (Resolved): osd: FAILED assert(msgr->lock.is_locked())
ah, i was using wrong msgr, fixing! Sage Weil
05:50 AM Bug #2212 (Resolved): osd: FAILED assert(msgr->lock.is_locked())
With the new heartbeat code I noticed a couple of OSD's go down with:... Wido den Hollander
09:58 AM RADOS Bug #2214 (Resolved): crush: pgs only mapped to 2 devices with replication level 3
This is from #2173. Note that all 3 osds are up.... Josh Durgin
09:43 AM Bug #2173 (Resolved): MDS crash when start with end of buffer
Josh Durgin
06:04 AM Feature #2213 (Resolved): rbd: shouldn't need config file to get help
I just ran "rbd --help" on a pretty much un-configured machine and got:
global_init: unable to open config file.
...
Alex Elder
05:22 AM Bug #2211 (Resolved): osd: entity_inst_t OSDMap::get_inst(int) const
While trying out the new heartbeat code I encountered this crash:... Wido den Hollander
03:28 AM Revision e478a758 (ceph): vstart: enable omap for xattrs
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil

03/25/2012

08:39 PM Bug #2173: MDS crash when start with end of buffer
Shall we colse this bug, as the mds server was recovered by providing an empty session map and we can not reproduced ... soft crack
08:39 PM Bug #2210 (Duplicate): osd: some PGs remains remapped or degraded
Some PGs remains 'remapped' or 'degraded' status after adding an osd server.
The steps to re-produce the bugs:
1....
soft crack
03:05 PM Revision f4b2097a (ceph): Merge remote branch 'gh/wip-doc-peering'
Sage Weil
02:57 PM Revision d3bcac24 (ceph): Makefile: fix modules that cannot find pk11pub.h when compiling with NS...
Signed-off-by: Jim Schutt <jaschut@sandia.gov>
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Jim Schutt
02:57 PM Revision 3ab28950 (ceph): don't override CFLAGS
leveldb adds -I flags to CFLAGS and CXXFLAGS, but if these macros are
overridden in the make command line, the flags ...
Alexandre Oliva
09:54 AM Feature #2087: lightweight filestore workload generator
Pushed a new commit to [1], making the code compliant with the CodeStyle and with Sage's suggestions on github.
[1...
Joao Eduardo Luis
04:47 AM Revision ef17c8c9 (ceph): add smoke suite
This could probably be collapsed into a bunch of singleton tasks to make
it simpler to track how many actual jobs res...
Sage Weil
04:20 AM Revision b5641ef3 (ceph): rgw: don't #include fcgi from rgw_common.h
ceph-dencoder #includes rgw_common.h, and needs to build even when
--without-radosgw is specified and libfcgi isn't i...
Sage Weil
04:09 AM Revision 1c1192a9 (ceph): backfill: use 'rbd' pool instead of 'data'
(data has a replay interval, which makes writes take longer to resume
after repeering)
Sage Weil
04:09 AM Revision 397e7f2f (ceph): add osd_recovery task to test divergent osd logs
Sage Weil

03/24/2012

11:07 PM Revision 24910c3b (ceph): add osd-recovery test
Sage Weil
11:07 PM Revision 6bf9c957 (ceph): renamed backfill -> osd_backfill
Sage Weil
11:05 PM Revision ca9a5a4a (ceph): rename backfill -> osd_backfill
Sage Weil
10:36 PM Revision 22e80874 (ceph): put filestore xattr option in [global]
...for test_filestore_idempotent's benefit Sage Weil
09:41 PM Feature #2134: qa: smoke suite
Sage Weil
09:04 PM Feature #1802 (Resolved): qa: test to exercise divergent osd logs
Sage Weil
03:10 PM Bug #2192: ceph-mon hangs consuming 100% CPU
Is this reproducible? Are you able to connect to the ceph-mon process with gdb? Sage Weil
03:06 PM Bug #2185 (Won't Fix): osd/ReplicatedPG.cc: 5938: FAILED assert(r >= 0) in ReplicatedPG::scan_ran...
Sage Weil
08:13 AM Feature #2087: lightweight filestore workload generator
Pushed a working version to ceph's git repository, branch wip-2087 [1]. Feedback would be appreciated.
[1] - https...
Joao Eduardo Luis

03/23/2012

08:27 PM Revision 2ec8f27f (ceph): rados_bench: generate_object_name now takes a buffer length
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>
Samuel Just
05:27 PM Bug #2209 (Resolved): osd: read kb stats not tracked?
Sage Weil
01:21 PM Bug #2196: `rados bench` will write test objects with a constant oid, under-reporting performance.
2ec8f27f58adca40d125051a23547b639ee7d5f6 Samuel Just
01:21 PM Bug #2196 (Resolved): `rados bench` will write test objects with a constant oid, under-reporting ...
Samuel Just
12:53 PM rgw Bug #2208 (Resolved): rgw: radosgw-admin temp remove failure
The radosgw-admin temp remove on congress goes into infinite loop when trying to list the .intent-log pool. Yehuda Sadeh
11:07 AM Bug #2200 (Can't reproduce): mon: not accepting new connections
Yehuda's indicated that this might be tied in to networking issues that were ongoing at the time. Given the symptoms ... Greg Farnum
11:04 AM Bug #2199 (Fix Under Review): mon: get_bl osdmap_full/9583 No such file or directory
I believe this is fixed in misc-fixes-for-review commit:e08b489d094efe384c3db639af0be765665bee23. Sage needs to revie... Greg Farnum

03/22/2012

11:09 PM Bug #2200: mon: not accepting new connections
Okay, that appears to not be it (the connections established and terminated match for clients and are only off by 9 o... Greg Farnum
10:09 PM Bug #2200: mon: not accepting new connections
There's not a lot I can do to diagnose this with just logs; the Monitors don't refuse connections like that on their ... Greg Farnum
09:42 AM Bug #2200 (Can't reproduce): mon: not accepting new connections
Following a networking downtime and monitors restart (as described in #2199), and following a recovery process, all a... Yehuda Sadeh
10:00 PM Bug #2199 (In Progress): mon: get_bl osdmap_full/9583 No such file or directory
Looks like the problem is that the Monitor got elected leader, and while it collected all the state it didn't write i... Greg Farnum
10:00 AM Bug #2199: mon: get_bl osdmap_full/9583 No such file or directory
My guess/hope is that this is one of the issues solved by the monitor slurp and other fixes since 0.41, but I haven't... Greg Farnum
09:41 PM Revision 21a170e8 (ceph): doc: dev/peering.rst edits from Greg
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:12 PM Bug #2207 (Resolved): osd: crash when op length is greater than op input data
This could happen due to a malicious or buggy client. I caused this with an accidentally empty request, with positive... Josh Durgin
05:10 PM CephFS Documentation #2206 (Resolved): Need a control command to gracefully shutdown an active MDS prior...
There is currently no way to gracefully shutdown an active MDS and allow a standby to activate or to transfer the act... Matthew Roy
04:53 PM Bug #2205 (Won't Fix): mkcephfs throws "No such file or directory" errors when the pwd the script...
When executing mkcephfs on a new cluster the script throws the message "bash: line 0: cd: /home/matthew/forCeph: No s... Matthew Roy
03:33 PM Revision 8fa904a6 (ceph): doc: update dev/peering document
- fix discussion of last epoch started
- define terms for current and past intervals
- describe role of pg info
- rem...
Sage Weil
02:55 PM Revision de867632 (ceph): msgr: fix tcp.cc linkage
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
02:53 PM Revision fd9935b7 (ceph): cephtool: don't prefix log items
This just makes it hard to read them.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
02:46 PM Subtask #2201: Document old design
+1; I have no idea what this bug is for Dan Mick
11:39 AM Subtask #2201: Document old design
Old design of...what?
(I see now that it's connected to the omap stuff, but if you could include a little more con...
Greg Farnum
11:34 AM Subtask #2201 (In Progress): Document old design
Samuel Just
11:33 AM Subtask #2201 (In Progress): Document old design
Samuel Just
01:08 PM Bug #2196 (In Progress): `rados bench` will write test objects with a constant oid, under-reporti...
Samuel Just
11:34 AM Subtask #2204 (Rejected): implement upgrade from old design to new design
Samuel Just
11:33 AM Subtask #2203 (In Progress): implement new design
Samuel Just
11:33 AM Subtask #2202 (Rejected): Document new design
Samuel Just
11:33 AM Feature #2149 (In Progress): osd: use omap for snap collections
Samuel Just
11:17 AM Feature #2198: add an option to force a down osd to be marked immediately out
Hmm, yeah, I forgot about that.
Somebody was asking about it; I'm not sure if they cared exactly but I'm sure there ...
Greg Farnum
11:08 AM Feature #2198: add an option to force a down osd to be marked immediately out
Not really, a write will still go to N-1 replicas until the new one is backfilled up through the object's position.
...
Sage Weil
11:00 AM Feature #2198: add an option to force a down osd to be marked immediately out
It guarantees that you always have the set number of copies on-disk when you get a commit, instead of probably having... Greg Farnum
10:47 AM Feature #2198: add an option to force a down osd to be marked immediately out
What's the motivation for doing that? Is it any better than setting the out interval to be something very short? Sage Weil
09:14 AM Bug #2116: Repeated messages of "heartbeat_check: no heartbeat from"
see new wip-osd-hb branch
Sage Weil

03/21/2012

11:41 PM Revision 2e21adf2 (ceph): Objecter: resend linger_ops on any change
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>
Samuel Just
11:35 PM Revision b47454b6 (ceph): ObjectStore: add COLLECTION_MOVE to dump
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
11:35 PM Revision 23313ee6 (ceph): FileStore: whitelist COLLECTION_MOVE on replay
Signed-off-by: Samuel Just <rexludorum@gmail.com> Samuel Just
11:35 PM Revision ec52eeb2 (ceph): FileStore: remove src on EEXIST during collection_move replay
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
11:35 PM Revision 52aff487 (ceph): ObjectStore: Add collection_move to generate_instances
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:01 PM Revision 3caa4319 (ceph): ceph: define and use a shell_scripts Makefile variable
Define a variable "shell_scripts" in the Makefile.in, and use it
along with some pattern rules to avoid some duplicat...
Alex Elder
09:01 PM Revision 1b2a0669 (ceph): ceph-kdump-copy: add tools for saving kdumps
This puts in place an init script and a command it runs to save a
kernel core dump to a remote server when a panic or...
Alex Elder
08:41 PM Bug #2199: mon: get_bl osdmap_full/9583 No such file or directory
kept logs for the failing monitor under /var/log/ceph/2199 Yehuda Sadeh
08:26 PM Bug #2199 (Resolved): mon: get_bl osdmap_full/9583 No such file or directory
Happened on congress (afair, off 0.41). One monitor is out for more than a month. Following network outage, both moni... Yehuda Sadeh
07:00 PM Revision 6f0f250b (ceph): suite: add missing print statement
Josh Durgin
06:58 PM Revision 8a9a5670 (ceph): suite: fix print statement when summary doesn't exist
Josh Durgin
04:59 PM Feature #2198 (New): add an option to force a down osd to be marked immediately out
Samuel Just
02:25 PM rgw Bug #2197 (Resolved): rgw: need to throttle incoming requests
In case we can't handle requests, we'd end up accepting requests indefinitely thus we consume fds endlessly. This wil... Yehuda Sadeh
01:30 PM Revision d0e8f148 (ceph): doc: update list of debian dists
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:28 PM Revision a608a8fe (ceph): Merge branch 'stable'
Sage Weil
12:52 PM Bug #2196 (Resolved): `rados bench` will write test objects with a constant oid, under-reporting ...
(As discussed on @#ceph@, 2012/03/21 -- with thanks to @joshd@)
The command @rados bench@ generates a sequence of ...
David McBride
08:21 AM Bug #2178: rbd: corruption of first block
The next object is whatever the MBR points to. You can find the object name from the sector offset that gdisk gives y... Josh Durgin
02:55 AM Bug #2178: rbd: corruption of first block
Hi Josh,
thanks for taking the time to investigate this... And yes, many others show the same behaviour. Is "the n...
Oliver Francke
06:50 AM Feature #2127: Save kernel core dumps on all of our test machines
I seem to remember seeing a reference to 'mkcrashrd', a mkinitrd type script that generates the initrd image the cras... Sage Weil
02:00 AM Revision 91c08f6e (ceph): Add watch op to rados.py
Signed-off-by: Samuel Just <sam.just@dreamhost.com> Samuel Just
12:51 AM Revision 72361784 (ceph): Objecter: resend linger_ops on any change
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>
Samuel Just
12:51 AM Revision 3019d460 (ceph): TestRados: Add watch
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Josh Durgin <josh.durgin@dreamhost.com>
Samuel Just
12:20 AM Revision 2998368a (ceph): rgw: remove unused definition
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> Yehuda Sadeh
12:20 AM Revision 4760536f (ceph): rgw: keep pool placement info also in cacheable location
Mirror the pools placement info, so that we can cache it.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
12:00 AM Revision f1563a66 (ceph): Revert "Objecter: add op->resend_on_any_change"
This reverts commit c53194d75390dd6d5aa4a9a33f741cbd106e3338.
recalc_linger_op_target is used for linger_ops
Signed...
Samuel Just

03/20/2012

11:11 PM Revision 2daff0e9 (ceph): ReplicatedPG: osd_max_notify_timeout -> osd_default_notify_timeout
This setting should not override user specified timeout.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
11:11 PM Revision c53194d7 (ceph): Objecter: add op->resend_on_any_change
lingers must be resent even if the primary does not change.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
11:11 PM Revision fc7a1bda (ceph): ReplicatedPG: return -EBUSY on delete for objects with watchers
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
10:15 PM Revision 6a5cbec3 (ceph): rgw: replace bucket_id generation
bucket_id is now string: <global instance id>.<num> where
num is increasing monotonically within the current rgw
inst...
Yehuda Sadeh
09:07 PM Feature #2127 (In Progress): Save kernel core dumps on all of our test machines
I finally have crash dumps getting packaged and sent over to a
remote machine reliably. The problem is that it does...
Alex Elder
06:59 PM Bug #2178: rbd: corruption of first block
I looked at the block you attached, and compared it to the first 4MiB of my desktop's hard drive. It looks like it co... Josh Durgin
03:58 AM Bug #2178: rbd: corruption of first block
Hi *,
any update on this topic? Cause we are working for hours and days with three people to rescue as many images...
Oliver Francke
06:41 PM Revision cdd5298d (ceph): v0.44
Sage Weil
05:59 PM Revision e42fbb70 (ceph): rgw: process default alt args before processing conf file
this fixes #2189
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:52 PM Revision e0b8f7a0 (ceph): rgw: process default alt args before processing conf file
this fixes #2189
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:37 PM Revision 51a07339 (ceph): rgw: incrase socket backlog
20 is too small
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:25 PM Revision 5b331987 (ceph): rgw: fix internal cache api
This fixes issue #2190
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Yehuda Sadeh
05:14 PM rgw Bug #2193 (Resolved): rgw: .pools.avail is not cached
Fixed, commit:4760536fe573c702bac8fb1d51213d76059e32dc.
We now mirror the info in the object. Still keeping the om...
Yehuda Sadeh
09:28 AM rgw Bug #2193 (Resolved): rgw: .pools.avail is not cached
Probably due to recent omap changes, we don't cache omap operations. Either we cache it, or just keep available pools... Yehuda Sadeh
03:10 PM rgw Feature #2194 (Resolved): rgw: replace bucket-marker-ver with better, fast, more scalable solution
Fixed, commit:6a5cbec38b761d524e699e2a7410a340d093ccca. Yehuda Sadeh
09:32 AM rgw Feature #2194 (Resolved): rgw: replace bucket-marker-ver with better, fast, more scalable solution
We use this object in order to create unique prefix for bucket objects (we do it at bucket creation). Instead of this... Yehuda Sadeh
02:50 PM Revision 815fc3e2 (ceph): suite: failed runs might not have durations
This was one cause of emails not being sent - stale /tmp/cephtest dirs
fail without recording a duration.
Josh Durgin
10:47 AM rgw Bug #2189 (Resolved): rgw: can't change debug level through ceph.conf
Fixed, commit:e0b8f7a0331b0ceee54a911bb9231cb168eb2d0f. Yehuda Sadeh
10:28 AM rgw Bug #2190 (Resolved): rgw: cache disabled
Fixed, commit:5b3319870ea9d6c715c671e006e3a772008e3e78. Yehuda Sadeh
09:43 AM CephFS Feature #2195 (Resolved): Allow removal of last MDS if there's no filesystem
Right now you can't remove the last MDS from your cluster, which means that if you aren't using it and it's off you w... Greg Farnum
05:43 AM Bug #2192 (Won't Fix): ceph-mon hangs consuming 100% CPU
I have a test setup of two nodes each running 0.43 mds, mon and osd. I mount ceph kernel filesystem at /srv/ceph on b... Vladimir Kulev
 

Also available in: Atom