Activity
From 03/25/2012 to 04/23/2012
04/23/2012
- 09:48 PM Revision 198544ad (ceph): mds: remove preferred from ceph_file_layout
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:48 PM Revision b3cdc21a (ceph): cephfs: remove preferred osd setting
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:48 PM Revision f01b6dd5 (ceph): test_ioctls: remove preferred osd
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:48 PM Revision a5143444 (ceph): mon: ignore/remove localized pgs
- This will trigger on the next OSDMap update.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 09:48 PM Revision 771fd05b (ceph): mon: set auid for mon-created pools to 0
- Signed-off-by: Sage Weil <sage@newdream.net>
- 09:48 PM Revision 637de4d7 (ceph): mon: load CompatSet features on startup
- Signed-off-by: Sage Weil <sage@newdream.net>
- 09:48 PM Revision 19ba3475 (ceph): osdmaptool: fix clitests for lack of localized pgs
- Signed-off-by: Sage Weil <sage@newdream.net>
- 09:48 PM Revision 94adf5d9 (ceph): osd: remove localized pgs from pg_pool_t
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:48 PM Revision 0138a764 (ceph): osd: ignore localized pgs
- - do not load them on startup
- ignore any we hear about over the wire
Signed-off-by: Sage Weil <sage.weil@dreamhost... - 09:48 PM Revision b8f4acfb (ceph): osd: remove preferred from object_locator_t
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:48 PM Revision f164b877 (ceph): osdmap: do not forcefeed preferred osd to crush
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:48 PM Revision 6d1344c3 (ceph): libcephfs: disable ceph_set_default_preferred_pg
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:48 PM Revision 21ef9795 (ceph): client: rip out preferred_pg thing
- This wasn't even named properly. Blech.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 08:58 PM Revision 07776136 (ceph): Merge remote-tracking branch 'gh/wip-discard'
- 06:23 PM Bug #2338: mon: adding new monitors simultaneously can allow a new mon to become leader
- Looks like the original monitor doesn't believe in the existence of the new monitors; I'll need to check out why.
<p... - 05:46 PM Bug #2338: mon: adding new monitors simultaneously can allow a new mon to become leader
- ...
- 05:42 PM Bug #2338 (Rejected): mon: adding new monitors simultaneously can allow a new mon to become leader
- When you add two new monitors (out of 3 total) to a cluster you can end up with one of the new monitors being the lea...
- 05:26 PM Bug #2286: mon: different full/near_full values on different monitors
- Hi Guys,
After upgrading the patched-kernel btrfs test cluster from 0.45-1 to 0.45-281-g0777613, the full_ratio an... - 04:38 PM Feature #2337 (Resolved): rgw and rados performance numbers
- 04:30 PM Revision 43d1a920 (ceph): run_seed_to.sh: remove stray arg
- This crept in in commit d1740bd586db80068fc0292223cf21911de66428.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 04:21 PM Revision a11b69fd (ceph): nuke: ignore ntpdate errors
- We keep seeing a race between ntpd startup and our stop + ntpdate + start
sequence. Ignore errors here. - 04:16 PM Feature #2336 (Resolved): qemu: wire up discard
- 04:10 PM Feature #2335 (Resolved): librbd: write-thru cache mode
- 04:02 PM rgw Documentation #1813 (Fix Under Review): doc: document radosgw api diffs with s3
- 03:51 PM Feature #2334 (Resolved): mon: set max mark-out or mark-down
- 02:34 PM Feature #1044 (Resolved): librbd: discard support
- 02:34 PM Feature #2296 (Resolved): librbd: allow resizing to arbitrary sizes
- 02:29 PM Feature #1451 (Resolved): librbd: instrument via perfcounter
- 02:22 PM Feature #1888 (Rejected): log: per-thread ring buffer
- 02:20 PM Subtask #2333 (Resolved): create queueing for peering messages
- Currently, the osd dispatch calls directly into the PG peering state machine. Instead, we need to queue the events g...
- 02:10 PM Subtask #2332 (Resolved): move pg queueing into pgs
- Currently, the osd reaches into the pg to manipulate the pg queue during message receipt and during handle_osd_map. ...
- 02:07 PM Cleanup #2041 (In Progress): osd: move peering into worker threads
- 12:19 PM Revision 0112e749 (ceph): run_seed_to.sh: rework the script, make it more flexible and broaden th...
- Allow for '-h' and other options such as disabling the journal sync tests,
defining it is to be run on a btrfs FS, en... - 11:31 AM Cleanup #2331 (Resolved): Makefile.am:182: `lib/libgtest.a' is not a standard libtool library name
- Warning is still happening, despite git clean -fdx, git submodule freshening of various sorts, etc.
This should prob... - 11:27 AM Bug #2276 (Rejected): osd: eat cpu on restart
- it's up now.. i think i just didn't wait long enough.
- 11:26 AM Bug #2266 (Resolved): teuthology: nuke after failure is failing
- ignore errors caused by ntpdate vs ntpd race
- 11:25 AM Bug #2322 (Need More Info): osd/ReplicatedPG.cc: 3832: FAILED assert(!object_contexts.size())
- also going to wait until the threading refactor is complete before diving into this further.
at this point asserti... - 11:24 AM Feature #2330 (Resolved): dump open files, sockets when we run out of fds
- 11:23 AM Bug #2310 (Resolved): osd: too many open files
- this is just sockets and hitting the flusher limit. we're both increasing 'max open files' and switching to vm limit...
- 11:04 AM Bug #2329 (Resolved): fix detection of C++11 atomic header
- the C++11 atomic header is now <atomic> (I've checked gcc 4.6 and 4.7) and not <cstdatomic>
- 10:23 AM rgw Bug #1681 (Resolved): rgw: user rm with --purge doesn't remove data
- 09:56 AM Feature #2251 (Resolved): rgw long run workloads
04/22/2012
- 09:23 PM Revision e9ecd1b3 (ceph): perfcounters: tolerate multiple loggers with the same name
- Make them unique by appending -<ptr>, so that the json we dump will remain
valid.
We may also want to allow people t... - 04:01 AM Revision be438d6e (ceph): Merge branch 'master' into wip-discard
- 03:28 AM Revision 97f507ff (ceph): Makefile: disable format-security warning
- The prt() varargs function generates this warning
test/rbd/fsx.c: In function ‘prt’:
warning: test/rbd/fsx.c:203:2: ...
04/21/2012
- 09:31 PM Revision c8377e46 (ceph): filestore: verify that fiemap works
- Check for a bug present in older versions of ext4. If present, disable
FIEMAP. See #2328.
Signed-off-by: Sage Weil... - 08:36 PM Revision 6cf87673 (ceph): filestore_idempotent: url has changed
- 08:32 PM Revision 7471a9b1 (ceph): rados: fix error printout for mapext
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 07:50 PM Revision 07ddff42 (ceph): librbd: instrument with perfcounters
- Track IO operations on a per-image basis.
Implements: #1451
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 03:16 PM Feature #1451 (Fix Under Review): librbd: instrument via perfcounter
- see commit:07ddff427145e109eb820b6ed0ddb6cca74b65b6
- 03:15 PM Bug #2328 (Resolved): osd: mapext/fiemap doesn't work for small extents
- see commit:c8377e466caace018eea06c1739265111ce72c48 for a kludge that detects the bug and disabled fiemap.
- 02:35 PM Bug #2328: osd: mapext/fiemap doesn't work for small extents
- this works on a newer kernel (3.2.0-2-amd64).
should we check kernel versions in filestore and magically disable f... - 02:20 PM Bug #2328 (Resolved): osd: mapext/fiemap doesn't work for small extents
- If you query the mapping for an extent that inside a larger allocated extent, the fiemap ioctl won't tell you:
<pr... - 11:15 AM CephFS Bug #2218: CephFS "mismatch between child accounted_rstats and my rstats!"
- Logs from a clean cluster at http://matthew.royhousehold.net/cephLogs/cephLogs.mds.tar (4382MB md5 aaf9364c7e35bc6b5d...
- 12:13 AM Revision fb9fdf45 (ceph): librbd: fix ictx_check pointer weirdness by using std::string
- I was seeing failures of LibRBD.TestIOToSnapshot where we would fail to
refresh after rollback, even though the snap ... - 12:01 AM Revision 888a082f (ceph): FileJournal: don't wait flusher until completions are queued
- Fixes: #2324
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
04/20/2012
- 11:56 PM Revision d1740bd5 (ceph): filestore: fix collection_add journal replay problem
- In collection_add we have a two-phase guard set on the linked object via
the old name. During replay, we might see t... - 11:56 PM Revision 92b299af (ceph): FileStoreDiff: flip sense of diff*() methods around
- true means diff, false means same.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 11:55 PM Revision ca369c98 (ceph): test_idempotent_sequence: Use FileStoreDiff class instead.
- Use FileStoreDiff instead of having the diff code embedded in the test,
allowing for more tests and people to use the... - 11:51 PM Revision 5466ebc7 (ceph): test_idempotent_sequence: Output missing options on "usage".
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 11:51 PM Revision 47461291 (ceph): FileStoreDiff: check if two FileStores match.
- This code should be on a stand-alone class, instead of being embedded on
a single test, in case someone or something ... - 11:51 PM Revision b94d6a6c (ceph): librbd: fix zeroing of trailing bits on short reads that span objects
- handle_sparse_read() was taking buf_ofs and buf_len, but buf_len was being
interpreted as the total size of the buffe... - 11:51 PM Revision 81af4e86 (ceph): librbd: make discard invalidate the range in cache
- Fed this to test_librbd_fsx and it was happy.
Signed-off-by: Sage Weil <sage@newdream.net> - 11:51 PM Revision 5b603437 (ceph): librbd: fix debug output
- objects is misleading here, these are byte offsets
Signed-off-by: Sage Weil <sage@newdream.net> - 11:51 PM Revision 72323965 (ceph): objectcacher: implement Object::discard()
- Discard a range of bytes from an object.
Signed-off-by: Sage Weil <sage@newdream.net> - 11:51 PM Revision 165038d5 (ceph): objectcacher: rename truncate_set -> discard_set, and use discard
- Do not assume the object extents are at the trailing edge of objects.
Instead, discard arbitrary extents. Fix caller... - 11:51 PM Revision 4ddbbf54 (ceph): librbd: allow image resize to non-block boundaries
- The caller is still invalidating the entire cache, so we don't need to
deal with discard at this level. That might b... - 11:51 PM Revision e861e20a (ceph): revert to xfstests' fsx, which has discard support
- 11:51 PM Revision dd78c3df (ceph): test_librbd_fsx: port newer xfsprogs version
- Builds and runs... for a few ops at least.
Signed-off-by: Sage Weil <sage@newdream.net> - 11:51 PM Revision 379c5e72 (ceph): librbd: fix debug output for image resize
- Print old -> new, not new -> old.
Signed-off-by: Sage Weil <sage@newdream.net> - 11:50 PM Revision a8aaa93b (ceph): admin_socket: add a separate debug subsys/level
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 11:36 PM Revision f7699cc9 (ceph): log: prefix dump with line numbers
- This makes it easier to interpret the dump, and makes it obvious what is
dump (and potentially a dup of something tha... - 10:35 PM Revision a68b3069 (ceph): test_filestore_workloadgen: track pg log size in memory
- Avoid a stat!
Signed-off-by: Sage Weil <sage@newdream.net> - 10:30 PM Revision 6b28f55c (ceph): test_filestore_workloadgen: whitespace
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:25 PM Revision 2834f601 (ceph): organize test_filestore_idempotent*
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:23 PM Revision 8bb58bc2 (ceph): remove hopelessly flawed test_filestore_idempotent
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:22 PM Revision 94ebe8d9 (ceph): test_store -> test_filestore
- and organize source
Signed-off-by: Sage Weil <sage@newdream.net> - 10:20 PM Revision 77c645a1 (ceph): test_wrkldgen -> test_filestore_workloadgen
- Also put the code in src/test/filestore/.
Signed-off-by: Sage Weil <sage@newdream.net> - 10:13 PM Revision 27efa055 (ceph): msgr: move locking out of submit_message to its callers
- Since they're doing stuff like looking through rank_pipe, they need
to be holding the lock too.
Signed-off-by: Greg ... - 10:12 PM Revision 5a8019ec (ceph): msgr: remove useless block addition.
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 09:18 PM Feature #2327 (Resolved): mon: use external keyring for inter-mon auth
- Currently the mon. key is part of the internal mon auth database. It can't be modified without a running mon cluster...
- 09:17 PM rbd Feature #2326 (Resolved): krbd: use new class interfaces, new image format
- Update rbd_types.h to match the userspace version, and add support for opening new-format images while keeping suppor...
- 09:04 PM Feature #2325 (Resolved): setup new email/etc
- 08:32 PM Linux kernel client Tasks #2138 (Resolved): rbd: run xfstests on a local XFS filesystem over RBD
- I have two files that implement automated testing using
xfstests over rbd devices.
One is now in the ceph git tre... - 08:28 PM Linux kernel client Bug #2287: rbd: crashes with 10Gbit network and fio
- A kernel dump would likely help, but there's no guarantee because
of the delayed execution of the operation. It wou... - 07:38 PM Linux kernel client Bug #2287: rbd: crashes with 10Gbit network and fio
- Would a kernel core dump help here?
- 07:37 PM Linux kernel client Bug #2287: rbd: crashes with 10Gbit network and fio
- This is one of a family of bugs we've been trying to understand.
Here is another one:
http://tracker.newdream.n... - 08:28 PM Revision f00dab6f (ceph): ceph-authtool: rename cli test dir
- Signed-off-by: Sage Weil <sage@newdream.net>
- 08:28 PM Revision 613d8179 (ceph): ceph-conf: rename cli test dir
- Signed-off-by: Sage Weil <sage@newdream.net>
- 08:22 PM Revision 0616c5b2 (ceph): Merge branch 'stable'
- Conflicts:
src/osd/OpRequest.cc - 08:21 PM Revision ac874a62 (ceph): OpRequest: mark_event output is now at debug 5
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 08:00 PM Linux kernel client Bug #2243: btrfs: warning in orphan_commit_root
- I mentioned this somewhat informally to Chris Mason last week. I
provided him the message, and he said:
Well, ... - 07:39 PM Feature #2251 (In Progress): rgw long run workloads
- 07:37 PM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
- Another report, likely related:
http://tracker.newdream.net/issues/2287
I don't understand it well enough yet... - 07:37 PM rgw Documentation #1813 (In Progress): doc: document radosgw api diffs with s3
- 07:35 PM Subtask #825 (In Progress): osd: remove pg map updating from handle_osd_map
- 07:35 PM Feature #2314 (Fix Under Review): remove localized pgs
- wip-lpg
Did a basic test of a cluster with localized pgs and upgraded to this, no problems. A reasonably thorough... - 07:12 PM Linux kernel client Bug #2298 (Resolved): rbd: broken encode_op for big-endian hosts?
- This has been fixed. I have been testing it in a private branch
and will shortly be updating the ceph-client testin... - 07:10 PM Linux kernel client Bug #2242 (Resolved): rbd: spinlock on wrong cpu
- This was fixed a couple of weeks ago, and the result has been committed
both to the testing and master branches of t... - 06:32 PM Revision c9d21856 (ceph): hammer.sh: -a to archive each run
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:00 PM Bug #2324 (Resolved): osd: assert("q.empty()") failed in OpSequencer destructor
- sam fixed this in commit:888a082f23974b1f7a63f302e29a326182e7dc41
- 03:56 PM Bug #2324 (Resolved): osd: assert("q.empty()") failed in OpSequencer destructor
- This is consistently reproducible with 2 osds started by vstart on vit, but only happens intermittently with 1 osd.
... - 04:32 PM Feature #2245 (Resolved): rgw long run ceph install
- 03:09 PM Bug #2079 (Duplicate): rbd: creating a snapshot with the same name doesn't return an error
- i think this was caused by the rados class return values. in any case, it works correctly now.
- 03:07 PM Bug #2084 (Can't reproduce): segfault in tcmalloc
- 12:16 PM Feature #1618 (Resolved): libvirt: make sure migration works
- we demoed this
- 12:05 PM Feature #2323 (Resolved): osd: limit 'old request' messages generated
- If there are hundreds of old requests queued, let's say that, instead of generated gigabytes of logs.
Maybe a simp... - 10:44 AM Bug #2322 (Resolved): osd/ReplicatedPG.cc: 3832: FAILED assert(!object_contexts.size())
- ...
- 05:24 AM Revision f2d4574b (ceph): config: dump subsys log levels with --show-config
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:29 AM Revision 607a8a3d (ceph): osd: do not create localized pgs
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
04/19/2012
- 11:43 PM Revision 0df33e43 (ceph): Merge branch 'wip-2313'
- 11:25 PM Revision 8335eb53 (ceph): rgw: can't remove user that owns data
- This is a workaround for #1681. We don't allow the removal of
user that still has data.
Signed-off-by: Yehuda Sadeh ... - 11:11 PM Revision 9df7f6f6 (ceph): msgr: merge the send_message and lazy_send_message implementations.
- They only varied by the presence of "lazy" (in output) and a flag.
Signed-off-by: Greg Farnum <gregory.farnum@dreamh... - 10:58 PM Revision a394fb2f (ceph): msgr: do not establish a new Pipe for any server
- Servers do not establish Connections, and if there's a Connection
without a Pipe it's because the Pipe was deliberate... - 10:44 PM Revision 42a49cf1 (ceph): msgr: Create a proper Connection* lazy_send_message()
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 10:44 PM Revision 526cd9d0 (ceph): msgr: merge submit_message() functions
- Our consistency guarantees on address- and Connection-based send_message()
calls have diverged since they have differ... - 10:22 PM Revision 56fc92c0 (ceph): FileStore: don't read reference to writeq head after pop_write
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 09:49 PM Bug #2291 (Can't reproduce): objectcacher perfcounters don't work with test_librbd_fsx
- This works fine for me. Maybe it was fixed when the objectcacher naming thing was changed?...
- 09:17 PM Messengers Cleanup #2150: repair the Simple/Messenger interface
- Looks good to me, provided it makes it through the regression suite without problems!
- 07:11 PM Messengers Cleanup #2150: repair the Simple/Messenger interface
- wip-msgr-interface
- 09:08 PM Feature #2321 (Resolved): osd: investigate memory consumption from peering backlog
- 09:07 PM Feature #2320 (Duplicate): mon: detect and throttle osd flapping
- 09:07 PM Feature #2319 (Resolved): mon: block osd mark-down
- 09:07 PM Feature #2318 (Resolved): mon: block osd boot
- 09:06 PM Feature #2317 (Resolved): mon: pause/unpause auto-mark-out
- 08:33 PM Revision ff0fe372 (ceph): add rbd_fsx_[no]cache jobs to regression suite
- 08:32 PM Revision e3af0877 (ceph): rbd_fsx: show progress
- The updated fsx takes this arg.
Signed-off-by: Sage Weil <sage@newdream.net> - 08:21 PM Revision 758fa9ba (ceph): rgw: bucket HEAD operation should return 200, not 204
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 07:47 PM Revision 9cb796a0 (ceph): obsync: section 8 -> 1
- Fixes: #2263
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 07:44 PM Revision 6a58314d (ceph): fix misc checks that wait for N osds to be up
- These all cut&pasted broken code, blah!
- 06:16 PM Revision fcc44812 (ceph): msgr: minimally document the SimpleMessenger inner classes
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 06:16 PM Revision db6d467b (ceph): msgr: fix some spacing issues in Pipe
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 06:16 PM Revision c470e1a0 (ceph): msgr: start moving functions around in Pipe
- 06:16 PM Revision 5c21f7ea (ceph): msgr: remove the Pipe::disposable flag.
- It has no function as it stands. It was intended for new connections
to avoid being marked lossy, but that can't happ... - 06:16 PM Revision 146e1a46 (ceph): msgr: make mark_down_on_empty() behave more consistently.
- Previously you could get ms_handle_reset() notifications if you
used mark_down_on_empty(), but only if the Pipe's que... - 06:16 PM Revision 501553e5 (ceph): msgr: group private data members by type
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 06:16 PM Revision d3e40e00 (ceph): msgr: reorder the SimpleMessenger data members a little bit
- They make moderately more sense to me in this order.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 04:37 PM rgw Feature #2313 (Resolved): rgw: expose extra bucket info trough S3 api
- Already pushed a test to the s3-tests functional.
- 04:33 PM rgw Feature #2313 (In Progress): rgw: expose extra bucket info trough S3 api
- wip-2313 looks sane.. let's add a test and merge for 0.46.
- 04:25 PM rgw Bug #1681 (In Progress): rgw: user rm with --purge doesn't remove data
- merged the radosgw-admin change to make it fail. let's add a test for it and then close this bug.
- 04:23 PM Revision 09100b6b (ceph): rgw: HEAD bucket, don't dump rounded size
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 04:19 PM Revision 9c2251f6 (ceph): osd: fix vector array out of bounds
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:17 PM Feature #2296 (Fix Under Review): librbd: allow resizing to arbitrary sizes
- see wip-discard
- 02:13 PM Feature #2296: librbd: allow resizing to arbitrary sizes
- more importantly, we need to either error out with EINVAL if it's not a block multiple, or do it... currently we sile...
- 02:37 PM Bug #2307 (Resolved): OSD & Monitor disagree on the contents of pg_temp
- Just changing the pg_num and pgp_num did fix it up, so with the osdmap workaround we should be all good now.
- 02:10 PM Feature #1044 (Fix Under Review): librbd: discard support
- 12:47 PM Bug #2263 (Resolved): obsync: move man page to section 1
- 12:45 PM Bug #2311 (Need More Info): rbd: delete + create image led to EEXIST
- 07:20 AM Bug #2311 (In Progress): rbd: delete + create image led to EEXIST
- Can you generate a log? Ideally 'debug ms = 1'?
Also, attach the output of 'ceph --show-config'?
Thanks! - 02:43 AM Bug #2311: rbd: delete + create image led to EEXIST
- Hi Sage,
uhm, not solved yet as per ceph version 0.45-207-g3053e47 (commit:3053e4773bae93cfa3158882aa4963803862f9b... - 12:44 PM Bug #2262 (Resolved): qa: osd-recovery tasks fails on flush_pg_stats
- fixed by teuthology commit:6a58314d4627d106c5fd6df186e191c19a01f64b
- 10:47 AM Bug #2192: ceph-mon hangs consuming 100% CPU
- It was some 3.0.0 Ubuntu kernel, backed by btrfs.
- 10:06 AM Bug #2192: ceph-mon hangs consuming 100% CPU
- I missed this when it came in, and I don't know where the 100% CPU usage is coming from, but the hung filesystem soun...
- 04:58 AM Revision e82c3309 (ceph): ceph-fuse: only call common_init_finish in child; fix admin socket
- If we do it in the parent and child, we break the admin socket (among
other things).
Fixes: #2293
Signed-off-by: Sag... - 04:27 AM Revision f1de0d6c (ceph): qa: drop rgw.py from test_python.sh workunit
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:27 AM Revision d3a57a21 (ceph): ceph.spec.in: drop rgw.py
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:26 AM Revision 0995d084 (ceph): Makefile: drop rgw.py
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:19 AM Bug #2316 (Resolved): rbd: restart of OSD leeds to stale qem-VM's with "ceph version 0.45-207-g30...
- Hi,
in my current test-setup all four VM's are started with rbd_cache parm. After all VM's are started and began t... - 12:14 AM Revision 4390c08a (ceph): mon: fix full/nearfull ratio initialization
- create_pending() is called before create_initial(), so just put these in
the first incremental.
Signed-off-by: Sage ...
04/18/2012
- 11:37 PM Revision 7fbc0fcc (ceph): msgr: add error code returns to send_keepalive.
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 11:27 PM Revision cb43eeee (ceph): msgr: add some validity checks to ~SimpleMessenger()
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 10:54 PM Revision 0c2c18c1 (ceph): msgr: convert global_seq_lock to a spinlock.
- Because nobody wants to get put to sleep waiting on a unique integer.
(I would just make this an atomic_t but we need... - 10:51 PM Bug #2263: obsync: move man page to section 1
- 10:45 PM Revision dc4c759c (ceph): msgr: convert get_policy() to return a const reference.
- The only user who didn't want a const should have been using
direct access anyway (and now is doing so).
Signed-off-... - 10:45 PM Revision aab516da (ceph): rgw: allow subuser creation on radosgw-admin user create
- This fixes issue #2312: allow the creation of both user
and a subuser in a single radosgw-admin command.
Signed-off-... - 10:44 PM Revision c9adf07b (ceph): msgr: move SimpleMessenger function declarations into good order
- We now have one declaration each of private and protected, and two
declarations of public -- one for the Messenger-re... - 10:42 PM CephFS Bug #2293 (Resolved): admin sockets don't persist with ceph-fuse
- commit:e82c33099a0efda027bc7fa991dcd2073baea539
- 10:23 PM Revision dbe779a5 (ceph): osd: dump old ops singly rather than all at once.
- Fixes #2269. Convert the OpTracker::check_ops_in_flight interface
to take a vector<string> and create a separate warn... - 10:23 PM Revision b0532aa6 (ceph): osdmap: restructure flow in OSDMap::decode
- Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> - 10:19 PM Revision 7ae1aefa (ceph): gather logs for cfuse dbench workload, hopefully catch #1737
- 10:13 PM Revision bc458bfd (ceph): ceph-object-corpus: prune archive
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:09 PM Revision ebb19e70 (ceph): test/encoding/import.sh: fix
- Signed-off-by: Sage Weil <sage@newdream.net>
- 09:53 PM Revision e1622e4f (ceph): rgw: HEAD bucket also dumps bucket meta info
- Shows total objects, bytes used, actual bytes used.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 09:53 PM Revision 0769efa5 (ceph): rgw: add x-rgw-* as a possible meta header prefix
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 09:46 PM rgw Bug #2027: rgw -> apache miscommunication
- Not completely unlikely. We can set it to "can't reproduce", and reopen if we see it again.
- 06:12 PM rgw Bug #2027: rgw -> apache miscommunication
- do we think this is fixed now by the rgw throttling?
- 09:36 PM Revision 2de37e01 (ceph): Merge remote-tracking branch 'gh/wip_journal'
- Reviewed-by: Sage Weil <sage@newdream.net>
- 09:10 PM Revision c3bfe456 (ceph): osdmap: remove bad cct reference
- Not in OSDMap
Signed-off-by: Sage Weil <sage@newdream.net> - 08:28 PM Revision 9ef953b5 (ceph): Merge branch 'wip-2286-ratio-a'
- Reviewed-by: Sage Weil <sage@newdream.net>
- 07:50 PM Revision eea982e5 (ceph): osdmap: workaround old bug that zeroed pool_max
- Signed-off-by: Sage Weil <sage@newdream.net>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> - 06:32 PM Revision ee22c97b (ceph): python-ceph: remove rgw bindings
- We shouldn't expose these internal apis.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 06:21 PM Revision 407b2e0b (ceph): whitelist xfs_fsr syslog noise
- Ignore lines like
2012-04-17T13:44:11-07:00 plana59 fsr[5454]: DEBUG: fsize=450560 blsz_dio=450560 d_min=512 d_max=2... - 06:10 PM Revision eb8a8fdf (ceph): pgmap: allow Incrementals to specify [near]full_ratios of 0
- This commit isn't entirely safe: old monitors used 0 to mean "no change".
We can revert this (and the PGMonitor.cc po... - 06:05 PM Revision 88525eaa (ceph): pgmon: add "set_full_ratio x" and "set_nearfull_ratio x" commands
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 06:05 PM Revision 882bd5b8 (ceph): pgmon: convert PGMonitor::prepare_command to our standard infrastructure.
- So that I can add another command.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 06:05 PM Revision f1152e67 (ceph): mon: only fill in full/nearfull sets if the ratio > 0
- This avoids putting all OSDs in both sets when the ratios are 0, as they
are with a fresh cluster and pgmap. This al... - 06:05 PM Revision 1242b754 (ceph): pgmon: remove the PGMonitor update_full_ratios stuff
- Making it a config watcher is just a huge mess in terms of consistently
updating it appropriately.
The next commit wi... - 06:05 PM Revision 841f2885 (ceph): mon: unconditionally encode PGMap full ratios in the Incremental
- This properly spreads the real value to peon monitors -- they weren't
seeing the right values at all before.
Initiali... - 06:00 PM Bug #2310: osd: too many open files
- failed to capture a full strace.. try it again (once we find a failing osd on congress) with
strace -e trace=open,... - 09:48 AM Bug #2310 (Resolved): osd: too many open files
- ...
- 04:39 PM Bug #2315 (Resolved): unrecognized admin socket command 'objecter_requests'
- From teuthology:/a/nightly_coverage_2012-04-18-a/1602/teuthology.log:...
- 03:57 PM rgw Bug #2312 (Resolved): rgw: create user and subuser in a single radosgw-admin command
- Fixed, commit:aab516da7f89310445be4e4fb61836084d2dac32.
- 02:01 PM rgw Bug #2312 (Resolved): rgw: create user and subuser in a single radosgw-admin command
- 03:41 PM Bug #2211 (Resolved): osd: entity_inst_t OSDMap::get_inst(int) const
- 03:41 PM Bug #2262 (In Progress): qa: osd-recovery tasks fails on flush_pg_stats
- 03:27 PM Feature #2314 (Resolved): remove localized pgs
- 02:47 PM rgw Feature #2313: rgw: expose extra bucket info trough S3 api
- Ok, let's just send those extra headers anyway. Otherwise we'd have some issue creating the request signature for the...
- 02:11 PM rgw Feature #2313 (Resolved): rgw: expose extra bucket info trough S3 api
- syntax:
HEAD /<bucket>
X-RGW-Params: extrainfo
extra response headers:
X-RGW-Object-Count: <object count>
X-RG... - 02:23 PM Feature #2252 (Resolved): rgw long run kernels
- 02:22 PM Feature #2250 (Resolved): rgw long run raid config
- 02:14 PM Feature #2265 (Rejected): make sure objecter/kclient error out when localized pgs don't exist
- 01:58 PM rgw Feature #2308 (Resolved): radosgw-admin: make user create idempotent
- done, commit:5a6bbd0c473e15aa7642da367e7936015d19d77a.
- 01:46 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
- And I gave him a patched monitor so he could set pg_num, which should fix it. Waiting to hear back, and will apply th...
- 01:16 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
- pushed workaround that will repair osdmaps that saw your corruption, commit:eea982e56739a7a91ca907ccc5c5ec1f78d9460d.
- 01:30 PM Bug #2311 (Resolved): rbd: delete + create image led to EEXIST
- this is 'rbd writeback window' at its best. long live 'rbd cache'!
- 01:06 PM Bug #2311: rbd: delete + create image led to EEXIST
- Congrats for closing the annoying ticket #2178 :-D
Fair enough, to have a new one on this issue, here my last note... - 12:46 PM Bug #2311: rbd: delete + create image led to EEXIST
- Is it possible there is some other user, or the logs are from the wrong cluster?
I see:
- client.13507 deletes 90... - 12:45 PM Bug #2311 (Resolved): rbd: delete + create image led to EEXIST
- Here is a sequence copy-n-pasted:
rbd rm data/905-testdisk.rbd
Removing image: 100% complete...done.
rbd create ... - 01:28 PM Bug #2286 (Resolved): mon: different full/near_full values on different monitors
- commit:9ef953b5e20c3d232cfe4aa90f26476a2a2f911b
- 11:18 AM Bug #2286 (Fix Under Review): mon: different full/near_full values on different monitors
- Check out wip-2286-ratio-a and see what you think. It fills in the ratios from g_conf on create_initial, only changes...
- 12:51 PM Bug #2178: rbd: corruption of first block
- Hi Sage,
sorry, was not clear enough. The logfiles provide informations for "907-testdisk.rbd..." not "906..."
Th... - 12:46 PM Bug #2178 (Resolved): rbd: corruption of first block
- moved this new issue to #2311, and resolving this bug. hooray!
- 12:45 PM Bug #2178: rbd: corruption of first block
- Oliver Francke wrote:
> Here is a sequence copy-n-pasted:
>
> rbd rm data/905-testdisk.rbd
> Removing image: 100... - 10:41 AM Bug #2178: rbd: corruption of first block
- Oliver Francke wrote:
> Hi Sage,
>
> here my notes, after almost 40 tests no bad things happened, only once a min... - 07:22 AM Bug #2178: rbd: corruption of first block
- second logfile here, sorry.
- 07:18 AM Bug #2178: rbd: corruption of first block
- Here is a sequence copy-n-pasted:
rbd rm data/905-testdisk.rbd
Removing image: 100% complete...done.
rbd create ... - 05:51 AM Bug #2178: rbd: corruption of first block
- Meanwhile continued to test...:
I noticed some negative degredation:
2012-04-18 14:43:37.282634 pg v128104: ... - 05:36 AM Bug #2178: rbd: corruption of first block
- Hi Sage,
here my notes, after almost 40 tests no bad things happened, only once a minor hickup, where the rbd-head... - 08:22 AM Linux kernel client Bug #2298 (In Progress): rbd: broken encode_op for big-endian hosts?
- I sent a note to the various lists Al Viro posted to, to confirm the
bug (wasn't sure whether Sage had or not).
I... - 05:58 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
- Interesting... The warning showed up again despite test 232 being
removed from the list. Based on the time stamp o... - 04:38 AM Revision 5a6bbd0c (ceph): rgw-admin: user create is idempotent
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Reviewed-by: Sage Weil <sage@newdream.net> - 04:38 AM Revision 413d8be9 (ceph): osdmap: dump pool_max
- Signed-off-by: Sage Weil <sage@newdream.net>
04/17/2012
- 10:33 PM Revision 4627b67f (ceph): msgr: Document remaining SimpleMessenger functions and data members
- This doesn't handle the inner classes (Pipe, mostly) yet. It also
raises questions about whether these functions shou... - 10:01 PM Revision 1f6b7f81 (ceph): msgr: Group the SimpleMessenger functions following the Messenger's pat...
- Ordering everything nicely will make for a cleaner interface, make
it easier to see what documentation we need, and m... - 08:47 PM Bug #2286: mon: different full/near_full values on different monitors
- Greg Farnum wrote:
> Hmm. I looked at redoing this and got stuck on the semantics we want. If we're interested in fu... - 04:52 PM Bug #2286 (In Progress): mon: different full/near_full values on different monitors
- Hmm. I looked at redoing this and got stuck on the semantics we want. If we're interested in full_ratio == 0 being an...
- 11:00 AM Bug #2286: mon: different full/near_full values on different monitors
- yeah. actually, i think the check should go in tick() inside the is_leader() block, and not update_from_paxos().
- 10:54 AM Bug #2286: mon: different full/near_full values on different monitors
- Oh, I see...I wasn't following that need_*_ratio_update stuff properly. And update_full_ratios() will be called on th...
- 10:30 AM Bug #2286: mon: different full/near_full values on different monitors
- Greg Farnum wrote:
> I'm looking at your patch and it doesn't make a lot of sense to me.
> First off, when do you t... - 09:45 AM Bug #2286: mon: different full/near_full values on different monitors
- I'm looking at your patch and it doesn't make a lot of sense to me.
First off, when do you think that peon monitors ... - 08:43 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
- Greg Farnum wrote:
> I'm confused how you're getting that pool_max printout — I don't see it at all when I run that ... - 06:57 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
- I'm confused how you're getting that pool_max printout — I don't see it at all when I run that command with a ceph-de...
- 04:16 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
- at some point the osdmap pool_max got set to -1.
nine:2307 04:15 PM $ ~/src/ceph/src/ceph-dencoder type OSDMap i... - 03:56 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
- nine:2307 03:56 PM $ osdmaptool osdmap_full/5754 -p | grep ^pool
pool 0 'data' rep size 2 crush_ruleset 0 object_has... - 03:52 PM Bug #2307: OSD & Monitor disagree on the contents of pg_temp
- It looks to me liek the 'data' pool (0) was deleted, and then a new one (vmimages) was created. but somehow that was...
- 10:34 AM Bug #2307 (Resolved): OSD & Monitor disagree on the contents of pg_temp
- See: http://marc.info/?t=133352732900001&r=1&w=2
It seems that (for example) pg 0.138 is in pg_temp, but the OSD c... - 08:22 PM Revision 3053e477 (ceph): Merge branch 'wip-2289-2'
- 08:20 PM Revision dcd34fc8 (ceph): FileJournal: let peek_write return a reference
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 08:17 PM Revision 65614847 (ceph): common: add blkdev.{h,cc}
- Signed-off-by: Sage Weil <sage@newdream.net>
- 08:12 PM Revision 82d7a681 (ceph): rbd: fix probe of source when block device
- Fixes: #2304
Signed-off-by: Sage Weil <sage@newdream.net> - 08:11 PM Revision 41cb40ad (ceph): filestore: factor block device size probe out into helper
- Signed-off-by: Sage Weil <sage@newdream.net>
- 07:56 PM Revision dea85368 (ceph): FileJournal: clarify locking in header
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 07:53 PM Revision 377f75f9 (ceph): OpRequest: mark_event output is now at debug 5
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 06:39 PM Revision 98e2369e (ceph): librados: add methods to check and wait for safe/complete -and- callback
- Allow user to check for (safe|complete) -and- callback return, and wait
for that condition.
Fixes: #2301
Signed-off-... - 06:17 PM Revision 001aa462 (ceph): msgr: add a little detail to mark_down_on_empty() and mark_disposable()
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 05:53 PM Revision 2eb2aa5a (ceph): msgr: group functions more appropriately
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 05:53 PM Revision a422c479 (ceph): msgr: remove out-of-date comments and function
- I think that destroy() is left over from a misinformed attempt
to implement refcounting on the Messenger.
Signed-off... - 05:45 PM Revision 0be9718a (ceph): librbd: 'rbd cache enabled' -> 'rbd cache'
- 'enabled' is useless verbiage. We should fix the rgw option too,
protably...
Signed-off-by: Sage Weil <sage@newdrea... - 03:59 PM Revision e875b89f (ceph): Add task for running fsx on an rbd image.
- 03:03 PM Feature #2309 (Duplicate): rados namespaces
- 01:33 PM Revision 6cbda4c3 (ceph): qa: comment out xfstest 232
- Test 232 in the xfstests suite produces an XFS error in the log
when run over an RBD device. This is most likely an ... - 01:23 PM rgw Bug #2289 (Resolved): rgw: listing a bucket hangs after removing inexisting object
- Fixes merged into master at commit:3053e4773bae93cfa3158882aa4963803862f9b2.
- 01:13 PM Bug #2304 (Resolved): rbd import fails on block device
- 11:57 AM CephFS Bug #2299 (Rejected): all MDS commit suicide on startup
- 11:54 AM Bug #2219 (Can't reproduce): OSD's commit suicide with 0.44
- Let us know if you see this again! Thanks
- 11:40 AM Bug #2301 (Resolved): librados: LibRadosMisc.AioOperatePP failure
- 11:27 AM rgw Feature #2308 (Resolved): radosgw-admin: make user create idempotent
- radosgw-admin user create should be idempotent and work similar to user modify. We would need to verify that the same...
- 08:11 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
- I believe we are seeing the same problem here. I have been able to reproduce it each time I have tried. The hardwar...
- 07:14 AM Revision e0ec2865 (ceph): objecter: recalc pgid in case it wasn't a precalculated pgid
- We had an issue where we didn't call it since we relied on the
existance of the object name and we were passed an emp... - 07:12 AM Revision f22da6c0 (ceph): cls_rgw: rgw_dir_suggest_changes move cur_disk_bl inside loop
- Otherwise we'd fail in case the cls_cxx_read_key failed, since
we rely on its size.
Signed-off-by: Yehuda Sadeh <yeh... - 06:36 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
- I have updated the run_xfstests.sh script so that it simply no longer
runs test 232. That way we can still benefit ... - 04:36 AM Revision 998498f2 (ceph): Merge branch 'wip-oc-perf'
- 04:32 AM Revision 31d0d1d7 (ceph): librados: test get_pool_name
- Signed-off-by: Sage Weil <sage@newdream.net>
- 04:32 AM Revision 2dbea128 (ceph): librados: C bindings for get_pool_name()
- Signed-off-by: Sage Weil <sage@newdream.net>
04/16/2012
- 10:02 PM Bug #2178: rbd: corruption of first block
- The most recent occurrence has been confirmed to be a replay issue with non-btrfs filesystems. The wip-guard branch ...
- 09:54 PM Bug #2255 (Resolved): osd: fix object name collisions between pools in temp collection
- 09:52 PM Bug #2286: mon: different full/near_full values on different monitors
- pushed a patch that confines the logic of when to update this into a single bit of code. look okay?
i think the b... - 12:57 PM Bug #2286: mon: different full/near_full values on different monitors
- Sage asked on irc about just setting it up on the initial create_empty. The problem with that the only data which is ...
- 11:26 AM Bug #2286 (Fix Under Review): mon: different full/near_full values on different monitors
- This got (obviously) broken by commit:b6d1c0c9b7290a237560528b6ff0d6b2b2998ee2, which put in the use of magic numbers...
- 09:37 PM Feature #2113 (Resolved): objectcacher perfcounters
- 11:24 AM Feature #2113: objectcacher perfcounters
- My bad — I'll try and do that today!
- 11:13 AM Feature #2113 (Fix Under Review): objectcacher perfcounters
- not merged yet! i wanted to get feedback first on my naming kludge...
- 10:02 AM Feature #2113 (Resolved): objectcacher perfcounters
- Sage merged this.
- 09:21 PM Revision 9f5b9ada (ceph): Merge remote branch 'upstream/wip-guard'
- 09:19 PM Revision 2f5af83f (ceph): Filestore: fix guard errors in COLL_REMOVE and COLL_MOVE
- Use object rather than collection check in COLL_REMOVE. Adjust
COLL_MOVE check for -1/1 distinction.
Signed-off-by:... - 07:32 PM Revision f87a6a0f (ceph): cls_rgw: rgw_dir_suggest_changes() don't try to decode cur_disk
- don't try to decode cur_disk if bufferlist is zero sized.
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> - 07:32 PM Revision 6868ec6a (ceph): rgw: skip empty entries when listing a bucket
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 07:32 PM Revision 73badda6 (ceph): cls_rgw: don't zero out error code before reading value
- We were zeroing the return code before reading it.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 04:12 PM Bug #2306: objecter: accessing empty object maps to pool 0
- that looks right to me.
and yeah, i don't think object operations should be possible on an empty object name... - 04:03 PM Bug #2306: objecter: accessing empty object maps to pool 0
- Yep, that's pretty much exactly what I was thinking.
The only other question is if this fix is the right approach ... - 04:00 PM Bug #2306: objecter: accessing empty object maps to pool 0
- Would something like this work (not tested)?...
- 03:52 PM Bug #2306: objecter: accessing empty object maps to pool 0
- i prefer an explicit separate field for oid-vs-pg mode so that we can distinguish between pg 0.0 (really) and no pg/n...
- 03:07 PM Bug #2306: objecter: accessing empty object maps to pool 0
- Ah, nope. list_objects is broken.
- 03:06 PM Bug #2306: objecter: accessing empty object maps to pool 0
- From what I see, the pg ops call pool_op_submit() and not op_submit() so Greg's fix might be ok?
- 02:53 PM Bug #2306: objecter: accessing empty object maps to pool 0
- Ah, you're right. I missed that function when looking to see who filled in the op->pgid.
In that case we should ma... - 02:33 PM Bug #2306: objecter: accessing empty object maps to pool 0
- i think that if was there for the pg ops (PGLS) where there is no object... the list_objects code is filling in the p...
- 02:07 PM Bug #2306 (Fix Under Review): objecter: accessing empty object maps to pool 0
- Yep, the Objecter doesn't calculate pg placement for objects with a zero-length name. I'm pretty sure the if guard th...
- 01:51 PM Bug #2306: objecter: accessing empty object maps to pool 0
- Empty object <== object with empty name
- 01:51 PM Bug #2306 (Resolved): objecter: accessing empty object maps to pool 0
- Even if different pool was specified.
- 03:34 PM CephFS Bug #2299: all MDS commit suicide on startup
- this issue can be closed, there was an error in the underlying fileystem of osd.0 :)
- 02:59 PM CephFS Bug #2277: qa: flock test broken
- I was going to move this over to the kernel client project and then realized I can't — should we close this bug (reje...
- 02:46 PM CephFS Bug #2277: qa: flock test broken
- ...
- 02:15 PM Linux kernel client Bug #2298: rbd: broken encode_op for big-endian hosts?
- there are some old g5's in the closet here at aon that we can use.
in the past we've found/fixed these issues with... - 01:46 PM Linux kernel client Bug #2298: rbd: broken encode_op for big-endian hosts?
- I haven't looked at this in any detail but I presume Al is correct.
We don't have any big endian hardware anywhere, ... - 01:49 PM CephFS Bug #2288: libcephfs: setxattr returns EEXIST following removexattr
- More info:
That branch has a patch which adds a call to removexattr before the setxattr. If you run testceph twice i... - 01:38 PM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
- Should have waited. It have reproduced the problem by running test 232.
- 01:37 PM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
- After a lot of repetitions, I've narrowed it down to test 232 or 234.
- 10:12 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
- I ran subsets of that list at least three times and never
reproduced it. I tried again after a reboot, and again,
... - 06:43 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
- Looking at the list of tests that indicate they include quota testing,
the ones that are currently being run by the ... - 06:36 AM Linux kernel client Bug #2302: xfs: warning at mutex_remove_waiter
- I sent a report to the XFS mailing list about the warning. I have to try
to narrow down which test was running when... - 01:18 PM CephFS Bug #2285: libcephfs: failure with empty name components
- Yep, it's client-local; there's no request to the MDS for this either.
Guess that means we don't care right now? - 01:04 PM CephFS Bug #2285 (In Progress): libcephfs: failure with empty name components
- Oddly, this looks like it's a race. I can't reproduce it with any client debugging on...
- 11:33 AM Feature #2305: Moving rbd images between pools
- Not quite; copy works, but slowly (because of course it's duplicating all the data). I don't know if mv/rename could...
- 11:24 AM Feature #2305 (Rejected): Moving rbd images between pools
- We discovered it does work if you keep the image names the same and vary the pool names. :)
- 11:01 AM Feature #2305 (Resolved): Moving rbd images between pools
- It would be nice to have an option to move rbd's between pools with a syntax like:
rbd mv <first poolname>/<image na... - 10:02 AM Messengers Cleanup #2150 (In Progress): repair the Simple/Messenger interface
- Not really done! ;)
- 08:44 AM rbd Feature #2297: ObjectCacher: mark buffers mergeable for ksm
- I'm really not sure this is something we want to do, especially unconditionally. Let's wait until we get some idea of...
- 07:27 AM Bug #2304 (Resolved): rbd import fails on block device
- root@burnupi30:~# rbd import /dev/sda burnupi30.sda
fiemap ioctl() failed
Importing image: 100% complete...done.
... - 03:39 AM Revision 6bede298 (ceph): dump_stuck: whitelist 'wrongly marked me down'
- The test marks the osds down.. they may generate this error if they get
that faster than they get the signal via the ...
04/15/2012
- 08:30 PM Bug #2303 (Can't reproduce): osd: failed to peer on startup
- ubuntu@teuthology:/a/nightly_coverage_2012-04-14-b/994
- 08:24 PM Linux kernel client Bug #2302 (Can't reproduce): xfs: warning at mutex_remove_waiter
- ...
- 03:33 PM Feature #1044 (In Progress): librbd: discard support
- 03:33 PM Feature #2163 (Resolved): qa: full xfstests on rbd
- 03:33 PM Subtask #2249 (Resolved): teuthology task (3)
- 03:33 PM Feature #2226 (Resolved): osd: better filestore idempotency test
- 05:39 AM Linux kernel client Bug #2287: rbd: crashes with 10Gbit network and fio
- Here some more info from the crash:
@[58113.180039] libceph: tid 387083 timed out on osd92, will reset osd
[5818...
04/14/2012
- 10:38 PM Revision 1bc01289 (ceph): Merge branch 'stable'
- 09:06 PM Revision 19e673cc (ceph): filestore_idempotent: use new sequence-based tester
- random seed, inject at 50-300.
- 08:53 PM Revision 20ab74ea (ceph): test_librbd: fix write sizes
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 08:07 PM CephFS Bug #2299: all MDS commit suicide on startup
- after i told osd.0 to get lost and reformatted it, the cluster started resyncing.
then (magically) mds.0 started up ... - 09:39 AM CephFS Bug #2299 (Rejected): all MDS commit suicide on startup
- my setup is: 1 MON, 2 MDS and 4 OSD.
ceph version is commit:1e76a8713feac6883c648512dcdc28c83f7ff69e.
after copyi... - 05:26 PM Revision 81c8a7b9 (ceph): test_idempotent_sequence: no config file
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:43 PM Revision a40fe5bd (ceph): run_xfstests.sh: ensure cleanup on errors
- Because we exit on any error (due to 'set -e'), the cleanup call was
never getting made in the event of an error. Th... - 04:41 PM Bug #2301: librados: LibRadosMisc.AioOperatePP failure
- the problem is that the completion callback is now async, but wait_for_complete() is not.
do we think that is ok?
- 02:59 PM Bug #2301 (Resolved): librados: LibRadosMisc.AioOperatePP failure
- 2012-04-14T00:11:00.763 INFO:teuthology.task.workunit.client.0.out:[ RUN ] LibRadosMisc.AioOperatePP
2012-04-14... - 04:26 PM Revision 3d187db5 (ceph): run_xfstests.sh: pass test result via exit status
- It turns out that xfstests *does* exit with non-zero status
when a test fails. Its exit status is the number of test... - 01:58 PM Bug #2300 (Rejected): objecter: not sending stat request
- 01:50 PM Bug #2300: objecter: not sending stat request
- Ah, actually we try to access an object with empty oid, which is obviously wrong. Probably due to #2289 issues.
- 12:34 PM Bug #2300 (Rejected): objecter: not sending stat request
- Happens in rgw (can only see it on congress). Following a rgw.bucket_list call response, we call librados io_ctx->sta...
- 12:38 PM rgw Bug #2289: rgw: listing a bucket hangs after removing inexisting object
- Pushed several fixes to wip-2289. The scenario was:
creating bucket
trying to remove object that does not exist
... - 06:53 AM Revision a4dd9204 (ceph): objclass: fix cls_cxx_map_remove_key()
- didn't set the correct osd op, didn't do anything.
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> - 06:10 AM Revision b352a3e1 (ceph): rgw: call dir_suggest_changes with correct bufferlist
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 05:28 AM Revision 6ba4efcd (ceph): rbd.py: add xfstests functionality
- Add tasks for running xfstests over a pair of rbd volumes. The main
one is called xfstests, and it sets up rbd volum... - 05:27 AM Revision 4498825a (ceph): add rbd_xfstests to regression suite
- 05:19 AM Revision a31392cf (ceph): objectcacher: name them
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 05:18 AM Revision 28e61811 (ceph): librados: add IoCtx::get_pool_name() to c++ api
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:51 AM Revision 446b5e24 (ceph): ObjectCacher: move functions out of the header.
- These all set off my "big-functions-in-the-header" alarm.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 04:51 AM Revision 6257a94e (ceph): ObjectCacher: add a bunch of perfcounters
- Hurray, we can collect cache statistics!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 04:51 AM Revision f355875b (ceph): ObjectCacher: group all the function definitions properly
- Some of the ObjectCacher functions were separate, which works poorly
if you're trying to do things like use ldout.
S... - 04:51 AM Revision d80c126a (ceph): ObjectCacher: use "objectcacher" instead of the CephContext::name for p...
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 04:51 AM Revision 72a75730 (ceph): objectcacher: perf skeleton
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
Conflicts:
src/osdc/ObjectCacher.cc
src/osdc/ObjectCacher.h - 04:41 AM Revision 2cc854f7 (ceph): msgr: Document all the functions in the Messenger interface.
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 04:40 AM Revision b8207dd2 (ceph): msgr: remove several functions from the interface.
- The suicide() function is unused and its current SimpleMessenger
implementation just calls shutdown(). And I don't se... - 04:02 AM Revision bf8847e7 (ceph): qa: add run_xfsests.sh script
- Add a script that runs xfstests over a pair of devices that are
specified using command line arguments. The tests ar... - 04:01 AM Revision ee69f271 (ceph): test/rbd/fsx: fix signed/unsigned warnings
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:52 AM Revision 812c7c7a (ceph): librbd: implement discard
- Implement sync and async discard. Embed an ObjectWriteOperation in the
BlockCompletion struct.
The sync version doe... - 03:51 AM Revision 589c6269 (ceph): Merge branch 'wip-librbd-caching'
- Reviewed-by: Sage Weil <sage.weil@dreamhost.com>
- 03:46 AM Revision 5c2d5b18 (ceph): test: copy fsx to rbd directory
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision b26d7024 (ceph): test: port fsx to run on an rbd image instead of a file
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision 78d14da8 (ceph): librbd: flush pending writes when a new snapshot is created
- This makes sure the state is as consistent as librbd can make it
before the snapshot is actually created.
Signed-off... - 03:46 AM Revision 30805bfa (ceph): test_librbd_fsx: clean up properly
- Delete and close everything on exit.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> - 03:46 AM Revision 68e2e004 (ceph): librbd: flush cache before creating a snapshot
- This is a temporary workaround until the ObjectCacher
is smarter about snapshots.
Signed-off-by: Josh Durgin <josh.d... - 03:46 AM Revision d0b1f72c (ceph): ObjectCacher: add some asserts about when the lock is held
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision 7197825d (ceph): Remove unused Completion.h header
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision 1dd9c303 (ceph): ObjectCacher: remove redundant 'public:'
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision 7868dbe0 (ceph): ObjectCacher: remove unused/unimplemented methods
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision ee9987b2 (ceph): librados: debug logging for aio writes
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision 20be92ec (ceph): ObjectCacher: always delete callback in commit_set
- The GatherBuilder used later always deletes it. To make
the interface sane, delete it in every return path.
Signed-o... - 03:46 AM Revision 6cf22655 (ceph): ObjectCacher: remove useless conditional
- We assert this condition at the beginning of the function.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> - 03:46 AM Revision 55b65544 (ceph): test_librbd: flush before opening another handle to the image
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision d9aa81b0 (ceph): finisher: add indent settings header
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision f464e6c3 (ceph): ObjectCacher: include object when outputting a BufferHead
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision 9c8b3f70 (ceph): .gitignore: clean up to not match so many things recursively
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
- 03:46 AM Revision a0fc3336 (ceph): librados: unconditonally associate IoCtxImpl with AioCompletionImpl
- We were only setting the pointer if we were on the aio_write_list. Set
this pointer for any aio.
Signed-off-by: Sag... - 03:46 AM Revision 34a10fae (ceph): librados: do aio callbacks in async thread
- Call user completions in an async thread. This allows callers to call back
into librados from the callback, and allo... - 03:46 AM Revision cbbd5f97 (ceph): ObjectCacher: check for read/write errors
- The objecter doesn't fix every error for us.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> - 03:46 AM Revision abbfd937 (ceph): Filer: make file_to_extents static
- Now the ObjectCacher has no need for a Filer instance (not that
it had a good reason for one before).
Signed-off-by:... - 03:46 AM Revision d6dbff01 (ceph): ObjectCacher: remove dependency on Objecter
- Abstract out how writeback is done with a WritebackHandler object.
For RBD caching, this will be done by librados, bu... - 03:46 AM Revision 9546c158 (ceph): librbd: remove writeback window
- This is superseded by a full-fledged writeback cache.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> - 03:46 AM Revision c0073770 (ceph): librbd: allow writeback caching
- This uses the existing infrastructure of ObjectCacher for
buffer management and expiry.
Signed-off-by: Josh Durgin <... - 03:46 AM Revision 579d30b6 (ceph): librbd: check for writes to snapshots
- librados does this for us normally, but caching does not check for this.
We might as well check early to avoid schedu... - 03:46 AM Revision bb35ecf0 (ceph): librbd: fix bytes read accounting in read_iterate
- ObjectCacher will never do short reads, and always returns 0.
librados may do short reads at the end of an object.
S... - 03:41 AM Revision cc1ae672 (ceph): filestore: flush log on failure injection
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:41 AM Revision ad2eba43 (ceph): Merge branch 'wip-2226b'
- Conflicts:
src/common/config_opts.h
src/os/FileStore.cc
src/os/FileStore.h - 03:38 AM Revision 4fc48e5b (ceph): run_seed_to_range.sh: run filestore idempotent sequence tests over a range
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision f052c82b (ceph): run_seed_to: assume tool is in path, not cwd
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 2cb0032e (ceph): DeterministicOpSequence: writing to object being cloned in the same tx.
- We write to the object being cloned prior to the clone to ensure we are
cloning a valid range of bytes.
The write an... - 03:38 AM Revision c7729847 (ceph): test_idempotent_sequence: simple test script
- Takes seed and kill point, then verifies the result.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 03:38 AM Revision 95bb4239 (ceph): filestore: use our assert
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision a8b9b98c (ceph): run_seed_to.sh: set -e
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 56fb027f (ceph): test_idempotent_sequence: make get-last-op return 0 if no txns applied
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 2d24eb33 (ceph): FileStore: don't return ENOENT from object_map getters
- ENOENT in those contexts means that the object_map does not know
about the object, not that the object does not exist... - 03:38 AM Revision 21be4ad9 (ceph): test_idempotent_sequence: Add more info on diff's dout's.
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:38 AM Revision bb0499a4 (ceph): test_idempotent_sequence: outputting collection being diff'ed.
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:38 AM Revision 27cb82e5 (ceph): test_idempotent_sequence: use get_obj_at() for clone source
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 1b451294 (ceph): test_idempotent_sequence: fix/enable do_coll_add
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision a5f533a7 (ceph): test_idempotent_sequence: fix diff return value, mount behavior
- No mkfs!
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 03:38 AM Revision 08cf4d28 (ceph): test_idempotent_sequence: include object name in diff output
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 2d20b4c2 (ceph): test_idempotent_sequence: return true from diff if different
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 5d30581e (ceph): test_idempotent_sequence: add remove op
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision e270e0b5 (ceph): test_idempotent_sequence: diff two stores.
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:38 AM Revision 2c940a4f (ceph): test_idempotent_sequence: Remove references to VerifyFileStore
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:38 AM Revision 42899110 (ceph): Makefile: remove VerifyFileStore from test_idempotent_sequence build
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:38 AM Revision 84175c7d (ceph): VerifyFileStore: delete from tree
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:38 AM Revision 794fc0b6 (ceph): test_idempotent_sequence: fix collection_move arg order
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision a8c32057 (ceph): test_idempotent_sequence: whitesepace
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision bc5dffe9 (ceph): test_idempotent_sequence: replace on coll_move
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision c056f603 (ceph): test_idempotent_sequence: simplify clone
- - pick existing source
- any target is ok, as long as it != source
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 03:38 AM Revision 51bc6dfd (ceph): test_idempotent_sequence: make object names overlap
- This way when we move objects between collections they will replace
each other in interesting ways.
Signed-off-by: S... - 03:38 AM Revision db3d14b9 (ceph): test_idempotent_sequence: fewer objects per collection by default
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision bec1e50e (ceph): test_idempotent_sequence: remove existing target for collection_add
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 975e59ad (ceph): test_idempotent_sequence: Add commands and lose a couple of optional args.
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:38 AM Revision 5ced86be (ceph): filestore: init filestore_kill_at in ctor
- Otherwise we don't get the option for FileStore instances created after
common_init_finish() (which does md_config_t:... - 03:38 AM Revision 9dba3172 (ceph): test_idempotent_sequence: require nonexistent filestore dir
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 795274bc (ceph): test_idempotent_sequence: include tx number in each transaction
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 57866946 (ceph): test_idempotent_sequence: implement get-last-op
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision 6573690a (ceph): VerifyFileStore: Check if two FileStore's match after applying a set of...
- With DeterministicOpSequence we are able to reproduce exactly the same
sequence of operations, over and over. However... - 03:38 AM Revision dee47bb2 (ceph): test_idempotent_sequence: Generate a reproducible sequence of txs.
- With this test we aim at reproducing the same sequence of transactions
as long as we are provided with the same seed ... - 03:38 AM Revision 4a6cfcb3 (ceph): deterministicopseq: add collection_rename() support
- 03:38 AM Revision 01daa88d (ceph): test_idempotent_sequence: no need to reinject value that is already there
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:38 AM Revision a5a83950 (ceph): filestore: name internally
- We need to allow the perfcounter name to be controlled so that we can have
two instances of FileStore in the same pro... - 03:38 AM Revision 83227550 (ceph): TestFileStoreState: Represent a FileStore's state to be used by tests.
- Instead of having each test creating the same representation of a
FileStore's state, with a map/set of collections an... - 03:38 AM Revision 2f02a8a5 (ceph): DeterministicOpSequence: Generate a reproducible sequence of operations.
- Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com>
- 03:33 AM Revision 583c79af (ceph): filestore: fix collection_add argument names
- No functional changes, just fixing and clarifying argument names so that it
is less confusing/wrong.
Signed-off-by: ... - 03:33 AM Revision dfcadc94 (ceph): filestore: simple failure injections via --filestore-kill-at <n>
- This will make filestore suicide (_exit(1)) on the n'th potential failure
call site. We can potentially fail:
... - 03:33 AM Revision f4001790 (ceph): filestore: two-phase guard
- For certain operations (collection_add) we need a two-phase guard, and an
"in-progress" state.
- before exposing an... - 03:33 AM Revision 8d3acb5a (ceph): filestore: replay collection_move using add+remove
- This approximates the buggy collection_move. It is still buggy. It is
only there to replay old journals.
Rip out b... - 01:14 AM Revision c6551fb8 (ceph): test_rados.py: add basic tests for the Object class
- Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Reviewed-by: Greg Farnum <gregory.farnum@dreamhost.com> - 01:13 AM Revision 76799680 (ceph): rados.py: fix Object.read argument order
- 57cfd7673da66d191cc397689da35d0a1f3dc61a switched the argument order but didn't change this caller.
Signed-off-by: T...
04/13/2012
- 11:12 PM Revision 5ce8d71f (ceph): FileJournal: allow write_thread and write_finisher to run in parallel
- Previously, the write_thread and the write_finisher serialized on
write_lock limiting performance. aio_lock has been... - 11:03 PM Feature #1044 (Resolved): librbd: discard support
- 11:03 PM Feature #2163: qa: full xfstests on rbd
- 11:02 PM Feature #2052 (Resolved): librbd: caching
- 06:37 PM Feature #2052: librbd: caching
- This is passing long-running fsx with osd thrashing consistently, and all the other rbd tests. I think the branch (wi...
- 10:40 PM Linux kernel client Bug #2298 (Resolved): rbd: broken encode_op for big-endian hosts?
- ...
- 10:17 PM Subtask #2249: teuthology task (3)
- 09:26 PM Subtask #2237 (Resolved): failure+replay tester (8)
- 06:39 PM Bug #2278 (Resolved): librados: python read has arguments swapped
- Fixed by 76799680546a79fc73ad7bbc58960a31ae2290ad.
- 10:10 AM Bug #2278: librados: python read has arguments swapped
- 07:56 AM Bug #2278 (Resolved): librados: python read has arguments swapped
- Object.read from rados.py is passing arguments to ioctx.read in a wrong order.
--- rados.py.dist 2012-04-13 16:5... - 06:38 PM rbd Feature #2297 (New): ObjectCacher: mark buffers mergeable for ksm
- This is done with a simple madvise call, but we should test that it works with ksm and verify that all the buffers ar...
- 06:29 PM Feature #2296 (Resolved): librbd: allow resizing to arbitrary sizes
- Right now resizing to a non-object-size multiple will round down the remainder. With discard support, we support this...
- 06:25 PM Feature #2295 (Resolved): make qemu cache=writeback,writethrough option turn on librbd caching
- This will enable more familiar use of caching with qemu/rbd, and let people configure it with libvirt's existing xml.
- 05:51 PM rbd Feature #2294 (New): librbd: optionally cache entire objects, instead of only requesting the part...
- This may save many round trips for small read sizes (common to vms).
- 05:49 PM Feature #2113 (Fix Under Review): objectcacher perfcounters
- Okay, I checked and these work — if you run ceph-fuse -f and play around you can do a dump_perfcounters and see the v...
- 05:46 PM CephFS Bug #2293 (Resolved): admin sockets don't persist with ceph-fuse
- It looks like the admin socket is associated with the launching process, rather than the background process that cont...
- 05:46 PM rbd Feature #2292 (New): ObjectCacher: support sparse objects
- The ObjectCacher doesn't store which objects or parts of objects don't exist. This info could improve read performance.
- 05:46 PM Revision dc44950b (ceph): filestore: implement collection_move() as add + remove
- This ensures we get add and remove steps with different spos values, which
makes the guard work. The collection_move... - 05:46 PM Bug #2291 (Can't reproduce): objectcacher perfcounters don't work with test_librbd_fsx
- The admin socket perfcounters_dump command only outputs objecter data. I'm speculating that it has to do with the obj...
- 05:30 PM Feature #2290 (Resolved): ObjectCacher: handle read/write errors
- Currently the return value of the underlying read/write calls is ignored (I left TODO notes there). We should figure ...
- 05:02 PM rgw Bug #2289 (Resolved): rgw: listing a bucket hangs after removing inexisting object
- 04:59 PM Revision dc10d04e (ceph): OSD: improve information and format of OSDTracker messages
- Also, Message now has a timestamp indicating when the message
was fully recieved for use by OSDTracker.
Signed-off-b... - 03:12 PM Subtask #2235 (Resolved): generate deterministic sequence of transactions (5)
- 02:01 PM CephFS Bug #2288 (Resolved): libcephfs: setxattr returns EEXIST following removexattr
- running cephtest a couple of time (out of wip-testlibcephfs):...
- 01:48 PM Linux kernel client Bug #2287 (Resolved): rbd: crashes with 10Gbit network and fio
- From http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/5968:...
- 01:47 PM Bug #2286 (Resolved): mon: different full/near_full values on different monitors
- If you run vstart, you get...
- 01:39 PM CephFS Bug #2285 (Resolved): libcephfs: failure with empty name components
- the following in client/testceph.cc fails:
// test empty name components
my_fd = ret = ceph_open(cmount, "rea... - 11:19 AM rgw Feature #2284 (Resolved): rgw: bench based on rados_bench
- 11:17 AM rgw Feature #2171 (Rejected): rgw: asynchronously calculate md5
- 11:16 AM Feature #2283: The ceph command should time out
- 10:21 AM Feature #2283 (New): The ceph command should time out
- When using ceph to query certain parts of the cluster, there should be an option to time out after a certain set numb...
- 09:44 AM Subtask #2282 (Resolved): Handle map updates on a per-pg basis
- Currently, we advance all pgs to the next map at once. This requires us to flush the filestore queue and basically h...
- 09:27 AM Feature #2281 (Resolved): build big burnupi cluster for testing
- 09:23 AM Feature #2280 (Resolved): improve gitbuilder infrastructure
- * do not fill up local disk; sync results out immediately
* resolve branches immediately, not after each full pass
? - 09:20 AM rbd Feature #2279 (Resolved): rbd: trivial layering design doc
- - how parent images are marked read-only
- how parent/child relationship is represented
- possibly how this allow... - 09:16 AM Bug #2192 (Need More Info): ceph-mon hangs consuming 100% CPU
- 09:14 AM Feature #2246 (Resolved): force10s on sepia
- 09:13 AM Messengers Cleanup #2150 (Resolved): repair the Simple/Messenger interface
- 09:13 AM Feature #2240 (Resolved): osd: new default locations
- 06:16 AM Revision 3d11455b (ceph): osdmaptool: fix clitest for lpg_num 0
- Broke this in 11b93d3af2c0f1bca3e48fe773a14f07f479ccc3
Signed-off-by: Sage Weil <sage@newdream.net> - 05:56 AM Revision 55535d04 (ceph): move tasks:cfuse_workunit_suites_dbench.yaml to stress pending #1737 fix
- 05:36 AM Revision 1e76a871 (ceph): Merge remote branch 'gh/wip-defaults'
- 04:47 AM Revision 8a6c3c17 (ceph): msgr: dispatcher: Documentation of security functions
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
04/12/2012
- 11:24 PM Revision 9b2aff20 (ceph): FileStore: don't return ENOENT from object_map getters
- ENOENT in those contexts means that the object_map does not know
about the object, not that the object does not exist... - 11:17 PM Subtask #2237 (In Progress): failure+replay tester (8)
- 11:17 PM Subtask #2235: generate deterministic sequence of transactions (5)
- 11:15 PM Feature #2240: osd: new default locations
- 10:58 PM CephFS Bug #2277 (New): qa: flock test broken
- ubuntu@teuthology:/a/nightly_coverage_2012-04-12-b/687
ubuntu@teuthology:/a/nightly_coverage_2012-04-11-b/525
thi... - 10:50 PM Revision 241c5e5e (ceph): msgr: Reorder functions to reduce the number of privacy tags.
- No functional changes.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 10:48 PM CephFS Bug #1737: ceph-fuse crash in xlist::remove
- ubuntu@teuthology:/a/nightly_coverage_2012-04-12-b/717
- chef: null
- ceph: null
- ceph-fuse: null
- workunit:
... - 10:45 PM CephFS Bug #2187: pjd chown/00.t failed test 97
- 2012-04-12T13:09:27.496 INFO:teuthology.task.workunit.client.0.out:../pjd-fstest-20080816/tests/chown/00.t (Wstat: ...
- 10:45 PM Revision 5cf7628f (ceph): msgr: miscellaneous fixes.
- Indentation on a public label; various function documentation; remove
an inappropriate definition of a virtual functi... - 10:35 PM Bug #2276 (Rejected): osd: eat cpu on restart
- osd.856 on congress.
- 10:17 PM Revision 8e5f5625 (ceph): DBObjectMap: check in clone and link that source is not destination
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 09:35 PM Bug #2275 (Resolved): osd: crash in FileJournal::wrap_read_bl
- ...
- 07:16 PM Revision a5730c85 (ceph): Merge branch 'wip-2223'
- Reviewed-by: Sage Weil <sage@newdream.net>
- 05:59 PM Revision 9565a8ba (ceph): ObjectStore: Remove code duplication when dumping transactions.
- By using OStreamFormatter, we can have a single function responsible for
dumping a transaction. We keep the same old ... - 05:59 PM Revision ddbaf584 (ceph): filestore: remove commented code, fix whitespace
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 05:59 PM Revision 7223f00a (ceph): filestore: dump transactions sets, include seq and trans_num
- Lists of transactions commit as a unit; reflect that in the dump. Also
include seq and trans_num.
Signed-off-by: Sa... - 05:59 PM Revision 816a5128 (ceph): objectstore: tweak dump() a bit
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 05:59 PM Revision 308c1fc6 (ceph): filestore: use config observer to start/stop dumping/tracing
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 05:59 PM Revision 6ee0b718 (ceph): filejournal: dump injson
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 05:59 PM Revision 6a7bbe86 (ceph): filestore: dump transaction in json on error
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:29 PM Documentation #2274 (Closed): Basic Availability Model
- (1) Construct a continuous-time markov availability model for a basic cluster (3 mons, 4 osds, 2 copy)
(Petri ne... - 04:19 PM Documentation #2273 (Closed): basic reliability models
- 1. construct a probabilistic model for data loss in 1, 2, and 3 copy systems, assuming independent failures
2. plug ... - 04:13 PM RADOS Documentation #2272 (Closed): FAQs: RADOS reliability and availability
- I expect others to improve this, but this is just to capture the ideas.
It is probably more of a white paper than an... - 04:06 PM Documentation #2271 (Resolved): FAQ: BTRFS vs XFS
- I expect others to improve this list, but to start it out ...
what file systems we run on (and test on)
how you... - 12:15 PM Feature #2223 (Resolved): Tracing facility on FileStore
- 09:05 AM RADOS Feature #2268 (Resolved): crush: update item's position in crush map
- via crushtool and 'ceph osd crush ...'
- 03:55 AM Bug #2267 (Closed): Ceph client crashed after shutting down one mds and osd
- Ceph version: 0.44.1-1~bpo70+1
Kernel version: 3.2.12-1
Ceph config:
[global]
auth supported = cephx
keyri... - 12:38 AM Revision cfac4a97 (ceph): pgmon: clear out osd reports after the OSD has gone down
- Previously we never removed report times from last_osd_report. Do
so, in check_osd_map (which, on the leader, is call...
04/11/2012
- 11:00 PM Revision 119dd5ae (ceph): mkcephfs: update man page
- Signed-off-by: Sage Weil <sage@newdream.net>
- 11:00 PM Revision 4a4b7994 (ceph): ceph-authtool: update man page
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:53 PM Revision ab08fb8b (ceph): mkcephfs: note that btrfs (and --mkbtrfs) are optional and experimental
- And that --mkbtrfs will be deprecated soon.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 10:53 PM Revision ee39291a (ceph): ceph-authtool: add warning to man page
- - data is not encrypted over the wire
- intended for trusted environments
Signed-off-by: Sage Weil <sage.weil@dreamh... - 10:40 PM Revision 11b93d3a (ceph): osd: disable localized pgs by default
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:40 PM Revision 8836b81f (ceph): mon: alloc pgp_num adjustment up and down
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:39 PM Revision 83e1260b (ceph): mon: set pgp_num == pg_num (by default) for new pools
- For when pg_num is specified but not pgp_num. Thanks Greg!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 10:39 PM Revision 58671a4c (ceph): mon: command to disable localized pgs for a pool
- ceph osd pool disable_lpgs <poolname> --yes-i-really-mean-it
Grr, these should be off by default. We can't adjust t... - 08:35 PM Revision 7fdf25bc (ceph): debian: python-support -> dh_python2
- I followed the instructions on
http://wiki.debian.org/Python/TransitionToDHPython2
Signed-off-by: Sage Weil <sage@... - 07:35 PM Revision ed0653b4 (ceph): COPYING: doc/ CC BY-SA
- Signed-off-by: Sage Weil <sage@newdream.net>
- 07:33 PM Revision 6e83e119 (ceph): README: update
- - refer to COPYING, SubmittingPatches
- a word about dependencies
- building packages
- drop the list of built binari... - 06:42 PM Revision 838a7618 (ceph): ceph-rbdnamer: include in dist tarball and debs/rpms
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:42 PM Revision af502735 (ceph): obsync: include man page in tarball, packages
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:42 PM Revision 9678c097 (ceph): init-radosgw: start in runlevel 4
- Fixes lintian error
W: radosgw: init.d-script-missing-start etc/init.d/radosgw 4
Signed-off-by: Sage Weil <sage@new... - 06:42 PM Revision 84efc554 (ceph): debian: drop unnecessary conflicts on librgw
- Cut and paste baggage from libcephfs, it looks like.
Signed-off-by: Sage Weil <sage@newdream.net> - 06:18 PM Messengers Cleanup #2150 (In Progress): repair the Simple/Messenger interface
- I haven't done it, but I had enough time to glance over it and see at least a couple things that need fixing before t...
- 05:49 PM Feature #2113: objectcacher perfcounters
- Sage asked me to run it under an rbd mount and look at it. Need to get tests from Josh and then figure out how to do ...
- 04:30 PM Feature #2113 (Fix Under Review): objectcacher perfcounters
- Compile-tested.
- 10:51 AM Feature #2113 (In Progress): objectcacher perfcounters
- Yoink.
- 05:40 PM Revision 292898a8 (ceph): init-ceph: start at all runlevels
- This fixes lintian error:
W: ceph: init.d-script-missing-start etc/init.d/ceph 4
Signed-off-by: Sage Weil <sage@new... - 05:03 PM Revision b1946290 (ceph): Merge branch 'stable'
- 04:30 PM Bug #2266 (Resolved): teuthology: nuke after failure is failing
- it fails, and then fails to unlock, and eats up machines.
for example, ubuntu@teuthology:/a/nightly_coverage_2012-... - 03:08 PM Feature #2265 (Rejected): make sure objecter/kclient error out when localized pgs don't exist
- 11:02 AM Bug #2264 (Can't reproduce): mon: failed assert in bump_epoch
- During startup of a teuthology run on commit 1775301bb46379648f3f88914ef56aa1982db020 (before the cluster was healthy...
- 10:48 AM Bug #2263 (Resolved): obsync: move man page to section 1
- 09:25 AM Bug #2262 (Resolved): qa: osd-recovery tasks fails on flush_pg_stats
- consistently
- 08:09 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
- Looks like the problem arose while running fsstress on the xfs loop
mount on top of a file on the ext2 filesystem.
... - 07:56 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
- FYI, xfstests 49 tests running XFS on a loop device. I have to wait for a
reboot in order to see if I can tell at w... - 07:49 AM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
- Looks like xfstests #49 is a reproducer for this problem, at least
after running the tests that lead up to it first ... - 05:47 AM Revision be5b25b6 (ceph): filestore: fix collection_move guard
- We had a sequence like:
1- write A block 1
2- write A block 2
3- write A block 3
4- write A block 4
5- move A -... - 05:47 AM Revision 4bd9d1bb (ceph): filestore: fix collection_add guard
- If we crash between the link() and setting the guard, we will get
EEXIST. Tolerate that.
Signed-off-by: Sage Weil <... - 05:47 AM Revision df4d7a47 (ceph): filestore: fix collection_rename guard
- If we crash between the rename and setting the guard, we can get EEXIST
or ENOTEMPTY on rename. Tolerate that.
Sign... - 05:47 AM Revision 85db25e8 (ceph): filestore: fix fd leak on collection_rename
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 05:47 AM Revision c3e4c5b7 (ceph): filestore: cleanup: flip sense of replay guard check
- The other are all if (_check_replay_guard(..)) do_it;. Make this one
match.
Signed-off-by: Sage Weil <sage.weil@dre... - 05:43 AM Revision 43de5e4f (ceph): FileStore: dumping transactions to a file
- Dump each queued transaction to a predefined file, specified with
--filestore-dump-file, in JSON format.
Signed-off... - 05:43 AM Revision cd4a760e (ceph): osd: fix heartbeat set_port()
- set_port() fails an assert if it isn't an in4 or in6 address, which a
default entity_addr_t is not.
Signed-off-by: S... - 05:29 AM Linux kernel client Bug #2261 (In Progress): paging error in libceph after crashed osd comes back online
- 05:22 AM Linux kernel client Bug #2261 (Can't reproduce): paging error in libceph after crashed osd comes back online
- ...
- 04:43 AM Revision 1775301b (ceph): osd: reenable clone on recovery
- This hasn't turned up problems in QA.
Fixes: #2002
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 02:25 AM Bug #2178: rbd: corruption of first block
- Well Sage,
I have a torture-test already :-D
OK, so it's independent from yours and that's good. It sounds, we ar...
04/10/2012
- 11:24 PM Feature #2223: Tracing facility on FileStore
- did some cleanup, changed the way the output is structured wrt the transaction lists, and tweaked a few other things....
- 11:23 PM Revision ddb98f77 (ceph): ceph_manager: don't try to start greenlet twice
- spawn already scheduled it. Trying to start it again hits an assert.
- 11:11 PM Revision 6fbac10d (ceph): osd: allow users to specify the osd heartbeat server address.
- Reported-by: Nick Bartos <nick@pistoncloud.com>
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Reviewed-by... - 10:23 PM Bug #2002 (Resolved): osd: racy push/pull for clones
- 10:19 PM Bug #2161 (Resolved): nonlinear scaling for PGMap::pg_stat encode
- commit:bd518e998c0ff12d611db19a8cff6da3622597cb
- 10:18 PM Bug #1953 (Resolved): teuthology: core files aren't archived when using valgrind
- it works!
- 10:10 PM Bug #2225 (Resolved): gitbuilder.ceph.com returning 503: Service Temporarily Unavailable.
- Yehuda found the bad apache option.. override it in the domain_service (maxconnperip=1000 param)
- 09:56 PM Revision 4f030e1b (ceph): osd_types: fix off by one error in is_temp
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Reviewed-by: Gregory Farnum <gregory.farnum@dreamhost.com> - 09:49 PM Messengers Cleanup #2150 (Resolved): repair the Simple/Messenger interface
- 09:49 PM Feature #1044 (Fix Under Review): librbd: discard support
- 09:48 PM Revision 31f16a4c (ceph): rgw: list multipart response fix
- LastModified was formatted outside of the Part block.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 09:06 PM Revision 89fecda6 (ceph): Makefile.am: remove some clutter
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 09:04 PM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
- I'm going to have to look at this again in the morning, but I think
we're in this block of code:
#ifdef CONFIG_BL... - 08:37 PM Linux kernel client Bug #2260: libceph: null pointer dereference at try_write+0x638+0xfb0
- Here's a disassembled block of the code where the fault occurred.
The address listed corresponds to offset 3468 belo... - 08:10 PM Linux kernel client Bug #2260 (Resolved): libceph: null pointer dereference at try_write+0x638+0xfb0
- It's not an exact match but it's close enough that I wanted to reopen
bug 1793 or 1866, but found myself unable to. ... - 08:41 PM Revision 1ac5554d (ceph): kernel: kludge around mysterious 0-byte .git/HEAD files
- No idea where these are coming from, but they break nodes with behavior
like
ubuntu@plana08:~$ sudo install -d -m075... - 05:42 PM Revision 0aea1cb1 (ceph): v0.45
- 04:17 PM Revision 0d5918f8 (ceph): kernel: reset to remote firmware branch; don't pull
- Pull might merge if upstream rebases. Just make our branch match the
remote one. - 04:12 PM Revision 9b755fd6 (ceph): kernel: change git incantation for firmware pull
- The 'git pull <uri>' seemed to consistently fail on some nodes. Can't be
sure this was really the problem with them ... - 03:59 PM Revision 22b1f17f (ceph): ls: another newline
- 03:57 PM Revision 7757fbb9 (ceph): ls: remote stray newline
- 03:27 PM Feature #2246: force10s on sepia
- Fabric brought up by Networking group. Interfaces up, configured, and working (nuttcp shows 9.5GB/s or so with
defa... - 01:26 PM Feature #2111: msgr workloads
- I think the messenger tester may be at a point where we can call this bug satisfied.
- 01:18 PM Bug #2178: rbd: corruption of first block
- the good news is i see the problem. the bad news is its the exact bug we thought we fixed. the other good news is w...
- 07:38 AM Bug #2178: rbd: corruption of first block
- Hi Sage,
just in case, the reply from yesterday did not reach you:
--- 8-< ---
Good morning,
it's already... - 12:27 PM Feature #2258 (Resolved): use external leveldb package
- autoconf lets you use the installed library. not doing so by default to avoid the pain of building on older distros.
- 04:22 AM Revision 965f83d4 (ceph): Merge branch 'next'
- 04:20 AM Revision d348e1ab (ceph): configure: --with-system-leveldb
- Default to bundled leveldb. Optionally check.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 04:20 AM Revision 34cc308e (ceph): filestore: fix leveldb includes
- Signed-off-by: Sage Weil <sage@newdream.net>
- 03:23 AM Revision 0b2e1cd2 (ceph): cephfs: fix uninit var warning
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
04/09/2012
- 11:58 PM Revision 9906d5ed (ceph): Change to local mirror of linux-firmware repo to try to stop failures
- 11:17 PM Revision f79b95e5 (ceph): Makefile: add missing .h to tarball
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 10:56 PM Revision 8d5c87a8 (ceph): rgw: fix object name with slashes when vhost style bucket used
- Fixes issue #2259. The problem was that we were initializing the
object name, then in the case of a virtual host buck... - 09:02 PM Revision 853b0458 (ceph): OSD: use per-pg temp collections, bug #2255
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 07:08 PM Revision 36d42dea (ceph): buffer: allow advance() to move an iterator backward
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 07:08 PM Revision bd518e99 (ceph): encoding: fix iterator use for struct_len copy_in
- The end() iterator position does not record an offset when the list is
modified.
Signed-off-by: Sage Weil <sage.weil... - 04:30 PM rgw Bug #2259 (Resolved): rgw: object name cut after slash when virtual host style is used
- Fixed, commit:8d5c87a86e070b4e95ef0d58a469bdbbef4a826c.
- 03:42 PM rgw Bug #2259 (Resolved): rgw: object name cut after slash when virtual host style is used
- 09:32 AM Bug #2178: rbd: corruption of first block
- The missing piece of information is mapping the file offset to a block device offset. Can you, inside the VM,...
- 03:59 AM Revision 7951d7e4 (ceph): Merge remote branch 'gh/stable' into next
- 03:58 AM Revision dd8fd168 (ceph): configure: HAVE_FALLOCATE -> CEPH_HAVE_FALLOCATE
- /usr/include/linux/fs.h defines this on CentOS 5, even though it does not
in fact compile. This stupid workaround av...
04/08/2012
- 09:53 PM Feature #2258 (Resolved): use external leveldb package
- - make our configure take/require a --with-system-leveldb or similar to not use the bundled leveldb
- update the deb... - 08:31 AM Bug #2178: rbd: corruption of first block
- Hi Sage and *Happy easter*,
yesterday I had some "luck" after 10 tries....
Here is what I have for you:
first ...
04/06/2012
- 09:27 PM Feature #1692 (Duplicate): librbd: Support TRIM (hole punching) (userspace client)
- dup of #1044
- 09:07 PM Revision 8e1cc8ab (ceph): init-ceph: manage pid_file from init script
- With upstart the daemon shouldn't manage the pid file itself. Move this
out of the default config and into the legac... - 08:48 PM Revision 81d2cbeb (ceph): config: move /var/run and /var/log defaults to config_opts.h
- This flips the sense of the common_init defaults. Before, the alternate
defaults were filled in if it was a daemon. ... - 08:39 PM Revision dfa043df (ceph): config: {osd,mon}_data default to /var/lib/ceph/$type/$cluster-$id
- Signed-off-by: Sage Weil <sage@newdream.net>
- 07:26 PM Revision 2ceda946 (ceph): Merge branch 'stable'
- 06:44 PM Revision 7680cdad (ceph): dencoder, rgw: make ceph-dencoder load much faster
- by avoiding linking with unneeded shared objects.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 05:00 PM Revision 98326968 (ceph): encoding: use iterator to copy_in encoded length
- This gives us a pointer to the position into the list where the final
length value will be copied. Previously we use... - 03:47 PM rgw Feature #2257 (Rejected): rgw: detect fastcgi module 100-continue support automatically
- The current default that is used doesn't work with vanilla fastcgi module. It'd be great if that could be set automat...
- 02:46 PM rbd Feature #2256 (Resolved): rbd: parallelize deletions
- There are a few places where we delete things one at a time: resizing to a smaller size, deleting all snapshots, and ...
- 02:04 PM Feature #2240 (Fix Under Review): osd: new default locations
- wip-defaults
- 12:05 PM Bug #2161: nonlinear scaling for PGMap::pg_stat encode
- wip-encoding
- 09:18 AM Bug #2161: nonlinear scaling for PGMap::pg_stat encode
- Ake van der Meer wrote:
> My ceph-osd processes run at 100% CPU for many minutes at a time doing this: http://pasteb... - 08:25 AM Bug #2161: nonlinear scaling for PGMap::pg_stat encode
- My ceph-osd processes run at 100% CPU for many minutes at a time doing this: http://pastebin.com/wYnPKWeJ
In src/i... - 10:05 AM Feature #2246 (In Progress): force10s on sepia
- Ports being mapped yesterday and today in preparation for switch config review.
- 09:21 AM Bug #2255 (Resolved): osd: fix object name collisions between pools in temp collection
- 08:28 AM Feature #2223: Tracing facility on FileStore
- Made some changes to the ObjectStore.cc, regarding code duplication of the transaction's dump methods. Feedback would...
04/05/2012
- 09:55 PM Revision 689ac5d7 (ceph): v0.44.2
- 09:53 PM Revision e0c4db9e (ceph): FileStore: do not check dbobjectmap without option set
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 09:25 PM Revision 38e24b1e (ceph): config: include /etc/ceph/$cluster.keyring in keyring search path
- mkcephfs and the docs etc still write to /etc/ceph/keyring.
Signed-off-by: Sage Weil <sage@newdream.net> - 09:25 PM Revision 57dff032 (ceph): config: expand metavariables for --show-config, --show-config-value
- Signed-off-by: Sage Weil <sage@newdream.net>
- 09:08 PM Revision 90e88a08 (ceph): Merge branch 'wip-cluster'
- Reviewed-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com>
- 08:35 PM Revision cfee0333 (ceph): config: parse fsid uuid in config, not ceph_mon
- Use the new OPT_UUID type.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 08:35 PM Revision 2c14c8b2 (ceph): config: add distinct UUID type
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 08:32 PM Revision 2c0dc47e (ceph): global: add -C or --cluster early args to specify cluster name
- This will let you specify which cluster to talk to on the command line
(e.g., 'ceph -C foo ...' or when starting a da... - 08:32 PM Revision 930a669a (ceph): config: add cluster name as metavariable; use for config locations
- Add a cluster name (default "ceph") to the config structure, and expand
$cluster in all config values.
Make the defa... - 08:25 PM Revision bda562fb (ceph): config: implement --show-config and --show-config-value <option>
- Dump internal config value(s) to stdout and then exit.
Signed-off-by: Sage Weil <sage@newdream.net> - 08:04 PM Revision f18b219a (ceph): test_workload_gen: fix logging
- Signed-off-by: Sage Weil <sage@newdream.net>
- 08:01 PM Revision 32b5d0f8 (ceph): config: remove obsolete bdev_* options
- These were part of ebofs.
Signed-off-by: Sage Weil <sage@newdream.net> - 07:59 PM Revision 1b769535 (ceph): Merge remote-tracking branch 'gh/wip-log'
- 06:43 PM Revision 0e5d087c (ceph): README: update instructions
- Needed to add submodule instructions.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 03:49 PM Revision 3d7f1db7 (ceph): Kernel: Pull linux-firmware from git
- Signed-off-by: Mark Nelson <nhm@clusterfaq.org>
- 02:21 PM Feature #2248 (Resolved): cluster naming
- 02:20 PM Subtask #2236 (Resolved): filestore failure injection (3)
- wip-filestore-failure
I don't think enumerating/identifying the callers is needed here. For the idempotency teste... - 01:19 PM Feature #2226: osd: better filestore idempotency test
- Thought about the a bit more. The filestore failure injection is easiest to implement with an _exit(1) or something,...
- 01:13 PM Feature #1890 (Resolved): log: async log writeout
- 01:13 PM Feature #1889 (Resolved): log: structure log records
- 12:30 PM Feature #2254 (Resolved): doc: cephx
- pending improved documentation:
* was is, is not protected
* how to convert/upgrade a non-cephx cluster to cephx (e... - 12:22 PM Subtask #2235 (In Progress): generate deterministic sequence of transactions (5)
- 10:51 AM Bug #2178: rbd: corruption of first block
- Ok, my attempts to parse the log to find out of order replies is quickly snowballing. (complexity of dropped replies...
- 08:21 AM Bug #2178: rbd: corruption of first block
- Oliver Francke wrote:
> Uhm...
>
> ... I thought, we were talking about the same issue since the very beginning..... - 01:25 AM Bug #2178: rbd: corruption of first block
- Uhm...
... I thought, we were talking about the same issue since the very beginning... corruption of .rbd-blocks.....
04/04/2012
- 11:12 PM Revision 0df6fbd3 (ceph): rados: fix rados import
- This fixes issue #2253. Wrong param order to fread().
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 11:11 PM Feature #2248 (Fix Under Review): cluster naming
- 11:00 AM Feature #2248: cluster naming
- - new ocmmand line arg (-C, --cluster)
- controls default config files
- becomes another subst ($cluster) to be use... - 10:38 AM Feature #2248 (Resolved): cluster naming
- 08:56 PM Revision ba0fb3ed (ceph): cleanup-and-unlock.sh: helper to nuke and then unlock a set of nodes
- I usually do something like
teuthology-lock --list-targets --owner scheduled_sage@metropolis > /tmp/b
./cleanup-an... - 08:54 PM Revision 3adf2bf9 (ceph): schedule_suite.sh: helper to schedule a suite
- There's a bunch of stuff hardcoded in here, similar to the nightly, but
it's a useful starting point. - 04:09 PM Bug #2253 (Resolved): rados import: uploaded objects are empty
- Fixed, commit:0df6fbd3a66741ad02c7556b0c4026dc3577d797.
- 03:37 PM Bug #2253 (Resolved): rados import: uploaded objects are empty
- 03:33 PM rgw Documentation #1813: doc: document radosgw api diffs with s3
- We'd like to have it for the current sprint, or at least no later than the next sprint. 5/1 as an upperbound target d...
- 12:45 PM Bug #2233: Throttle when there are lots of large conccurent IOs
- Yeah, it's the failing gracefully bit that I'm interested in. :)
- 12:38 PM Bug #2233: Throttle when there are lots of large conccurent IOs
- Just the rados bench tool itself is allocating 16GB to feed into librados.
Now that you mention it, librados might... - 12:29 PM Bug #2233: Throttle when there are lots of large conccurent IOs
- Aha! The plana nodes appear to only have 8GB of ram and 8GB of swap.
Is the allocation of that memory part of libra... - 11:20 AM Linux kernel client Bug #2242: rbd: spinlock on wrong cpu
- OK, I think this problem arises because of the switch to a spinlock to
protect the client list. Doing so was the ri... - 09:53 AM Linux kernel client Bug #2242 (Resolved): rbd: spinlock on wrong cpu
- ...
- 11:19 AM Bug #2178: rbd: corruption of first block
- Oliver Francke wrote:
> Hi Sage,
>
> I was talking about the verbose logfiles from monday. TBH, I don't expect Ba... - 10:32 AM Bug #2178: rbd: corruption of first block
- Hi Sage,
I was talking about the verbose logfiles from monday. TBH, I don't expect BadThings without "rbd_writebac... - 09:49 AM Bug #2178: rbd: corruption of first block
- Oliver Francke wrote:
> Whew, that was fast,
>
> after second run I had some errors in one file with:
> [osd]
>... - 07:01 AM Bug #2178: rbd: corruption of first block
- Whew, that was fast,
after second run I had some errors in one file with:
[osd]
filestore fiemap threshol... - 05:43 AM Bug #2178: rbd: corruption of first block
- Well Sage,
its harder these days to reproduce, cause I think the current version has made "something more stable"(... - 10:57 AM Feature #2252 (Resolved): rgw long run kernels
- 10:54 AM Feature #2251 (Resolved): rgw long run workloads
- 10:53 AM Feature #2250 (Resolved): rgw long run raid config
- 10:47 AM Subtask #2249 (Resolved): teuthology task (3)
- 10:35 AM Feature #2246 (Resolved): force10s on sepia
- 10:32 AM Feature #2245 (Resolved): rgw long run ceph install
- 10:29 AM Messengers Feature #2244 (New): msgr: performance tester
- 09:54 AM Linux kernel client Bug #2243 (Resolved): btrfs: warning in orphan_commit_root
- 2012-04-04T01:02:59.191518-07:00 plana32 kernel: [ 8815.371555] ------------[ cut here ]------------
2012-04-04T01:0... - 09:45 AM Feature #2241 (Rejected): upstart
- 09:45 AM Feature #2240 (Resolved): osd: new default locations
- 09:42 AM Subtask #2239 (New): install + configure package everywhere
- chef!
- 09:42 AM Subtask #2238 (Rejected): vm for coredump archive
- 09:41 AM Subtask #2237 (Resolved): failure+replay tester (8)
- 09:39 AM Subtask #2236 (Resolved): filestore failure injection (3)
- add a hook to operations that we want to potentially fail.
need to identify the caller so that the tester can pote... - 09:38 AM Subtask #2235 (Resolved): generate deterministic sequence of transactions (5)
- 09:22 AM Bug #2234 (Resolved): Sometimes 'ceph -s' is unable to show pg data and crashes
- ceph -s / ceph -w sometimes gives me output as below:...
- 09:15 AM CephFS Feature #1237: mds caps limit mount to some subdir
- Nope — as with all the other MDS stuff, this is currently not a priority.
- 07:10 AM CephFS Feature #1237: mds caps limit mount to some subdir
- Is there any progress on this issue?
- 04:21 AM Revision 0921c062 (ceph): config: drop loud ERROR prefix
- This makes gitbuilder sad.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 12:03 AM Revision b9185bb2 (ceph): osdmap: allow row, room, datacenter, pool in conf for initial crush map
- These work just like host and rack, except that they are optional.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 12:02 AM Revision 4313a2d8 (ceph): crush: don't warn on skipped types
- It's perfectly okay to skip some.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 12:01 AM Revision 56a6aa7a (ceph): osdmap: set 'default' pool type correctly
- Got this wrong in e85961167eb1f37f80f263257799e4e901d17e74
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
04/03/2012
- 11:33 PM Revision dd7b84a5 (ceph): ceph-fuse: fix log reopen when -f is specified
- Don't restart if it wasn't stopped.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 10:56 PM Revision 1836d467 (ceph): Added assertion to check that targets > roles
- Signed-off-by: Mark Nelson <mark.nelson@dreamhost.com>
- 10:56 PM Revision 95294027 (ceph): nuke: don't run umount when no xargs args
- Gets rid of this noise:
INFO:teuthology.nuke:Unmount any osd data directories...
INFO:teuthology.orchestra.run.err:U... - 10:40 PM Revision e8596116 (ceph): osd: define more crush types
- We don't use these by default, but this way they are there should someone
want to use them.
Signed-off-by: Sage Weil... - 10:37 PM Messengers Bug #1674 (Need More Info): daemons crash when sent random data
- FWIW I was unable to reproduce this with the current code, with or without cephx enabled.
- 10:35 PM Revision 2dbdadbe (ceph): test_rewrite_latency: check return value
- Fixes warning
warning: test/test_rewrite_latency.cc:27:36: ignoring return value of ‘ssize_t pwrite(int, const void*... - 10:28 PM Revision 493344fd (ceph): Makefile: add mssing header
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:07 PM Bug #1627 (Can't reproduce): ceph-mon memleak if ceph-osd cluster ip is not reachable, but public...
- 09:21 PM Revision d57d8af7 (ceph): rgw: throttle at num_threads * 2
- If we throttle at num_threads, then nothing gets into the workqueue until
a worker thread is idle, which means you pa... - 08:44 PM Revision 1ef37ab8 (ceph): Merge remote-tracking branch 'gh/msgr-api-changes'
- Reviewed-by: Sage Weil <sage@newdream.net>
- 08:41 PM Revision a31efd9c (ceph): filestore: print Sequencer name in debug output
- And clean it up just a bit.
Signed-off-by: Sage Weil <sage@newdream.net> - 08:22 PM Revision 756621d5 (ceph): msgr: clean up Pipe::do_sendmsg.
- Document it as with the tcp stuff, remove an if(0)'d debugging block,
and remove the useless "sd" parameter since it'... - 08:22 PM Revision 9f10a991 (ceph): msgr: write minimal documentation for the tcp functions.
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 08:22 PM Revision e966c39d (ceph): msgr: make a bunch of stuff private.
- Why were all these data members public? They're accessed by Pipes
and the Accepter and stuff, so maybe that's why...b... - 08:22 PM Revision 096971d4 (ceph): msg: update the Dispatcher and Messenger documentation
- Clarify what mark_down() and mark_down_on_empty() actually do.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.... - 08:21 PM Revision 36ec8e93 (ceph): dispatcher: fix documentation for ms_handle_reset
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 08:21 PM Revision cbe13ab2 (ceph): msgr: rename set_ip() -> set_addr_unknowns()
- The generic interface shouldn't reference specifics like that.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.... - 08:13 PM Revision 607f35e7 (ceph): msgr: Remove _my_name and ms_addr, replace with direct access to my_inst.
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 08:13 PM Revision 77f45667 (ceph): msgr: store the entity_inst_t in the Messenger.
- Convert ms_addr and _my_name to be references to their fields in
the entity_inst_t my_inst.
This way we can use const... - 08:11 PM Revision 6374d064 (ceph): buffer: implement a contents_equal function on bufferlists
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 08:11 PM Revision 5681461b (ceph): msgr: change the signature of get_myaddr()
- Return a const reference to the actual address, instead of copying it.
All current users are happy with this, and I c... - 08:11 PM Revision 45a76eaf (ceph): msgr: get_connection() is required to establish a connection if none ex...
- Making an allowance for lossy server connections is silly. Just don't
ask for the Connection in that case. (There are... - 08:10 PM Revision e80126ea (ceph): test: fix monmaptool help text
- Broken by commit:15f0a3270fdcf09acce554313f2d0c0814a511e4
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 06:32 PM Revision e06436e9 (ceph): cls_rgw: guard decode
- thee were few cases where decode wasn't guarded.
Signed-off-by: Yehuda Sadeh <yehuda.sadeh@dreamhost.com> - 06:30 PM Revision ebb487a6 (ceph): cls_rgw: reset return code in some cases
- Beforehand the return code was ignored, so fixed the cases
where we erroneously return error instead of success.
Sig... - 05:12 PM Revision a8938422 (ceph): librados: fix exec test
- Return for read operations is now returned correctly.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 04:52 PM rgw Bug #1681: rgw: user rm with --purge doesn't remove data
- Maybe we should disallow removal of user that has data? We can suspend it instead.
- 04:06 PM Revision 57f52479 (ceph): doc: disable broken 'doxygenclass' class in librados c++ doc
- This is the last remaining gitbuilder error. Add it back when the C++
docs actually build.
Signed-off-by: Sage Weil... - 03:58 PM Revision 9d4fcd08 (ceph): Merge remote-tracking branch 'gh/stable'
- 03:57 PM Bug #1921 (Resolved): teuthology: silently continues when len(targets) != len(roles)
- 03:44 PM Revision e40cf8ca (ceph): test_workload_gen: fix Sequencer ctor
- Signed-off-by: Sage Weil <sage@newdream.net>
- 02:43 PM Feature #2226: osd: better filestore idempotency test
- 02:32 PM Documentation #2175 (Resolved): doc: fix doc build errors
- got this to yellow (only warnnings), yay!
- 01:39 PM Feature #1890: log: async log writeout
- 01:39 PM Feature #1889: log: structure log records
- 10:45 AM Feature #2134 (Resolved): qa: smoke suite
- 10:31 AM Bug #2178: rbd: corruption of first block
- Hi Oliver,
I have two things to try:
- 'rbd writeback window = 0'. I know it's not what you want to run, but t... - 10:29 AM Bug #2233: Throttle when there are lots of large conccurent IOs
- That is 16GB of RAM being allocated and used — I don't remember what hardware these are running on and have no idea w...
- 09:47 AM Bug #2233 (Won't Fix): Throttle when there are lots of large conccurent IOs
- When sending large amounts of data via a single client (ie 256 concurrent 64MB IOs) we can hit a bad_alloc on the cli...
- 09:15 AM Cleanup #2191 (Resolved): reexamine simple_spinlock
- 08:51 AM Feature #2087 (Resolved): lightweight filestore workload generator
- 05:04 AM Revision b5ca2fe0 (ceph): Merge remote-tracking branch 'gh/wip-name-sequencers'
- 05:03 AM Revision d70191a8 (ceph): Merge remote-tracking branch 'gh/wip-2087'
04/02/2012
- 08:24 PM Revision addc7446 (ceph): rgw: check for subuser existence
- This fixes #1856: looking up subuser that doesn't exist returns
user as long as subuser prefix defined existing user.... - 02:30 PM rgw Bug #1853 (Resolved): rgw: qa test to verify bucket recreation does not override bucket
- Implemented, commit:1551c5b08714b415c49fc759002b7c6a6d4d611a.
- 01:26 PM rgw Bug #1856 (Resolved): It is possible to look up an rgw user by a subuser that does not exist as l...
- Fixed, commit:addc744692f60885a747c4531cd12bf19b3a7f2a.
- 11:15 AM rgw Feature #2171: rgw: asynchronously calculate md5
- Thinking about it some more, it's probably not the best use of time and effort. We initiate the md5 calculation after...
- 08:29 AM Bug #2178: rbd: corruption of first block
- Hi Sage,
here we go again, with ceph-0.44.1-1-g41f84fa
One bad file with following infos:
20120402 171642.12... - 12:04 AM Revision e792cd93 (ceph): filestore: fix ZERO fallback write
- It helps if we write zeros!
Signed-off-by: Sage Weil <sage@newdream.net>
04/01/2012
- 11:24 PM Revision 8434caf5 (ceph): qa: test_rewrite_latency
- Tool to measure latency of overwriting a single block.
Signed-off-by: Sage Weil <sage@newdream.net> - 07:23 PM Bug #2221: Monitor setup bugs
- 2) ...
- 06:35 PM rbd Feature #2232: qemu: resize guest disk when rbd image is resized
- I tested this on Friday, and qemu rereads the size (at least when using virtio) when the guest requests it (i.e. echo...
- 04:21 PM rbd Feature #2232 (New): qemu: resize guest disk when rbd image is resized
- According to Christoph, this is probably just a matter of calling bdrv_truncate() with the new size. If that doesn't...
- 04:19 PM rbd Feature #2231 (Resolved): librbd: expose header change (resize?) via api
- we need a callback or something so that users (qemu) can be informed when the header changes. this will let them, sa...
03/31/2012
- 03:22 PM Feature #1655: gitbuilder aggregator page
- I took some inspiration from the updated aggregator script that is now at http://ceph.newdream.net/gitbuilder.cgi. I'...
- 03:31 AM Revision dbc70b9d (ceph): Merge remote branch 'gh/wip-mon_setup'
- Reviewed-by: Sage Weil <sage@newdream.net>
- 03:18 AM Revision f8a53869 (ceph): osd: fix error code return from class methods
- Don't shadow the result at function scope.
Fixes: #2148
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 12:22 AM Revision 15f0a327 (ceph): monmaptool: make clear you can set the fsid when making a new map.
- Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
- 12:07 AM Revision 208daeb3 (ceph): ceph_mon: fix fsid parsing.
- fsid is a field in the CephContext _conf structure and is parsed by
the standard options parsing library before it ge...
03/30/2012
- 11:15 PM Revision 9a69c3f3 (ceph): ceph.conf: enable 'osd recover clone overlap'
- to test the recovery cloning in qa. this was redone, but forgot to enable
it in qa. - 11:14 PM Revision aa31035e (ceph): osd: update_stats() on reads too
- Update pg stats on any op completion (read or write), not just writes. Do
the calls with log_op_stats() for consiste... - 11:11 PM Revision 28788654 (ceph): log: dump_recent in fatal signal handler
- Signed-off-by: Sage Weil <sage@newdream.net>
- 11:07 PM Revision f27acbc8 (ceph): Merge remote-tracking branch 'gh/wip-log'
- Conflicts:
src/common/config_opts.h - 11:00 PM Revision 374bef9c (ceph): Merge remote branch 'gh/wip-osd-hb'
- 10:37 PM Revision f7f65ebe (ceph): osd: fix typo in debug message
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:57 PM Revision 75e3b9b3 (ceph): Merge remote branch 'gh/wip-osd-recovery-sources'
- 09:23 PM Revision df5860fe (ceph): objectstore: name Sequencers
- Assign a (unique) name to each Sequencer. This will aid in debugging, and
can be useful when dumping traces of FileS... - 09:11 PM Cleanup #2230 (Resolved): deprecate 'btrfs devs'
- 09:00 PM rgw Feature #2229 (New): rgw: functional tests for rgw class
- A series of simple functional tests to verify the rgw class methods behave as they should.
- 08:58 PM Bug #2148 (Resolved): osd: class error return not propagated to client
- commit:f8a53869f6db4c76516ee525f00f87f930920692
- 06:57 PM Revision 29c01f25 (ceph): ceph_common.sh: Remove dead code.
- Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
- 06:27 PM Revision ba6bb4cf (ceph): man: Oops, update ceph-mon(8) for real. Sorry about that.
- Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
- 06:26 PM Revision 541a543c (ceph): man: Update ceph-mon(8) after reStructuredText syntax fixes.
- Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
- 06:16 PM Revision 2c542442 (ceph): doc: Remove duplicate anchor from (unused) overview doc.
- Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
- 06:11 PM Revision 1ec47db1 (ceph): doc: Convert the mailing list mention to not be a section heading.
- If toctree is inside a section, the subtree is inside the section too.
We don't want all of dev/* to be under "Mailin... - 06:11 PM Revision b162696b (ceph): doc: Fix reStructuredText syntax errors.
- Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com>
- 05:52 PM Revision 2d1a96d3 (ceph): add include/stringify.h
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 05:27 PM Bug #2221: Monitor setup bugs
- (1) is a problem due to options parsing collisions...fixed!
(2) is directly contradicted by my testing...?
(3) I ne... - 04:59 PM Revision b25817a5 (ceph): FileJournal: check pwrite return value when zeroing journal
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 04:52 PM Revision 41f84fac (ceph): filestore: set guard on collection_move
- During recovery we submit transactions like:
- delete a/foo
- move tmp/foo to a/foo
This prevents the EEXIST chec... - 04:25 PM Bug #2026 (Can't reproduce): osd: ceph::HeartbeatMap::check_touch_file
- 04:25 PM Bug #2045 (Can't reproduce): osd: dout_lock deadlock
- haven't seen this in a while.
also, this code is about to go away anyway with wip-log. - 04:16 PM Bug #2102 (Can't reproduce): osd: pg stuck in backfill
- 04:15 PM Bug #2102 (Duplicate): osd: pg stuck in backfill
- 04:14 PM Bug #2002: osd: racy push/pull for clones
- i take that back; this wasn't enabled in qa. adding to the teuthology ceph.conf file.
- 04:12 PM Bug #2002 (Resolved): osd: racy push/pull for clones
- haven't seen this in forever; looks fixed.
- 04:11 PM Bug #2209 (Resolved): osd: read kb stats not tracked?
- commit:aa31035e555129e56888320b84f16264f28bd7df
- 03:59 PM Bug #2116 (Resolved): Repeated messages of "heartbeat_check: no heartbeat from"
- fixed by commit:374bef9c97266600b4c6b83100485d7250363213
- 03:59 PM Bug #2165 (Resolved): osd: recovering ending with missing
- fixed with merge of commit:75e3b9b309e5365975e3e5855c065bd4fe28b64c
- 03:58 PM Bug #2178: rbd: corruption of first block
- 02:51 PM Bug #2178: rbd: corruption of first block
- Please build the current git stable branch, which includes 41f84fac1ae4b4c72bf9bfe07614c4066c916fd1. The version sho...
- 07:35 AM Bug #2178: rbd: corruption of first block
- Here the remaining timestamps from the other VM's with bad blocks:
VM-2:
20120330 105139.579830 filling block 171... - 07:12 AM Bug #2178: rbd: corruption of first block
- Hi *,
I needed a couple of runs, but managed now to provide some 81MiB/97MiB osd.X.log-files, where in between sh.... - 03:58 PM Bug #2164 (Resolved): osd: scrub missing _, snapset attrs
- commit:41f84fac1ae4b4c72bf9bfe07614c4066c916fd1
- 03:49 PM Revision f89f98df (ceph): osd: clear RECOVERING on start_peering_interval
- This prevents us from, say, getting into a recovering+stray state.
Signed-off-by: Sage Weil <sage@newdream.net> - 03:45 PM Revision 3cdd8d58 (ceph): osd: more heartbeat debug
- Signed-off-by: Sage Weil <sage@newdream.net>
- 03:45 PM Revision e1a58912 (ceph): osd: discard heartbeat_peer in note_down_osd
- Discard the heartbeat_peer as soon as we find out, along with queued
failures, or else the heartbeat_check may come a... - 03:45 PM Revision 21e6e2b8 (ceph): osd: ignore peer epoch of 0 on ping reply
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 03:45 PM Revision efc27f19 (ceph): osd: don't fail new heartbeat peers
- last_tx may be 0 because we just added this peer; don't mark them down
yet!
Signed-off-by: Sage Weil <sage.weil@drea... - 03:45 PM Revision 33b9187a (ceph): osd: rename hbin -> hbclient, hbout -> hbserver
- This is way less confusing.
Signed-off-by: Sage Weil <sage@newdream.net> - 03:44 PM Revision 4e2f0d14 (ceph): osd: simplify heartbeat logic
- Simplify heartbeats to use a simple request/reply model.
- avoid any weirdness with map update timing
- no from/to... - 03:44 PM Revision fe5f0331 (ceph): osd: send pings from hbin
- Fixes: #2212
Signed-off-by: Sage Weil <sage@newdream.net> - 02:32 PM Revision eebc9ec2 (ceph): test: test_workload_gen: Add callback for collection destruction.
- When we remove a collection, we must cleanup after the coll_entry_t we
once had on the available collections set. For... - 01:53 PM Revision 424b5b07 (ceph): ceph: --concise by default, add --verbose option
- It's time.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 12:50 PM Feature #2227 (Closed): QA: create a test to verify operation with non-default layouts
- I submitted a patch that modified ceph_calc_file_object_mapping()
in the ceph client, and when reviewing it Sage poi... - 09:53 AM Feature #2226 (Resolved): osd: better filestore idempotency test
- ...
- 03:31 AM Revision 409b648b (ceph): config: drop old debug_* items
- ...and replace code references with conf->subsys.should_gather().
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 02:16 AM Revision 5d981b15 (ceph): rgw: add unittest just to verify we link
- This will flush out references to stuff in libglobal.la, among other
things.
Signed-off-by: Sage Weil <sage.weil@dre... - 02:06 AM Revision 69b01726 (ceph): config: fix librados, libcephfs unit tests
- No more g_conf->debug.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 01:08 AM Revision 394d8b1e (ceph): Add test for object source marked down
- 01:08 AM Revision b4aa098f (ceph): make Thrasher not inherit from Greenlet
- 01:02 AM Revision 1c8ec702 (ceph): PG,ReplicatedPG: update missing_loc_sources with missing_loc
- In some cases missing_loc was updated without missing_loc_sources
Signed-off-by: Samuel Just <samuel.just@dreamhost.... - 01:02 AM Revision 05ef3ba6 (ceph): ReplicatedPG: fix loop in check_recovery_sources
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 12:35 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- I think I can be optimistic :)...
03/29/2012
- 10:06 PM Bug #2178: rbd: corruption of first block
- Okay, I suspect this is actually bug #2164, which was causing the _ xattr to get lost when ceph-osd restarts on non-b...
- 09:52 PM Bug #2225 (Resolved): gitbuilder.ceph.com returning 503: Service Temporarily Unavailable.
- I can't find any 503 in the apache logs on this machine. Could it be on the client side?
- 09:48 PM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- Well, I fixed one problem, but I can't see how it could have resulted in the log you posted.
Pushed a few more pat... - 11:36 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- I collected logs from 4 OSDs, they can be downloaded at: http://logger.ceph.widodh.nl/ceph/issues/2212/
At 10:13 t... - 09:21 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- Der.. do you have a log you can attach/post?
- 02:59 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- I reverted the extra debugging for the heartbeat stuff, but that didn't seem to consume all the CPU time.
The load... - 01:40 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- I just installted the code on my cluster and things do not seem to behave yet.
The cluster is still jumping around... - 08:54 PM Linux kernel client Bug #1940 (Resolved): locking cycle in ceph_osdc_start_request
- commit:ab434b60ab07f8c44246b6fb0cddee436687a09a
- 08:15 PM Revision 41a09bea (ceph): Merge remote branch 'upstream/wip_latency'
- 07:53 PM Linux kernel client Bug #1793 (Can't reproduce): NULL pointer dereference at try_write+0x627/0x1060
- Marking this Can't Reproduce. Will reopen if it shows up again.
- 03:21 PM Linux kernel client Bug #1793: NULL pointer dereference at try_write+0x627/0x1060
- Another 100 iterations of kernel_untar_build.sh using the current
master branch (c666601a935b94cc0f3310339411b6940de... - 07:51 AM Linux kernel client Bug #1793: NULL pointer dereference at try_write+0x627/0x1060
- Bugs 1793 and 2081 have a signature of a page fault/bad memory reference
from process_one_work() -> con_work(), and ... - 07:53 PM Linux kernel client Bug #2069 (Can't reproduce): client crash during kernel_untar_build rm -r step
- I just finished at least 150 iterations of kernel_untar.sh and never
hit this using the current master branch of cep... - 07:51 PM Linux kernel client Bug #2081 (Can't reproduce): msgr: spinlock badness?
- Marking this Can't Reproduce. Will reopen if it happens again.
- 07:43 PM Linux kernel client Bug #2081: msgr: spinlock badness?
- Another 100 iterations of kernel_untar_build.sh using the current
master branch (c666601a935b94cc0f3310339411b6940de... - 07:51 AM Linux kernel client Bug #2081 (Need More Info): msgr: spinlock badness?
- Bugs 1793 and 2081 have a signature of a page fault/bad memory reference
from process_one_work() -> con_work(), and ... - 07:50 PM Linux kernel client Bug #2174 (Can't reproduce): rbd: iozone thrashing failure
- OK, I'll go ahead and state that I can't reproduce this...
- 07:46 PM Linux kernel client Bug #2174: rbd: iozone thrashing failure
- Status was Verified. Changing it to Need More Info because I can't even
seem to reproduce it at this point. (I sup... - 07:44 PM Linux kernel client Bug #2174: rbd: iozone thrashing failure
- Another 12 iterations of suites/iozone.sh using the current
master branch (c666601a935b94cc0f3310339411b6940de751ba)... - 07:59 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
- I don't know whether we've adequately captured the signature or symptoms
of this problem. I believe though that it ... - 07:20 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
- I have been trying to reproduce this using the latest testing/master/for-linus
branch (they're the same right now) a... - 02:34 PM Revision c39ed568 (ceph): test: test_workload_gen: Fixing a memleak.
- Apparently, the FileStore does not cleanup after transactions once they
are applied, which may lead to huge memory le... - 09:27 AM Linux kernel client Bug #2224 (Rejected): Oops in __cfh_to_dentry
- I setup an HA pair of NFS servers which re-export Ceph to NFS clients.
The HA pair is in active/standby mode, using... - 07:42 AM Feature #2087: lightweight filestore workload generator
- Memory leak fixed.
Apparently, the FileStore does not cleanup after transactions once they are applied, which may ... - 06:21 AM Feature #2087 (In Progress): lightweight filestore workload generator
- Looks like some memory should be leaking bad, such that valgrind hangs on exit.
==19080==
==19080== HEAP SUMMARY... - 07:24 AM Linux kernel client Bug #2064 (Resolved): ceph-client: messenger: nocrc flag not implemented correctly
- Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved. - 07:12 AM Linux kernel client Bug #2157 (Resolved): ceph: xattr: fix nanosecond display on i_rctime
- Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved. - 07:12 AM Linux kernel client Bug #2156 (Resolved): ceph: xattr: fix a possible buffer overrun bug
- Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved. - 07:11 AM Linux kernel client Bug #2155 (Resolved): ceph: xattr: wrong value assumed for "no preferred PG"
- Linus pulled in the changes without any immediate trouble, so
I'm marking this and a few others resolved. - 05:56 AM Feature #2223 (Resolved): Tracing facility on FileStore
- Allow a user to specify a file onto which log the transactions that come through OSDs' FileStores.
This should all... - 05:47 AM Revision b3069e50 (ceph): ceph_argparse: drop useless declaration from unit test
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 12:16 AM Revision 4269f8d5 (ceph): ReplicatedPG: ctx might not contain an OpRequest
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 12:16 AM Revision 135a11ba (ceph): FileJournal: optionally zero journal on create
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 12:15 AM Revision 2486c61a (ceph): FileStore: Pass OpRequestRef into filestore in queue_transaction
- This allow us to track op progress through the filestore.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> - 12:15 AM Revision d026cdc7 (ceph): FileJournal: use DSYNC for directio path
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 12:15 AM Revision 533bbf7b (ceph): osd/: OpRequest implements TrackedOp for passing into filestore
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
03/28/2012
- 11:12 PM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- Ah, I see the bug now. Pushed a fix to wip-osd-hb, thanks!
Let us know if this behaves for you.. if so I'll pull ... - 04:23 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- It's quite large (222MB), so I uploaded the file, available at: http://logger.ceph.widodh.nl/ceph/osd.1.log_27-03-201...
- 10:51 PM Bug #2165: osd: recovering ending with missing
- see wip-osd-recovery-sources
- 10:46 PM CephFS Bug #1811: 2 pjd chown tests failed on cfuse
- ...
- 04:02 PM Revision 4f0d170a (ceph): test: test_workload_gen: Change CLI option and add '--help' usage.
- With this commit, we support the following options (and old ones are no
longer available):
--test-num-colls VAL ... - 03:34 PM Revision 18d219e5 (ceph): rgw: replace dout with ldout
- librgw can't use g_ceph_context
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net> - 03:21 PM Feature #2222: osd: distinguish between 'degraded' and 'misplaced'
- We should pick a designator that doesn't make it sound like the objects are lost.
- 02:27 PM Feature #2222 (Resolved): osd: distinguish between 'degraded' and 'misplaced'
- normal data migration happens with a acting set > the up set, so that we never drop below N replicas, but we still ca...
- 02:45 PM Feature #2087: lightweight filestore workload generator
- 02:07 PM Bug #2221 (Resolved): Monitor setup bugs
- Carl reported several configuration issues when creating new monitors (based on the instructions at http://ceph.newdr...
- 01:59 PM Revision a3bdf055 (ceph): test: test_workload_gen: Default arguments, and minor changes.
- Besides adding support for default arguments, passed onto global_init(),
this commit fixes a conflict in Makefile.am,... - 01:32 PM Revision 37cdbcd4 (ceph): log: fix up unittest
- Fewer entries; compile.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 08:35 AM rgw Bug #2220 (Resolved): rgw: librgw dep on g_ceph_context
- Fixed, commit:18d219e512a8e0f427a2229a71e15869cac3b593.
- 07:16 AM rgw Bug #2220 (Resolved): rgw: librgw dep on g_ceph_context
- from last night's qa,...
- 04:37 AM Bug #2219: OSD's commit suicide with 0.44
- I accidentally removed the core file(s) :(
Hope this one pops up again so I have a core file. - 04:11 AM Linux kernel client Tasks #2138: rbd: run xfstests on a local XFS filesystem over RBD
- After setting up two rbd devices and making some fairly simple changes
to xfstests, then setting up appropriate envi... - 04:04 AM Linux kernel client Bug #2155: ceph: xattr: wrong value assumed for "no preferred PG"
- This got rebased: 3489b42a72a41d477665ab37f196ae9257180abb
This has been sent as part of a pull request to Linus ... - 04:04 AM Linux kernel client Bug #2156: ceph: xattr: fix a possible buffer overrun bug
- This got rebased: 3489b42a72a41d477665ab37f196ae9257180abb
This has been sent as part of a pull request to Linus ... - 04:03 AM Linux kernel client Bug #2157: ceph: xattr: fix nanosecond display on i_rctime
- This got rebased: 3489b42a72a41d477665ab37f196ae9257180abb
This has been sent as part of a pull request to Linus ... - 04:01 AM Linux kernel client Bug #2064: ceph-client: messenger: nocrc flag not implemented correctly
- It got rebased once more, and this should be the last:
37675b0f42a8f7699c3602350d1c3b2a1698a3d3
This has been s... - 03:52 AM Bug #2178: rbd: corruption of first block
- Hi,
I decided to upgrade to "latest-n-greatest" in the test-cluster, to make sure, that if I hit the error again w... - 02:58 AM Revision 94e3abf8 (ceph): Merge branch 'stable'
- 12:22 AM Revision 8948ad01 (ceph): test: test_workload_gen: CodeStyle compliance and cleanup.
- This commit aims at the compliance with Ceph's CodeStyle, as well
as cleaning up some lingering unused code.
Also, n... - 12:22 AM Revision d172b40c (ceph): test: test_workload_gen: Destroy collections.
- 12:22 AM Revision 3770096a (ceph): test: test_workload_gen: Mimic an OSD's workload.
- In it's current state, the workload generator will queue a lot of
transactions onto the FileStore, and will wait if n... - 12:18 AM Revision 749826c2 (ceph): allow use of a separate journal block device
03/27/2012
- 11:44 PM Revision ffc468f2 (ceph): osdmap: less noisy about osd additions during buildmap
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 11:44 PM Revision 36c2f27d (ceph): osdmaptool: fix clitest conf filename
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 11:37 PM Revision ca1f79b5 (ceph): dout: no newlines on dout_emergency
- Preserve old behavior to avoid breaking all the cli tests.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 11:27 PM Revision d5360968 (ceph): throttle: fix off by one issue
- We were blocking only if we exceeded max count, not if
we reached it.
Signed-off-by: Yehuda Sadeh <yehuda@hq.newdrea... - 11:23 PM Revision a52d048a (ceph): rgw: throttle incoming requests
- Don't accept more than the number of threads, otherwise if cluster is
backed up for any reason we'd end up exhausting... - 11:16 PM Revision 30cadf01 (ceph): prebufferedstreambuf: fix typedef
- 'typename' not allowed here:
./common/PrebufferedStreambuf.h:27: error: using 'typename' outside of template
Signed... - 10:35 PM Revision 93ba4c00 (ceph): Merge branch 'wip-intent-fixes'
- 10:35 PM Revision ca4fab47 (ceph): Merge branch 'master' of ssh://github.com/ceph/ceph
- 10:35 PM Revision 16b60b3e (ceph): rgw: minor style fixes
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 10:12 PM Revision 4d74a7b2 (ceph): osd: fix handling of recovery sources when osds go down
- If a source osd goes down, we need to
- reset any pulls (already did that before)
- remove peer from missing_loc s... - 10:03 PM Revision 8fdde24c (ceph): osd: remove down osds from peer_*_requested maps
- This will leave less crap around to confuse recovery if a source osd goes
down and then up.
Signed-off-by: Sage Weil... - 10:02 PM Revision 1ee60873 (ceph): osd: maintain missing_loc_sources
- This is a superset of all missing_loc values... everywhere we might
pull an object from, or are currently pulling fro... - 09:37 PM Revision 5dbb9715 (ceph): rgw: all intent log operations are now async
- That includes removing a directory index object, and the removal of
the actual intent log object.
Signed-off-by: Yeh... - 09:20 PM Revision 0b1e3ed4 (ceph): osd: increase default heartbeat_interval to 6 seconds
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 09:12 PM Revision 69844496 (ceph): rgw: remove pool_list(), can't list_objects() on system buckets
- pool_list() was broken, replaced now with pool_iterate(). list_objects()
shouldn't be used any more with system bucke... - 09:04 PM Revision 2e9079cf (ceph): rgw: intent log processing uses new pool_iterate()
- intead of pool_list(), which is broken (assuming pgls results are
sorted, which are not).
Signed-off-by: Yehuda Sade... - 08:57 PM Revision 1814aac1 (ceph): Merge branch 'misc-fixes-for-review'
- 08:57 PM Revision d5c4015d (ceph): uclient: We want to release cache when we lose the CACHE cap, not gain it!
- Looks like this was detected as a problem back in
84644dc56183b67050793a1b8da07850508b29d6 but the fix wasn't complet... - 08:57 PM Revision c3b04644 (ceph): paxos: share_state sends every unknown value, including the stashed one
- Sage points out that the stashed object might not be the same as the
one we actually archive. For instance, OSDMonito... - 08:57 PM Revision 2acf4aea (ceph): mon: Paxos needs to store the latest version permanently on-disk.
- Previously it was only storing this m->latest_value in the stash,
which of course got overwritten. And then when some... - 08:57 PM Revision d0ba27ae (ceph): doc: add a short thing on kernel client troubleshooting.
- I just noticed this sitting uncommitted in my tree.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> - 08:02 PM Revision c89b7f22 (ceph): v0.44.1
- 06:35 PM Revision 6044c5b8 (ceph): hadoop: define subsystem, fix logging
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:31 PM CephFS Bug #2218: CephFS "mismatch between child accounted_rstats and my rstats!"
- The MDS log is at https://matthew.royhousehold.net/mds.a.log.1.gz (1505MB, md5 197ef232d50d27e2b7c2f62370c9c6b6)
- 02:45 PM CephFS Bug #2218 (Need More Info): CephFS "mismatch between child accounted_rstats and my rstats!"
- There's not enough info in the attached log to figure out what happened. I can tell you that your home directory beli...
- 06:20 PM Revision ce61a83f (ceph): log: throttle message submission, trim recent
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:20 PM Revision fe56818e (ceph): config: configure log thresholds
- - max new entries before we wait for flush
- max recent entries to keep around
Signed-off-by: Sage Weil <sage@newdre... - 06:05 PM Revision 339956df (ceph): log: don't spam -1 to syslog; add err_to_syslog for consistency
- This matches the stderr settings.
Signed-off-by: Sage Weil <sage@newdream.net> - 05:44 PM Revision 17a95c22 (ceph): log: use PrebufferedStreambuf
- It's faster than ostringstream!
Signed-off-by: Sage Weil <sage@newdream.net> - 05:44 PM Revision bfa2bcd7 (ceph): prebufferedstreambuf: fix get_str()
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:43 PM Revision 0e3c0c44 (ceph): bench_log: flush
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:42 PM Revision 3a87e452 (ceph): log/EntryQueue: no implicit trim
- dequeue() things explicitly if you want to remove them.
Signed-off-by: Sage Weil <sage@newdream.net> - 05:41 PM Revision f66e0750 (ceph): utime_t: sprintf() method
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:41 PM Revision 6ab85264 (ceph): do_autogen: control optimization level
- -O 2 -> -O2
Signed-off-by: Sage Weil <sage@newdream.net> - 05:41 PM Revision a4509273 (ceph): common: add PrebufferedStreambuf
- Simple streambuf that uses a preallocated buffer, and then spills over
into a std::string if necessary.
Signed-off-b... - 05:41 PM Revision 23f0af3c (ceph): test log performance with PreallocatedStreambuf
- - faster than ostringstream in optimistic case
- same as ostreamstream + std::string assignment in worst case (use
... - 05:41 PM Revision 8c5046fa (ceph): bench_log: simple util to time how long it takes to log stuff
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:41 PM Revision 362ca19b (ceph): log: move create_entry() into Log interface
- This will let us be smarter than putting it on the heap.
Signed-off-by: Sage Weil <sage@newdream.net> - 05:41 PM Revision c7242bfe (ceph): log: flush on_exit
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:41 PM Revision abfadb9b (ceph): assert: dump recent log entries on failed assertions
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:41 PM Revision f41887e3 (ceph): log: new logging infrastructure
- - explicitly defined subsystems, and ceph_subsys_FOO enums to go with them
- modular log system with Entry object
- s... - 04:26 PM rgw Bug #2197 (Resolved): rgw: need to throttle incoming requests
- Fixed, commit:a52d048ac429c3d2b6a9286d96253308f6588762.
- 04:10 PM Bug #2178: rbd: corruption of first block
- The next step is to reproduce the corruption on the test cluster with logs:
debug osd = 20
debug ms = 1
debug... - 08:37 AM Bug #2178: rbd: corruption of first block
- Well,
one more comment:
my guess would be, it has todo something with expansion of the "sparse-file" while writin... - 05:24 AM Bug #2178: rbd: corruption of first block
- Good morning ;)
meanwhile I have not been lazy. I've managed - with current setup in test-cluster - to produce "in... - 04:07 PM Bug #2164: osd: scrub missing _, snapset attrs
- wip-2164
it's a problem with the collection_move guard (or lack thereof) - 03:40 PM rgw Bug #2208 (Resolved): rgw: radosgw-admin temp remove failure
- Fixed, merged at commit:93ba4c004a9269148a75b67da2522855cb1842a3.
- 02:19 PM Bug #2219 (Need More Info): OSD's commit suicide with 0.44
- Can you look at the core file and 'thread apply all bt'?
- 05:57 AM Bug #2219: OSD's commit suicide with 0.44
- ...
- 05:03 AM Bug #2219 (Can't reproduce): OSD's commit suicide with 0.44
- I noticed this myself today, but on IRC somebody else came along:...
- 02:03 PM Bug #2199 (Resolved): mon: get_bl osdmap_full/9583 No such file or directory
- Merged to master in commit:1814aac17593dee0fa4c774d5b462f277f6698da, reviewed by Sage — even though I forgot to add t...
- 12:25 PM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- Can you attach the full osd.1 log?
- 12:36 AM Bug #2211: osd: entity_inst_t OSDMap::get_inst(int) const
- Over night I saw 16 OSD's go down with the same backtrace.
All OSD's were running with debug ms/osd set to 1, this... - 09:07 AM Linux kernel client Bug #2174: rbd: iozone thrashing failure
- I've been off on other things, but this problem apparently recurred
even if the latest check-in (Josh's change) in p... - 08:38 AM CephFS Bug #2217: sync and O_DIRECT writes only write first extent in iov vector
- The code should not be written that way.
However I think it doesn't matter at this point, because the only caller
...
03/26/2012
- 11:48 PM Revision 974a2013 (ceph): objecter: don't call op_throttle_ops.take(1) unconditionally
- Signed-off-by: Yehuda Sadeh <yehuda@hq.newdream.net>
- 11:17 PM Revision 679cd1fe (ceph): objecter: add in-flight ops throttling
- In addition to ops length, we also want to throttle it by
actual number of ops.
Signed-off-by: Yehuda Sadeh <yehuda@... - 10:02 PM Revision d6b0cbd4 (ceph): config: use our assert
- Signed-off-by: Sage Weil <sage@newdream.net>
- 07:40 PM Revision c3dc6a6e (ceph): msg: assert pipe->msgr == msgr
- Fixes: #2216
Signed-off-by: Sage Weil <sage@newdream.net> - 06:57 PM Revision e30b7710 (ceph): rbd: fix typo in default config
- pyflakes would have caught this if 'all' weren't a built-in function
- 06:43 PM Revision 483fcf80 (ceph): doc: include crush in toctree
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 06:30 PM Revision 3bd1f18e (ceph): doc: few notes on manipulating the crush map
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 06:24 PM CephFS Bug #2218 (Resolved): CephFS "mismatch between child accounted_rstats and my rstats!"
- The mismatch is detected at 2012-03-26 18:39:54.306661...
- 05:15 PM Revision 6db77158 (ceph): doc/dev/peering.rst: fix typo
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
- 04:39 PM Revision 1a0360cb (ceph): osd/: OpRequest is no longer a RefCountedObject, remove puts/gets
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 04:38 PM Revision ea377a08 (ceph): osd/: Convert OpRequest* to OpRequestRef
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 04:38 PM Revision 2cb6c7d0 (ceph): OSD: Add typedef for shared_ptr<OpRequest>
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 04:36 PM Revision 3ed784c9 (ceph): osd/: add mark_event to OpRequest and move tracking into OpTracker
- Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
- 03:51 PM Bug #2192: ceph-mon hangs consuming 100% CPU
- It was reproduced all the time, for 0.44 also. After I adjusted cluster to have only one monitor problem has gone. (U...
- 02:44 PM CephFS Bug #2217 (Resolved): sync and O_DIRECT writes only write first extent in iov vector
- static ssize_t ceph_aio_write(struct kiocb *iocb, const struct iovec *iov,
unsigned long nr_segs, loff_t po... - 01:34 PM Bug #2199 (Fix Under Review): mon: get_bl osdmap_full/9583 No such file or directory
- Re-pushed misc-fixes-for-review.
- 09:59 AM Bug #2199 (In Progress): mon: get_bl osdmap_full/9583 No such file or directory
- Sage pointed out the stash data structure isn't necessarily the same as the other stored data structures, so this nee...
- 12:47 PM Messengers Cleanup #2216 (Resolved): SimpleMessenger should make sure it owns passed-in Connections
- 10:50 AM Messengers Cleanup #2216 (Resolved): SimpleMessenger should make sure it owns passed-in Connections
- Otherwise we get weird issues like #2212.
- 12:38 PM Cleanup #2191: reexamine simple_spinlock
- my log branch drops this for the dout logging. the last user is the buffer.h debugging (enabled manually via a macro...
- 12:06 PM RADOS Bug #2047: crush: with a rack->host->device hierarchy, several down devices are likely to cause b...
- fwiw dropping the local search behavior fixes this bad behavior. the question is what probably was the local search ...
- 11:27 AM RADOS Bug #2047: crush: with a rack->host->device hierarchy, several down devices are likely to cause b...
- 11:27 AM Bug #2210 (Duplicate): osd: some PGs remains remapped or degraded
- this is actually a crush problem, see #2047.
- 09:45 AM Bug #2210: osd: some PGs remains remapped or degraded
- #2173 has some osd logs and related info for the same problem on a less clean cluster. Thanks for the detailed steps ...
- 10:36 AM CephFS Fix #2215 (Resolved): ceph-fuse does not invalidate page cache
- Right now the userspace client doesn't invalidate the page cache when it loses the cache capability on an inode. Appa...
- 09:58 AM Bug #2212 (Resolved): osd: FAILED assert(msgr->lock.is_locked())
- ah, i was using wrong msgr, fixing!
- 05:50 AM Bug #2212 (Resolved): osd: FAILED assert(msgr->lock.is_locked())
- With the new heartbeat code I noticed a couple of OSD's go down with:...
- 09:58 AM RADOS Bug #2214 (Resolved): crush: pgs only mapped to 2 devices with replication level 3
- This is from #2173. Note that all 3 osds are up....
- 09:43 AM Bug #2173 (Resolved): MDS crash when start with end of buffer
- 06:04 AM Feature #2213 (Resolved): rbd: shouldn't need config file to get help
- I just ran "rbd --help" on a pretty much un-configured machine and got:
global_init: unable to open config file.
... - 05:22 AM Bug #2211 (Resolved): osd: entity_inst_t OSDMap::get_inst(int) const
- While trying out the new heartbeat code I encountered this crash:...
- 03:28 AM Revision e478a758 (ceph): vstart: enable omap for xattrs
- Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
03/25/2012
- 08:39 PM Bug #2173: MDS crash when start with end of buffer
- Shall we colse this bug, as the mds server was recovered by providing an empty session map and we can not reproduced ...
- 08:39 PM Bug #2210 (Duplicate): osd: some PGs remains remapped or degraded
- Some PGs remains 'remapped' or 'degraded' status after adding an osd server.
The steps to re-produce the bugs:
1.... - 03:05 PM Revision f4b2097a (ceph): Merge remote branch 'gh/wip-doc-peering'
- 02:57 PM Revision d3bcac24 (ceph): Makefile: fix modules that cannot find pk11pub.h when compiling with NS...
- Signed-off-by: Jim Schutt <jaschut@sandia.gov>
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> - 02:57 PM Revision 3ab28950 (ceph): don't override CFLAGS
- leveldb adds -I flags to CFLAGS and CXXFLAGS, but if these macros are
overridden in the make command line, the flags ... - 09:54 AM Feature #2087: lightweight filestore workload generator
- Pushed a new commit to [1], making the code compliant with the CodeStyle and with Sage's suggestions on github.
[1... - 04:47 AM Revision ef17c8c9 (ceph): add smoke suite
- This could probably be collapsed into a bunch of singleton tasks to make
it simpler to track how many actual jobs res... - 04:20 AM Revision b5641ef3 (ceph): rgw: don't #include fcgi from rgw_common.h
- ceph-dencoder #includes rgw_common.h, and needs to build even when
--without-radosgw is specified and libfcgi isn't i... - 04:09 AM Revision 1c1192a9 (ceph): backfill: use 'rbd' pool instead of 'data'
- (data has a replay interval, which makes writes take longer to resume
after repeering) - 04:09 AM Revision 397e7f2f (ceph): add osd_recovery task to test divergent osd logs
Also available in: Atom