Project

General

Profile

Activity

From 01/24/2011 to 02/22/2011

02/22/2011

11:09 PM Revision 74346592 (ceph): ReplicatedPG: snap_trimmer should bail out while finalizing_scrub
Check to make sure !finalizing_scrub when relocking.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
11:09 PM Revision 4ddc5db6 (ceph): OSD,PG: fix race between processing scrub and dequeueing scrub
Previously, a second scrub could be scheduled between when the first is
dequeued and processed resulting in two scrub...
Samuel Just
08:45 PM Revision 89b743c4 (ceph): osd: fix recovery pointer when pulling head before snapid
If recovery wants to pull a snapped object and needs the head first, pull()
does that, but the caller doesn't ++skipp...
Sage Weil
08:20 PM Revision 48f69894 (ceph): osd: verify object version during push
Fail to push if the ondisk version doesn't match the version we want to
send.
This isn't supposed to happen. If it d...
Sage Weil
05:40 PM Revision e4b82742 (ceph): osd: improve up_thru request behavior
There is some epoch the OSD wants for up_thru, based on when the PG mapping
last changed. However, once the monitor ...
Sage Weil
05:27 PM Revision 4df36436 (ceph): pybind: update rados python bindings for new API
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:11 PM Revision 158889b2 (ceph): PGMap: make osd_full and nearfull ratios configurable.
These were previously set by #defines. Pretty stupid
when we have a nice config system already!
Signed-off-by: Greg ...
Greg Farnum
05:10 PM Revision 0e4e5a5b (ceph): osd: set correct epoch for up_thru osd->mon request
Put the epoch we need for up_thru in the request. Putting the most recent
epoch causes incorrect osdmap churn.
Fixe...
Sage Weil
04:09 PM Revision 2de8b7b8 (ceph): librados: add cluster handle to C API
Had to add a layer of indirection to the list context handles.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:09 PM Revision 1dc12e3e (ceph): move g_default_file_layout into ceph_fs.cc
It's defined in ceph_fs.h.
Fix a bunch of headers that use types without including the headers that
define those typ...
Colin Patrick McCabe
04:09 PM Revision 950e8c3e (ceph): librados: more API cleanup; rados_conf_ stubs
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:14 AM Bug #817: Implement custom propose logic for OSDMap/PGMap
This is less important since Sage figured out #824. Greg Farnum
09:07 AM Bug #824 (Resolved): Map churn with lots of PGs
commit:0e4e5a5b26c1027cfb99edce869fecc4a5a2d487 Sage Weil
07:46 AM Bug #824 (Resolved): Map churn with lots of PGs
Like the title says. This may be related to #817 but Sage seems to think there's also an issue with the OSD code prom... Greg Farnum
08:53 AM Bug #810 (Duplicate): 1). PG bits don't get recognized and 2). Takes too long for OSDs to boot up.
Setting pg bits is working properly, but you're using the wrong config name. :) Underscores should only be used on th... Greg Farnum
08:51 AM Feature #823 (Resolved): osd: make FULL and NEAR_FULL configurable
Made these configurable via mon_osd_full_ratio and mon_osd_nearfull_ratio in commit:158889b2e4089db2361b7f61d6e2e2c13... Greg Farnum
08:41 AM Bug #816: fs size underflowed
First step here is to figure out how to reproduce, and/or find/generate full osd logs of it happening. Sage Weil
08:34 AM Bug #752 (Resolved): High I/O wait when writing data
That's good news. I'm not really sure what was going wrong here before either. Let's see if this comes back. Sage Weil
06:09 AM Bug #752: High I/O wait when writing data
I've spent a lot of time testing and finding out where this could come from, but it seems to be done now for no good ... Wido den Hollander

02/21/2011

05:00 AM Revision 0a96e6a7 (ceph): Makefile: include ceph_argsparse.h in dist tarball
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:58 AM Revision 4ffe0698 (ceph): filestore: fix clone_range
This was broken by the safe_write() switchover; the success return value
is now 0, not the number of bytes written.
...
Sage Weil
04:36 AM Feature #823 (Resolved): osd: make FULL and NEAR_FULL configurable
FULL and NEAR_FULL are set in osd/osd_types.h:... Wido den Hollander

02/20/2011

09:54 PM Revision 349cb128 (ceph): keyring_init: don't print error when explicit key/keyfile is specified
e.g. when I am non-root and specify a key explicitly, no need to complain
about not being able to read root's /etc/ce...
Sage Weil
09:52 PM Revision 859aaf78 (ceph): Revert "keyring_init: g_conf.keyring is not a list"
This reverts commit 2fb6036aa53f5eb3173b80fd17b7240bd3daf156. Sage Weil
06:17 PM Revision 2a59b213 (ceph): common: Split argument parsing into ceph_argparse
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe

02/19/2011

12:01 AM Revision b96f1642 (ceph): Revert "Makefile.am: remove unused libs from linking with librbd tests ...
Same problem as 38f38a99149e88f18072fcbdbee316ac21f6f30f.
This reverts commit e5db46cea0997f3f959b2ae896c980585f079ac0.
Josh Durgin

02/18/2011

11:42 PM Revision a7929c5e (ceph): Merge branch 'pool_memory'
Greg Farnum
11:41 PM Revision f49909ba (ceph): PG: convert hash_maps to maps, remove unused.
waiting_for_[missing|degraded]_object don't need to be
hash_maps, and we don't use stat_object_temp_rd at all.
Swap t...
Greg Farnum
11:41 PM Revision 935eaf3a (ceph): PG: remove the object locking stubs and some dead code.
These are unused (#if 0'd, so no way to use them!) and require
a memory-hogging hash_map. Goodbye!
Signed-off-by: Gr...
Greg Farnum
11:41 PM Revision eea1f098 (ceph): OSD: convert waiting_for_pg from hash_map to map.
This doesn't need to be a hash_map; there will only be an entry
for each PG that gets a message request while it's no...
Greg Farnum
11:41 PM Revision 654dc79c (ceph): vstart: remove directories, too.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
11:41 PM Revision a2f0cd0d (ceph): test: Add new memory tests, move to own subdir.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
09:11 PM Revision 38f38a99 (ceph): Revert "Makefile.am: remove unused libs from linking with librbd"
librados doesn't export ceph::buffer_total_alloc
This reverts commit 9bbd6c32a59ce0a2e4cc21a498e0b04bcd4781ed.
Josh Durgin
08:22 PM Feature #400: mkcephfs: work on multiple nodes without using ssh
After talking with TV about this, I think this should look more like:
- run node-specific steps on every host. ge...
Sage Weil
07:17 PM Revision eadc3c87 (ceph): testlibrbdpp: fix off by one error in read test
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
07:12 PM Revision e5db46ce (ceph): Makefile.am: remove unused libs from linking with librbd tests and rbd
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:50 PM Revision 9bbd6c32 (ceph): Makefile.am: remove unused libs from linking with librbd
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:02 PM Revision 2fb6036a (ceph): keyring_init: g_conf.keyring is not a list
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:40 PM Revision f51040cc (ceph): Clock: remove unused mutex
We don't use a mutex in g_clock any more, so let's not construct one any
more.
Signed-off-by: Colin McCabe <colin.mc...
Colin Patrick McCabe
04:26 PM Feature #801: librados: allow access to multiple clusters
> the read and write functions have some strange parameters ordering:
> offset, then buffer, then length
It may ...
Colin McCabe
04:25 PM Feature #801: librados: allow access to multiple clusters
> BTW if we do end up changing this, we should be consistent with naming...
> some functions are rados_verb_noun an...
Colin McCabe
03:40 PM CephFS Cleanup #814: hadoop: refactor hadoop shim in terms of java libceph bindings
Sage,
I see the general benefits of a Java-based libceph interface, but what are your long-term plans for Hadoop o...
Noah Watkins
03:36 PM Revision 5db1e50d (ceph): debug.h: cleanup includes
Shouldn't need to include DoutStreambuf.h; that's all implementation.
Don't include Mutex.h, since we don't use it.
*...
Colin Patrick McCabe
03:25 PM Bug #698: cosd memory usage with large number of pools
I merged code which removes some hash_maps in commit:a7929c5e265d1b6502733ee9525fd93bbcfc739e; this takes per-PG memo... Greg Farnum
03:18 PM Feature #820: bufferlist memory pool
To elaborate: it appears when I compare the heap memory use reported by massif* and the resident memory usage reporte... Greg Farnum
03:05 PM Feature #820 (Closed): bufferlist memory pool
Investigate putting buffers in a separate memory pool. Sage Weil
03:16 PM Bug #822 (Resolved): fix i386 build of radostestpp
Hitting a gcc error on gitbuilder-i386 (ubuntu 10.10, gcc 4.4.5):
testradospp.cc: In function ‘int main(int, con...
Sage Weil
03:08 PM Bug #821 (Resolved): osd: fix osdmap caching
two issues (at least):
- we end up regenerating the osdmaps from disk data frequently
- we throw out cached copies ...
Sage Weil
02:40 PM rbd Bug #391: snap create/delete caused corruption
oh.. missed the external rbd tool call. That might have caused the problem, as doing it while there's a running vm is... Yehuda Sadeh
02:04 PM rbd Bug #391: snap create/delete caused corruption
> I haven't been able to reproduce this with the latest ceph and qemu-rbd. I'd like to upgrade the kvmtest cluster an... Andrew F
01:22 PM rbd Bug #391: snap create/delete caused corruption
I haven't been able to reproduce this with the latest ceph and qemu-rbd. I'd like to upgrade the kvmtest cluster and ... Josh Durgin
02:35 PM Revision d362874e (ceph): common: Move hex dump functions into hex.h
Move hex dump functions into hex.h. Remove unecessary includes from
debug.cc
Signed-off-by: Colin McCabe <colin.mcca...
Colin Patrick McCabe
02:28 PM Revision 61b661b2 (ceph): Makefile: version.cc should depend on ceph_ver
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:25 PM Revision ef0e07d9 (ceph): debug.h: move Ceph version stuff into version.h
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:07 PM rbd Bug #809 (Resolved): listing snapshots when none exist causes qemu to crash
Sage Weil
01:47 PM Messengers Bug #798: should have a empty() test on list<Messegger *>
This was before 940d58d94ef6dec16affea88c5c79b1f5eba16db? Can you try to reproduce with the latest?
Thanks!
Sage Weil
12:54 PM Linux kernel client Feature #819 (Closed): robust nfs re-export
Sage Weil
12:54 PM CephFS Feature #818 (Resolved): mds: robust lookuphash
Make the lookuphash be thorough about locating the directory inode. Sage Weil
11:06 AM Bug #817 (Won't Fix): Implement custom propose logic for OSDMap/PGMap
It's way too easy to generate map spam when peering. If I run "test/memuse/test_pool_memuse.sh 99 2" then in one exam... Greg Farnum
07:06 AM Revision 4231cef6 (ceph): Merge remote branch 'origin/max_commit_size'
Sage Weil
01:32 AM Revision 7171cf68 (ceph): librbd: hold image context lock minimally
Holding the image context lock during snapshot removal prevented the
client from responding to a notify, causing a de...
Josh Durgin

02/17/2011

07:53 PM Revision da416004 (ceph): mkcephfs: fix premature tmp directory deletion
Previously, the temp directory would be deleted after the first daemon
on a host was started leaving the second one t...
Samuel Just
07:24 PM Revision 04066325 (ceph): .gitignore: ignore testsnaps
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:02 PM Revision dfafa9f3 (ceph): .gitignore: ignore debian packaging outputs
Josh Durgin
06:53 PM Revision b56cafe4 (ceph): pybind/rados: write_full: remove silly extra param
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:48 PM Revision 3e4aaf95 (ceph): pybind/rados: implement Pool.write_full
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:24 PM Revision b93be8c6 (ceph): pybind/rados: implement Pool::change_auid
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:05 PM Revision 23df5ba3 (ceph): pybind/rados: add rados.version
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:58 PM Revision 8be3ad12 (ceph): pybind/rados: Add Rados.pool_exists
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:46 PM Revision 09e01fa4 (ceph): pybind/rados: Snap.name should be a py string
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:38 PM Revision aeac6a63 (ceph): pybind/rados: add snapshots
Add snapshot lookup, iteration, creation, destruction interface.
Add test.
Signed-off-by: Colin McCabe <colin.mccab...
Colin Patrick McCabe
04:46 PM Bug #816 (Can't reproduce): fs size underflowed
ceph version 0.25~rc (commit:0c97d79056ba982f571ef8e720c9d488e3982f81)... Josh Durgin
02:46 PM Revision ae702b3e (ceph): Add Pool::list_objects
Add a Pool::list_objects method. Add a test for this to pybind-test.py
Signed-off-by: Colin McCabe <colin.mccabe@dre...
Colin Patrick McCabe
02:21 PM Subtask #815: Remove globals & partition g_conf
Good point. Some things probably should be global. However, we have a lot of things that library clients *don't* want... Colin McCabe
09:11 AM Subtask #815: Remove globals & partition g_conf
So one issue that occurred to me with setting this up is that we have a number of things that really *ought* to be gl... Greg Farnum
09:08 AM Subtask #815 (Resolved): Remove globals & partition g_conf
This bug is to track the progress of removing globals and setting up g_conf so it can be used as a parameter rather t... Greg Farnum
12:41 PM Revision 0a3798ef (ceph): Add pybind-test
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:40 PM Revision f06604cd (ceph): rados-python bindings: Fix pool deletion a bit
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:10 AM CephFS Cleanup #814: hadoop: refactor hadoop shim in terms of java libceph bindings
I mean that, at the end of the day, we should probably have:
- a java Ceph interface binding that is identical to t...
Sage Weil
09:00 AM CephFS Cleanup #814: hadoop: refactor hadoop shim in terms of java libceph bindings
I'm not sure what you're after here -- you mean you want Java bindings for librados, and then the Hadoop patches shou... Greg Farnum
07:14 AM CephFS Cleanup #814 (Resolved): hadoop: refactor hadoop shim in terms of java libceph bindings
Refactor the hadoop code in terms of generic Java bindings for libceph, instead of mixing the SWIG crap in with the a... Sage Weil
09:24 AM Bug #698: cosd memory usage with large number of pools
So, yeah. All the heapdump commands were sent *after* ceph -w reported the PGs were all active+clean, although I didn... Greg Farnum
09:13 AM Feature #801: librados: allow access to multiple clusters
Do we want to implement support for multiple clusters in SimpleMessenger, or are we going to have basically full inca... Greg Farnum
07:13 AM Cleanup #813 (Resolved): libceph interface cleanup
A few things:
- clean up the API wrt to setting up a client instance (see #801)
- clean up packaging (/usr/includ...
Sage Weil
07:09 AM Feature #812 (Closed): switch up crypto library (again)
RHEL doesn't want to include crypto++ because it is not FIPS 140-2 certified (although they got a kick of out a DOE l... Sage Weil
05:36 AM Linux kernel client Bug #811 (Can't reproduce): rbd warning and crash
ceph-client
commit 9c01177349b435186025a088f612a6f5ce2f3de9
After execute command: echo "192.168.250.70,192.168.2...
Piotr S
12:47 AM Bug #810 (Duplicate): 1). PG bits don't get recognized and 2). Takes too long for OSDs to boot up.
1.
By default, (0.24.3) there are 264 number of placement groups per osd.
I've tried to change to something else...
DongJin Lee

02/16/2011

11:07 PM Revision 29b5ae0c (ceph): simplemessenger: Fix num_threads bug printout.
Also add documentation to get_num_threads since its contract
changed significantly.
Signed-off-by: Greg Farnum <greg...
Greg Farnum
09:22 PM Bug #698: cosd memory usage with large number of pools
Hmm, I just tried to run one of my tests using 2 OSDs instead of one for the first time. It looks like it's still pos... Greg Farnum
08:16 PM CephFS Feature #601: mds: order directory commits after rename
Err, I guess actually that should read:
8) a commits
9) hold on b/bar gets removed
10) commit b comes in
11) b co...
Greg Farnum
07:53 PM CephFS Feature #601: mds: order directory commits after rename
I haven't gotten too far into it yet, but my current line of attack is to see how feasible it is to place "holds" on ... Greg Farnum
05:07 PM CephFS Feature #601: mds: order directory commits after rename
I think this is going to be much ahrder than I originally imagined. For example:
- mv /a/foo /b/foo
- mv /b/bar ...
Sage Weil
03:51 PM CephFS Feature #601: mds: order directory commits after rename
I'm going to take a look at this while waiting for data collection to run as I work on #698. Greg Farnum
06:28 PM Revision fc1ce083 (ceph): monmaptool: fix command-line output
Don't check errno if it isn't set.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
05:52 PM Bug #799: mkcephfs error
cconf output many messages in debug mode,but some messages is useless,and affect cconf.
finally, i try to change ce...
longguang yue
12:13 PM Bug #799: mkcephfs error
can you attach your ceph.conf? Sage Weil
05:21 PM Revision 651fac6d (ceph): dout: don't print version when forcing fg logging.
dout: don't print version when forcing fg logging.
Fix tests that were assuming us to spew errors about /var/log, wh...
Colin Patrick McCabe
05:19 PM Revision e86e4cf8 (ceph): init-ceph: use do_cmd for pid_file dir creation
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:03 PM Bug #802: osd: failing to send heartbeats (btrfs hang?)
I think this is the same thing Jim is seeing. Sage Weil
04:57 PM Revision e4ecddd4 (ceph): dout initialization: remove unecessary flush()
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:22 PM Revision e4054d2f (ceph): dout: properly output ceph version on opening dout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:05 PM Revision 5676253c (ceph): monmaptool: set_foreground_logging
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:02 PM rbd Bug #809 (Resolved): listing snapshots when none exist causes qemu to crash
I'm guessing this is just a null pointer dereference. To reproduce:
* Create an rbd-backed vm.
* Run it in qemu. ...
Josh Durgin
03:16 PM Revision 03e56b77 (ceph): msgr: complain if there are > 1 threads, not 1
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:11 PM rbd Feature #808 (Resolved): rbd: store snapshot timestamp and vmclock
In the qemu monitor, 'info snapshots' will list the date and vmclock of each, in addition to snapshot id and tag. Josh Durgin
09:16 AM Bug #806 (Resolved): init-ceph should use do_cmd consistently, or not at all
this bug fixed by commit:e86e4cf
the ssh stuff kicks in only with -a. it's weird, yes, but incredibly convenient....
Sage Weil
03:32 AM Revision 3b03c5a3 (ceph): osd: fix population on unconnected_watchers on obc load
Fixes: #807
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil

02/15/2011

11:24 PM Revision b0862e32 (ceph): test/osd: Fix indentation on RadosModel.h and TestSnaps.cc
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:57 PM Revision 28bb6fb5 (ceph): testsnaps: add snapshot test
Uses RadosModel.h to check the results of a randomized sequence of
writes, reads, snapshots, snapshot removals, and r...
Samuel Just
07:28 PM Bug #807 (Resolved): unconnected_watchers assert fail
The (violated) invariant is that any item in oi.watchers must be either in obc->watchers or obc->unconnected_watchers... Sage Weil
04:47 PM Bug #807: unconnected_watchers assert fail
Here's roughly what I did that lead to this (I'm not sure exactly when cosd crashed):
- imported two images using rb...
Josh Durgin
04:18 PM Bug #807 (Resolved): unconnected_watchers assert fail
cosd crashed when running qemu off an rbd image calles kvm213_root:... Josh Durgin
07:02 PM Revision 6fb416b0 (ceph): common: thread: get number of threads from /proc
The kernel knows how many threads we have; just ask it. One less atomic
variable to carry around.
We will eventually...
Colin Patrick McCabe
05:52 PM Revision fec24834 (ceph): Remove ExportControl (we have better auth now)
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:16 PM Revision 26504682 (ceph): init-ceph: status: use daemon_is_running
daemon_is_running does some nice things like check /proc/$pid/cmdline.
Signed-off-by: Colin McCabe <colin.mccabe@dre...
Colin Patrick McCabe
05:09 PM Revision a9c330e5 (ceph): OSD: ignore osd_max_write_size if it's set to 0.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:01 PM Revision d496b651 (ceph): Journaler: add some checks for expire_pos.
I don't think these are necessary checks, but the expire_pos >= trim_pos
invariant got broken somehow by johnl, and t...
Greg Farnum
05:01 PM Revision bfe2c3cf (ceph): Journaler: call set_layout after init_headers.
set_layout modifies last_committed, but then init_headers
uses operator= and overwrites those changes. In this case
i...
Greg Farnum
04:39 PM Revision 76667c41 (ceph): init-ceph: fix status for multi-node clusters
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:58 PM Bug #806 (Resolved): init-ceph should use do_cmd consistently, or not at all
init-ceph sometimes uses do_cmd, and sometimes not.
Example:...
Colin McCabe
03:39 PM Bug #795 (Resolved): init-ceph assumes only one daemon per node
fixed by commit:76667c41e4bc242d089656485d02ba9c4f9b4fec Colin McCabe
01:49 PM Revision 8cd5b6a5 (ceph): Merge branch 'dout2'
Colin Patrick McCabe
01:33 PM Revision 31a17c6a (ceph): common/lockdep.cc: don't use dout unlocked
Lockdep should use the regular dout() interfaces, rather than going
around them. In particular, we shouldn't output t...
Colin Patrick McCabe
09:38 AM CephFS Bug #805 (Can't reproduce): mds startup: _replay journaler got error -22, aborting
Well, I can't figure out how the header could have gotten corrupted like that, but I've put in a few more checks for ... Greg Farnum

02/14/2011

04:26 PM CephFS Bug #805: mds startup: _replay journaler got error -22, aborting
Okay, somehow expire < trim. That shouldn't happen. Greg Farnum
02:29 PM CephFS Bug #805: mds startup: _replay journaler got error -22, aborting
Dumped and bzipped the journal here: http://johnleach.co.uk/downloads/ceph/805-journal.dump.tbz2 John Leach
01:21 PM CephFS Bug #805: mds startup: _replay journaler got error -22, aborting
You don't have any other logs from before, do you? At this point there's a problem with the way the MDS journal ended... Greg Farnum
01:02 PM CephFS Bug #805 (Resolved): mds startup: _replay journaler got error -22, aborting
As per #803, the assert is now fixed but starting up the cluster I now get:... John Leach
03:20 PM Revision 16cd9173 (ceph): assert: allow assertions inside calls to dout()
We should handle the situation where we assert() while already holding
the dout() lock. At the same time, we want to ...
Colin Patrick McCabe
01:56 PM Revision 37361926 (ceph): dout: Convert _dout_lock to plain pthread_mutex_t
Convert _dout_lock to plain pthread_mutex_t. This way, we don't have to
depend on the order of global constructor ini...
Colin Patrick McCabe
01:52 PM Revision 1543d364 (ceph): assert.cc: some cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:52 PM Revision dd21803a (ceph): common/debug.h: use std::string rather than string
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:51 PM Revision 65bb14f0 (ceph): common: Remove common/tls.cc
Using ELF TLS via the __thread keyword is much faster than using
pthread_getspecific and pthread_setspecific. It's al...
Colin Patrick McCabe
12:17 PM Revision db188f0c (ceph): dout: use DoutLocker rather than Mutex::Locker
Use DoutLocker rather than Mutex::Locker, in preparation for making the
dout_lock a plain old pthread_mutex_t.
Signe...
Colin Patrick McCabe
12:17 PM Revision a7b8e9c6 (ceph): os/FileStore: use derr/dendl for dout locking
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:05 PM Revision e7b584f0 (ceph): os/FileStore: use ceph_abort rather than abort
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:30 AM Revision 97809ee3 (ceph): lockdep: balance dout and dendl, fix whitespace
Make lockdep use dendl the same way as the other code. This is in
preparation for making lockdep use normal dout() ra...
Colin Patrick McCabe
10:41 AM Bug #777 (Resolved): mount hung, tid timed out messages in log
All right, looks like this is good! Greg Farnum
09:23 AM Bug #698: cosd memory usage with large number of pools
It seems the messages stacking up are just a result of massif slowing down operations too much.
According to massi...
Greg Farnum
09:16 AM Linux kernel client Bug #804: read performance slow
In general read performance is a little bit slower than write performance, for reasons we haven't managed to diagnose... Greg Farnum
03:50 AM Linux kernel client Bug #804 (Resolved): read performance slow
Ceph cluster is configured and running with 64 OSD’s daemon( 64 hard disk ), 1 Metadata servers(MDS) an... Mohamed Pakkeer

02/13/2011

10:16 PM Revision 4ee35b03 (ceph): Journaler: fix bad assert.
We can call reread_head during normal replay under
certain circumstances. So add the REREAD_HEAD state
as allowed.
S...
Greg Farnum
02:17 PM Bug #803 (Resolved): mds assert failed replaying journal after respawn
This was just a bad assert missing an allowed case. Looks like this got hit while going through error-handling code, ... Greg Farnum
12:49 PM Bug #803 (Resolved): mds assert failed replaying journal after respawn
... John Leach
08:57 AM Bug #777: mount hung, tid timed out messages in log
Tested with a build of master and it looks good. I'm writing files to a dir atm and am up to 430,000 entries and stil... John Leach
04:52 AM Messengers Bug #798: should have a empty() test on list<Messegger *>
a exception is throwed out when pop_front()
------gdb msg--------------------------
-----------------
[root@mods2 ...
longguang yue

02/12/2011

09:23 PM Revision da696695 (ceph): testlibrbd: fix printf args
Stupid me!
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
07:02 PM Revision 4f4306e2 (ceph): common: create dout_emergency interface and use it
Create the dout_emergency interface, which is safe to call from a signal
handler or from inside dout itself.
Signed-...
Colin Patrick McCabe
07:06 AM Revision 81beac55 (ceph): testlibrbd: check return values
Stupid printf!
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
07:06 AM Revision 6a800643 (ceph): cfuse: use safe_read and check return value
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:47 AM Revision a930c7af (ceph): debian: add python, python-dev build-deps
Might be overkill? The error I see from pbuilder is
checking for a Python interpreter with version >= 2.4... none
e...
Sage Weil
06:41 AM Revision 940d58d9 (ceph): msgr: clean up Pipe::queue_received locking
Ensure we maintain the invariant that a pipe has a non-empty queue IFF
the pipe is queued.
Prompted by #798.
Signed...
Sage Weil
01:03 AM Revision 12e14f70 (ceph): MDCache: add max_dir_commit_size.
Configured by setting mds_dir_max_commit_size in conf, or else
by looking at osd_max_write_size. This should lead to ...
Greg Farnum
01:03 AM Revision 3129581e (ceph): MDCache: switch CDir::_commit so that it can limit max write size.
This should fix #777.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
12:58 AM Revision 25c71ea2 (ceph): MDCache: switch CDir::_commit so that it can limit max write size.
This should fix #777.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
12:16 AM Revision fa4a9230 (ceph): MDS: Don't always _commit_full just because we have a complete dir.
Instead, commit if a certain percentage of the dentries are dirty.
Configurable via mds_dir_commit_ratio!
Signed-off...
Greg Farnum

02/11/2011

11:54 PM Revision 4328a550 (ceph): MDCache: add max_dir_commit_size.
Configured by setting mds_dir_max_commit_size in conf, or else
by looking at osd_max_write_size. This should lead to ...
Greg Farnum
11:38 PM Revision b7de2fd0 (ceph): .gitignore: py-compile
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:36 PM Revision cd859923 (ceph): librados: include headers in dist tarball
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:09 PM Bug #799: mkcephfs error
1. g++ with CXXFLAGS='-g -pg' FFLASGS='-g' CFLAGS='-g'
2. ceph.conf ,open all debug options at 20 level
so cconf w...
longguang yue
09:09 PM Bug #799: mkcephfs error
Which version are you running? What is your ceph.conf? What arguments did you pass to mkcephfs? Sage Weil
09:56 PM Revision 74c4d867 (ceph): MDS: Don't always _commit_full just because we have a complete dir.
Instead, commit if a certain percentage of the dentries are dirty.
Configurable via mds_dir_commit_ratio!
Signed-off...
Greg Farnum
09:50 PM Revision e21034e1 (ceph): Make git ignore librbd tests.
Tommi Virtanen
09:48 PM Feature #801: librados: allow access to multiple clusters
And while we're breaking backward compatibility, the read and write functions have some strange parameters ordering: ... Yehuda Sadeh
09:08 PM Feature #801: librados: allow access to multiple clusters
See librados_api branch.
BTW if we do end up changing this, we should be consistent with naming... some functions ...
Sage Weil
02:46 PM Feature #801 (Resolved): librados: allow access to multiple clusters
the current api assumes you are only talking to a single cluster. we need to have a 'cluster handle' returned by the... Sage Weil
09:48 PM Revision 0c97d790 (ceph): ceph.spec.in: add librbd
Josh Durgin
09:42 PM Revision 27a7b5a7 (ceph): ceph: add --concise flag
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:41 PM Revision 1c320ebe (ceph): Merge branch 'stable'
Conflicts:
configure.ac
Sage Weil
09:29 PM Revision 7fea444b (ceph): Merge remote branch 'origin/librbd'
Sage Weil
09:21 PM Revision b7d1d052 (ceph): objecter: set linger op target pg when a linger is resent
send_linger always creates a new Op, but op_submit does not fill in
the target pg if an existing session is passed in...
Josh Durgin
09:05 PM Messengers Bug #798 (In Progress): should have a empty() test on list<Messegger *>
Hi Longguang,
I took a look at this and think it's okay. Basically the invariant is that the pipe should be in the ...
Sage Weil
07:00 PM Messengers Bug #798: should have a empty() test on list<Messegger *>
void SimpleMessenger::dispatch_entry(){
...
pipe->pipe_lock.Lock();
list<Message *>& m_queue = pipe->in_q[priority...
longguang yue
08:58 PM Revision 718c42b9 (ceph): Make git ignore qa/workunits/direct_io_test.
Tommi Virtanen
08:55 PM Revision 09c019bb (ceph): monclient: fix weirdness when we explicitly start with non-master mon
Fix
$ ./ceph -m 10.0.1.252:6790 health
2011-02-11 12:30:38.132728 mon <- [health]
2011-02-11 12:30:38.133243 mon <- ...
Sage Weil
07:10 PM Revision 7e1ebd7a (ceph): Disable lockdep for ExportControl, ConfFile locks
Currently, we haven't read the configuration at the time we initialize
these locks. So we can't know whether lockdep ...
Colin Patrick McCabe
04:51 PM Bug #777: mount hung, tid timed out messages in log
Sage Weil wrote:
> And after thinking about it a bit, Greg, _commit_full should be pretty easily fixed up to behave ...
Greg Farnum
04:49 PM Tasks #796 (Resolved): Let CDir::_commit_full write in pieces
Pushed into master in commit: 3129581e89cb38f9ad2ff85673e50a033ec77d40 Greg Farnum
04:12 PM Tasks #796: Let CDir::_commit_full write in pieces
Pushed it into the max_commit_size branch. Testing it now. Greg Farnum
04:48 PM Feature #709: nagios plugin
The nagios machines are still etch so I think I'm going to push to get them updated to lenny before proceeding with t... Dallas Kashuba
02:40 PM Feature #709: nagios plugin
fwiw i've been creating a 'logical cluster' machine in teh dh db. see ceph-playground-fs, for example. although the ... Sage Weil
02:37 PM Feature #709 (In Progress): nagios plugin
I've started on this. I think it will look something like this...
A new "cephmonitor" service entry on a dummy ma...
Dallas Kashuba
04:04 PM CephFS Tasks #797 (Resolved): Don't _commit_full just because dir is_complete()
This got pushed in commit: fa4a9230c6ba68b9b66d1560abb89114caedf74b Greg Farnum
03:57 PM Bug #802 (Closed): osd: failing to send heartbeats (btrfs hang?)
mkcephfs -c /etc/ceph/ceph.conf --allhosts -k /etc/ceph/keyring.bin --mkbtrfs
service ceph -a start
watch osds flap...
Sage Weil
03:50 PM Revision cde53e95 (ceph): Merge branch 'pybind'
Colin Patrick McCabe
02:35 PM rbd Bug #391: snap create/delete caused corruption
Using ext3freezer on the guest during the snapshotting doesn't help either — as far as I can tell, simply taking/remo... Andrew F
11:48 AM rbd Bug #391: snap create/delete caused corruption
Managed to reproduce this by running this script (to create and delete snapshots) on the host:... Andrew F
11:33 AM rbd Bug #391 (In Progress): snap create/delete caused corruption
reopening Yehuda Sadeh
02:00 PM Revision f70d9047 (ceph): config: Remove g_conf.num_osd, num_mds, num_mon
These configuration options don't serve any use and were confusing.
Signed-off-by: Colin McCabe <colin.mccabe@dreamh...
Colin Patrick McCabe
12:23 PM rgw Tasks #774 (Rejected): move to radosgw.git
Sage Weil
12:03 PM Revision 8a1906a0 (ceph): MonitorStore: check for ENOENT, not EEXIST
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:51 AM Revision a350296d (ceph): common: bufferlist::read_file: use safe_read
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:31 AM Revision d4312952 (ceph): auth/Crypto.cc: use safe_read_exact
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
10:55 AM Revision 22dece17 (ceph): os/FileStore: remove unused variable
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe

02/10/2011

11:43 PM Revision 12a47f56 (ceph): s3.sh: simple bug fix
W Spikes
11:41 PM Revision 1021f5e2 (ceph): rados: add create op
Yehuda Sadeh
08:44 PM rgw Bug #768: eliminate any read/modify/write races
That should be accomplished using a new class: the 'rgw' class. This, however, won't fix the problem when using a dif... Yehuda Sadeh
08:42 PM rgw Bug #766: slow bucket creation times
It's confirmed. Bucket creation time can be slow. There is a problem with osd having to digest the new osdmap after e... Yehuda Sadeh
08:22 PM Feature #800 (Rejected): pool namespace
Pools should have namespaces. Yehuda Sadeh
06:24 PM Bug #799 (Closed): mkcephfs error
[root@localhost ceph]# mkcephfs -c /etc/ceph/ceph.conf --allhosts -k /etc/ceph/adminkeyring.bin
usage: cconf <-c fi...
longguang yue
06:02 PM Messengers Bug #798 (Can't reproduce): should have a empty() test on list<Messegger *>
void SimpleMessenger::dispatch_entry(){
...
pipe->pipe_lock.Lock();
list<Message *>& m_queue = pipe->in_q[pr...
longguang yue
05:49 PM Revision b60444b5 (ceph): make:add messages/MOSDRepScrub.h to NOINST_HEADERS
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:49 PM Revision 2cd2c56d (ceph): v0.24.3
Sage Weil
05:16 PM Revision e5fbf5ec (ceph): os/FileStore: fix error handling
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:16 PM Revision c7886ba5 (ceph): common/common_init: use safe_read
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:16 PM Revision 42b735c4 (ceph): rados.cc: use fwrite to output data to stdout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:16 PM Revision 378ba0a1 (ceph): mon/MonitorStore.cc: use safe_read
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:16 PM Revision c695a61f (ceph): cephtool: get_indata should use safe_read_exact
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:01 PM Revision 500edf7a (ceph): rbd: fix signed/unsigned comp on i386
We -D_FILE_OFFSET_BITS=64, so off_t is off64_t.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:53 PM Revision a476cceb (ceph): Removing files that look completely unused.
Tommi Virtanen
04:52 PM Revision 8f43d4e7 (ceph): osd: avoid message leak in error path
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:24 PM Revision 5d3a129f (ceph): config: fix -f option settings.
We don't want to force foreground logging just because we run in the
foreground -- use no-daemon for that!
Signed-of...
Greg Farnum
04:00 PM Revision ff85407b (ceph): common/DoutStreambuf: use safe_io.h
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:09 PM Revision 2aabb909 (ceph): tools/common: use fwrite to write stdout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:08 PM Revision 70a8b625 (ceph): osd: write_meta/read_meta: fix error handling
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:49 PM Revision 59874008 (ceph): msgr: better error checking for write_pid_file
Use safe_write. Check error code of close.
Fix what looks like a "lack of braces around multi-statement block"
error...
Colin Patrick McCabe
02:47 PM Revision c3f0f09d (ceph): common/buffer.cc: kill deadcode
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:47 PM Revision 635c535c (ceph): os/FileStore.cc: use safe_write
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:47 PM Revision 0d4244d8 (ceph): common/ConfUtils.cc: use safe_write
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:21 PM Revision d2491941 (ceph): MonitorStore::put_int: handle I/O errors
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:02 PM Revision 56943deb (ceph): common/safe_io.h: warn about unchecked returns
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:55 PM Revision 4a4c2eeb (ceph): mds/Dumper: use safe_write
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:55 PM Revision e8ac5aa2 (ceph): cfuse.cc: use safe_write
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:55 PM Revision ea80a066 (ceph): btrfs_ioc_test.c: use safe_write
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:49 PM Revision cd2a2d73 (ceph): test_disk_bw.cc: use safe_write
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:47 PM Revision 4de7fd49 (ceph): common: safe_io: allow C files to include safe_io
Don't use extern C unless we're compiling with C++.
This allows C files to include safe_io.h
Signed-off-by: Colin Mc...
Colin Patrick McCabe
01:14 PM Revision 84b90892 (ceph): client/fuse_ll: use safe_write
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:58 PM Revision 380ccfd7 (ceph): common: safe_io: handle EOF on read
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:58 PM Revision a743e314 (ceph): common:safe_io: create exact and non-exact variant
Create safe_read_exact, which fails with -EDOM unless it reads exactly
the requested number of bytes. More often, wha...
Colin Patrick McCabe
12:58 PM Revision b161e6a3 (ceph): os: convert some read/write uses to safe variants
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:57 AM CephFS Tasks #797 (Resolved): Don't _commit_full just because dir is_complete()
We should use some kind of metric to see if we should commit_partial or commit_full! Greg Farnum
11:57 AM Tasks #796 (Resolved): Let CDir::_commit_full write in pieces
CDir::_commit_full attempts to write out the whole directory to the PG, no matter how large. It doesn't need to do th... Greg Farnum
09:10 AM Bug #462 (Resolved): cephx: verify_authorizer_reply exception in decode_decrypt
Sage Weil
08:38 AM Bug #698: cosd memory usage with large number of pools
I ran the OSD through massif and in one of my tests, creating 2000 pools (with default 8 PGs), 1/3 of the ending memo... Greg Farnum
08:09 AM Bug #794 (Resolved): massif is unusable on current code
This seems to have been because of a bad setting in config for the foreground logging option. Simple enough to fix!
...
Greg Farnum
12:19 AM Revision af755be9 (ceph): filestore: fix fiemap
Yehuda Sadeh
12:19 AM Revision 65560eff (ceph): testlibrbd: change strcmp to memcmp, other stuff
Yehuda Sadeh

02/09/2011

10:30 PM Revision 82f5989d (ceph): librbd: use watch/notify internally to keep ImageCtx updated
Josh Durgin
09:45 PM Bug #777 (In Progress): mount hung, tid timed out messages in log
Greg Farnum wrote:
> It's not exactly a duplicate, but #594 is the latest open bug on fragment split/merge. When tha...
Sage Weil
04:50 PM Bug #777 (Duplicate): mount hung, tid timed out messages in log
It's not exactly a duplicate, but #594 is the latest open bug on fragment split/merge. When that's done and fragmenta... Greg Farnum
04:02 PM Bug #777: mount hung, tid timed out messages in log
.....oh. So you're trying to create a million files in a single directory?
Ceph stores inodes in the directory the...
Greg Farnum
03:45 PM Bug #777: mount hung, tid timed out messages in log
no ooms (the boxes are 6gig ram) and now I'm relatively sure the mds is not crashing, it seems to be an orderly respa... John Leach
03:41 PM Bug #777: mount hung, tid timed out messages in log
to be clear, this is not resulting in the "tid timed out" messages now, just the journal hangs, so it might be a sepa... John Leach
03:38 PM Bug #777: mount hung, tid timed out messages in log
got a debug log from the mds now. Looks like the mds decides to commit it's data to the osds and something goes wrong... John Leach
03:36 PM Bug #777: mount hung, tid timed out messages in log
Do you have any core dumps from the MDS? Or is this running in a memory-constrained system so it's getting OOM-killed? Greg Farnum
03:32 PM Bug #777: mount hung, tid timed out messages in log
I can reproduce this:
* format new cluster (in my case: 3 mons, 2 mds, 4 osds)
* mount filesystem using kernel cl...
John Leach
03:08 PM Bug #777 (In Progress): mount hung, tid timed out messages in log
I don't think the MDS can do that (and if it can, we need to put blocks in place to prevent that). I guess I'll need ... Greg Farnum
03:04 PM Bug #777: mount hung, tid timed out messages in log
Only the kernel client was in use, so am assuming it was the mds that made a write bigger than the journal size (120m... John Leach
08:39 PM Revision e26b194a (ceph): librbd: fully specify std::string in list function
Josh Durgin
08:39 PM Revision ec01b300 (ceph): librbd: add C++ interface tests
Josh Durgin
08:05 PM Linux kernel client Bug #750 (Won't Fix): run "dd", printk " libceph: osd1 172.16.10.68:6805 socket closed"
I imagine this has the same cause as #742. Please re-open if you manage to reproduce while using btrfs for the cosd s... Greg Farnum
07:49 PM Linux kernel client Bug #750: run "dd", printk " libceph: osd1 172.16.10.68:6805 socket closed"

hi ,
1.
i git ceph-client ,checkout unstable (commit 9c01177349b435186025a088f612a6f5ce2f3de9)
make menuconfig...
changping Wu
08:04 PM Linux kernel client Bug #742 (Won't Fix): ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale"...
This is an unfortunate result of running the kernel client on the same machine as the OSD:
The cosd process needs to...
Greg Farnum
07:56 PM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
ceph-client unstable:
kernel hacking:
select:
[] RCU debugging: sparse-based checks for pointer usage
< > tor...
changping Wu
07:55 PM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
Hi samuel,
osds are running on ext4 .
the steps:
1.
git ceph-client ,checkout unstable (commit 9c01177349b435...
changping Wu
07:28 PM Revision f7475de2 (ceph): dout: don't print time to syslog
Syslog itself prepends a time to messages, so don't duplicate that.
Use a faster snprintf-based method to generate th...
Colin Patrick McCabe
06:28 PM Bug #792 (Resolved): dout: don't output the date to syslog
implemented in f7475de2f672a39afda430af04a6f8557859abf5 Colin McCabe
10:01 AM Bug #792 (Resolved): dout: don't output the date to syslog
Dout shouldn't output the date to syslog. Syslog takes care of that itself, and outputting the date twice is a waste.... Colin McCabe
06:07 PM Revision 820d1a78 (ceph): On i386, pthread ids are 8 hexdigits. Adjust cli tests.
Tommi Virtanen
05:49 PM Revision 061b8d97 (ceph): pybind/rados: expand on error handling a bit
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:40 PM Revision 8284b6b0 (ceph): Add python bindings for rados
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Hannu Valtonen
02:48 PM Revision e45fd9f2 (ceph): Merge branch 'fix32'
Colin Patrick McCabe
02:39 PM Revision fe298fc3 (ceph): msg/SimpleMessenger.c: fix signed/unsigned compare
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:32 PM Revision 76fd853f (ceph): os/FileStore: fix signed/unsigned comparison warn
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:28 PM Revision af77fa4f (ceph): os/FileJournal: compare like types in MAX()
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:20 PM Bug #795 (Resolved): init-ceph assumes only one daemon per node
The init-ceph script (usually installed as /etc/init.d/ceph) assumes that there is only one daemon running on the nod... Colin McCabe
01:33 PM Bug #794 (Resolved): massif is unusable on current code
When starting up valgrind stuff via vstart, it now logs everything to the console. I presume this is because of the c... Greg Farnum
01:32 PM Revision 2ca337d0 (ceph): tools/gui.cc: fix signed/unsigned comparison
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
10:49 AM Bug #698 (In Progress): cosd memory usage with large number of pools
Taking a look now! Greg Farnum
10:21 AM Bug #759 (Resolved): osd: pgs spend a long time peering when marking osds out
Sage Weil
10:18 AM Bug #759: osd: pgs spend a long time peering when marking osds out
1a01e5ee1b88a217547873296e0371858be13f37 merged in a branch moving replica scrubbing to rep_scrub_wq with a new non-o... Samuel Just
10:21 AM Bug #793 (Resolved): osd: avoid blocking in scrub_wq
Sage Weil
10:05 AM Tasks #584: do throughput scaling tests on sepia
I don't remember: did we actually finish this and just not update the bug? If not we should decide what to do with it! Greg Farnum
10:01 AM Bug #644: rsync can be sloooow
Well, Sage's above patch makes it some better, but there are still issues with losing the I_COMPLETE flag due to pree... Greg Farnum
12:22 AM Revision 1a01e5ee (ceph): Merge branch 'stable'
Conflicts:
src/osd/OSD.cc
Sage Weil
12:22 AM Revision ec9d14c1 (ceph): Merge remote branch 'origin/rep_scrub_wq' into stable
Sage Weil

02/08/2011

10:04 PM Revision eb20de40 (ceph): rbd: fix watch tool
Yehuda Sadeh
09:39 PM Revision 1ce2a55a (ceph): rbd: modify watch
Yehuda Sadeh
07:50 PM Revision 5d7f4823 (ceph): librbd: fix completion cb mixup
Yehuda Sadeh
05:38 PM Revision 778902b4 (ceph): make:add messages/MOSDRepScrub.h to NOINST_HEADERS
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:44 PM Revision 0b862ecc (ceph): gitignore: ignore massif.out.*
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:41 PM Revision cc525b3a (ceph): osd: discard scrub reply if pg changed
build_scrub_map will bail out if the pg changed. Discard the result in
that case since the primary will ignore it an...
Sage Weil
04:41 PM Revision a948aa11 (ceph): osd: avoid map_lock for scrub_map reply
Using osd->osdmap->epoch without map_lock is dangerous. We can avoid it
entirely by replying on the same connection ...
Sage Weil
04:22 PM Revision 36097c3a (ceph): osd: never rewrite log after {advance,activate}_map
pg->dirty_log is never true, so this is dead code. And nothing in either
of those two methods updates the pg log.
S...
Sage Weil
04:22 PM Revision 3055d094 (ceph): osd: always write backlog after creation
dirty_log is never set to true, so we would set the log.backlog flag but
not write it to disk. If we restarted the O...
Sage Weil
04:22 PM Revision 19afe11c (ceph): osd: fix no missing inferance
Add missing continue in last_update==last_complete (no missing) case.
Signed-off-by: Sage Weil <sage.weil@dreamhost....
Sage Weil
04:07 PM Revision 7d5fc1ae (ceph): common_init: don't force profiling_logger = true
common_init: don't force profiling_logger = true for daemons. Users can
always set "profiling logger = true" when the...
Colin Patrick McCabe
03:48 PM Revision e028cc4b (ceph): config: kill more unused options
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:43 PM Revision 7d3e8afc (ceph): cfuse: remove high level fuse interface
Untested, unused, and inferior to the lowlevel implementation.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:02 PM Revision c0416af0 (ceph): common: Rename Logger to ProfilingLogger
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:52 PM CephFS Bug #791 (Resolved): ls -al waits for writes to complete
On Mon, 6 Dec 2010, Jim Schutt wrote:
> Hi Sage,
>
> On Sat, 2010-12-04 at 21:59 -0700, Sage Weil wrote:
> > >
>...
Greg Farnum
01:49 PM Bug #777 (Resolved): mount hung, tid timed out messages in log
In the absence of other evidence I'm going to go ahead and mark this as solved. Greg Farnum
01:11 PM Revision 2315759e (ceph): common: combine set_fg_logging into common_init
Combine set_fg_logging into common_init. This eliminates the problem
where we briefly try to apply the logging settin...
Colin Patrick McCabe
12:50 PM Bug #790 (Resolved): convert read, write, pread, pwrite to sage's safe versions
Convert users of read, write, pread, pwrite to sage's safe versions (unless they are already handling short reads/wri... Colin McCabe
10:21 AM Bug #758 (Resolved): dd-truncate elsewhere-dd again is slow
From Jim:
Also FWIW, I'm currently unable to reproduce the slow
write after truncate issue, using stable branch
on...
Greg Farnum
09:07 AM Documentation #789 (Resolved): Document config options
All ceph.conf config options for the different modules should be clearly documented. Yehuda Sadeh
08:51 AM Cleanup #765 (Resolved): remove config option cruft
removed lots of stuff in master branch. Sage Weil
07:36 AM Linux kernel client Bug #747 (Can't reproduce): ladder0 crash
Sage Weil
07:36 AM Bug #788 (Resolved): slow btrfs commits
Sage Weil
07:35 AM Linux kernel client Bug #730 (Closed): connection resets from kclient
Looking closer, this appears to (now at least) be due to slow btrfs commits on the OSD (e.g. 30-50 seconds) which mak... Sage Weil
04:56 AM Revision cb4fcfe3 (ceph): OSD: Add rep_scrub_wq
Previously, replica scrubs would be handled in sub_op_scrub in the op
queue. Replica scrubs will now be processed by...
Samuel Just
04:56 AM Revision aed279e6 (ceph): PG: added replica_scrub
Adds handler in PG for MOSDRepScrub messages. replica_scrub will
replace sub_op_scrub.
Signed-off-by: Samuel Just <...
Samuel Just
04:56 AM Revision 03c7b062 (ceph): OSD: Adds handler for MOSDRepScrub
handle_rep_scrub enqueues the message in rep_scrub_wq.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
04:56 AM Revision 212f977f (ceph): PG: switch _request_scrub_map to send MOSDRepScrub
Also switches sub_op_scrub_reply to sub_op_scrub_map to handle the
OSD_OP_SCRUB_MAP response.
Signed-off-by: Samuel ...
Samuel Just
04:56 AM Revision 41629202 (ceph): PG: remove sub_op_scrub
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
04:56 AM Revision 7245b6a1 (ceph): MOSDRepScrub: Adds a message for initiating a replica scrub
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
04:56 AM Revision 4cab2031 (ceph): rados: Adds CEPH_OSD_OP_SCRUB_MAP sub op
Previously, maps were requested with a sub_op and sent with a
sub_op_reply. As maps will now be requested using a di...
Samuel Just
01:21 AM Revision ea3266b4 (ceph): librbd: change tests to use new rbd_list and rbd_list_snaps interface
Josh Durgin
01:20 AM Revision 022facad (ceph): librbd: depends on librados
Yehuda Sadeh

02/07/2011

11:13 PM Revision 18c31893 (ceph): librados, librbd, radosacl: use new include paths
Josh Durgin
11:13 PM Revision 5e1e362b (ceph): librbd: use RBDClient directly in the C interface
Josh Durgin
11:13 PM Revision 42205985 (ceph): librbd: add set_snap and I/O to the C API
Josh Durgin
11:13 PM Revision f1d5f1c7 (ceph): librbd: add rbd prefix to C version of set_snap
Josh Durgin
11:13 PM Revision 0d870c47 (ceph): librbd: add C version of get_rados_pools
Josh Durgin
11:13 PM Revision b43a52c2 (ceph): librbd: aio_read, still buggy
Yehuda Sadeh
11:13 PM Revision 23426fc8 (ceph): librbd: fix aio_read
Yehuda Sadeh
11:13 PM Revision e8238bd4 (ceph): rbd: revert to the old(er) export implementation
the aio_write was just used for development Yehuda Sadeh
11:13 PM Revision 78c856c0 (ceph): librbd: change bufferlist namespace
Yehuda Sadeh
11:13 PM Revision 654a1f8a (ceph): librbd: finish C I/O api
Josh Durgin
11:13 PM Revision 52f0945d (ceph): librbd: add C api I/O tests
Josh Durgin
11:13 PM Revision 6a5c26ad (ceph): librbd: remove redundant casting in C aio completion functions
Josh Durgin
11:13 PM Revision e40bb79a (ceph): librbd: check aio return values in tests
Josh Durgin
11:13 PM Revision e7312bd0 (ceph): librbd: modify list_snaps C interface
Yehuda Sadeh
11:13 PM Revision d3e6222b (ceph): librbd: modify rbd_list() interface
Yehuda Sadeh
11:13 PM Revision 6ce5e888 (ceph): librbd, librados: cleanup
Yehuda Sadeh
11:13 PM Revision 0d8bcb14 (ceph): rbd: fix misc problems
Yehuda Sadeh
11:13 PM Revision 78fe3934 (ceph): librbd: update C interface and tests to use image contexts
Josh Durgin
11:13 PM Revision f18c7213 (ceph): librbd: add packages for librbd
Josh Durgin
11:13 PM Revision 9b9a3c09 (ceph): librbd: store snapshot names in image context and use them when listing...
Josh Durgin
11:13 PM Revision 6de5908c (ceph): librbd: remove unnecessary argument to trim_image
Josh Durgin
11:13 PM Revision 4994174e (ceph): librbd: re-read image header in get_snapc, and call it after snap creat...
This fixes a bug with image context in which the snap info
in the image header showed fewer snapshots than it should ...
Josh Durgin
11:13 PM Revision 8dd25006 (ceph): rbd: dynamically link with librbd
Josh Durgin
11:13 PM Revision 7aefb323 (ceph): librbd: initialize order in C API test
Josh Durgin
11:13 PM Revision 88a004aa (ceph): librbd: use a pointer instead of a globally constructed RBD instance in...
Josh Durgin
11:13 PM Revision a2109d41 (ceph): librbd: convert to dout
Josh Durgin
11:13 PM Revision c4ddbafa (ceph): librbd: aio write
Yehuda Sadeh
11:13 PM Revision 9bb5b551 (ceph): librbd: Install the headers in the development package
Signed-off-by: Wido den Hollander <wido@widodh.nl> Wido den Hollander
11:13 PM Revision 93d32c62 (ceph): rados, rbd: move include files to their own directories
In order to match the path being used with the packages Yehuda Sadeh
11:09 PM Revision a411c004 (ceph): librbd: synchronous read and write functions
also update the rbd tool Yehuda Sadeh
10:41 PM Revision 36c0c10c (ceph): rbd: get it compiled with updated librbd
Yehuda Sadeh
10:41 PM Revision 3ebefafa (ceph): librbd: pre-declare RBDClient to avoid a bunch of casts
Josh Durgin
10:41 PM Revision 3e9c1833 (ceph): librbd: remove do_ prefix from RBDClient methods
Josh Durgin
10:41 PM Revision 92a03de8 (ceph): librbd: return 0 on success in open_pools
Josh Durgin
10:41 PM Revision 691d23fb (ceph): librbd: update C interface and tests
Josh Durgin
10:41 PM Revision 58776ba1 (ceph): librbd: implement some more functionality
Yehuda Sadeh
10:41 PM Revision 99422875 (ceph): librbd: add ImageCtx and open/close image
RBDClient functions are partially converted to use it. Josh Durgin
10:41 PM Revision 747a5aca (ceph): librbd: make ImageCtx stuff compile and remove it from remove/copy/rename
Josh Durgin
10:41 PM Revision b443b7ac (ceph): librbd: remove set_assert_version on resize
Yehuda Sadeh
10:41 PM Revision 014ecb8a (ceph): librbd: remove pool_t parameter from some api calls
Yehuda Sadeh
10:41 PM Revision ccb86592 (ceph): librbd: use snapshot metadata from ImageCtx insead of refetching it on ...
Josh Durgin
10:41 PM Revision 74b5e97c (ceph): rbd: use updated interface with image_t
Josh Durgin
10:41 PM Revision 2ac597a5 (ceph): librbd: use helper to set ImageCtx snapid
Josh Durgin
10:41 PM Revision 41036c00 (ceph): librbd: initial implementation created from rbd.cc
Includes image and snapshot create/list/delete functions.
I/O will be added later.
Signed-off-by: Josh Durgin <josh....
Josh Durgin
10:41 PM Revision bd3d484e (ceph): librbd: add tests for the C api
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:41 PM Revision a5362c50 (ceph): rbd: update the command line rbd tool to use librbd
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:41 PM Revision 591fe952 (ceph): librbd: modify interface a bit
Yehuda Sadeh
10:41 PM Revision b93ddac8 (ceph): librbd: make it compile
Yehuda Sadeh
09:55 PM Linux kernel client Bug #784: kclient crash
Greg Farnum wrote:
> That backtrace was via gdb. I kept the stuff around but I'm not much good at handling linux cra...
Sage Weil
09:45 PM Linux kernel client Bug #784: kclient crash
That backtrace was via gdb. I kept the stuff around but I'm not much good at handling linux crashes so I just put it ... Greg Farnum
09:41 PM Linux kernel client Bug #784: kclient crash
if this was uml and you have a core file, gdb will give you a more useful backtrace... Sage Weil
09:01 PM Linux kernel client Bug #784 (Can't reproduce): kclient crash
I got this on a local rsync_better branch, which is built off of unstable commit 9c01177. As best I can tell it's not... Greg Farnum
09:07 PM rgw Feature #787 (Closed): rgw watchdog
a watchdog that periodically (every few seconds) runs a few operations (upload, download, list buckets, list content ... Yehuda Sadeh
09:03 PM rgw Feature #786 (Rejected): journal rgw operations to backup storage
Yehuda Sadeh
09:02 PM rgw Feature #785 (Rejected): rgw: fix filesystem backend
The filesystem backend for rgw is probably not working, we need to fix that. Also we should be able to configure easi... Yehuda Sadeh
08:12 PM Revision 07115999 (ceph): safe_io: move to C
Fixes unistd vs XOPEN_SOURCE define weirdness.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
07:59 PM Revision 79983c27 (ceph): add safe_{read/write} wrappers
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:59 PM Revision 9a442958 (ceph): ceph: use new safe_read wrapper
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:57 PM Revision 6799b864 (ceph): build_debian_image: include bind9-host
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:07 PM Revision 56e4aa46 (ceph): build_debian_image: fixes
- /dev/pts
- copy console helper scripts
- run from any directory
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
06:00 PM Revision e531ecd0 (ceph): common: call closelog before openlog
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:34 PM Revision 6bf53f18 (ceph): filejournal: fix return code check
pwrite() returns number of bytes, or -1.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:23 PM rgw Feature #783 (Rejected): separate rgw to a different source repository
Yehuda Sadeh
05:19 PM Revision 6ad4d101 (ceph): vstart.sh: -k doesn't take an argument
-k doesn't take an argument, so don't call shift a second time. (It's
already called once at the end of the loop.)
S...
Colin Patrick McCabe
05:18 PM Bug #754 (Resolved): mkfs script should clean up its entries in /tmp, and use TMPDIR
implemented by commit:09c38e8edadb3a5bf470ff9b0409172e0c61e012 and commit:aa8a25592f442ab640708115e144e6b8d95cb664 Colin McCabe
05:15 PM Revision be8cdb08 (ceph): qa: test for proper i_complete handling
This reproduces some buggery with rename vs I_COMPLETE on the kernel
client.
Signed-off-by: Sage Weil <sage.weil@dre...
Sage Weil
05:02 PM Revision 7e07ee05 (ceph): config: call openlog to set ident
Set the 'ident' field in our syslog output.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
05:02 PM Feature #753 (Resolved): Setting identity for syslog
Implemented by commit:7e07ee05ff9994b78456c700327d8fcef3ef1739 and commit:e531ecd02de7339984310b20e88a22335084b2f7 Colin McCabe
04:56 PM Revision 80198780 (ceph): build_debian_image: include libcrypto++ (need for runtime)
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:56 PM Revision 2635a3b8 (ceph): kernel console debug output setup scripts
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
01:54 PM Revision eb9e6197 (ceph): dout: force reopen after conf read, fg logging set
After the configuration is read, or a program sets foreground logging,
force those changes to be applied to the dout(...
Colin Patrick McCabe
01:17 PM Linux kernel client Bug #776 (Resolved): infinite loop on snap trace update
Sage Weil
01:17 PM Linux kernel client Bug #776: infinite loop on snap trace update
fixed by commit:e8e1ba96b207deba1339b09983f8b29f92cb1497 Sage Weil
01:06 PM Cleanup #781 (Resolved): make proper wrappers for read/write/etc.
Sage Weil
09:34 AM Cleanup #781 (Resolved): make proper wrappers for read/write/etc.
The TEMP_FAILURE_RETRY macro doesn't return a meaningful value for things like read/write. Also, and read/write call... Sage Weil
12:34 PM Revision aa8a2559 (ceph): mkcephfs: fix bug in multi-node setups
mkcephfs needs to create $TDIR on the other nodes in a multi-node setup
before trying to scp files into that director...
Colin Patrick McCabe
12:06 PM Bug #777: mount hung, tid timed out messages in log
I haven't fully determined everything going on here, but were you using anything besides the kernel client with this ... Greg Farnum
11:28 AM Feature #618 (Resolved): osd: allow reads from replicas
merged into master by commit:e7144e1458bb8b001ec8984471fe3989e5902de4 Colin McCabe
11:28 AM Feature #620 (Resolved): objecter: (optionally) read from replica if on localhost and primary is not
Merged into master at commit:e7144e1458bb8b001ec8984471fe3989e5902de4 Colin McCabe
10:12 AM Linux kernel client Bug #782 (Won't Fix): mount -o name=client.foo fails (name=foo succeeds)
Sage Weil
09:32 AM Bug #780 (Resolved): cosd crashes on old btrfs
commit:6bf53f18af087859013a5a5a57aac082cdf49beb Sage Weil
09:23 AM Bug #780: cosd crashes on old btrfs
This works fine on f54b793f3a3a63921bb47b1156667694604e68ee, btw. Greg Farnum
09:20 AM Bug #780 (Resolved): cosd crashes on old btrfs
... Greg Farnum
05:17 AM Revision b5f75927 (ceph): script: add script to build uml images
Set up a tap interface in your bridge, and then
./linux ubda=image mem=512M eth0=tuntap,tap0 ip=<ip> netmask=<netmas...
Sage Weil
05:13 AM Revision 390786ef (ceph): remove old script
Sage Weil
05:11 AM Revision 652fc3f0 (ceph): remove ancient web content
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:49 AM Revision d7af2102 (ceph): mon: ignore mds boot messages with zeroed port
On 0.24.2 I saw a zeroed port in the cmds log and in the mdsmap. Ignore
anything from a cmds with a zeroed port to p...
Sage Weil

02/06/2011

09:56 PM Revision 5a50d339 (ceph): client: more carefully gaurd local cache truncate
This fixes an assert when len=0 in file_to_extents when we get some weird
metadata from the MDS.
Fixes: #778
Signed-...
Sage Weil
09:37 PM Revision 5b754cb4 (ceph): msgr: make remove_pid_file fail silently if nothing to remove
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
02:04 PM Bug #778 (Resolved): csyn --syn walk fails on directory layout?
commit:5dde7be605d013109b48c0fcc66ca18244ea9ecf Sage Weil
10:14 AM Bug #778 (Resolved): csyn --syn walk fails on directory layout?
Sage Weil
10:15 AM Bug #779 (Can't reproduce): mds: loses port information
Found that the playground mds (0.24.2+) has a port of 0 in the mds log and mdsmap. Obviously nothing could connect. ... Sage Weil
04:43 AM Revision f77c889e (ceph): remove ancient mds_local_osd option
This used to add a few osds to the end of the osdmap, not included in the
crush map, with the assumption that they wo...
Sage Weil

02/05/2011

09:55 PM Revision d5fced13 (ceph): fakemessenger: remove fakemessenger and related dead code
FakeMessenger has hopelessly drifted from the current abstract interface;
all of this code is years old.
Signed-off-...
Sage Weil
09:50 PM Revision 0797b538 (ceph): clock: remove tare function
This was used way back when to set a clock's relative "zero" time to
simplify stat collection when running performanc...
Sage Weil
09:25 PM Revision ffd95ee9 (ceph): config: remove (most) unused options
Left in a few that may be used later:
client_unmount_timeout
osd_shed_reads*
osd_min_raid_width
osd_recovery_forget_...
Sage Weil
09:21 PM Revision 865d5cfb (ceph): client: set lru midpoint
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:09 PM Revision f975fdea (ceph): logger: don't calculate variance by default
Less memory, less CPU.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:19 PM Revision bd7a043f (ceph): msgr: check return values
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:18 PM Revision 6c47bde6 (ceph): msgr: check return values for write_pid_file
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:45 AM Bug #777 (Resolved): mount hung, tid timed out messages in log
I have a ceph cluster with 3 mons, 1 mds and 4 osds. I mounted the ceph filesystem on another machine using the defau... John Leach

02/04/2011

10:27 PM Revision b75846e5 (ceph): filestore: handle read_op_seq errors
Fixes: #775
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
09:42 PM Revision f54b793f (ceph): osd: Return error code when a write is too large.
This should prevent us from crashing the OSD by trying
to pass in a write that is larger than its journal can handle....
Greg Farnum
09:38 PM Bug #644: rsync can be sloooow
Try this?... Sage Weil
02:43 PM Bug #644: rsync can be sloooow
All right, ran into a wall unfortunately. :(
Because rsync does a transfer-to-temp-file-and-rename, and because rena...
Greg Farnum
08:55 PM Revision e22be43d (ceph): config: add new osd_max_write_size option.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:49 PM Linux kernel client Bug #776 (Resolved): infinite loop on snap trace update
on commit:42961d2333a1855c649fa3790e258ab4f0fa66a4... Sage Weil
07:00 PM Revision ff0e57d7 (ceph): msgr: SimpleMessenger::start: fix unchecked return
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:57 PM Revision a454f237 (ceph): msgr: remove deadcode, warnings
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:53 PM Revision 4b1d86c8 (ceph): msgr: remove_pid_file: fix unchecked returns
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:02 PM Revision e58c3718 (ceph): os: FileJournal::do_write: fix error handling
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:47 PM Revision a87a3c11 (ceph): Merge branch 'wannoying2'
Conflicts:
src/os/FileJournal.cc
Colin Patrick McCabe
05:28 PM Revision a2abe5f5 (ceph): assert.h: Add ceph_abort
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:17 PM Revision 23d733a2 (ceph): os: _check_disk_write_cache: fix error handling
Fix error handling; use popen instead of a temporary file.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
04:50 PM Revision 712f589a (ceph): cmds: check return code of chdir
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:45 PM Revision cf1b9694 (ceph): unittest_encoding: build without -fstrict-aliasing
unittest_encoding gets "this breaks strict-aliasing" warnings when
compiled under Ubuntu. So don't crank up optimizat...
Colin Patrick McCabe
04:28 PM Revision b5d27514 (ceph): FileStore::mount: fix error handling
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:27 PM Revision e1df71fb (ceph): os: fix zeroing first chunk of FileJournal
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:10 PM Bug #775 (Resolved): FileStore::read_op_seq doesn't check errors properly
commit:b75846e5d9983bd25435e33795e4b615354c2cab Sage Weil
02:14 PM Bug #775 (Resolved): FileStore::read_op_seq doesn't check errors properly
FileStore::read_op_seq should check ::read and ::open errors.
Currently it just prints a message but returns the b...
Colin McCabe
02:22 PM Revision f45d5566 (ceph): os: FileStore::mkjournal: fix error handling
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:16 PM Revision b1196b87 (ceph): os: FileJournal::create: fix error handling
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:09 PM rgw Tasks #774 (Rejected): move to radosgw.git
Sage Weil
02:08 PM Revision 38877bb8 (ceph): cmon: check return code of chdir
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:07 PM rgw Feature #772 (Rejected): correct 100-continue handling
Sage Weil
02:06 PM rgw Feature #772 (Rejected): correct 100-continue handling
Sage Weil
02:06 PM rgw Feature #773 (Resolved): rgw: efficient list-objects filtering
filter list object results efficiently (on osd side) so that O(num results) passes over the network instead of O(buck... Sage Weil
02:05 PM rgw Feature #771 (Resolved): rgw: POST
Sage Weil
02:05 PM rgw Feature #770 (Resolved): rgw: logging
Sage Weil
02:05 PM rgw Feature #769 (Closed): cache pool acls, pool list, etc.
We need to cache anything we need to in radosgw so that most requests translate into a single request on the backend. Sage Weil
02:04 PM rgw Bug #768 (Resolved): eliminate any read/modify/write races
i think mainly in bucket creation? user creation? Sage Weil
02:04 PM rgw Feature #767 (Resolved): rgw: incremental/large file uploads
Sage Weil
02:04 PM rgw Bug #766 (Resolved): slow bucket creation times
may not still be a problem; just need to confirm. this is a monitor tuning thing. Sage Weil
02:03 PM Cleanup #765 (Resolved): remove config option cruft
Lots of these are old, dead, or (if they're still there) should be dead. Sage Weil
01:06 PM Feature #760 (Resolved): librbd: Include headers in the development package
Actually rados/buffer.h is just a symlink to include/buffer.h and is actually under the ceph namespace and shared bet... Yehuda Sadeh
09:06 AM Feature #760: librbd: Include headers in the development package
It's declared in rados/buffer.h, so it should be librados?
Btw, you just added AIO write with a bufferlist without...
Wido den Hollander
12:43 PM Bug #756 (Resolved): OSD should have max write size
Pushed in commit:988d83881f53b3026fece3f043800327261acfb5
There's no client-side checking at all, unfortunately, b...
Greg Farnum
12:42 PM Revision 5be24ee2 (ceph): tools/common: EINTR/unchecked return fix
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:39 PM Revision 00bd2cb8 (ceph): rados: fix pedantic gcc warning on writing stdout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:24 PM Revision c956ee2e (ceph): filejournal: fix unchecked returns, modularize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe

02/03/2011

11:54 PM Revision b780b91a (ceph): filer: fix probe
ENOENT is allowed and expected at the end of a journal. It is treated the
same as a zero size object.
Fixes 800507e...
Sage Weil
11:24 PM Revision 5e6f5569 (ceph): filestore: fix wipe_subvol the rest of the way
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
11:09 PM Revision 9fd1ae4c (ceph): filestore: fix (part of) wipe_subvol
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:42 PM Revision c1108126 (ceph): msgr: add nonce parsing to entity_addr_t::parse()
Reported-by: John Leach <john@brightbox.co.uk>
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:20 PM Revision af9bc4fc (ceph): auth/ExportControl: fix allow_def leak
allow_def would be leaked if the first branch is not taken.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
07:58 PM Revision 989ff5b0 (ceph): Merge branch 'stable'
Conflicts:
src/common/signal.cc
src/os/FileStore.cc
Sage Weil
07:58 PM Revision 400813cc (ceph): ReplicatedPG: snap_trimmer fix leaked lock
Previous patch 7a02070b741d3482ff6b28827c1eb274a2134486 leaks the pg
lock.
Signed-off-by: Samuel Just <samuel.just@d...
Samuel Just
07:54 PM Revision e49dced7 (ceph): signal: fix redefine warnings
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:54 PM Revision 18d882bc (ceph): mds: make SETDIRLAYOUT start with ancestor policy
When we set a new policy on a directory, make the starting point the
current policy, as specified by an ancestor. Re...
Sage Weil
07:35 PM Revision 02ce7348 (ceph): MonMap: add,remove,contains can take const references
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
07:34 PM Revision 88313ee0 (ceph): SimpleMessenger: learned_addr can take a const ref
peer_addr_for_me can be passed as a const ref.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
06:31 PM Revision 7a02070b (ceph): ReplicatedPG:snap_trimmer should return if !clean or !active or !primary
The PG may become !clean or !active while in the osd snap_trim_wq.
Signed-off-by: Samuel Just <samuel.just@dreamhost...
Samuel Just
06:15 PM Revision 17788929 (ceph): cosd: check chdir return code
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:43 PM Revision 2daa45e3 (ceph): osd: Ager: remove Ebofs leftovers
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:15 PM Revision 52866751 (ceph): Add do_autogen.sh
It's handy when making builds with various different warning levels.
Also saves you from having to remember --prefix,...
Colin Patrick McCabe
04:03 PM Revision 09c38e8e (ceph): mkcephfs: clean up temp files, honor $TEMPDIR
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:51 PM Revision 87c022c6 (ceph): client/fuse_ll.c: const cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:50 PM Revision 5f18f20d (ceph): Revert "client/fuse_ll: fully initialize callback struct"
Older build environments may not have all the fuse function ptrs that
are present in newer ones. It's best to just le...
Colin Patrick McCabe
03:47 PM Revision f0059c25 (ceph): Revert "client/fuse.cc: explicity zero fuse function ptrs"
Older build environments may not have all the fuse function ptrs that
are present in newer ones. It's best to just le...
Colin Patrick McCabe
03:42 PM Revision 7b295033 (ceph): Merge branch 'master' of ssh://ceph.newdream.net/git/ceph
Colin Patrick McCabe
03:13 PM CephFS Feature #764 (Rejected): mds: make anchor table scale
The anchor table is current kept completely in memory. This won't scale forever for large numbers of anchors, especi... Sage Weil
02:46 PM Bug #763 (Resolved): fix nonce parsing in entity_addr_t::parse()
commit:c1108126b47b335098fa613583409c97d58b0a53 Sage Weil
02:44 PM Revision 6d1f3588 (ceph): os: FileStore::wipe_subvol: fix removing old dir
Older versions of btrfs which don't have BTRFS_IOC_SNAP_DESTROY need to
manually remove snapshots.
Signed-off-by: Co...
Colin Patrick McCabe
02:11 PM Revision 92ded7e1 (ceph): cosd.cc: better error message if osd->init fails
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:15 PM Bug #756 (In Progress): OSD should have max write size
Have a patch prepared to enable this on the OSD side, but I want to think a bit about if there's a good way to do thi... Greg Farnum
12:36 PM Revision 9ca0f7c3 (ceph): Merge branch 'wannoying'
Colin Patrick McCabe
12:14 PM Revision 7b43f6ba (ceph): mds: handle_client_setdirlayout: init fl_pg_pool
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Greg Farnum
12:14 PM Bug #759: osd: pgs spend a long time peering when marking osds out
the replica scrub needs to go in a different work queue (not op_wq). scrub_wq, or something else that's assigned to ... Sage Weil
12:02 PM Bug #759: osd: pgs spend a long time peering when marking osds out
this appears to be scrubbing related:
- we get a new osdmap. handle_osd_map tries to pause the op threadpool.
- ...
Sage Weil
10:24 AM Bug #759 (In Progress): osd: pgs spend a long time peering when marking osds out
Sage Weil
12:10 PM Revision 8961ce63 (ceph): librados: print out errs in do_import, check EINTR
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:10 PM Revision 7e4cec3a (ceph): osd/OSD: remove always-true assert
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:10 PM Revision e533a563 (ceph): mon/OSDMonitor: don't check if unsigned < 0
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:10 PM Revision 8b283ff5 (ceph): tools/gui.cc: remove empty else statement
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:10 PM Revision e9837847 (ceph): mount.ceph: put static at the beginning of decl
To squash a gcc warning.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
12:10 PM Revision 38b50013 (ceph): client/fuse.cc: explicity zero fuse function ptrs
Set unused function pointers to 0 explicitly to please gcc. (They were
already implicitly zeroed because the containi...
Colin Patrick McCabe
12:10 PM Revision 0bcbe7c5 (ceph): crush: eliminate signed/unsigned comparisons
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:10 PM Revision ff64562a (ceph): include/utime.h: avoid comparing float with 0
Avoid doing an exact comparison of a float with 0.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
12:10 PM Revision 0c977e88 (ceph): const-ify some unchanging static structs
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision 516ac99b (ceph): client/fuse_ll: fully initialize callback struct
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision 800507e8 (ceph): osdc/Filer: assert(0) on stat error
Remove broken error handling code and replace with an assert(0) and a
TODO.
Signed-off-by: Colin McCabe <colin.mccab...
Colin Patrick McCabe
11:54 AM Revision e7e4c440 (ceph): rgw/rgw_rest.cc: fix uninitialized static struct
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision c9f4060c (ceph): include/buffer.h:unsigned variables are always >=0
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision fa18b743 (ceph): crush/builder.c: fix signed/unsigned confusion
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision f4f066b8 (ceph): src/config.cc: fully initialize g_default_file_layout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision e251f51f (ceph): direct_io_test: fix signed/unsigned, printf flags
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision 5684a7ec (ceph): Client::readdir_r_cb: fix "always returns true"cmp
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision 81c173b0 (ceph): osdc/ObjectCacher.h: don't assert unsigned >= 0
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:54 AM Revision e0be4b52 (ceph): mds/locks: fully initialize lock states
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:53 AM Revision 16c4563c (ceph): Merge branch 'master' of ssh://ceph.newdream.net/git/ceph
Colin Patrick McCabe
11:39 AM Bug #761 (Resolved): librbd: Include path to librados
commit:ee20a7921253eddf52f91e5e0be08543d8e59037 should fix it. Did some reshuffling, now the include files within the... Yehuda Sadeh
05:30 AM Bug #761 (Resolved): librbd: Include path to librados
In librbd.hpp and librbd.h we have:
*librbd.h*...
Wido den Hollander
11:37 AM Feature #760: librbd: Include headers in the development package
Applying the first patch. Shouldn't the namespace in the second one be ceph and not librados? Yehuda Sadeh
05:52 AM Feature #760: librbd: Include headers in the development package
Not worth opening a new issue, see the second patch.
Use the right namespace for string and bufferlist.
Wido den Hollander
05:07 AM Feature #760 (Resolved): librbd: Include headers in the development package
Right now librbd.h and librbd.hpp are not added to librbd1-dev
Attached is a patch which includes them.
Wido den Hollander
10:51 AM Bug #762 (Can't reproduce): cmon: PGMonitor::encode_pending() assert failure
from ML:... Sage Weil

02/02/2011

10:55 PM Revision e3fe2dbc (ceph): RGW: RGW::GetObj - signed/unsigned bug
get_obj returns an int, len is unsigned
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
10:38 PM Revision d47789d6 (ceph): RBD.cc: pread returns a signed value
The check for len < 0 here always returned true since len is uint64_t.
Signed-off-by: Samuel Just <samuel.just@dream...
Samuel Just
10:38 PM Revision 763fb136 (ceph): RGW: get_service_secret returns a bool, not an int
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
10:38 PM Revision 10ceea99 (ceph): RGW: fix shadowed variable error
int ret and bool ret shadow this->ret, setting ret before the goto
therefore had no effect in these cases.
Signed-of...
Samuel Just
10:38 PM Revision 960cd6d0 (ceph): RGW: fix ret in RGWCopyObj::init_common
int ret shadows RGWCopyObj::ret.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
10:33 PM Revision 4587f1fe (ceph): mount.ceph: option parsing fix
Passing -o secretfile would cause a segfault since searching for = would
result in a null pointer. New version check...
Samuel Just
09:09 PM Bug #759 (Resolved): osd: pgs spend a long time peering when marking osds out
On the playground (with lots of data), I see that some PGs spend a long time in peering state after marking an OSD as... Sage Weil
07:02 PM Revision 2b9cb24e (ceph): Fix base64-decoding when input ends in newline.
It used to return -EINVAL because it thought the end was not aligned
to 4 bytes.
Clean up superfluous src < end test...
Tommi Virtanen
06:04 PM Revision acf40be9 (ceph): Fix inner loop index variable. i and j are too close together ;)
Before this fix, j would keep increasing well beyond in_len,
and lead to segfaults.
Tommi Virtanen
05:57 PM Revision 0b166c31 (ceph): mds: Remove redundant cap suppression code.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:57 PM Revision 0b7174f6 (ceph): rados: Make rados tool "put" objects in chunks.
Defaults to 4MB, but can be configured.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
05:57 PM Revision cef3ce82 (ceph): mds: Remove unused variable.
Fixes compile complaint. Was made useless in commit
6d29fc66326ff40443ac465c84d6f7bddf3cee4f.
Signed-off-by: Greg Fa...
Greg Farnum
05:57 PM Revision c7ac6f92 (ceph): osd: Fix compile-time warning.
store is properly initialized inside a try block, but the
compiler doesn't notice that and so thinks it may be used
u...
Greg Farnum
05:10 PM Revision 0972202f (ceph): Speling fixes
Trivial spelling fixes...
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Chris Dunlop
05:10 PM Revision eb3f65a6 (ceph): Improve mount.ceph.8
Seeing as I was wondering what "rbytes" might be, I guess others might
too...
Signed-off-by: Sage Weil <sage.weil@dr...
Chris Dunlop
03:50 PM Revision cb897a5b (ceph): confutils: check return values
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:50 PM Revision 8ebcc80b (ceph): mds: check return values in dumper
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:50 PM Revision 471b1897 (ceph): monitorstore: check return values
And use open(2) instead of fopen.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:59 PM Bug #644: rsync can be sloooow
Discovered that the kernel client is clearing its I_COMPLETE flag on every newly-created directory due to how the cap... Greg Farnum
12:25 PM Revision e7144e14 (ceph): Merge branch 'objecter_balance_reads'
Colin Patrick McCabe
12:00 PM Revision 05f26581 (ceph): os: use scoped variables for fds
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:56 AM Revision e6030151 (ceph): osd: change unfound handling when using localize
Change how cosd handles unfound objects when doing operations with
localize_reads. Specifically, don't wait for unfou...
Colin Patrick McCabe
11:56 AM Revision 2faf6a09 (ceph): csyn: implement localize_reads cmdline option
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:56 AM Revision a4781df7 (ceph): test: add test_csyn
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:56 AM Revision a28dbc93 (ceph): objecter: send read to random replica if BALANCE_READS flag is set
The flag is just an existing one; we may want to rename.
This assumes the OSD will discard any request sent prior to...
Sage Weil
11:56 AM Revision e165f8ea (ceph): objecter: implement FLAG_LOCALIZE_READS
Read from a local replica, if any.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
11:56 AM Revision 087d4159 (ceph): libceph: iface for CEPH_OSD_FLAG_LOCALIZE_READS
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:56 AM Revision 7912f5c7 (ceph): cfuse: add --localize-reads argument
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:56 AM Revision a4c9bdf7 (ceph): Hadoop: set localize_reads
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:29 AM Bug #758 (Resolved): dd-truncate elsewhere-dd again is slow
Jim Schutt has reported some troubling behavior apparently involved with client caps surrounding distributed (but not... Greg Farnum
11:05 AM Bug #757 (Resolved): creating multiple snapshots through librbd can lead to inconsistent state
Fixed in commit:a23903c5d8122d330cc159fb0f6ae0a7d72dd349 by rereading the header after creating or removing a snapshot. Josh Durgin
11:03 AM Bug #757 (Resolved): creating multiple snapshots through librbd can lead to inconsistent state
With the following sequence of events:
create image foo
create snap of foo
resize image foo
create snap of foo
...
Josh Durgin
10:54 AM Bug #756 (Resolved): OSD should have max write size
To prevent crashes if an incoming write is larger than the journal, we need an OSD max write size in the config. OSD ... Greg Farnum
09:57 AM Cleanup #755 (Resolved): Compartmentalize rados.cc
The rados tool code is pretty disgusting at this point. It consists entirely of a main function, with only usage() an... Greg Farnum
09:55 AM Feature #624 (Resolved): radostool: make 'put' write large objects in chunks
commit:0b7174f6aa072b6c0d8291a6c87a3991f8c401cd
Set chunk size with -b, defaults to 4MB.
Greg Farnum

02/01/2011

10:39 PM Revision ece4f61a (ceph): FileStore: fix double close
curr_fd is already closed if cp == cur_seq. This second close
occasionally ended up closing another thread's fd. Th...
Samuel Just
07:02 PM Revision 06fe1320 (ceph): FileStore: fix double close
curr_fd is already closed if cp == cur_seq. This second close
occasionally ended up closing another thread's fd. Th...
Samuel Just
06:54 PM Revision 16ffcf77 (ceph): common: config.cc: use "admin" as the default id
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:28 PM Revision 79ed51fd (ceph): mds: make --dump-journal preserve offset
Suggest user use tar -S to preserve sparseness.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:25 PM Revision ffd549ba (ceph): common: move init_g_conf into md_config_t ctor
Make sure that g_conf is initialized with default values before anything
else happens.
Signed-off-by: Colin McCabe <...
Colin Patrick McCabe
05:22 PM Revision 196b817b (ceph): common: config.cc: whitespace cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:09 PM Bug #754 (Resolved): mkfs script should clean up its entries in /tmp, and use TMPDIR
Our mkfs shell script creates stuff in /tmp and doesn't clean it up. Should use "trap" to ensure that this stuff gets... Colin McCabe
04:33 PM Revision a4375329 (ceph): common: config.cc: de-globalize g_fake_kill_after
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:13 PM Revision 6a03ffbe (ceph): common: config.cc: de-globalize show_config
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:06 PM Revision 68f929a6 (ceph): common: clean up g_conf.id initialization a bit
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
02:23 PM Revision 4380920b (ceph): common: remove ceph_set_default_id
ceph_set_default_id was only ever used to set the default ID to "admin",
which it already was.
Signed-off-by: Colin ...
Colin Patrick McCabe
12:37 PM Bug #752: High I/O wait when writing data
While running rsync I sometimes see these messages appearing inside the VM:... Wido den Hollander
12:13 PM Feature #753 (Resolved): Setting identity for syslog
Right now all syslog messages get their process name as their identity in syslog, for example:... Wido den Hollander
12:03 PM Revision 49e238b5 (ceph): FileStore: fix error handling for mkfs, umount
In FileStore::umount: check if FDs are valid before closing them. Make
them invalid after closing them. Shut down Fil...
Colin Patrick McCabe
11:08 AM Linux kernel client Bug #750: run "dd", printk " libceph: osd1 172.16.10.68:6805 socket closed"
Jeff Wu wrote:
> ceph client driver is from ceph-client.git unstable
There are some error handling fixes recentl...
Sage Weil
09:38 AM Feature #719 (Resolved): mds journal dump should be simpler
commit:79ed51fd5bc30756f6a4f2e682c90a4e085f43d0 Sage Weil
05:10 AM Revision 2373972f (ceph): gitignore: ignore eclipse metadata
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:09 AM Revision a45e8f2d (ceph): remove ancient active/ stuff
Sage Weil
04:58 AM Revision f2713332 (ceph): osd: don't leak fd on error
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:55 AM Revision 4d50af76 (ceph): crypto: don't clobber errno
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:24 AM Revision 6dc8994b (ceph): Merge remote branch 'origin/ostimeo'
Sage Weil

01/31/2011

05:32 PM Bug #644: rsync can be sloooow
Started taking a look at this again. Greg Farnum
04:23 PM Bug #735: Manual drive pull testing hangs filesystem
merged by commit:6dc8994b750631c15e88553fd4fabdd9e4907989 Sage Weil
03:29 PM Bug #735: Manual drive pull testing hangs filesystem
The ioctl timeout is now implemented in the ostimeo branch ( commit:2a266bd09d0db3b8d8c4f33a101229de1a4301a3 ) Colin McCabe
03:37 PM Linux kernel client Cleanup #168 (Closed): new truncate sequence
we don't use the inode .truncate op.
Sage Weil
12:23 PM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
Were the osds running on top of btrfs? Also, could you post the ceph.conf? Samuel Just
11:22 AM Bug #752: High I/O wait when writing data
Oh, I forgot my bonnie++ results. Wido den Hollander
11:22 AM Bug #752: High I/O wait when writing data
I've done some benchmarks today in the VM and noticed something weird.
In the VM I ran:...
Wido den Hollander
10:40 AM CephFS Feature #630: release caps on inodes unlinked by other clients
Putting this back in the queue since we've pushed it back past 1.0. Greg Farnum
10:39 AM CephFS Bug #551 (Can't reproduce): cfuse crash on quick mds restart
When I created this Sage told me that this wasn't actually the cause, and it was a different bug. I can't seem to rep... Greg Farnum

01/30/2011

05:34 AM Revision 548f329f (ceph): Merge branch 'mds_reset'
Fixes: #602 Sage Weil
05:17 AM Revision 6137c789 (ceph): Merge remote branch 'origin/stable'
Conflicts:
src/osd/OSD.cc
Sage Weil

01/29/2011

09:41 PM CephFS Feature #602 (Resolved): mds: handle corrupt/missing journals
no repair flag, but journal discard and cluster reset are in place. not especially useful yet without the actual rep... Sage Weil
09:25 PM Bug #563 (Closed): osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
Sage Weil
08:33 PM Bug #740 (Can't reproduce): osd's not sharing osdmaps with osd_ping
Sage Weil
02:24 AM Revision 0f3198e8 (ceph): OSD: update_osd_stat take heartbeat_lock
Previously update_osd_stat had a race with code modifying heartbeat_from
causing the iterator increment to occasional...
Samuel Just
12:56 AM Revision cf59cdf7 (ceph): mds: skip a few more inodes during journal reset
To be safe...
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:55 AM Revision f8f2cae0 (ceph): mds: open mydir (along w/ root) inode from boot_start()
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
12:55 AM Revision e405d4a4 (ceph): mds: implement journal reset
This basically works. Remaining issues:
- mydir and root inodes are recreated from scratch but need to be
reconc...
Sage Weil
12:48 AM Revision 14c669c3 (ceph): Locker: Drop loner correctly!
Our previous check for if we want to drop the loner was incorrect.
Now, it's fixed. Resolves a serious bug with inode...
Greg Farnum
12:47 AM Revision 04728b85 (ceph): librados: fix C interface const, too.
See 561224e95d6c66661d1bd6dce0e3d9da6f4a7e13
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
12:47 AM Revision 9b42e8eb (ceph): Locker: Drop loner correctly!
Our previous check for if we want to drop the loner was incorrect.
Now, it's fixed. Resolves a serious bug with inode...
Greg Farnum

01/28/2011

11:24 PM Revision 02408ecf (ceph): config: remove dead stringtable cruft
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:57 PM Revision 9e4325b2 (ceph): mds: defer sending resolves until mdsmap.failed.empty()
There is no point sending resolves while there are still failed nodes,
since we can't complete. We also trigger an a...
Sage Weil
08:35 PM Revision f711508b (ceph): mds: defer sending resolves until mdsmap.failed.empty()
There is no point sending resolves while there are still failed nodes,
since we can't complete. We also trigger an a...
Sage Weil
05:45 PM Revision aed862b4 (ceph): mds: standardize option parsing
- Use the standard macros.
- Simply --hot-standby and --journal-check options (always specify rank).
- Update usage()...
Sage Weil
05:05 PM Revision 334a7564 (ceph): common: _dout_lock: initialize _dout_lock first
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:15 PM Feature #739 (Resolved): 'ceph health' should be warning when pg's are not all active
Fixed by commit:50976297fd227664f3b451d77a11863259a18f48 Colin McCabe
04:08 PM Bug #751 (Resolved): _dout_lock is used after been disctucted.
Fixed by commit:334a7564a21017e78f66ee0c73dcec4aacbb7244 Colin McCabe
01:56 PM Bug #751: _dout_lock is used after been disctucted.
I have confirmed that _dout_lock is *not* the last thing to be destroyed.
Jan 28 05:57:42 flab cconf: creating mu...
Colin McCabe
01:29 PM Revision e373d95e (ceph): common: Put debug.cc last in linking order
debug.cc needs to be last in the linking order so that its global
constructors will be initialized first and destroye...
Colin Patrick McCabe
01:17 PM Bug #752 (Resolved): High I/O wait when writing data
Like I said on IRC, I'm seeing a high load on my machine "noisy".
The setup is:
* Intel(R) Xeon(R) CPU 5110 1.6G...
Wido den Hollander
01:07 PM Bug #715 (Closed): btrfs: extent-tree.c ( btrfs_free_block_group & release_global_block_rsv)
Closing this one, seems after all this was a real btrfs bug. Fixed in the latest tree. Wido den Hollander
12:19 PM Revision 2a266bd0 (ceph): os: FileStore: Add commit timeout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:05 PM Bug #746: core dump on radostool failure
Although it wasn't apparent from my bug report, 027335afe30127f841a5ea875e173ffc4cd7cf91 was a parent revision of 5a1... Colin McCabe
10:53 AM Bug #746 (Resolved): core dump on radostool failure
this looks like it is fixed by commit:027335afe30127f841a5ea875e173ffc4cd7cf91. Sage Weil
12:01 PM Revision eda48faf (ceph): rbd: Rados::init: clean up after failure
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
11:27 AM Bug #714 (Won't Fix): ceph -s ,show osd number error
It still says 6 because the osd existed at one point in time. There isn't a command to mark it as nonexistent curren... Sage Weil
10:52 AM Linux kernel client Bug #730: connection resets from kclient
I'm hoping this is caused by the bad error handling in try_read() and try_write(). Need to do some more testing befo... Sage Weil
10:51 AM CephFS Bug #663 (Can't reproduce): cmds segfaults
Hopefully this will turn up later. The new asserts will let us catch it sooner and get a useful stack dump. Sage Weil
10:51 AM Bug #652: ReplicatedPG _scrub missing clone
Explanation: _scrub in the case that it hits a missing clone currently fails an assert and dies. It might be worth h... Samuel Just
10:46 AM CephFS Bug #733 (Can't reproduce): cmds crash: mds/LogEvent.cc:88: FAILED assert(p.end())
Hmm not enough to go on I'm afraid. If you see this again please let us know! Sage Weil
10:42 AM Revision 1ea8f2b3 (ceph): os:FileStore:use std::string rather than huge bufs
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
10:42 AM Revision 184048cb (ceph): os: FileStore: remove default param
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
10:42 AM Revision ba5cbb36 (ceph): os: FileStore: ctor should init all class vars
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:24 AM Revision 35442744 (ceph): osd: fix mutual exclusion for _dispatch
We want only one thread dispatching messages (either new or requeued), so
that we can preserve ordering. Previously ...
Sage Weil
05:33 AM Revision 50976297 (ceph): Merge remote branch 'origin/health2' into unstable
Sage Weil
01:12 AM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
ceph-client unstable:
make menuconfig:
kernel hacking:
i selected :
[*] RCU debugging: sparse-based checks ...
changping Wu
12:27 AM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
git ceph-client-standalone.git master-backport:
if printk "caps stale" ,then cosd blocked.
mon osd mds is at the ...
changping Wu
12:53 AM Linux kernel client Bug #750: run "dd", printk " libceph: osd1 172.16.10.68:6805 socket closed"
ceph client driver is from ceph-client.git unstable changping Wu
12:46 AM Linux kernel client Bug #750: run "dd", printk " libceph: osd1 172.16.10.68:6805 socket closed"
I try to reproduce this issue.
but ceph-client.git unstable + ceph 0.24.2 ,
one mon ,one mds ,two osd at the same h...
changping Wu

01/27/2011

09:56 PM Bug #749: test pipe queue if is empty??V0.24.2
Sage Weil wrote:
> Is there a stack dump in the log (/var/log/ceph/whatever.log)? That will include which revision ...
longguang yue
09:36 PM Bug #749: test pipe queue if is empty??V0.24.2
Is there a stack dump in the log (/var/log/ceph/whatever.log)? That will include which revision of the code you're r... Sage Weil
09:39 PM Bug #744: osd class changes take too long to propagate
I think we should use the 'subscribe' mechanism to pass class changes (just the names of updated classes?) to osds. ... Sage Weil
09:34 PM Linux kernel client Bug #750: run "dd", printk " libceph: osd1 172.16.10.68:6805 socket closed"
From the error is sounds like the OSDs are down. Can you include 'ceph -s' output? Sage Weil
09:31 PM Bug #751 (Resolved): _dout_lock is used after been disctucted.
cconf -c /etc/ceph/ceph.conf -l mon
------------gdb-------------------------
common/Mutex.h: In function 'void Mute...
longguang yue
06:48 PM Revision adffe53e (ceph): units: add signals unit test
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:47 PM Revision bb72d95d (ceph): mds: cluster_fail instead of reset_cluster
Mark all cluster members as failed, and blacklist. Do not force up/failed
ranks to stopped, as that requires the adm...
Sage Weil
04:34 PM Revision e59007c5 (ceph): mon: add mdsmap DOWN flag to prevent mdsmap updates
This is intended to be set while doing critical cluster manipulation to
avoid cmds instances from starting up and get...
Sage Weil
04:34 PM Revision 9ebd7735 (ceph): mon: add 'mds reset_cluster' command
Reset an MDS cluster back to a single node. The idea is:
- wipe out mds journals
- maybe set recovery flag
- mds...
Sage Weil
04:34 PM Revision c3d4014f (ceph): mdsmap: add flags
Convert unused client_epoch field to flags to avoid a protocol change. It
is always 0 on current clusters. Lucky us!...
Sage Weil
03:54 PM Revision 73e76723 (ceph): .gitignore: vstart generated files
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:53 PM Revision 0c2ec2d0 (ceph): vstart: put tmp files in /tmp
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:23 AM Bug #735: Manual drive pull testing hangs filesystem
We need to be ready to handle unresponsive FileStores in general. Even if the underlying filesystem is 100% perfect (... Colin McCabe
08:52 AM CephFS Feature #602: mds: handle corrupt/missing journals
see mds_reset branch.
Still need a process to reset the individual mds instances. This should:
- advance the jou...
Sage Weil

01/26/2011

10:05 PM Revision 561224e9 (ceph): librados: Remove rados_pool_t& usage, and pointless consts.
For some reason when I wrote this I passed rados_pool_t by reference
in some functions instead of by value. It's just...
Greg Farnum
08:00 PM Linux kernel client Bug #750 (Won't Fix): run "dd", printk " libceph: osd1 172.16.10.68:6805 socket closed"
Hi
i git ceph-client-standalone.git master-backport
build and insmod it.
ceph server: ceph 0.24.2
OS:linux-2.6...
changping Wu
06:08 PM Revision 7d65f6ea (ceph): osd: restart if the osdmap client, heartbeat, OR cluster addrs don't match
If we somehow get ourselves into a situation where the OSDMap addresses do
not match our actual addresses, restart an...
Sage Weil
06:08 PM Revision fbcf6690 (ceph): osd: preserve ordering when ops are requeued
Requeue ops under osd_lock to preserve ordering wrt incoming messages.
Also drain the waiter queue when ms_dispatch t...
Sage Weil
06:08 PM Revision 47dc27a6 (ceph): osd: avoid extraneous send_boot() calls
Only send_boot() on osdmap update if we are restarting. Otherwise we can
end up with too many MOSDBoot messages in f...
Sage Weil
05:51 PM Bug #749 (Can't reproduce): test pipe queue if is empty??V0.24.2
void SimpleMessenger::dispatch_entry()
line 299
#0 0x00007fd99adc96c5 in raise () from /lib64/libc.so.6
Missing s...
longguang yue
05:29 PM Revision fce3c8d8 (ceph): test_unfound.sh: kill cosds rather than mark out
For this test, we need to kill cosds rather than mark them as out.
Otherwise, we cannot force objects to become unfou...
Colin Patrick McCabe
04:04 PM Revision 6685baeb (ceph): disable scrubs during test_unfound
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:45 PM Revision 8aff0dd1 (ceph): mon: implement PGMonitor::get_health
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:54 PM Revision 5c5408bf (ceph): mon: OSDMonitor::get_health: const cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:47 PM Revision 23a7e88d (ceph): mon: MonitorStore::mkfs: use run_cmd
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:25 PM Feature #748 (Rejected): Design/implement a RADOS copy
It would be nice if the OSDs could copy objects between themselves, rather than requiring the client to do a read and... Greg Farnum
01:09 PM Revision 9ecd5fe7 (ceph): os: FileStore: use run_cmd instead of system
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
01:08 PM Revision b68e1218 (ceph): common: Add run_cmd
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
12:53 PM Bug #737 (Resolved): don't use system() in daemons
fixed by commit:b68e121815fa1e551b074899517bf745795a2e75, commit:9ecd5fe7d001455c840e818e4e1f7b1f74bc1017, commit:23a... Colin McCabe
12:51 PM Feature #618: osd: allow reads from replicas
todo: add libceph interface to this Colin McCabe
11:41 AM Linux kernel client Bug #747 (Can't reproduce): ladder0 crash
... Sage Weil
11:34 AM Bug #746 (Resolved): core dump on radostool failure
radostool failed because of a problem with some daemons. However, instead of failing gracefully, it dumped core.
O...
Colin McCabe
10:53 AM Feature #745 (Resolved): qa: radosgw tests
creating, removing buckets. creating, fetching, removing objects. listing objects. Sage Weil
10:42 AM Bug #741 (Resolved): Removing rbd snapshots causes weird things to happen
That was fixed at commit:da2d50180dfdc0e30b4348f2acceb2be650f20b7. Yehuda Sadeh
10:40 AM Bug #744 (Won't Fix): osd class changes take too long to propagate
The class timeout is too long, we might want to consider some notification mechanism to let osds find out about new c... Yehuda Sadeh
10:18 AM Bug #743 (Resolved): osd: broken ordering when pg ops are requeued
commit:fbcf66906e67adbe6769ba7b1853dd0161e977c6 Sage Weil
10:03 AM Bug #743 (Resolved): osd: broken ordering when pg ops are requeued
Incoming messages race with requeued ops and get out of order. This is problematic for osd_sub_op's in partricular.
...
Sage Weil
01:02 AM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
if printk " ceph: mds0 caps renewed"
,then ,system hang.
[ 3495.283726] libceph: tid 139576 timed out on osd0,...
changping Wu

01/25/2011

11:49 PM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
run ceph server and ceph client at the same machine. changping Wu
11:48 PM Linux kernel client Bug #742: ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale",then oops
ceph server : ceph-0.24.2 changping Wu
11:47 PM Linux kernel client Bug #742 (Won't Fix): ceph-client.git unstable , tiotest ,printk "timeout, ceph: mds0 caps stale"...
hi ,
i git ceph-client.git ,checkout to unstable.
build it at ubuntu 10.04 ,make install,reboot,
then run as linux...
changping Wu
10:36 PM Revision ba998f05 (ceph): ReplicatedPG: _rollback_to fix the just cloned condition
_rollback_to in the case that head was just cloned and that clone
includes snapid does not need to do anything. Prev...
Samuel Just
10:12 PM Revision bc98a0fb (ceph): MDSMonitor: fix bugs with standby-replay assignment.
We were accidentally passing gid instead of rank into find_standby_for!
Also, if we got an MDS with rank -1 we went a...
Greg Farnum
10:12 PM Revision f19ef8eb (ceph): dumper: rework slightly to prevent incorrect usage of g_conf.id.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:40 PM Revision 81c0ec1a (ceph): mon: remove PGMap::pg_set
We don't need an additional data structure to hold the keys to pg_stat.
We can just look at the keys of pg_stat.
Sig...
Colin Patrick McCabe
05:14 PM Bug #741: Removing rbd snapshots causes weird things to happen
You're right. Thanks. Andrew F
05:03 PM Bug #741: Removing rbd snapshots causes weird things to happen
This might be something that we already fixed. The older rbd class (version 1.2) had this issue which was fixed in ve... Yehuda Sadeh
04:56 PM Bug #741 (Resolved): Removing rbd snapshots causes weird things to happen
A quick transcript should explain:... Andrew F
05:09 PM Revision 212289f6 (ceph): vstart: Add --standby_mds setting, for auto-creating standby-replays.
Greg Farnum
05:08 PM Revision 5ce82c94 (ceph): Merge branch 'standby_replay' into unstable
Greg Farnum
04:58 PM Revision b17de794 (ceph): mon: PGMap::apply_incremental must maintain pg_set
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:50 PM Revision dba7c1fd (ceph): Merge branch 'testing' into unstable
Conflicts:
configure.ac
src/Makefile.am
src/common/common_init.cc
src/common/debug.h
src/common/signal.cc
src/c...
Sage Weil
04:38 PM Revision 58637acc (ceph): debian: fix publish.sh for ubuntu
Sage Weil
03:24 PM Revision 5beec0b0 (ceph): os: readdir_r: read into PATH_MAX-sized buf
Fix the readdir_r uses in FileStore.cc
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
03:24 PM rbd Feature #699 (In Progress): support snapshot notify
changes done in commit:9c01177349b435186025a088f612a6f5ce2f3de9, still needs to get it upstream. Yehuda Sadeh
03:22 PM Linux kernel client Feature #738 (Resolved): ino32 mount option
done, commit:9c01177349b435186025a088f612a6f5ce2f3de9. Yehuda Sadeh
08:03 AM Linux kernel client Feature #738 (Resolved): ino32 mount option
expose 32-bit inos via readdir and stat. Sage Weil
03:22 PM Linux kernel client Feature #700 (Resolved): rbd: support snapshot notify
done, commit:fb021e0010a209bca2e4b206f7bfc47379f704e8 Yehuda Sadeh
03:20 PM Feature #298 (Resolved): osd: "watch" operation
done as of osd commit:7e206dfec07033d49770bafcc2c86f6dd736d7c2, ceph-client commit:fb021e0010a209bca2e4b206f7bfc47379... Yehuda Sadeh
03:06 PM Bug #727 (Rejected): sysfs: cannot create duplicate filename '/devices/rbd/0'
Yeah, probably a duplicate of #693, closing it (should be Rejected-Duplicate status but there isn't one yet). Yehuda Sadeh
03:00 PM Revision 0fbbbad8 (ceph): os: FileStore::mkfs error handling fixes
Clean up all resources on every exit path. Don't allocate multiple
PATH_MAX buffers on the stack when one will do. Fi...
Colin Patrick McCabe
02:30 PM Bug #736 (Resolved): Buffer overflow in FileStore::collection_list_partial
Fixed by commit:5beec0b00ccf1dce3358154367558d9a4e320e93 Colin McCabe
02:28 PM Bug #734 (Resolved): incorrect filestore errno checks
Addressed in commit:0fbbbad8dd53603f06d226dca2c309236d3d85b8 and commit:6b4ee9ed3ed0f19966269357cac31ea594c0bcd3 Colin McCabe
02:04 PM Bug #722 (Resolved): Fix use of g_conf.id
Fixed with standby-replay stuff and commit:f19ef8eb03521507cbb4f0071484d1419eafc03d Greg Farnum
01:36 PM Bug #740: osd's not sharing osdmaps with osd_ping
this is contention on map_lock. something is holding it for too long. Sage Weil
10:56 AM Bug #740 (Can't reproduce): osd's not sharing osdmaps with osd_ping
on playground, latest testing Sage Weil
12:57 PM Bug #735: Manual drive pull testing hangs filesystem
I have a Quarch box in the lab that I was just pointed to. It has an ssh interface to power cycle drives for failure... Brian Chrisman
08:56 AM Bug #735: Manual drive pull testing hangs filesystem
Shouldn't btrfs be able to detect that the disk is gone and return appropriate error codes itself, rather than hanging? Greg Farnum
08:29 AM Bug #735: Manual drive pull testing hangs filesystem
Yep, this is a problem. The errors are causing btrfs operations to hang instead of return error codes.
What sho...
Sage Weil
10:37 AM Feature #739 (Resolved): 'ceph health' should be warning when pg's are not all active
Sage Weil
10:32 AM CephFS Bug #733: cmds crash: mds/LogEvent.cc:88: FAILED assert(p.end())
Odd, I can't repro this anymore. It was either fixed by some change between 785bf0fcbfb69efa8dd97340c8ee0079bb5ad55e ... Ravi Pinjala
08:31 AM CephFS Bug #733: cmds crash: mds/LogEvent.cc:88: FAILED assert(p.end())
Can you restart the mds with 'debug mds = 20' so we can see what events are getting replayed and which decode is fail... Sage Weil
09:06 AM Feature #687 (Resolved): Let the monitor auto-set idle MDSes as standbys
Merged to unstable in commit:5ce82c94a1df22accefba18d04eab9c7672e7c02.
Just set "mds standby replay = true" in the...
Greg Farnum
09:05 AM Feature #686 (Resolved): Specify standby behavior in configuration file
Merged to unstable in commit:5ce82c94a1df22accefba18d04eab9c7672e7c02. Greg Farnum

01/24/2011

08:53 PM Revision f7572de5 (ceph): v0.24.2
Sage Weil
07:06 PM Revision bf749eee (ceph): Merge branch 'unstable' into standby_replay
Greg Farnum
06:59 PM Revision 943fd14f (ceph): filestore: don't wait min sync interval on explicit sync()
Also, if we do wait longer, wait on the same cond.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:59 PM Revision 3a30eb75 (ceph): msgr: include con in debug output
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:59 PM Revision 4a49a87d (ceph): msgr: make connection pipe reset atomic
Close a small and unlikely race.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:57 PM Revision b7bc4711 (ceph): MDSMap: split up find_standby_for into multiple functions.
Usage of this function is rapidly diverging, in terms of what
is desired.
We now have "find_standby_for", which selec...
Greg Farnum
06:57 PM Revision 95ed2e4a (ceph): MDSMonitor: Try to assign unassigned standby-replay MDSes during tick()
We can now specify an MDS as standby-replay and let the monitor
assign it to any MDS. The monitor will only assign it...
Greg Farnum
06:57 PM Revision 74ce6e1b (ceph): MDSMonitor: On restarting MDSes; set to standby-replay if appropriate.
This way, if the primary MDS crashes and is replaced, but is supposed
to standby-replay its secondary on recovery, it...
Greg Farnum
06:57 PM Revision 76fbdf17 (ceph): MDSMap: Update/fix print function.
It previously didn't look at standby_for_name unless standby_for_rank
was set!
Also, we now let it print out standby...
Greg Farnum
06:57 PM Revision 69447f79 (ceph): man: Update cmds manual.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
06:57 PM Revision a9824b04 (ceph): MDS: MDSMonitor: Make MDS set standby-replay preferences, not MDSMonitor.
The MDS has more information about its configuration than the MDSMonitor
does. Therefore, encode that information int...
Greg Farnum
06:57 PM Revision 1b45088e (ceph): MDSMonitor: be more conservative with use of pending_mdsmap.
Use the current mdsmap when looking for MDSes to standby-replay for,
as that way we know the other MDS is already up....
Greg Farnum
06:57 PM Revision 785c4987 (ceph): MDSMonitor: remove unused code.
Greg Farnum
06:57 PM Revision f4a07df9 (ceph): MDSMonitor: Don't create new map for standby-replay spam.
If an MDS is unable to get into the standby-replay state for some
reason (MDS it should be following doesn't exist ye...
Greg Farnum
06:57 PM Revision cc207ca5 (ceph): MDSMonitor: Adjust handling of MDSes asking for standby-replay.
1) If the MDS does not specify an MDS to follow, we mark them as
standing-by for -2. MDSMap::find_standby_for() has b...
Greg Farnum
06:57 PM Revision 54bdac70 (ceph): mds: Adjust replay state changes and options parsing.
The MDS used to interpret g_conf.id as a rank. It no longer does
so and requires that standby ranks/names be set via ...
Greg Farnum
05:45 PM Revision ea2266d0 (ceph): os: fix minor typo in function defs
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:44 PM Revision 6b4ee9ed (ceph): os: fix some obvious error handling problems
Fix some errors like checking errno when it may not have been set, doing
other operations which may change the value ...
Colin Patrick McCabe
05:43 PM Bug #737 (Resolved): don't use system() in daemons
We should replace system() with a simple wrapper function that just does fork + exec.
The use of system() is wrong...
Colin McCabe
05:02 PM Bug #376: File corruption after cluster crashes
if the bug is resolved? longguang yue
04:16 PM Bug #736 (Resolved): Buffer overflow in FileStore::collection_list_partial
FileStore::collection_list_partial calls readdir_r without actually allocating space for the dirent. A properly craft... Colin McCabe
03:44 PM Revision 017495db (ceph): Makefile: use CXXFLAGS more consistently
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
03:44 PM Revision 76a1e09a (ceph): Makefile: remove unecessary header check
We already check for libcrypto++ using PKG_CHECK_MODULES; we don't need
to fish for header files.
Signed-off-by: Col...
Colin Patrick McCabe
03:44 PM Revision ffb208e3 (ceph): Makefile: use new Spirit headers where available
Use new boost::spirit header files where available, to eliminate the
annoying compiler warning on newer systems.
Sig...
Colin Patrick McCabe
03:43 PM Revision a2a7d039 (ceph): test: Add test_rw
Test reading and writing lots of objects from the object store.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost....
Colin Patrick McCabe
02:42 PM Feature #619: objecter: optionally read from replicas
Sigh. Reposting patch for clarity:... Colin McCabe
02:41 PM Feature #619 (Resolved): objecter: optionally read from replicas
I ran test_rw.sh with the following patch:
diff --git a/src/osdc/Objecter.h b/src/osdc/Objecter.h
index 37a23fb.....
Colin McCabe
12:31 PM Bug #735 (Resolved): Manual drive pull testing hangs filesystem
It appears that drive failure problems in my configuration are not making their way up through the stack to kill off ... Brian Chrisman
11:27 AM Bug #734 (Resolved): incorrect filestore errno checks
Sage Weil
 

Also available in: Atom