Project

General

Profile

Activity

From 12/13/2010 to 01/11/2011

01/11/2011

11:16 PM Revision 9a5a7fe4 (ceph): Git ignored files cleanup.
Make gitignore entries not match recursively.
I wanted to introduce a directory "osdmaptool" to contain cli tests
fo...
Tommi Virtanen
10:49 PM Bug #704 (Closed): it hang both in client and osd
1.# sync ;in client,it hang ......never return
2.in osd0,
# ls /date/osd0 ;hang there,never return.
longguang yue
09:21 PM Bug #703: 0.23.1update to 0.24.1,old ceph can not run
Core was generated by `/usr/bin/cosd -i 0 -c /tmp/ceph.conf.7906'.
Program terminated with signal 11, Segmentation f...
longguang yue
09:06 PM Bug #703: 0.23.1update to 0.24.1,old ceph can not run
Core was generated by `/usr/bin/cosd -i 0 -c /tmp/ceph.conf.7906'.
Program terminated with signal 11, Segmentation f...
longguang yue
08:54 PM Bug #703: 0.23.1update to 0.24.1,old ceph can not run
meanwhile,osd dead,but ceph -s show 2 osds is up and in. longguang yue
08:50 PM Bug #703 (Resolved): 0.23.1update to 0.24.1,old ceph can not run
so after i mkcephfs
1.mds {0=up:creating},
2.osd can not run,even i run cosd -i -c ceph.conf.
osd.log:
journal ...
longguang yue
06:29 PM Revision 61bd155f (ceph): osd: OSD::queue_pg_for_deletion: avoid double del
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
06:22 PM Revision 27f172f4 (ceph): osd: OSD::queue_pg_for_deletion: avoid double del
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
05:50 PM Revision 6e6c22ea (ceph): mds: avoid double-pinning stray inodes
We make multiple iterations through populate_mydir(). Only pin each stray
once. Fixes #689 and crashes like
mds/CI...
Sage Weil
05:22 PM Revision 83d3fb2c (ceph): debian: make update_pbuilder.sh a bit smarter
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:32 PM Bug #701 (Rejected): osd: scrub crash, failed clone_size assert
was running old code, problem went away with latest testing branch Sage Weil
03:25 PM Bug #701: osd: scrub crash, failed clone_size assert
on kvmtest-osd1 and 2 Sage Weil
03:24 PM Bug #701 (Rejected): osd: scrub crash, failed clone_size assert
... Sage Weil
03:31 PM Bug #702 (Resolved): osd: failed interval_set assertr in snap_trimmer FAILED assert(p->first > st...
... Sage Weil
12:48 PM Bug #696 (Resolved): osd: _put_pool, assert(p->num_pg > 0)
Colin McCabe
11:29 AM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
Tried your fix, seems to be working, but like I said on IRC, I can't be 100% sure. I couldn't reproduce it today, so ... Wido den Hollander
10:05 AM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
See if this fixes it up: commit:27f172f4c7d1afafeda8c9ec1d530f5e02ea1b7a
in the latest unstable.
Colin McCabe
07:32 AM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
Colin McCabe wrote:
> Also, Sage, is there a reason why PG::~PG() is doing a pool->put(), but OSD::_remove_pg() also ...
Greg Farnum
05:21 AM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
Hi Colin,
The logging was indeed low. But I was able to reproduce the crash yesterday with a higher logging enable...
Wido den Hollander
10:57 AM Linux kernel client Feature #700 (Resolved): rbd: support snapshot notify
Sage Weil
10:55 AM rbd Feature #699 (Resolved): support snapshot notify
Sage Weil
10:49 AM Bug #698 (Resolved): cosd memory usage with large number of pools
I reported this on the mailing list a week ago but never filed it here. Still present in 0.24.1.
I've got a 3 nod...
John Leach
10:00 AM CephFS Cleanup #588 (In Progress): Allow Inodes to have multiple parent Dentries
Okay, going to swap this with an implementation using sets rather than lists, then done! Greg Farnum
09:36 AM CephFS Bug #689 (Resolved): cmds crash: mds/CInode.h:1088: FAILED assert(ref_set.count(by) == 0)
Thanks, Ravi. There's no corruption. Fixed by commit:6e6c22ea23918e72211eb9cbdac64aec4abf7e7c. Sage Weil
12:30 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Okay, everything is fine now... got rid of ENOSPC and had a scrub error that would not repair. I upgraded to 0.24.1 a... ar Fred

01/10/2011

11:51 PM Revision e9a70f15 (ceph): keyring: can parse plain text keyring files
Yehuda Sadeh
11:41 PM Revision e189222f (ceph): ReplicatedPG: Fix bug in rollback
Previously, _rollback_to assumed that the rollback was a noop if
ctx->clone_obc was set and it's prior version matche...
Samuel Just
11:20 PM Revision 1229af51 (ceph): Port encoding tests over to gtest.
Tommi Virtanen
11:19 PM Revision 9b0d5776 (ceph): Use Google Test framework for unit tests.
Use ``make check`` to run the tests.
The src/gtest directory comes from ``svn export
http://googletest.googlecode.co...
Tommi Virtanen
11:07 PM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
Hi Wido,
I took a quick glance at the logs in osd_3_crash_rgw_pool_remove.tar.gz, but it looks like those are the ...
Colin McCabe
07:41 PM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
It looks like we already have "_get_pool" and "_put_pool" printed out in the logs when the relevant functions are cal... Colin McCabe
12:47 AM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
I'm able to reproduce the crash on my 'noisy' machine.
After setting the logging to 20 I got:...
Wido den Hollander
10:07 PM Revision 1a9ef3fa (ceph): Make git ignore generated files.
Tommi Virtanen
11:30 AM Feature #697: create rbd library
We've discussed it on IRC, might be usefull as reference: http://irclogs.ceph.widodh.nl/index.php?date=2011-01-10 Wido den Hollander
11:25 AM Feature #697 (Resolved): create rbd library
As suggested by Wido: we can create a new library -- librbd (similar to librados) that'll allow modifying rbd images.... Yehuda Sadeh
10:32 AM Bug #694: no osd journal causes cosd segfault
fixed by commit:1846355176ecd5b60e93da8cf20cea0ef6ed4326 Colin McCabe

01/09/2011

10:23 PM CephFS Bug #689: cmds crash: mds/CInode.h:1088: FAILED assert(ref_set.count(by) == 0)
Log sent in email due to paranoia about security ;)
Also, I've verified that I get the same crash with 0.24.1. The...
Ravi Pinjala
09:34 PM Revision 18463551 (ceph): os: don't crash on no-journal case
JournalingObjectStore::commit_start should handle the case where journal is
null. This will occur if the user doesn't...
Colin Patrick McCabe
08:28 PM Bug #696: osd: _put_pool, assert(p->num_pg > 0)
We may want to add get/put pool debug prints (if there aren't any already) so we can nail this one down... Sage Weil
01:36 AM Bug #696 (Resolved): osd: _put_pool, assert(p->num_pg > 0)
Yesterday I tried to remove some pools from my Ceph system on 'noisy', running unstable commit eace4398cb163a670ff6bb... Wido den Hollander

01/08/2011

08:02 PM Support #695 (Closed): two questions about osd and kclient,important!!
This is a bug tracker, not a question-answer forum or a wiki. Please send your questions to the developer list and we... Greg Farnum
07:03 PM Support #695 (Closed): two questions about osd and kclient,important!!
1.ceph-client-standalone.git ,ceph-client.git
what is the different of the client.git???i know ceph-client-standal...
longguang yue
02:11 PM CephFS Bug #689: cmds crash: mds/CInode.h:1088: FAILED assert(ref_set.count(by) == 0)
Can you attach the whole log? Sage Weil
02:08 PM CephFS Bug #689: cmds crash: mds/CInode.h:1088: FAILED assert(ref_set.count(by) == 0)
Here's (a portion of) the mds log with debug mds = 20:
2011-01-08 14:16:35.724241 b68bdb70 mds0.cache.ino(100) pro...
Ravi Pinjala
01:45 PM CephFS Tasks #366 (Resolved): test snaptests against clustered mds failures
Sage Weil
01:35 PM Linux kernel client Bug #634 (Can't reproduce): Kernel client takes too long to recover after a MDS restart
Sage Weil
01:34 PM Linux kernel client Cleanup #683 (Rejected): confused of struct ceph_xattr
I'm not too concerned about the naming here; all xattrs are inode xattrs. Sage Weil
12:50 AM Revision 630565f3 (ceph): v0.24.1
Sage Weil

01/07/2011

11:01 PM Revision eace4398 (ceph): test_split.sh: add many_pools test
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
10:25 PM Revision a64ddbb6 (ceph): ReplicatedPG: get_object_context ssc refcount leak
If obc->obs.ssc is non-null, the second get_snapset_context ends up
leaking a snapset reference.
Signed-off-by: Samu...
Samuel Just
10:17 PM Revision 7f357cf2 (ceph): mds: force dirfrag fragmention when replaying metablob
We can have non-auth (and thus ambiguously fragmented) dirs in our cache.
When those get replayed, adjust our fragmen...
Sage Weil
10:17 PM Revision 63125adb (ceph): mds: make thrash_exports select random frags
We were always picking the first frag.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:17 PM Revision e6ee6c4f (ceph): mds: take rdlocks on bounding dftlocks; clean up migrator lock code
We need to take an rdlock on bounding dirfrags during migration for a
rather irritating reason: when we export the bo...
Sage Weil
10:17 PM Revision fde4b97e (ceph): mds: fix _dout_lock recursion recursion
The get_snaps() method also something to dout. We need to take care to
not do that as part of the ostream operator<<...
Sage Weil
09:45 PM Revision d2dbe9e9 (ceph): ReplicatedPG: register_object_context and register_snapset_context cleanup
Previously, get_object_context and get_snapset_context did not register
the resulting objects. In some cases, these ...
Samuel Just
08:40 PM Revision 2da3ac14 (ceph): librados: check for initialization before doing certain operations
Yehuda Sadeh
07:22 PM Revision 7b20c290 (ceph): librados: fix api declaration
Yehuda Sadeh
06:50 PM Revision baae4bc7 (ceph): librados: add rados->version, include librados.h from .hpp
Yehuda Sadeh
06:46 PM Revision a3424776 (ceph): librados-config: add man page
Yehuda Sadeh
06:45 PM Revision 98105ad7 (ceph): librados-config: added a command line tool to dump librados version
Yehuda Sadeh
03:50 PM Bug #694 (Resolved): no osd journal causes cosd segfault
When ceph.conf doesn't include an "osd journal" configuration option, cosd will throw a segfault. Noah Watkins
03:36 PM Bug #646 (Can't reproduce): [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git +...
Sage Weil
02:45 PM Linux kernel client Bug #693: Adding a non-existent image creates a bad sysfs node
commit:ee0450e4ede0dde6b4c721dfbd0a578b5e13053c in the unstable branch. Yehuda Sadeh
02:41 PM Linux kernel client Bug #693: Adding a non-existent image creates a bad sysfs node
That's commit:64f2c53e25165d78808f482651a92685fd28ffe9. Yehuda Sadeh
02:41 PM Linux kernel client Bug #693 (Resolved): Adding a non-existent image creates a bad sysfs node
Fixed in 64f2c53e25165d78808f482651a92685fd28ffe9. Had a bad cleanup sequence. Yehuda Sadeh
11:48 AM Linux kernel client Bug #693 (Resolved): Adding a non-existent image creates a bad sysfs node
To reproduce:
echo "10.0.1.247 name=admin rbd doesnotexist" > /sys/bus/rbd/add
/sys/bus/rbd/devices/0 is create...
Josh Durgin
02:25 PM Bug #666: OSD: ReplicatedPG::eval_repop
FWIW, I seem to be able to reproduce this at will.
I've been trying to understand what I'm seeing in the logs - it...
Jim Schutt
02:00 PM CephFS Bug #684 (Resolved): unstable fails some workunits
fixed by commit:fde4b97e8c4fe5e7a657815ba59449a66e85ecba Sage Weil
12:00 PM CephFS Bug #684: unstable fails some workunits
Attached the full-thread backtrace of the MDS after it hangs on snaptest-upchildrealms. Greg Farnum
12:47 PM Bug #691 (Resolved): librados: Segfault when open_pool is run when not initialized
Yehuda Sadeh
12:25 PM Bug #691: librados: Segfault when open_pool is run when not initialized
Fixed with commit 2da3ac1499944e7aac358a47a4d5651f3a1257e0. Yehuda Sadeh
04:14 AM Bug #691 (Resolved): librados: Segfault when open_pool is run when not initialized
While working with phprados I experienced a crash when I tried to open a pool when I did not initialize librados.
...
Wido den Hollander
11:13 AM Bug #673 (Resolved): cmon: SimpleMessenger::Pipe::discard_queue
Sage Weil
11:03 AM Feature #334 (Resolved): Version in librados
Sage Weil
11:03 AM Feature #334 (In Progress): Version in librados
Sage Weil
10:43 AM Feature #334: Version in librados
rados->version() implemented in commit:baae4bc7c5390681d4549bdfb9ec4204aa515eb1.
Yehuda Sadeh
10:02 AM Feature #334: Version in librados
Thanks, but this has only been done in the librados C library, not in the C++. Has this been done un purpose (just to... Wido den Hollander
04:30 AM phprados Feature #692 (Resolved): Throw an exception if we try to open a pool before we are initialized
Fixed by commit b5d2a5871c8cf7da6f350060055a7d83da9f3330
We now throw an exception if we try to open a pool before...
Wido den Hollander
04:15 AM phprados Feature #692 (Resolved): Throw an exception if we try to open a pool before we are initialized
Related to #691 Wido den Hollander
04:25 AM phprados Bug #690 (Resolved): Segfault when a xattr has no value during listing
Both fixed by bb3c344b69133a9307e076cca4523af38a233e21 Wido den Hollander
04:17 AM phprados Bug #690 (In Progress): Segfault when a xattr has no value during listing
Same goes for getxattr, example:... Wido den Hollander
04:06 AM phprados Bug #690 (Resolved): Segfault when a xattr has no value during listing
Right now phprados segfaults during the return value array assignment when we list the xattr's of a object, where one... Wido den Hollander

01/06/2011

11:59 PM Revision 86653700 (ceph): ReplicatedPG: clone_overlap should contain one entry per clone
Previously, writefull and _delete_head would remove the last
entry from snapset.clone_overlap. Now, the last entry b...
Samuel Just
11:44 PM Revision 3a4afd98 (ceph): mon: Introduce Monitor::leader_since
Introduce Monitor::leader_since to keep track of when the current
monitor became the leader.
Signed-off-by: Colin Mc...
Colin Patrick McCabe
11:44 PM Revision a9420716 (ceph): osd: Rename osd_mon_report_interval
Rename osd_mon_report_interval to osd_mon_report_interval_min.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
Colin Patrick McCabe
11:44 PM Revision 668c0d96 (ceph): mon: remove obsolete comment
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
11:44 PM Revision 5947769e (ceph): mon: don't allow Monitor to be copied
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
11:44 PM Revision fc032afd (ceph): osd: Introduce osd_mon_report_interval_max
After every g_conf.osd_mon_report_interval_max seconds, we send out a PG
stat update even if nothing has changed. Thi...
Colin Patrick McCabe
11:44 PM Revision 396dce9e (ceph): mon: Always forward the PGStats to the leader
Always forward the PGStats to the leader, even if they are the same as
the old PGStats. The leader will mark as down ...
Colin Patrick McCabe
11:44 PM Revision 494b812c (ceph): mon: mark osds down for not sending MOSDPGStat
PGMonitor::prepare_pg_stats should check to see if the stats in the
MOSDPgStats message are the same as the ones we a...
Colin Patrick McCabe
11:44 PM Revision 2a7aad1b (ceph): osd: Create all_osds_die test
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
11:39 PM Revision 0ef19e6b (ceph): Merge branch 'standby_replay' into unstable
Greg Farnum
11:37 PM Revision 3df23749 (ceph): mds: Add is_any_replay() method and fill it in as appropriate.
This way we don't need to remember to call all three of is_replay(),
is_standby_replay(), is_oneshot_replay().
Signe...
Greg Farnum
10:50 PM Revision b1b4c03e (ceph): Merge remote branch 'origin/unstable' into standby_replay
Conflicts:
src/cmds.cc
src/mds/MDS.cc
src/mds/MDS.h
Greg Farnum
10:43 PM Revision 8dbe529e (ceph): librados: add library api versioning
Yehuda Sadeh
09:44 PM CephFS Bug #689: cmds crash: mds/CInode.h:1088: FAILED assert(ref_set.count(by) == 0)
Apparently the objdump is a bit big to post as an attachment. I've uploaded it at http://p-static.net/cmds-objdump.gz Ravi Pinjala
09:42 PM CephFS Bug #689 (Resolved): cmds crash: mds/CInode.h:1088: FAILED assert(ref_set.count(by) == 0)
I just upgraded to Ceph 0.24, and when I started up the cluster I got
this crash.
2011-01-06 21:23:48.744864 b685...
Ravi Pinjala
07:12 PM Revision 92824b64 (ceph): MDSMonitor: Do not set the rank of an MDS in standby-replay
or oneshot-replay modes.
This was causing issues with identification in various circumstances,
and turns out to be u...
Greg Farnum
07:12 PM Revision 7561f809 (ceph): MDSMonitor: Remove STATE_ONESHOT_REPLAY from takeover logic in tick().
If something dies during a journal-check we shouldn't have anybody
doing standby for them, so assert out!
Signed-off...
Greg Farnum
07:12 PM Revision f84a0909 (ceph): lru: change control flow and an assert to keep purpose clearer.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
07:12 PM Revision d14c7dd4 (ceph): elist: add a clear_list function.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
07:12 PM Revision ae4dda41 (ceph): mdcache: adjust trim() to handle running during standby-replay.
This just means it needs to handle files on the open list and not
trim them. Add a check for that with an assert, and...
Greg Farnum
07:12 PM Revision 8ca291c0 (ceph): mdlog: Add some helper functions for accessing segments map data.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
07:12 PM Revision b8b172d2 (ceph): MDS: make standby_trim_segments functional. Hurray, hot standbys work!
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
07:12 PM Revision ff7340aa (ceph): mds: After probing the journal, reset if we've fallen behind.
Previously, if the journal got trimmed and we missed log entries,
we failed out in the journaling step and stopped.
T...
Greg Farnum
07:12 PM Revision bf0675fa (ceph): journaler: Add init_headers function, call when reading head off disk.
Uninitialized headers were causing a failed assert during replay,
and there's no good reason to leave them set at the...
Greg Farnum
07:12 PM Revision 8d88f3b2 (ceph): mdlog: return EAGAIN if replay falls off the tail of the journal.
This can happen when we're following an active journal, and
would previously cause the MDS to shut down. Now we retur...
Greg Farnum
07:12 PM Revision 7472806b (ceph): MDS: add gids to the logger file names.
This is just to make differentiating between the standby's files
and stuff easier.
Signed-off-by: Greg Farnum <gregf...
Greg Farnum
07:12 PM Revision cd4cd01a (ceph): MDS: rename replay Contexts -- they were ambiguous at best.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
07:12 PM Revision 60a4189d (ceph): mdcache: change replay trimming a bit.
Previously we were re-inserting dentrys on the open list. But if
there weren't any other available dentrys to trim, t...
Greg Farnum
07:12 PM Revision b0676cb9 (ceph): journaler: delete Contexts on finish() in new functions.
Previously we weren't, and leaked memory.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net>
Greg Farnum
07:12 PM Revision 71f6a089 (ceph): MDS: miscellaneous standby-replay fixes and cleanups.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
07:12 PM Revision 3cb8c080 (ceph): Journaler: Remove the unused read_pos field.
Rename it to unused_field, fill the in-memory read_pos
from header.expire_pos, and fill unused_field with the expire_...
Greg Farnum
07:12 PM Revision b9aaa117 (ceph): MDLog: don't change expire_pos or read_pos on replay.
These are unnecessary or rendered irrelevant by previous commit
removing read_pos from the on-disk Header.
Signed-of...
Greg Farnum
07:12 PM Revision 3f5371cd (ceph): MDS: MDSMonitor: if MDS is in standby-replay and its leader goes down,
take over as the MDS!
This means we can now exit standby-replay.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net>
Greg Farnum
07:12 PM Revision 7dcbfb81 (ceph): MDS: make use of the hooks to start standby-replay.
This doesn't include trim, and there's no way to exit the replay!
Signed-off-by: Greg Farnum <gregf@hq.newdream.net>
Greg Farnum
07:09 PM Revision d55a6579 (ceph): osd, rados: pgls filter cleanups
Yehuda Sadeh
06:52 PM Revision cfd87cee (ceph): objecter: use raw_pg_to_pg when needed
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Sage Weil
06:35 PM Revision e1675f32 (ceph): Journaler: set state to STATE_ACTIVE in _finish_probe_end.
This was never actually getting set, although it doesn't matter
since STATE_ACTIVE and STATE_PROBING are defined to b...
Greg Farnum
06:35 PM Revision 8de5681d (ceph): Journaler: remove unused vector<snapid_t> snaps from recover().
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
06:35 PM Revision 25d6576a (ceph): Journaler: add new reread_head function and state.
This is to facilitate the forthcoming up_shadow MDS state.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net>
Greg Farnum
06:35 PM Revision 93597910 (ceph): Journaler: Add a read-only setting, and asserts to make it fail on writ...
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
06:35 PM Revision cfd839d2 (ceph): Journaler: Add reset() function, which returns it to the immediate post...
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
06:35 PM Revision fb074f45 (ceph): Journaler: Add function reprobe, to search for the new end of log.
Add new REPROBING state and split up new function probe() from _finish_read_head.
Signed-off-by: Greg Farnum <gregf@...
Greg Farnum
06:35 PM Revision 14829c41 (ceph): Journaler: use uint64_6 instead of int64_t.
Since the values can never be negative, this is far more appropriate,
and it results in fewer casts than the other wa...
Greg Farnum
06:35 PM Revision 18d89d13 (ceph): Journaler: Set the privacy of new functions correctly.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
06:35 PM Revision c9e1f992 (ceph): Journaler: redefine states to make them all unique.
Apparently PROBING and ACTIVE being identical was a mistake.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net>
Greg Farnum
06:35 PM Revision 8da26fde (ceph): Journaler: make reread_head an asynchronous function.
This better fits the spirit of the other functions, and the MDS itself.
Signed-off-by: Greg Farnum <gregf@hq.newdrea...
Greg Farnum
06:35 PM Revision a34a0cf8 (ceph): Journaler: make reprobe() an asynchronous function.
This better fits the spirit of the other functions, and the MDS itself.
Signed-off-by: Greg Farnum <gregf@hq.newdrea...
Greg Farnum
06:35 PM Revision cc0eef4b (ceph): mds: Create new STATE_ONESHOT_REPLAY for the MDS.
This takes over the previous behavior of STATE_STANDBY_REPLAY,
allowing standby-replay to be used for the upcoming co...
Greg Farnum
06:35 PM Revision 02d46401 (ceph): mds: rename is_standby_replay() to is_oneshot_replay.
This better represents its current purpose. Greg Farnum
06:35 PM Revision e30cbaa9 (ceph): mds: extend the use of uint64_t instead of (signed) loff_t, et al.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> Greg Farnum
06:35 PM Revision 6337842c (ceph): mds: add expire_pos to the ESubtreeMap.
This will allow more efficient trimming during standby_replay.
Signed-off-by: Greg Farnum <gregf@hq.newdream.net>
Greg Farnum
06:35 PM Revision 87e40c0f (ceph): journaler: add reread_head_and_probe function.
It does both so callers don't need to implement
intermediate bottom-half handlers.
Signed-off-by: Greg Farnum <gregf...
Greg Farnum
06:35 PM Revision a8a7e730 (ceph): MDS: Implement the hooks for standby_replay.
This commit adds the necessary state checks and machinery
for the MDS to go through a "looping" replay.
It does not y...
Greg Farnum
06:31 PM Revision 4f15b8fa (ceph): objecter, librados: propagate extra pgls info to client
Yehuda Sadeh
04:47 PM CephFS Bug #684: unstable fails some workunits
Logs for each case are on kai:/home/gregf/logs Greg Farnum
10:13 AM CephFS Bug #684: unstable fails some workunits
Updated with snaptest-2 issue. Greg Farnum
08:53 AM CephFS Bug #684 (Resolved): unstable fails some workunits
This is on both cfuse and the kclient, so I assume it's an MDS issue.
v0.24 worked correctly; bisecting this is diff...
Greg Farnum
04:40 PM Bug #688 (Resolved): should add dependency on libedit to rpm, deb, configure.ac
should add dependency on libedit to rpm, deb, configure.ac(04:35:35 PM)
One user saw this error because his libed...
Colin McCabe
03:02 PM Feature #609: osd: query pool/pg for objects with given xattr
yehudasa@skinny:~/ceph/src$ ./rados ls -p metadata --filter="parent 0x100"
602.00000000
dirino=100 dname=stray2 v=...
Yehuda Sadeh
03:01 PM Feature #687 (Resolved): Let the monitor auto-set idle MDSes as standbys
If we have more MDS daemons than configured MDSes, let the monitor set the idle ones as standbys automatically.
This...
Greg Farnum
03:00 PM Feature #686 (Resolved): Specify standby behavior in configuration file
We need ways to set certain MDSes as standbys of each other, in the config file. Greg Farnum
02:54 PM Bug #629 (Resolved): cosd segfaults when deleting a pool containing degraded objects
Sage Weil
02:52 PM Feature #334 (Resolved): Version in librados
Fixed in commit:42c240d389cd2811f61d0f85aa5422d5473bb7dd. Will be useful for qemu rbd. Yehuda Sadeh
02:49 PM Bug #666: OSD: ReplicatedPG::eval_repop
I have to use our managed transfer app for that; you should be getting an email about the transfer.
Let me know if...
Jim Schutt
12:46 PM Bug #666 (In Progress): OSD: ReplicatedPG::eval_repop
Sage Weil
12:46 PM Bug #666: OSD: ReplicatedPG::eval_repop
Can you post logs?
Thanks!
Sage Weil
10:59 AM Bug #666: OSD: ReplicatedPG::eval_repop
Hit another instance of this on current unstable (d386ea2ea0b).
I've got logs from debug osd = 20 + debug filestor...
Jim Schutt
02:47 PM CephFS Feature #91 (Resolved): mds: up:shadow mode
Well, this seems to be working as best I can tell.
There are some odd issues with virtual memory usage growing by ...
Greg Farnum
02:15 PM Feature #685 (Duplicate): libcephmon: interact with ceph monitors via a library
Hi,
Right now librados nor libceph give you a full internal look of how the Ceph system is behaving.
Some infor...
Wido den Hollander
10:36 AM CephFS Cleanup #588: Allow Inodes to have multiple parent Dentries
When the client changes are merged into testing it also passes snaptest-upchildrealms and snaptest-2. I suspect this ... Greg Farnum
09:19 AM CephFS Cleanup #588: Allow Inodes to have multiple parent Dentries
Pushed the branch uclient_dentries; it's only a few commits.
There were actually surprisingly few things that need...
Greg Farnum
10:35 AM Bug #682 (Resolved): higher pgp_bits setting leads to strange radostool errors
Looks like we have a winner...
Submitted the patch as commit:cfd87ceefb46358adaa1751975c8d3a6b063bdf9
C.
Colin McCabe
10:00 AM Feature #94 (Resolved): mon: monitor some osds
Implemented in the osd_timeo branch.
C.
Colin McCabe
02:16 AM Revision d386ea2e (ceph): common: dout_create_rank_symlink: init if needed
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
12:50 AM Revision 2d781ff6 (ceph): osd, rados: pgls filter fixes
Yehuda Sadeh
12:48 AM Revision 1f3b9c69 (ceph): mds: remove stray reference& in FragmentMarking context
Led to confusing occasional(!) crashes on marking completion.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
12:48 AM Revision 17bc2935 (ceph): mds: change refragment journaling/store strategy
We had a serious problem before where we were updating the cache and
redivvying up the dentries among fragments, but ...
Sage Weil
12:48 AM Revision d1d26a53 (ceph): mds: make adjust_dir_fragments always adjust fragtree
If we have the inode but no dirfrags, we still need to adjust the
inode dirfragtree.
Signed-off-by: Sage Weil <sage@...
Sage Weil
12:48 AM Revision cdf6af04 (ceph): Merge remote branch 'origin/testing' into unstable
Sage Weil
12:48 AM Revision 21d49d91 (ceph): mds: fix can_authpin assert on post-fragment commit
We wan to ignore the authpinnability check here; we already have the
(old) frag frozen, so no worries about starvatio...
Sage Weil
12:48 AM Revision 15c5e0c2 (ceph): mds: add mds_debug_frag option
Verify dirfragtree matches any open dirfrags.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil

01/05/2011

11:53 PM Revision 8adaa047 (ceph): common: make command-line programs log to stderr
command-line programs (as opposed to daemons) should send their logs to
stderr rather than to a log file, syslog, etc...
Colin Patrick McCabe
11:30 PM Linux kernel client Cleanup #683 (Rejected): confused of struct ceph_xattr
file xattrs.c,line 371,376
struct ceph_xattr ,it should be struct ceph_inode_xattr
longguang yue
11:29 PM Revision bba356b8 (ceph): rgw_admin: call common_set_defaults as non-daemon
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
10:10 PM Bug #682: higher pgp_bits setting leads to strange radostool errors
try this?... Sage Weil
05:01 PM Bug #682: higher pgp_bits setting leads to strange radostool errors
by the way, it should be 100% reproducible, just pull the latest unstable and run "./test/test_unfound.sh run" Colin McCabe
04:47 PM Bug #682 (Resolved): higher pgp_bits setting leads to strange radostool errors
git-bisect shows that commit:8f104243f823c3b5c827a4c7e59d637d38846e3f broke test_unfound.sh, test_lost.sh, and probab... Colin McCabe
08:40 PM Revision f3771689 (ceph): debian: update scripts to build ubuntu (maverick, lucid) packages too
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:33 PM Revision 6dce7b0f (ceph): mds: move flock types into separate header
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:18 PM Revision 4b2ba49e (ceph): rados tool: Remove duplicate line in usage
Signed-off-by: Wido den Hollander <wido@widodh.nl>
Signed-off-by: Sage Weil <sage@newdream.net>
Wido den Hollander
03:14 PM Feature #609 (In Progress): osd: query pool/pg for objects with given xattr
Yehuda Sadeh
12:34 PM Feature #679: build .debs for ubuntu releases
commit:f377168905b3d9530cca380b1c06ed34e7e343a1 Sage Weil
12:34 PM Feature #679 (Resolved): build .debs for ubuntu releases
Sage Weil
09:04 AM Cleanup #681 (Resolved): Incorrect usage radostool, duplicate line for 'getxattr'
thanks. commit:4b2ba49e287b6afe50e99772173f0c1482fffc13 Sage Weil
06:01 AM Cleanup #681 (Resolved): Incorrect usage radostool, duplicate line for 'getxattr'
Just a small thing I noticed today:... Wido den Hollander
02:25 AM Revision 065e2acb (ceph): common: generic_dout needs to take the dout mutex
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
01:18 AM Revision c46d8d5d (ceph): osd: add pgls filtering by parent ino
Yehuda Sadeh
01:02 AM Revision edd5686e (ceph): common: handle_fatal_signal: print threadid in hex
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
12:14 AM Revision 4a1c1b4a (ceph): mds: fix ancestor backtrace encoding
Use explicit types to capture the encoding. Include object ino in the
inode_backtrace_t so that the xattr can stand ...
Sage Weil

01/04/2011

11:27 PM Revision f66d4728 (ceph): frag: const cleanup fragset_t
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:27 PM Revision 4c53c726 (ceph): mds: make get_dirfrags_under behave when dirfragtree is not coherent wi...
This is (currently) the case during replay/resolve, although it's not
clear that it should be.
Signed-off-by: Sage W...
Sage Weil
11:27 PM Revision a5f9a36c (ceph): mds: make resolve adjust dir fragmentation as needed
During resolve, adjust dir fragmentation as needed based on the subtrees
the sender explicitly claims. The given fra...
Sage Weil
11:27 PM Revision 3d04d96b (ceph): mds: force fragmentation for ambiguous imports as well
Handle needed refragmentation for processing ambiguous bounds. That means
forcing the peers' subtree root fragmentat...
Sage Weil
11:01 PM Revision 1b0bb7a1 (ceph): osd, objecter: pgls filtering option
Yehuda Sadeh
10:38 PM Revision fab61391 (ceph): PG: Fixes bug in _scrub with checking clones
I introduced this bug in
4a4a1e53c7d380cd0b582c1d0685fd0ef4ef1711.
curclone++ not curclone--.
Signed-off-by: Samuel ...
Samuel Just
07:29 PM Revision 8f104243 (ceph): osd: set default pg_bits higher; pgp_bits to old value
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:50 PM Revision aafb79c7 (ceph): assert: print thread id in hex
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:27 PM Revision 4a4a1e53 (ceph): PG: Fix bug in scrub when checking clone sizes
Previosly, _scrub checked:
assert(p->second.size == snapset.clone_size[curclone])
curclone was, however, an index in...
Samuel Just
06:20 PM Revision 3d4e1d99 (ceph): client: fix frag selection code
Calling fragtree_t::contains() on a non-frag_t is nonsense and will crash.
And a fragtree is a complete partition of ...
Sage Weil
06:18 PM Revision 062c4cc8 (ceph): vstart.sh: specify keyring in ceph.conf
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:16 PM Revision d2578107 (ceph): remove ancient uofs.h
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:09 PM Revision 9befd1ea (ceph): mkcephfs: Clarified numosd message
Signed-off-by: Matthew Roy <matthew@royhousehold.net>
Signed-off-by: Sage Weil <sage@newdream.net>
Matthew Roy
03:10 PM CephFS Bug #594: mds: frag split/merge vs replay
commit:3d04d96b5cd0ab9d94904d035347c00fe19f687a fixes things for resolve. rejoin still needs work. Sage Weil
02:47 PM Feature #609 (Resolved): osd: query pool/pg for objects with given xattr
implemented it as a new PGLS_FILTER operation, but reuses the PGLS operation code. There's no class extension now, bu... Yehuda Sadeh
09:01 AM Support #680 (Closed): what's device_open,uofs_*
uofs is an ancient implementation OBFS, a userspace object file system from UCSC. It should probably be removed from... Sage Weil
05:39 AM Support #680 (Closed): what's device_open,uofs_*
what is uofs? device_open longguang yue
05:24 AM Revision 78f35a64 (ceph): osd: send sufficient log to compensate for replicas with last_complate ...
If a replica has last_complete < log.tail and no backlog, send enough log
for them to get back into a consistent stat...
Sage Weil
05:24 AM Revision ff035ab3 (ceph): osd: generate backlog if needed to get last_complete >= log.tail || bac...
If primary or a replica has a mistrimmed pg log, we need to generate the
backlog during peering. This sucks, because...
Sage Weil
05:24 AM Revision 462cb841 (ceph): osd: clean up backlog generation checks a bit
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:24 AM Revision 88c445b1 (ceph): mds: start new log segment resolve start, not replay finish
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:24 AM Revision 6c73da0a (ceph): mds: assert no submit_entry during replay state
We should never submit items to the journal during replay.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
01:44 AM Revision 16c603b2 (ceph): common: Implement max open files
In init-ceph, call ulimit -n if the user has set a maximum number of
open files, and the current maximum number of fi...
Colin Patrick McCabe
12:23 AM Bug #665: mds(uo:replay),clent can not mount,return error 5 input/output
how to locate error in logs?
mds.log is attached.
Sage Weil wrote:
> You can't mount until mds replay completes. ...
longguang yue
12:14 AM Revision 5dc66244 (ceph): osd: Make g_conf.osd_max_notify_timeout a uint32_t
Make g_conf.osd_max_notify_timeout a uint32_t. Squashes an annoying
compiler warning and avoids the awkward issue of ...
Colin Patrick McCabe

01/03/2011

11:15 PM Revision aad9ffec (ceph): Merge branch 'testing' into unstable
Sage Weil
10:33 PM Revision b40e7dc0 (ceph): mds: load root inode on replay if auth
If we are auth for the root inode, load it's initial value off of disk. We
may not see it in the log if it has not be...
Sage Weil
10:15 PM Revision 20593b0d (ceph): msgr: Unlock dispatch_queue.lock when short-circuiting queue_received.
Previously we left the mutex locked, which is obviously bad bad bad!
I believe this was the cause of #673.
Signed-of...
Greg Farnum
09:14 PM Revision 4efa3006 (ceph): filestore: assert on out of order journal pipeline submissions
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:14 PM Revision 259c509a (ceph): filestore: fix wake condition when journal submission blocks
We only want to wake up if we are at the front of the line, in order to
preserve journal submission pipeline ordering...
Sage Weil
09:12 PM CephFS Bug #663: cmds segfaults
added assert in commit:6c73da0a996421b108f19d7f1b56157cec281642 that will catch this sooner. Sage Weil
09:12 PM Bug #590 (Resolved): osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
fixed in commit:ff035ab31c524cfe63cc75d81ab706b5b7d35448 Sage Weil
08:22 PM Revision 62e4cdea (ceph): common: print thread ID in sig handlers and assert
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
07:52 PM Feature #609: osd: query pool/pg for objects with given xattr
Can we use the list objects operation (LSPG?) as a model here? That's basically what it does.. it grabs up to N item... Sage Weil
05:27 PM Feature #609: osd: query pool/pg for objects with given xattr
The problem that I see is that this is going to be a very slow operation, and we need to figure out how not to timeou... Yehuda Sadeh
07:50 PM Revision 15dcc651 (ceph): mds: fix purge_stray for directories, zeroed layouts
- We don't want to purge file content on directories
- Don't fall over if a file has a zero period
Reported-by: Paul...
Sage Weil
07:37 PM Revision 266cc6c6 (ceph): rbd: add watch option for rbd tool
Yehuda Sadeh
06:30 PM Revision 6cdfa304 (ceph): osd: PG::Info::History: init last_epoch_clean
It seems that we have not been zeroing
PG::Info::History:last_epoch_clean when the History structure is
created. This...
Colin Patrick McCabe
06:24 PM Revision 7e83b570 (ceph): Merge branch 'testing' into unstable
Conflicts:
configure.ac
Sage Weil
06:24 PM Revision 56861bd5 (ceph): Merge remote branch 'origin/keyring_cleanup' into unstable
Sage Weil
06:15 PM Revision 1d5a69ff (ceph): debian: try to update pbuild env as needed
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:14 PM Revision 9ad05cf7 (ceph): SimpleMessenger.cc: Fixes a dispatch_throttler leak in queue_received
when the pipe has been halted.
Signed-off-by: Samuel Just <samuelj@hq.newdream.net>
Samuel Just
06:09 PM Bug #674: tiobench stress test , OSD timeout

Hi ,
ceph 0.24, git ceph-client-standalone:master-backport.
It's very easy to reproduce this issue with tiotest.
...
changping Wu
03:02 PM Bug #674: tiobench stress test , OSD timeout
are the cosd instances running? what does 'ceph -s' show? it looks like it's just blocking because the osds aren't ... Sage Weil
05:11 PM Feature #668 (Resolved): Change number of open files in init script
Implemented by commit:4018fb2f3bb5ee546134f93904ea8608617a8f2a Colin McCabe
02:19 PM CephFS Bug #671 (Resolved): mds: fragstat problem after mds restart
Sage Weil
02:19 PM CephFS Bug #671: mds: fragstat problem after mds restart
fixed by commit:b40e7dc0f73bc24631e8cd79d68d790733aafb99 Sage Weil
01:58 PM Bug #673: cmon: SimpleMessenger::Pipe::discard_queue
I believe this should be fixed by commit:20593b0d38d5357c89b93fac8c06e2083fa56df9. Greg Farnum
01:01 PM Bug #672 (Resolved): osd: crash from ballpit0
fixed by commit:259c509a8941bf7cdad8bd4ede0ccd73ca8a83d3 Sage Weil
12:44 PM Bug #672: osd: crash from ballpit0
It seems like we are getting some out-of-order messages, just like in bug 666.
Here is the one that crashed us:
2...
Colin McCabe
01:01 PM Bug #666 (Resolved): OSD: ReplicatedPG::eval_repop
fixed by commit:259c509a8941bf7cdad8bd4ede0ccd73ca8a83d3 Sage Weil
12:27 PM CephFS Bug #678 (Resolved): What if period == 0 in MDCache...purge_stray?
fixed by commit:15dcc65199fc825ca8c51a31de3be01410aca9c1 Sage Weil
11:36 AM CephFS Bug #678: What if period == 0 in MDCache...purge_stray?
Based on what I was doing when this asplode I'd say it's either a directory or a file created but not yet written to ... Paul Komkoff
09:53 AM CephFS Bug #678: What if period == 0 in MDCache...purge_stray?
The question is whether this was a regular file (shouldn't have a 0 period!) or a directory (we need to delete the me... Sage Weil
10:55 AM Bug #676 (Resolved): inconsistent handling of g_conf.keyring
Sage Weil
10:46 AM Bug #669 (Rejected): testradospp doesn't build
Sage Weil
09:47 AM Bug #665: mds(uo:replay),clent can not mount,return error 5 input/output
You can't mount until mds replay completes. Probably it is blocked on osd io? Can uou repeat with 'debug ms = 1' in... Sage Weil

01/02/2011

08:52 PM Revision 924355df (ceph): auth: CEPH_KEYRING overrides g_conf.keyring
Allow users to choose different keyring files by setting an environment
variable, CEPH_KEYRING.
Signed-off-by: Colin...
Colin Patrick McCabe
08:20 PM Feature #679 (Resolved): build .debs for ubuntu releases
should just be a matter of setting up pbuilder roots? Sage Weil
08:19 PM Revision 13e8507b (ceph): auth: make g_conf.keyring a plain old string
Make g_conf.keyring a plain old string rather than an array of strings.
Don't do substitution using the user's HOME v...
Colin Patrick McCabe

01/01/2011

05:04 AM CephFS Bug #678 (Resolved): What if period == 0 in MDCache...purge_stray?
Program terminated with signal 8, Arithmetic exception.
#0 0x0000...
Paul Komkoff

12/30/2010

11:15 PM Revision 789cb9a4 (ceph): osdmaptool: better error handling
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
11:12 PM Revision e2c0f404 (ceph): common: bufferlist: handle EINTR, check close rval
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
10:50 PM Revision 51462d6b (ceph): common: bufferlist::read_file: return read errors
Don't ignore errors when reading a file with buffer::list.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
Colin Patrick McCabe
06:38 PM Revision adbfd95c (ceph): cconf: code cleanup
cconf: add a better usage() message, with examples. Give more helpful
error messages when the usage is wrong. Put dif...
Colin Patrick McCabe
06:38 PM Revision 323ac1a7 (ceph): qa: rbd: disable automatic keyring discovery
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
06:38 PM Revision 634cca95 (ceph): qa: rbd: Update rbd test to use the new rbd sysfs
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
06:38 PM Revision c846615d (ceph): qa: rbd: small optimization to script
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
12:38 PM Linux kernel client Bug #677 (Resolved): rbd snapshot creation takes too long
Yehuda Sadeh
12:37 PM Linux kernel client Bug #677: rbd snapshot creation takes too long
Fixed with commit:b6a9e5db67e3685e1fbac9c92de2963bff541fc1. We called the notify under the ctl_mutex. The notify trig... Yehuda Sadeh
12:31 PM Linux kernel client Bug #677 (Resolved): rbd snapshot creation takes too long
Apparently the notify doesn't get a completion event, even though the osd sends it. It only timeouts away from the wait. Yehuda Sadeh
10:32 AM Bug #676 (Resolved): inconsistent handling of g_conf.keyring
In some places we treat g_conf.keyring as a single string. In others, we treat it as a comma-separated array. Sometim... Colin McCabe
02:33 AM Revision e1a1e1e6 (ceph): auth: Fix buggy parsing of g_conf.keyring
Previously, we weren't correctly handling comma-separated lists of
values in g_conf.keyring.
Signed-off-by: Colin Mc...
Colin Patrick McCabe
01:58 AM Revision 27cba781 (ceph): common: str_list: const cleanup
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
01:02 AM Revision a4223d4f (ceph): osd: watch request doesn't use version
also fix watch-notify to send the current user_version Yehuda Sadeh
01:02 AM Revision 7e206dfe (ceph): osd, librados: configurable notify timeout
Yehuda Sadeh

12/29/2010

11:55 PM Bug #662 (Resolved): update rbd qa scripts to use new sysfs interface
Implemented by commit:3914ee5e3ca70252e305805190226d60b5cf0dc7
C.
Colin McCabe
11:36 PM Revision de8f0212 (ceph): qa/rbd: minor fixes
* qa/rbd/common: create tempdir for script
* qa/rbd/common: try harder to find monhost (try "mon addr" and also
"mon...
Colin Patrick McCabe
11:33 PM Revision ace90971 (ceph): makefile: ignore mkdir -p errors in install
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
10:34 PM Revision fddbd903 (ceph): cleanup: don't use __PRETTY_FUNCTION__
Its output isn't as pretty as I had hoped.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
Colin Patrick McCabe
07:13 PM Revision bbd0ce25 (ceph): logging: clean up some error handling
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
04:56 PM Feature #389 (Resolved): Synchronize header modifications between clients
configurable timeouts implemented as of commit:7e206dfec07033d49770bafcc2c86f6dd736d7c2.
I'm closing this issue, t...
Yehuda Sadeh
04:52 PM Feature #572 (Resolved): Implement lingering osd requests
this was already completed. Yehuda Sadeh
01:07 AM Revision 22828b9a (ceph): osd: PG::Info::History: init last_epoch_clean
It seems that we have not been zeroing
PG::Info::History:last_epoch_clean when the History structure is
created. This...
Colin Patrick McCabe

12/28/2010

11:25 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Very nice, thanks. I can almost say it worked!
... _almost_ because the recovery went well for a long time until o...
ar Fred
09:44 PM Support #675 (Closed): hot to adjust replicate level,disk not the same size
some disk is 3T,while some disks less 50G,
who can tell me the meaning of crush file.
# types
type 0 device
type...
longguang yue
06:35 PM Bug #674: tiobench stress test , OSD timeout
git the lastest ceph-client-standalone:master-backport:
run:
dd if=/dev/zero of=/mnt/ceph bs=4k count=500000
a...
changping Wu
02:04 AM Revision b3bb8922 (ceph): signals: handle_fatal_signal: use SA_NODEFER
SA_RESETHAND | SA_NODEFER allows the "re-trigger default signal handler"
trick to work for signals other than SIGSEGV...
Colin Patrick McCabe
01:51 AM Revision cbeddc23 (ceph): signals: backtrace some more exotic fatal signals
We're not likely to see these, but if we do, we want it in the logs!
Signed-off-by: Colin McCabe <colinm@hq.newdream...
Colin Patrick McCabe
01:38 AM Revision ce81e6a5 (ceph): signals: test infinite_recursion
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
01:34 AM Revision ff6e4d4d (ceph): common: make generic_usage a little prettier
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
01:32 AM Revision 247b11b7 (ceph): signals: Handle SIGILL, SIGBUS, SIGFPE.
Print out a backtrace when we get SIGILL, SIGBUS, or SIGFPE. Fix a bug
where we failed to install a SIGABRT handler.
...
Colin Patrick McCabe

12/27/2010

09:34 PM Revision a410360a (ceph): test: add TestSignalHandlers
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
04:51 PM Bug #674: tiobench stress test , OSD timeout
printk many timed out messages, then ,tiobench test fail. changping Wu
01:45 AM Bug #674 (Can't reproduce): tiobench stress test , OSD timeout
Hi,
we do multi-thread stress test for ceph 0.23.1 , ceph client printk osd timeout.
1. test tool: tiobench-0.3....
changping Wu
12:31 AM Bug #673 (Resolved): cmon: SimpleMessenger::Pipe::discard_queue
On noisy I saw my monitor go down with:... Wido den Hollander

12/25/2010

07:24 PM Bug #672 (Resolved): osd: crash from ballpit0
see attached Sage Weil

12/24/2010

08:46 AM CephFS Bug #671 (Resolved): mds: fragstat problem after mds restart
... Sage Weil
08:24 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Ok, may not look like it, but this is actaully progress. Add this:... Sage Weil

12/23/2010

11:43 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Thanks for the patch, back to osd1 vs pg 3.70...
osd1 crashed:
2010-12-23 22:17:29.289927 7fef0ec40700 osd1 1292 ...
ar Fred
09:12 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Sigh... this one shoheuld have t conditions match and be a bit more spacecase programmer proof. Also prints peer inf... Sage Weil
08:26 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Hi and thanks again for the patch. I guess it allowed things to go a bit further... at least I don't see the same cra... ar Fred
09:44 PM Support #670: what is the condition of run testrados.
,i run testrados ,but it stop at rados_write function.
it hant there.
my cluster'osd lose many data,log says 1000...
longguang yue

12/22/2010

08:11 PM Support #670 (Closed): what is the condition of run testrados.
This isn't really a bug, you should send questions like this to the list.
Anyway, it's just a silly little program...
Greg Farnum
01:03 AM Support #670 (Closed): what is the condition of run testrados.
it needs ceph cluster running?
or it must run on osd?or mds? or mon?
-------------------
because my ceph error,i r...
longguang yue
03:58 PM Feature #389: Synchronize header modifications between clients
main stuff left to be done:
- configurable timeouts (on client side?)
- more debugging
Yehuda Sadeh
12:49 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Looks like it got past the previous problem pg but bombed later... I forgot to test for backlog. This should do it?
...
Sage Weil
03:52 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Thanks for the patch, which I immediately tested:
osd2 crashes first, then osd1.
osd2:
#0 0x00007fb4b3db4ba5 ...
ar Fred
12:58 AM Bug #669: testradospp doesn't build
you have to give -I(headder dir) -l(redos) ,when compile
look gcc options
Greg Farnum wrote:
> gregf@kai:~/ceph/sr...
longguang yue

12/21/2010

09:56 PM Revision 532522da (ceph): rgw_admin: fix compile error in usage
generic_usage was changed in c4b5c33b123edfc7770e156b4b7f183f8b0af0c9
but this use was not updated. Should call gene...
Samuel Just
08:50 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Oh, the case above it catching it. Can you please try... Sage Weil
02:11 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
I just tried 180a417603552 which is v0.24.
Same crash, and no occurrence of
@dout(10) << "activate peer osd" << p...
ar Fred
06:48 PM Revision 61f964c6 (ceph): librados: Fix compile error by adding std:: namespace
No idea how this got left out or passed any testing?
Signed-off-by: Greg Farnum <gregf@hq.newdream.net>
Greg Farnum
05:36 PM CephFS Bug #663: cmds segfaults
Can you attach that output.txt please? We need to see what path it being taken that submits the journal item in teh ... Sage Weil
02:02 AM CephFS Bug #663: cmds segfaults
After even more advice on IRC, here's what happens on the commandline:
cmds -n mds.dango -D --debug_mds 20 > outpu...
Alexander Rødseth
10:55 AM Bug #669 (Rejected): testradospp doesn't build
gregf@kai:~/ceph/src$ g++ testradospp.cc
In file included from include/object.h:28,
from include/...
Greg Farnum
06:00 AM Feature #668 (Resolved): Change number of open files in init script
On a busy system with multiple OSD's your OSD's can reach the max number of open files.
It should be useful IMHO i...
Wido den Hollander
05:55 AM Feature #667 (Closed): Feature: ceph osd pool get data get pg_size
Hi,... Wido den Hollander
02:40 AM Bug #666: OSD: ReplicatedPG::eval_repop
Forgot to mention, this was on the RC branch, commit: 180a4176035521940390f4ce24ee3eb7aa290632 (v0.24) Wido den Hollander
02:33 AM Bug #666 (Resolved): OSD: ReplicatedPG::eval_repop
On my 'noisy' machine I wanted to see how fast writing a 10GB file would go, but this caused some crashes.
This ma...
Wido den Hollander

12/20/2010

11:58 PM Revision 180a4176 (ceph): v0.24
Sage Weil
09:22 PM Revision 69940e27 (ceph): osd: compensate for replicas with tail > last_complete
Normally we shouldn't ever have a last_complete < log.tail (&& !backlog).
But maybe we do (old bugs, whatever; see #5...
Sage Weil
09:20 PM Bug #665 (Closed): mds(uo:replay),clent can not mount,return error 5 input/output
mount.ceph mon:/ /mnt
return error 5 input/output
-------------
[root@tcmaster ~]# ceph -s
2010-12-21 05:17:54.05...
longguang yue
07:34 PM Revision 0e510dbe (ceph): objectcacher: Fix erroneous reference to "lock" with "flock."
This looks to be an old bug introduced years ago in
267679abc7e29e73655da7367d87e22a0a0d2375, and left
undiscovered d...
Greg Farnum
06:50 PM Bug #664 (Closed): osd auto dead,zombie state.
osd.1.log
2010-12-21 02:51:11.996619 7f1fe40e2720 filestore(/spare/osd1) lock_fsid failed to lock /spare/osd1/fsid, ...
longguang yue
03:49 PM Feature #620: objecter: (optionally) read from replica if on localhost and primary is not
first pass just compares the ip address, commit:f40feef994a5c50fec5230af091f949979422298 Sage Weil
03:13 PM Linux kernel client Bug #657 (Resolved): nfs-reexport null deref in ceph_init_dentry
Sage Weil
03:06 PM Linux kernel client Bug #657: nfs-reexport null deref in ceph_init_dentry
fix was simpler than that.. commit:92cf765237e2787eb168096305c448caf25ac7f8 Sage Weil
03:12 PM Tasks #653 (Resolved): get playground radosgw up and running again
Sage Weil
03:12 PM Bug #655 (Resolved): class objects are being stripped (debian packages)
commit:1e291fc9ef36a5c7ca36053012aec3e9a89f9861 Sage Weil
03:11 PM Bug #656: ceph
... or at least the crash in ceph_crush.txt is. It's not clear from the report why the 'ls' is hafornging. Sage Weil
03:09 PM Bug #656 (Closed): ceph
This is fixed in the latest code. Thanks for the report! Sage Weil
01:12 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Okay, i see the problem. Or at least teh symptom. Please try the latest 'rc' branch, commit:69940e2.
I'm not cer...
Sage Weil
02:34 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
Hi, I produced these logs with the latest rc (b04b6f48237fcbd22716279726f827237a3eb4af).
I have not seen the prima...
ar Fred
11:26 AM Bug #644: rsync can be sloooow
Like I promised, I've done some rsync benchmarking on my Noisy machine, first a small hardware list:
* Dual-Core Int...
Wido den Hollander
10:23 AM Bug #658 (Resolved): osd: scrub crash
Sage Weil
10:14 AM CephFS Bug #663: cmds segfaults
Uploaded another crash .tar.gz:
http://68.178.169.4:81/mds_crash2.tar.gz
(approx 125M)
Alexander Rødseth
07:27 AM CephFS Bug #663 (Can't reproduce): cmds segfaults
Hello.
One of my cmds daemons segfaults.
After asking for advice on #ceph/irc.oftc.net, I used cdebugpack -c /e...
Alexander Rødseth

12/19/2010

02:03 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
Sage Weil wrote:
> Stefan Majer wrote:
> > Hi,
> >
> > we see similar problems with kernel 2.6.37-rc6 and ceph b...
Stefan Majer

12/18/2010

02:49 PM Bug #662 (Resolved): update rbd qa scripts to use new sysfs interface
...and run them on one or more of the ceph* nodes!
see Documentation/ABI/testing/sysfs-bus-rbd in latest -rc to se...
Sage Weil
10:09 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
Sage, like you asked:... Wido den Hollander
05:02 AM Revision b04b6f48 (ceph): mds: make nested scatterlock state change check more robust
The predirty_journal_parents() calls wrlock_start() with nowait=true
because it has a journal entry open and we don't...
Sage Weil
12:52 AM Revision d64616b9 (ceph): mds: set layout on files created by MKNOD instead of CREATE
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:52 AM Revision 7d33830a (ceph): mds: set a writeable client range on regular files created via MKNOD
If the client reexports ceph via nfs, file creations come through as
a MKNOD followed by OPEN. If it's a MKNOD on a ...
Sage Weil
12:52 AM Revision df7e3ffd (ceph): mds: issue caps on regular files created via MKNOD
Give the client some caps to write with. This is similar to the current
MKDIR behavior, but with different bits.
Si...
Sage Weil
12:02 AM Revision 7df46936 (ceph): cephx: fix uninitialized value
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

12/17/2010

11:31 PM Revision 285f351b (ceph): mon: build_initial_monmap: fix mismatched alloc
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
11:30 PM Revision 3a235b0f (ceph): filestore: make OpSequencer::flush() work for writeahead journaling items
It was only waiting for items in the op_queue to complete. The goal is
to wait for anything we've called queue_trans...
Sage Weil
11:26 PM Revision caa46093 (ceph): common: cleanups
common_init: avoid (mismatched) heap allocation
ConfFile::_parse: avoid memory leak on error path
ConfFile: NULL fi...
Colin Patrick McCabe
11:14 PM Revision 2f916086 (ceph): auth: avoid mismatched allocation
Can't pair strdup and free.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
Colin Patrick McCabe
11:14 PM Revision 795811d6 (ceph): hadoop: fix a bunch of mismatched allocations
Using array new means you need array delete.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
Colin Patrick McCabe
11:14 PM Revision f7dc1a92 (ceph): rgw: fix fd leak on error path
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
11:14 PM Revision 28bcf0bc (ceph): osd: PG::choose_acting: fix major iterator mistake
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
10:15 PM Revision 5184db44 (ceph): filestore: add per-sequencer flush operation
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:15 PM Revision 3c7d30f1 (ceph): osd: flush pg writes to disk before starting scrub scan
This avoids two races:
- we just completed recovery by pushing objects to the replica, and the
replica starts sca...
Sage Weil
09:09 PM CephFS Bug #661 (Resolved): mds: mdlog recursion / scatter crash
commit:b04b6f48237fcbd22716279726f827237a3eb4af Sage Weil
08:56 PM CephFS Bug #661 (Resolved): mds: mdlog recursion / scatter crash
... Sage Weil
08:51 PM Revision 2fb60daf (ceph): osd: debug scan_list and scrub a bit better
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:28 PM Revision c3a24fc5 (ceph): osd: refactor _get_pool / _put_pool a little bit
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
08:00 PM Revision 39aff221 (ceph): osd: OSD::_lookup_pool: avoid double lookup
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
07:52 PM Revision bf4529b9 (ceph): test: create test_pools.sh
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
06:59 PM Revision 1cfad2ea (ceph): osd: clear INCONSISTENT if scrub detects no errors
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:36 PM Revision b1908755 (ceph): osd: add assert that we're replica
ar Fred saw a crash where we got into merge_log as a stray, which really
shouldn't ever happen! See #590.
Signed-of...
Sage Weil
04:59 PM CephFS Cleanup #660 (Closed): mds: use helpers in mknod, mkdir, openc paths
Lots of not-quite-duplicate code here that can be cleaned up Sage Weil
04:43 PM Bug #531: Journaling Causes System Hang
This is strange.
I tried a very large write.
root@cephnode01:/mnt# dd if=/dev/zero of=cephtest01 bs=1M count=1M...
Bryan Tong
12:27 PM Bug #531: Journaling Causes System Hang
Okay,
I got everything updated and it appears the cluster hung right about 180GB of data written. Which was double...
Bryan Tong
11:41 AM Bug #531: Journaling Causes System Hang
Just wanted to make an update.
I just got a chance to get back on the cluster today.
I have upgraded the cluste...
Bryan Tong
04:31 PM Revision 1e291fc9 (ceph): debian: don't strip rados classes
Signed-off-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@newdream.net>
Laszlo Boszormenyi
04:30 PM Revision 9c173bb4 (ceph): debian: rename ceph.lintian -> ceph.lintian-overrides
Signed-off-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@newdream.net>
Laszlo Boszormenyi
04:15 PM Feature #659 (Resolved): direct io unit test
For qa/workunits:
Test directio reads and writes where
- the file offset is 512-byte and not page aligned
- the ...
Sage Weil
01:29 PM Bug #629: cosd segfaults when deleting a pool containing degraded objects
This shouldn't happen again commit:c3a24fc5d31d53e3db911be900b9067584f0e07e
It still might be interesting to see t...
Colin McCabe
10:48 AM Bug #658 (Resolved): osd: scrub crash
... Sage Weil
10:40 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
ar Fred wrote:
> 2010-12-03 16:29:00.261788 7fe2f28a9710 osd1 1059 pg[3.70( v 660'2955057 lc 567'2954755 (643'295484...
Sage Weil
01:47 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
I spent some time yesterday trying the latest rc (914f6ddebd899667b1937dfe9d5f1a94537dc500), and I still hit the same... ar Fred
09:33 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
Stefan Majer wrote:
> Hi,
>
> we see similar problems with kernel 2.6.37-rc6 and ceph build from yesterday (29480...
Sage Weil
12:21 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
Hi,
we see similar problems with kernel 2.6.37-rc6 and ceph build from yesterday (29480f42be8551f47d79282b7376a10a...
Stefan Majer
09:02 AM Linux kernel client Bug #657 (Resolved): nfs-reexport null deref in ceph_init_dentry
Al Viro observes:... Sage Weil
08:00 AM Revision b2746c94 (ceph): logging: eliminate out-of-range dout calls
Eliminate calls to dout that use non-existent log levels, like negative
levels less than -1. Also trigger a compiler ...
Colin Patrick McCabe
12:25 AM Revision c4b5c33b (ceph): common: more cerr -> derr conversions
cmds: cerr -> derr
DoutStreambuf: primitive_log: just write to the stdout fd rather than cerr
assert: don't write o...
Colin Patrick McCabe

12/16/2010

11:31 PM Bug #656 (Closed): ceph
i have 2 osd,[2.6.36,ceph-0.23.1]
#ceph osd down 0
#ls /mnt/ceph {ceph's mount dir}
hanging here,
after a while ...
longguang yue
10:57 PM Revision 30f752cd (ceph): gceph,ceph: replace cerr->derr
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
10:05 PM Revision c76379fd (ceph): cosd: replace cerr with derr
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
09:06 PM Revision 73669d87 (ceph): PG.cc:
sub_op_scrub must set finalizing_scrub on the replica
before waiting for last_update_applied to catch up to
info.la...
Samuel Just
06:32 PM Revision 4644247c (ceph): osd: FileJournal: use derr
Use derr to announce errors in FileJournal.
Handle EINTR where necessary (still haven't fixed
read/write/pread/pwrit...
Colin Patrick McCabe
06:32 PM Revision 4fc1af5e (ceph): logging: re-introduce derr
Re-introduce derr as a special log level (level -1) which will show up
in all logs, and on stderr. These messages are...
Colin Patrick McCabe
05:10 PM Tasks #653: get playground radosgw up and running again
S3 gateway is up and running (I just recreated my user and tested it, it was working before). We still need to recrea... Yehuda Sadeh
03:02 PM Bug #655 (Resolved): class objects are being stripped (debian packages)
When installing the latest debian packages, the resulting /usr/lib/rados-classes/*.so are completely stripped, thus w... Yehuda Sadeh
02:56 PM Tasks #654 (Resolved): get playground ladder0 mounted
mounted, was just missing name=username on the mount command. Also created the users. Yehuda Sadeh
08:11 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
My rsync (196k files, 74GB data) finished succesfully, but the btrfs warning repeated itself twice. Wido den Hollander
02:09 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
Compiled from the kernel GIT ( a4851d8f7d6351a395d36ae8fdcf41745a832d76 ) last night and then started a rsync this mo... Wido den Hollander
01:23 AM Revision 29480f42 (ceph): ReplicatedPG.cc:
_scrub must set head when it encounters a head snap
curclone counts down, not up
Signed-off-by: Samuel Just <samuel...
Samuel Just
12:33 AM Revision 1e490eff (ceph): osd: timed out watcher is added to unconnected map
Yehuda Sadeh
12:33 AM Revision c321620e (ceph): osd: send notify message only to unexpired watchers
Yehuda Sadeh
12:26 AM Revision 619b45ad (ceph): logging: close file when reloading global config
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe

12/15/2010

11:23 PM Revision 060fd428 (ceph): osd: fix watch timer, locking
Yehuda Sadeh
09:39 PM Revision 914f6dde (ceph): filestore: detect final version of async ioctl SNAP_CREATE_V2
Li's revised interface for the async snap ioctl is more flexible. Update
the ioctl call sites and detection code acc...
Sage Weil
09:15 PM Tasks #654 (Resolved): get playground ladder0 mounted
Get it mounted (it was being weird on me). And recreate the users (probably servicectl ladder0:user config from yakko?) Sage Weil
09:14 PM Tasks #653 (Resolved): get playground radosgw up and running again
I'm not sure what needs to be done to configure the radosgw pools..
Also we need to recreate the users/buckets fr...
Sage Weil
09:10 PM Bug #648 (Resolved): monclient: PGMap::apply_incremental
trimming changed by commit:89d5c91e7d207d646651f8959ee37a15ea199d1b Sage Weil
09:10 PM Bug #631 (Won't Fix): OSD: FileJournal::committed_thru
Sage Weil wrote:
> Okay, the first crash you saw is due to #645. I think it's a kernel bug causing that ioctl to fa...
Sage Weil
09:07 PM Revision 06a2d7a2 (ceph): mds: Save straydn in mdr so it's consistent across retry attempts.
Otherwise, we could choose new stray dirs and fail to get all
the locks we needed (while leaving old strays locked fo...
Greg Farnum
09:06 PM Cleanup #650 (Resolved): objecter: refactor request tracking to be per-osd instead of per-pg
commit:d54a854811a51a5730b548da712d59761057fa58 Sage Weil
08:44 PM Revision e31f0a47 (ceph): tools: don't start msgr thread before daemonize
Calling messenger->add_dispatcher_head() has the side-effect of starting
the messenger thread. So we must not do it b...
Colin Patrick McCabe
07:02 PM Revision d54a8548 (ceph): Merge branch 'objecter' into unstable
Sage Weil
07:02 PM Revision 065cdf52 (ceph): objecter: track pending requests by osd, not pg
This is a big cleanup. Also
- switch to keeping per-osd Connection *'s
- make requests time out independently (not...
Sage Weil
07:02 PM Revision f6dc5d9f (ceph): objecter: cleanup: rename op maps
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:02 PM Revision 32a8aed9 (ceph): objecter: add reopen_session helper
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:02 PM Revision 530083cc (ceph): objecter: check for pg mapping changes in each incremental; refactor mi...
We need to detect when a pg mapping changes but the primary stays the same.
That means we can't just look at the fina...
Sage Weil
07:01 PM Revision 5d44d599 (ceph): msgr: mark down by Connection*
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:01 PM Revision 07e593c4 (ceph): mds: fix inode ancestor attr encoding
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:15 PM Revision fdbd85e4 (ceph): automake: ignore rmdir errors during uninstall
We don't want to fail "make distcheck" for a silly reason.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
Colin Patrick McCabe
06:01 PM Bug #652 (Resolved): ReplicatedPG _scrub missing clone
ReplicatedPG.cc:4016 = handle the missing clone case Samuel Just
03:18 PM Linux kernel client Bug #552 (Resolved): Samba with kernel oplocks=on produces lots of corrupt mds entries in dmesg
Closing this out unless we hear about more issues. Greg Farnum
01:38 PM Feature #643 (Resolved): filestore: update btrfs ioctl interface for soon-to-be-pushed SNAP_CREAT...
commit:914f6ddebd899667b1937dfe9d5f1a94537dc500 Sage Weil
01:01 PM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
I took the for-linus branch, since both next-rc and master wouldn't compile against 2.6.37-rc5. Due to this error:
...
Wido den Hollander
10:39 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
I've just hit the bug again, this while I was running a rsync to my Ceph cluster.... Wido den Hollander
09:48 AM Feature #95: mon: adjust overload based on osd disk utilization
Implemented reweight-by-utilization in the overload branch.
C.
Colin McCabe
01:00 AM Revision 7b5e923c (ceph): osd: send pending notification for reconnected watcher
Yehuda Sadeh
12:28 AM Revision f9694648 (ceph): automake: add osd/Watch.h to noinst_HEADERS
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe

12/14/2010

09:55 PM Linux kernel client Bug #651 (Resolved): osd_client: need to recalculate request mapping for every osdmap incremental
Currently if we get an osdmap message with multiple incrementals, and a request maps to a different osd and then back... Sage Weil
08:47 PM Revision c8d9b20c (ceph): Merge branch 'sync2' into unstable
Sage Weil
07:50 PM Revision 89d5c91e (ceph): mon: trim pgmap less aggressively
This will make observer crashes due to missed states (#648) much harder to
hit. Eventually the pgmap state trim prob...
Sage Weil
07:02 PM Revision 056e91e0 (ceph): librados: drop watch_lock
Use the existing lock to do protect all of this.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:00 PM Revision d4420a8a (ceph): objecter: drop linger_info_mutex
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:51 PM Revision b989087d (ceph): crypto: catch cryptopp decrypt/encrypt exceptions
Yehuda Sadeh
06:48 PM Revision 215f3320 (ceph): objecter: simplify linger register
Drop single-use helper; make unregister_linger part of the public
interface.
Signed-off-by: Sage Weil <sage@newdream...
Sage Weil
06:47 PM Revision b60a9abf (ceph): objecter: fix up linger ack/commit to trigger first time only
We only want the user-provided ack/commit callbacks to trigger the first
time we register the lingering op. Same goe...
Sage Weil
06:35 PM Revision 8a75086d (ceph): objecter: clean up linger interface
Put LingerOp on heap. Use xlist to attach to PGs. Add in/out bufferlists.
Signed-off-by: Sage Weil <sage@newdream....
Sage Weil
05:55 PM Revision 96b32382 (ceph): Merge remote branch 'origin/unstable' into sync2
Conflicts:
src/auth/Crypto.cc
src/osd/ReplicatedPG.cc
src/osd/ReplicatedPG.h
src/osd/osd_types.h
Sage Weil
03:03 PM Feature #562 (Closed): separate gui into separate binary, package
Colin McCabe
02:56 PM Revision 3e076c39 (ceph): logging: use Mutex::Locker
Use Mutex::Locker to make logging exception-safe. That is, if you are
doing "dout() << foo() << dendl;" and foo throw...
Colin Patrick McCabe
02:20 PM Revision bf31f3f1 (ceph): logger: Fix DoutStreambuf::create_rank_symlink
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
02:13 PM Revision 4c377199 (ceph): cephtool: rename tools files
Rename tools files to be more consistent. For example, the main()
function for ./ceph should be in ceph.cc.
Signed-o...
Colin Patrick McCabe
12:50 PM rbd Feature #341: libvirt bindings
A "network" disk type was introduced in "036ad5052b43fe9f0d197e89fd16715950408e1d":http://libvirt.org/git/?p=libvirt.... Josh Durgin
11:44 AM Bug #646: [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git + ceph 0.23
I'm going to get the -standalone.git repo updated to the latest code (it's a bit out of date) so we can confirm this ... Sage Weil
11:36 AM Cleanup #650 (Resolved): objecter: refactor request tracking to be per-osd instead of per-pg
Sage Weil
10:40 AM Bug #649 (Resolved): OSD: CryptoPP::StreamTransformationFilter::LastPut
Fixed in b989087ddf8775588ddbb6234d099398a2e18072. CryptoPP threw an exception when failed to decode message (probabl... Yehuda Sadeh
02:34 AM Bug #649 (Resolved): OSD: CryptoPP::StreamTransformationFilter::LastPut
This morning on my test machine (noisy.ceph.widodh.nl, 1 MON, 1 MDS, 3 OSD) all three OSD's died at exact the same mo... Wido den Hollander
09:58 AM Revision a3fcf908 (ceph): logging: Fix use-before-access in debug.cc
Signed-off-by: Vangelis Koukis <vkoukis@cslab.ece.ntua.gr>
Signed-off-by: Constantinos Venetsanopoulos <cven@cslab.ec...
Vangelis Koukis
09:53 AM Revision 3932f084 (ceph): osd: PG::prior_set_affected: const cleanup
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> Colin Patrick McCabe
09:14 AM Bug #648: monclient: PGMap::apply_incremental
This is a known issue, caused by the pg state trimming. It'll go away eventually with #647. In the meantime, I'll m... Sage Weil
01:31 AM Bug #648 (Resolved): monclient: PGMap::apply_incremental
I left my laptop on last night with a 'ceph -w' on one of my test machines, this morning I saw:... Wido den Hollander

12/13/2010

08:23 PM Feature #647 (Duplicate): mon: refactor paxos interaction
We currently have a paxos instance per state machine, which is silly for a bunch of reasons. The big one is that a m... Sage Weil
07:28 PM Bug #646: [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git + ceph 0.23
master-backport ,reproduce it.
unstable-backport,can't reproduce it ,
i'm not sure whether this issue had been fixed.
changping Wu
07:00 PM Bug #646: [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git + ceph 0.23
/ceph-client-standalone$ git branch
master
* master-backport
changping Wu
06:45 PM Bug #646 (Can't reproduce): [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git +...
1. ceph-client-standalone: git from git://ceph.newdream.net/git/ceph-client-standalone.git
2. ceph: ceph-0.23
3.OS:...
changping Wu
10:18 AM Feature #640: support log rotation
I guess when I filed this I was thinking of a setup where there was a small tmpfs partition where the logs went to, w... Colin McCabe
 

Also available in: Atom