Project

General

Profile

Activity

From 05/22/2010 to 06/20/2010

06/20/2010

09:41 PM Revision bf3d52a4 (ceph): journal: initialize applied_seq during journal replay
This should avoid
#0 0x00007f41b1a18a75 in raise () from /lib/libc.so.6
#1 0x00007f41b1a1c5c0 in abort () from /li...
Sage Weil
04:45 PM Revision 2540ea48 (ceph): mkcephfs: push conf to remote machines
Signed-off-by: Fred Ar <ar.fred@yahoo.com>
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
04:44 PM Revision 3bfabd41 (ceph): mon: allow 10ms clock drift before complaining
Sage Weil
02:53 PM Bug #215: osd crash: FAILED assert(seq >= last_committed_seq)
this should be fixed by commit:bf3d52a4b725a0f2d3db39ea9ad5b412171ea0ad... can you please confirm?
thanks!
Sage Weil
10:39 AM Bug #215 (Resolved): osd crash: FAILED assert(seq >= last_committed_seq)
this is ceph unstable c626ac384678661b765c1ae1dee8db48b2c70993... ar Fred
10:34 AM Bug #214: don't fail on assertion when mkcephfs is mis-used
handle_osd_map should log an error and return if the fsid doesn't match Sage Weil
09:37 AM Bug #214 (Resolved): don't fail on assertion when mkcephfs is mis-used
3 boxes, each with 1 mon 1 mds 1 osd
I wanted a clean base for further testing, so on each boxes, I did a:...
ar Fred

06/19/2010

03:41 PM Revision 775506e9 (ceph): debian: include cclass
ar Fred
03:40 PM Revision d4173fa7 (ceph): initscript: fix for dash
Signed-off-by: Fred Ar <ar.fred@yahoo.com> ar Fred
03:39 PM Revision 1f2725bc (ceph): cclass: fix for dash
Signed-off-by: Fred Ar <ar.fred@yahoo.com> ar Fred
08:30 AM Bug #213 (Resolved): non-idempotent transactions (clone) under ext3 may not replay correct result
The writeahead journaling will restore the store to a known state regardless of which operations have committed, but ... Sage Weil
04:26 AM Revision 6d833475 (ceph): initscript: remove class loading for now
- only need to do it once, by connecting to a random monitor, not for
each monitor
- not sure we should try it ever...
Sage Weil
02:12 AM Revision b3dc5ecf (ceph): vstart: only need to load classes once, not for each monitor
Sage Weil

06/18/2010

11:32 PM Revision c626ac38 (ceph): Merge branch 'rbd-class' into unstable
Sage Weil
11:31 PM Revision 3a2c9169 (ceph): Revert "class: able to add classes when doing mkfs"
This reverts commit 65371482dd661c6dc3d251fb9fd4e01812bdaa0e. Sage Weil
10:59 PM Revision fb1ad6e7 (ceph): filestore: op_start when op is _queued_, so that q is drained on commit
We need the store in a consistent state on commit, which means flushing
transactions such that we have all ops <= a g...
Sage Weil
10:09 PM Revision 6a261eb1 (ceph): journal: maintain applied_seq, use that value during the commit.
Track which seq's have successfully applied.
This fixes writeahead mode. Parallel mode is still broken...
Sage Weil
09:30 PM Revision ed768c9a (ceph): buffer: fix includes for writev/iovec stuff in write_fd()
Sage Weil
09:09 PM Revision a3e906e3 (ceph): filestore: make sync() and friends actually sync!
Sage Weil
08:56 PM Revision ae90ba18 (ceph): initscripts: fix verbose output for root commands
Sage Weil
08:06 PM Revision 10d41d2c (ceph): filejournal: only pad to align for data segments above 64 KB
Sage Weil
08:00 PM Revision f98c0b4f (ceph): filejournal: make alignment optional (<0 == no preferred alignment)
Sage Weil
07:49 PM Revision 2be1480a (ceph): Merge branch 'unstable' of ceph.newdream.net:git/ceph into unstable
Sage Weil
07:48 PM Revision 3fbc2a2a (ceph): buffer: remove debug cruft
Sage Weil
07:47 PM Revision d1626090 (ceph): filejournal: pad entries on disk based on preferred alignment
Adjust pre/post padding to align entry payload with it's preferred
alignment.
Sage Weil
07:46 PM Revision f59d77be (ceph): objectstore: track offset, alignment of largest data segment [encoding ...
Sage Weil
07:44 PM Revision 0995f28f (ceph): buffer: list::rebuild_page_aligned()
Rebuild bufferlist such that all segments/ptrs are page sized/aligned.
(Avoids copying/reallocating segments that are...
Sage Weil
06:38 PM Revision 25116b27 (ceph): client: if checking caps on a snap, return, don't assert
Greg Farnum
05:10 PM Revision bd4188a0 (ceph): filejournal: use buffer::write_fd() instead of open coding
Avoid dup code, and also handle short writes. Sage Weil
04:40 PM Revision ca9b7563 (ceph): osd: implement bench command
'bench [bsize] [total bytes]'
Result is reported via logclient.
Sage Weil
04:24 PM Bug #212 (Resolved): osd: journal vs filestore sync in writeahead move
Sage Weil
04:24 PM Bug #212: osd: journal vs filestore sync in writeahead move
many commits, culminating in commit:fb1ad6e746a52a99d82d864d4a6efeda09be2440 Sage Weil
02:39 PM Bug #212 (Resolved): osd: journal vs filestore sync in writeahead move
The journal is incorrectly dropping entries in writeahead mode Sage Weil
04:19 PM Revision e4b9dd4b (ceph): msgr: avoid calculating data crc is peer didn't either
Otherwise we're just wasting CPU cycles! Sage Weil
03:34 PM Revision 2bb94b36 (ceph): msgr: Fix crash in policy.throttler->get(message_size) when message_siz...
Signed-off-by: Simone Gotti <simone.gotti@gmail.com>
Signed-off-by: Sage Weil <sage@newdream.net>
Simone Gotti
12:08 PM CephFS Bug #188 (Resolved): cfuse crashes on snapshot file read
This was a simple fix, just remove a dumb assert! Done in 25116b27388293ce19aef0472b1a5bd6c42527c3.
However, leads...
Greg Farnum
12:08 PM CephFS Bug #211 (Resolved): Cfuse won't unmount properly with snapshot activity
Perform the steps in #188, then ctrl-C cfuse. It won't unmount!
If you unmount and remount before creating the sna...
Greg Farnum
10:32 AM Linux kernel client Bug #210 (Resolved): GPF in ceph_con_revoke_message+0x2c/0x152
osd was repeated being restarted, probably doing weird things.... Sage Weil
10:02 AM Bug #98: reserved identifier violation
I don't actually remember how they were created, but according to the Readme file, it's javah, yes.
Those files aren...
Greg Farnum
09:49 AM Bug #98: reserved identifier violation
Do you use the tool ""javah":http://java.sun.com/javase/7/docs/technotes/tools/solaris/javah.html" for these header f... Markus Elfring
09:35 AM Bug #98: reserved identifier violation
Markus Elfring wrote:
> Do you mean that a tool for the "Java™ Native Interface":http://java.sun.com/javase/6/docs/t...
Sage Weil
09:18 AM Bug #98: reserved identifier violation
Do you mean that a tool for the "Java™ Native Interface":http://java.sun.com/javase/6/docs/technotes/guides/jni/index... Markus Elfring
09:03 AM Bug #98: reserved identifier violation
Markus Elfring wrote:
> Who is responsible for the tool that generates the affected header files?
Oh.. if those a...
Sage Weil
08:55 AM Bug #98: reserved identifier violation
Who is responsible for the tool that generates the affected header files? Markus Elfring
08:48 AM Bug #98: reserved identifier violation
Markus Elfring wrote:
> Would you also like to adjust any symbols with the prefix "_Included"?
> Example: source:sr...
Sage Weil
08:41 AM Bug #98: reserved identifier violation
Would you also like to adjust any symbols with the prefix "_Included"?
Example: source:src/client/hadoop/org_apache_...
Markus Elfring

06/17/2010

08:44 PM Revision 35095434 (ceph): qa: another mds test fix... wait long enough for the previous death to ...
Sage Weil
08:37 PM Revision c6067131 (ceph): msgr: ref count Pipe to avoid use after free
The Connection has a Pipe pointer to facilitate
send_message(Message, Connection)
but the reaper() clears that point...
Sage Weil
08:09 PM Revision 920b2d89 (ceph): qa: fix mdstable failure tests
Sage Weil
05:51 PM Revision 4d66a72a (ceph): ceph: fix straggler header ifdef guards
Sage Weil
05:47 PM Revision f4b9d9d8 (ceph): Bug #98: Unique names for include guards
A couple of preprocessor symbols for include guards tampered with the reserved namespace.
Signed-off-by: Markus Elfr...
Markus Elfring
05:36 PM Revision 6de2dde7 (ceph): ceph: remove leading _ from macro
Reported-by: Markus Elfring <elfring@users.sourceforge.net>
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:34 PM Revision 9013832c (ceph): Bug #98: Movement of an underscore for a data structure identifier
A leading underscore was moved to the back of a data structure identifier.
Signed-off-by: Markus Elfring <elfring@us...
Markus Elfring
05:13 PM Revision 8c5ee65b (ceph): osd: fix snap_trimmer AccessMode interaction
Recheck access mode every time we retake the pg lock (each object). Sleep
if necessary.
Sage Weil
05:13 PM Revision af2d023c (ceph): osd: don't requeue null OSDOp on apply_and_flush_repops
(this triggers during snap trimming) Sage Weil
05:13 PM Revision e6b033ac (ceph): osd: make clean_up_local remove hardlinks from collection dirs
Sage Weil
05:13 PM Revision 081bbe15 (ceph): mds: fix cdir head/snap items/null accounting
We were mixing up primary vs remote, partly due to the weird placement of
the counter adjustments. Make those sane a...
Sage Weil
05:13 PM Revision 57d79aaa (ceph): Revert "mkcephfs: cosd mkfs as root, then chown"
This reverts commit 0383878c4f5133f43ec16067a8917d3080adc690. Sage Weil
05:04 PM Revision f6501564 (ceph): Merge branch 'unstable' of ssh://ceph.newdream.net/home/sage/ceph.newdr...
Sage Weil
04:29 PM Revision b30ff171 (ceph): logclient: clean up interaction with monclient, monitor
Use monclient where available. Otherwise, we are a monitor, so send to
ourselves.
Conflicts:
src/mon/MonmapMonito...
Sage Weil
04:29 PM Revision b4727229 (ceph): osd: optionally periodically check pglog for corruption
...at least until we find this bug, periodically check pglog for corruption
and log it to the monitor when it happens...
Sage Weil
04:29 PM Revision 583f9f42 (ceph): mkcephfs: error out of journal dev is included in btrfs devs
Sage Weil
04:29 PM Revision f976bbdc (ceph): filejournal: journal write cache is safe on >= 2.6.33
Sage Weil
02:16 PM CephFS Bug #172 (Closed): OSD and MDS crash on rm -r
Closing this one. The osd crash was a snap_trimmer bug fixed a few days ago.
Added a qa workunit that repeats t...
Sage Weil
02:01 PM Bug #209 (Rejected): mkdir freezes on cfuse
I think I missed when diagnosing this one; looks like it was an error in my patch that I didn't actually compile out ... Greg Farnum
11:47 AM Bug #209 (Rejected): mkdir freezes on cfuse
This suddenly cropped up while testing my solution to 188 for no apparent reason, and now I can't get rid of it no ma... Greg Farnum
11:57 AM Bug #98 (Resolved): reserved identifier violation
Okay, applied these with a few fixes (and without the trailing gibberish in #1). Thanks Markus! Sage Weil
11:43 AM Bug #208 (Resolved): Can't create snapshots in cfuse on mds cluster
While trying to reproduce 188, I keep getting "mkdir: cannot create directory `mnt/a/.snap/1': No such file or direct... Greg Farnum
10:42 AM Bug #179 (Closed): corrupted LogEntry in mon data
Sage Weil
10:34 AM Linux kernel client Bug #204 (Resolved): crush update crash
fixed by commit:cebc5be6b6c82a99231e9c9af451e9e3d3399ec6 Sage Weil
10:18 AM Bug #203 (Rejected): osd journal full deadlock?
Sage Weil
10:18 AM Bug #173 (Resolved): Throttle client requests on OSD
Sage Weil
10:02 AM Linux kernel client Bug #207 (Resolved): NULL pointer reference when adding a block device
Actually this is probably due to the interface change in ceph_calc_raw_layout.
Should be fixed with commit 7a06cc4c2...
Yehuda Sadeh
01:43 AM Linux kernel client Bug #207 (Resolved): NULL pointer reference when adding a block device
I've been seeing this error a few times lately, but it seems i have some more information about it.
Building the r...
Wido den Hollander
09:40 AM Bug #205 (Resolved): remove journal write cache warning?
warning updated in commit:f976bbdcf700e52c9ef92964551b2c73b014686c Sage Weil

06/16/2010

09:15 PM Revision d7b9f39d (ceph): filejournal: log if journal item is larger than journal
Sage Weil
05:53 PM Revision 5dfc4119 (ceph): osd: warn when failing to mount journal on ENOTTY
suggest that the problem was due to journal size not configured Yehuda Sadeh
05:14 PM Revision d5995661 (ceph): conf: fix buffer initalization
This fixes a bug where we didn't initialize the variable
post processing buffer, which would sometimes lead to
garbag...
Yehuda Sadeh
05:14 PM Revision 5c001f4e (ceph): conf: can specify environment variables for substitution
Yehuda Sadeh
05:13 PM Revision ce909c2c (ceph): osd: error out when failing to read journal header
This fixes bug #201. Yehuda Sadeh
02:08 PM Bug #203: osd journal full deadlock?
maybe the journal was just too small:
> > > 10.06.11 21:46:37.335878 40cec950 journal check_for_full at 4694016 :
...
Sage Weil
09:30 AM Bug #203 (Rejected): osd journal full deadlock?
Sage Weil
11:56 AM Linux kernel client Feature #206 (New): make a 'soft' mode
On Wed, 16 Jun 2010, Peter Niemayer wrote:
> Hi,
>
> trying to "umount" a formerly mounted ceph filesystem that h...
Sage Weil
11:53 AM CephFS Bug #188 (In Progress): cfuse crashes on snapshot file read
Greg Farnum
11:43 AM Feature #138: Try out tcmalloc
All right, this is on hold while we work through some of the bugs that have been reported recently. Greg Farnum
10:03 AM Bug #205 (Resolved): remove journal write cache warning?
Sage Weil
09:57 AM Linux kernel client Bug #204 (Resolved): crush update crash
> mkcephfs -c /etc/ceph/ceph.conf --mkbtrfs (1mds, 1mon, 2osd (2 phyiscal disks$
> start ceph
> mount ceph fs
> wr...
Sage Weil

06/15/2010

12:39 PM Bug #201 (Resolved): Floating point exception when journal path is a directory
Fixed by 862bb864908ec22ec70b616ee2eddcf31e497742.
Now it'll exit cleanly and logs should be more descriptive abou...
Yehuda Sadeh
06:04 AM Bug #201 (Resolved): Floating point exception when journal path is a directory
When trying to add a new OSD i forgot that the journal had to be a file or a blockdevice, but instead a created a dir... Wido den Hollander
12:04 PM Bug #202: OSD crash during reads from cluster
I tried doing a fresh mkfs of only osd5, this is what i did:
* Removed all data in /srv/ceph/osd5
* Ran cosd with -...
Wido den Hollander
09:49 AM Bug #202 (Can't reproduce): OSD crash during reads from cluster
Today i noticed one crashing OSD during read operations (rsync) from my cluster.
I don't know if it matters, but t...
Wido den Hollander

06/14/2010

07:22 PM Revision d971bb06 (ceph): osd: mkfs failures are colored red
Yehuda Sadeh
04:07 PM Linux kernel client Bug #200: umount hangs with clustered mds
Seems to be waiting forever here:
wait_event(mdsc->cap_flushing_wq, check_cap_flush(mdsc, want_flush));
Yehuda Sadeh
03:21 PM Linux kernel client Bug #200 (Resolved): umount hangs with clustered mds
Happens on both current master and the unstable branches (6/14). The umount at the following scenario never exits:
...
Yehuda Sadeh
06:38 AM Bug #199 (Closed): OSD crash when rebalancing data
Today i wanted to expand my number of OSD's from 5 to 7.
After i loaded my new crushmap the rebalancing of data st...
Wido den Hollander

06/13/2010

10:45 AM Linux kernel client Bug #194: MOSDMap memory leak?
fixed by commit:ae32be31341a5fecfa16c5b3eb78095207182cce Sage Weil
10:43 AM Linux kernel client Bug #194 (Resolved): MOSDMap memory leak?
Sage Weil

06/12/2010

12:10 PM Bug #98: reserved identifier violation
Would you like to integrate any changes from the appended name adjustments into your source code repository? Markus Elfring

06/11/2010

10:10 PM Bug #198 (Can't reproduce): mds: repeated probing of inode
see 1000008b414.00000000 on current sepia.a mds instance Sage Weil
09:05 PM Revision bdda73fd (ceph): monc: handle mon0 or mon.a syntax when pull mon addrs from .conf
Sage Weil
09:03 PM Revision aabaa065 (ceph): monc: more debug info on monmap receipt
Sage Weil
08:56 PM Revision 690ea1c1 (ceph): mon: MONNAMES feature is supported by all userland
Sage Weil
08:34 PM Revision ddf739f6 (ceph): mon: simplify request forwarding
Use the MonSession to track proxy state, NOT the PaxosServiceMessage header
hackery.
Sage Weil
05:15 PM Revision 8cc096f6 (ceph): monmaptool: don't allow dup names, addrs to be added to the map
Sage Weil
05:12 PM Revision e5ff5189 (ceph): mds: add force_wrlock to lock states; use for filemax changes
This lets us write out a max_size change when we revoke caps from a client
(thus excl->sync) and the client releases ...
Sage Weil
05:12 PM Revision 658d51da (ceph): mon: make send_reply always consume reply ref (even if not sent); fix c...
Sage Weil
05:08 PM Revision 6a40008c (ceph): mon: make send_reply always consume reply ref (even if not sent); fix c...
Sage Weil
04:57 PM Revision 33850a22 (ceph): monmaptool: fix usages tring
Sage Weil
04:56 PM Revision 9fddee98 (ceph): mkcephfs: behave with new mon naming scheme
Sage Weil
10:46 AM Bug #197 (Resolved): osd: FAILED assert(state == DELAYED || state == RMW)
Sage Weil
10:18 AM Bug #197 (Resolved): osd: FAILED assert(state == DELAYED || state == RMW)
... Sage Weil

06/10/2010

11:01 PM Revision 3336d9d0 (ceph): rgw: put_obj uses write_full
Yehuda Sadeh
10:26 PM Revision 0b0009cf (ceph): cls_rbd: use write_full when writing image header
Yehuda Sadeh
10:26 PM Revision 5d2e85d2 (ceph): class: implement write_full interface
Yehuda Sadeh
10:15 PM Bug #35: osd: pg split should queue transaction(s) under appropriate sequencer(s)
splitting is broken in other ways as well.. postpone for v0.22. Sage Weil
10:15 PM Bug #192 (Resolved): inconsistent md5sum on repeated read of file
Sage Weil
05:08 PM Bug #192: inconsistent md5sum on repeated read of file
Problem seem to be rbd branch related, introduced by the splitting of calc_layout to helper function: plen is not upd... Yehuda Sadeh
12:48 PM Bug #192 (Resolved): inconsistent md5sum on repeated read of file
BTW: I think the .snap portion is not required for this to fail.... Sage Weil
10:14 PM Feature #195 (Resolved): monitor naming and removal support
see mon-remove branch. let's merge this after v0.21. Sage Weil
10:13 PM Linux kernel client Bug #193: protocol error after control-c
Yehuda Sadeh wrote:
> This was on the rbd branch, does it also happen on the unstable branch? The wait_for_completio...
Sage Weil
05:18 PM Linux kernel client Bug #193: protocol error after control-c
This was on the rbd branch, does it also happen on the unstable branch? The wait_for_completion_killable() might have... Yehuda Sadeh
04:42 PM Linux kernel client Bug #193 (Resolved): protocol error after control-c
Saw this on wido's machine:... Sage Weil
10:11 PM Linux kernel client Bug #194 (Resolved): MOSDMap memory leak?
Code audit looks ok. Could this be a false alarm somehow?... Sage Weil
09:45 PM Revision 80d2afc1 (ceph): cclsinfo: utility to display class information
Yehuda Sadeh
09:45 PM Revision e3b23bd0 (ceph): cclsinfo: some fixes
Yehuda Sadeh
09:45 PM Revision d071cb1c (ceph): cclsinfo: replace error message
Yehuda Sadeh
09:45 PM Revision 203af22e (ceph): cls: export relevant functions
Yehuda Sadeh
09:45 PM Revision 057ccbbf (ceph): cclsinfo.sh: shell script to dump class info
Yehuda Sadeh
09:45 PM Revision d87c057f (ceph): cclsinfo.sh: better architecture detection
Yehuda Sadeh
09:45 PM Revision 65371482 (ceph): class: able to add classes when doing mkfs
Yehuda Sadeh
09:45 PM Revision e737e635 (ceph): class: more flexible class loading, new cclass.sh script
Yehuda Sadeh
09:45 PM Revision 7584fef1 (ceph): class: modify scripts, fix configuration
Yehuda Sadeh
09:45 PM Revision 3bb153c3 (ceph): class: modify startup scripts to add classes on mon startup
Yehuda Sadeh
09:45 PM Revision ae5bb842 (ceph): cclass: add some options
Yehuda Sadeh
09:45 PM Revision 912e3c85 (ceph): osd: fix rollback when head points at the rolled back snapshot
Yehuda Sadeh
09:45 PM Revision f34c5b91 (ceph): cls_rbd: fix snapshots seq number
Yehuda Sadeh
09:45 PM Revision b58e1b9a (ceph): rados: update documentation to mention mkpool and rmpool
Greg Farnum
09:45 PM Revision 4716b486 (ceph): rbd: snapshots management via class mechanism
Yehuda Sadeh
09:45 PM Revision c5abf58d (ceph): class: add CLS_LOG macro
Yehuda Sadeh
09:45 PM Revision 2991c300 (ceph): rbd: add snapshot creation via rbd class
Yehuda Sadeh
09:45 PM Revision a6e50c24 (ceph): rbd: implement snap revert header manipulation via class
Yehuda Sadeh
09:45 PM Revision e297aa0b (ceph): rbd: snap revert header manipulation fixes
Yehuda Sadeh
09:45 PM Revision 94308d13 (ceph): cls-rbd: some adjustments, debug info
Yehuda Sadeh
09:42 PM Revision 7739cafb (ceph): testradospp: add a test for librados trunc
Yehuda Sadeh
09:41 PM Revision 42ff45b4 (ceph): librados: add trunc operation
Yehuda Sadeh
09:12 PM Revision 3de397fc (ceph): testradospp: update test, use write_full
Yehuda Sadeh
09:04 PM Revision d1c779c9 (ceph): librados: implement write_full
Yehuda Sadeh
06:55 PM Revision 92f2ecd3 (ceph): osd: fix null dereference in dout
Yehuda Sadeh
05:17 PM Revision effe5db3 (ceph): mds: be extra careful
Probably implied by the refs == 0, but be explicit anyway. Sage Weil
05:17 PM Revision 607ba5dd (ceph): mds: fix another instance of bad stray trimming code to use proper helper
Sage Weil
04:10 PM Revision 673fdc7a (ceph): mds: remove erroneous return statement
Greg Farnum
04:06 PM Revision 8be5b029 (ceph): mds: a checker function like _is_dir_nonempty should only report results.
Remove the call to reply_request and fix up the callers that need it. Greg Farnum
03:47 PM Feature #184: librados support for truncate, writefull
And related rgw commit:
3336d9d0fab7513c12a48879ab9a50cba2ab475e
Yehuda Sadeh
03:43 PM Feature #184 (Resolved): librados support for truncate, writefull
implemented.
commits:
d1c779c96a0afeaf245ade6c5d4c8de818fcd66b
42ff45b4eec31b992a1ecc1cc27a0f513f6cadde
Yehuda Sadeh
12:52 PM rbd Feature #41: Support snapshots
The qemu BlockDriver interface includes a bdrv_snapshot_goto callback that is being used to select a snapshot. Snapsh... Yehuda Sadeh
12:44 PM rbd Feature #41: Support snapshots
Yehuda Sadeh wrote:
> Then kvm-rbd snapshots implementation also needs to include snapshots rollback.
How would t...
Sage Weil
12:35 PM rbd Feature #41: Support snapshots
Then kvm-rbd snapshots implementation also needs to include snapshots rollback. Yehuda Sadeh
12:39 PM Feature #191 (Rejected): rbd needs locking mechanism
Certain operations on rbd image cannot happen without a proper locking mechanism. This includes all rbd image header ... Yehuda Sadeh
12:32 PM Feature #160 (Resolved): rbd revert-to-snapshot
snap_rollback is implemented and seems to be working. Actually, there's no need for the rbd-class specific code for i... Yehuda Sadeh
12:28 PM Linux kernel client Feature #190: krbd: DISCARD support
Yeah, it's called 'discard'. In order to get a block device to support it we need to do something like:
queue_flag_...
Yehuda Sadeh
10:26 AM CephFS Bug #177 (Resolved): unlinked inode during try_to_expire()
Hit this one myself (with full mds logs) and found the bug. Fixed in commit:607ba5dd14e11fe28f1a37cf8a29d4c48d181e0b Sage Weil
05:17 AM Revision 214a4279 (ceph): mds: handle dup anchorclient ACKs gracefully
One recovery scenario sees crisscrossing 'agree' and 'commit':
C->S --> commit1
S->C --> agree
C<-S <== agree
...
Sage Weil

06/09/2010

09:43 PM Revision c35cb294 (ceph): mds: fix export caps
Don't mangle sessionmap version by incrementing twice; pv is returned by
prepare_force_open_sessions().
Sage Weil
09:19 PM Revision 0349c789 (ceph): mds: allow import caps pin from multiple replicas at once
e.g. on dentry unlink, replicas send their caps to the auth. There may
be multiple replicas.
Sage Weil
08:50 PM Revision 1a3fac44 (ceph): osd: return EINVAL when snapc is invalid and trying to write
Yehuda Sadeh
08:30 PM Revision e74250d8 (ceph): osd: don't clobber racing pg_info on active pg
Wido saw a pg go active, but an activate log+info update crossed paths with
a pg_notify info, and the primary overwro...
Sage Weil
07:29 PM Revision 42738093 (ceph): osd: refine SnapContext::is_valid()
Sage Weil
07:27 PM Revision 2648e403 (ceph): osd: implement SnapContext::is_valid()
Sage Weil
05:56 PM Revision 7c780592 (ceph): init-ceph: don't fail if we fail to umount
It may not be mounted to begin with. Sage Weil
02:40 PM Linux kernel client Feature #190 (Resolved): krbd: DISCARD support
TRIM does exist, somewhere in Linux. RBD should support it so if the client system is using a supporting filesystem, ... Greg Farnum
05:34 AM Revision 12798de3 (ceph): osd: add --dump-pg-log command
Sage Weil
05:25 AM Revision 464fcd0e (ceph): monmap: use this here vector size to be clear
Sage Weil
04:52 AM Revision 16a46710 (ceph): mon: send old format monmap to old clients (backward compatible)
Also add some Connection smarts to monitor session. Sage Weil
04:36 AM Revision fb1c3b92 (ceph): osd: clear newly_removed_snaps on osdmap update if unchanged
We only want to apply _newly_ removed snaps once, or else we try to trim
the same snaps multiple times, and crash lik...
Sage Weil

06/08/2010

11:43 PM Revision 15a7a839 (ceph): mds: use helper to send message to client; fix send to null connection
Sometimes session->connection is NULL; use session->inst in that case. Sage Weil
11:33 PM Revision c992d020 (ceph): mds: remove erroneous bracket
Greg Farnum
11:25 PM Revision 2a88e2e5 (ceph): add checks for being a snapshot root to dir_is_nonempty
Greg Farnum
11:11 PM Revision 0d8c2975 (ceph): Merge branch 'unstable' into mon-remove
Sage Weil
11:03 PM Revision e3c4459e (ceph): monclient: track cur_mon by name, not rank
Sage Weil
10:59 PM Linux kernel client Bug #189 (Resolved): leaked dentry
Running unstable, commit:e041c5f
I think this is triggered by bonnie.sh. The bonnie.sh below is the dir where the...
Sage Weil
10:53 PM Bug #114: osd: corrupted pglog
Running 'src/script/check_pglog.sh $osd_data_dir' (from ceph.git) periodically will let you check the osd for any cor... Sage Weil
10:48 PM Revision 1d162972 (ceph): mon: rename whoami rank, simplify rank change logic
Sage Weil
10:37 PM Revision 0dc95695 (ceph): mds: fix stale lease trimming xlist iterator abuse
Sage Weil
10:37 PM Revision 26a4d0ea (ceph): throttle: allow take(0)
Sage Weil
10:17 PM Revision 2fa4a069 (ceph): mon: identify monitors by name, not rank
Sage Weil
10:13 PM Revision c196851b (ceph): osd: init auid to CEPH_AUTH_UID_DEFAULT in case authorizer doesn't set it.
We should probably also require the authorizer to set it for us. Greg Farnum
10:13 PM Revision 76fb75e9 (ceph): buffer: fix padding distances
Greg Farnum
10:13 PM Revision 7c856462 (ceph): mon: fix memory-leaked messages
Greg Farnum
09:48 PM Bug #56 (Closed): osd: crash on repop completion
Sage Weil
09:48 PM Bug #56 (Rejected): osd: crash on repop completion
I haven't seen this in forever. Closing out. Sage Weil
08:50 PM Feature #138: Try out tcmalloc
Going to implement this as a compile-time option (based on available libraries) once I've audited the mds and osd for... Greg Farnum
08:38 PM Revision 80c42d06 (ceph): mds: scan stray dir, eval strays on mds startup
Sage Weil
08:31 PM Bug #179 (In Progress): corrupted LogEntry in mon data
Sage Weil
08:31 PM Bug #179 (Rejected): corrupted LogEntry in mon data
Hmm, yeah i give up on this one. I see that it's corrupt, but not in any particularly suggestive way. No idea what ... Sage Weil
12:49 AM Bug #179: corrupted LogEntry in mon data
I also had some problems using gdb...
gdb won't work if cmon is not in /usr/bin and debug symbols for cmon (which I ...
ar Fred
06:39 PM Revision da520d7b (ceph): mon: clean up monmapmonitor warnings, style
Sage Weil
06:37 PM Revision 76191d23 (ceph): logclient: clean up interaction with monclient, monitor
Use monclient where available. Otherwise, we are a monitor, so send to
ourselves.
Sage Weil
06:27 PM Revision 0ab54c70 (ceph): monclient: make get_monmap_privately() clean up after itself
- set cur_mon=-1 when we're done
- clean up connections
Sage Weil
06:22 PM Revision 0b44ef02 (ceph): monclient: Make MonClient update cur_mon on getting new monmap
A MonClient starting with an incorrect monmap (i.e. mon id in the
starting map does not match the actual mon id found...
Sage Weil
04:42 PM Revision 2e1b0d35 (ceph): mon: make mon lease clock check protocol change backward compatible
Sage Weil
04:29 PM Bug #186 (Resolved): BUG: failed to decode message of type 66 v1: buffer::exception
Sage Weil
09:38 AM Bug #186 (In Progress): BUG: failed to decode message of type 66 v1: buffer::exception
This was due to commit:29a42efe2e4c789092f59b98b29632bdc4b88a80, which made a protocol change. I'll fix it up today ... Sage Weil
06:41 AM Bug #186 (Resolved): BUG: failed to decode message of type 66 v1: buffer::exception
While my cluster was in a degraded state due to a disk failure at random 2 out of my 3 monitors crashed.
I really ...
Wido den Hollander
04:25 PM Feature #180 (Resolved): Return ENOTEMPTY when trying to remove a directory which has a snapshot
As of 2a88e2e54ee0b9449e86cec02315e2809b75ca8b it will return ENOTEMPTY if you try to delete a dir which roots a snap... Greg Farnum
04:23 PM CephFS Bug #188 (Resolved): cfuse crashes on snapshot file read
gregf@pudgy:~/ceph/src$ sudo mkdir mnt/a
[sudo] password for gregf:
gregf@pudgy:~/ceph/src$ sudo mkdir mnt/a/b
gr...
Greg Farnum
03:18 PM Bug #181 (Resolved): monitor eats 8G of memory before beeing oom killed
Found and fixed many more monitor memory leaks in 7c85646240a02a3e82a727045de6e4432cc2ed9e. Valgrind is a lot happier... Greg Farnum
01:00 AM Bug #181: monitor eats 8G of memory before beeing oom killed
Ok, here is an update 1 day after posting comment #2:
mon0 is dead, mon1 is also dead, both OOM-killed I guess (no...
ar Fred
10:42 AM RADOS Bug #187 (Rejected): crush: high variance, latency for straw buckets
Sage Weil
09:47 AM Cleanup #103: Introduction of namespaces
Markus Elfring wrote:
> Can a bit of "C++ namespaces advice":http://stackoverflow.com/questions/713698/c-namespaces-...
Sage Weil
03:06 AM Cleanup #103: Introduction of namespaces
Can a bit of "C++ namespaces advice":http://stackoverflow.com/questions/713698/c-namespaces-advice help to clarify yo... Markus Elfring
07:49 AM Revision bc9fba0e (ceph): Introduced ceph mon remove command
Added ceph mon remove <ip>:<port> command. The command will remove
the target monitor from the monmap and shutdown th...
Paul Chiang
05:18 AM Revision 58fe4b8d (ceph): qa: add untar_snap_rm.sh
Sage Weil
05:05 AM Revision ac10d837 (ceph): osd: print rollback osd_op nicely
Sage Weil
03:55 AM Bug #99: Check return codes everywhere
The C/C++ programming language makes it easy to "overlook unused return values":http://stackoverflow.com/questions/12... Markus Elfring
02:31 AM Feature #101: Conversion of pointer parameters into references
> ... and because it makes the pass-by-reference more explicit.
I find this opinion questionable. I assume that yo...
Markus Elfring
02:10 AM Cleanup #146: Complete build options for Pthread API
... Markus Elfring

06/07/2010

11:03 PM Revision 470a6fde (ceph): mds: wire Connection to Session when Session already exists on connect
Sage Weil
11:03 PM Revision 6d770abe (ceph): mds: funnel mds->client messages through single Session* helper
Simplify callers where possible. Sage Weil
10:52 PM Feature #6 (Rejected): libceph could use a backward-compatible-to function
Usually this is handled via the shared object versioning scheme. The bit that doesn't address (I think) is when the ... Sage Weil
10:50 PM Bug #98: reserved identifier violation
The qemu guys were worried about this when we submitted the rbd driver. Simply changing __FOO_H to CEPH_FOO_H throug... Sage Weil
10:48 PM Bug #99: Check return codes everywhere
Markus Elfring wrote:
> Would you like to reuse any class library?
>
> I do not like "assert" for consistent erro...
Sage Weil
10:42 PM Cleanup #103: Introduction of namespaces
Markus Elfring wrote:
> This issue corresponds to my previous "feature request":https://sourceforge.net/tracker/?fun...
Sage Weil
10:42 PM Revision 29a42efe (ceph): mon: simplify clock drift checks
Ignore lease sent vs lease_ack receive times bc multiple lease msgs may
be in flight and the ack may be from a previo...
Sage Weil
10:40 PM Feature #101 (Rejected): Conversion of pointer parameters into references
Markus Elfring wrote:
> This issue corresponds to my previous "feature request":https://sourceforge.net/tracker/?fun...
Sage Weil
10:38 PM Cleanup #146: Complete build options for Pthread API
Markus Elfring wrote:
> Would you like to integrate the appended changes into your source code repository?
This g...
Sage Weil
10:25 PM Bug #149 (Closed): Stale NFS Handle when copying from snapshot
Still finding bugs with this basic workload, but I haven't seen ESTALE pretty much ever. Closing this one out for no... Sage Weil
10:04 PM Revision 527d5fd7 (ceph): monc: behave in ms_handle_reset if cur_mon is < 0
Sage Weil
10:03 PM Revision 6ff2a876 (ceph): msgr: don't throttle.get 0
Sage Weil
10:00 PM Revision 191cb2e4 (ceph): throttle: allow put(0)
Still returns a consistent value for the count. Sage Weil
09:59 PM Revision e505fb5a (ceph): msgr: don't thottle.put 0
Sage Weil
09:47 PM Revision d2740973 (ceph): Merge remote branch 'origin/msgr' into unstable
Sage Weil
07:05 PM Revision 4ecd8fac (ceph): mds: use cap on head if there is none on the snapped inode
This is needed, in particular, when we're flushing snap data on an inode
that already got COWed.
Sage Weil
06:40 PM Revision 5be26609 (ceph): osd: use low-level helper getting obc in sub_op_push
find_object_context does all sorts of stuff we don't need here: we know
which object the context is for. Just set it...
Sage Weil
03:09 PM Feature #185: mds: set file layout policy on directory hierarchy
If we use an xattr for this, one issue will be keeping the ancestor directly policy consistent across any mds node ma... Sage Weil
03:07 PM Feature #185: mds: set file layout policy on directory hierarchy
Alex Nelson wrote:
> One complication that may arise: If directory xattrs are what are used, what would be the sema...
Sage Weil
03:04 PM Feature #185: mds: set file layout policy on directory hierarchy
One complication that may arise: If directory xattrs are what are used, what would be the semantics of a file hard-l... Alex Nelson
03:03 PM Feature #185 (Resolved): mds: set file layout policy on directory hierarchy
It woud be helpful to have a way to specify multiple files' object sizes. Currently there is an ioctl for controllin... Alex Nelson
02:37 PM CephFS Bug #177: unlinked inode during try_to_expire()
The problem is that the dirty_inodes list assumes the inodes are either linked or base inodes. That should be the ca... Sage Weil
02:32 PM Bug #179 (In Progress): corrupted LogEntry in mon data
Okay, I can't make heads or tails of your core file on my system for some reason. Can you try this on your machine? ... Sage Weil
12:55 PM Revision 61555cce (ceph): throtle: add asserts on max and change parameters where appropriate
Greg Farnum
12:54 PM Revision 8413ed49 (ceph): throttle: fix assert count to actually use count
Greg Farnum
12:04 PM Linux kernel client Bug #182 (Resolved): VFS: Busy inodes after unmount of ceph.
This was actually an mds bug. It wasn't responding to a client_caps flushsnap. Fixed in ceph.git commit:4ecd8facd91... Sage Weil
09:43 AM Linux kernel client Bug #182 (Resolved): VFS: Busy inodes after unmount of ceph.
Am Sun, 06 Jun 2010 21:10:28 -0700 schrieb Sage Weil:
> On Sat, 5 Jun 2010, Thomas Mueller wrote:
>> hi
>>
>> ...
Sage Weil
11:35 AM Feature #184 (Resolved): librados support for truncate, writefull
We need to add librados support for truncate and writefull. We should make sure that the S3 gateway handles object re... Yehuda Sadeh
11:11 AM CephFS Bug #52 (Resolved): mds: dentry versionlock
Sage Weil
11:11 AM Feature #112 (Resolved): osd: snap rollback object op
Sage Weil
11:10 AM Bug #176 (Resolved): osd: make_clone needs to duplicate xattrs
Sage Weil
11:09 AM Feature #183 (Resolved): qa: xfstests workunit
simple script that will run a subset of xfstests in the current directory. Sage Weil
09:49 AM Bug #181 (In Progress): monitor eats 8G of memory before beeing oom killed
Guess I'll look at this a bit more. Greg Farnum
01:22 AM Bug #181: monitor eats 8G of memory before beeing oom killed
Hi, thanks for the fixes.
I just finished testing the new version, and my monitor survived (eating 6.8G memory, mo...
ar Fred
05:15 AM Revision 21a97d1e (ceph): mon: don't leak MAuth
Sage Weil
05:15 AM Revision d57b6296 (ceph): crypto: don't leak memory in CryptoAES::encrypt()
Sage Weil
05:15 AM Revision 520a2c37 (ceph): crypto: don't clean up EVP table on every decrypt()
Don't think that's appropriate? And certainly doesn't happen for the
encrypt() case.
Sage Weil
01:47 AM Revision ba63a7a4 (ceph): Removed all copies of the whoami value
Paul Chiang

06/06/2010

10:24 PM Bug #181 (Resolved): monitor eats 8G of memory before beeing oom killed
fixed two leaks, commit:21a97d1e7ce329fac07b5e69362d27bb7edb31f5 and commit:d57b629699158abacdcc3880d43111291a6fdf77 ... Sage Weil
05:08 AM Bug #181 (Resolved): monitor eats 8G of memory before beeing oom killed
Hi, I installed the latest ceph 0c38b3d63dd24fb8b86283de5e00f260a03d4024, and the latest qemu-rbd e6d8dbce416bfdba880... ar Fred
02:08 PM Feature #138 (In Progress): Try out tcmalloc
Okay, this is definitely something we need to look into more. I tried running with tcmalloc and the standard malloc, ... Greg Farnum
06:23 AM Cleanup #146: Complete build options for Pthread API
Would you like to integrate the appended changes into your source code repository? Markus Elfring

06/05/2010

12:01 AM Revision 989c9ee1 (ceph): throttle: use signed counters and assert that count never drops below 0
Greg Farnum

06/04/2010

11:42 PM Revision 46040a5f (ceph): msgr: switch to get/set functions for Message:throttler
Greg Farnum
11:42 PM Revision 246415b3 (ceph): osd: fix compile issues
Greg Farnum
11:42 PM Revision 3b333f7a (ceph): msgr: put throttler usage on Message destruct
Greg Farnum
11:42 PM Revision 800da082 (ceph): msgr: Fix uses of get_[data, payload, middle] to use throttling-aware f...
Greg Farnum
11:42 PM Revision 0d4bdfac (ceph): osd: add osd_client_message_size_cap option to config; default 500MB
And change the name in cosd to be that Greg Farnum
11:32 PM Revision a76d8fc6 (ceph): objectcacher: cleanup formatting
Sage Weil
11:32 PM Revision dff7cb33 (ceph): objectcacher: fix stat accounting when resizing bufferheads
Must keep stats in mind when adjusting bufferheads! Sage Weil
11:32 PM Revision 0c38b3d6 (ceph): objectcacher: add verify_stats() debugging helper
Sage Weil
11:32 PM Revision 12a5d7b2 (ceph): objectcacher: match states before merging in map_write
The caller is going to set us to dirty, so we don't care what state we
have, so long as the left and right bits we're...
Sage Weil
11:32 PM Revision 462552ab (ceph): objectcacher: fix use of invalid iterator in map_write()
The p points to bh, which is removed by merge_left. Move it back to final,
so we can advance to the new next a few l...
Sage Weil
11:23 PM Revision 522c12e5 (ceph): osd: fix rollback when head points at the rolled back snapshot
Yehuda Sadeh
08:23 PM Revision 33b947cd (ceph): msg: remove copy_payload and copy_data functions; change set to use thr...
Greg Farnum
08:10 PM Revision 8d1e7739 (ceph): Merge branch 'rbd' into unstable
Sage Weil
08:09 PM Revision 7b6aea6a (ceph): osd: clean up rollback debug output
Sage Weil
08:01 PM Revision 1b5920f8 (ceph): uclient: handle inode with no caps from mds
This happens when you readdir and some inodes are in a different snaprealm. Sage Weil
07:57 PM Revision e79a3fae (ceph): osd: filter_xattrs on a rollback op
Greg Farnum
07:55 PM Revision 48555f52 (ceph): osd: fix naughty iterator usage after invalidating it
Greg Farnum
07:49 PM Revision c730b85c (ceph): osd: add filter_xattrs function to remove non-user xattrs from a map of...
Greg Farnum
07:49 PM Revision a70a3668 (ceph): osd: _make_clone now properly duplicates xattrs
Greg Farnum
07:04 PM Revision f60be8e3 (ceph): progress
Greg Farnum
06:07 PM Revision 84b279a4 (ceph): mds: fix straydn->first part deux
9ed0c30ecf6611193db52e1facc1f46b37f04bc4 forgot to remove the old code. Sage Weil
04:45 PM Bug #173: Throttle client requests on OSD
Pushed to msgr, commit 800da082ad8aad032ff5299b5ad0c05bc378a1e3.
This definitely, definitely does not set a hard m...
Greg Farnum
04:31 PM Bug #178 (Resolved): cfuse fails dbench
fixed this, and some related ObjectCacher bugs. dbench exercised a lot of code that simple tests in the past had not... Sage Weil
09:50 AM Bug #178 (Resolved): cfuse fails dbench
From Thomas Mueller <thomas@chaschperli.ch>:
cfuse fails dbench with ceph.git testing. See logs:
Debian testing...
Sage Weil
02:04 PM Bug #179: corrupted LogEntry in mon data
Doh.. so it looks like the piece of info I need was in the logm directory. If you still have it, great. If not, I c... Sage Weil
11:13 AM Bug #179 (Closed): corrupted LogEntry in mon data
this is after a restart due to the update of all ceph daemons to c4e6482d302aa288031ced6cd845d60ba655e5c8
#0 0x...
ar Fred
12:58 PM Bug #176: osd: make_clone needs to duplicate xattrs
Pushed to unstable; and updated _rollback_to in the rbd branch to filter properly. Greg Farnum
11:50 AM CephFS Bug #165 (Resolved): cmds crash
Sage Weil
11:39 AM CephFS Bug #165: cmds crash
Indeed, can't reproduce the crash with the latest unstable.
I did 3-4 restart of all mds and it worked fine, that'...
ar Fred
11:43 AM Feature #180 (Resolved): Return ENOTEMPTY when trying to remove a directory which has a snapshot
When the following command sequence is used, valuable data in a snapshot could go lost:... Wido den Hollander
10:37 AM CephFS Bug #172: OSD and MDS crash on rm -r
Sage Weil wrote:
> Wido den Hollander wrote:
> > Today i ran the same test again, almost the same result.
> >
> ...
Wido den Hollander
09:53 AM CephFS Bug #172: OSD and MDS crash on rm -r
Wido den Hollander wrote:
> Today i ran the same test again, almost the same result.
>
> Before i ran the test i ...
Sage Weil
06:22 AM CephFS Bug #172: OSD and MDS crash on rm -r
Today i ran the same test again, almost the same result.
Before i ran the test i created a fresh fs with mkcephfs....
Wido den Hollander
03:25 AM CephFS Bug #177 (Resolved): unlinked inode during try_to_expire()
After trying to recover from bug #172 my MDS started to crash on their recovery.
Both mds0 and mds1 crashed while ...
Wido den Hollander
01:22 AM Revision 97f00aec (ceph): debugging output
Greg Farnum
01:22 AM Revision d3863272 (ceph): rados: print out pool instead of object
Greg Farnum
12:33 AM Revision 9ead80f8 (ceph): mds: fix CDir::take_sub_waiting vs dnwaiter pin
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:33 AM Revision 074a9b10 (ceph): mds: make discover work for multiversion inodes (e.g. dirs)
If we don't have the specific snap, look up the head and see if it's
multiversion.
This doesn't give us a "range" lo...
Sage Weil
12:33 AM Revision ec0aa43a (ceph): mds: don't export stray (~mdsfoo/stray), and ignore in balancer
We _must_ keep mdsdir and stray on local mds for normal operations.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
12:33 AM Revision 9ed0c30e (ceph): mds: set straydn first to match inode on unlink
Sage Weil
12:33 AM Revision c4e6482d (ceph): mds: only purge dentries with no extra refs (besides dirty)
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:33 AM Revision 551a12f5 (ceph): mds: fix cap clone logic to look at matching first, not last
The cap->client_follows is set to follows+1 by flushsnap, since the real
follows value isn't convenient. But it is e...
Sage Weil
12:33 AM Revision 791ca282 (ceph): mds: kill open_foreign_stray; but open remote mdsdirs instead
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

06/03/2010

11:45 PM Revision ff0e8715 (ceph): libatomic: fix assert.h compilation
Yehuda Sadeh
11:40 PM Revision 900d4c6c (ceph): msgr: add Throttle pointer to Policy
Greg Farnum
11:20 PM Revision 1facfe0f (ceph): Merge branch 'unstable' into msgr
Greg Farnum
09:14 PM Revision 62b900f5 (ceph): mds: open past snap parents at end of rejoin phase
We really need past parents open before we go active or else anything
that needs to build a snap context will fail.
Sage Weil
09:12 PM Revision 3989ae40 (ceph): osd: make sure we don't return EAGAIN to client
Greg Farnum
08:48 PM Revision 26449e7c (ceph): mdsmap: show individual mds states in summary
Sage Weil
08:26 PM Revision 09185a00 (ceph): osd: improve snap_trimmer debug output
Sage Weil
08:24 PM Revision 2b33d99b (ceph): mds: another cap_exports message/mdcache encoding fix
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:08 PM Revision 55da048f (ceph): mds: only adjust dn->first on lock msg if !multiversion
The multiversion dn->first references a range of inode versions; don't
drag it forward. Fixes 38cb2403c043e6676b5631...
Sage Weil
07:03 PM Revision 5f905961 (ceph): mds: more fix cap_exports typing
Sage Weil
06:59 PM Revision 054669ab (ceph): mds: fix scatter_nudge infinite loop
Sage Weil
06:08 PM Revision 40b23227 (ceph): mds: fix ESessions type
Sage Weil
06:04 PM Revision 5cd7919a (ceph): mds: drag in->first forward with straydn in handle_dentry_unlink
Sage Weil
05:38 PM Revision 394d9c3d (ceph): mds: fix anchorclient dup lookups, again
Sage Weil
05:17 PM Revision 980f234f (ceph): mds: only log successful requests as completed
Sage Weil
05:09 PM Revision fa1e5603 (ceph): mds: anchor dir on mksnap
Sage Weil
04:45 PM Revision c09d610c (ceph): mkcephfs: error when creating journal file in a directory that differs...
mkcephfs creates osd data directory automatically, but it doesn't create a
directory for the osd journal file.
When ...
CC Lien
04:40 PM Revision 5dd4a2d6 (ceph): mds: fix mismatched cap_exports type between msg and MDCache
The types need to match because they are encoded/decoded interchangeably.
See MMDSCacheRejoin::decode() and MDCache::...
Sage Weil
04:33 PM Revision 609e6572 (ceph): mds: fix trim_unlinked iterator badness
We may remove the next inode in the map. Queue up unlinked roots first,
which we know remove_inode_recursive() won't...
Sage Weil
04:28 PM Revision 915ab3ca (ceph): mds: define MDS_REF_SET in unstable
Sage Weil
04:27 PM Revision ef095e1f (ceph): mds: clear dirtyscattered in remove_inode()
Sage Weil
04:17 PM Revision 26822162 (ceph): mds: allow dup lookups in anchorclient
It's not practical for callers to avoid dups, particularly since they may
be unaware of each other. And it's trivial...
Sage Weil
04:13 PM Bug #176 (Resolved): osd: make_clone needs to duplicate xattrs
Sage Weil
04:01 PM Revision 8a2a9bd6 (ceph): assert: fix assert vs atomic_ops.h breakage
This was causing us to use the system assert, not the ceph one. Sage Weil
03:23 PM Feature #175 (Resolved): Make the system large-object safe
This will require extensive work throughout the system, especially in OSD recovery code. Right now, Ceph assumes that... Greg Farnum
03:20 PM rgw Feature #174 (Resolved): Support large files better
Right now, the rados gateway just dumps a given file into the RADOS store as a single large dump. If somebody's stori... Greg Farnum
03:19 PM Revision f5ccc662 (ceph): mds: ensure past snap parents get opened before doing file recovery
Otherwise we can fail to get_snaps() when we start the recovery:
#0 0x00007fa037625f55 in *__GI_raise (sig=<value o...
Sage Weil
03:17 PM Bug #173 (In Progress): Throttle client requests on OSD
Yep, working on it now. Greg Farnum
02:44 PM Bug #173 (Resolved): Throttle client requests on OSD
See Jim Schutt's issue on the mailing list and simple patch to illustrate the problem.
Namely, overzealous clients c...
Greg Farnum
03:16 PM Bug #149: Stale NFS Handle when copying from snapshot
Putting this back in the mix since Sage has been handling a lot of bugs from this test case today. Greg Farnum
03:04 PM Revision c0e9d210 (ceph): mds: relax lock state before encoding export (and lock state)
We can't fuss with lock state in the finish method because we already
encoded the old state to the new auth, and we a...
Sage Weil
01:54 PM Linux kernel client Bug #111 (Resolved): handle EAGAIN from osd
Looks to me like this can't actually happen. The function ReplicatedPG::find_object_context can return EAGAIN, and an... Greg Farnum
01:39 PM CephFS Bug #170 (Rejected): null pointer dereference in journal_cow_dentry causes assertion failure
Sage Weil
10:33 AM CephFS Bug #170: null pointer dereference in journal_cow_dentry causes assertion failure
Unfortunately I don't -- on Yehuda's suggestion I recompiled with optimization off and have been trying to reproduce ... Greg Farnum
01:39 PM CephFS Bug #171 (Resolved): mds: MDSTableClient::_logged_ack(version_t) FAILED assert(pending_commit.cou...
Sage Weil
07:42 AM CephFS Bug #171: mds: MDSTableClient::_logged_ack(version_t) FAILED assert(pending_commit.count(tid))
fixed by commit:3768ef941e67d17ecd710994b2c88960ba60627d Sage Weil
06:07 AM Revision 3768ef94 (ceph): mds: do not bother tableserver until it is active
We resend these requests when the TS does go active, and if we send dups
things get all screwed up (see partial log b...
Sage Weil
05:14 AM Revision 7c0df054 (ceph): mds: do not reset filelock state when checking max_size during recovery
This was broken by d5574993 (probably, that commit fixed a similar
problem). The rejoin_ack initializes replica stat...
Sage Weil
05:11 AM CephFS Bug #172 (Closed): OSD and MDS crash on rm -r
I'm still using my test script which unpacks the kernel source and then removes it again with a few steps in between.... Wido den Hollander
04:33 AM Revision 15c6651f (ceph): mds: lock->sync replica state is lock, not sync
It's not readable yet. And after the lock->sync gather completes we send
out a SYNC.
Fixes failed assertion like:
...
Sage Weil
02:37 AM Revision 1c930f9b (ceph): msg: add missing msg_types.cc
Sage Weil

06/02/2010

11:09 PM Linux kernel client Bug #111: handle EAGAIN from osd
I agree. Though we should differentiate between two cases. One is that we initiate the EAGAIN (e.g., when reached a l... Yehuda Sadeh
11:00 PM Linux kernel client Bug #111: handle EAGAIN from osd
Yehuda Sadeh wrote:
> We should make the client handle it, but we should also try to make sure that the osd doesn't ...
Sage Weil
10:51 PM Linux kernel client Bug #111: handle EAGAIN from osd
We should make the client handle it, but we should also try to make sure that the osd doesn't ever return it (at leas... Yehuda Sadeh
10:56 PM CephFS Bug #171 (Resolved): mds: MDSTableClient::_logged_ack(version_t) FAILED assert(pending_commit.cou...
... Sage Weil
10:40 PM Linux kernel client Bug #38 (Resolved): rm -r failure
I'm going to chalk this one up to commit:13a4214cd9ec14d7b77e98bd3ee51f60f868a6e5 (the d_subdirs ordering problem) an... Sage Weil
10:37 PM Linux kernel client Bug #69: ceph: ffff88001976ba50 auth cap (null) not mds0 ???
For a multi-mds system, this can be caused if we are between an export and import on a cap.
But when I saw this th...
Sage Weil
10:24 PM CephFS Bug #165: cmds crash
This looks an awful lot like it might be fixed by commit:15c6651ff57b88722b5c896f5698bf1d033e1f98. And possibly prev... Sage Weil
01:31 AM CephFS Bug #165: cmds crash
just got the same crash of mds2 using b441fbdc9fdca271ed3bd100fc3c98c800b509b1
please find the full logs of each m...
ar Fred
09:36 PM CephFS Bug #170: null pointer dereference in journal_cow_dentry causes assertion failure
this is actually a failed assertion, not a null deref. it looks like gdb is having trouble resolving the symbols pro... Sage Weil
03:04 PM CephFS Bug #170 (Rejected): null pointer dereference in journal_cow_dentry causes assertion failure
I've seen this a few times today.
Using the latest unstable servers(08afc8df680dc0cd5ad26f3f89152aa25a72b639), and m...
Greg Farnum
07:40 PM Revision 5262a96a (ceph): mds: add export_dir command
Sage Weil
07:40 PM Revision 4075b95c (ceph): mds: add MDCache::cache_traverse()
Sage Weil
06:50 PM Revision a3323c98 (ceph): tcp: parse ipv4 and ipv6 addresses
Sage Weil
06:50 PM Revision 0d1e5dbf (ceph): move addr parse() into entity_addr_t
Sage Weil
06:50 PM Revision eac36cb5 (ceph): initscript: unmount btrfs if we mounted it
Sage Weil
06:34 PM Revision 08afc8df (ceph): mon: fix unsynchronized clock logic;
change output for clarity Greg Farnum
11:19 AM Feature #169 (Resolved): osd: start up despite corrupted pg log(s)
Catch decoding, memory alloc exceptions, and skip corrupt pgs so the osd can still start up. Log the errors. Sage Weil
10:02 AM Bug #149 (In Progress): Stale NFS Handle when copying from snapshot
Greg Farnum

06/01/2010

11:34 PM Revision b441fbdc (ceph): mds: lookup exact snap dn on import
Sage Weil
11:33 PM Revision 38cb2403 (ceph): mds: update dn->first too when lock state adjusts inode->first
This keeps dn->first in sync with inode->first Sage Weil
10:23 PM Revision 9248cd9e (ceph): mds: don't change lock states on replicated inode
The reconnect will infer some client caps, which will affect what lock
states we want. If we're not replicated, fine...
Sage Weil
10:02 PM Revision afadb122 (ceph): mds: fix root null deref in recalc_auth_bits
Root may be null if we don't have any subtrees besides ~mds$id. Sage Weil
09:14 PM Revision 364f3cb0 (ceph): mds: adjust subtree map when unlinking dirs
Otherwise we get subtree bounds in the stray dir and get confused down
the line.
Sage Weil
08:18 PM CephFS Bug #30: multimds: slave_request on getattr
I think the problem is that we authpin anything we rdlock... is that really necessary? Sage Weil
07:57 PM Revision c4bbb000 (ceph): mds: discover snapped paths on retried ops
This is intended to mitigate a livelock issue with traversing to snapped
metadata. The client specifies all snap req...
Sage Weil
06:39 PM Revision 464e46c8 (ceph): mon: add wiggle room for clock synchronization check
Greg Farnum
05:30 PM Revision 7f8a743c (ceph): mds: add case for CEPH_LOCK_DVERSION to LockType
Greg Farnum
03:23 PM CephFS Bug #165: cmds crash
I pushed a fix to unstable that _might_ fix the root cause of this, but it's hard to say. Can you leave 'debug mds =... Sage Weil
12:58 PM CephFS Bug #165: cmds crash
ar Fred wrote:
> A bit later, I restarted the whole cluster, mds0 and mds2 crashed with the same stack trace, mds1 w...
Sage Weil
03:08 PM Linux kernel client Cleanup #168 (Closed): new truncate sequence
The new truncate sequence was merged for 2.6.35-rc1. (->truncate is deprecated?)
We need to see what updates (i...
Sage Weil
03:02 PM CephFS Bug #167 (Resolved): mds crash
fixed by commit:afadb1224516fc3a615d0cc51fe7560fcc0b5e7c Sage Weil
01:21 PM CephFS Bug #167 (Resolved): mds crash
Core was generated by `/usr/bin/cmds -i r1-11 -c /tmp/fetched.ceph.conf.5518'.
Program terminated with signal 11, Se...
ar Fred
12:48 PM Linux kernel client Bug #166 (Can't reproduce): Failing some pjd tests?
Best guess is an unsychronized client/server clock. Greg Farnum
11:55 AM Linux kernel client Feature #42: Resize of rbd image
There is a refresh /sys/class/.. interface, however, resizing of an image should be lock protected, and probably shou... Yehuda Sadeh
10:28 AM Linux kernel client Bug #164 (Resolved): memory leak in statfs
Fixed.
commit: 5d97634a3b824ed746ba0d5441bf3d1d65f490a0
Yehuda Sadeh

05/31/2010

06:22 AM CephFS Bug #165: cmds crash
A bit later, I restarted the whole cluster, mds0 and mds2 crashed with the same stack trace, mds1 was fine. ar Fred
03:04 AM CephFS Bug #165 (Resolved): cmds crash
one of my 3 mds crashed quickly after startup of the whole cluster:
this is using latest unstable (00c3dafd5afe6461f...
ar Fred
03:41 AM Linux kernel client Bug #166 (Can't reproduce): Failing some pjd tests?
Failed Test Stat Wstat Total Fail Failed List of Failed
----------------------------------------...
Greg Farnum

05/30/2010

01:36 AM Revision 00c3dafd (ceph): xlist: add assert to catch invalid iterator usage
Greg Farnum

05/29/2010

10:07 PM Linux kernel client Bug #144: GPF at con_close_socket+0x40/0x9f
Yeah, i think this is related to #163, but i still don't know how that would cause this problem. The basic issue is ... Sage Weil
09:58 PM Linux kernel client Bug #163 (Resolved): put_osd on umount can use client after free
fixed by commit:a922d38fd10d55d5033f10df15baf966e8f5b18c Sage Weil
04:40 PM Linux kernel client Bug #163: put_osd on umount can use client after free
That would explain bug #144:
[12836.065773] Last user: [<ffffffffa01106b9>](put_osd+0x3f/0x82 [ceph])
Yehuda Sadeh
09:25 AM Linux kernel client Bug #163 (Resolved): put_osd on umount can use client after free
the connection can be put after ceph_client is freed, at which point this will dereference a bad pointer... Sage Weil
09:57 PM Linux kernel client Bug #164 (Resolved): memory leak in statfs
workload dbench
master branch...
Sage Weil
06:06 PM Revision 79b39625 (ceph): ObjectCacher: do not try to deref an invalidated xlist::iterator
Fixes #159 Greg Farnum
11:09 AM Bug #159 (Resolved): cfuse abort on file delete (0.20.2)
All right, fixed in 0d437a205b4c239cb85f08ad6976868d84bf9ab4.
The ObjectCacher wasn't properly cleaning up objects i...
Greg Farnum

05/28/2010

08:21 PM Revision 83094d97 (ceph): paxos: fix store_state fix
Sage Weil
07:59 PM Revision 62e290e8 (ceph): msgr: print bind errors to stderr
Sage Weil
07:56 PM Revision 6060bdd8 (ceph): rbd: some fixes to conform with qemy code style
Yehuda Sadeh
07:50 PM Revision 3a705ded (ceph): paxos: cleanup
Sage Weil
07:48 PM Revision 3c3e82e0 (ceph): paxos: only store committed values in store_state
The uncommitted value is handled specially by handle_last() Sage Weil
07:41 PM Revision 187011cd (ceph): initscript: fix typo with $lockfile stuff
Sage Weil
07:37 PM Revision 6b72d70b (ceph): paxos: set last_committed in share_state()
It wasn't getting set for LAST message, which broke recovery somewhat.
Broken by 8e76c5a1d827e01f77149245679bd00ba27...
Sage Weil
01:44 PM Linux kernel client Bug #162 (Can't reproduce): list bug during shrink_dcache_for_umount
ceph3, rsync workload.
unstable circa 5/25...
Sage Weil
01:12 PM Linux kernel client Bug #141 (Resolved): ERESTARTSYS on mds update operations cause bad results
Sage Weil
10:49 AM Linux kernel client Bug #141: ERESTARTSYS on mds update operations cause bad results
I assume that switching to wait_for_completion_killable() fixed this one?
related commit: 0ec773c7f9ecbff4b75c3c68...
Yehuda Sadeh
12:58 PM Bug #158 (Resolved): cmon silently fails if addr is wrong in ceph.conf (0.20.1)
fixed by commit:62e290e87fa2ce5b33a847e0837b2198bac6842b Sage Weil
08:42 AM Bug #158 (Resolved): cmon silently fails if addr is wrong in ceph.conf (0.20.1)
Sage Weil
12:47 PM Bug #161: Monitor crashes on begin
actually, commit:3c3e82e0f5feacef5f191a5ce34bf96c15fdaed5 Sage Weil
12:37 PM Bug #161 (Resolved): Monitor crashes on begin
fixed by commit:6b72d70be42823e32bb8bcec033ac3a62943e089 Sage Weil
11:39 AM Bug #161 (Resolved): Monitor crashes on begin
On an assert:
assert(begin->last_committed == last_committed);
(gdb) bt
#0 0x00007f39eacfdf45 in *__GI_raise (sig...
Greg Farnum
12:07 PM Linux kernel client Bug #148 (Resolved): iozone failure
yeah, this has survived 24 hours, whereas before it was failing after an hour or two. Sage Weil
12:00 PM Linux kernel client Bug #144: GPF at con_close_socket+0x40/0x9f
What was the specific scenario? Can it be reproduced? Yehuda Sadeh
11:40 AM Bug #159 (In Progress): cfuse abort on file delete (0.20.2)
Greg Farnum
08:43 AM Bug #159 (Resolved): cfuse abort on file delete (0.20.2)
Sage Weil
11:17 AM Linux kernel client Bug #150: order:1 page allocation failure
Too many dirty pages? Too many pending osd requests?
We should probably try to get how many osds requests were in-fl...
Yehuda Sadeh
11:07 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
nfs uses the rpc code, which, if I understand it correctly initializes a work queue for socket allocation and connect... Yehuda Sadeh
10:36 AM Feature #160 (Resolved): rbd revert-to-snapshot
Need to fully implement revert-to-snapshot functionality. Currently there's a partial implementation in the rbd-class... Yehuda Sadeh
12:12 AM Revision 8c448257 (ceph): osd: fix compilation
Yehuda Sadeh

05/27/2010

11:32 PM Revision 4b797745 (ceph): mds: fix null dn deref during anchor_prepare
Sage Weil
11:14 PM Revision bb8b1398 (ceph): mds: fix invalid use of connection
Greg Farnum
10:25 PM Revision 93804416 (ceph): mds: switch some session->inst send_message calls to session->connection;
switch an MDS broadcast from instance-based to Connection *-based send. Greg Farnum
10:02 PM Support #156: Example debug levels in sample.ceph.conf
Looks quite useful, thank you. There are also some logging directories available, e.g. logger sym. May those be add... Alex Nelson
09:49 PM Support #156 (Resolved): Example debug levels in sample.ceph.conf
Sage Weil
09:59 PM Revision 330e1e21 (ceph): osd: warn, don't crash, on purged_snaps shrinkage
Sage Weil
09:59 PM Revision a1a13502 (ceph): mkcephfs: pass -c to cmon --mkfs
Sage Weil
09:59 PM Revision 0a1d526b (ceph): osdmap: assert maxrep >= minrep
Sage Weil
09:59 PM Revision 594d4568 (ceph): osdmaptool: include raw, up, acting mappings
Sage Weil
09:59 PM Revision 892a0e25 (ceph): config: parse in $host from conf file
So you can do stuff like
log dir = /data/$host
Sage Weil
09:58 PM Revision d2c40055 (ceph): initscript: incorporate Josef's fedora fixes
Add 'status' command.
Add chkconfig line.
Do lockfile stuff only if /var/run/subsys exists.
Still specifying the run...
Sage Weil
09:31 PM Revision b83b0733 (ceph): rados: add op for rollback
Greg Farnum
09:31 PM Revision e935b8ec (ceph): osd: add rollback to ceph_osd_op_name
Greg Farnum
09:31 PM Revision 23336561 (ceph): osd: create _delete_head function, move CEPH_OSD_OP_DELETE handling to it.
Greg Farnum
09:31 PM Revision be1030d8 (ceph): rados: add snap.snapid to ceph_osd_op, to replace use of MOSDOp's snapid
Greg Farnum
09:31 PM Revision b82ba820 (ceph): osd: implement rollback functionality
Greg Farnum
09:31 PM Revision 91fb924a (ceph): objecter: add rollback_object function, which rolls back a single objec...
Greg Farnum
09:31 PM Revision 0292f2e6 (ceph): librados: add rollback_object functions.
Greg Farnum
09:31 PM Revision 9dd35584 (ceph): rados: add rollback functionality to rados
Greg Farnum
09:31 PM Revision bd9cf968 (ceph): osd: set clone_overlaps properly on rollback
Greg Farnum
09:31 PM Revision edffc122 (ceph): librados: update C header file to proper name for rollback function
Greg Farnum
09:31 PM Revision 7cc3ab62 (ceph): rados.h: should use __le64 instead of __u64
Yehuda Sadeh
09:26 PM Linux kernel client Bug #148: iozone failure
I think this may have been caused by the mds request signal handling? It isn't happening on the latest unstable. Sage Weil
09:24 PM Revision 08f69663 (ceph): ceph.spec: build-required libatomic_ops-devel, not libatomic_ops
And no perl-devel. Sage Weil
09:23 PM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
We could have a pool of preallocated sockets.. but that could be exhausted.
Or duplicate a bunch of socket creation ...
Sage Weil
06:23 PM Revision f95e1e0a (ceph): mds: add Connection * to Session
Greg Farnum
06:22 PM Revision 53523267 (ceph): Merge branch 'unstable' into msgr
Greg Farnum
04:34 PM CephFS Feature #45: Investigate adding Connection * to mds Session
Added Connection *; now testing my send_message conversions to make sure I'm not trying to use any Connection *s whil... Greg Farnum
02:57 PM Linux kernel client Bug #157 (Resolved): fix auth_x memory leak
fixed by 'ceph: fix leak of osd authorizer'. the osd_client put_osd() didn't clean up the ceph_authorizer. Sage Weil
01:14 PM Linux kernel client Bug #157 (Resolved): fix auth_x memory leak
this is on ceph1, qa loopall.sh workload, unstable branch.... Sage Weil
04:47 AM Revision a3dc4bda (ceph): sample.ceph.conf: include debug options, commented out
Sage Weil

05/26/2010

11:58 PM Revision 78375cfd (ceph): mon: add crush_rule data member to MPoolOp; use it in new pool creation...
Greg Farnum
11:58 PM Revision a9e17271 (ceph): objecter: add optional crush_rule parameter; set in pool_op_submit as n...
Greg Farnum
11:58 PM Revision 8044f7ac (ceph): librados: add crush_rule parameter to create_pool functions
Greg Farnum
11:58 PM Revision 05256bb0 (ceph): rados: you can now set the crush rule to use when creating a pool
Greg Farnum
09:54 PM Support #156 (In Progress): Example debug levels in sample.ceph.conf
We should still add a wiki page with debugging information. I can include info about debug options, and also other s... Sage Weil
09:53 PM Support #156 (Resolved): Example debug levels in sample.ceph.conf
good idea. commit:a3dc4bdac2057c2d0fcd27cab9c416c5089b4c76 Sage Weil
05:20 PM Support #156 (Resolved): Example debug levels in sample.ceph.conf
The debug options for ms, osd, etc. could afford to be listed in the sample.ceph.conf file, even commented out. Ther... Alex Nelson
09:47 PM Revision a92df208 (ceph): mds: include LAZYIO in CEPH_CAP_ANY set
Sage Weil
09:47 PM Revision a13b5b1c (ceph): mds: include LAYZIO cap in sync->mix and mix->sync transitions
Sage Weil
09:47 PM Revision 297d3ecd (ceph): client: update ioctl.h (lazyio, invalidate_range)
Sage Weil
09:47 PM Revision 648ce976 (ceph): mds: LAYZIO is not liked, but it is allowed
Sage Weil
09:35 PM Revision 9b4d25b9 (ceph): mon: detect and warn on clock synchronization problems;
change MMonPaxos::lease_expire to lease_timestamp Greg Farnum
09:35 PM Revision 75de2723 (ceph): mon: warn to log, not just dout, on clock drift
Greg Farnum
09:11 PM Revision bee74a1e (ceph): ceph: add conversion to qemu coding style
Hi Yehuda,
I've added a small hack to make push_to_qemu.pl convert tabs to spaces.
Christian
Christian Brunner
05:59 PM Revision a1c99811 (ceph): paxos: use helper to store committed state; fix master mon catch up usi...
The catch up logic in handle_last didn't handle the stashed state, so we
crashed and burned if it was the master that...
Sage Weil
05:01 PM Revision c0df916a (ceph): cfuse: bail out on mount() errors
Sage Weil
04:58 PM Feature #135 (Resolved): Specify crush rules
Added crush_rule parameters/data members as appropriate to OSDMonitor pool creation functions, objecter, librados, an... Greg Farnum
02:47 PM Feature #33: O_LAZY or equivalent
Sage Weil
02:32 PM Feature #105 (Resolved): mon: warn on clock drift
It warns to dout and the logger:
1) when the slave notices the leader is behind by >(mon_lease - latency), or
2) wh...
Greg Farnum
11:33 AM CephFS Bug #52: mds: dentry versionlock
merged into unstable Sage Weil
11:32 AM Bug #37 (Rejected): osd: recover missing clone object
this could have been related to the osd recovery fixes (wrt snapdir). haven't seen this in weeks. dropping it for now. Sage Weil
11:03 AM Feature #112: osd: snap rollback object op
Greg Farnum
11:01 AM Bug #151 (Resolved): cmon crash in PGMonitor::update_from_paxos at mon/PGMonitor.cc:90
fixed by commit:a1c99811bae2199a4ef3eef8681ac70ccfa128f5 Sage Weil
06:39 AM Bug #151 (Resolved): cmon crash in PGMonitor::update_from_paxos at mon/PGMonitor.cc:90
one of my 3 monitors crashed today, the whole ceph cluster was idle at that time.
cmon compiled at f7708dea1f, ple...
ar Fred
10:44 AM Documentation #155 (Resolved): document ceph auth
Sage Weil
10:14 AM Feature #154 (Closed): support IPv6 addresses
most of the infrastructure is there... Sage Weil
10:02 AM Bug #152 (Resolved): cfuse problem
fixed by commit:c0df916a790f9560d487c74c22152a7e16e6f226 Sage Weil
06:50 AM Bug #152 (Resolved): cfuse problem
Hi,
compiled at 7ecf493fd2c
it seems cfuse starts fuse when it fails connecting to a monitor (which is dead in ...
ar Fred
10:00 AM Bug #145: Check build dependencies for FastCGI
Will any adjustments be needed to check for required header files also in a subdirectory like "fastcgi"?
Does a conf...
Markus Elfring
09:54 AM CephFS Bug #153 (Resolved): mds: fix snap dentry replication vs readdir on frag auth
The request may be something like #123//foo/some/path/to/dir, dir lives in the stray dir, and is auth on another node... Sage Weil

05/25/2010

11:40 PM Revision 32d34f06 (ceph): Merge branch 'lazyio' into unstable
Conflicts:
src/mds/locks.c
Sage Weil
09:44 PM Revision e6b9055f (ceph): interval_set: fix union_of, intersection_of size accounting
Sage Weil
08:47 PM Revision 2b9ef644 (ceph): init-ceph: use = not == for comparison operator
Sage Weil
08:13 PM Revision fc228b5b (ceph): Merge branch 'mds_dentries' into unstable
Sage Weil
08:01 PM Revision 701d2672 (ceph): mds: better debugging on rmdir
Sage Weil
08:01 PM Revision 29ca21f5 (ceph): mds: fix scatterlock gather, writebehind
We stopped overloading the virutal is_updated() when we renamed to
is_dirty.
broken by 7f19ee1ac36095cd4d4c169858d93...
Sage Weil
03:42 PM Linux kernel client Bug #143 (Resolved): avoid resending requests on mon ticket renewal
fixed by 'ceph: do not resend mon requests on auth ticket renewal' and 'ceph: renew auth tickets before they expire' Sage Weil
02:37 PM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
What it actually means is that sock_alloc_inode is being called under the kswapd context and it does an allocation wi... Yehuda Sadeh
10:34 AM Linux kernel client Bug #147 (Resolved): lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and c...
... Sage Weil
02:15 PM Cleanup #146: Complete build options for Pthread API
If there is an environment where the -lpthread isn't sufficient, sure. Send a patch! :) Sage Weil
06:15 AM Cleanup #146 (Rejected): Complete build options for Pthread API
Would you like to combine "your check for this programming interface":http://ceph.newdream.net/git/?p=ceph.git;a=blob... Markus Elfring
02:07 PM Bug #145: Check build dependencies for FastCGI
Something nonstandard with Suse then? On debian it's
fatty:src 02:05 PM $ dpkg -S /usr/include/fcgiapp.h
libfcgi...
Sage Weil
01:41 AM Bug #145: Check build dependencies for FastCGI
I wonder why a header is not found because the file "/usr/include/fastcgi/fcgiapp.h" is available from the package "F... Markus Elfring
02:01 PM Linux kernel client Bug #150 (Can't reproduce): order:1 page allocation failure
workload was rsync to a ceph mount.
ceph3 mounting cosd0:/
not sure which version. probably unstable from last wee...
Sage Weil
11:09 AM Bug #149: Stale NFS Handle when copying from snapshot
Reproduced on version:
kclient: current unstable branch (240ed68eb567d80dd6bab739341999a5ab0ad55d)
server: current ...
Yehuda Sadeh
11:05 AM Bug #149 (Closed): Stale NFS Handle when copying from snapshot
Happens in the following scenario:
mount ceph
cd /mnt
mkdir a; cd a
tar xvfj ~/linux-2.6.xx.tar.bz2 (^C after a f...
Yehuda Sadeh
10:37 AM Bug #134 (Resolved): rbdtool segfaults when listing
Might have been due to protocol change in the pool-op that didn't get a protocol version bumped up. Resolving it unti... Yehuda Sadeh
10:35 AM Linux kernel client Bug #148 (Resolved): iozone failure
on ceph4, running
* rbd 3a6e756 ceph-rbd: snapshots support...
Sage Weil
10:32 AM Linux kernel client Bug #106 (Resolved): msgpool depletion?
Sage Weil
10:28 AM Linux kernel client Bug #106: msgpool depletion?
On what version did it happen? Do we have any reproducible scenario? Yehuda Sadeh

05/24/2010

11:51 PM Revision f8f9e6c4 (ceph): mds: make export targets stay in mdsmap for a while
This limits the mdsmap churn some. Keep old targets around for at least
min-max iterations before removing them.
Sage Weil
11:51 PM Revision 7f0ef1cd (ceph): mds: balancer cleanup
Sage Weil
11:50 PM Revision da42d061 (ceph): mds: warn on dn release that dne
Sage Weil
11:11 PM Revision 06b86ea4 (ceph): rbd: modify rbd on-disk header
Yehuda Sadeh
10:58 PM Revision 7cf48614 (ceph): rbd: fix push_to_qemu.pl
Yehuda Sadeh
10:56 PM Revision be082f0d (ceph): filestore: make mkfs() zap any file or dirs it finds
Sage Weil
10:56 PM Revision 7113775b (ceph): mon: roll mkmonfs functionality into cmon --mkfs
Sage Weil
10:55 PM Revision 5e8a6096 (ceph): rbd: modify header, add utility to ease sync with qemu tree
Yehuda Sadeh
09:00 PM Revision a9b494c4 (ceph): mon: no need for 'whoami' file in store
The monitor rank is provided during startup. No need to verify it against
the monitor store, especially since the st...
Sage Weil
09:00 PM Revision 0d98fc6f (ceph): osd: keep recovery ops in sync with pull
Call start_recovery_op from pull() instead of fixing every caller (some
were wrong). This keeps the recovery state i...
Sage Weil
03:31 PM Bug #132 (Resolved): slow mon recovery after operating degraded for too long
fixed by commit:bf1cb87d255b88d8e06b2988b6700e400ceb1b92 and commit:357aa0334436da79065dc67b270ff78f8899493f Sage Weil
03:30 PM Cleanup #121 (Resolved): roll mkmonfs functionality into cmon
commit:752a0fd5630aba92dedc3bb30fccec0ec837fa59 Sage Weil
02:25 PM Bug #133 (Resolved): mds crash on snapshot
The crash I saw here (related to an anchor table lookup) is fixed by commit:51c5823472ef8208c1b7a6b094f1655ccdc1190e
Sage Weil
02:23 PM Bug #145: Check build dependencies for FastCGI
Hmm, there is a rule in configure.ac checking for FCGX_Init. Is that rule broken, or is checking FCGX_Init insuffici... Sage Weil
02:14 PM Bug #145 (Resolved): Check build dependencies for FastCGI
I stumble on the following messages for my compilation try.... Markus Elfring

05/23/2010

10:13 PM Revision 56c4043a (ceph): reword blacklisted output so it's clearly discussing MDSes and not OSDs
Greg Farnum

05/22/2010

04:56 PM Revision f7708dea (ceph): uclient: don't unlink null dentry when getting null linkage in mds reply
This broke semi-recently when the mds started returning null linkages (and
associated leases).
Sage Weil
10:02 AM Bug #140 (Resolved): Cfuse crashes when mv-ing a file
fixed by commit:f7708dea1f2db5d3be31ddc2aaf1500e1d50746d
Sage Weil
09:47 AM Bug #140 (In Progress): Cfuse crashes when mv-ing a file
Sage Weil
 

Also available in: Atom