Project

General

Profile

Activity

From 04/11/2010 to 05/10/2010

05/10/2010

10:10 PM Cleanup #113 (In Progress): audit mds_client locking, esp reply handler
Sage Weil
03:25 PM Cleanup #113: audit mds_client locking, esp reply handler
see also #66 Sage Weil
10:09 PM Bug #64: crash in handle_mds_map (corrupt s_waiting list?)
fixed by commit:1c0806d2caacc683c56a587eaf1502769a7c0698 Sage Weil
04:35 PM Bug #64 (Resolved): crash in handle_mds_map (corrupt s_waiting list?)
fixed by 'ceph: fix locking, error paths when waking reconnect requests' Sage Weil
10:09 PM Bug #66: BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
fixed by commit:9abf82b8bc93dd904738a71ca69aa5df356d4d24 Sage Weil
04:34 PM Bug #66 (Resolved): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
fixed by 'ceph: fix locking, error paths when waking reconnect requests' Sage Weil
03:24 PM Bug #66: BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
unable to reproduce... but, see #113 Sage Weil
04:44 PM Feature #119 (New): avoid looping connect/retry errors on console
we should try to avoid filling up logs with stuff like this:... Sage Weil
04:35 PM Bug #78 (Resolved): bdi_init list bug
Sage Weil
04:13 PM Feature #18 (Resolved): reconnect fixups
Sage Weil
03:25 PM Bug #50 (Resolved): osd timeout reset leaves some ops hanging
Sage Weil
10:31 AM Bug #50: osd timeout reset leaves some ops hanging
Sage Weil
10:31 AM Bug #50: osd timeout reset leaves some ops hanging
finally found this, fixed by commit:77eb74b92fee7340d104b24a9ee2800196b0f140 Sage Weil

05/07/2010

03:01 PM Bug #116 (Resolved): can we drop user. xattr prefix for magic ceph xattrs?
Sage Weil
01:43 PM Bug #115 (Resolved): rbdtool --list on empty pool should return correct message
Fixed, pushed to unstable. Yehuda Sadeh
12:39 PM Bug #115: rbdtool --list on empty pool should return correct message
Here's the output:... Wido den Hollander
12:03 PM Bug #115 (Resolved): rbdtool --list on empty pool should return correct message
It currently returns some weird error.
Reported by wido.
Yehuda Sadeh
09:51 AM Cleanup #113 (Resolved): audit mds_client locking, esp reply handler
what does mdsc->mutex protect? s_mutex? which protects requests? Sage Weil
09:39 AM Bug #65 (Resolved): crash in tcp_sendpage
may have also been related to #109.
closing this one, since we haven't seen it in a while.
Sage Weil
09:38 AM Bug #109: kernel bugs out with bad osd caps
osd errors weren't unregistering the request. fixed by commit:a40355b39e006459b1ffba052c53084d20d64209 Sage Weil
09:37 AM Bug #109 (Resolved): kernel bugs out with bad osd caps
Sage Weil
08:19 AM Bug #111 (Resolved): handle EAGAIN from osd
currently we just return this to the caller, when we should retry.
not that the osd returns this very often (ever?)
Sage Weil

05/06/2010

02:37 PM Bug #109: kernel bugs out with bad osd caps
... Sage Weil
02:34 PM Bug #109 (Resolved): kernel bugs out with bad osd caps
e.g.,... Sage Weil
12:30 PM Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
again:... Sage Weil

05/05/2010

09:39 PM Bug #78: bdi_init list bug
i suspect this was fixed by commit:5dfc589a8467470226feccdc50f1b32713318e7b Sage Weil
09:38 PM Cleanup #79 (Closed): use bdi setup and register helper
no. i did rename the bdi ceph-%d, though. Sage Weil
09:37 PM Bug #104 (Resolved): bdi problem on EPERM from osd
Sage Weil
09:36 PM Bug #104: bdi problem on EPERM from osd
problem was use of invalid wbc in completion. fixed by commit:54ad023ba8108d0163acc931ed4b5e4a8a3a7327 Sage Weil
04:53 PM Bug #104: bdi problem on EPERM from osd
the problem is when writepages_finish gets a -1 result code. Sage Weil
03:03 PM Bug #104 (In Progress): bdi problem on EPERM from osd
Sage Weil
02:57 PM Bug #104: bdi problem on EPERM from osd
ladder0:
mount -a
echo asdf > /c/foo
sync
<crash>
Sage Weil
03:56 PM Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
hmm, s_cap_lock usage sites look okay... don't think it's a leaked spinlock Sage Weil
03:03 PM Bug #107 (Resolved): lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
on master. rsync workload.
[168476.538425] BUG: soft lockup - CPU#0 stuck for 61s! [kswapd0:318]
[168476.538430] ...
Sage Weil
11:45 AM Bug #106 (Resolved): msgpool depletion?
which pool is it?
[104608.030333] ceph: msgpool_get ffff88010f1fa370 now 0/0, may fail
[104608.036614] ----------...
Sage Weil

05/04/2010

03:46 PM Bug #104 (Resolved): bdi problem on EPERM from osd
last sysfs file: /sys/class/net/lo/operstate
CPU 1
Modules linked in: ceph [last unloaded: ceph]
Pid: 5724, comm:...
Sage Weil
11:32 AM Bug #28: gracefully fail on fill_trace errors
this includes ENOMEM on xattr blob Sage Weil

05/03/2010

10:10 PM Bug #54 (Resolved): do dentry offset assignment when dentry becomes non-null
Sage Weil
10:09 PM Bug #54: do dentry offset assignment when dentry becomes non-null
added to unstable. Sage Weil
04:13 PM Cleanup #79 (Closed): use bdi setup and register helper
See commit:e6d086d83cf7f102d48c006f58172a69ec0c15a4
This will make our /sys/kernel/debug/bdi directory pretty (cep...
Sage Weil
04:10 PM Bug #78 (Resolved): bdi_init list bug
There were 2 clients mounted, here, so unclear what what was. One was behaving fine.
The other was forcefully unm...
Sage Weil
12:10 PM Bug #47 (Closed): gfp at ceph_update_snap_trace+0x16a/0x419
Sage Weil
12:10 PM Bug #38 (Closed): rm -r failure
Sage Weil
12:10 PM Bug #22 (Closed): BUG at fs/ceph/caps.c:253
Sage Weil
12:10 PM Bug #4 (Closed): lockdep warning in socket code
Sage Weil
12:10 PM Bug #3 (Closed): leaked dentry ref on umount
Sage Weil
12:10 PM Bug #2 (Closed): BUG at fs/ceph/caps.c:2178
Sage Weil
12:09 PM Bug #1 (Closed): gpf in tcp_sendpage
Sage Weil
11:13 AM Bug #65: crash in tcp_sendpage
this is probably a problem with the backport.. it went away when we switch to 2.6.34-rc3 on issdm Sage Weil

04/26/2010

08:46 PM Bug #69 (Can't reproduce): ceph: ffff88001976ba50 auth cap (null) not mds0 ???
no apparent malfunction. on master branch. dmesg is
ceph2 login: [249643.959209] ceph: get_reply unknown tid 260...
Sage Weil

04/23/2010

08:22 PM Bug #66 (Resolved): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
[ 6447.063496] ------------[ cut here ]------------
[ 6447.065210] kernel BUG at fs/ceph/mds_client.c:1841!
[ 6447....
Sage Weil
04:13 PM Bug #65 (Resolved): crash in tcp_sendpage
on issdm. master branch (standalone).
[ 2900.360800] ceph: osd16 weight 0x10000 (in)
[ 2900.360802] ceph: osd17 ...
Sage Weil
12:28 PM Bug #64 (Resolved): crash in handle_mds_map (corrupt s_waiting list?)
unstable branch.
ceph: mds0 caps stale
ceph: mds0 caps stale
ceph: mds0 hung
ceph: mds0 came back
ceph: mds0 c...
Sage Weil
10:55 AM Bug #63 (Resolved): dentry_info slab not empty
[97009.315064] slab error in kmem_cache_destroy(): cache `ceph_dentry_info': Can't free all objects
[97009.324159] P...
Sage Weil

04/22/2010

09:46 AM Feature #42: Resize of rbd image
rbdtool part is done. i think the driver just needs to have a 'refresh' function to reload the image metadata. Sage Weil

04/20/2010

01:35 PM Feature #23: fcntl/flock advisory lock support
All right, designed a basic interface between the client and the MDS. Going to implement the MDS and messaging parts ... Greg Farnum
10:07 AM Bug #54 (Resolved): do dentry offset assignment when dentry becomes non-null
see rename_vs_i_complete branch on fatty for partial solution Sage Weil

04/19/2010

03:51 PM Bug #50 (Resolved): osd timeout reset leaves some ops hanging
i see an osd reset:
[ 7769.416465] ceph: tid 65961 timed out on osd4, will reset osd
followed by stray replies...
Sage Weil
10:26 AM Feature #23 (In Progress): fcntl/flock advisory lock support
Greg Farnum
10:20 AM Bug #36 (Rejected): uml crash in sendpage
can't reproduce. bad compilation? Sage Weil
10:19 AM Bug #47 (Resolved): gfp at ceph_update_snap_trace+0x16a/0x419
bug in realm split. fixed by commit:be23f8ad6fccba9a5535a7d3013eb492c20e39cb Sage Weil
08:59 AM Bug #47 (In Progress): gfp at ceph_update_snap_trace+0x16a/0x419
Sage Weil

04/16/2010

04:07 PM Bug #38 (Resolved): rm -r failure
the problem is (mostly) that d_move reorders d_subdirs. so just clear I_COMPLETE on rename.
but also:
- we shoul...
Sage Weil
12:30 PM Bug #47 (Closed): gfp at ceph_update_snap_trace+0x16a/0x419
after running snaptest1.sh twice:
[90471.092551] general protection fault: 0000 [#1] PREEMPT SMP
[90471.095369] ...
Sage Weil

04/15/2010

10:30 AM Feature #42 (Resolved): Resize of rbd image
We want to be able to resize the rbd image. Snapshots should be maintained and should have a size associated with them. Yehuda Sadeh
10:30 AM Bug #38 (In Progress): rm -r failure
reliably reproducible with kernel_untar_build.sh workunit. looks like problem with dcache_readdir. from the rm -r's... Sage Weil

04/14/2010

03:34 PM Bug #38 (Resolved): rm -r failure
seeing this one again on current clients (commit:a6a5349) Sage Weil
03:31 PM Bug #36 (Rejected): uml crash in sendpage
running kernel untar qa workunit on fatty.
with valgrind on osd
Sage Weil

04/13/2010

02:09 PM Bug #4 (Resolved): lockdep warning in socket code
Sage Weil
02:09 PM Bug #4: lockdep warning in socket code
use separate class for ceph socket.
commit:79cb735c
Sage Weil
11:50 AM Bug #1 (Resolved): gpf in tcp_sendpage
Sage Weil
11:50 AM Bug #2 (Resolved): BUG at fs/ceph/caps.c:2178
Sage Weil
11:44 AM Bug #2 (Closed): BUG at fs/ceph/caps.c:2178
Sage Weil
11:44 AM Bug #2: BUG at fs/ceph/caps.c:2178
Fixed by commit:e130642ba Sage Weil
11:50 AM Bug #3 (Resolved): leaked dentry ref on umount
Sage Weil
11:05 AM Bug #3 (Closed): leaked dentry ref on umount
Sage Weil
11:49 AM Bug #22 (Resolved): BUG at fs/ceph/caps.c:253
Sage Weil
11:04 AM Bug #22 (Closed): BUG at fs/ceph/caps.c:253
fixed by commit:d29b86892. Sage Weil
11:03 AM Feature #27: ACLs
Add ACL support in kclient. I suspect this just means wiring things up to the generic acl helper code (the actual ac... Sage Weil
09:48 AM Feature #27 (Resolved): ACLs
Sage Weil
09:49 AM Bug #28 (Won't Fix): gracefully fail on fill_trace errors
If fill_trace runs into problems when processing a reply, we should be sure to fail gracefully. Namely, if we can't ... Sage Weil
09:47 AM Feature #26 (Rejected): statlite
see -fsdevel thread from spring 09 for discussion (during LSF'09). Sage Weil
09:46 AM Feature #25 (New): mdsc: mempool for cap writeback?
Sage Weil
09:46 AM Feature #24 (New): mdsc: preallocate reply msgs
We should preallocate space for replies to our MDS messages. Sage Weil
09:45 AM Feature #23 (Resolved): fcntl/flock advisory lock support
Sage Weil

04/12/2010

02:25 PM Bug #3 (Resolved): leaked dentry ref on umount
fixed ref leak in dcache readdir, commit:2844a76a25 Sage Weil
01:46 PM Bug #1 (Closed): gpf in tcp_sendpage
Sage Weil
01:45 PM Bug #1 (Resolved): gpf in tcp_sendpage
Sage Weil
10:11 AM Bug #1: gpf in tcp_sendpage
Most likely kernel/module mismatch. Objdump didn't match oops message, I think this should be resolved-invalid unless... Yehuda Sadeh
01:07 PM Bug #22 (Closed): BUG at fs/ceph/caps.c:253
#12 0x0000000070c80b96 in ceph_add_cap (inode=0x6a5589b8, session=0x6fba8000, cap_id=236951, fmode=-1, issued=3413, w... Yehuda Sadeh
 

Also available in: Atom