Activity
From 04/11/2010 to 05/10/2010
05/10/2010
- 10:10 PM Cleanup #113 (In Progress): audit mds_client locking, esp reply handler
- 03:25 PM Cleanup #113: audit mds_client locking, esp reply handler
- see also #66
- 10:09 PM Bug #64: crash in handle_mds_map (corrupt s_waiting list?)
- fixed by commit:1c0806d2caacc683c56a587eaf1502769a7c0698
- 04:35 PM Bug #64 (Resolved): crash in handle_mds_map (corrupt s_waiting list?)
- fixed by 'ceph: fix locking, error paths when waking reconnect requests'
- 10:09 PM Bug #66: BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
- fixed by commit:9abf82b8bc93dd904738a71ca69aa5df356d4d24
- 04:34 PM Bug #66 (Resolved): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
- fixed by 'ceph: fix locking, error paths when waking reconnect requests'
- 03:24 PM Bug #66: BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
- unable to reproduce... but, see #113
- 04:44 PM Feature #119 (New): avoid looping connect/retry errors on console
- we should try to avoid filling up logs with stuff like this:...
- 04:35 PM Bug #78 (Resolved): bdi_init list bug
- 04:13 PM Feature #18 (Resolved): reconnect fixups
- 03:25 PM Bug #50 (Resolved): osd timeout reset leaves some ops hanging
- 10:31 AM Bug #50: osd timeout reset leaves some ops hanging
- 10:31 AM Bug #50: osd timeout reset leaves some ops hanging
- finally found this, fixed by commit:77eb74b92fee7340d104b24a9ee2800196b0f140
05/07/2010
- 03:01 PM Bug #116 (Resolved): can we drop user. xattr prefix for magic ceph xattrs?
- 01:43 PM Bug #115 (Resolved): rbdtool --list on empty pool should return correct message
- Fixed, pushed to unstable.
- 12:39 PM Bug #115: rbdtool --list on empty pool should return correct message
- Here's the output:...
- 12:03 PM Bug #115 (Resolved): rbdtool --list on empty pool should return correct message
- It currently returns some weird error.
Reported by wido. - 09:51 AM Cleanup #113 (Resolved): audit mds_client locking, esp reply handler
- what does mdsc->mutex protect? s_mutex? which protects requests?
- 09:39 AM Bug #65 (Resolved): crash in tcp_sendpage
- may have also been related to #109.
closing this one, since we haven't seen it in a while.
- 09:38 AM Bug #109: kernel bugs out with bad osd caps
- osd errors weren't unregistering the request. fixed by commit:a40355b39e006459b1ffba052c53084d20d64209
- 09:37 AM Bug #109 (Resolved): kernel bugs out with bad osd caps
- 08:19 AM Bug #111 (Resolved): handle EAGAIN from osd
- currently we just return this to the caller, when we should retry.
not that the osd returns this very often (ever?)
05/06/2010
- 02:37 PM Bug #109: kernel bugs out with bad osd caps
- ...
- 02:34 PM Bug #109 (Resolved): kernel bugs out with bad osd caps
- e.g.,...
- 12:30 PM Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
- again:...
05/05/2010
- 09:39 PM Bug #78: bdi_init list bug
- i suspect this was fixed by commit:5dfc589a8467470226feccdc50f1b32713318e7b
- 09:38 PM Cleanup #79 (Closed): use bdi setup and register helper
- no. i did rename the bdi ceph-%d, though.
- 09:37 PM Bug #104 (Resolved): bdi problem on EPERM from osd
- 09:36 PM Bug #104: bdi problem on EPERM from osd
- problem was use of invalid wbc in completion. fixed by commit:54ad023ba8108d0163acc931ed4b5e4a8a3a7327
- 04:53 PM Bug #104: bdi problem on EPERM from osd
- the problem is when writepages_finish gets a -1 result code.
- 03:03 PM Bug #104 (In Progress): bdi problem on EPERM from osd
- 02:57 PM Bug #104: bdi problem on EPERM from osd
- ladder0:
mount -a
echo asdf > /c/foo
sync
<crash> - 03:56 PM Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
- hmm, s_cap_lock usage sites look okay... don't think it's a leaked spinlock
- 03:03 PM Bug #107 (Resolved): lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
- on master. rsync workload.
[168476.538425] BUG: soft lockup - CPU#0 stuck for 61s! [kswapd0:318]
[168476.538430] ... - 11:45 AM Bug #106 (Resolved): msgpool depletion?
- which pool is it?
[104608.030333] ceph: msgpool_get ffff88010f1fa370 now 0/0, may fail
[104608.036614] ----------...
05/04/2010
- 03:46 PM Bug #104 (Resolved): bdi problem on EPERM from osd
- last sysfs file: /sys/class/net/lo/operstate
CPU 1
Modules linked in: ceph [last unloaded: ceph]
Pid: 5724, comm:... - 11:32 AM Bug #28: gracefully fail on fill_trace errors
- this includes ENOMEM on xattr blob
05/03/2010
- 10:10 PM Bug #54 (Resolved): do dentry offset assignment when dentry becomes non-null
- 10:09 PM Bug #54: do dentry offset assignment when dentry becomes non-null
- added to unstable.
- 04:13 PM Cleanup #79 (Closed): use bdi setup and register helper
- See commit:e6d086d83cf7f102d48c006f58172a69ec0c15a4
This will make our /sys/kernel/debug/bdi directory pretty (cep... - 04:10 PM Bug #78 (Resolved): bdi_init list bug
- There were 2 clients mounted, here, so unclear what what was. One was behaving fine.
The other was forcefully unm... - 12:10 PM Bug #47 (Closed): gfp at ceph_update_snap_trace+0x16a/0x419
- 12:10 PM Bug #38 (Closed): rm -r failure
- 12:10 PM Bug #22 (Closed): BUG at fs/ceph/caps.c:253
- 12:10 PM Bug #4 (Closed): lockdep warning in socket code
- 12:10 PM Bug #3 (Closed): leaked dentry ref on umount
- 12:10 PM Bug #2 (Closed): BUG at fs/ceph/caps.c:2178
- 12:09 PM Bug #1 (Closed): gpf in tcp_sendpage
- 11:13 AM Bug #65: crash in tcp_sendpage
- this is probably a problem with the backport.. it went away when we switch to 2.6.34-rc3 on issdm
04/26/2010
- 08:46 PM Bug #69 (Can't reproduce): ceph: ffff88001976ba50 auth cap (null) not mds0 ???
- no apparent malfunction. on master branch. dmesg is
ceph2 login: [249643.959209] ceph: get_reply unknown tid 260...
04/23/2010
- 08:22 PM Bug #66 (Resolved): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
- [ 6447.063496] ------------[ cut here ]------------
[ 6447.065210] kernel BUG at fs/ceph/mds_client.c:1841!
[ 6447.... - 04:13 PM Bug #65 (Resolved): crash in tcp_sendpage
- on issdm. master branch (standalone).
[ 2900.360800] ceph: osd16 weight 0x10000 (in)
[ 2900.360802] ceph: osd17 ... - 12:28 PM Bug #64 (Resolved): crash in handle_mds_map (corrupt s_waiting list?)
- unstable branch.
ceph: mds0 caps stale
ceph: mds0 caps stale
ceph: mds0 hung
ceph: mds0 came back
ceph: mds0 c... - 10:55 AM Bug #63 (Resolved): dentry_info slab not empty
- [97009.315064] slab error in kmem_cache_destroy(): cache `ceph_dentry_info': Can't free all objects
[97009.324159] P...
04/22/2010
- 09:46 AM Feature #42: Resize of rbd image
- rbdtool part is done. i think the driver just needs to have a 'refresh' function to reload the image metadata.
04/20/2010
- 01:35 PM Feature #23: fcntl/flock advisory lock support
- All right, designed a basic interface between the client and the MDS. Going to implement the MDS and messaging parts ...
- 10:07 AM Bug #54 (Resolved): do dentry offset assignment when dentry becomes non-null
- see rename_vs_i_complete branch on fatty for partial solution
04/19/2010
- 03:51 PM Bug #50 (Resolved): osd timeout reset leaves some ops hanging
- i see an osd reset:
[ 7769.416465] ceph: tid 65961 timed out on osd4, will reset osd
followed by stray replies... - 10:26 AM Feature #23 (In Progress): fcntl/flock advisory lock support
- 10:20 AM Bug #36 (Rejected): uml crash in sendpage
- can't reproduce. bad compilation?
- 10:19 AM Bug #47 (Resolved): gfp at ceph_update_snap_trace+0x16a/0x419
- bug in realm split. fixed by commit:be23f8ad6fccba9a5535a7d3013eb492c20e39cb
- 08:59 AM Bug #47 (In Progress): gfp at ceph_update_snap_trace+0x16a/0x419
04/16/2010
- 04:07 PM Bug #38 (Resolved): rm -r failure
- the problem is (mostly) that d_move reorders d_subdirs. so just clear I_COMPLETE on rename.
but also:
- we shoul... - 12:30 PM Bug #47 (Closed): gfp at ceph_update_snap_trace+0x16a/0x419
- after running snaptest1.sh twice:
[90471.092551] general protection fault: 0000 [#1] PREEMPT SMP
[90471.095369] ...
04/15/2010
- 10:30 AM Feature #42 (Resolved): Resize of rbd image
- We want to be able to resize the rbd image. Snapshots should be maintained and should have a size associated with them.
- 10:30 AM Bug #38 (In Progress): rm -r failure
- reliably reproducible with kernel_untar_build.sh workunit. looks like problem with dcache_readdir. from the rm -r's...
04/14/2010
- 03:34 PM Bug #38 (Resolved): rm -r failure
- seeing this one again on current clients (commit:a6a5349)
- 03:31 PM Bug #36 (Rejected): uml crash in sendpage
- running kernel untar qa workunit on fatty.
with valgrind on osd
04/13/2010
- 02:09 PM Bug #4 (Resolved): lockdep warning in socket code
- 02:09 PM Bug #4: lockdep warning in socket code
- use separate class for ceph socket.
commit:79cb735c - 11:50 AM Bug #1 (Resolved): gpf in tcp_sendpage
- 11:50 AM Bug #2 (Resolved): BUG at fs/ceph/caps.c:2178
- 11:44 AM Bug #2 (Closed): BUG at fs/ceph/caps.c:2178
- 11:44 AM Bug #2: BUG at fs/ceph/caps.c:2178
- Fixed by commit:e130642ba
- 11:50 AM Bug #3 (Resolved): leaked dentry ref on umount
- 11:05 AM Bug #3 (Closed): leaked dentry ref on umount
- 11:49 AM Bug #22 (Resolved): BUG at fs/ceph/caps.c:253
- 11:04 AM Bug #22 (Closed): BUG at fs/ceph/caps.c:253
- fixed by commit:d29b86892.
- 11:03 AM Feature #27: ACLs
- Add ACL support in kclient. I suspect this just means wiring things up to the generic acl helper code (the actual ac...
- 09:48 AM Feature #27 (Resolved): ACLs
- 09:49 AM Bug #28 (Won't Fix): gracefully fail on fill_trace errors
- If fill_trace runs into problems when processing a reply, we should be sure to fail gracefully. Namely, if we can't ...
- 09:47 AM Feature #26 (Rejected): statlite
- see -fsdevel thread from spring 09 for discussion (during LSF'09).
- 09:46 AM Feature #25 (New): mdsc: mempool for cap writeback?
- 09:46 AM Feature #24 (New): mdsc: preallocate reply msgs
- We should preallocate space for replies to our MDS messages.
- 09:45 AM Feature #23 (Resolved): fcntl/flock advisory lock support
04/12/2010
- 02:25 PM Bug #3 (Resolved): leaked dentry ref on umount
- fixed ref leak in dcache readdir, commit:2844a76a25
- 01:46 PM Bug #1 (Closed): gpf in tcp_sendpage
- 01:45 PM Bug #1 (Resolved): gpf in tcp_sendpage
- 10:11 AM Bug #1: gpf in tcp_sendpage
- Most likely kernel/module mismatch. Objdump didn't match oops message, I think this should be resolved-invalid unless...
- 01:07 PM Bug #22 (Closed): BUG at fs/ceph/caps.c:253
- #12 0x0000000070c80b96 in ceph_add_cap (inode=0x6a5589b8, session=0x6fba8000, cap_id=236951, fmode=-1, issued=3413, w...
Also available in: Atom