Activity
From 10/20/2014 to 11/18/2014
11/18/2014
- 11:51 PM Bug #10131: kclient: dentry still in use on umount
- it's a VFS bug. fixed by...
- 11:04 PM Bug #10131 (In Progress): kclient: dentry still in use on umount
- 09:20 AM Bug #10131 (Resolved): kclient: dentry still in use on umount
- ...
- 03:40 PM Fix #10135 (Resolved): OSDMonitor: allow adding cache pools to cephfs pools already in use
- Right now we disallow this with _check_remove_tier(), I believe because we were worried about coordinating the switch...
- 02:37 PM Feature #1398: qa: multiclient file io test
- Answering my own question: Item 2 above. It looks like this can all be done from python.
11/13/2014
- 06:58 AM Bug #10092 (Resolved): multiple_rsync.sh + ceph-fuse timing out on firefly
- greg is right, these time out semi-regularly. increased the timeout on master, giant, firefly.
11/12/2014
- 08:59 PM Bug #10092 (Resolved): multiple_rsync.sh + ceph-fuse timing out on firefly
- teuthology-2014-11-11_23:04:01-fs-firefly-distro-basic-multi/598145
teuthology-2014-11-11_23:04:01-fs-firefly-distro...
11/11/2014
- 02:59 PM Bug #8090: multimds: mds crash in check_rstats
- ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-11-10_23:18:02-multimds-giant-testing-basic-multi/595393
11/10/2014
- 11:46 PM Bug #10041: ceph-fuse: never exit when no MDS server is available
- Just wanted to add that lack of timeout causes havoc all over the place... Autofs, backup scrips mounting CephFS on d...
- 04:05 PM Bug #10041: ceph-fuse: never exit when no MDS server is available
- Although it terminates on "Ctrl+C" a timeout would be _very_ useful because it would prevent system from hanging on b...
- 11:11 AM Bug #10041: ceph-fuse: never exit when no MDS server is available
- Was it blocking in the foreground? Did SIGKILL (ie, control-C) work on it?
We can add a configurable timeout but I... - 01:07 AM Bug #10041 (Resolved): ceph-fuse: never exit when no MDS server is available
- I'm attempting to mount CephFS using Fuse client (i.e. _ceph-fuse_) which do not exit if all MDS servers are down (I ...
- 10:57 PM Bug #10061 (New): uclient: MDS: output cap data in messages
- MClientCaps messages don't dump the caps they're updating, and generally neither does anything else. We need to optio...
- 10:55 PM Feature #10060 (New): uclient: warn about stuck cap flushes
- It can be hard to diagnose issues that involve cap state. To help with that, the client should keep track of its cap ...
- 10:40 PM Bug #9977 (Resolved): cephfs-journal-tool falsely reports invalid start_ptr
- In next branch as commit:65c33503c83ff8d88781c5c3ae81d88d84c8b3e4 and in giant as commit:fc5354dec55248724f8f6b795e3a...
- 09:36 PM Bug #9341: MDS: very slow rejoin
- Thanks.
- 09:27 PM Bug #9341 (Resolved): MDS: very slow rejoin
- This is backported to giant as of commit:97e423f52155e2902bf265bac0b1b9ed137f8aa0. The test for it also got backporte...
- 09:26 PM Bug #9800 (Resolved): client-limits test is not passing
- Backported in commit:387efc5fe1fb148ec135a6d8585a3b8f8d97dbf8
- 05:20 PM Bug #10025 (Resolved): Journal undump causes MDS to crash when start pos is not on object boundary
- Merged into next in commit:69be8e9b30c18e47c17ff7dafc4ac8fbe00d48e7, and the appropriate backport bits were merged la...
- 11:24 AM Bug #9997: test_client_pin case is failing
- http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-09_23:04:01-fs-next-testing-basic-multi/593068/
- 11:23 AM Bug #6613: samba is crashing in teuthology
- Still happening: http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-09_23:14:01-samba-next-testing-basic-multi/59...
11/09/2014
- 10:41 PM Bug #9995 (Resolved): failing test_filelock
- 09:19 AM Bug #9341: MDS: very slow rejoin
- Greg Farnum wrote:
> Hmm, we didn't put this in Giant initially because we were trying not to perturb it. Master has...
11/08/2014
- 08:07 AM Bug #9977 (Fix Under Review): cephfs-journal-tool falsely reports invalid start_ptr
- Backport to giant PR at:
https://github.com/ceph/ceph/pull/2887
11/07/2014
- 04:27 PM Bug #10011: Journaler: failed on shutdown or EBLACKLISTED
- Should be resolved by commit:6977d02f0d31c453cdf554a8f1796f290c1a3b89. We may want to backport once it's been through...
- 04:16 PM Feature #4138 (Resolved): MDS: forward scrub: add functionality to verify disk data is consistent
- This one ticket at least is definitely fulfilled by commit:daa9f9ffe82a811b5e0e69ef52241c4e0b7556bc
11/06/2014
- 11:43 PM Bug #9995: failing test_filelock
- 12:16 AM Bug #9995: failing test_filelock
- https://github.com/ceph/ceph-qa-suite/pull/228
- 09:46 PM Bug #9977 (Pending Backport): cephfs-journal-tool falsely reports invalid start_ptr
- Merged to next in commit:574c1d4bad37514ba941e3ae83e33a7d926697d9
Yes, let's please backport. - 05:49 PM Bug #9674: nightly failed multiple_rsync.sh
- I messed up (didn't set sudo everywhere), newer commits will hopefully make it all good. giant:f66bf31b6743246fb1c882...
- 11:16 AM Bug #10025 (Resolved): Journal undump causes MDS to crash when start pos is not on object boundary
Related ML thread from Jasper Siero, who first encountered the issue on firefly (http://lists.ceph.com/pipermail/ce...
11/05/2014
- 08:58 AM Bug #9995: failing test_filelock
- We'll need to update the test then so that it detects this situation and aborts quietly instead of raising an error.
- 05:42 AM Bug #10011: Journaler: failed on shutdown or EBLACKLISTED
- Ah... I've just realised why the "respawn on blacklist" thing I put in a while back isn't kicking in here: because Jo...
- 04:32 AM Bug #10011: Journaler: failed on shutdown or EBLACKLISTED
mon.a says:...
11/04/2014
- 10:59 PM Bug #9995: failing test_filelock
- ...
- 08:54 PM Bug #9995: failing test_filelock
- Is there something we can do as a workaround to prevent this blocking things? I expect people are going to use new ce...
- 07:36 PM Bug #9995 (Won't Fix): failing test_filelock
- it's a bug in old version of libfuse, it calls our setlk callback for both fcntl setlk and flock requests
- 05:46 PM Bug #9994: ceph-qa-suite: nfs mount timeouts
- teuthology-2014-11-03_23:10:01-knfs-giant-testing-basic-multi/585658/
- 05:40 PM Bug #10011 (Resolved): Journaler: failed on shutdown or EBLACKLISTED
- http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-03_23:08:01-kcephfs-giant-testing-basic-multi/585648/
teuth... - 06:53 AM Bug #9869 (Resolved): Client: not handling cap_flush_ack messages properly
11/03/2014
- 07:55 PM Feature #1398: qa: multiclient file io test
- A first pass of this is in origin/wip-multiclientio-wusui
- 12:10 PM Bug #9997 (Resolved): test_client_pin case is failing
- http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-02_23:04:01-fs-next-testing-basic-multi/583588/
RuntimeErro... - 12:05 PM Bug #9995 (Resolved): failing test_filelock
- http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-02_23:04:01-fs-next-testing-basic-multi/583589/
It's gettin... - 11:43 AM Bug #9994: ceph-qa-suite: nfs mount timeouts
- http://qa-proxy.ceph.com/teuthology/teuthology-2014-10-31_23:10:01-knfs-giant-testing-basic-multi/582459/
http://q... - 11:34 AM Bug #9994 (Resolved): ceph-qa-suite: nfs mount timeouts
- ...
- 11:27 AM Bug #9977: cephfs-journal-tool falsely reports invalid start_ptr
- https://github.com/ceph/ceph/pull/2853
- 11:27 AM Bug #9977 (Fix Under Review): cephfs-journal-tool falsely reports invalid start_ptr
- PR up for next, probably also worth backporting to giant as without it journal-tool is pretty useless on filesystems ...
10/31/2014
- 05:10 PM Tasks #3680 (Rejected): deduplication in ceph
- we should discuss this on the email list
- 10:48 AM Bug #9977 (Resolved): cephfs-journal-tool falsely reports invalid start_ptr
This is happening when the journal expire_pos isn't at an object boundary. The expected start_ptr counter is being...- 10:03 AM Feature #1398: qa: multiclient file io test
- ...
10/30/2014
- 10:11 AM Feature #1398: qa: multiclient file io test
- A task that implements this could be useful for testing calamari as well (I manually did some of the things needed he...
- 10:08 AM Feature #1398 (In Progress): qa: multiclient file io test
- 09:37 AM Feature #9881 (In Progress): mds: admin command to flush the mds journal
10/29/2014
- 09:34 PM Feature #9940: uclient: be more robust when dealing with outstanding RADOS IO and stale caps
- While in the general case it is necessary to fence clients that have become unresponsive to the MDS, this type of "so...
- 09:23 PM Feature #9940 (New): uclient: be more robust when dealing with outstanding RADOS IO and stale caps
- If we've given IO to the Objecter and our caps go stale, we need to do something to handle it.
- 09:06 PM Bug #1666 (Resolved): hadoop: time-related meta-data problems
- We now take client timestamps for almost everything, so this should no longer be a problem and I'm closing it unless ...
- 11:04 AM Bug #9935: client: segfault on ceph_rmdir path "/"
- Yes, EBUSY is what a local filesystem gives you, so that sounds right to me.
- 10:48 AM Bug #9935 (Resolved): client: segfault on ceph_rmdir path "/"
- A segfault occurs when removing the root directory. What is the expected behavior? I think -EBUSY is what makes sense.
10/28/2014
- 12:43 PM Bug #9900 (Duplicate): Failure in multiple_rsync (directories wrongly appear changed)
- I imagine this is a dup of #9894?
- 12:18 PM Bug #9800 (Pending Backport): client-limits test is not passing
- I don't know that we need/want to try and push this in before release (although since it's all guarded inside of a br...
- 05:29 AM Bug #9800 (Resolved): client-limits test is not passing
- ...
- 11:12 AM Bug #8255 (Fix Under Review): mds: directory with missing object cannot be removed
- https://github.com/ceph/ceph/pull/2821
10/27/2014
- 06:17 PM Feature #4138 (Fix Under Review): MDS: forward scrub: add functionality to verify disk data is co...
- This bit at least has been isolated and put into a PR:
https://github.com/ceph/ceph/pull/2814 - 04:23 PM Bug #9870 (Resolved): kernel: not handling cap_flush_ack messages properly
- 10:28 AM Bug #9904 (Resolved): Don't crash MDS on clients sending messages with bad seq
- Currently in Server::handle_client_session, we do this:...
- 10:14 AM Feature #9903 (Resolved): Recover lost dirfrag via data pool
[While the MDS cluster is offline and journal has been flushed if necessary]
Given that a particular dirfrag obj...- 09:36 AM Bug #9900 (Duplicate): Failure in multiple_rsync (directories wrongly appear changed)
http://pulpito.ceph.com/teuthology-2014-10-24_23:08:01-kcephfs-giant-testing-basic-multi/570840/
http://pulpito.ce...- 06:05 AM Bug #9800: client-limits test is not passing
- https://github.com/ceph/ceph/pull/2809
http://pulpito.front.sepia.ceph.com/john-2014-10-27_13:05:29-fs:recovery-wip-...
10/24/2014
- 11:14 AM Bug #9884: too many files in /usr for multiple_rsync.sh
- Yeah, just cutting it down to a more predictable/smaller directory sounds good to me.
- 10:50 AM Bug #9884: too many files in /usr for multiple_rsync.sh
- one failure http://pulpito.ceph.com/teuthology-2014-10-20_23:04:01-fs-giant-distro-basic-multi/562537/
- 10:49 AM Bug #9884 (Closed): too many files in /usr for multiple_rsync.sh
- for example, plana81 has 60k files in /usr, but plana90 has 90k files in /usr. perhaps multiple_rsync should /usr/src...
- 09:53 AM Feature #3882 (Rejected): Hide snapshot directory name in mount/mtab
- we can now restrict snap access by uid...
- 09:49 AM Feature #9883 (Resolved): journal-tool: smarter scavenge (conditionally update dir objects)
- 09:42 AM Feature #9881 (Resolved): mds: admin command to flush the mds journal
- 09:41 AM Feature #9880 (Resolved): mds: more gracefully handle EIO on missing dir object
10/23/2014
- 01:47 PM Bug #9869 (Pending Backport): Client: not handling cap_flush_ack messages properly
- I tested this manually with a patch that sets the starting tid value to 65535 and looking at the logs. That causes im...
- 12:47 PM Bug #9870: kernel: not handling cap_flush_ack messages properly
10/22/2014
- 05:34 PM Bug #9870 (Resolved): kernel: not handling cap_flush_ack messages properly
- This is the analogue to #9869, which Zheng tells me is also a problem in the kernel. We need to downcast the message ...
- 05:30 PM Bug #9869: Client: not handling cap_flush_ack messages properly
- Waiting for this to build so it can be tested.
- 05:28 PM Bug #9869 (Resolved): Client: not handling cap_flush_ack messages properly
- We saw a log segment that contained this:...
10/21/2014
- 03:22 PM Feature #9557 (Fix Under Review): mds: verify backtrace on fetch_dir
- 10:44 AM Feature #9557 (In Progress): mds: verify backtrace on fetch_dir
- 11:43 AM Bug #8809 (Can't reproduce): uclient: memory leak
- maybe fixed by 2313ce1d024361fd7f4d2cbca789010f0fe0faad
- 10:55 AM Bug #9674: nightly failed multiple_rsync.sh
- commit:477073aba1da880dfd0b8c82f4792788579f28b9 in master and commit:44ce33c12443909b02c7ee451ad45400f55d53c9 in giant
10/20/2014
- 01:23 PM Feature #414 (Resolved): ceph-fuse: implement file locking
- 01:22 PM Bug #8576: teuthology: nfs tests failing on umount
- teuthology commit:4f2957c42d0f76a399cb26c660ede9243c095779 runs those commands as well as the previous ones.
- 01:02 PM Bug #9679 (Closed): Ceph hadoop terasort job failure
- Fixed in cephfs-hadoop repo.
- 11:15 AM Bug #9800: client-limits test is not passing
Same failure:
http://pulpito.front.sepia.ceph.com/teuthology-2014-10-17_23:04:02-fs-giant-distro-basic-multi/555...
Also available in: Atom