Project

General

Profile

Activity

From 10/20/2014 to 11/18/2014

11/18/2014

11:51 PM Bug #10131: kclient: dentry still in use on umount
it's a VFS bug. fixed by... Zheng Yan
11:04 PM Bug #10131 (In Progress): kclient: dentry still in use on umount
Zheng Yan
09:20 AM Bug #10131 (Resolved): kclient: dentry still in use on umount
... Greg Farnum
03:40 PM Fix #10135 (Resolved): OSDMonitor: allow adding cache pools to cephfs pools already in use
Right now we disallow this with _check_remove_tier(), I believe because we were worried about coordinating the switch... Greg Farnum
02:37 PM Feature #1398: qa: multiclient file io test
Answering my own question: Item 2 above. It looks like this can all be done from python. Anonymous

11/13/2014

06:58 AM Bug #10092 (Resolved): multiple_rsync.sh + ceph-fuse timing out on firefly
greg is right, these time out semi-regularly. increased the timeout on master, giant, firefly. Sage Weil

11/12/2014

08:59 PM Bug #10092 (Resolved): multiple_rsync.sh + ceph-fuse timing out on firefly
teuthology-2014-11-11_23:04:01-fs-firefly-distro-basic-multi/598145
teuthology-2014-11-11_23:04:01-fs-firefly-distro...
Sage Weil

11/11/2014

02:59 PM Bug #8090: multimds: mds crash in check_rstats
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-11-10_23:18:02-multimds-giant-testing-basic-multi/595393 Sage Weil

11/10/2014

11:46 PM Bug #10041: ceph-fuse: never exit when no MDS server is available
Just wanted to add that lack of timeout causes havoc all over the place... Autofs, backup scrips mounting CephFS on d... Dmitry Smirnov
04:05 PM Bug #10041: ceph-fuse: never exit when no MDS server is available
Although it terminates on "Ctrl+C" a timeout would be _very_ useful because it would prevent system from hanging on b... Dmitry Smirnov
11:11 AM Bug #10041: ceph-fuse: never exit when no MDS server is available
Was it blocking in the foreground? Did SIGKILL (ie, control-C) work on it?
We can add a configurable timeout but I...
Greg Farnum
01:07 AM Bug #10041 (Resolved): ceph-fuse: never exit when no MDS server is available
I'm attempting to mount CephFS using Fuse client (i.e. _ceph-fuse_) which do not exit if all MDS servers are down (I ... Dmitry Smirnov
10:57 PM Bug #10061 (New): uclient: MDS: output cap data in messages
MClientCaps messages don't dump the caps they're updating, and generally neither does anything else. We need to optio... Greg Farnum
10:55 PM Feature #10060 (New): uclient: warn about stuck cap flushes
It can be hard to diagnose issues that involve cap state. To help with that, the client should keep track of its cap ... Greg Farnum
10:40 PM Bug #9977 (Resolved): cephfs-journal-tool falsely reports invalid start_ptr
In next branch as commit:65c33503c83ff8d88781c5c3ae81d88d84c8b3e4 and in giant as commit:fc5354dec55248724f8f6b795e3a... Greg Farnum
09:36 PM Bug #9341: MDS: very slow rejoin
Thanks. Dmitry Smirnov
09:27 PM Bug #9341 (Resolved): MDS: very slow rejoin
This is backported to giant as of commit:97e423f52155e2902bf265bac0b1b9ed137f8aa0. The test for it also got backporte... Greg Farnum
09:26 PM Bug #9800 (Resolved): client-limits test is not passing
Backported in commit:387efc5fe1fb148ec135a6d8585a3b8f8d97dbf8 Greg Farnum
05:20 PM Bug #10025 (Resolved): Journal undump causes MDS to crash when start pos is not on object boundary
Merged into next in commit:69be8e9b30c18e47c17ff7dafc4ac8fbe00d48e7, and the appropriate backport bits were merged la... Greg Farnum
11:24 AM Bug #9997: test_client_pin case is failing
http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-09_23:04:01-fs-next-testing-basic-multi/593068/ Greg Farnum
11:23 AM Bug #6613: samba is crashing in teuthology
Still happening: http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-09_23:14:01-samba-next-testing-basic-multi/59... Greg Farnum

11/09/2014

10:41 PM Bug #9995 (Resolved): failing test_filelock
Zheng Yan
09:19 AM Bug #9341: MDS: very slow rejoin
Greg Farnum wrote:
> Hmm, we didn't put this in Giant initially because we were trying not to perturb it. Master has...
Dmitry Smirnov

11/08/2014

08:07 AM Bug #9977 (Fix Under Review): cephfs-journal-tool falsely reports invalid start_ptr
Backport to giant PR at:
https://github.com/ceph/ceph/pull/2887
John Spray

11/07/2014

04:27 PM Bug #10011: Journaler: failed on shutdown or EBLACKLISTED
Should be resolved by commit:6977d02f0d31c453cdf554a8f1796f290c1a3b89. We may want to backport once it's been through... Greg Farnum
04:16 PM Feature #4138 (Resolved): MDS: forward scrub: add functionality to verify disk data is consistent
This one ticket at least is definitely fulfilled by commit:daa9f9ffe82a811b5e0e69ef52241c4e0b7556bc Greg Farnum

11/06/2014

11:43 PM Bug #9995: failing test_filelock
Zheng Yan
12:16 AM Bug #9995: failing test_filelock
https://github.com/ceph/ceph-qa-suite/pull/228 Zheng Yan
09:46 PM Bug #9977 (Pending Backport): cephfs-journal-tool falsely reports invalid start_ptr
Merged to next in commit:574c1d4bad37514ba941e3ae83e33a7d926697d9
Yes, let's please backport.
Greg Farnum
05:49 PM Bug #9674: nightly failed multiple_rsync.sh
I messed up (didn't set sudo everywhere), newer commits will hopefully make it all good. giant:f66bf31b6743246fb1c882... Greg Farnum
11:16 AM Bug #10025 (Resolved): Journal undump causes MDS to crash when start pos is not on object boundary

Related ML thread from Jasper Siero, who first encountered the issue on firefly (http://lists.ceph.com/pipermail/ce...
John Spray

11/05/2014

08:58 AM Bug #9995: failing test_filelock
We'll need to update the test then so that it detects this situation and aborts quietly instead of raising an error. Greg Farnum
05:42 AM Bug #10011: Journaler: failed on shutdown or EBLACKLISTED
Ah... I've just realised why the "respawn on blacklist" thing I put in a while back isn't kicking in here: because Jo... John Spray
04:32 AM Bug #10011: Journaler: failed on shutdown or EBLACKLISTED

mon.a says:...
John Spray

11/04/2014

10:59 PM Bug #9995: failing test_filelock
... Zheng Yan
08:54 PM Bug #9995: failing test_filelock
Is there something we can do as a workaround to prevent this blocking things? I expect people are going to use new ce... Greg Farnum
07:36 PM Bug #9995 (Won't Fix): failing test_filelock
it's a bug in old version of libfuse, it calls our setlk callback for both fcntl setlk and flock requests Zheng Yan
05:46 PM Bug #9994: ceph-qa-suite: nfs mount timeouts
teuthology-2014-11-03_23:10:01-knfs-giant-testing-basic-multi/585658/ Greg Farnum
05:40 PM Bug #10011 (Resolved): Journaler: failed on shutdown or EBLACKLISTED
http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-03_23:08:01-kcephfs-giant-testing-basic-multi/585648/
teuth...
Greg Farnum
06:53 AM Bug #9869 (Resolved): Client: not handling cap_flush_ack messages properly
Greg Farnum

11/03/2014

07:55 PM Feature #1398: qa: multiclient file io test
A first pass of this is in origin/wip-multiclientio-wusui Anonymous
12:10 PM Bug #9997 (Resolved): test_client_pin case is failing
http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-02_23:04:01-fs-next-testing-basic-multi/583588/
RuntimeErro...
Greg Farnum
12:05 PM Bug #9995 (Resolved): failing test_filelock
http://qa-proxy.ceph.com/teuthology/teuthology-2014-11-02_23:04:01-fs-next-testing-basic-multi/583589/
It's gettin...
Greg Farnum
11:43 AM Bug #9994: ceph-qa-suite: nfs mount timeouts
http://qa-proxy.ceph.com/teuthology/teuthology-2014-10-31_23:10:01-knfs-giant-testing-basic-multi/582459/
http://q...
Greg Farnum
11:34 AM Bug #9994 (Resolved): ceph-qa-suite: nfs mount timeouts
... Greg Farnum
11:27 AM Bug #9977: cephfs-journal-tool falsely reports invalid start_ptr
https://github.com/ceph/ceph/pull/2853 John Spray
11:27 AM Bug #9977 (Fix Under Review): cephfs-journal-tool falsely reports invalid start_ptr
PR up for next, probably also worth backporting to giant as without it journal-tool is pretty useless on filesystems ... John Spray

10/31/2014

05:10 PM Tasks #3680 (Rejected): deduplication in ceph
we should discuss this on the email list Sage Weil
10:48 AM Bug #9977 (Resolved): cephfs-journal-tool falsely reports invalid start_ptr

This is happening when the journal expire_pos isn't at an object boundary. The expected start_ptr counter is being...
John Spray
10:03 AM Feature #1398: qa: multiclient file io test
... Anonymous

10/30/2014

10:11 AM Feature #1398: qa: multiclient file io test
A task that implements this could be useful for testing calamari as well (I manually did some of the things needed he... Anonymous
10:08 AM Feature #1398 (In Progress): qa: multiclient file io test
Anonymous
09:37 AM Feature #9881 (In Progress): mds: admin command to flush the mds journal
John Spray

10/29/2014

09:34 PM Feature #9940: uclient: be more robust when dealing with outstanding RADOS IO and stale caps
While in the general case it is necessary to fence clients that have become unresponsive to the MDS, this type of "so... John Spray
09:23 PM Feature #9940 (New): uclient: be more robust when dealing with outstanding RADOS IO and stale caps
If we've given IO to the Objecter and our caps go stale, we need to do something to handle it. Greg Farnum
09:06 PM Bug #1666 (Resolved): hadoop: time-related meta-data problems
We now take client timestamps for almost everything, so this should no longer be a problem and I'm closing it unless ... Greg Farnum
11:04 AM Bug #9935: client: segfault on ceph_rmdir path "/"
Yes, EBUSY is what a local filesystem gives you, so that sounds right to me. John Spray
10:48 AM Bug #9935 (Resolved): client: segfault on ceph_rmdir path "/"
A segfault occurs when removing the root directory. What is the expected behavior? I think -EBUSY is what makes sense. Noah Watkins

10/28/2014

12:43 PM Bug #9900 (Duplicate): Failure in multiple_rsync (directories wrongly appear changed)
I imagine this is a dup of #9894? Greg Farnum
12:18 PM Bug #9800 (Pending Backport): client-limits test is not passing
I don't know that we need/want to try and push this in before release (although since it's all guarded inside of a br... Greg Farnum
05:29 AM Bug #9800 (Resolved): client-limits test is not passing
... John Spray
11:12 AM Bug #8255 (Fix Under Review): mds: directory with missing object cannot be removed
https://github.com/ceph/ceph/pull/2821 Zheng Yan

10/27/2014

06:17 PM Feature #4138 (Fix Under Review): MDS: forward scrub: add functionality to verify disk data is co...
This bit at least has been isolated and put into a PR:
https://github.com/ceph/ceph/pull/2814
Greg Farnum
04:23 PM Bug #9870 (Resolved): kernel: not handling cap_flush_ack messages properly
Zheng Yan
10:28 AM Bug #9904 (Resolved): Don't crash MDS on clients sending messages with bad seq
Currently in Server::handle_client_session, we do this:... John Spray
10:14 AM Feature #9903 (Resolved): Recover lost dirfrag via data pool

[While the MDS cluster is offline and journal has been flushed if necessary]
Given that a particular dirfrag obj...
John Spray
09:36 AM Bug #9900 (Duplicate): Failure in multiple_rsync (directories wrongly appear changed)

http://pulpito.ceph.com/teuthology-2014-10-24_23:08:01-kcephfs-giant-testing-basic-multi/570840/
http://pulpito.ce...
John Spray
06:05 AM Bug #9800: client-limits test is not passing
https://github.com/ceph/ceph/pull/2809
http://pulpito.front.sepia.ceph.com/john-2014-10-27_13:05:29-fs:recovery-wip-...
John Spray

10/24/2014

11:14 AM Bug #9884: too many files in /usr for multiple_rsync.sh
Yeah, just cutting it down to a more predictable/smaller directory sounds good to me. Greg Farnum
10:50 AM Bug #9884: too many files in /usr for multiple_rsync.sh
one failure http://pulpito.ceph.com/teuthology-2014-10-20_23:04:01-fs-giant-distro-basic-multi/562537/ Zheng Yan
10:49 AM Bug #9884 (Closed): too many files in /usr for multiple_rsync.sh
for example, plana81 has 60k files in /usr, but plana90 has 90k files in /usr. perhaps multiple_rsync should /usr/src... Zheng Yan
09:53 AM Feature #3882 (Rejected): Hide snapshot directory name in mount/mtab
we can now restrict snap access by uid... Sage Weil
09:49 AM Feature #9883 (Resolved): journal-tool: smarter scavenge (conditionally update dir objects)
Sage Weil
09:42 AM Feature #9881 (Resolved): mds: admin command to flush the mds journal
Sage Weil
09:41 AM Feature #9880 (Resolved): mds: more gracefully handle EIO on missing dir object
Sage Weil

10/23/2014

01:47 PM Bug #9869 (Pending Backport): Client: not handling cap_flush_ack messages properly
I tested this manually with a patch that sets the starting tid value to 65535 and looking at the logs. That causes im... Greg Farnum
12:47 PM Bug #9870: kernel: not handling cap_flush_ack messages properly
Zheng Yan

10/22/2014

05:34 PM Bug #9870 (Resolved): kernel: not handling cap_flush_ack messages properly
This is the analogue to #9869, which Zheng tells me is also a problem in the kernel. We need to downcast the message ... Greg Farnum
05:30 PM Bug #9869: Client: not handling cap_flush_ack messages properly
Waiting for this to build so it can be tested. Greg Farnum
05:28 PM Bug #9869 (Resolved): Client: not handling cap_flush_ack messages properly
We saw a log segment that contained this:... Greg Farnum

10/21/2014

03:22 PM Feature #9557 (Fix Under Review): mds: verify backtrace on fetch_dir
Zheng Yan
10:44 AM Feature #9557 (In Progress): mds: verify backtrace on fetch_dir
Greg Farnum
11:43 AM Bug #8809 (Can't reproduce): uclient: memory leak
maybe fixed by 2313ce1d024361fd7f4d2cbca789010f0fe0faad Zheng Yan
10:55 AM Bug #9674: nightly failed multiple_rsync.sh
commit:477073aba1da880dfd0b8c82f4792788579f28b9 in master and commit:44ce33c12443909b02c7ee451ad45400f55d53c9 in giant Greg Farnum

10/20/2014

01:23 PM Feature #414 (Resolved): ceph-fuse: implement file locking
Zheng Yan
01:22 PM Bug #8576: teuthology: nfs tests failing on umount
teuthology commit:4f2957c42d0f76a399cb26c660ede9243c095779 runs those commands as well as the previous ones. Greg Farnum
01:02 PM Bug #9679 (Closed): Ceph hadoop terasort job failure
Fixed in cephfs-hadoop repo. Noah Watkins
11:15 AM Bug #9800: client-limits test is not passing

Same failure:
http://pulpito.front.sepia.ceph.com/teuthology-2014-10-17_23:04:02-fs-giant-distro-basic-multi/555...
John Spray
 

Also available in: Atom