Project

General

Profile

Activity

From 08/18/2014 to 09/16/2014

09/16/2014

05:37 PM Fix #9435 (Resolved): prevent use of cache pools as metadata or data pools
Merged into giant branch in commit:eb1b2e0072bf605095f4104c2b6c2abfba216dbe Greg Farnum
02:57 AM Fix #9435 (Fix Under Review): prevent use of cache pools as metadata or data pools
https://github.com/ceph/ceph/pull/2507 John Spray
02:16 PM Feature #9466: kclient: Extend CephFSTestCase tests to cover kclient
Got these passing at least once by hand using IPMI to work around #9477, suite scheduled:
http://pulpito.front.sep...
John Spray

09/15/2014

02:02 PM Bug #9444 (Resolved): "unmatched rstat" exception after firefly->master upgrade
if mds_verify_scatter isn't enabled, the MDS will fix rstat mismatch atomically. Zheng Yan
10:45 AM Bug #9444: "unmatched rstat" exception after firefly->master upgrade
I think you're right, John. I'm not sure why we never saw this before though — Zheng, what changed that we're looking... Greg Farnum
02:45 AM Bug #9444: "unmatched rstat" exception after firefly->master upgrade
Is this actually fixed, in the case of filesystems created using old code? It seems like the patch prevents creating... John Spray
01:04 PM Fix #9435: prevent use of cache pools as metadata or data pools
First half here: https://github.com/ceph/ceph/tree/wip-9435 (no handling of tiering updates yet) John Spray
12:47 PM Fix #9435 (In Progress): prevent use of cache pools as metadata or data pools
John Spray
12:32 PM Feature #9477: Handle kclient shutdown with dead network more gracefully

Ah, this *only* happens if I have some dirty state from userspace at the time. In this instance it's my Mount.open...
John Spray
11:59 AM Feature #9477 (Closed): Handle kclient shutdown with dead network more gracefully
... John Spray
10:14 AM Bug #9423 (Resolved): failure in client_recovery task
John Spray
10:14 AM Bug #9423: failure in client_recovery task

Fixed merged to giant....
John Spray
08:07 AM Bug #9423: failure in client_recovery task
John Spray
09:50 AM Feature #9466 (In Progress): kclient: Extend CephFSTestCase tests to cover kclient
John Spray
03:43 AM Feature #9466: kclient: Extend CephFSTestCase tests to cover kclient
kclient instrumentation to enable implementing KernelClient::get_global_id (mapping local mount to the ID we see on t... John Spray
03:38 AM Feature #9466 (Resolved): kclient: Extend CephFSTestCase tests to cover kclient

Currently the mds_client_recovery and mds_client_limits tasks in ceph-qa-suite only run against the fuse client, be...
John Spray
08:06 AM Bug #9177: ceph-fuse: failing MPI mdtest runs
https://github.com/ceph/ceph-qa-suite/pull/140 John Spray
08:04 AM Bug #9177: ceph-fuse: failing MPI mdtest runs
John Spray

09/12/2014

05:40 PM Bug #9427 (Resolved): osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_writte...
Merged to master in commit:e06f4251ac36503d33f203567ada1b096119ab80.
Immediately cherry-picked to giant in commit:c3...
Greg Farnum
11:35 AM Bug #9427: osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_written.expire_pos)
For posterity, the manual test procedure for the changes to rewrite that fix this issue:... John Spray
06:57 AM Bug #9427 (Fix Under Review): osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= las...
https://github.com/ceph/ceph/pull/2469 John Spray
05:31 PM Bug #9444 (Resolved): "unmatched rstat" exception after firefly->master upgrade
aha, yep! thanks Sage Weil
05:29 PM Bug #9444 (Fix Under Review): "unmatched rstat" exception after firefly->master upgrade
git cherry-pick da17394941386dab88ddbfed4af2c8cb6b5eb72f
https://github.com/ceph/ceph/pull/2479
Zheng Yan
06:45 AM Bug #9444 (Resolved): "unmatched rstat" exception after firefly->master upgrade

Create filesystem with firefly, then restart system with master binaries plus wip-9427-rewrite....
John Spray
01:57 PM Bug #9280 (Resolved): valgrind failures in ceph-fuse
commit:46bbe30e6895311e4ce5f9cf2dea3438db99188e Sage Weil
01:53 PM Fix #9435: prevent use of cache pools as metadata or data pools
Yes, that's what I'm hoping as well. That's what _check_remove_pool() is; we'd need to add an equivalent for tiering.... Greg Farnum
01:48 PM Fix #9435: prevent use of cache pools as metadata or data pools
I lean toward setting data pool to the base pool too. I worry about having to stand up so many guard rails, though. ... Sage Weil
01:43 PM Fix #9435: prevent use of cache pools as metadata or data pools
The user pointed out that right now we prevent assigning EC pools to CephFS. I believe this is the result of a user w... Greg Farnum
01:24 PM Bug #9423: failure in client_recovery task
Can we pull out the fix so we can merge it and have it run against giant going forward? Greg Farnum

09/11/2014

01:32 PM Fix #9435: prevent use of cache pools as metadata or data pools
This conversation is getting split across several mediums, but this shouldn't prevent specifying the use of a base po... Greg Farnum
01:13 PM Fix #9435: prevent use of cache pools as metadata or data pools
My vote is NAK on this. THis is exactly what I want to do on my cluster and I this is the only way EC can be used fo... Sage Weil
01:07 PM Fix #9435: prevent use of cache pools as metadata or data pools
Yeah, that's the simple solution. I was also wondering though if we wanted to do something more sophisticated trying ... Greg Farnum
12:48 PM Fix #9435: prevent use of cache pools as metadata or data pools
I mean something like this (although I'm not positive I got all the requirements right):... Joao Eduardo Luis
12:39 PM Fix #9435: prevent use of cache pools as metadata or data pools
would checking the nature of pools during 'fs new' on the monitor and failing if any of the specified pools (data or ... Joao Eduardo Luis
11:30 AM Fix #9435 (Resolved): prevent use of cache pools as metadata or data pools
From the mailing list... Greg Farnum
11:50 AM Feature #9437 (Resolved): make 'ceph tell mds.* ...' work, deprecate 'ceph mds tell * ...'
Sage Weil
09:26 AM Bug #9428 (Resolved): mds: tight mon reconnect loop
Sage Weil
08:35 AM Bug #9427: osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_written.expire_pos)
It doesn't need to be an absolute offset that gets fed to the standby-replay MDS, as long as it can use the informati... Greg Farnum
06:55 AM Bug #9427: osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_written.expire_pos)
So rewriting the truncate_finish part isn't too hard if we want to do that:
https://github.com/ceph/ceph/commit/4ae6...
John Spray
03:40 AM Bug #9427: osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_written.expire_pos)

The history:...
John Spray
07:10 AM Bug #9341: MDS: very slow rejoin
I re-built and re-deployed ceph with fuse patch; re-configured all kernel clients to use fuse client; re-mounted Ceph... Dmitry Smirnov
06:56 AM Feature #9375 (Fix Under Review): Send single 'many clients' health warning instead of N warnings...
John Spray
06:56 AM Feature #9189 (Fix Under Review): Expose client identifying metadata to MDS, e.g. hostname
John Spray

09/10/2014

10:09 PM Bug #9428 (Resolved): mds: tight mon reconnect loop
... Sage Weil
10:08 PM Bug #9427: osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_written.expire_pos)
wip-mds has hacky workaround Sage Weil
09:38 PM Bug #9427: osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_written.expire_pos)
ESubtreeMap has an expire_pos field, and we set it in ESubtreeMap::replay() if it is > the current expire pos. I thi... Sage Weil
09:30 PM Bug #9427: osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_written.expire_pos)
... Sage Weil
08:52 PM Bug #9427 (Resolved): osdc/Journaler.cc: 405: FAILED assert(last_written.write_pos >= last_writte...
... Sage Weil
07:02 PM Bug #9341: MDS: very slow rejoin
that patch is for kernel client. here is the patch for ceph-fuse Zheng Yan
05:23 PM Bug #9341: MDS: very slow rejoin
Zheng Yan wrote:
> are you using kernel client? If you are, please try the attached patch. I hope it will improve re...
Dmitry Smirnov
06:19 AM Bug #9341: MDS: very slow rejoin
are you using kernel client? If you are, please try the attached patch. I hope it will improve rejoin speed. Zheng Yan
03:56 PM Bug #9178: samba: ENOTEMPTY on "rm -rf"
the fix https://github.com/ceph/ceph/pull/2431 hasn't been merged yet Zheng Yan
01:54 PM Bug #9178: samba: ENOTEMPTY on "rm -rf"
/a/teuthology-2014-09-08_23:14:02-samba-master-testing-basic-multi/474551/ Greg Farnum
03:15 PM Bug #9423: failure in client_recovery task
Had seen this previously at http://pulpito.ceph.com/teuthology-2014-09-05_23:04:02-fs-master-testing-basic-multi/4701... John Spray
01:27 PM Bug #9423: failure in client_recovery task
http://pulpito.ceph.com/teuthology-2014-09-08_23:04:01-fs-master-testing-basic-multi/474441/ Greg Farnum
01:27 PM Bug #9423 (Resolved): failure in client_recovery task
... Greg Farnum
02:01 PM Bug #8427: ceph-fuse: Dumpling "cache still has 0+1 items, waiting (for caps to release?)" on shu...
/a/teuthology-2014-09-09_19:06:01-fs-dumpling-testing-basic-multi/475752
I copied the server logs to it
Greg Farnum
01:54 PM Bug #9177: ceph-fuse: failing MPI mdtest runs
/teuthology-2014-09-08_19:06:01-fs-dumpling-testing-basic-multi/473897/
/teuthology-2014-09-08_19:06:01-fs-dumpling-...
Greg Farnum
01:54 PM Bug #9280: valgrind failures in ceph-fuse
/teuthology-2014-09-08_23:04:01-fs-master-testing-basic-multi/474458/
/teuthology-2014-09-08_23:04:01-fs-master-test...
Greg Farnum
01:46 PM Bug #8576: teuthology: nfs tests failing on umount
... Greg Farnum
05:58 AM Feature #7316: improve mds state dumps (memory usage, completeness)
NB as follow up to our new health checks (9282, 9284) we should ensure we add admin socket commands for dumping the s... John Spray

09/09/2014

07:18 AM Bug #8055 (Can't reproduce): knfs: NFS: nfs4_discover_server_trunking unhandled error -5. Exiting...
Sage Weil
07:17 AM Bug #7613 (Can't reproduce): mds/MDCache.cc: 216: FAILED assert(inode_map.count(in->vino()) == 0)
Sage Weil
07:12 AM Bug #8757 (Won't Fix): no need to hold write lock on hardlink's dir while creating anchortable entry
the anchor table is no more, yay! Sage Weil
07:08 AM Bug #8576 (Need More Info): teuthology: nfs tests failing on umount
Sage Weil
07:07 AM Bug #9280: valgrind failures in ceph-fuse
Sage Weil
07:07 AM Bug #9341 (Need More Info): MDS: very slow rejoin
Sage Weil
07:04 AM Bug #5382 (Can't reproduce): mds: failed objecter assert on shutdown
Sage Weil
03:30 AM Bug #9178: samba: ENOTEMPTY on "rm -rf"
... Zheng Yan

09/08/2014

06:50 PM Bug #9358 (Resolved): mds: standby-replay mds crashes
Zheng Yan
01:11 PM Bug #9178: samba: ENOTEMPTY on "rm -rf"
http://qa-proxy.ceph.com/teuthology/teuthology-2014-09-07_23:14:01-samba-next-testing-basic-multi/471653/ Greg Farnum
01:10 PM Bug #6613: samba is crashing in teuthology
Still popping up:
http://qa-proxy.ceph.com/teuthology/teuthology-2014-09-07_23:14:01-samba-next-testing-basic-multi/...
Greg Farnum
01:01 PM Bug #9341: MDS: very slow rejoin
If you set the "mds dump cache after rejoin" config option to true it will dump out the cache to a file which we can ... Greg Farnum
11:29 AM Bug #9177: ceph-fuse: failing MPI mdtest runs
http://qa-proxy.ceph.com/teuthology/teuthology-2014-09-05_23:04:02-fs-master-testing-basic-multi/470162/ Greg Farnum
11:29 AM Bug #8576: teuthology: nfs tests failing on umount
This is still showing up frequently. One example: http://qa-proxy.ceph.com/teuthology/teuthology-2014-09-07_23:10:01-... Greg Farnum
10:18 AM Feature #9283: mds: limit inodes with caps to <100% of cache
I don't think we specified any policies when discussing this. If you're thinking something like requesting each clien... Greg Farnum
06:59 AM Feature #9283: mds: limit inodes with caps to <100% of cache
Can't remember if we discussed policies for this at all -- could this be as simple as applying a some fraction to mds... John Spray
06:45 AM Feature #9375 (Resolved): Send single 'many clients' health warning instead of N warnings for N c...

Recently added per-client health warnings for 9282, 9284, these should (for some configurable threshold N of client...
John Spray
06:43 AM Feature #9284 (Fix Under Review): mds: warn when clients are not responding to cache pressure
John Spray
06:43 AM Feature #9282 (Resolved): mds: warn (and kill?) sessions to clients which aren't revoking caps
Merged the patch that adds clog warnings, addition "ceph status" warnings added in the 9284 branch. John Spray
04:08 AM Bug #9280: valgrind failures in ceph-fuse
More:
http://pulpito.ceph.com/teuthology-2014-09-05_23:04:02-fs-master-testing-basic-multi/470180/
http://pulpito...
John Spray

09/07/2014

11:02 PM Bug #9341: MDS: very slow rejoin
To the best of my knowledge this has nothing to do with #9264 and #8962.
Here is MDS restart log:...
Dmitry Smirnov

09/05/2014

04:40 PM Bug #9341: MDS: very slow rejoin
how may data files do you have? I guess the MDS is doing backpointer lookup, we can make the client first trim its ca... Zheng Yan
02:21 PM Bug #9341: MDS: very slow rejoin
I wonder if this is the same as some of our own recent tickets (see #9264 and #8962). What size journal does the MDS ... Greg Farnum
01:07 AM Bug #9358 (Resolved): mds: standby-replay mds crashes
... Zheng Yan

09/04/2014

11:13 PM Bug #4746 (Resolved): client: invalidate callback can deadlock
client does async invalidate now Zheng Yan
10:49 PM Bug #8648 (Resolved): Standby MDS leaks memory over time
fixed by commit eae88dad4c32e4bb5fb255ec4bf1be18b09d498e Zheng Yan
05:25 AM Feature #9282 (Fix Under Review): mds: warn (and kill?) sessions to clients which aren't revoking...
https://github.com/ceph/ceph/pull/2395 Zheng Yan
01:52 AM Bug #9341 (Resolved): MDS: very slow rejoin
I had fiasco trying to use CephFS as network share: today restart of MDS (i.e. down time) took ~3 hours most of which... Dmitry Smirnov

09/03/2014

09:08 PM Feature #9026 (Resolved): client: vxattr support for rctime, rsize, etc.
Merged in with commit:1bcf40459c48af5724c5eba672fdf2cc976c42ea Greg Farnum
06:04 PM Feature #5486: kclient: make it work with selinux
I think cephfs part is ready for selinux support. but ceph is not included in selinux policy. Zheng Yan
11:20 AM Feature #9338 (Resolved): A read timeout argument for cephfs mounts (fuse client especially)
When a cluster's cephfs mount hangs, for reasons such as IO pausing due to a cluster going full or osds crashing, the... JuanJose Galvez
08:09 AM Bug #9123 (Can't reproduce): kceph: had 130k+ inodes with write caps
going with zheng's theory :) Sage Weil
08:08 AM Bug #9121 (Resolved): mds: inode stuck recovering after client restart
Sage Weil
08:07 AM Bug #8962 (Resolved): kcephfs: client does not release revoked cap
old bug; restarted those clients. Sage Weil
06:36 AM Bug #9212 (Won't Fix): mon election delays mds beacon
... Zheng Yan
12:04 AM Feature #3289 (Resolved): ceph-fuse: somehow exert pressure on the VFS to remove dentries from th...
resolved by commit 1d0c62facf client: try shrinking kernel inode cache when trimming session caps Zheng Yan

09/02/2014

03:24 PM Documentation #8258 (Resolved): 0.80~rc1: outdated MDS man page

Oops, forgot to close this at the time....
John Spray
03:23 PM Bug #9252 (Resolved): Worker thread to advance MDS in absence of messages
... John Spray
03:23 PM Bug #9152 (Resolved): mds: beacon needs to not take mds_lock
... John Spray
03:23 PM Bug #9151 (Resolved): mds should log/error/warn when segments are NOT getting trimmed
... John Spray
11:06 AM Feature #9312 (Resolved): kclient: support signatures in kernel code
While refactoring authentication docs, Sage pointed out in review comments that "somewhere here we should note that t... John Wilkins
06:29 AM Feature #9284 (In Progress): mds: warn when clients are not responding to cache pressure
John Spray
02:54 AM Bug #8648 (Fix Under Review): Standby MDS leaks memory over time
https://github.com/ceph/ceph/pull/2375 Zheng Yan

09/01/2014

11:20 PM Feature #9026 (Fix Under Review): client: vxattr support for rctime, rsize, etc.
https://github.com/ceph/ceph/pull/2373 Zheng Yan
05:18 PM Bug #9216: mds may regard active clients as stale due to slow pg recovery
maybe it's the same as #9212 Zheng Yan
04:05 AM Bug #9297 (Duplicate): mds: null deref in _session_logged
Dupe of #9173: the fix hadn't landed at the point these tests were run. John Spray

08/31/2014

07:54 PM Bug #7750 (Can't reproduce): Attempting to mount a kNFS export of a sub-directory of a CephFS fil...
can't reproduce on 3.16 kernel Zheng Yan

08/30/2014

09:42 AM Feature #9299 (New): mds: improve cache lru
it appears to be very easy for a single task/client to push everything out of the cache. we have a stratified lru, b... Sage Weil
09:31 AM Bug #9297 (Duplicate): mds: null deref in _session_logged
ubuntu@teuthology:/var/lib/teuthworker/archive/john-2014-08-29_03:49:04-fs-wip-jcsp-test-testing-basic-multi/459392<p... Sage Weil

08/29/2014

10:08 AM Bug #9260 (Resolved): hadoop fs gets EINVAL
... John Spray
09:59 AM Bug #9252 (Fix Under Review): Worker thread to advance MDS in absence of messages
John Spray
09:59 AM Bug #9151 (Fix Under Review): mds should log/error/warn when segments are NOT getting trimmed
John Spray
07:18 AM Feature #9287 (Rejected): qa: hadoop: add big top tests to suite
Sage Weil
07:18 AM Feature #9286 (Rejected): qa: hadoop: test 2.x with teuthology
Sage Weil
07:02 AM Feature #9284 (Resolved): mds: warn when clients are not responding to cache pressure
Sage Weil
07:00 AM Feature #9283 (New): mds: limit inodes with caps to <100% of cache
Sage Weil
07:00 AM Feature #9282 (Resolved): mds: warn (and kill?) sessions to clients which aren't revoking caps
We need better ways of dealing with clients who aren't following our instructions. The most obvious of them is to sim... Greg Farnum
06:37 AM Bug #9280 (Resolved): valgrind failures in ceph-fuse

/a/john-2014-08-29_03:49:04-fs-wip-jcsp-test-testing-basic-multi
Valgrind: client (Leak_DefinitelyLost, Leak_Ind...
John Spray
12:43 AM Bug #9123: kceph: had 130k+ inodes with write caps
Zheng Yan
12:43 AM Bug #9123: kceph: had 130k+ inodes with write caps
I saw 10.214.137.25 in the mds log, 10.214.137.25 is gitbuilder-archive if IP hasn't changed. Maybe the issue and #89... Zheng Yan
12:15 AM Bug #8962: kcephfs: client does not release revoked cap
... Zheng Yan

08/28/2014

05:46 PM Bug #9266 (Resolved): ceph_test_libcephfs pool name failures
... John Spray
11:59 AM Bug #9266 (In Progress): ceph_test_libcephfs pool name failures
John Spray
09:20 AM Bug #9266 (Resolved): ceph_test_libcephfs pool name failures

http://pulpito.front.sepia.ceph.com/teuthology-2014-08-25_23:04:01-fs-master-testing-basic-multi/451157/
Also fa...
John Spray
05:36 PM Bug #9276 (New): Client::get_file_extent_osds asserts in object_locator_to_pg if osd map is out o...

This would happen if files in the filesystem had layouts referring to pools that were no in the OSD map, which can ...
John Spray
04:21 PM Bug #9264 (Duplicate): mds: occasionally log segments can't trim
Sage Weil
04:20 PM Bug #9264: mds: occasionally log segments can't trim
... Sage Weil
04:16 PM Bug #9264: mds: occasionally log segments can't trim
... Sage Weil
07:00 AM Bug #9264 (Duplicate): mds: occasionally log segments can't trim
it happened with latest lab mds restart yesterday; we have the logs (for another 6 days or so)... Sage Weil
11:59 AM Bug #9260: hadoop fs gets EINVAL
Could be related to #9266, a recurrence of something trying to look up pool names before osdmap is loaded in client. ... John Spray
10:08 AM Bug #9260: hadoop fs gets EINVAL
Duh, that last exception was just libcephfs-java not being installed. John Spray
09:46 AM Bug #9260: hadoop fs gets EINVAL

Hmm, apparently there's more than one way this can fail:...
John Spray
11:31 AM Bug #9178: samba: ENOTEMPTY on "rm -rf"
It might not need to if there's a client bug somewhere. (Or some other issue?) Greg Farnum
06:40 AM Bug #9178: samba: ENOTEMPTY on "rm -rf"
The strange thing is that the MDS never reply ENOTEMPTY. Zheng Yan
09:17 AM Bug #9173 (Resolved): Crash in Server::_session_logged
John Spray
08:09 AM Bug #8962: kcephfs: client does not release revoked cap
... Sage Weil
06:19 AM Bug #9252: Worker thread to advance MDS in absence of messages
Testing on wip-jcsp-test John Spray
06:19 AM Bug #9151: mds should log/error/warn when segments are NOT getting trimmed
Testing on wip-jcsp-test John Spray

08/27/2014

05:02 PM Bug #9260 (Resolved): hadoop fs gets EINVAL
This will fail on hadoop fs -put with EINVAL. No apparent problems in the libcephfs log.... Sage Weil
02:37 PM Feature #4583 (Resolved): libcephfs: add test that kills a client and verifies mds cleans it up
... John Spray
02:22 PM Feature #7810 (Resolved): libcephfs: add a test that freezes + unfreezes a client, and then verif...
... John Spray
02:21 PM Feature #4886 (Resolved): teuthology: add tests that use the MDS dumper
John Spray
02:21 PM Feature #4886: teuthology: add tests that use the MDS dumper
... John Spray
10:03 AM Bug #8962: kcephfs: client does not release revoked cap
... Sage Weil
06:51 AM Bug #9252 (Resolved): Worker thread to advance MDS in absence of messages

As we move dispatchers outside of the MDS (first Objecter, now Beacon), there are some cases that don't progress pr...
John Spray

08/26/2014

04:07 PM Bug #9105: ~ObjectCacher behaves poorly on EBLACKLISTED
Sage Weil
04:04 PM Bug #9105: ~ObjectCacher behaves poorly on EBLACKLISTED
NB the handing for this case in rbd landed with wip-objecter, keep this ticket open for general purpose ObjectCacher ... John Spray
04:05 PM Bug #9238: Floating point exception in Locker::calc_new_client_ranges
Loic: if the original change was on firefly too then yes John Spray
11:34 AM Bug #9238: Floating point exception in Locker::calc_new_client_ranges
Should this be backport to firefly ? Loïc Dachary
11:05 AM Bug #9238 (Resolved): Floating point exception in Locker::calc_new_client_ranges
Sage Weil
09:50 AM Bug #9238 (Fix Under Review): Floating point exception in Locker::calc_new_client_ranges
https://github.com/ceph/ceph/pull/2331 John Spray
09:29 AM Bug #9238 (Resolved): Floating point exception in Locker::calc_new_client_ranges

On master, MDS starts fine first time and then crashes on second start.
Floating point error on Locker::calc_new...
John Spray
06:26 AM Bug #9212: mon election delays mds beacon
mds sent beacon... Zheng Yan
06:09 AM Bug #9152 (Fix Under Review): mds: beacon needs to not take mds_lock
John Spray

08/25/2014

05:07 PM Bug #8878 (Resolved): mds lock cycle (wip-objecter)
Sage Weil
03:56 PM Bug #9216: mds may regard active clients as stale due to slow pg recovery
I haven't got that far yet, but if I had to guess I'd say it is not about caps, since when this happens, all existing... Alexandre Oliva
10:26 AM Bug #9216: mds may regard active clients as stale due to slow pg recovery
Interesting. Did you establish the mechanism by which the clients are being stale? Do they have a renew caps request ... Greg Farnum
01:59 AM Bug #9216 (New): mds may regard active clients as stale due to slow pg recovery
I occasionally get fuse and ceph.ko mounts into weird states, and I can generally track them down to the mds's decidi... Alexandre Oliva
10:29 AM Bug #9212: mon election delays mds beacon
Sage Weil
10:21 AM Bug #9212: mon election delays mds beacon
Did we identify why it was blacklisted? I don't think we have any tests that should make it that slow or whatever. Greg Farnum
09:30 AM Bug #9212 (Rejected): mon election delays mds beacon
EBLACKLISTED Sage Weil

08/24/2014

09:25 AM Bug #9212 (Won't Fix): mon election delays mds beacon
ubuntu@teuthology:/a/teuthology-2014-08-22_23:04:01-fs-master-testing-basic-multi/444359... Sage Weil

08/22/2014

02:57 AM Bug #4545: error creating empty object store. Invalid argument.
i maybe found the problem.
before you mkcephfs,you should ensure the dir(/var/lib/ceph/osd/ceph-0) empty.
once i wr...
cache china

08/21/2014

04:44 PM Bug #5762 (Resolved): teuthology: Failed MPI runs lead to a hung test instead of a failure
Sage Weil
09:55 AM Bug #9152 (In Progress): mds: beacon needs to not take mds_lock
wip-9152 John Spray
09:50 AM Bug #9177: ceph-fuse: failing MPI mdtest runs
The compiler is spitting out a warning about getcwd -- no evidence that that's what it's actually hitting in this ins... John Spray
08:53 AM Bug #9177: ceph-fuse: failing MPI mdtest runs
http://qa-proxy.ceph.com/teuthology/teuthology-2014-08-20_23:04:01-fs-next-testing-basic-multi/439228/ Greg Farnum
08:29 AM Bug #9177: ceph-fuse: failing MPI mdtest runs
How did you track it down to getcwd? If that is the issue there are a bunch of avenues of attack here, and we should ... Greg Farnum
06:31 AM Bug #9177: ceph-fuse: failing MPI mdtest runs
mdtest has a getcwd call into an unzeroed buffer that it doesn't check the error of. If fuse is failing the getcwd f... John Spray
06:56 AM Bug #9151 (In Progress): mds should log/error/warn when segments are NOT getting trimmed
John Spray
05:56 AM Feature #9189 (Resolved): Expose client identifying metadata to MDS, e.g. hostname

Currently, when doing e.g. a "session ls" on an MDS's admin socket, we get client IDs and IP addresses. It would b...
John Spray
05:35 AM Bug #9173 (Fix Under Review): Crash in Server::_session_logged

https://github.com/ceph/ceph/pull/2297
John Spray

08/20/2014

10:47 AM Bug #9173: Crash in Server::_session_logged
Better log. John Spray
06:30 AM Bug #9173 (Resolved): Crash in Server::_session_logged

Hit by mds_client_recovery task...
John Spray
10:33 AM Bug #9178: samba: ENOTEMPTY on "rm -rf"
http://qa-proxy.ceph.com/teuthology/teuthology-2014-08-10_23:14:02-samba-next-testing-basic-plana/415869/
Greg Farnum
10:30 AM Bug #9178 (Resolved): samba: ENOTEMPTY on "rm -rf"
... Greg Farnum
10:14 AM Bug #9177 (Resolved): ceph-fuse: failing MPI mdtest runs
... Greg Farnum

08/19/2014

09:13 AM Bug #9152: mds: beacon needs to not take mds_lock
Hmm, the beacon send code doesn't need to hold the lock on its own, but it's triggered by the SafeTimer, which is jus... Greg Farnum
09:05 AM Bug #9151: mds should log/error/warn when segments are NOT getting trimmed
What kind of logging do we want? I assume you mean journal segments, and this is a bog standard operation...
If it's...
Greg Farnum
06:49 AM Fix #4286: SLES 11 - cfuse: disable 'big_writes'and 'atomic_o_trunc
Ian Colle
 

Also available in: Atom