Project

General

Profile

Activity

From 12/06/2016 to 01/04/2017

01/04/2017

01:57 PM Bug #16914: multimds: pathologically slow deletions in some tests
Right, but users are going to do this -- it needs to work. John Spray
10:31 AM Backport #18413 (In Progress): jewel: lookup of /.. in jewel returns -ENOENT
Nathan Cutler
10:26 AM Backport #18413 (Resolved): jewel: lookup of /.. in jewel returns -ENOENT
https://github.com/ceph/ceph/pull/12783 Nathan Cutler

01/03/2017

08:13 PM Bug #16397: nfsd selinux denials causing knfs tests to fail
Ok, sorry for the delay on this. Finally got around to opening a RHBZ:
https://bugzilla.redhat.com/show_bug.cg...
Jeff Layton
05:54 PM Bug #18408: lookup of /.. in jewel returns -ENOENT
The gory details of this problem are available here:
https://bugzilla.redhat.com/show_bug.cgi?id=1408989
Jeff Layton
05:52 PM Bug #18408 (Pending Backport): lookup of /.. in jewel returns -ENOENT
Jeff Layton
05:52 PM Bug #18408 (Resolved): lookup of /.. in jewel returns -ENOENT
This is a problem for nfs-ganesha, which needs to be able to perform a lookup of ".." in the root directory in order ... Jeff Layton
02:38 PM Bug #17563 (Resolved): extremely slow ceph_fsync calls
Kernel patches merged for v4.10. Jeff Layton
02:27 PM Bug #18131: ceph-fuse not clearing setuid/setgid bits on chown
Sorry for the late response. At the very least, we need these commits:... Jeff Layton
09:36 AM Bug #18396 (Resolved): Test Failure: kcephfs test_client_recovery.TestClientRecovery
http://pulpito.ceph.com/teuthology-2016-12-31_11:20:02-kcephfs-kraken-testing-basic-smithi/678546/
Looks like the ...
Zheng Yan
09:09 AM Bug #18361: Test failure: test_session_reject (tasks.cephfs.test_sessionmap.TestSessionMap)
https://github.com/ceph/ceph/pull/12757 Zheng Yan
03:55 AM Bug #18157 (Resolved): ceph-fuse segfaults on daemonize
Zheng Yan
03:47 AM Bug #18179 (Fix Under Review): MDS crashes on missing metadata object
https://github.com/ceph/ceph/pull/12749 Zheng Yan
03:02 AM Bug #18047 (Resolved): assertion in MDSMap::get_up_features()
Zheng Yan
02:19 AM Bug #18362: Test failure: test_evict_client (tasks.cephfs.test_volume_client.TestVolumeClient)
http://qa-proxy.ceph.com/teuthology/teuthology-2016-12-31_17:15:02-fs-master---basic-smithi/679504/
http://qa-proxy....
Zheng Yan

12/30/2016

06:10 AM Bug #18211 (Resolved): test_snapshot_remove (tasks.cephfs.test_strays.TestStrays) failed at data ...
Zheng Yan
06:08 AM Bug #18363: Test failure: test_ops_throttle (tasks.cephfs.test_strays.TestStrays)
Oops, The log files no long exist. looks like someone is actively deleting the logs Zheng Yan

12/29/2016

09:58 AM Bug #18363 (Can't reproduce): Test failure: test_ops_throttle (tasks.cephfs.test_strays.TestStrays)
http://qa-proxy.ceph.com/teuthology/teuthology-2016-12-24_17:15:02-fs-master---basic-smithi/663459/teuthology.log Zheng Yan
09:55 AM Bug #18362 (Duplicate): Test failure: test_evict_client (tasks.cephfs.test_volume_client.TestVolu...
http://qa-proxy.ceph.com/teuthology/teuthology-2016-12-27_11:10:01-fs-kraken---basic-smithi/669343/
probably it'...
Zheng Yan
09:53 AM Bug #18361: Test failure: test_session_reject (tasks.cephfs.test_sessionmap.TestSessionMap)
http://qa-proxy.ceph.com/teuthology/teuthology-2016-12-27_11:10:01-fs-kraken---basic-smithi/669334/ Zheng Yan
09:49 AM Bug #18361 (Fix Under Review): Test failure: test_session_reject (tasks.cephfs.test_sessionmap.Te...
https://github.com/ceph/ceph/pull/12708 Zheng Yan
09:16 AM Bug #18361 (Resolved): Test failure: test_session_reject (tasks.cephfs.test_sessionmap.TestSessio...
Zheng Yan

12/28/2016

10:19 AM Bug #16914: multimds: pathologically slow deletions in some tests
The reason is that you use "rm -rf delete_me/*" to delete files. ceph-fuse needs to do a lookup "delete_me" for each ... Zheng Yan

12/27/2016

11:33 AM Bug #11482 (Resolved): kclient: intermittent log warnings "client.XXXX isn't responding to mclien...
Nathan Cutler
11:32 AM Backport #13932 (Rejected): hammer: kclient: intermittent log warnings "client.XXXX isn't respond...
Nathan Cutler

12/24/2016

02:45 PM Bug #16914: multimds: pathologically slow deletions in some tests
I have a nice simple reproducer for this now (even with fuse default permissions = false it has the slowdown).
It'...
John Spray

12/22/2016

05:26 PM Bug #18309: TestVolumeClient.test_evict_client failure creating pidfile
Alternative approach: https://github.com/ceph/ceph/pull/12628 Nathan Cutler
02:17 PM Bug #18314 (Resolved): commit 41d46e492 "osd/ReplicatedPG: limit omap request by bytes" breaks la...
Sage Weil
01:58 PM Bug #18314: commit 41d46e492 "osd/ReplicatedPG: limit omap request by bytes" breaks large directory
http://tracker.ceph.com/issues/18334 to track a proper fix for OMAP_GETKEYS Sage Weil
10:14 AM Backport #17478 (Resolved): jewel: MDS goes damaged on blacklist (failed to read JournalPointer: ...
Loïc Dachary
10:14 AM Backport #17582 (Resolved): jewel: monitor assertion failure when deactivating mds in (invalid) f...
Loïc Dachary
10:14 AM Backport #17615 (Resolved): jewel: mds: false "failing to respond to cache pressure" warning
Loïc Dachary
10:14 AM Backport #17617 (Resolved): jewel: [cephfs] fuse client crash when adding a new osd
Loïc Dachary
10:14 AM Backport #17697 (Resolved): jewel: MDS long-time blocked ops. ceph-fuse locks up with getattr of ...
Loïc Dachary
10:14 AM Backport #17706 (Resolved): jewel: multimds: mds entering up:replay and processing down mds aborts
Loïc Dachary
10:14 AM Backport #17720 (Resolved): jewel: MDS: false "failing to respond to cache pressure" warning
Loïc Dachary
10:13 AM Backport #17841 (Resolved): jewel: mds fails to respawn if executable has changed
Loïc Dachary
10:13 AM Backport #17885 (Resolved): jewel: "[ FAILED ] LibCephFS.InterProcessLocking" in jewel v10.2.4
Loïc Dachary

12/21/2016

08:02 PM Bug #18309 (Fix Under Review): TestVolumeClient.test_evict_client failure creating pidfile
-https://github.com/ceph/ceph/pull/12606- Nathan Cutler
06:20 PM Bug #18309: TestVolumeClient.test_evict_client failure creating pidfile
The problem is that global_init_prefork is calling pidfile_write, and we started using that from the client in 83aaa5... John Spray
06:17 PM Backport #18308 (Resolved): ceph-fuse not clearing setuid/setgid bits on chown
Nathan Cutler
05:20 PM Backport #18308 (New): ceph-fuse not clearing setuid/setgid bits on chown
Nathan Cutler
04:36 PM Backport #18308: ceph-fuse not clearing setuid/setgid bits on chown
h3. original description
I had some test failures that showed up in my most recent fs suite run here:
http...
Nathan Cutler
06:17 PM Bug #18131: ceph-fuse not clearing setuid/setgid bits on chown
*master PR*: https://github.com/ceph/ceph/pull/12331 Nathan Cutler
04:38 PM Bug #18131: ceph-fuse not clearing setuid/setgid bits on chown
@Jeff: Which commit fixes the issue/should be backported to jewel? Nathan Cutler
05:18 PM Bug #18254 (Resolved): path restricted cephx caps not working correctly
Nathan Cutler
05:16 PM Bug #18254: path restricted cephx caps not working correctly
*master PR*: https://github.com/ceph/ceph/pull/12505 Nathan Cutler
04:52 PM Bug #18254: path restricted cephx caps not working correctly
@Jeff: We have a system/service in place for backporting bugfixes to our stable releases. Patches backported via this... Nathan Cutler
05:18 PM Backport #18307: path restricted cephx caps not working correctly
(removed attachments that are available at #18254) Nathan Cutler
05:17 PM Backport #18307 (Resolved): path restricted cephx caps not working correctly
Nathan Cutler
04:39 PM Backport #18307 (New): path restricted cephx caps not working correctly
h3. original description
Ramana noticed this first while testing my ganesha patches to allow restricting exports. ...
Nathan Cutler
02:24 PM Bug #18314 (Fix Under Review): commit 41d46e492 "osd/ReplicatedPG: limit omap request by bytes" b...
https://github.com/ceph/ceph/pull/12599 Zheng Yan
08:37 AM Bug #18314: commit 41d46e492 "osd/ReplicatedPG: limit omap request by bytes" breaks large directory
... Zheng Yan
08:32 AM Bug #18314 (Resolved): commit 41d46e492 "osd/ReplicatedPG: limit omap request by bytes" breaks la...
OPTION(osd_max_omap_bytes_per_request, OPT_U64, 4<<20)
4M can only carry about 5k dentries. It's too small
Zheng Yan
11:14 AM Bug #15921 (Can't reproduce): segfault in cephfs-journal-tool (TestJournalRepair failure)
Haven't seen this failure in a long time. John Spray
11:11 AM Bug #2375 (Closed): rrdtoll data malfuntion..
Ancient, closing. John Spray
11:09 AM Bug #1206 (Closed): NFS reexport file creation lags 1-3 seconds
Closing this because it's ancient (and if NFS creates were super-slow we'd notice on the knfs suite) John Spray

12/20/2016

06:54 PM Backport #18307: path restricted cephx caps not working correctly
PR is up here:
https://github.com/ceph/ceph/pull/12592
Jeff Layton
01:00 PM Backport #18307 (Resolved): path restricted cephx caps not working correctly
https://github.com/ceph/ceph/pull/12592 Jeff Layton
06:06 PM Bug #18311 (Fix Under Review): Decode errors on backtrace will crash MDS
https://github.com/ceph/ceph/pull/12588 John Spray
03:16 PM Bug #18311 (Resolved): Decode errors on backtrace will crash MDS
Noticed by inspection:... John Spray
05:46 PM Bug #18225 (Resolved): MDS doesn't release memory after exceeding its cache size limit
John Spray
05:34 PM Bug #9935 (Fix Under Review): client: segfault on ceph_rmdir path "/"
https://github.com/ceph/ceph/pull/12550 Michal Jarzabek
01:19 PM Bug #18309 (Resolved): TestVolumeClient.test_evict_client failure creating pidfile
Consistent on master
http://pulpito.ceph.com/jspray-2016-12-19_21:05:25-fs-master-distro-basic-smithi/648157
I ...
John Spray
01:13 PM Backport #18308 (Resolved): ceph-fuse not clearing setuid/setgid bits on chown
https://github.com/ceph/ceph/pull/12591 Jeff Layton
01:01 PM Bug #18131 (Pending Backport): ceph-fuse not clearing setuid/setgid bits on chown
Jeff Layton
12:59 PM Bug #18254 (Pending Backport): path restricted cephx caps not working correctly
Jeff Layton
12:21 PM Bug #18254: path restricted cephx caps not working correctly
Patch merged. We'll also want this backported to jewel. Jeff Layton
11:16 AM Bug #18306 (Resolved): segfault in handle_client_caps
http://pulpito.ceph.com/jspray-2016-12-19_21:05:25-fs-master-distro-basic-smithi/648247... John Spray

12/16/2016

02:42 PM Backport #18283 (Closed): kraken: monitor cannot start because of "FAILED assert(info.state == MD...
Nathan Cutler
02:42 PM Backport #18282 (Resolved): jewel: monitor cannot start because of "FAILED assert(info.state == M...
https://github.com/ceph/ceph/pull/13123 Nathan Cutler

12/14/2016

10:52 PM Bug #18254: path restricted cephx caps not working correctly
The patch turns out to be pretty trivial:... Jeff Layton
09:26 PM Bug #18254: path restricted cephx caps not working correctly
Revised test program here, in patch format so it can build in tree. We should probably roll this up into a regression... Jeff Layton
08:56 PM Bug #18254: path restricted cephx caps not working correctly
Thanks Greg, I'll take a look at how all of that stuff gets set. FWIW, here's the log with the client debugging crank... Jeff Layton
08:44 PM Bug #18254: path restricted cephx caps not working correctly
Did you check the client log to see where it's failing out at?
I'd check the code flow from Client::mount() to the M...
Greg Farnum
08:28 PM Bug #18254: path restricted cephx caps not working correctly
The program logs this in the MDS logs when run. I'm definitely passing in a real path there:... Jeff Layton
08:12 PM Bug #18254: path restricted cephx caps not working correctly
Oh, and you will need to overwrite the key in the reproducer program with the one for "alice". Jeff Layton
08:05 PM Bug #18254 (Resolved): path restricted cephx caps not working correctly
Ramana noticed this first while testing my ganesha patches to allow restricting exports. It appears that attempting t... Jeff Layton
07:45 PM Bug #18119 (Closed): mds: check and get latest current logsegment to avoid trimming logsegment cr...
This does not seem to be an issue in either master nor Jewel. Greg Farnum
01:54 PM Feature #12132 (Resolved): cephfs-data-scan: Cleanup phase
https://github.com/ceph/ceph/pull/12337#pullrequestreview-12909728... John Spray
12:56 PM Bug #18166 (Pending Backport): monitor cannot start because of "FAILED assert(info.state == MDSMa...
John Spray

12/13/2016

10:47 PM Bug #18151: Incorrect report of size when quotas are enabled.
As commented in the users ML, we will be updating to 10.2.5 in early January. Will provide feedback once that is done. Goncalo Borges
07:58 PM Bug #18151: Incorrect report of size when quotas are enabled.
In fact the quota tree changes already got backported so I think this is resolved. http://tracker.ceph.com/issues/16313 Greg Farnum
06:17 AM Bug #18151 (In Progress): Incorrect report of size when quotas are enabled.
Well, this code changed a fair bit between Jewel and master, as Zheng ripped out the quota trees. However, there appe... Greg Farnum
05:59 PM Bug #18131: ceph-fuse not clearing setuid/setgid bits on chown
I'll lower the priority to Normal now.
Ok, this should be fixed in mainline kernels and coming to stable series ke...
Jeff Layton
02:24 PM Bug #18159 (Fix Under Review): "Unknown mount option mds_namespace"
https://github.com/ceph/ceph/pull/12465 John Spray
02:21 PM Bug #16691 (Resolved): sepia LRC lost directories
John Spray
02:21 PM Feature #17853 (Resolved): More deterministic timing for directory fragmentation
John Spray
01:47 PM Bug #18238 (Can't reproduce): TestDataScan failing due to log "unmatched rstat on 100"

This is almost certainly just something where we need to update the log whitelist, but I'm curious about how we got...
John Spray
10:14 AM Bug #17270: [cephfs] fuse client crash when adding a new osd
@Henrik: The fix appears to be to revert https://github.com/ceph/ceph/commit/1a48a8a2b222e41236341cb1241f0885a1b0b9d8... Nathan Cutler
09:39 AM Bug #17270: [cephfs] fuse client crash when adding a new osd
Is there are chance to get this backported to hammer too? We had same ceph-fuse crashes recently (0.94.9 ceph-fuse an... Henrik Korkuc
08:17 AM Bug #18211: test_snapshot_remove (tasks.cephfs.test_strays.TestStrays) failed at data pool empty ...
Zheng Yan
08:17 AM Bug #18211: test_snapshot_remove (tasks.cephfs.test_strays.TestStrays) failed at data pool empty ...
https://github.com/ceph/ceph-client/commit/6899bb08e4173b7dfc0aa232e589541da869411f Zheng Yan

12/12/2016

02:48 PM Bug #18157: ceph-fuse segfaults on daemonize
We worked around this somewhat badly in master/kraken, but Kefu's Preforker change is a better option. Greg Farnum
02:48 PM Bug #18159: "Unknown mount option mds_namespace"
Let's just make it silent on this case (unkonwn option) and let kernel reject it John Spray
02:15 PM Bug #17193 (Pending Backport): truncate can cause unflushed snapshot data lose
John Spray
02:07 PM Bug #9935 (In Progress): client: segfault on ceph_rmdir path "/"
John Spray
12:03 PM Bug #18225 (Fix Under Review): MDS doesn't release memory after exceeding its cache size limit
https://github.com/ceph/ceph/pull/12443 John Spray
11:37 AM Bug #18225 (Resolved): MDS doesn't release memory after exceeding its cache size limit

In some circumstances the MDS may fail to enforce its own cache size limits. Because boost::pools are used for all...
John Spray

12/09/2016

02:05 PM Bug #18211: test_snapshot_remove (tasks.cephfs.test_strays.TestStrays) failed at data pool empty ...
the object is from pool permission check. it's kernel version of http://tracker.ceph.com/issues/13782 Zheng Yan
09:51 AM Bug #18211 (Resolved): test_snapshot_remove (tasks.cephfs.test_strays.TestStrays) failed at data ...
http://pulpito.ceph.com/jspray-2016-12-06_12:37:38-kcephfs:recovery-master-testing-basic-smithi/611141/... Zheng Yan
09:49 AM Bug #17193: truncate can cause unflushed snapshot data lose
2016-12-06T13:28:03.559 INFO:tasks.cephfs_test_runner: self.assertTrue(self.fs.data_objects_absent(file_a_ino, siz... Zheng Yan

12/08/2016

05:38 PM Bug #18166 (Fix Under Review): monitor cannot start because of "FAILED assert(info.state == MDSMa...
https://github.com/ceph/ceph/pull/12395 John Spray
04:23 PM Bug #18166: monitor cannot start because of "FAILED assert(info.state == MDSMap::STATE_STANDBY)"
It looks like MDSMonitor::maybe_promote_standby is iterating over pending_fsmap.standby_daemons, but inside the loop ... John Spray
06:01 AM Bug #18166: monitor cannot start because of "FAILED assert(info.state == MDSMap::STATE_STANDBY)"
The attachment is the log of crash monitor.
Thanks!
guotao Yao
03:37 PM Bug #18131: ceph-fuse not clearing setuid/setgid bits on chown
Merged the chown part of this, and I think I have sorted out the problems I was having with the truncate and write co... Jeff Layton
07:08 AM Backport #18195 (Resolved): jewel: cephfs: fix missing ll_get for ll_walk
https://github.com/ceph/ceph/pull/13125 Loïc Dachary
07:05 AM Backport #18192 (Resolved): jewel: standby-replay daemons can sometimes miss events
https://github.com/ceph/ceph/pull/13126 Loïc Dachary

12/07/2016

08:06 PM Bug #18179 (Resolved): MDS crashes on missing metadata object
Saw this crash happening on a Jewel 10.2.3 MDS when it was missing a object in the metadata pool:... Wido den Hollander
05:01 PM Bug #18166: monitor cannot start because of "FAILED assert(info.state == MDSMap::STATE_STANDBY)"
So this cluster is freshly-created with version 10.2.3?
Can you upload the monitor log with ceph-post-file? (Prefera...
Greg Farnum
09:43 AM Bug #18166 (Resolved): monitor cannot start because of "FAILED assert(info.state == MDSMap::STATE...

ceph version: v10.2.3
operation system: ubuntu 14.04
linux kernel version: 3.13.0
Description:
I test for c...
guotao Yao
02:14 PM Bug #17954 (Pending Backport): standby-replay daemons can sometimes miss events
John Spray
02:14 PM Bug #16924 (Resolved): Crash replaying EExport
Not backporting because it's multi-mds John Spray
02:11 PM Bug #18016 (Duplicate): cephtool-test-mds.sh waiting for an active MDS daemon (intermittent)
Will assume this is duplicate of https://github.com/ceph/ceph/pull/12234 unless we can see evidence otherwise -- this... John Spray
12:53 PM Feature #17980: MDS should reject connections from OSD-blacklisted clients
Yes, these two should work together: 9754 to blacklist things, and then this ticket to enforce that blacklist on the ... John Spray
12:06 PM Bug #16771: mon crash in MDSMonitor::prepare_beacon on ARM
Hmm, still nothing's jumping out at me.
It is noteworthy that mds_gid_t is a BOOST_STRONG_TYPEDEF (unlike other th...
John Spray
09:58 AM Feature #17835 (In Progress): mds: enable killpoint tests for MDS-MDS subtree export
Vishal Kanaujia
08:16 AM Bug #18157: ceph-fuse segfaults on daemonize
an alternative fix at https://github.com/ceph/ceph/pull/12358 Kefu Chai
01:02 AM Documentation #18040 (Resolved): Documentation says not to run multiple MDS, but we can do that now
John Spray

12/06/2016

11:39 PM Bug #18159 (Resolved): "Unknown mount option mds_namespace"
I think this is just a spurious message coming from src/mount/mount.ceph.c because it was not updated when mds_namesp... John Spray
11:16 PM Bug #18157 (Fix Under Review): ceph-fuse segfaults on daemonize
https://github.com/ceph/ceph/pull/12347 Greg Farnum
10:20 PM Bug #18157: ceph-fuse segfaults on daemonize
Not detected in nightlies because of #18158 Greg Farnum
10:16 PM Bug #18157 (Resolved): ceph-fuse segfaults on daemonize
... Greg Farnum
06:52 PM Bug #17193: truncate can cause unflushed snapshot data lose
It looks like the patch hasn't eliminated the failure:
http://pulpito.ceph.com/jspray-2016-12-06_12:37:38-kcephfs:re...
John Spray
03:34 PM Bug #18131: ceph-fuse not clearing setuid/setgid bits on chown
Testing a PR now that fixes the setattr codepaths. That's pretty simple to do from those codepaths since we're sendin... Jeff Layton
02:58 PM Feature #18154 (Fix Under Review): qa: enable mds thrash exports tests

Currently:...
John Spray
05:40 AM Bug #18086 (Pending Backport): cephfs: fix missing ll_get for ll_walk
Greg Farnum
04:34 AM Bug #18151: Incorrect report of size when quotas are enabled.
Due to other bug / issue, I've run ceph-fuse in debug mode with 'debug client = 20'. Right after launching ceph-fuse,... Goncalo Borges
01:21 AM Bug #18151: Incorrect report of size when quotas are enabled.
1) My environment:
- ceph/cephfs in 10.2.2.
- All infrastructure is in the same version (rados cluster, mons, m...
Goncalo Borges
01:15 AM Bug #18151 (Resolved): Incorrect report of size when quotas are enabled.
Goncalo Borges
 

Also available in: Atom