Project

General

Profile

Activity

From 01/13/2016 to 02/11/2016

02/11/2016

11:48 PM Bug #14716: "Thread.cc: 143: FAILED assert(status == 0)" in fs-hammer---basic-smithi
This one's odd. The problem in #14697 is different; it's actually calling timer.shutdown() twice there. Here, that is... Greg Farnum
11:39 PM Bug #14697: mds: assert in SafeTimer while suiciding
Greg Farnum
11:38 PM Bug #14697: mds: assert in SafeTimer while suiciding
> 2016-02-08T02:00:34.587 INFO:tasks.ceph.mds.a-s.mira064.stderr:Thread::join(): pthread_join failed with error 22
...
Greg Farnum
10:07 PM Bug #14735 (Resolved): ceph-fuse does not mount at boot on Debian Jessie
Greg Farnum
01:58 PM Bug #14735: ceph-fuse does not mount at boot on Debian Jessie
Fixed by https://github.com/ceph/ceph/pull/7607 Florent B
08:58 AM Bug #14735 (Resolved): ceph-fuse does not mount at boot on Debian Jessie
I have a problem with ceph-fuse on Debian Jessie.
I have this in my fstab :
@id=my_user,daemonize=false,mon_hos...
Florent B
10:06 PM Bug #14365 (Resolved): unsafe handle_config_change() methods
Fixed in https://github.com/ceph/ceph/pull/7312 and https://github.com/ceph/ceph/pull/7581 (both required). Greg Farnum
04:37 AM Bug #14384 (Resolved): fsx failed to compile
Loïc Dachary
04:37 AM Bug #13777 (Resolved): Ceph file system is not freeing space
Loïc Dachary
04:37 AM Bug #13714 (Resolved): Segmentation fault accessing file using fuse mount
Loïc Dachary
04:14 AM Bug #14732 (Duplicate): open returns EACCES when O_TRUNC is specified and write permission is den...
See http://tracker.ceph.com/issues/14692#note-2 for a few runs with this issue... Loïc Dachary

02/10/2016

06:08 AM Backport #13889 (Resolved): infernalis: Segmentation fault accessing file using fuse mount
Loïc Dachary
06:07 AM Backport #13931 (Resolved): infernalis: kclient: intermittent log warnings "client.XXXX isn't res...
Loïc Dachary
06:07 AM Backport #14067 (Resolved): infernalis : Ceph file system is not freeing space
Loïc Dachary
06:06 AM Backport #14490 (Resolved): infernalis: fsx failed to compile
Loïc Dachary

02/09/2016

11:58 PM Bug #14716 (Won't Fix): "Thread.cc: 143: FAILED assert(status == 0)" in fs-hammer---basic-smithi
Jobs:
http://qa-proxy.ceph.com/teuthology/teuthology-2016-02-09_08:45:22-fs-hammer---basic-smithi/1624/teuthology....
Yuri Weinstein
09:41 PM Bug #14714 (Won't Fix): three jobs in samba suite failing for hammer v0.94.6 QE validation
Runs:
mira
http://pulpito.ceph.com/teuthology-2016-02-09_13:11:14-samba-hammer---basic-mira/
smithi
http://p...
Yuri Weinstein
06:44 PM Bug #14698: Test failure: test_full_fsync (tasks.cephfs.test_full.TestQuotaFull)
http://pulpito.ceph.com/gregf-2016-02-08_22:02:06-fs-greg-fs-testing-27-1---basic-mira/1009/
Also this failure see...
Greg Farnum
03:06 PM Bug #14698: Test failure: test_full_fsync (tasks.cephfs.test_full.TestQuotaFull)
Dunno — it's QuotaFull rather than ClusterFull so I created a new ticket. Greg Farnum
11:17 AM Bug #14698: Test failure: test_full_fsync (tasks.cephfs.test_full.TestQuotaFull)
Different to http://tracker.ceph.com/issues/14258 ? John Spray
08:16 AM Backport #13932 (New): hammer: kclient: intermittent log warnings "client.XXXX isn't responding t...
Loïc Dachary
08:02 AM Backport #13932 (In Progress): hammer: kclient: intermittent log warnings "client.XXXX isn't resp...
Loïc Dachary
07:59 AM Backport #13932: hammer: kclient: intermittent log warnings "client.XXXX isn't responding to mcli...
I think the PR to backport is https://github.com/ceph/ceph/pull/6432 - it was merged on November 30. Nathan Cutler
06:04 AM Backport #13932: hammer: kclient: intermittent log warnings "client.XXXX isn't responding to mcli...
Zheng, was there something wrong with that PR or something? Greg Farnum
06:01 AM Backport #13932 (New): hammer: kclient: intermittent log warnings "client.XXXX isn't responding t...
The proposed PR has been closed, assuming no work is otherwise being done. Loïc Dachary
05:59 AM Backport #14668: hammer: Wrong ceph get mdsmap assertion
It is unfortunately too late, v0.94.6 is already frozen and being tested. Loïc Dachary
01:12 AM Bug #13980: all nfs v3 mounts fail in ovh lab
and in http://pulpito.ovh.sepia.ceph.com:8081/teuthology-2016-02-07_20:10:01-knfs-hammer-testing-basic-openstack/ Yuri Weinstein

02/08/2016

07:04 PM Bug #14698 (Duplicate): Test failure: test_full_fsync (tasks.cephfs.test_full.TestQuotaFull)
http://pulpito.ceph.com/gregf-2016-02-08_00:50:24-fs-greg-fs-testing-27-1---basic-mira/10806/
http://pulpito.ceph.co...
Greg Farnum
06:32 PM Bug #14697 (Resolved): mds: assert in SafeTimer while suiciding
http://pulpito.ceph.com/gregf-2016-02-08_00:50:24-fs-greg-fs-testing-27-1---basic-mira/10767/... Greg Farnum
04:11 PM Bug #14672 (Rejected): MDS crashes with FAILED assert(inode_map.count(in->vino()) == 0) in 9.2.0
Yeah, I'm pretty sure about that. Greg Farnum
11:55 AM Bug #14672: MDS crashes with FAILED assert(inode_map.count(in->vino()) == 0) in 9.2.0
Was this the system that you ran newfs on? I think I was concerned that this might be from running newfs while the m... John Spray
07:25 AM Backport #14624 (Resolved): hammer: fsx failed to compile
Loïc Dachary
05:09 AM Backport #14690 (Rejected): infernalis: Client::_fsync() on a given file does not wait unsafe req...
Loïc Dachary
05:06 AM Bug #13443 (Resolved): Ceph-fuse won't start correctly when the option log_max_new in ceph.conf s...
Loïc Dachary

02/07/2016

08:20 AM Bug #14685 (New): dbench hang on native cifs mount
http://pulpito.ceph.com/teuthology-2016-02-03_19:14:02-samba-jewel---basic-mira/5717/
http://pulpito.ceph.com/teutho...
Zheng Yan
05:34 AM Bug #14684 (Resolved): test_scrub_checks fails
http://qa-proxy.ceph.com/teuthology/teuthology-2016-02-03_14:03:10-fs-jewel---basic-smithi/5365/teuthology.log
<pr...
Zheng Yan

02/05/2016

11:13 PM Backport #14668: hammer: Wrong ceph get mdsmap assertion
Hammer backport is staged. Maybe it's not too late to squeeze it into 0.94.6 - we'll see. Nathan Cutler
11:12 PM Backport #14668 (In Progress): hammer: Wrong ceph get mdsmap assertion
Nathan Cutler
11:08 PM Backport #14668: hammer: Wrong ceph get mdsmap assertion
h3. Original description
One of our hammer clusters won't start now after running ceph mds getmap.
I did:...
Nathan Cutler
11:04 PM Backport #14668: hammer: Wrong ceph get mdsmap assertion
https://github.com/ceph/ceph/pull/4203
I will stage hammer backport.
Nathan Cutler
10:54 AM Backport #14668: hammer: Wrong ceph get mdsmap assertion
This was fixed in >=infernalis, but apparently never backported.... John Spray
08:57 AM Backport #14668: hammer: Wrong ceph get mdsmap assertion
After a few more retries the mon's eventually started. Dan van der Ster
08:51 AM Backport #14668 (Resolved): hammer: Wrong ceph get mdsmap assertion
https://github.com/ceph/ceph/pull/7542 Dan van der Ster
11:05 PM Bug #14681 (Resolved): Wrong ceph get mdsmap assertion
https://github.com/ceph/ceph/pull/4203 Nathan Cutler
03:46 PM Bug #14672: MDS crashes with FAILED assert(inode_map.count(in->vino()) == 0) in 9.2.0
added log Kenneth Waegeman
03:45 PM Bug #14672 (Rejected): MDS crashes with FAILED assert(inode_map.count(in->vino()) == 0) in 9.2.0
Full log in attach
-9> 2016-02-05 15:26:29.015197 7f177de2f700 10 mds.0.locker got rdlock on (ipolicy sync r...
Kenneth Waegeman
10:45 AM Bug #13583: Client::_fsync() on a given file does not wait unsafe requests that create/modify the...
@yan do you think it should be backported to infernalis as well ? It showed up when running tests:
* http://pulpit...
Loïc Dachary
10:41 AM Bug #13583 (Pending Backport): Client::_fsync() on a given file does not wait unsafe requests tha...
Loïc Dachary

02/04/2016

11:56 PM Bug #14641: don't let users specify 0 on stripe count or object size
This is based on http://www.spinics.net/lists/ceph-users/msg25363.html
Perhaps the cluster is old and they broke i...
Greg Farnum
11:43 PM Bug #14641: don't let users specify 0 on stripe count or object size
We already validate layouts in Server.cc. We *could* also validate them on load from disk or during scrub...? John Spray
03:15 PM Bug #14641 (Duplicate): don't let users specify 0 on stripe count or object size
Those are fairly nonsensical. Amongst other things, they induce a divide-by-zero in the StrayManager right now. Greg Farnum
03:16 PM Feature #14642 (New): Validate layouts everywhere we load them
See _calculate_ops_required, wherein we divide by their product without checking it's non-zero. Greg Farnum
02:34 PM Bug #14607 (Duplicate): Some fuse tests fail with: failed to remove ‘/home/ubuntu/cephtest’: Dire...
Dupe of: http://tracker.ceph.com/issues/13422 John Spray
07:13 AM Bug #14607: Some fuse tests fail with: failed to remove ‘/home/ubuntu/cephtest’: Directory not empty
I'm seeing this a *bunch* now as well. eg http://pulpito.ceph.com/gregf-2016-02-03_09:19:17-fs-greg-fs-testing-23---b... Greg Farnum
07:56 AM Feature #14640 (New): nfs: qa: evaluate connectathon NFS tests for applicability to our suite
https://fedorapeople.org/cgit/steved/public_git/cthon04.git/tree/README
Is there anything in there that might be u...
Greg Farnum

02/03/2016

09:37 AM Backport #14624 (In Progress): hammer: fsx failed to compile
Nathan Cutler
09:32 AM Backport #14624 (Resolved): hammer: fsx failed to compile
https://github.com/ceph/ceph/pull/7501 Nathan Cutler
08:49 AM Bug #12710 (Resolved): fsstress.sh fails
Nathan Cutler
06:40 AM Bug #10436 (Fix Under Review): ceph-fuse: snapshot flushing from page cache to Client is not cohe...
https://github.com/ceph/ceph/pull/7495 Zheng Yan
05:46 AM Backport #14584 (Resolved): hammer: fsstress.sh fails
Loïc Dachary

02/02/2016

06:32 PM Bug #13903 (Resolved): Failure in TestStrays.test_ops_throttle
Whoops, merged this last week. Greg Farnum
02:29 PM Bug #14608 (Can't reproduce): snaptests.yaml failure: [WRN] open_snap_parents has:" in cluster log

http://pulpito.ovh.sepia.ceph.com:8081/teuthology-2016-01-30_18:04:02-fs-master---basic-openstack/13302/
John Spray
02:18 PM Bug #14607 (Duplicate): Some fuse tests fail with: failed to remove ‘/home/ubuntu/cephtest’: Dire...

e.g. http://pulpito.ovh.sepia.ceph.com:8081/teuthology-2016-01-30_18:04:02-fs-master---basic-openstack/13319/
& ...
John Spray
01:57 PM Bug #10436 (In Progress): ceph-fuse: snapshot flushing from page cache to Client is not coherent
Zheng Yan
11:44 AM Bug #10436: ceph-fuse: snapshot flushing from page cache to Client is not coherent
Zheng Yan
09:21 AM Bug #14557 (Duplicate): snaps: failed snaptest-multiple-capsnaps.sh
... Zheng Yan

02/01/2016

03:33 AM Backport #14584 (In Progress): hammer: fsstress.sh fails
Loïc Dachary
03:30 AM Backport #14584 (Resolved): hammer: fsstress.sh fails
https://github.com/ceph/ceph/pull/7454 Loïc Dachary
03:26 AM Bug #12710 (Pending Backport): fsstress.sh fails
Loïc Dachary

01/29/2016

10:25 AM Backport #14067 (In Progress): infernalis : Ceph file system is not freeing space
Abhishek Varshney
10:18 AM Backport #14490 (In Progress): infernalis: fsx failed to compile
Abhishek Varshney
05:19 AM Bug #14557 (Duplicate): snaps: failed snaptest-multiple-capsnaps.sh
This is on a testing branch, but it's about to get merged to master.
http://pulpito.ceph.com/gregf-2016-01-26_15:35:...
Greg Farnum
05:15 AM Backport #12350 (In Progress): Provided logrotate setup does not handle ceph-fuse correctly
h3. original description
(move here because Backport issues only have a link to the PR in the description)
OS: ...
Loïc Dachary

01/28/2016

02:50 AM Bug #13926 (Need More Info): lockup in multithreaded application
did not find anything in the log Zheng Yan

01/27/2016

06:01 AM Bug #13926: lockup in multithreaded application
Zheng, anything come out of this? Greg Farnum
05:59 AM Bug #10336 (Can't reproduce): hung ffsb test
Greg Farnum
05:57 AM Bug #13546 (Resolved): mv of directories hung Ceph filesystem
Greg Farnum

01/25/2016

05:10 AM Backport #14490 (Resolved): infernalis: fsx failed to compile
https://github.com/ceph/ceph/pull/7429 Loïc Dachary
04:10 AM Bug #14489 (New): ovh: ENOSPC on multiple_rsync.sh as part of cfuse_workunit_misc
http://pulpito.ovh.sepia.ceph.com:8081/teuthology-2016-01-16_18:04:01-fs-master---basic-openstack/4272/ Greg Farnum
04:07 AM Bug #14488: ovh: ENOSPC in kernel_untar_build
http://pulpito.ovh.sepia.ceph.com:8081/teuthology-2016-01-18_20:08:01-kcephfs-master-testing-basic-openstack/6824/ Greg Farnum
03:58 AM Bug #14488 (New): ovh: ENOSPC in kernel_untar_build
http://pulpito.ovh.sepia.ceph.com:8081/teuthology-2016-01-16_20:08:02-kcephfs-master-testing-basic-openstack/4363/ Greg Farnum
03:50 AM Bug #13977: ovh: sambatorture.scan-pipe_number fails on ENOMEM
http://teuthology.ovh.sepia.ceph.com/teuthology/teuthology-2016-01-16_23:14:01-samba-master---basic-openstack/4432/ Greg Farnum
03:48 AM Bug #14486 (New): ovh: samba test filled up disk
http://pulpito.ovh.sepia.ceph.com:8081/teuthology-2016-01-17_23:14:01-samba-hammer---basic-openstack/5996/ Greg Farnum

01/21/2016

08:15 PM Feature #14456 (Resolved): mon: prevent older/incompatible clients from mounting the file system
If the kernel client mounts something with quotas set, we should provide server-side warning messages. For now, this ... Greg Farnum
01:32 PM Support #14450 (New): ceph-fuse does not work without loaded ceph/rbd kernel module writing big data
Hi,
i was installing different OS ( centos 6, centos 7, Debian 8 ).
I was installing ceph-fuse only ( hammer r...
Oliver Dzombc
01:32 PM Bug #14365 (Fix Under Review): unsafe handle_config_change() methods
John Spray
01:01 AM Bug #14395: cephfs_journal_tool fails
Sorry~~ I made a mistake xie xingguo

01/20/2016

02:42 PM Bug #13903 (Fix Under Review): Failure in TestStrays.test_ops_throttle
Zheng Yan
02:41 PM Bug #13903: Failure in TestStrays.test_ops_throttle
https://github.com/ceph/ceph/pull/7297 Zheng Yan
02:55 AM Bug #13903: Failure in TestStrays.test_ops_throttle
Zheng, please take a look. Greg Farnum
02:12 PM Bug #14365: unsafe handle_config_change() methods
Also, MDSDaemon::handle_conf_change was broken because it was taking mds_lock, when mds_lock is already taken in the ... John Spray
01:00 PM Bug #14365: unsafe handle_config_change() methods
Turns out there's a pre-existing lock cycle issue similar to #14374 here, we just never noticed because live config c... John Spray
12:58 PM Bug #14365 (In Progress): unsafe handle_config_change() methods
John Spray
11:43 AM Bug #14374 (Fix Under Review): MDS asok handlers trigger lock cycle assertion if they take mds_lock
https://github.com/ceph/ceph/pull/7295 John Spray
02:49 AM Bug #14395 (Resolved): cephfs_journal_tool fails
Greg Farnum
02:24 AM Bug #14377 (Resolved): [ FAILED ] LibCephFS.DirLs
Greg Farnum
02:17 AM Bug #11517 (Resolved): Libcephfs: Doesn't check file's open mode when do read/write
Greg Farnum
02:13 AM Bug #14254 (Resolved): failed pjd chown test 117
Greg Farnum

01/19/2016

07:42 PM Feature #14427 (Resolved): qa: run snapshot tests under thrashing
We run snapshot tests in their own subsuite right now to verify they keep functioning, but we do not test them under ... Greg Farnum
07:41 PM Bug #10436: ceph-fuse: snapshot flushing from page cache to Client is not coherent
Once this is fixed we need to re-enable snaptest-snap-rm-cmp.sh in the snaptests.yaml qa-suite config fragment. Greg Farnum
07:39 PM Feature #3819 (Resolved): mds: re-add snaptests to qa suite
https://github.com/ceph/ceph-qa-suite/pull/678 Greg Farnum
07:34 PM Bug #13903: Failure in TestStrays.test_ops_throttle
http://qa-proxy.ceph.com/teuthology/gregf-2016-01-18_19:56:11-fs-greg-fs-speculative-118---basic-mira/32912/ Greg Farnum
03:01 PM Bug #14255: qa: we are filling smithi disks with ffsb workloads
ffsb (config is at qa/workunits/suites/random_write.32.ffsb) only use about 13G space, no matter how long it runs. I ... Zheng Yan
02:02 AM Bug #14255: qa: we are filling smithi disks with ffsb workloads
I haven't looked yet but I suspect we're just running for a set period of time and the smithis are so much faster tha... Greg Farnum
08:45 AM Bug #14357 (Resolved): Delay in clientreplay on quiet clusters
Zheng Yan

01/18/2016

08:39 PM Bug #14365: unsafe handle_config_change() methods
I think these should be okay, but it's easy to fix. John, can you establish that we don't need locks or else set them... Greg Farnum
08:26 AM Bug #14395 (Fix Under Review): cephfs_journal_tool fails
https://github.com/ceph/ceph-qa-suite/pull/801 Zheng Yan
08:21 AM Bug #14395 (Resolved): cephfs_journal_tool fails
http://qa-proxy.ceph.com/teuthology/teuthology-2016-01-13_14:03:02-fs-jewel---basic-smithi/28198/teuthology.log
<p...
Zheng Yan
07:06 AM Bug #14380 (Fix Under Review): "ceph mds setmap" crashes mon on invalid input
https://github.com/ceph/ceph/pull/7262 Zheng Yan

01/15/2016

02:38 PM Bug #14384 (Pending Backport): fsx failed to compile
I sent a patch upstream and pushed a patch to the workunit in master branch so that it checks out a working commit in... Greg Farnum
01:39 PM Bug #14384 (Resolved): fsx failed to compile
http://teuthology.ovh.sepia.ceph.com/teuthology/teuthology-2016-01-11_23:04:01-fs-master---basic-openstack/2232/
htt...
Zheng Yan
01:12 PM Bug #14379 (Fix Under Review): Add confirmation flag to "ceph mds rmfailed"
https://github.com/ceph/ceph/pull/7248 Zheng Yan
12:30 AM Bug #14379 (Resolved): Add confirmation flag to "ceph mds rmfailed"

It's horribly dangerous but has a rather non-threatening name.
John Spray
09:27 AM Bug #13546 (Fix Under Review): mv of directories hung Ceph filesystem
I added it to https://github.com/ceph/ceph/pull/7199 Zheng Yan
09:22 AM Bug #14377 (Fix Under Review): [ FAILED ] LibCephFS.DirLs
https://github.com/ceph/ceph/pull/7246 Zheng Yan
06:09 AM Bug #14377: [ FAILED ] LibCephFS.DirLs
It's caused by duplicated entries in readdir result. This can happen when readdir requires several mds requests and t... Zheng Yan
12:35 AM Bug #14380 (Resolved): "ceph mds setmap" crashes mon on invalid input
Needs exception handling around MDSMap::decode John Spray

01/14/2016

07:27 PM Feature #13569 (Resolved): ceph-fuse: support direct IO
Greg Farnum
07:25 PM Bug #13546 (In Progress): mv of directories hung Ceph filesystem
Do you have a PR or any other commits to go with that? Is it safe to unilaterally not drop caps in a case like this? Greg Farnum
06:18 PM Bug #14377: [ FAILED ] LibCephFS.DirLs
Zheng, you've been fixing a lot here lately, please take a look! Greg Farnum
06:18 PM Bug #14377 (Resolved): [ FAILED ] LibCephFS.DirLs
http://pulpito.ceph.com/gregf-2016-01-12_23:23:33-fs-greg-fs-testing-1-12-1---basic-mira/26500/
This failed the Di...
Greg Farnum
03:41 PM Bug #14374 (Resolved): MDS asok handlers trigger lock cycle assertion if they take mds_lock

http://pulpito.ceph.com/gregf-2016-01-12_23:29:42-fs-greg-fs-speculative---basic-mira/26556
The asok handler is ...
John Spray
03:14 AM Bug #14319: Double decreased the count to trim caps which will cause failing to respond to cache ...
This issue also exists on master, so close original PR and create this new one.
https://github.com/ceph/ceph/pull/...
Zhi Zhang
02:39 AM Backport #12350 (Pending Backport): Provided logrotate setup does not handle ceph-fuse correctly
Zhi Zhang
12:42 AM Bug #14365 (Resolved): unsafe handle_config_change() methods
The handle_conf_change() methods in these files look potentially unsafe (modifying data without locks):... Josh Durgin

01/13/2016

02:21 PM Bug #13903: Failure in TestStrays.test_ops_throttle
If anyone has time, yes -- given enough time I can figure it out but it might be more obvious to someone more familia... John Spray
01:11 AM Bug #13903: Failure in TestStrays.test_ops_throttle
I think you talked about this in standup but I'm forgetting — do you need somebody else to look over the caps stuff h... Greg Farnum
02:20 PM Bug #13546: mv of directories hung Ceph filesystem
I have an explanation for this.

see https://github.com/ukernel/ceph/commit/a750c361cd631a1f87ee152083d2a42c49fd02b6
Zheng Yan
02:19 AM Bug #13546: mv of directories hung Ceph filesystem
Bumping this up as we ought to examine it and I think it's been lost in the shuffle. Greg Farnum
12:05 PM Bug #14357 (Fix Under Review): Delay in clientreplay on quiet clusters
https://github.com/ceph/ceph/pull/7216 John Spray
10:38 AM Bug #14357 (Resolved): Delay in clientreplay on quiet clusters
Because we are checking for clientreplay_done at the end of _dispatch, if a request is completing via a commit contex... John Spray
08:01 AM Bug #11517 (Fix Under Review): Libcephfs: Doesn't check file's open mode when do read/write
https://github.com/ceph/ceph/pull/7209 Zheng Yan
02:24 AM Bug #14256 (Resolved): mds: objecter assert on shutdown
Sage Weil
 

Also available in: Atom