Project

General

Profile

Activity

From 09/30/2013 to 10/29/2013

10/29/2013

07:45 PM Bug #6608 (Rejected): samba teuthology dbench failure
running dbench on local FS in parallel also results in similar failures. Zheng Yan
11:43 AM Bug #6608: samba teuthology dbench failure
http://qa-proxy.ceph.com/teuthology/teuthology-2013-10-27_19:01:26-fs-dumpling-testing-basic-plana/71285/
http://qa-...
Greg Farnum

10/28/2013

09:03 PM Bug #6613: samba is crashing in teuthology
tail of client log:
---
2013-10-22 08:05:27.405155 7ff1167fc700 20 client.4105 trim_cache size 0 max 0
2013-10-22 ...
Zheng Yan
10:05 AM Bug #6613: samba is crashing in teuthology
This is happening regularly on dumpling and next, but I don't think I've seen it on cuttlefish. We've clearly done so... Greg Farnum
10:04 AM Bug #6608: samba teuthology dbench failure
http://qa-proxy.ceph.com/teuthology/teuthology-2013-10-25_23:01:10-fs-master-testing-basic-plana/69202/
http://qa-pr...
Greg Farnum
09:40 AM Bug #6655 (Need More Info): readdir() fails on CephFS mount symlinked directories
Sage Weil

10/26/2013

04:38 AM Bug #6655: readdir() fails on CephFS mount symlinked directories
I can't reproduce this locally. Which kernel did you use? please try ceph-fuse and recent kernel. Zheng Yan
02:12 AM Bug #6655: readdir() fails on CephFS mount symlinked directories
If struggling to reproduce, it seems like readdir() works directly after other access to the symlink, but only once.
...
Pieter Steyn
01:38 AM Bug #6655 (Can't reproduce): readdir() fails on CephFS mount symlinked directories
Background:
* Ubuntu Server 12.04 64bit.
* CephFS Dumpling 0.67.4
* We moved from local filesystem to CephFS ...
Pieter Steyn

10/24/2013

01:33 AM Feature #3541 (Resolved): mds: robust ino lookup using file backpointers
Zheng Yan
01:33 AM Feature #4295 (Resolved): mds: Actually purge deleted directories
Zheng Yan

10/23/2013

06:49 PM Bug #6609: teuthology rsync workunit failure
files were synced appropriately. rsync only sync directory share/doc/ 's timestamp or mode when it was executed for t... Zheng Yan
03:18 PM Bug #6609: teuthology rsync workunit failure
I didn't look at the details much (even to figure out what the file transfer issues were). What kind of timestamp iss... Greg Farnum
05:43 PM Bug #6623 (Resolved): mds: update backtraces on existing clusters
The backtrace code doesn't update existing clusters as it touches them, unless the paths actually change.
Zheng fi...
Greg Farnum

10/22/2013

05:10 PM Bug #6613 (Closed): samba is crashing in teuthology
At the end of the run:... Greg Farnum
05:03 PM Bug #6608: samba teuthology dbench failure
/a/teuthology-2013-10-21_19:01:05-fs-dumpling-testing-basic-plana/63428/ Greg Farnum
10:39 AM Bug #6599 (Pending Backport): client: invalid iterator dereference in Client::trim_caps
Sage Weil
04:17 AM Bug #6609: teuthology rsync workunit failure
both tests only sent directory share/doc (but didn't sent files in share/doc) when rsync was executed for the second ... Zheng Yan

10/21/2013

10:33 PM Bug #5411: teuthology: bad object dereference
Still seeing this sometimes, for the record: /a/teuthology-2013-10-20_19:01:21-fs-dumpling-testing-basic-plana/61470/ Greg Farnum
10:31 PM Bug #6608: samba teuthology dbench failure
/a/teuthology-2013-10-20_19:01:21-fs-dumpling-testing-basic-plana/61466/ Greg Farnum
10:24 PM Bug #6608: samba teuthology dbench failure
/a/teuthology-2013-10-20_02:13:10-fs-next-testing-basic-plana/60931/ Greg Farnum
10:05 PM Bug #6608 (Can't reproduce): samba teuthology dbench failure
... Greg Farnum
10:28 PM Bug #6599: client: invalid iterator dereference in Client::trim_caps
Test suite passed on this branch; just some spurious valgrind stuff. Greg Farnum
10:22 PM Bug #6609: teuthology rsync workunit failure
/a/teuthology-2013-10-20_02:13:31-kcephfs-next-testing-basic-plana/60994
/a/teuthology-2013-10-20_02:13:10-fs-next-t...
Greg Farnum
10:19 PM Bug #6609 (Can't reproduce): teuthology rsync workunit failure
... Greg Farnum
01:24 AM Bug #6450 (Closed): Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
Zheng Yan
01:17 AM Bug #6450: Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
We have separated Ceph and OpenStack hosts, upgraded the Ceph hosts to 3.12-rc5 and so far things seem to hold up...
...
Jens-Christian Fischer
12:24 AM Bug #4714 (Duplicate): kclient: ceph_sync_{read,write} only accept single buffer.
dup #2217 Zheng Yan
12:23 AM Bug #2217 (Resolved): sync and O_DIRECT writes only write first extent in iov vector
by commit 53d028160f (ceph: implement readv/preadv for sync operation) and commit 2f0a7a1808 (ceph: Implement writev/... Zheng Yan

10/20/2013

02:37 AM Bug #6599 (Resolved): client: invalid iterator dereference in Client::trim_caps
#0 0x00000034bfc0eebb in raise () from /lib64/libpthread.so.0
#1 0x000000000067c2c9 in reraise_fatal (signum=6) at...
Zheng Yan

10/16/2013

05:48 PM Bug #6450: Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
Jens-Christian Fischer wrote:
> we have now upgraded the complete cluster to Dumpling (0.67.3) (also due to other pr...
Sage Weil
04:25 PM Bug #6279 (Resolved): creating a new fs on pools from an old fs can lead to lost MDS Tables
Oh, this is already backported, commit:bd073eeac28d8cba969e5746c5e6adcb95820fdf Greg Farnum
04:23 PM Bug #4405 (Resolved): MDCache::populate_mydir can loop forever
Cherry-picked to dumpling branch in commit:299ddd31b29e332dc5e76bc4f871e4769698665d Greg Farnum
03:47 PM Fix #4708: MDS: journaler pre-zeroing is dangerous
Like Sage said, blacklisting. :)
It's been a while but I think the scenario I envisioned here is one in which the or...
Greg Farnum
02:58 PM Fix #4708: MDS: journaler pre-zeroing is dangerous
https://github.com/ceph/ceph/pull/733 Zheng Yan
09:55 AM Fix #4708: MDS: journaler pre-zeroing is dangerous
Possibly related to #6548. Greg Farnum
09:54 AM Bug #6458: journaler: journal too short during replay
Hmmm, definitely possible. That ticket isn't exactly a certain diagnosis either, though. :( Greg Farnum
07:13 AM Bug #6458: journaler: journal too short during replay
maybe this is the same as #4708. Two MDS (one mds is supposed to be dead, but it's not ) modified the log at the same... Zheng Yan

10/15/2013

07:07 AM Bug #3738: kclient fsx truncate/write multi-client race
by commit 755581977c2bc9eb81c9d9d955024cbedded2161 Zheng Yan
07:01 AM Bug #1117: mds: rename rollback broken on slaves during replay
by commit 844cd46c77274ee7726ded8bf0d83e7f586da00e Zheng Yan

10/14/2013

08:24 PM Bug #5025: samba smbtorture lock test fails on kclient
in commit:f5685013a4ff4f569413945f380dd48b6cdbfaad Greg Farnum
07:01 PM Bug #5025 (Resolved): samba smbtorture lock test fails on kclient
Zheng Yan
09:41 AM Bug #6396 (Resolved): mds: recovery hits assert(!segment.empty()) when reissuing caps
Backported to dumpling as commit:cd1c3c9e00e90b19e83c1f11a48e516a7de93665 Greg Farnum
09:27 AM Bug #1117: mds: rename rollback broken on slaves during replay
Which patch[es]? Greg Farnum
07:29 AM Bug #1117 (Resolved): mds: rename rollback broken on slaves during replay
Zheng Yan
09:25 AM Bug #3738: kclient fsx truncate/write multi-client race
Is this a side effect of 3c3b2ceb03e7294704f5bf3e1e420012a0166585, or some other patch? Please reference them when cl... Greg Farnum
07:24 AM Bug #3738 (Resolved): kclient fsx truncate/write multi-client race
now mds revokes Fw before doing truncate Zheng Yan
07:40 AM Bug #2385 (Can't reproduce): max mds = 2, mds hang and crash
Zheng Yan
07:34 AM Bug #5036 (Resolved): `ls` hangs on random folder
Zheng Yan
07:34 AM Bug #2019 (Resolved): mds: CInode::filelock stuck in sync->mix
Zheng Yan
07:31 AM Bug #3088 (Resolved): NULL pointer dereference at ceph_d_prune
Zheng Yan
07:28 AM Bug #429 (Resolved): mds: fix rstat propogation into past parents
haven't seen any rstat error for months Zheng Yan
07:25 AM Bug #3681 (Resolved): kclient fsx fails nightly
Zheng Yan

10/11/2013

09:10 AM Feature #6511 (Rejected): MDS: add special purging options for testing
We do a lot of stuff async with our journal, stray inodes, etc, that we need a good way to test. Let this ticket serv... Greg Farnum
07:33 AM Bug #4405 (Pending Backport): MDCache::populate_mydir can loop forever
Sage Weil

10/10/2013

10:42 PM Bug #6396 (Pending Backport): mds: recovery hits assert(!segment.empty()) when reissuing caps
This seems to be showing up in our nightlies a fair bit still; it should be backported. Greg Farnum
01:18 PM Bug #4405 (Fix Under Review): MDCache::populate_mydir can loop forever
He's got a patch in wip-4405 that he thinks would solve the loading problem; I think he might be right. The other iss... Greg Farnum
11:05 AM Bug #6460: ceph-fuse: xlist crash in ~Inode on osd_op_reply
In general I'm not usually too worried about FS backports since anybody using the FS should be keeping up to date wit... Greg Farnum
04:27 AM Bug #6460 (Resolved): ceph-fuse: xlist crash in ~Inode on osd_op_reply
Sage Weil

10/09/2013

08:16 PM Bug #6460 (Pending Backport): ceph-fuse: xlist crash in ~Inode on osd_op_reply
Probably already fixed by https://github.com/ceph/ceph/pull/590. Do we need to backport MDS fixes to old version? Zheng Yan
02:02 PM Bug #4832: mds: failed auth_unpin assert
I've backported these two patches to the cuttlefish branch as well.
(Plus a cuttlefish-4832 branch on top of v0.61.8...
Greg Farnum

10/04/2013

08:41 AM Bug #5290 (Can't reproduce): mds: crash whilst trying to reconnect
Sage Weil
07:24 AM Bug #5290: mds: crash whilst trying to reconnect
I certainly haven't been hit by this again, so if you consider it resolved... Damien Churchill

10/03/2013

07:50 PM Bug #5250: ceph-mds 0.61.2 aborts on start
I just updated to 0.67.3 and I want to confirm that I still have to recompile to get around the abort.
0> 201...
Jérôme Poulin
06:11 PM Bug #6473: multimds + ceph-fuse: fsstress gets ENOTEMPTY on final rm -r
ubuntu@teuthology:/a/sage-2013-10-03_17:25:57-marginal:multimds-master-testing-basic-/32499 Sage Weil
06:11 PM Bug #6473 (Can't reproduce): multimds + ceph-fuse: fsstress gets ENOTEMPTY on final rm -r
... Sage Weil
01:34 PM Bug #5025: samba smbtorture lock test fails on kclient
http://qa-proxy.ceph.com/teuthology/teuthology-2013-10-02_23:01:08-fs-master-testing-basic-plana/31442/ Sage Weil
01:33 PM Bug #5025: samba smbtorture lock test fails on kclient
oops, still fails on kclient. Sage Weil
01:33 PM Bug #2825 (Resolved): File lock doesn't work properly
Sage Weil

10/02/2013

10:21 PM Bug #5753: ceph-fuse: segfault when getting back a traceless rename op
and /a/teuthology-2013-10-01_23:01:09-fs-next-testing-basic-plana/29640 Greg Farnum
01:00 PM Bug #5753: ceph-fuse: segfault when getting back a traceless rename op
Showed up again, /a/teuthology-2013-09-27_19:01:27-fs-dumpling-testing-basic-plana/21601 Greg Farnum
09:53 PM Bug #6458 (New): journaler: journal too short during replay
Wow, that explanation of what was going on was so very wrong. Now I'm just not sure how this could have occurred. Greg Farnum
06:27 PM Bug #6458: journaler: journal too short during replay
Made a pull request too: https://github.com/ceph/ceph/pull/683 Greg Farnum
06:26 PM Bug #6458: journaler: journal too short during replay
Yep, it ran pjd successfully; plenty of journal commits there! Greg Farnum
06:19 PM Bug #6458 (Fix Under Review): journaler: journal too short during replay
Pushed a patch to wip-journaler-safety, commit:a0ba5c66162af720627fcf7ba63fdc76ac97f568. I'm setting up a basic funct... Greg Farnum
05:58 PM Bug #6458 (In Progress): journaler: journal too short during replay
This is a bit more complicated than we described — we do not in fact blindly write the write_pos to our head object; ... Greg Farnum
11:19 AM Bug #6458 (New): journaler: journal too short during replay
Urgh, that last comment was mistaken. Greg Farnum
10:56 AM Bug #6458 (Rejected): journaler: journal too short during replay
That is not what happened; the underlying objects were inconsistent in RADOS. Greg Farnum
10:07 AM Bug #6458 (Can't reproduce): journaler: journal too short during replay
Got a report on irc from a user whose log was 611 bytes shorter than the header indicated it should be. His guess was... Greg Farnum
02:44 PM Bug #1596 (Can't reproduce): mds crash during ffsb on kernel client in CInode::is_frozen
Sage Weil
02:44 PM Bug #1601 (Can't reproduce): mds crash during snaps workunit
Sage Weil
02:44 PM Bug #1752 (Can't reproduce): ceph-fuse isn't releasing caps without flushing data?
Sage Weil
02:43 PM Bug #3601: client: With multiple clients, file remove doesn't free up space
repushing to wip-fuse for testing Sage Weil
02:38 PM Bug #5025 (Resolved): samba smbtorture lock test fails on kclient
lock test was fixed a while ago, commit:476e4902907dfadb3709ba820453299ececf990b test is reenabled in the suite. Sage Weil
12:54 PM Bug #6460 (Resolved): ceph-fuse: xlist crash in ~Inode on osd_op_reply
... Greg Farnum
12:18 PM Bug #3681: kclient fsx fails nightly
added fsx back into the kcephfs test suite. reportedly fsx now passes, but we should verify before closing this bug. Sage Weil
12:13 PM Bug #5037 (Can't reproduce): Ceph-MDS asserts after upgrade 0.56.2 -> 0.56.6
Sage Weil
12:12 PM Bug #5162 (Can't reproduce): File is locked unexpected and not released anymore
Sage Weil
12:07 PM Bug #5033 (Can't reproduce): oops in ceph_put_wrbuffer_cap_refs
Sage Weil
12:07 PM Bug #5290: mds: crash whilst trying to reconnect
i'm inclined to call this can't reproduce. there was a locking fix recently that covered the session_map too that co... Sage Weil
12:05 PM Bug #5418 (Resolved): kceph: crash in remove_session_caps
Sage Weil
11:44 AM Feature #6332 (Resolved): mds: add config option disabling snapshots by default
This got merged to master; I'm going to call that done since we're releasing Emperor soon-ish. Greg Farnum
02:34 AM Bug #6450: Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
we have now upgraded the complete cluster to Dumpling (0.67.3) (also due to other problems we have experienced in the... Jens-Christian Fischer

10/01/2013

09:11 PM Bug #6396 (Resolved): mds: recovery hits assert(!segment.empty()) when reissuing caps
yeah looks good! Sage Weil
09:07 PM Bug #6396: mds: recovery hits assert(!segment.empty()) when reissuing caps
Zheng Yan
09:07 PM Bug #6396: mds: recovery hits assert(!segment.empty()) when reissuing caps
By reading the mds log for #5458, I think these issues should be mostly fixed by commit b144170544(mds: properly retu... Zheng Yan
09:09 PM Bug #6349 (Duplicate): MDS: failed assert !segments.empty() while rejoining after being standby-r...
dup #6396 Zheng Yan
09:08 PM Bug #5458 (Duplicate): mds: standby-replay -> replay takeover does not handle racing expire/trim
dup #6396 Zheng Yan
06:47 AM Bug #6450: Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
Jens-Christian Fischer wrote:
> Can I run a 0.67.3 MDS with the rest of the infrastructure on 0.61.8?
yes, you ca...
Zheng Yan
05:34 AM Bug #6450: Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
Can I run a 0.67.3 MDS with the rest of the infrastructure on 0.61.8?
We are using the rc1 kernels in order to run...
Jens-Christian Fischer
05:28 AM Bug #6450 (Need More Info): Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
The first warning was caused by a MDS bug. (you can try upgrading MDS 0.67.3 ) The rest BUGs did not look like ceph r... Zheng Yan
04:57 AM Bug #6450 (Closed): Kernel bugs in 3.12-rc1, taking 2 hosts (and 1 following) down
We are running 10 hosts with 74 OSDs on Ubuntu 13.04, Ceph 0.61.8 and Kernel 3.12-rc1
root@h5:~# ceph --version
...
Jens-Christian Fischer
 

Also available in: Atom