Project

General

Profile

Activity

From 12/11/2012 to 01/09/2013

01/09/2013

02:58 PM Bug #3773 (Can't reproduce): mds crashed at LogEvent::decode
ceph version: 0.56.1 (e4a541624df62ef353e754391cbbb707f54b16f7)
I had a cluster [burnupi06, burnupi07, burnupi08] ...
Tamilarasi muthamizhan
12:05 PM Feature #3570 (In Progress): teuthology: mds thrasher
Sam Lang
10:58 AM Bug #3681: kclient fsx fails nightly
Proposed fix to set i_size before the setattr request:
This will resolve the above issue, because the cap flush on...
Sam Lang

01/08/2013

04:29 PM Bug #3597: ceph-fuse: denying root access
Is root actually a member of the fuse group? If not that would be correct behavior. Greg Farnum
12:04 PM Feature #626 (Closed): qa: add IOR, rompio, or other parallel workloads suite
Added tests to the _marginal_ qa suite that run IOR, mdtest, and fsx-mpi. Sam Lang
09:39 AM Feature #3543: mds: new encoding
I'm going to get started on this (mostly just figuring out current state, probably) today. Greg Farnum

01/07/2013

04:04 PM Feature #3749 (Resolved): Remove forced synchronization from Java bindings
Remove "synchronized" keyword from native interface. This was originally added when we were seeing some pthread mutex... Noah Watkins
03:26 PM Bug #3746 (Rejected): kclient mmap doesn't zero past EOF
Error coming from fsx:
INFO:teuthology.orchestra.run.out:Mapped Write: non-zero data past EOF (0xb826) page offset...
Sam Lang
12:19 PM Cleanup #3742 (Resolved): Remove old Hadoop wrappers and configuration options
I think it's likely that the current Hadoop shim is at least at feature parity with the old wrappers. Noah Watkins
10:02 AM Bug #3726 (Resolved): Enforce Ceph's minimum stripe size in the java bindings
Noah Watkins
10:02 AM Bug #3726 (Closed): Enforce Ceph's minimum stripe size in the java bindings
Noah Watkins
09:21 AM Bug #3738 (Resolved): kclient fsx truncate/write multi-client race

This bug is similar to #3681, but occurs only in the non-exclusive case (multiple clients), where a truncate doesn'...
Sam Lang
09:09 AM Bug #3681: kclient fsx fails nightly
The race here is between a truncate down, and completion of osd write ops triggering a cap flush. The exact order th... Sam Lang

01/04/2013

07:54 PM Bug #3666 (Resolved): Segfault running test_libcephfs
commit:3a9408742a8a6cbc870cba543a208285f1a6cec1 Sage Weil
03:25 PM Bug #3666: Segfault running test_libcephfs
I pushed a new wip-client-shutdown. This switches the clean-up order of client/messenger in libcephfs, rather than mo... Noah Watkins
01:36 PM Bug #3666: Segfault running test_libcephfs
Right, I think your fix will work, but it breaks the interface abstraction (messenger is created above the client, de... Sam Lang
01:16 PM Bug #3666: Segfault running test_libcephfs
This is what I'm running to reproduce the error. It's been running now for an hour on wip-client-shutdown without any... Noah Watkins
12:57 PM Bug #3666: Segfault running test_libcephfs
Rather than moving messenger shutdown into client shutdown? Noah Watkins
12:48 PM Bug #3666: Segfault running test_libcephfs
A similar issue was just handled in the ceph_fuse.cc code. There we just delay deleting the client till the end. Yo... Sam Lang
10:41 AM Bug #3666: Segfault running test_libcephfs
During unmount, the client is shutdown and free'd before the messenger. If any messages are delivered after the clien... Noah Watkins
03:29 PM Feature #3730 (Closed): Support replication factor in Hadoop
In order to support per-file replication values in Hadoop we need to specify that a new file should be generated in a... Noah Watkins
01:54 PM Bug #3726: Enforce Ceph's minimum stripe size in the java bindings
Also, name it something along the lines of get_stripe_granularity() and not .._min(imum)_ as that isn't entirely accu... Anonymous
01:40 PM Bug #3726: Enforce Ceph's minimum stripe size in the java bindings
After a discussion on jabber, the decision is to go with exposing a function call in libcephfs and then using that in... Anonymous
11:09 AM Bug #3726 (Resolved): Enforce Ceph's minimum stripe size in the java bindings
The Hadoop bindings are using the blocksize as the stripe size. If a block size is explicitly passed down, it ends up... Anonymous
01:00 PM Bug #3718: multi-client dbench gets stuck over NFS exported cephfs
Heads up, Zheng Yan's patches on the mds fix issues related to running multiclient dbench tests. Sam Lang
12:24 PM Feature #3626: mds: debug mode to generate traceless replies to clients
Hmm, okay. I wasn't real clear on the previous bugs so I'll need to look at it more if I end up taking this, but soun... Greg Farnum
11:46 AM Feature #3626: mds: debug mode to generate traceless replies to clients
Greg Farnum wrote:
> Hurray, it is. Nobody except the client looks at the trace_bl and setting that is the only thin...
Sage Weil
11:35 AM Feature #3626: mds: debug mode to generate traceless replies to clients
Hurray, it is. Nobody except the client looks at the trace_bl and setting that is the only thing set_trace() does. Ex... Greg Farnum
11:17 AM Feature #3626: mds: debug mode to generate traceless replies to clients
Greg Farnum wrote:
> Am I reading it correctly that this is just going to be doing the config and wrapper work to no...
Sage Weil
09:01 AM Feature #3626: mds: debug mode to generate traceless replies to clients
Am I reading it correctly that this is just going to be doing the config and wrapper work to not call set_trace() in ... Greg Farnum
12:20 PM Feature #3543: mds: new encoding
Sage Weil
12:20 PM Feature #3728: mds: draft design for lookup by ino
Sage Weil
12:14 PM Feature #3728 (Resolved): mds: draft design for lookup by ino
Sage Weil
12:20 PM Feature #3570: teuthology: mds thrasher
Sage Weil
12:06 PM Feature #3727 (Resolved): mds: refactor EMetablob encoding paths
Right now, the EMetaBlob sub-structures — for performance reasons — use an encoding pattern that doesn't match anythi... Sage Weil
11:42 AM Cleanup #89: mds: put inode dirty fields in dirty_bits_t to reduce memory footprint
Greg Farnum wrote:
> I briefly scanned the CInode and inode_t structs and it wasn't obvious to me what this should e...
Sage Weil
09:34 AM Cleanup #89: mds: put inode dirty fields in dirty_bits_t to reduce memory footprint
I briefly scanned the CInode and inode_t structs and it wasn't obvious to me what this should encompass. Are you talk... Greg Farnum
11:41 AM Subtask #547: mds: define fsck strategy, required metadata
This was a whiteboard discussion 2 years ago. Nothing was written down. We should reopen new and more detailed issu... Sage Weil
09:29 AM Subtask #547: mds: define fsck strategy, required metadata
Where are the results of this bug? It's marked resolved but I don't see any fsck references in the git tree, and ther... Greg Farnum
11:38 AM Cleanup #3677: libcephfs, mds: test creation/addition of data pools, create policy
Greg Farnum wrote:
> Do we have a separate bug for the library calls this needs?
#685, which would take the clien...
Sage Weil
09:27 AM Cleanup #3677: libcephfs, mds: test creation/addition of data pools, create policy
Do we have a separate bug for the library calls this needs? Greg Farnum
11:36 AM Feature #3244: qa: integrate Ganesha into teuthology testing to regularly exercise Ganesha CephFS...
Greg Farnum wrote:
> And for this one as well: setting up Ganesha in teuthology, run tests against it? Not using the...
Sage Weil
09:24 AM Feature #3244: qa: integrate Ganesha into teuthology testing to regularly exercise Ganesha CephFS...
And for this one as well: setting up Ganesha in teuthology, run tests against it? Not using the Ceph shim or anything... Greg Farnum
11:35 AM Feature #3243: qa: test samba reexport via libcephfs vfs plugin in teuthology
Greg Farnum wrote:
> Is this a matter of setting up (via teuthology) a Samba server which sits on top of a Ceph moun...
Sage Weil
09:24 AM Feature #3243: qa: test samba reexport via libcephfs vfs plugin in teuthology
Is this a matter of setting up (via teuthology) a Samba server which sits on top of a Ceph mount and then running tes... Greg Farnum
11:34 AM Feature #3426: ceph-fuse: build/run on os x
Greg Farnum wrote:
> Noah has done some work on this in the wip-osx branch; last I heard you could compile and get a...
Sage Weil
09:22 AM Feature #3426: ceph-fuse: build/run on os x
Noah has done some work on this in the wip-osx branch; last I heard you could compile and get a cluster going with vs... Greg Farnum
11:32 AM Feature #3542: mds: migration path for existing anchors, anchortables, etc.
Greg Farnum wrote:
> What all does this encompass? Design? Implementation? Does it need to be an online switch or ca...
Sage Weil
09:13 AM Feature #3542: mds: migration path for existing anchors, anchortables, etc.
What all does this encompass? Design? Implementation? Does it need to be an online switch or can it be an offline job? Greg Farnum
11:30 AM Feature #3541: mds: robust ino lookup using file backpointers
Greg Farnum wrote:
> Is this bug supposed to encompass the anchor table replacement work as well? I wouldn't expect ...
Sage Weil
09:12 AM Feature #3541: mds: robust ino lookup using file backpointers
Is this bug supposed to encompass the anchor table replacement work as well? I wouldn't expect so, but the presence o... Greg Farnum
11:23 AM Feature #3540: mds: maintain per-file backpointers on first file object
Greg Farnum wrote:
> Do we have any kind of design for this? We've talked about it some and it's conceptually simple...
Sage Weil
09:08 AM Feature #3540: mds: maintain per-file backpointers on first file object
Do we have any kind of design for this? We've talked about it some and it's conceptually simple, but splitting up the... Greg Farnum
11:15 AM Feature #626 (In Progress): qa: add IOR, rompio, or other parallel workloads suite
Yeah, that's what slang's working on to enable this. Assigning this to him. Sage Weil
08:57 AM Feature #626: qa: add IOR, rompio, or other parallel workloads suite
SamL has done some work on getting MPI going under teuthology, and on running some multi-client FS tests. I'm not sur... Greg Farnum
11:13 AM Feature #3621 (Resolved): qa: add knfsd reexport tests to qa suite
Sage Weil
09:43 AM Feature #3399: java: add accessor to Ceph version numbers
Oh, those are librados specific numbers, aren't they. So this bug is to create and expose a libceph version, then. Wh... Greg Farnum
09:35 AM Feature #3399: java: add accessor to Ceph version numbers
In libcephfs there is a call to get Ceph version (yes, just expose this). But, I recall Sage mentioning that it might... Noah Watkins
09:19 AM Feature #3399: java: add accessor to Ceph version numbers
This is just exposing the librados version() function to Java, right? Greg Farnum
09:41 AM Cleanup #660: mds: use helpers in mknod, mkdir, openc paths
What kind of helpers are you talking about with this? inode fetchers and lock grabbers? In a quick scan over handle_c... Greg Farnum
09:36 AM Feature #603: mds: repair directory hierarchy
This is part of #82 fsck, right? Do we have a more detailed algorithm anywhere? Greg Farnum

01/03/2013

01:59 PM Bug #3597: ceph-fuse: denying root access
I believe that we can reproduce this error. We are running Ubuntu 12.04 LTS Server on both the client and on the Cep... Graham Hemingway
12:56 PM Bug #3719 (Can't reproduce): pjd test 145 failed in the nightly runs
logs: ubuntu@teuthology:/a/teuthology-2013-01-02_19:00:03-regression-next-testing-basic/33621... Tamilarasi muthamizhan
12:48 PM Bug #3718 (Rejected): multi-client dbench gets stuck over NFS exported cephfs
When running qa/workunit dbench.sh the dbench 1 passes, but the dbench 10 gets hung up.
We should check this with ...
David Zafman
12:28 PM Feature #3621 (In Progress): qa: add knfsd reexport tests to qa suite
David Zafman
09:32 AM Bug #3681: kclient fsx fails nightly
Its most likely all the same bug, but fsx fails in different ways each time (always because of a truncate down). The... Sam Lang
09:27 AM Feature #3543: mds: new encoding
right. about 80% complete, see wip-mds-encoding. Sage Weil
09:22 AM Feature #3543: mds: new encoding
What is this task? Switching to use our versioned encoding scheme? Greg Farnum

01/02/2013

09:45 AM Bug #3700: mds: FAILED assert(!item_session_list.is_on_list())
fixed by revert of bad fix, see commit:6711a4c4038dbdf843f9dfe42c7809c5c37ae534 Sage Weil
09:37 AM Bug #3700 (Resolved): mds: FAILED assert(!item_session_list.is_on_list())
Sage Weil

12/30/2012

06:08 PM Fix #3630: mds: broken closed connection cleanup
... Sage Weil
06:06 PM Fix #3630: mds: broken closed connection cleanup
The con re-use looks like this:
- client connects
- mds ms_verify_authorizer creates a new session
- msgr see ex...
Sage Weil
06:04 PM Bug #3696 (Resolved): mds: FAILED assert(session_map.count(s->inst.name) == 0)
see #3630..let's fix this properly. Sage Weil

12/29/2012

02:39 PM Bug #3700 (Resolved): mds: FAILED assert(!item_session_list.is_on_list())
logs: ubuntu@teuthology:/a/teuthology-2012-12-29_03:00:03-regression-master-testing-gcov/30039... Tamilarasi muthamizhan
02:32 PM Bug #3696: mds: FAILED assert(session_map.count(s->inst.name) == 0)
ubuntu@teuthology:/a/teuthology-2012-12-29_03:00:03-regression-master-testing-gcov/30036 Tamilarasi muthamizhan
09:43 AM Bug #3696: mds: FAILED assert(session_map.count(s->inst.name) == 0)
reverted the broken fix, reproducing the original problem again. Sage Weil

12/28/2012

09:11 PM Bug #3696: mds: FAILED assert(session_map.count(s->inst.name) == 0)
Sage Weil
06:42 PM Bug #3696 (Resolved): mds: FAILED assert(session_map.count(s->inst.name) == 0)
This occurred shortly after startup when trying to reproduce another bug on the master branch:... Josh Durgin
06:21 PM Fix #3630: mds: broken closed connection cleanup
Sage Weil

12/26/2012

09:59 AM Bug #3681 (Resolved): kclient fsx fails nightly
... Sage Weil
08:39 AM Feature #3679 (Closed): Any API to get metadata?
Yep! See libcephfs. There is... Sage Weil
01:08 AM Feature #3679 (Closed): Any API to get metadata?
hello,there.
I am wondering if there is any API to get the metadata of a file .
I have the ceph file system run by ...
lollipop king
01:10 AM Tasks #3680 (Rejected): deduplication in ceph
I am wondering how to do deduplication in ceph...the big problem is how to get the metadata of a file
and how to mod...
lollipop king

12/24/2012

02:58 PM Feature #1448 (In Progress): test hadoop on sepia
Sage Weil
02:58 PM Cleanup #814 (Resolved): hadoop: refactor hadoop shim in terms of java libceph bindings
Sage Weil

12/23/2012

09:12 PM Cleanup #3677 (Closed): libcephfs, mds: test creation/addition of data pools, create policy
the create data pool argument is tested only with the default pools. once an lib is in place for the unit/functional... Sage Weil
09:06 PM Bug #3663 (Rejected): ceph kernel client is getting stuck on xstat* operations
No worries. Let us know if you do come across behavior that looks like a bug! Sage Weil
08:59 PM Bug #3663: ceph kernel client is getting stuck on xstat* operations
Hi Sage,
i am very sorry for taking your time with this issue, I feel like an idiot :(
The buggy client is runnin...
Roman Hlynovskiy

12/21/2012

02:39 PM Documentation #3672 (Resolved): doc: how to mount ceph-fuse from fstab
There's a new mount helper in bobtail for this. It contains these comments:... Josh Durgin
10:20 AM Bug #3666 (Resolved): Segfault running test_libcephfs
... Noah Watkins
08:36 AM Bug #3655 (Can't reproduce): client: hang in fsstress
I ran this test throughout the day yesterday and couldn't reproduce it, with message delays enabled. Marking as can'... Sam Lang
07:52 AM Bug #3663: ceph kernel client is getting stuck on xstat* operations
Hi Roman-
The logging levels are right, but in both mds logs neither mds was ever active; both were in the up:stan...
Sage Weil

12/20/2012

10:19 PM Bug #3663: ceph kernel client is getting stuck on xstat* operations
Hello Sage,
added 4 logs:
screen output from console of the laggy client. it ends up on 'jroger@pr02:~/data$ cp...
Roman Hlynovskiy
09:07 PM Bug #3663 (Need More Info): ceph kernel client is getting stuck on xstat* operations
Hmm. It's actually just saying its the oldest client; it's not actually too old (yet). The looping connect attempts... Sage Weil
08:48 PM Bug #3663 (Rejected): ceph kernel client is getting stuck on xstat* operations
there are 2 kernel clients happily working with ceph. as soon as I try mounting ceph from the third client, it's gett... Roman Hlynovskiy

12/19/2012

11:19 PM Bug #3655 (Can't reproduce): client: hang in fsstress
fsstress stuck in _read_sync()
#0 pthread_cond_wait@@GLIBC_2.3.2 ()
at ../nptl/sysdeps/unix/sysv/linux/x86_6...
Sam Lang
04:03 PM Bug #3637: client: not issuing caps for with clients doing shared writes
Proposed fix in wip-3637. The client's max size request in MClientCaps gets dropped if the file lock is in a non-sta... Sam Lang
12:30 PM Bug #3625: client: EEXIST error on multiple clients to create
Pushed fixes to wip-3625 (ceph and ceph-client repos) that implement #3 (mds sends back the created flag in reply to ... Sam Lang
12:29 PM Bug #3625: client: EEXIST error on multiple clients to create
David and I have posted comments on github about the fix to allow multiple
clients opening the same file to get a va...
Sam Lang

12/18/2012

08:59 PM Bug #3637: client: not issuing caps for with clients doing shared writes
The hang occurs because a client requests a max size increase, but doesn't have write caps, so the mds puts it on the... Sam Lang
07:53 AM Bug #3637 (Resolved): client: not issuing caps for with clients doing shared writes
With 3 clients running ceph-fuse, running the ior command:
/tmp/cephtest/binary/usr/local/bin/ior -e -w -r -W -b 1...
Sam Lang
04:23 PM Feature #3645 (Resolved): Requesting the ability to rename CephFS snapshots inside the ".snap"-di...
I believe the ability to rename CephFS snapshots can come in handy in many cases. For example, if one wants to imple... Oliver Daudey
12:01 PM Bug #3639 (Duplicate): kclient: hit EOF prematurely

Moved to #3641
Sam Lang
10:56 AM Bug #3639 (Duplicate): kclient: hit EOF prematurely
Failures seen when running IOR on the kernel client:
WARNING: Task 1 requested transfer of 1048576 bytes,
...
Sam Lang
12:00 PM Bug #3641 (Resolved): kclient: hit EOF prematurely

Failures seen when running IOR on the kernel client:
WARNING: Task 1 requested transfer of 1048576 bytes,
...
Sam Lang
11:57 AM Bug #3640 (Duplicate): kclient: hang and kernel panic

Creating a placeholder for the following issue reported by Eric Renfro on the mailing list:
http://thread.gmane....
Sam Lang

12/16/2012

09:48 AM Fix #3630 (Resolved): mds: broken closed connection cleanup
Consider:
- client->mds REQUEST_CLOSE
- mds->client CLOSE
- client closes con
- mds see fault, goes to stan...
Sage Weil

12/14/2012

06:31 PM Bug #3625: client: EEXIST error on multiple clients to create
I made some commits to wip-3625, which resolve the EEXIST, but now the test returns an EIO... Sam Lang
03:31 PM Bug #3625 (Resolved): client: EEXIST error on multiple clients to create
Discovered with IOR shared file test on ceph-fuse, if multiple clients attempt to create a file at the same time (do ... Sam Lang
03:51 PM Feature #3626 (Resolved): mds: debug mode to generate traceless replies to clients
Sage Weil
11:13 AM Bug #3610 (Resolved): client: Possible lock cycle in client/objectcacher
Sam Lang
11:13 AM Bug #3610: client: Possible lock cycle in client/objectcacher
Merged wip-3610 to next. Sam Lang

12/13/2012

07:49 PM Bug #3610 (Fix Under Review): client: Possible lock cycle in client/objectcacher
This appears to be related to the fork() in the caps.cc test messing up data structures in the lockdep code when its ... Sam Lang
03:53 PM Feature #3568 (Resolved): client: Allow hold_caps_until to be configured
Sage Weil
03:49 PM Feature #3621 (Closed): qa: add knfsd reexport tests to qa suite
Sage Weil
03:46 PM Cleanup #3423 (Resolved): Install java libraries into the correct directory
Sage Weil
01:28 PM Bug #3559 (Resolved): mds: not issuing RDCACHE to exclusive client for some files
Thought more about it, and I think it's right. Committed something to master that describes the logic in a big comment. Sage Weil
10:11 AM Bug #3559 (In Progress): mds: not issuing RDCACHE to exclusive client for some files
After discussion this apparently needs a bit more thought. Greg Farnum
10:04 AM Bug #3559 (Resolved): mds: not issuing RDCACHE to exclusive client for some files
Sage Weil

12/12/2012

09:00 PM Bug #2288: libcephfs: setxattr returns EEXIST following removexattr
Ah, I wasn't aware of this bug. The commit you mentioned is 323a52ee909621ed0169b86e158370394ba36f62. It makes remo... Sam Lang
06:40 PM Bug #2288: libcephfs: setxattr returns EEXIST following removexattr
SamL did some stuff involving projected xattrs; was this problem included in that set of changes or is it more compli... Greg Farnum
06:38 PM Bug #3369 (Resolved): journaled two client session close events
Sage merged that workaround a long time ago, and I think Zheng's recent patches might have fixed some potential root ... Greg Farnum
09:20 AM Bug #3610 (Resolved): client: Possible lock cycle in client/objectcacher
Teuthology reported this lock cycle while running test_libcephfs. It was triggered by the Caps.ReadZero test.
For...
Sam Lang

12/11/2012

02:10 PM Bug #3370 (Resolved): All nfsd hung trying to lock page(s) on export of kclient ceph
commit: 2978257c56935878f8a756c6cb169b569e99bb91 David Zafman
09:28 AM Bug #3597 (Can't reproduce): ceph-fuse: denying root access
I don't see this behavior with fuse 2.9.0 and latest ceph. Does it happen only on some files? What are the permissi... Sam Lang
 

Also available in: Atom