Project

General

Profile

Activity

From 10/10/2010 to 11/08/2010

11/08/2010

09:32 AM Bug #561: snaptest-2 doesn't execute properly
Okay, looks like this may be an issue with the test rather than Ceph. I just copied it into the root of the ceph moun... Greg Farnum
09:07 AM Bug #561 (Resolved): snaptest-2 doesn't execute properly
Checked it on cfuse and kclient:... Greg Farnum
09:20 AM Cleanup #427 (Rejected): mds: tie scatter pins directly to freeze machinery
no more scatterpins, yay! Sage Weil
07:39 AM Feature #560 (Resolved): mds: alternate directory hashing
Currently dentries are hashed among dirfrags using the linux dcache's hash function, which is pretty trivial. The pr... Sage Weil

11/07/2010

10:08 AM Feature #495 (Resolved): mds: add MIX_STALE
merged in commit:0b1909209800229f5098cdc848fc3901508c1e19. best part of this is MIX_STALE went away. yay! Sage Weil
08:11 AM Bug #556 (Resolved): clustered mds: rename
various hangs with thrash-exports and pjd rename tests. Sage Weil

11/05/2010

01:55 PM Feature #495: mds: add MIX_STALE
Sage Weil
01:02 PM Bug #551 (Can't reproduce): cfuse crash on quick mds restart
Program terminated with signal 11, Segmentation fault.
#0 0x00000000004704ad in Client::kick_flushing_caps (this=0x...
Greg Farnum
10:14 AM Feature #91: mds: up:shadow mode
Update the journaler interface to allow the MDS to 'tail' the journal... periodically check to see if it's been exten... Sage Weil
10:10 AM Feature #548 (Resolved): mds: shadowreplay one-shot mode
Make sure the current mechanism still works. Clean it up if needed. Sage Weil
09:19 AM Subtask #547 (Resolved): mds: define fsck strategy, required metadata
Sage Weil
09:19 AM Feature #340 (Closed): large directories, directory fragmenting
Sage Weil
09:19 AM Feature #519 (Closed): mds: dirfrag merge
Sage Weil

11/04/2010

09:56 AM Feature #545 (Resolved): mds: use bloom filter to supplement dirfrag COMPLETE flag
Currently we need the complete flag (or a cached negative dentry) to conclude a name does not exist in a frag before ... Sage Weil

11/03/2010

11:27 PM Bug #478 (Can't reproduce): MDS crash: LogEvent::decode()
Sage Weil
11:27 PM Bug #478: MDS crash: LogEvent::decode()
From the mds dump in the debugpacks, it looks like there were MDS daemons on two different nodes. I'm inclined to ch... Sage Weil
10:49 PM Bug #542 (Resolved): mds journal corruption
Sage Weil
10:49 PM Bug #542: mds journal corruption
commit:1c934ebd6ff3a3a7000671821a12e83c609f1e27 Sage Weil
10:24 PM Bug #542: mds journal corruption
Mystery solved.. this was actually a takeover:
- where the old mds was blacklisted
- new mds probed and read jour...
Sage Weil
09:38 PM Bug #542 (Resolved): mds journal corruption
I saw this on the playground.
THe last bit of the replay log:...
Sage Weil
09:12 PM Bug #472 (Resolved): mds: fragstat crash
Sage Weil
04:40 PM Feature #495 (Resolved): mds: add MIX_STALE
commit:fc9059e5270380c3266f7f958da6a8cc9b042f22 Sage Weil
04:05 PM Feature #495: mds: add MIX_STALE
Sage has been working on this today. Greg Farnum
03:42 PM Feature #541 (Resolved): mds: tempsync
Integrate this into the filelock state machine, and then use it when appropriate (namely, unscatter) Sage Weil

11/02/2010

10:18 AM Bug #529 (Resolved): Cfuse: Software caused connection abort
There were a sequence of commits in this, some of which were one step forward and two steps back. The testing branch ... Greg Farnum
05:51 AM Bug #529: Cfuse: Software caused connection abort
I was going to apply the patch to my version but I noted that my src/client/Client.h line 516 already says "truncate_... Ed Burnette

11/01/2010

10:36 PM Bug #451 (Can't reproduce): mds: replay error
Sage Weil
10:35 PM Bug #523: cfuse locks don't wake on mds reconnect?
This might be the same issue as #535 (which looks to me like it's waiting on tcp_read/poll?). Sage Weil
10:33 PM Bug #529: Cfuse: Software caused connection abort
Hey Greg, this looks like client truncation stuff again. This was biting me today, almost immediately. These two pa... Sage Weil
07:40 AM Bug #529 (Resolved): Cfuse: Software caused connection abort
After using ceph for a few minutes it gets into a state where I can no longer access the cfuse mount point. It also s... Ed Burnette

10/29/2010

04:39 PM Feature #340: large directories, directory fragmenting
split and merge rewritten and working. now for the stress testing. Sage Weil
02:42 PM Bug #520: mds: change ifile state mix->sync on (many) lookups?
Nothing wrong on the client.. it's just that the mds has / (a subtree root) in the MIX state, and file_eval doesn't d... Sage Weil
01:02 PM Bug #360 (Resolved): mds: head/snapped snap_cap linkage may cross mdss
For now, let's just auth_pin(). resolved by commit:440cc43956f367e6c8fb1077c83693ff568c9d2c Sage Weil
12:38 PM Bug #525 (Closed): Audit CInode creation code for initialization
Looks all good now! Greg Farnum
09:16 AM Bug #525 (Closed): Audit CInode creation code for initialization
Specifically, it seems there are some times when truncate_size (and truncate_seq?) aren't geting set. Check if there ... Greg Farnum
11:31 AM Bug #329 (Resolved): mds: mislinked dentry found during journal replay
The multi-mds fix has been pushed to mds_journal branch commit:aa83e11c67165878e1ca1b0fe66ff9b8c3a906c8. Then merged ... Greg Farnum

10/28/2010

02:00 PM Bug #523 (Can't reproduce): cfuse locks don't wake on mds reconnect?
Don't know the exact cause, but I was running clustered mds tests using snaptest-2.sh and once the MDSes had failed a... Greg Farnum

10/26/2010

07:30 PM Bug #520 (Closed): mds: change ifile state mix->sync on (many) lookups?
I'm seeing this on csyn --syn makefiles 1000 1 0... Sage Weil

10/25/2010

09:08 PM Feature #519 (Closed): mds: dirfrag merge
Sage Weil

10/22/2010

04:44 PM Feature #340: large directories, directory fragmenting
We still need to add a wrlock of the dirfragtreelock.
Sage Weil
04:23 PM Cleanup #514 (Rejected): Optimize MIX/MIX_STALE reconnects, etc
Right now the MDS puts locks into the MIX_STALE state whenever it loads from disk. This is safe but unnecessary. Fix! Greg Farnum
04:11 PM Feature #495 (In Progress): mds: add MIX_STALE
A first pass is done and pushed to the mix_stable branch. Testing and debugging now, but that may take a while. Greg Farnum

10/18/2010

08:37 AM Bug #500 (Closed): mds: FAILED assert("shouldn't be called if we are already xlockable" == 0)
nevermind, old code. Sage Weil
08:36 AM Bug #500 (Closed): mds: FAILED assert("shouldn't be called if we are already xlockable" == 0)
... Sage Weil

10/15/2010

11:15 AM Cleanup #493 (Rejected): mds: allow scatter_pinned inode to go from mix -> sync
We're going to skill scatter_pins instead, see #495 Sage Weil
11:13 AM Feature #495 (Resolved): mds: add MIX_STALE
... Sage Weil

10/14/2010

09:48 PM Bug #329: mds: mislinked dentry found during journal replay
I suspect the solution (for the clustered case) is something like:
- trim_non_auth and a _subtree_ when we replay...
Sage Weil
09:39 PM Bug #329: mds: mislinked dentry found during journal replay
This can come up with multiple MDSs. (Wido saw it with one MDS; not sure how that happened.)
With multiple MDSs, ...
Sage Weil
03:07 PM Cleanup #493 (Rejected): mds: allow scatter_pinned inode to go from mix -> sync
Sage Weil

10/13/2010

08:55 AM Feature #483 (Resolved): mds: add timestamp to LogEvent
Would be nice if every log even had an mtime associated with it. Sage Weil
01:55 AM Bug #478 (Can't reproduce): MDS crash: LogEvent::decode()
On both my MDS'es I'm seeing the following crash:... Wido den Hollander

10/11/2010

05:50 PM Bug #472: mds: fragstat crash
Well, this seems to have gotten rid of the first assert issue -- and made pjd last a bit longer -- and it's a bit mor... Greg Farnum
04:50 PM Bug #472: mds: fragstat crash
let's try... Sage Weil
04:32 PM Bug #472: mds: fragstat crash
Applied patch you gave me. Got new crash:
#0 0x0000000000000000 in ?? ()
#1 0x0000000000a1e317 in sigabrt_handler...
Greg Farnum
09:51 AM Bug #472: mds: fragstat crash
Similarly:
#0 0x0000000000000000 in ?? ()
#1 0x0000000000a1e2e7 in sigabrt_handler (signum=6) at config.cc:238
#...
Greg Farnum
 

Also available in: Atom