Activity
From 12/06/2010 to 01/04/2011
01/04/2011
- 11:27 PM Revision f66d4728 (ceph): frag: const cleanup fragset_t
- Signed-off-by: Sage Weil <sage@newdream.net>
- 11:27 PM Revision 4c53c726 (ceph): mds: make get_dirfrags_under behave when dirfragtree is not coherent wi...
- This is (currently) the case during replay/resolve, although it's not
clear that it should be.
Signed-off-by: Sage W... - 11:27 PM Revision a5f9a36c (ceph): mds: make resolve adjust dir fragmentation as needed
- During resolve, adjust dir fragmentation as needed based on the subtrees
the sender explicitly claims. The given fra... - 11:27 PM Revision 3d04d96b (ceph): mds: force fragmentation for ambiguous imports as well
- Handle needed refragmentation for processing ambiguous bounds. That means
forcing the peers' subtree root fragmentat... - 11:01 PM Revision 1b0bb7a1 (ceph): osd, objecter: pgls filtering option
- 10:38 PM Revision fab61391 (ceph): PG: Fixes bug in _scrub with checking clones
- I introduced this bug in
4a4a1e53c7d380cd0b582c1d0685fd0ef4ef1711.
curclone++ not curclone--.
Signed-off-by: Samuel ... - 07:29 PM Revision 8f104243 (ceph): osd: set default pg_bits higher; pgp_bits to old value
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:50 PM Revision aafb79c7 (ceph): assert: print thread id in hex
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:27 PM Revision 4a4a1e53 (ceph): PG: Fix bug in scrub when checking clone sizes
- Previosly, _scrub checked:
assert(p->second.size == snapset.clone_size[curclone])
curclone was, however, an index in... - 06:20 PM Revision 3d4e1d99 (ceph): client: fix frag selection code
- Calling fragtree_t::contains() on a non-frag_t is nonsense and will crash.
And a fragtree is a complete partition of ... - 06:18 PM Revision 062c4cc8 (ceph): vstart.sh: specify keyring in ceph.conf
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:16 PM Revision d2578107 (ceph): remove ancient uofs.h
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:09 PM Revision 9befd1ea (ceph): mkcephfs: Clarified numosd message
- Signed-off-by: Matthew Roy <matthew@royhousehold.net>
Signed-off-by: Sage Weil <sage@newdream.net> - 03:10 PM CephFS Bug #594: mds: frag split/merge vs replay
- commit:3d04d96b5cd0ab9d94904d035347c00fe19f687a fixes things for resolve. rejoin still needs work.
- 02:47 PM Feature #609 (Resolved): osd: query pool/pg for objects with given xattr
- implemented it as a new PGLS_FILTER operation, but reuses the PGLS operation code. There's no class extension now, bu...
- 09:01 AM Support #680 (Closed): what's device_open,uofs_*
- uofs is an ancient implementation OBFS, a userspace object file system from UCSC. It should probably be removed from...
- 05:39 AM Support #680 (Closed): what's device_open,uofs_*
- what is uofs? device_open
- 05:24 AM Revision 78f35a64 (ceph): osd: send sufficient log to compensate for replicas with last_complate ...
- If a replica has last_complete < log.tail and no backlog, send enough log
for them to get back into a consistent stat... - 05:24 AM Revision ff035ab3 (ceph): osd: generate backlog if needed to get last_complete >= log.tail || bac...
- If primary or a replica has a mistrimmed pg log, we need to generate the
backlog during peering. This sucks, because... - 05:24 AM Revision 462cb841 (ceph): osd: clean up backlog generation checks a bit
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:24 AM Revision 88c445b1 (ceph): mds: start new log segment resolve start, not replay finish
- Signed-off-by: Sage Weil <sage@newdream.net>
- 05:24 AM Revision 6c73da0a (ceph): mds: assert no submit_entry during replay state
- We should never submit items to the journal during replay.
Signed-off-by: Sage Weil <sage@newdream.net> - 01:44 AM Revision 16c603b2 (ceph): common: Implement max open files
- In init-ceph, call ulimit -n if the user has set a maximum number of
open files, and the current maximum number of fi... - 12:23 AM Bug #665: mds(uo:replay),clent can not mount,return error 5 input/output
- how to locate error in logs?
mds.log is attached.
Sage Weil wrote:
> You can't mount until mds replay completes. ... - 12:14 AM Revision 5dc66244 (ceph): osd: Make g_conf.osd_max_notify_timeout a uint32_t
- Make g_conf.osd_max_notify_timeout a uint32_t. Squashes an annoying
compiler warning and avoids the awkward issue of ...
01/03/2011
- 11:15 PM Revision aad9ffec (ceph): Merge branch 'testing' into unstable
- 10:33 PM Revision b40e7dc0 (ceph): mds: load root inode on replay if auth
- If we are auth for the root inode, load it's initial value off of disk. We
may not see it in the log if it has not be... - 10:15 PM Revision 20593b0d (ceph): msgr: Unlock dispatch_queue.lock when short-circuiting queue_received.
- Previously we left the mutex locked, which is obviously bad bad bad!
I believe this was the cause of #673.
Signed-of... - 09:14 PM Revision 4efa3006 (ceph): filestore: assert on out of order journal pipeline submissions
- Signed-off-by: Sage Weil <sage@newdream.net>
- 09:14 PM Revision 259c509a (ceph): filestore: fix wake condition when journal submission blocks
- We only want to wake up if we are at the front of the line, in order to
preserve journal submission pipeline ordering... - 09:12 PM CephFS Bug #663: cmds segfaults
- added assert in commit:6c73da0a996421b108f19d7f1b56157cec281642 that will catch this sooner.
- 09:12 PM Bug #590 (Resolved): osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- fixed in commit:ff035ab31c524cfe63cc75d81ab706b5b7d35448
- 08:22 PM Revision 62e4cdea (ceph): common: print thread ID in sig handlers and assert
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 07:52 PM Feature #609: osd: query pool/pg for objects with given xattr
- Can we use the list objects operation (LSPG?) as a model here? That's basically what it does.. it grabs up to N item...
- 05:27 PM Feature #609: osd: query pool/pg for objects with given xattr
- The problem that I see is that this is going to be a very slow operation, and we need to figure out how not to timeou...
- 07:50 PM Revision 15dcc651 (ceph): mds: fix purge_stray for directories, zeroed layouts
- - We don't want to purge file content on directories
- Don't fall over if a file has a zero period
Reported-by: Paul... - 07:37 PM Revision 266cc6c6 (ceph): rbd: add watch option for rbd tool
- 06:30 PM Revision 6cdfa304 (ceph): osd: PG::Info::History: init last_epoch_clean
- It seems that we have not been zeroing
PG::Info::History:last_epoch_clean when the History structure is
created. This... - 06:24 PM Revision 7e83b570 (ceph): Merge branch 'testing' into unstable
- Conflicts:
configure.ac - 06:24 PM Revision 56861bd5 (ceph): Merge remote branch 'origin/keyring_cleanup' into unstable
- 06:15 PM Revision 1d5a69ff (ceph): debian: try to update pbuild env as needed
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:14 PM Revision 9ad05cf7 (ceph): SimpleMessenger.cc: Fixes a dispatch_throttler leak in queue_received
- when the pipe has been halted.
Signed-off-by: Samuel Just <samuelj@hq.newdream.net> - 06:09 PM Bug #674: tiobench stress test , OSD timeout
Hi ,
ceph 0.24, git ceph-client-standalone:master-backport.
It's very easy to reproduce this issue with tiotest.
...- 03:02 PM Bug #674: tiobench stress test , OSD timeout
- are the cosd instances running? what does 'ceph -s' show? it looks like it's just blocking because the osds aren't ...
- 05:11 PM Feature #668 (Resolved): Change number of open files in init script
- Implemented by commit:4018fb2f3bb5ee546134f93904ea8608617a8f2a
- 02:19 PM CephFS Bug #671 (Resolved): mds: fragstat problem after mds restart
- 02:19 PM CephFS Bug #671: mds: fragstat problem after mds restart
- fixed by commit:b40e7dc0f73bc24631e8cd79d68d790733aafb99
- 01:58 PM Bug #673: cmon: SimpleMessenger::Pipe::discard_queue
- I believe this should be fixed by commit:20593b0d38d5357c89b93fac8c06e2083fa56df9.
- 01:01 PM Bug #672 (Resolved): osd: crash from ballpit0
- fixed by commit:259c509a8941bf7cdad8bd4ede0ccd73ca8a83d3
- 12:44 PM Bug #672: osd: crash from ballpit0
- It seems like we are getting some out-of-order messages, just like in bug 666.
Here is the one that crashed us:
2... - 01:01 PM Bug #666 (Resolved): OSD: ReplicatedPG::eval_repop
- fixed by commit:259c509a8941bf7cdad8bd4ede0ccd73ca8a83d3
- 12:27 PM CephFS Bug #678 (Resolved): What if period == 0 in MDCache...purge_stray?
- fixed by commit:15dcc65199fc825ca8c51a31de3be01410aca9c1
- 11:36 AM CephFS Bug #678: What if period == 0 in MDCache...purge_stray?
- Based on what I was doing when this asplode I'd say it's either a directory or a file created but not yet written to ...
- 09:53 AM CephFS Bug #678: What if period == 0 in MDCache...purge_stray?
- The question is whether this was a regular file (shouldn't have a 0 period!) or a directory (we need to delete the me...
- 10:55 AM Bug #676 (Resolved): inconsistent handling of g_conf.keyring
- 10:46 AM Bug #669 (Rejected): testradospp doesn't build
- 09:47 AM Bug #665: mds(uo:replay),clent can not mount,return error 5 input/output
- You can't mount until mds replay completes. Probably it is blocked on osd io? Can uou repeat with 'debug ms = 1' in...
01/02/2011
- 08:52 PM Revision 924355df (ceph): auth: CEPH_KEYRING overrides g_conf.keyring
- Allow users to choose different keyring files by setting an environment
variable, CEPH_KEYRING.
Signed-off-by: Colin... - 08:20 PM Feature #679 (Resolved): build .debs for ubuntu releases
- should just be a matter of setting up pbuilder roots?
- 08:19 PM Revision 13e8507b (ceph): auth: make g_conf.keyring a plain old string
- Make g_conf.keyring a plain old string rather than an array of strings.
Don't do substitution using the user's HOME v...
01/01/2011
- 05:04 AM CephFS Bug #678 (Resolved): What if period == 0 in MDCache...purge_stray?
- Program terminated with signal 8, Arithmetic exception.
#0 0x0000...
12/30/2010
- 11:15 PM Revision 789cb9a4 (ceph): osdmaptool: better error handling
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:12 PM Revision e2c0f404 (ceph): common: bufferlist: handle EINTR, check close rval
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 10:50 PM Revision 51462d6b (ceph): common: bufferlist::read_file: return read errors
- Don't ignore errors when reading a file with buffer::list.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> - 06:38 PM Revision adbfd95c (ceph): cconf: code cleanup
- cconf: add a better usage() message, with examples. Give more helpful
error messages when the usage is wrong. Put dif... - 06:38 PM Revision 323ac1a7 (ceph): qa: rbd: disable automatic keyring discovery
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 06:38 PM Revision 634cca95 (ceph): qa: rbd: Update rbd test to use the new rbd sysfs
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 06:38 PM Revision c846615d (ceph): qa: rbd: small optimization to script
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 12:38 PM Linux kernel client Bug #677 (Resolved): rbd snapshot creation takes too long
- 12:37 PM Linux kernel client Bug #677: rbd snapshot creation takes too long
- Fixed with commit:b6a9e5db67e3685e1fbac9c92de2963bff541fc1. We called the notify under the ctl_mutex. The notify trig...
- 12:31 PM Linux kernel client Bug #677 (Resolved): rbd snapshot creation takes too long
- Apparently the notify doesn't get a completion event, even though the osd sends it. It only timeouts away from the wait.
- 10:32 AM Bug #676 (Resolved): inconsistent handling of g_conf.keyring
- In some places we treat g_conf.keyring as a single string. In others, we treat it as a comma-separated array. Sometim...
- 02:33 AM Revision e1a1e1e6 (ceph): auth: Fix buggy parsing of g_conf.keyring
- Previously, we weren't correctly handling comma-separated lists of
values in g_conf.keyring.
Signed-off-by: Colin Mc... - 01:58 AM Revision 27cba781 (ceph): common: str_list: const cleanup
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 01:02 AM Revision a4223d4f (ceph): osd: watch request doesn't use version
- also fix watch-notify to send the current user_version
- 01:02 AM Revision 7e206dfe (ceph): osd, librados: configurable notify timeout
12/29/2010
- 11:55 PM Bug #662 (Resolved): update rbd qa scripts to use new sysfs interface
- Implemented by commit:3914ee5e3ca70252e305805190226d60b5cf0dc7
C. - 11:36 PM Revision de8f0212 (ceph): qa/rbd: minor fixes
- * qa/rbd/common: create tempdir for script
* qa/rbd/common: try harder to find monhost (try "mon addr" and also
"mon... - 11:33 PM Revision ace90971 (ceph): makefile: ignore mkdir -p errors in install
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 10:34 PM Revision fddbd903 (ceph): cleanup: don't use __PRETTY_FUNCTION__
- Its output isn't as pretty as I had hoped.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> - 07:13 PM Revision bbd0ce25 (ceph): logging: clean up some error handling
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 04:56 PM Feature #389 (Resolved): Synchronize header modifications between clients
- configurable timeouts implemented as of commit:7e206dfec07033d49770bafcc2c86f6dd736d7c2.
I'm closing this issue, t... - 04:52 PM Feature #572 (Resolved): Implement lingering osd requests
- this was already completed.
- 01:07 AM Revision 22828b9a (ceph): osd: PG::Info::History: init last_epoch_clean
- It seems that we have not been zeroing
PG::Info::History:last_epoch_clean when the History structure is
created. This...
12/28/2010
- 11:25 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Very nice, thanks. I can almost say it worked!
... _almost_ because the recovery went well for a long time until o... - 09:44 PM Support #675 (Closed): hot to adjust replicate level,disk not the same size
- some disk is 3T,while some disks less 50G,
who can tell me the meaning of crush file.
# types
type 0 device
type... - 06:35 PM Bug #674: tiobench stress test , OSD timeout
- git the lastest ceph-client-standalone:master-backport:
run:
dd if=/dev/zero of=/mnt/ceph bs=4k count=500000
a... - 02:04 AM Revision b3bb8922 (ceph): signals: handle_fatal_signal: use SA_NODEFER
- SA_RESETHAND | SA_NODEFER allows the "re-trigger default signal handler"
trick to work for signals other than SIGSEGV... - 01:51 AM Revision cbeddc23 (ceph): signals: backtrace some more exotic fatal signals
- We're not likely to see these, but if we do, we want it in the logs!
Signed-off-by: Colin McCabe <colinm@hq.newdream... - 01:38 AM Revision ce81e6a5 (ceph): signals: test infinite_recursion
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 01:34 AM Revision ff6e4d4d (ceph): common: make generic_usage a little prettier
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 01:32 AM Revision 247b11b7 (ceph): signals: Handle SIGILL, SIGBUS, SIGFPE.
- Print out a backtrace when we get SIGILL, SIGBUS, or SIGFPE. Fix a bug
where we failed to install a SIGABRT handler.
...
12/27/2010
- 09:34 PM Revision a410360a (ceph): test: add TestSignalHandlers
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 04:51 PM Bug #674: tiobench stress test , OSD timeout
- printk many timed out messages, then ,tiobench test fail.
- 01:45 AM Bug #674 (Can't reproduce): tiobench stress test , OSD timeout
- Hi,
we do multi-thread stress test for ceph 0.23.1 , ceph client printk osd timeout.
1. test tool: tiobench-0.3.... - 12:31 AM Bug #673 (Resolved): cmon: SimpleMessenger::Pipe::discard_queue
- On noisy I saw my monitor go down with:...
12/25/2010
- 07:24 PM Bug #672 (Resolved): osd: crash from ballpit0
- see attached
12/24/2010
- 08:46 AM CephFS Bug #671 (Resolved): mds: fragstat problem after mds restart
- ...
- 08:24 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Ok, may not look like it, but this is actaully progress. Add this:...
12/23/2010
- 11:43 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Thanks for the patch, back to osd1 vs pg 3.70...
osd1 crashed:
2010-12-23 22:17:29.289927 7fef0ec40700 osd1 1292 ... - 09:12 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Sigh... this one shoheuld have t conditions match and be a bit more spacecase programmer proof. Also prints peer inf...
- 08:26 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Hi and thanks again for the patch. I guess it allowed things to go a bit further... at least I don't see the same cra...
- 09:44 PM Support #670: what is the condition of run testrados.
- ,i run testrados ,but it stop at rados_write function.
it hant there.
my cluster'osd lose many data,log says 1000...
12/22/2010
- 08:11 PM Support #670 (Closed): what is the condition of run testrados.
- This isn't really a bug, you should send questions like this to the list.
Anyway, it's just a silly little program... - 01:03 AM Support #670 (Closed): what is the condition of run testrados.
- it needs ceph cluster running?
or it must run on osd?or mds? or mon?
-------------------
because my ceph error,i r... - 03:58 PM Feature #389: Synchronize header modifications between clients
- main stuff left to be done:
- configurable timeouts (on client side?)
- more debugging - 12:49 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Looks like it got past the previous problem pg but bombed later... I forgot to test for backlog. This should do it?
... - 03:52 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Thanks for the patch, which I immediately tested:
osd2 crashes first, then osd1.
osd2:
#0 0x00007fb4b3db4ba5 ... - 12:58 AM Bug #669: testradospp doesn't build
- you have to give -I(headder dir) -l(redos) ,when compile
look gcc options
Greg Farnum wrote:
> gregf@kai:~/ceph/sr...
12/21/2010
- 09:56 PM Revision 532522da (ceph): rgw_admin: fix compile error in usage
- generic_usage was changed in c4b5c33b123edfc7770e156b4b7f183f8b0af0c9
but this use was not updated. Should call gene... - 08:50 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Oh, the case above it catching it. Can you please try...
- 02:11 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- I just tried 180a417603552 which is v0.24.
Same crash, and no occurrence of
@dout(10) << "activate peer osd" << p... - 06:48 PM Revision 61f964c6 (ceph): librados: Fix compile error by adding std:: namespace
- No idea how this got left out or passed any testing?
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> - 05:36 PM CephFS Bug #663: cmds segfaults
- Can you attach that output.txt please? We need to see what path it being taken that submits the journal item in teh ...
- 02:02 AM CephFS Bug #663: cmds segfaults
- After even more advice on IRC, here's what happens on the commandline:
cmds -n mds.dango -D --debug_mds 20 > outpu... - 10:55 AM Bug #669 (Rejected): testradospp doesn't build
- gregf@kai:~/ceph/src$ g++ testradospp.cc
In file included from include/object.h:28,
from include/... - 06:00 AM Feature #668 (Resolved): Change number of open files in init script
- On a busy system with multiple OSD's your OSD's can reach the max number of open files.
It should be useful IMHO i... - 05:55 AM Feature #667 (Closed): Feature: ceph osd pool get data get pg_size
- Hi,...
- 02:40 AM Bug #666: OSD: ReplicatedPG::eval_repop
- Forgot to mention, this was on the RC branch, commit: 180a4176035521940390f4ce24ee3eb7aa290632 (v0.24)
- 02:33 AM Bug #666 (Resolved): OSD: ReplicatedPG::eval_repop
- On my 'noisy' machine I wanted to see how fast writing a 10GB file would go, but this caused some crashes.
This ma...
12/20/2010
- 11:58 PM Revision 180a4176 (ceph): v0.24
- 09:22 PM Revision 69940e27 (ceph): osd: compensate for replicas with tail > last_complete
- Normally we shouldn't ever have a last_complete < log.tail (&& !backlog).
But maybe we do (old bugs, whatever; see #5... - 09:20 PM Bug #665 (Closed): mds(uo:replay),clent can not mount,return error 5 input/output
- mount.ceph mon:/ /mnt
return error 5 input/output
-------------
[root@tcmaster ~]# ceph -s
2010-12-21 05:17:54.05... - 07:34 PM Revision 0e510dbe (ceph): objectcacher: Fix erroneous reference to "lock" with "flock."
- This looks to be an old bug introduced years ago in
267679abc7e29e73655da7367d87e22a0a0d2375, and left
undiscovered d... - 06:50 PM Bug #664 (Closed): osd auto dead,zombie state.
- osd.1.log
2010-12-21 02:51:11.996619 7f1fe40e2720 filestore(/spare/osd1) lock_fsid failed to lock /spare/osd1/fsid, ... - 03:49 PM Feature #620: objecter: (optionally) read from replica if on localhost and primary is not
- first pass just compares the ip address, commit:f40feef994a5c50fec5230af091f949979422298
- 03:13 PM Linux kernel client Bug #657 (Resolved): nfs-reexport null deref in ceph_init_dentry
- 03:06 PM Linux kernel client Bug #657: nfs-reexport null deref in ceph_init_dentry
- fix was simpler than that.. commit:92cf765237e2787eb168096305c448caf25ac7f8
- 03:12 PM Tasks #653 (Resolved): get playground radosgw up and running again
- 03:12 PM Bug #655 (Resolved): class objects are being stripped (debian packages)
- commit:1e291fc9ef36a5c7ca36053012aec3e9a89f9861
- 03:11 PM Bug #656: ceph
- ... or at least the crash in ceph_crush.txt is. It's not clear from the report why the 'ls' is hafornging.
- 03:09 PM Bug #656 (Closed): ceph
- This is fixed in the latest code. Thanks for the report!
- 01:12 PM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Okay, i see the problem. Or at least teh symptom. Please try the latest 'rc' branch, commit:69940e2.
I'm not cer... - 02:34 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Hi, I produced these logs with the latest rc (b04b6f48237fcbd22716279726f827237a3eb4af).
I have not seen the prima... - 11:26 AM Bug #644: rsync can be sloooow
- Like I promised, I've done some rsync benchmarking on my Noisy machine, first a small hardware list:
* Dual-Core Int... - 10:23 AM Bug #658 (Resolved): osd: scrub crash
- 10:14 AM CephFS Bug #663: cmds segfaults
- Uploaded another crash .tar.gz:
http://68.178.169.4:81/mds_crash2.tar.gz
(approx 125M) - 07:27 AM CephFS Bug #663 (Can't reproduce): cmds segfaults
- Hello.
One of my cmds daemons segfaults.
After asking for advice on #ceph/irc.oftc.net, I used cdebugpack -c /e...
12/19/2010
- 02:03 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- Sage Weil wrote:
> Stefan Majer wrote:
> > Hi,
> >
> > we see similar problems with kernel 2.6.37-rc6 and ceph b...
12/18/2010
- 02:49 PM Bug #662 (Resolved): update rbd qa scripts to use new sysfs interface
- ...and run them on one or more of the ceph* nodes!
see Documentation/ABI/testing/sysfs-bus-rbd in latest -rc to se... - 10:09 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- Sage, like you asked:...
- 05:02 AM Revision b04b6f48 (ceph): mds: make nested scatterlock state change check more robust
- The predirty_journal_parents() calls wrlock_start() with nowait=true
because it has a journal entry open and we don't... - 12:52 AM Revision d64616b9 (ceph): mds: set layout on files created by MKNOD instead of CREATE
- Signed-off-by: Sage Weil <sage@newdream.net>
- 12:52 AM Revision 7d33830a (ceph): mds: set a writeable client range on regular files created via MKNOD
- If the client reexports ceph via nfs, file creations come through as
a MKNOD followed by OPEN. If it's a MKNOD on a ... - 12:52 AM Revision df7e3ffd (ceph): mds: issue caps on regular files created via MKNOD
- Give the client some caps to write with. This is similar to the current
MKDIR behavior, but with different bits.
Si... - 12:02 AM Revision 7df46936 (ceph): cephx: fix uninitialized value
- Signed-off-by: Sage Weil <sage@newdream.net>
12/17/2010
- 11:31 PM Revision 285f351b (ceph): mon: build_initial_monmap: fix mismatched alloc
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:30 PM Revision 3a235b0f (ceph): filestore: make OpSequencer::flush() work for writeahead journaling items
- It was only waiting for items in the op_queue to complete. The goal is
to wait for anything we've called queue_trans... - 11:26 PM Revision caa46093 (ceph): common: cleanups
- common_init: avoid (mismatched) heap allocation
ConfFile::_parse: avoid memory leak on error path
ConfFile: NULL fi... - 11:14 PM Revision 2f916086 (ceph): auth: avoid mismatched allocation
- Can't pair strdup and free.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> - 11:14 PM Revision 795811d6 (ceph): hadoop: fix a bunch of mismatched allocations
- Using array new means you need array delete.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> - 11:14 PM Revision f7dc1a92 (ceph): rgw: fix fd leak on error path
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:14 PM Revision 28bcf0bc (ceph): osd: PG::choose_acting: fix major iterator mistake
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 10:15 PM Revision 5184db44 (ceph): filestore: add per-sequencer flush operation
- Signed-off-by: Sage Weil <sage@newdream.net>
- 10:15 PM Revision 3c7d30f1 (ceph): osd: flush pg writes to disk before starting scrub scan
- This avoids two races:
- we just completed recovery by pushing objects to the replica, and the
replica starts sca... - 09:09 PM CephFS Bug #661 (Resolved): mds: mdlog recursion / scatter crash
- commit:b04b6f48237fcbd22716279726f827237a3eb4af
- 08:56 PM CephFS Bug #661 (Resolved): mds: mdlog recursion / scatter crash
- ...
- 08:51 PM Revision 2fb60daf (ceph): osd: debug scan_list and scrub a bit better
- Signed-off-by: Sage Weil <sage@newdream.net>
- 08:28 PM Revision c3a24fc5 (ceph): osd: refactor _get_pool / _put_pool a little bit
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 08:00 PM Revision 39aff221 (ceph): osd: OSD::_lookup_pool: avoid double lookup
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 07:52 PM Revision bf4529b9 (ceph): test: create test_pools.sh
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 06:59 PM Revision 1cfad2ea (ceph): osd: clear INCONSISTENT if scrub detects no errors
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:36 PM Revision b1908755 (ceph): osd: add assert that we're replica
- ar Fred saw a crash where we got into merge_log as a stray, which really
shouldn't ever happen! See #590.
Signed-of... - 04:59 PM CephFS Cleanup #660 (Closed): mds: use helpers in mknod, mkdir, openc paths
- Lots of not-quite-duplicate code here that can be cleaned up
- 04:43 PM Bug #531: Journaling Causes System Hang
- This is strange.
I tried a very large write.
root@cephnode01:/mnt# dd if=/dev/zero of=cephtest01 bs=1M count=1M... - 12:27 PM Bug #531: Journaling Causes System Hang
- Okay,
I got everything updated and it appears the cluster hung right about 180GB of data written. Which was double... - 11:41 AM Bug #531: Journaling Causes System Hang
- Just wanted to make an update.
I just got a chance to get back on the cluster today.
I have upgraded the cluste... - 04:31 PM Revision 1e291fc9 (ceph): debian: don't strip rados classes
- Signed-off-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@newdream.net> - 04:30 PM Revision 9c173bb4 (ceph): debian: rename ceph.lintian -> ceph.lintian-overrides
- Signed-off-by: Laszlo Boszormenyi <gcs@debian.hu>
Signed-off-by: Sage Weil <sage@newdream.net> - 04:15 PM Feature #659 (Resolved): direct io unit test
- For qa/workunits:
Test directio reads and writes where
- the file offset is 512-byte and not page aligned
- the ... - 01:29 PM Bug #629: cosd segfaults when deleting a pool containing degraded objects
- This shouldn't happen again commit:c3a24fc5d31d53e3db911be900b9067584f0e07e
It still might be interesting to see t... - 10:48 AM Bug #658 (Resolved): osd: scrub crash
- ...
- 10:40 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- ar Fred wrote:
> 2010-12-03 16:29:00.261788 7fe2f28a9710 osd1 1059 pg[3.70( v 660'2955057 lc 567'2954755 (643'295484... - 01:47 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- I spent some time yesterday trying the latest rc (914f6ddebd899667b1937dfe9d5f1a94537dc500), and I still hit the same...
- 09:33 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- Stefan Majer wrote:
> Hi,
>
> we see similar problems with kernel 2.6.37-rc6 and ceph build from yesterday (29480... - 12:21 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- Hi,
we see similar problems with kernel 2.6.37-rc6 and ceph build from yesterday (29480f42be8551f47d79282b7376a10a... - 09:02 AM Linux kernel client Bug #657 (Resolved): nfs-reexport null deref in ceph_init_dentry
- Al Viro observes:...
- 08:00 AM Revision b2746c94 (ceph): logging: eliminate out-of-range dout calls
- Eliminate calls to dout that use non-existent log levels, like negative
levels less than -1. Also trigger a compiler ... - 12:25 AM Revision c4b5c33b (ceph): common: more cerr -> derr conversions
- cmds: cerr -> derr
DoutStreambuf: primitive_log: just write to the stdout fd rather than cerr
assert: don't write o...
12/16/2010
- 11:31 PM Bug #656 (Closed): ceph
- i have 2 osd,[2.6.36,ceph-0.23.1]
#ceph osd down 0
#ls /mnt/ceph {ceph's mount dir}
hanging here,
after a while ... - 10:57 PM Revision 30f752cd (ceph): gceph,ceph: replace cerr->derr
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 10:05 PM Revision c76379fd (ceph): cosd: replace cerr with derr
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 09:06 PM Revision 73669d87 (ceph): PG.cc:
- sub_op_scrub must set finalizing_scrub on the replica
before waiting for last_update_applied to catch up to
info.la... - 06:32 PM Revision 4644247c (ceph): osd: FileJournal: use derr
- Use derr to announce errors in FileJournal.
Handle EINTR where necessary (still haven't fixed
read/write/pread/pwrit... - 06:32 PM Revision 4fc1af5e (ceph): logging: re-introduce derr
- Re-introduce derr as a special log level (level -1) which will show up
in all logs, and on stderr. These messages are... - 05:10 PM Tasks #653: get playground radosgw up and running again
- S3 gateway is up and running (I just recreated my user and tested it, it was working before). We still need to recrea...
- 03:02 PM Bug #655 (Resolved): class objects are being stripped (debian packages)
- When installing the latest debian packages, the resulting /usr/lib/rados-classes/*.so are completely stripped, thus w...
- 02:56 PM Tasks #654 (Resolved): get playground ladder0 mounted
- mounted, was just missing name=username on the mount command. Also created the users.
- 08:11 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- My rsync (196k files, 74GB data) finished succesfully, but the btrfs warning repeated itself twice.
- 02:09 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- Compiled from the kernel GIT ( a4851d8f7d6351a395d36ae8fdcf41745a832d76 ) last night and then started a rsync this mo...
- 01:23 AM Revision 29480f42 (ceph): ReplicatedPG.cc:
- _scrub must set head when it encounters a head snap
curclone counts down, not up
Signed-off-by: Samuel Just <samuel... - 12:33 AM Revision 1e490eff (ceph): osd: timed out watcher is added to unconnected map
- 12:33 AM Revision c321620e (ceph): osd: send notify message only to unexpired watchers
- 12:26 AM Revision 619b45ad (ceph): logging: close file when reloading global config
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
12/15/2010
- 11:23 PM Revision 060fd428 (ceph): osd: fix watch timer, locking
- 09:39 PM Revision 914f6dde (ceph): filestore: detect final version of async ioctl SNAP_CREATE_V2
- Li's revised interface for the async snap ioctl is more flexible. Update
the ioctl call sites and detection code acc... - 09:15 PM Tasks #654 (Resolved): get playground ladder0 mounted
- Get it mounted (it was being weird on me). And recreate the users (probably servicectl ladder0:user config from yakko?)
- 09:14 PM Tasks #653 (Resolved): get playground radosgw up and running again
- I'm not sure what needs to be done to configure the radosgw pools..
Also we need to recreate the users/buckets fr... - 09:10 PM Bug #648 (Resolved): monclient: PGMap::apply_incremental
- trimming changed by commit:89d5c91e7d207d646651f8959ee37a15ea199d1b
- 09:10 PM Bug #631 (Won't Fix): OSD: FileJournal::committed_thru
- Sage Weil wrote:
> Okay, the first crash you saw is due to #645. I think it's a kernel bug causing that ioctl to fa... - 09:07 PM Revision 06a2d7a2 (ceph): mds: Save straydn in mdr so it's consistent across retry attempts.
- Otherwise, we could choose new stray dirs and fail to get all
the locks we needed (while leaving old strays locked fo... - 09:06 PM Cleanup #650 (Resolved): objecter: refactor request tracking to be per-osd instead of per-pg
- commit:d54a854811a51a5730b548da712d59761057fa58
- 08:44 PM Revision e31f0a47 (ceph): tools: don't start msgr thread before daemonize
- Calling messenger->add_dispatcher_head() has the side-effect of starting
the messenger thread. So we must not do it b... - 07:02 PM Revision d54a8548 (ceph): Merge branch 'objecter' into unstable
- 07:02 PM Revision 065cdf52 (ceph): objecter: track pending requests by osd, not pg
- This is a big cleanup. Also
- switch to keeping per-osd Connection *'s
- make requests time out independently (not... - 07:02 PM Revision f6dc5d9f (ceph): objecter: cleanup: rename op maps
- Signed-off-by: Sage Weil <sage@newdream.net>
- 07:02 PM Revision 32a8aed9 (ceph): objecter: add reopen_session helper
- Signed-off-by: Sage Weil <sage@newdream.net>
- 07:02 PM Revision 530083cc (ceph): objecter: check for pg mapping changes in each incremental; refactor mi...
- We need to detect when a pg mapping changes but the primary stays the same.
That means we can't just look at the fina... - 07:01 PM Revision 5d44d599 (ceph): msgr: mark down by Connection*
- Signed-off-by: Sage Weil <sage@newdream.net>
- 07:01 PM Revision 07e593c4 (ceph): mds: fix inode ancestor attr encoding
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:15 PM Revision fdbd85e4 (ceph): automake: ignore rmdir errors during uninstall
- We don't want to fail "make distcheck" for a silly reason.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> - 06:01 PM Bug #652 (Resolved): ReplicatedPG _scrub missing clone
- ReplicatedPG.cc:4016 = handle the missing clone case
- 03:18 PM Linux kernel client Bug #552 (Resolved): Samba with kernel oplocks=on produces lots of corrupt mds entries in dmesg
- Closing this out unless we hear about more issues.
- 01:38 PM Feature #643 (Resolved): filestore: update btrfs ioctl interface for soon-to-be-pushed SNAP_CREAT...
- commit:914f6ddebd899667b1937dfe9d5f1a94537dc500
- 01:01 PM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- I took the for-linus branch, since both next-rc and master wouldn't compile against 2.6.37-rc5. Due to this error:
... - 10:39 AM Bug #563: osd: btrfs, warning at inode.c ( btrfs_orphan_commit_root )
- I've just hit the bug again, this while I was running a rsync to my Ceph cluster....
- 09:48 AM Feature #95: mon: adjust overload based on osd disk utilization
- Implemented reweight-by-utilization in the overload branch.
C. - 01:00 AM Revision 7b5e923c (ceph): osd: send pending notification for reconnected watcher
- 12:28 AM Revision f9694648 (ceph): automake: add osd/Watch.h to noinst_HEADERS
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
12/14/2010
- 09:55 PM Linux kernel client Bug #651 (Resolved): osd_client: need to recalculate request mapping for every osdmap incremental
- Currently if we get an osdmap message with multiple incrementals, and a request maps to a different osd and then back...
- 08:47 PM Revision c8d9b20c (ceph): Merge branch 'sync2' into unstable
- 07:50 PM Revision 89d5c91e (ceph): mon: trim pgmap less aggressively
- This will make observer crashes due to missed states (#648) much harder to
hit. Eventually the pgmap state trim prob... - 07:02 PM Revision 056e91e0 (ceph): librados: drop watch_lock
- Use the existing lock to do protect all of this.
Signed-off-by: Sage Weil <sage@newdream.net> - 07:00 PM Revision d4420a8a (ceph): objecter: drop linger_info_mutex
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:51 PM Revision b989087d (ceph): crypto: catch cryptopp decrypt/encrypt exceptions
- 06:48 PM Revision 215f3320 (ceph): objecter: simplify linger register
- Drop single-use helper; make unregister_linger part of the public
interface.
Signed-off-by: Sage Weil <sage@newdream... - 06:47 PM Revision b60a9abf (ceph): objecter: fix up linger ack/commit to trigger first time only
- We only want the user-provided ack/commit callbacks to trigger the first
time we register the lingering op. Same goe... - 06:35 PM Revision 8a75086d (ceph): objecter: clean up linger interface
- Put LingerOp on heap. Use xlist to attach to PGs. Add in/out bufferlists.
Signed-off-by: Sage Weil <sage@newdream.... - 05:55 PM Revision 96b32382 (ceph): Merge remote branch 'origin/unstable' into sync2
- Conflicts:
src/auth/Crypto.cc
src/osd/ReplicatedPG.cc
src/osd/ReplicatedPG.h
src/osd/osd_types.h - 03:03 PM Feature #562 (Closed): separate gui into separate binary, package
- 02:56 PM Revision 3e076c39 (ceph): logging: use Mutex::Locker
- Use Mutex::Locker to make logging exception-safe. That is, if you are
doing "dout() << foo() << dendl;" and foo throw... - 02:20 PM Revision bf31f3f1 (ceph): logger: Fix DoutStreambuf::create_rank_symlink
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 02:13 PM Revision 4c377199 (ceph): cephtool: rename tools files
- Rename tools files to be more consistent. For example, the main()
function for ./ceph should be in ceph.cc.
Signed-o... - 12:50 PM rbd Feature #341: libvirt bindings
- A "network" disk type was introduced in "036ad5052b43fe9f0d197e89fd16715950408e1d":http://libvirt.org/git/?p=libvirt....
- 11:44 AM Bug #646: [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git + ceph 0.23
- I'm going to get the -standalone.git repo updated to the latest code (it's a bit out of date) so we can confirm this ...
- 11:36 AM Cleanup #650 (Resolved): objecter: refactor request tracking to be per-osd instead of per-pg
- 10:40 AM Bug #649 (Resolved): OSD: CryptoPP::StreamTransformationFilter::LastPut
- Fixed in b989087ddf8775588ddbb6234d099398a2e18072. CryptoPP threw an exception when failed to decode message (probabl...
- 02:34 AM Bug #649 (Resolved): OSD: CryptoPP::StreamTransformationFilter::LastPut
- This morning on my test machine (noisy.ceph.widodh.nl, 1 MON, 1 MDS, 3 OSD) all three OSD's died at exact the same mo...
- 09:58 AM Revision a3fcf908 (ceph): logging: Fix use-before-access in debug.cc
- Signed-off-by: Vangelis Koukis <vkoukis@cslab.ece.ntua.gr>
Signed-off-by: Constantinos Venetsanopoulos <cven@cslab.ec... - 09:53 AM Revision 3932f084 (ceph): osd: PG::prior_set_affected: const cleanup
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 09:14 AM Bug #648: monclient: PGMap::apply_incremental
- This is a known issue, caused by the pg state trimming. It'll go away eventually with #647. In the meantime, I'll m...
- 01:31 AM Bug #648 (Resolved): monclient: PGMap::apply_incremental
- I left my laptop on last night with a 'ceph -w' on one of my test machines, this morning I saw:...
12/13/2010
- 08:23 PM Feature #647 (Duplicate): mon: refactor paxos interaction
- We currently have a paxos instance per state machine, which is silly for a bunch of reasons. The big one is that a m...
- 07:28 PM Bug #646: [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git + ceph 0.23
- master-backport ,reproduce it.
unstable-backport,can't reproduce it ,
i'm not sure whether this issue had been fixed. - 07:00 PM Bug #646: [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git + ceph 0.23
- /ceph-client-standalone$ git branch
master
* master-backport
- 06:45 PM Bug #646 (Can't reproduce): [OOPS] mount.ceph monip:6789: /mnt/ceph ,ceph-client-standalone.git +...
- 1. ceph-client-standalone: git from git://ceph.newdream.net/git/ceph-client-standalone.git
2. ceph: ceph-0.23
3.OS:... - 10:18 AM Feature #640: support log rotation
- I guess when I filed this I was thinking of a setup where there was a small tmpfs partition where the logs went to, w...
12/12/2010
- 10:40 PM Revision 9add26be (ceph): mds: fix replay/resent vs completed request check
- If it is a _replayed_ request, we should always send a simple ack if it is
completed, because the client doesn't not ... - 09:51 PM rbd Tasks #421 (Resolved): get rbd support into qemu upstream
- 09:50 PM Linux kernel client Bug #473 (Can't reproduce): Kernel panic: ceph_pagelist_append
- 09:49 PM Linux kernel client Bug #304 (Can't reproduce): GPF in writepages_finish
- 09:47 PM Linux kernel client Feature #642 (Rejected): fill in s_uuid on superblock
- nevermind, i misread the nfs thread.. this is an extN thing.
- 02:33 PM CephFS Cleanup #638 (Resolved): mds: verify open+create resent/replayed event exception
- commit:9add26b
12/11/2010
- 09:58 PM Revision 0e08cb0f (ceph): osd: return ENOSPC for non-mds if full flag is set in osdmap
- Signed-off-by: Sage Weil <sage@newdream.net>
- 04:09 PM Revision 239b7677 (ceph): Merge remote branch 'origin/syslog' into unstable
- Conflicts:
src/mon/Paxos.cc
src/osd/PG.cc - 04:04 PM Revision 46242586 (ceph): Merge branch 'gceph' into unstable
- 08:20 AM Feature #562: separate gui into separate binary, package
- merged everything so far in commit:46242586eddcc948f71260f8c1ea2e8b1845a9f8
- 07:48 AM Feature #562: separate gui into separate binary, package
- Looks good! The one thing I'd change is to rename ceph.cc tool-common.cc or something along those lines, and cmd.cc ...
- 08:20 AM Feature #245 (Resolved): Logging to syslog
- merged in commit:239b7677e7a9df86d35cbfb25226c3f1a06771c5
- 08:09 AM CephFS Feature #630: release caps on inodes unlinked by other clients
- Sage Weil wrote:
> Or, the MDS needs to delete file data as soon as a stray's wanted drops to 0.
That won't work,... - 08:06 AM RADOS Feature #433: improve osd reweighting
- I think the thing to do here is extend the CrushWrapper interface (probably by wrapping something in mapper.c or buil...
- 04:34 AM Revision 292414c5 (ceph): gceph: Add gceph to rpm, deb
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 04:30 AM Revision 71a19a94 (ceph): gceph: run shutdown functions at exit
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 04:25 AM Revision bb82fd3d (ceph): gceph: fix compile
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 04:15 AM Revision 1a201f85 (ceph): gceph: add -h argument
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 04:12 AM Revision b4ceb194 (ceph): ceph tool: Create gceph
- Put the gui into a separate binary.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
12/10/2010
- 08:25 PM Feature #562: separate gui into separate binary, package
- I implemented this in the gceph branch.
cheers,
Colin - 04:28 PM Bug #631: OSD: FileJournal::committed_thru
- Okay, the first crash you saw is due to #645. I think it's a kernel bug causing that ioctl to fail.
This is actuall... - 03:18 AM Bug #631: OSD: FileJournal::committed_thru
- To answer your question, I never saw a crash with 2.6.37, I just rebooted back into 2.6.32
Just rebooting into 2.6... - 04:25 PM Bug #645 (Closed): intermittent failure of snap ioctl
- We're occasionally getting back an EINVAL from teh snap create ioctl.
extra debugging in place on the sepia test... - 04:15 PM Bug #644: rsync can be sloooow
- Copy of my notes file:
FS->FS
gregf@kai:~/ceph/src$ time rsync -r /btrfs/gregf/ceph-client/ mnt/ceph-client
skippi... - 04:15 PM Bug #644 (Closed): rsync can be sloooow
- This is probably due to metadata ops being fairly expensive, but we should inspect an rsync run over Ceph to make sur...
- 04:02 PM Feature #643 (Resolved): filestore: update btrfs ioctl interface for soon-to-be-pushed SNAP_CREAT...
- Chris is going to push Li's revision of the async ioctl for the next -rc.
12/09/2010
- 11:57 PM Revision 49844738 (ceph): librados, objecter: fix unwatch operation
- 10:38 PM Revision 346a2aac (ceph): rpm: update changelog
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 10:35 PM Revision e23d6200 (ceph): rpm: fix ceph.spec to work with gcephtool
- Don't try to package gui_resources unless we are building the GUI.
Get GUI dependencies correct.
Signed-off-by: Coli... - 09:55 PM Revision e5769b06 (ceph): objecter: resend_linger copies ops
- 09:45 PM Revision 83612ef7 (ceph): Fix overflow in FileJournal::_open_file()
- [ The following text is in the "iso-8859-7" character set. ]
[ Your display is set for the "iso-8859-1" character... - 09:09 PM Revision 329ae1bc (ceph): ReplicatedPG: snap_trimmer now acquires a read lock on the osd map
- before calling share_pg_info.
Signed-off-by: Samuel Just <samuelj@hq.newdream.net> - 09:09 PM Revision d0fbc30a (ceph): ReplicatedPG.cc: Fixes a bug in snap_trimmer where a pointer to a stack
- Cond is left in the mode.waiting_cond list.
Signed-off-by: Samuel Just <samuelj@hq.newdream.net> - 07:43 PM Bug #631: OSD: FileJournal::committed_thru
- Okay, pretty sure this was caused by a bug in 2.6.37-rc that was doing an async commit even for the sync snap/subvol ...
- 01:45 PM Bug #631: OSD: FileJournal::committed_thru
- Okay, confirmed it was the same bug you were seeing:...
- 07:18 PM Revision f68e6e7d (ceph): rpm: don't try to package radosacl
- radosacl is just a test binary, so unless we build with --with-debug, we
won't get it.
Signed-off-by: Colin McCabe <... - 07:18 PM Revision 6722b0c8 (ceph): rpm: add pkgconfig to BuildRequires
- You can't build without pkgconfig.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> - 06:28 PM Revision 9df18d19 (ceph): rpm: set files-attr for radosgw
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 09:13 AM Bug #639: RHEL6 beta packaging breaks on 'libcls_rbd.so.1.0.0.debug'
- The purpose of touching it is to prevent the installed .so from being stripped. Which means we don't need it in the ...
- 09:11 AM Linux kernel client Feature #642 (Rejected): fill in s_uuid on superblock
- 09:09 AM Feature #640 (Closed): support log rotation
- see logrotate.conf. it doesn't cap logs by size, but follows the standard scheme used by just about everything else ...
- 08:48 AM Feature #640 (Closed): support log rotation
- Here I'm talking about the log messages generated by dout() and friends.
We should allow users to set up log rotat... - 08:50 AM Feature #641 (Rejected): allow logs to be piped to an external program
- We already support sending logs to syslog, to stdout, or to a file. We could pretty easily support a fourth option, w...
- 02:10 AM Revision b4264fbb (ceph): filejournal: reset last_commited_seq if we find journal to be invalid
- If we read an event that's later than our expected entry, we set read_pos
to -1 and discard the journal. If that hap... - 12:02 AM Revision cc78bbf1 (ceph): objecter: create a new op for resending lingering requests
12/08/2010
- 09:51 PM Revision 027d5bfd (ceph): logger: tweak cmon log output a bit
- Make the output of cmon on stderr a little bit less verbose.
Signed-off-by: Colin McCabe <colinm@hq.newdream.net> - 08:25 PM Revision fdc7414e (ceph): logging: DoutStreambuf: handle daemonizing better
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 07:36 PM Revision 5cba1e63 (ceph): objecter: a few lingering fixes
- 07:12 PM Revision a9c098df (ceph): mon: use helper for clock drift check; log relative instead of absolute...
- Signed-off-by: Sage Weil <sage@newdream.net>
- 06:54 PM Revision 986c2af4 (ceph): logging: debug.h: use DoutStreambuf
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 03:52 PM Cleanup #299: catch std::bad_alloc and die with helpful error in log on ENOMEM
- This is kind of a tricky thing to really handle correctly.
In practice, most people run with memory overcommit tur... - 01:56 PM Feature #245: Logging to syslog
- 11:19 AM Bug #639 (Resolved): RHEL6 beta packaging breaks on 'libcls_rbd.so.1.0.0.debug'
- /usr/lib/debug/usr/lib64/rados-classes/libcls_rbd.so.1.0.0.debug is
installed but unpackaged.
In spec file I see ... - 11:11 AM CephFS Cleanup #638 (Resolved): mds: verify open+create resent/replayed event exception
- These look a bit fishy to me.
- 11:11 AM CephFS Bug #637 (Resolved): mds: check replica scatterlock flush on rejoin
- This needs to behave consistent with the start_flush/finish_flush hooks.
- 11:06 AM Bug #636 (Can't reproduce): RHEL6 beta packaging breaks on 'gui_resources'
- 'gui_resources' not getting into BUILDROOT correctly.
ceph.spec.in:128
%{_datadir}/ceph_tool/gui_resources/*
Co... - 10:31 AM Bug #635 (Resolved): RHEL6 beta packaging breaks unexpected characters
- ceph.spec.in uses @VERSION@ directly, which chokes rpmbuild when ''Version:' uses certain characters (for example 0.2...
- 07:27 AM Revision c53ffafb (ceph): logging: Remove _dout_check_log
- _dout_check_log is unneeded, since every invocation of dout makes the
same check.
Signed-off-by: Colin McCabe <colin... - 07:17 AM Revision 8fdd0f44 (ceph): logging: debug.h: minor cleanup
- Don't put std::ostream into the global namespace. Copyright update.
Signed-off-by: Colin McCabe <colinm@hq.newdream.... - 06:52 AM Revision aeba6bca (ceph): logging: eliminate dbeginl
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 03:09 AM Revision 0f0cb46a (ceph): logging: Implement rank symlinks
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 02:54 AM Revision 6c4a7d58 (ceph): logging: Support isym_path
- Support instance symlinks, which are activated when we are using
g_conf.log_per_instance.
Signed-off-by: Colin McCab... - 02:51 AM Revision e597d02d (ceph): logging: rename_output_file -> handle_pid_change
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 02:49 AM Revision 116478a3 (ceph): logging: _calculate_opath: use g_conf.log_dir
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 02:47 AM Revision ea3414d4 (ceph): logging: DoutStreambuf: better debug output
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 02:45 AM Revision 627399f7 (ceph): logging: create_symlink:sometimes use rel symlinks
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 02:43 AM Revision b00baab1 (ceph): logging: implement get_dirname, move get_basename
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 02:41 AM Revision ef223664 (ceph): logging: fix normalize_relative
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 12:44 AM Revision 2000f69e (ceph): mds: no not choose lock state on replicas
- The lock state has already been set during rejoin.
Signed-off-by: Sage Weil <sage@newdream.net> - 12:44 AM Revision fe103003 (ceph): mds: sync->mix replica state is sync->mix(2)
- When auth first moves to sync->mix,
- auth sends AC_MIX to replicas
- replicas go to sync->mix
- replicas finish g... - 12:44 AM Revision 4f643994 (ceph): mds: introduce rejoin_invent_dirfrag() helper
- Signed-off-by: Sage Weil <sage@newdream.net>
- 12:44 AM Revision f97660ff (ceph): mds: fix LOOKUPHASH to avoid creating bogus replica CDir
- We can't create the CDir if we are non-auth.
Signed-off-by: Sage Weil <sage@newdream.net> - 12:44 AM Revision 681b010f (ceph): mds: clear EXPORTINGCAPS on export_reverse
- We need to reverse the effects of encode_export_inode_caps(), which is just
the pin and state bit.
The original prob... - 12:44 AM Revision 9bbb33b4 (ceph): mds: send LOCKFLUSHED to trigger finish_flush on replicas
- Since f741766a we have triggered start_flush and finish_flush on replicas.
The problem is that the finish_flush didn'... - 12:44 AM Revision c681ed75 (ceph): mds: explicitly pass scatterlock dirty flag to auth on gather
- This ensures that if the replica is thinks it is flushing something the
auth will always do a scatter_writebehind.
S... - 12:44 AM Revision 39c5933d (ceph): mds: add missing try_clear_more() to scatterlock
- Signed-off-by: Sage Weil <sage@newdream.net>
- 12:44 AM Revision b5fd2e4d (ceph): mds: open undef dirfrags during rejoin
- Any invented dirfrags have a version of 0. This will cause problems later
if we pre_dirty() anything in that dir bec... - 12:44 AM Revision 2ea9b2d7 (ceph): mds: fix replay of already-journaled requests
- Check for already-completed tids for both retried and replayed requests.
Signed-off-by: Sage Weil <sage@newdream.net> - 12:44 AM Revision 9b9b8693 (ceph): mds: rev mds cluster internal protocol
- The lock encoding changed with the dirty bit on scatterlocks.
Signed-off-by: Sage Weil <sage@newdream.net> - 12:44 AM Revision 3825c4b8 (ceph): mds: small rejoin cleanup
- Signed-off-by: Sage Weil <sage@newdream.net>
12/07/2010
- 11:47 PM Revision 42464fb7 (ceph): logging: Add symlink helper functions
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 10:00 PM Revision e2ba601b (ceph): logger: fix EINTR handling
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 09:57 PM Revision bacdd493 (ceph): logging: rename_output_file: fix bug
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 09:55 PM Revision d70851ef (ceph): logging: DoutStreambuf: Implement log-to-file
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 09:46 PM Revision 95211145 (ceph): logging: Add log_to_file option
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 09:40 PM CephFS Bug #15 (Resolved): mds rejoin: invented dirfrags (MDCache.cc:3469)
- commit:b5fd2e4d4ee4bf02a993e75b756a3775b2d566e5
- 08:11 PM Revision df5d4e62 (ceph): logging: DoutStreambuf improvements
- Write to stdout_fileno directly rather than using a buffer, which we
would then have to flush. Fix a bug in the buffe... - 06:56 PM Revision 1e2e4aa0 (ceph): automake: in scripts, use sysconfdir as-is
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 06:48 PM Revision 10b6887e (ceph): automake: in deb pkg, use --syconfdir=/etc
- When building the debian packages, use --sysconfdir=/etc.
Also, don't fudge sysconfdir in the init-ceph script.
Sig... - 03:25 PM CephFS Feature #91: mds: up:shadow mode
- Okay, this seems to be working now. Had to adjust how the Journaler treated read_pos and to fix a few of my new re-re...
- 08:49 AM Linux kernel client Bug #634: Kernel client takes too long to recover after a MDS restart
- It's also possible (though unlikely) that the client isn't getting an updated MDSMap quickly enough or that the MDS t...
- 07:58 AM Linux kernel client Bug #634: Kernel client takes too long to recover after a MDS restart
- The client doesn't 'reconnect' until the MDS reaches the up:reconnect state. That's preceeded by up:replay (journal ...
- 07:53 AM Linux kernel client Bug #634 (Can't reproduce): Kernel client takes too long to recover after a MDS restart
- [208292.940934] libceph: mds0 192.168.1.11:6800 socket closed
[208293.050282] libceph: mds0 192.168.1.11:6800 connec... - 07:46 AM Revision d4043e81 (ceph): logging: add DoutStreambuf::set_prio
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 07:14 AM Revision 6c7735f6 (ceph): logging: DoutStreambuf must handle stdout + stderr
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 07:03 AM Revision 12544a49 (ceph): logging: Add log_to_syslog option
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 06:18 AM Revision 57bcdc54 (ceph): mkcephfs: require -k; update man page
- Force users to specify keyring location; update man page accordingly.
Signed-off-by: Sage Weil <sage@newdream.net>
12/06/2010
- 11:57 PM Revision 5ac581df (ceph): Rename SyslogStreambuf -> DoutStreambuf
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:53 PM Bug #631: OSD: FileJournal::committed_thru
- Oh yes, there seems to be a issue with the IPv6 connectivity where 'noisy' is at.
I added debug journal = 20 (I in... - 10:30 PM Bug #631: OSD: FileJournal::committed_thru
- I updated the above comment but I suspect you only looked at the email notification? In any case, can you reproduce ...
- 03:13 AM Bug #631: OSD: FileJournal::committed_thru
- Yes, simply starting the OSD's again gave me the same crash on one OSD.
I've attached the log, but here are the la... - 11:38 PM Revision 9811fbd0 (ceph): logging: Replace derr with dout
- derr was really just an alias for STDERR. Unfortunately, after we call
daemonize, STDERR is connected to /dev/null. S... - 11:38 PM Revision c94e0d2d (ceph): logging: optimize with likely/unlikely macros
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:38 PM Revision d1e0a2ae (ceph): logging: debug.h: move some debug functions
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:38 PM Revision ab18aaec (ceph): logging: add g_conf.clog_to_syslog
- Add a new configuration option that allows you to send central log
messages to syslog.
Signed-off-by: Colin McCabe <... - 11:35 PM Revision ab61823e (ceph): logging: LogEntry: don't pass enums by reference
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:30 PM Revision 4ef069c3 (ceph): logging:Move LogEntry.h into common with LogClient
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:30 PM Revision 82fa7f2d (ceph): logging: LogClient: refactor handle_log_ack
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:30 PM Revision fcae8a7a (ceph): logging: MLog.h: const cleanup
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 11:29 PM Revision f2ead26e (ceph): logging: better syntax for LogClient
- Rather than having to write logclient.log(LOG_ERROR, ss), coders can now
write clog.error() << "str". Auto-flushing, ... - 11:25 PM Revision 87545d06 (ceph): configure: detect crypto++ library
- 10:01 PM Revision ebcc9395 (ceph): osd: drop not-quite-copy constructor for object_info_t
- Making a copy-like constructor that doesn't actaully copy is confusing
and error prone. In this case, we initialized... - 08:49 PM Revision d69f3dd3 (ceph): MDS: Encode a full ancestor trace on inodes, not just the immediate par...
- 07:17 PM Revision 11c7dc03 (ceph): librados: fix the C++ interface init
- 07:17 PM Revision b1afea51 (ceph): librados: fix error path in rados_deinitialize
- Signed-off-by: Colin McCabe <colinm@hq.newdream.net>
- 07:16 PM Revision aa3dda61 (ceph): librados: fix the C++ interface init
- 06:31 PM Revision 9a604816 (ceph): librados: fix C interface error handling in init code
- 06:28 PM Revision 130b8b3f (ceph): librados: fix C interface error handling in init code
- 05:59 PM Revision bf030ca2 (ceph): client: resync ioctl header from ceph-client.
- Previous change to the CEPH_IOCTL_MAGIC in fbbf448 was incorrect!
Signed-off-by: Greg Farnum <gregf@hq.newdream.net> - 12:42 PM CephFS Feature #600 (Resolved): mds: store full trace on directories
- Done in commit:d69f3dd327730a61b614c9f41f6155626bc07686. Just loops through the parents and encodes them sequentially...
- 11:04 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- If you're not running btrfs, you can also copy a pg directory to another disk and symlink it. Just be sure to preser...
- 11:01 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- Hi Fred,
We don't support deleting things manually from the object store while cosd is running.
Are you running... - 12:52 AM Bug #590: osd/PG.cc:1645: FAILED assert(info.last_complete >= log.tail || log.backlog)
- At the time, when I said "by hand", it was even worse:
As the journal was full, doing a rm -Rf while the osd was s... - 11:03 AM CephFS Bug #451 (Closed): mds: replay error
- It look like the cluster is running pretty old code (0.22~rc). This particular problem was fixed by 1c934ebd (0.23).
- 10:22 AM CephFS Bug #451 (In Progress): mds: replay error
- 03:59 AM CephFS Bug #451: mds: replay error
- <removed sensitive info>
- 10:25 AM Bug #633 (Resolved): librados crashes when init failed
- Fixed with commit:9a60481681d86065979d4353305cdaad74fe1a01
- 10:25 AM Bug #633 (Resolved): librados crashes when init failed
- When the librados init fails, specifically when using the C interface, the subsequent call to deinitialize() crashes.
- 06:20 AM Revision 4e3c2011 (ceph): Tune Debian packaging for the upcoming v0.24 release.
- Including switch OpenSSL dependency to Crypto++ as its being used instead of
the former; remove radosacl as its not c...
Also available in: Atom