Project

General

Profile

Activity

From 07/06/2010 to 08/04/2010

08/04/2010

11:45 PM Revision aa5eced1 (ceph): osd: add a mapext operation for mapping extents (fiemap)
Yehuda Sadeh
07:19 PM Revision e2bb2e79 (ceph): rbd: import using fiemap
still requires some cleanup, but basically works Yehuda Sadeh
06:40 PM Revision 5b5c0066 (ceph): osdmap: only decode internal addresses if map is new enough version
Greg Farnum
05:37 PM Revision 8c4d2689 (ceph): Merge branch 'testing' into unstable
Sage Weil
05:01 PM Revision 6f7523a0 (ceph): rados: -i and -o are still there for other commands
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:56 PM Revision a7038da7 (ceph): ceph.spec.in: fix docdir paths
_docdir is /usr/share/doc, not /usr/share/doc/ceph (as it is with debian). Sage Weil
04:51 PM Revision 1ecda511 (ceph): rados: fix usage
Takuya ASADA
04:51 PM Revision 1cf45017 (ceph): ceph.spec.in: fix sample.ceph.conf chown
Reported-by: Xiaoguang Liu <syslxg@gmail.com> Sage Weil
04:51 PM Revision ef711e2e (ceph): osd: fix heartbeat to/from map updates
- always make _inst match _to and _from (don't adjust in note_down_osd)
- only mark down old hb connection in update_...
Sage Weil
01:03 PM Bug #331 (Closed): OSD crash: OSDMap::Incremental::decode
Fixed by commit:5b5c0066f1bbfdc8c03cfacffab8969e23377f90. Only applied if you upgraded your machines with an in-place... Greg Farnum
11:58 AM Bug #331: OSD crash: OSDMap::Incremental::decode
Fixed by 5b5c0066f1bbfdc8c03cfacffab8969e23377f90 Wido den Hollander
11:31 AM Bug #331 (Closed): OSD crash: OSDMap::Incremental::decode
After upgrading to the latest unstable i'm seeing a OSD crash in my whole cluster (30 OSD's).... Wido den Hollander
12:12 PM Bug #332 (Resolved): class operations hangs when out of disk space
osds can't load class in if there's not enough space to write down the temporary library file. It should return some ... Yehuda Sadeh
10:36 AM Bug #330: Crash on OSD::_share_map_outgoing(const entity_inst_t&)
Fixed (more) by commit:ef711e2eead039b9819b8380f7b1ea6ebd84160d Sage Weil
12:59 AM Bug #330: Crash on OSD::_share_map_outgoing(const entity_inst_t&)
The commit did not work, my OSD's kept crashing.
I place three new coredumps (preserved the timestamp) in the same...
Wido den Hollander
10:25 AM Bug #247 (Resolved): osd: allow separate interface for internal cluster traffic
Merged this into unstable. Greg Farnum

08/03/2010

11:13 PM Revision fa27b31e (ceph): osd: cover map_in_progress by osd_lock instead of an individual lock.
The separate lock is no longer needed and removing it reduces lock contention
and makes it easier to avoid races else...
Greg Farnum
10:21 PM Revision 207a1bdc (ceph): osd: Unlock osd_lock in _dispatch before locking handle_map_lock
Greg Farnum
09:56 PM Revision 595543f8 (ceph): osd: when checking OSD identities via require_same_or_newer_map, use cl...
Greg Farnum
09:56 PM Revision 24989feb (ceph): OSD: use get_cluster_inst instead of get_inst when checking other OSD i...
Greg Farnum
09:56 PM Revision 42c00850 (ceph): osdmap: remove unused versions of get_[cluster_]inst
Greg Farnum
09:56 PM Revision d7d615bc (ceph): osd: use the client_messenger for the logclient.
Greg Farnum
09:56 PM Revision b588a56e (ceph): config: Error out when you can't parse an addr from the config file.
Also, fix up a few other addr parsing cases to break properly! Greg Farnum
09:56 PM Revision c26cb727 (ceph): cosd: adjust client/cluster messenger setup to be more consistent
and poison better Gregory Farnum
09:56 PM Revision dfb85d23 (ceph): osd: adjust map_in_progress guarding
Greg Farnum
09:56 PM Revision c7d23529 (ceph): osd: guard _dispatch to prevent execution of new messages while the map...
Greg Farnum
09:56 PM Revision 2fbf1e9f (ceph): osd: switching dispatch_running to be a counter
Now it works with multiple messengers dispatching. Greg Farnum
09:56 PM Revision bc9cb931 (ceph): osdmap: make room in incremental for separate cluster/client OSD addrs.
Greg Farnum
09:56 PM Revision ded7b8e2 (ceph): osd: make new cluster_messenger, client_messenger pointers.
Don't remove old messenger, just alias it to client_messenger. Greg Farnum
09:56 PM Revision 7f701120 (ceph): osdmap: add osd_cluster_addr to full map
Include it in lookups and add get_cluster_inst et al to match get_inst et al Greg Farnum
09:56 PM Revision abff0990 (ceph): osd: use cluster/client_messenger as appropriate.
Modify send_incremental_map to choose based on the type of the entity. Greg Farnum
09:56 PM Revision ef9a4eeb (ceph): osd: remove messenger pointer, fix calls to it in PG and ReplicatedPG
Greg Farnum
09:56 PM Revision ed28d5e6 (ceph): osdmap: Make sure osd_cluster_addr matches osd_addr size on decode
Greg Farnum
09:56 PM Revision 26ff6cb5 (ceph): osdmap: move osd_cluster_addr decoding, and make sure that size matches...
Greg Farnum
09:56 PM Revision 6abf48da (ceph): config: rename g_my_addr to g_public_addr, add g_cluster_addr
Greg Farnum
09:56 PM Revision eb708208 (ceph): msgr: set up bind and Accepter::bind to take in addresses and bind to them
Greg Farnum
09:56 PM Revision 7eeca9b8 (ceph): cosd: create separate cluster/client messengers if separate addresses a...
Greg Farnum
09:56 PM Revision d290a046 (ceph): cosd: move messenger existence checks to where they're useful
Greg Farnum
09:56 PM Revision 382290ef (ceph): messages: add cluster_addr to MOSDBoot, use it in OSD and OSDMonitor
Greg Farnum
09:56 PM Revision d0f7412e (ceph): config: add OPT_ADDR type, move "public_addr"/"cluster_addr" processing...
Necessary side effect: move public_addr/cluster_addr into g_conf. Greg Farnum
09:56 PM Revision d6c8f8c8 (ceph): config: correctly notify when OPT_ADDR config options are found.
Greg Farnum
09:56 PM Revision 582f9285 (ceph): osd: register the OSD on cluster_messenger if needed
Greg Farnum
09:56 PM Revision 28b54973 (ceph): osd: be more careful with cluster/client messenger use.
Greg Farnum
09:56 PM Revision cec513ff (ceph): osd: add Mutex and flag to cover execution of handle_map_lock.
Fixes a crash when the cluster and client messenger both dispatch an
OSDMap and their handling gets intertwined.
Greg Farnum
06:07 PM Revision 4ea24d78 (ceph): osd: fix replies to dup/already-committed requests
We can safely reply if oldv is _older_ than what's on disk. Otherwise the
request ends up on the list and never gets...
Sage Weil
04:43 PM Revision 930ef4e5 (ceph): Merge branch 'testing' into unstable
Sage Weil
04:42 PM Revision 22a7d42b (ceph): assert: only include acconfig.h ifdef __CEPH__
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:40 PM Revision 0ffb08c8 (ceph): librados.hpp fix
[ The following text is in the "UTF-8" character set. ]
[ Your display is set for the "iso-8859-1" character set....
Takuya ASADA
04:39 PM Revision 4bcba121 (ceph): osd: fix map sharing with old heartbeat peers
The addr is in old_inst, since it's an old peer. Fixes a crash like
osd/OSD.cc: In function 'void OSD::_share_map_o...
Sage Weil
04:39 PM Revision be7d84b2 (ceph): mon: fix signed/unsigned compile warning
Sage Weil
04:17 PM Bug #247: osd: allow separate interface for internal cluster traffic
Am running the osd_msgrs branch in qa today, unless that finds problems that weren't exposed by a bunch of system sta... Greg Farnum
03:45 PM CephFS Bug #316: mds: track dirty snapped metadata without faking a cap
this illustrates the problem:... Sage Weil
10:57 AM Feature #138: Try out tcmalloc
let's turn this on for cmds and cosd.
and update configure.ac to detect it.
and set debian/control and ceph.spec...
Sage Weil
10:56 AM CephFS Feature #321: allow dns names for monitor switch
see MonClient::create_initial_monmap, and mount/mount.ceph.c for an example of using getaddrinfo() Sage Weil
10:52 AM Feature #82: mon: osd failure smarts
A simple approach would be to index the osd peers/heartbeat graph in the PGMonitor and only mark an osd down when som... Sage Weil
10:10 AM Linux kernel client Feature #23 (Resolved): fcntl/flock advisory lock support
This stuff got pushed into unstable yesterday. Greg Farnum
09:47 AM Bug #330 (Resolved): Crash on OSD::_share_map_outgoing(const entity_inst_t&)
fixed by commit:4bcba121ac165939b3a825564d67fbc1f2e74cf1 Sage Weil
01:46 AM Bug #330: Crash on OSD::_share_map_outgoing(const entity_inst_t&)
I got the cluster working again, by starting it in the following order:
* Kill monitor and mds
* Start all the OS...
Wido den Hollander
01:22 AM Bug #330 (Resolved): Crash on OSD::_share_map_outgoing(const entity_inst_t&)
When upgrading to the latest unstable all my OSD's (30 in total) crashed with the following message:... Wido den Hollander
12:07 AM CephFS Bug #312: MDS crash: LogSegment::try_to_expire(MDS*)
I just saw this crash again, uploaded the log (mds.0.log.1.gz) and corefile (core.cmds.node13.18006) to *logger.ceph.... Wido den Hollander

08/02/2010

11:27 PM Revision 6573635b (ceph): mds: fix cap encoding skipping when snaprealm differs
We were avoiding adding a new cap, but _any_ issue is problematic, not just
adds (which we can't really be sure about...
Sage Weil
11:01 PM Revision ccdfcd39 (ceph): osdmap: show flags
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:36 PM Revision d9a97ac4 (ceph): Merge branch 'locks' into unstable
Greg Farnum
10:35 PM Revision aed240ca (ceph): ceph_fs: use __le64 instead of __u64 for over-the-wire data
Greg Farnum
10:35 PM Revision 0b964c3d (ceph): msgr: Add CEPH_FEATURE_FLOCK to default features set, remove extraneous...
Greg Farnum
10:35 PM Revision f30efb28 (ceph): mds: new low-level debug output, and turn down previous debugs
Greg Farnum
08:27 PM Revision 3e42191b (ceph): debian: add ceph-client-tools dep to ceph
Signed-off-by: Thomas Mueller <thomas@chaschperli.ch>
Signed-off-by: Sage Weil <sage@newdream.net>
Thomas Mueller
08:26 PM Revision f725f203 (ceph): debian: move utils rbd,rados,csyn to ceph-client-tools
Signed-off-by: Thomas Mueller <thomas@chaschperli.ch>
Signed-off-by: Sage Weil <sage@newdream.net>
Thomas Mueller
08:26 PM Revision c61a6c32 (ceph): debian: move mount.ceph to ceph-client-tools
Signed-off-by: Thomas Mueller <thomas@chaschperli.ch>
Signed-off-by: Sage Weil <sage@newdream.net>
Thomas Mueller
08:26 PM Revision 3fc47c2e (ceph): Merge branch 'testing' into unstable
Sage Weil
08:26 PM Revision 9ccf9cb6 (ceph): debian: move fuse-utils dep to ceph-fuse
Reported-by: Thomas Mueller <thomas@chaschperli.ch>
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:23 PM Revision 4a7c3553 (ceph): osd: avoid most of handle_osd_map if there are no new maps in message
We don't want to take the time to flush things to disk and shut down all
the worker threads if there is nothing new i...
Sage Weil
08:23 PM Revision 9bfb8da9 (ceph): osd: send latest osdmap when we stop sending heartbeats to peer
Otherwise, they may still expect them, and may mark us down. Sage Weil
08:23 PM Revision baa69ad7 (ceph): cclass: fix .so loading file
The only thing I've found, is that the *.so files are included in the -devel
rpm. This leads to the fact that the 'fi...
Christian Brunner
08:14 PM Revision 025bce64 (ceph): osd: send heartbeat request from update_heartbeat_peers
This keeps our from_stamps in a consistent state, and sends the hb request
sooner rather than later.
Signed-off-by: ...
Sage Weil
08:13 PM Revision 90a2f201 (ceph): mon: make backward-compat mon id's start at '0', not 'a'
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:13 PM Revision 30cdc3dd (ceph): mon: drop redundant pg_temp mappings
Sage Weil
07:32 PM Revision 88c17f42 (ceph): osd: add asserts when indexing pg log by reqid
This should help catch callers that submit dup entries with the same
request id, causing other problems down the line.
Sage Weil
05:39 PM Revision df813369 (ceph): mds: locking. remove zero-valued client held/waiting lock counts from t...
In the unlikely event that you have a very large cluster doing a lot of
locking of different files amongst the whole ...
Greg Farnum
05:39 PM Revision b443f130 (ceph): mds: locking. Keep trap of neighbors and coalesce them
Greg Farnum
05:39 PM Revision 8dd1960b (ceph): mds: locks. Fix possible naughty dereference, and fix existence checks
Greg Farnum
05:39 PM Revision 2cfbddc2 (ceph): mds: Fix ceph_lock_state_t to use uint64_t
Greg Farnum
05:39 PM Revision 0d4bfea5 (ceph): mds: locking. remove ceph_filelock* lists and replace with iter lists
This allows for more efficient deletion of elements. Greg Farnum
05:39 PM Revision fa058c41 (ceph): mds: handle file locking in MClientCaps on a reconnect.
Greg Farnum
05:39 PM Revision fe0c5119 (ceph): mds: need to size values read from a bufferlist
Greg Farnum
05:39 PM Revision 5850363d (ceph): mds: Determine if there are file locks based on flockbl size, not dirty...
Greg Farnum
05:39 PM Revision b4e9e4c0 (ceph): mds: replace accidnetal __u64 with uint64_t
Greg Farnum
05:39 PM Revision 40bb86ab (ceph): mds: better printing on lock messages
Greg Farnum
05:39 PM Revision 8d85f344 (ceph): mds: drop distributed locks on waiting filelock messages, so they don't...
Greg Farnum
05:39 PM Revision c0eccba3 (ceph): mds: locking. Fix waiting lock count map deletion
Greg Farnum
05:39 PM Revision eb669ad2 (ceph): mds: locking. Compare iterator to right list!
Greg Farnum
05:39 PM Revision e501da4b (ceph): mds: debug output and some comments.
Greg Farnum
05:39 PM Revision e555ac67 (ceph): mds: recover filelocks properly on client reconnect
Greg Farnum
05:39 PM Revision eb3d9b44 (ceph): msg: rename MClientReply::dir_bl to extra_bl. Fix uses
Greg Farnum
05:39 PM Revision aab26ff6 (ceph): mds: add messenging support for advisory file locking
Greg Farnum
05:39 PM Revision f60e20e5 (ceph): mds: fcntl. add skeleton [add, remove]_lock functions.
Implement helper functions to be used later. Greg Farnum
05:39 PM Revision b75fe3b8 (ceph): mds: fcntl. implement add_lock function
Still need to write some of the helpers Greg Farnum
05:39 PM Revision 27ff76bd (ceph): mds: fnctl. implement adjust_locks method, with helpers
Greg Farnum
05:39 PM Revision 1fe8ab34 (ceph): mds: fnctl. implement half of remove_lock.
Still need to handle all the waiting_locks and set them if proper. Greg Farnum
05:39 PM Revision e41056ac (ceph): mds: fnctl. add get_waiting_overlaps function; fix get_overlapped locks
so that it won't do bad things if there aren't any. Greg Farnum
05:39 PM Revision 18b1d772 (ceph): mds: fnctl. finish remove_locks so it'll activate whichever locks it can.
This is lazy but it should work. Greg Farnum
05:39 PM Revision b7b349e2 (ceph): mds: fcntl. add proper encoding to ceph_filelock and ceph_lock_state_t
Greg Farnum
05:39 PM Revision 4520fdfa (ceph): mds: add distributed lock support for fcntl/flock support
Greg Farnum
05:39 PM Revision 4ea76e8d (ceph): mds: Implement file lock message handling.
Greg Farnum
05:39 PM Revision 68e592a8 (ceph): mds: cleanup
Sage Weil
05:39 PM Revision 5d375ea7 (ceph): mds: ceph_filelock and filelock_change need to be explicitly sized
Greg Farnum
05:39 PM Revision 0dbcdd84 (ceph): mds: implement look_for_lock, for fcntl F_GETLK-style things
Greg Farnum
05:39 PM Revision cc652fe7 (ceph): mds: fcntl. Keep track of how many locks/waiting locks each client has.
Greg Farnum
05:39 PM Revision 71cf9d6a (ceph): mds: fcntl locks can efficiently remove all locks from one client.
Greg Farnum
05:39 PM Revision 80412537 (ceph): mds: Now actually requeues blocked locks whenever locks are removed.
Greg Farnum
05:39 PM Revision 6a3600ae (ceph): mds: Remove unnecessary if check.
We already know that head_in exists, or we would have entered the previous
if-block and returned from the function.
Greg Farnum
05:39 PM Revision c14fa1d5 (ceph): ceph_fs: Add cap bits for advisory file locks
Greg Farnum
05:39 PM Revision cc845462 (ceph): mds: use the right size mask for file cap bits
Greg Farnum
05:39 PM Revision 367ff978 (ceph): ceph_fs: CEPH_LOCK_UNLOCK should have its own bit, for safety
Greg Farnum
05:39 PM Revision 8315b5db (ceph): mds: Move lock print function; add debugging; fix fencepost errors.
Greg Farnum
05:39 PM Revision a11a3197 (ceph): mds: conditionally include flock metadata in MClientCaps
Sage Weil
05:39 PM Revision fd61e1b4 (ceph): mds: conditionally include flock metadata in MClientReconnect
mds: move flockbl into each reconnect record
Simpler for the client side.
mds: keep old reconnect struct
mds: put ...
Sage Weil
05:21 PM Revision 6be6c48e (ceph): mds: remove size, mtime, atime from cap reconnect
Sage Weil
04:57 PM Revision 2136ee76 (ceph): mds: make mislinked dentry during replay workaround tolerate directorie...
This should probably all go away when the original bug is fixed! Sage Weil
04:28 PM Linux kernel client Bug #283 (Resolved): ceph_add_cap: couldn't find snap realm, NULL ptr deref
This is a server-side problem with CInode::encode_inodestat, fixed by commit:6573635ba48a9b6c4f364e8f8b7132c90ea2e8e9... Sage Weil
04:15 PM Revision 67236ae7 (ceph): move logrotate conf out of debian/ to fix ceph.spec
Sage Weil
02:12 PM Bug #306 (Resolved): osd: adjusting replication should be fast, keep data available
This was a problem with handle_osd_map doing all the slow steps even when there was no new information in the osdmap.... Sage Weil
02:11 PM Bug #282 (Resolved): osd: heartbeat can't keep up with large cluster changes
Sage Weil
02:11 PM Bug #282: osd: heartbeat can't keep up with large cluster changes
fixed what i think is the last issue here in commit:9bfb8da9f925642bca46528a999124cd8b28ba2a Sage Weil
01:47 PM Cleanup #324 (Resolved): standalone ceph-client-tools package
Sage Weil
12:15 PM Bug #326: OSD crash PG::IndexedLog::unindex
Saw this crash again, just added a new core file (core.node10.2629) to the logger machine. Also uploaded the log from... Wido den Hollander
10:04 AM CephFS Bug #328 (Resolved): MDS crash: MDCache::remove_inode(CInode*)
Fixed the replay workaround in commit:2136ee763659e84f5715974450b89e8dea31a717
The original source of the problem,...
Sage Weil
09:43 AM CephFS Bug #328 (Resolved): MDS crash: MDCache::remove_inode(CInode*)
Today i tried to sync kernel.org again, this went fine, until my log parition filled up and mds0 got stuck.
Someho...
Wido den Hollander
10:03 AM CephFS Bug #329 (Resolved): mds: mislinked dentry found during journal replay
There is a FIXME error that is logged during replay when we encounter a dentry that is already linked and a journal e... Sage Weil

07/31/2010

04:14 PM Revision 1d221964 (ceph): auth: fix cast
Wido den Hollander
12:14 AM Revision e3721638 (ceph): rgw: Support a loglevel of 0
Wido den Hollander

07/30/2010

11:35 PM Revision 309f0652 (ceph): cauthtool: correct man page sample of minimal fs client caps
Sage Weil
11:12 PM Revision 71532447 (ceph): rgw: fix setting acls on the bucket
Yehuda Sadeh
07:06 PM Revision b502be7a (ceph): move mkcephfs to /sbin
Sage Weil
07:05 PM Revision 7465b5e3 (ceph): debian: more cleanups
https://code.launchpad.net/~clint-fewbar/review-new-branches/ceph-new-pkg/+merge/31303 Sage Weil
05:24 PM Revision 8b8fd7e9 (ceph): auth: define AES iv in common header
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:34 PM Bug #327 (Closed): OSD crash PG::IndexedLog::print
I'm not sure if this is the same issue as #326, that's why i open a new issue.
At the same moment #326 took place,...
Wido den Hollander
04:27 PM Bug #326 (Resolved): OSD crash PG::IndexedLog::unindex
I've just seen this crash on one of my OSD's running with the latest unstable.
Have no idea what went wrong (Was j...
Wido den Hollander
04:19 PM Revision 325b41a3 (ceph): add files to .gitignore
add generated files to gitignore.
Signed-off-by: Thomas Mueller <thomas@chaschperli.ch>
Signed-off-by: Sage Weil <sa...
Thomas Mueller
04:18 PM Revision bcbad88d (ceph): redirect stdout to /dev/null when reloading
the reload command prints to stdout. example:
# service ceph reload
=== mon.0 ===
Reloading Ceph mon0 on hoe001...ki...
Thomas Mueller
09:25 AM Bug #325 (Can't reproduce): writes starve reads
From: Henry C Chang <henry_c_chang@tcloudcomputing.com>
To: ceph-devel@vger.kernel.org
Subject: writes-starving-rea...
Sage Weil
09:25 AM Cleanup #324 (Resolved): standalone ceph-client-tools package
Sage Weil

07/29/2010

10:49 PM Revision b72c1bb6 (ceph): monclient: use default port if not specified via -m
Reported-by: Thomas Mueller <thomas@chaschperli.ch>
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:32 PM Revision 728c7669 (ceph): mon: we support MONNAMES feature
Sage Weil
08:24 PM Revision 9052986f (ceph): Merge branch 'unstable' into mon-remove
Conflicts:
src/cmon.cc
src/include/ceph_fs.h
src/mon/MonClient.cc
src/mon/Monitor.cc
Sage Weil
06:37 PM Revision 11dc1af1 (ceph): mon: fix signed/unsigned compile warning
Sage Weil
06:32 PM Revision 719aa186 (ceph): debian: add update_pbuilder.sh script; don't do it during build
Sage Weil
04:38 PM Revision f3ac22ab (ceph): bump unstable version to v0.22~rc
Sage Weil
04:37 PM Revision ea2ce9d4 (ceph): cclass.in: fix libdir definition
Yehuda Sadeh
04:37 PM Revision 090436f5 (ceph): v0.21
Sage Weil
04:29 PM Revision fd4029cb (ceph): debian/ceph.logrotate: enhance postrotate for "service" util
[ The following text is in the "UTF-8" character set. ]
[ Your display is set for the "iso-8859-1" character set....
Thomas Mueller
04:17 PM Revision 6db89756 (ceph): mon: trim old logm states
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:14 PM Revision 183984e1 (ceph): ceph.spec.in: create /var/log/ceph/stat dir
the logrotate config expects a var/log/ceph/stat dir - create it.
Signed-off-by: Sage Weil <sage@newdream.net>
Thomas Mueller
04:14 PM Revision e1a97685 (ceph): debian: ceph-cfuse: move ceph from Recommends to Suggests
on a standalone fuse client ceph serverside is not needed. if it's
"Recommended" the default behaviour is to install ...
Thomas Mueller
03:50 PM CephFS Feature #321: allow dns names for monitor switch
the port part of htis is fixed by commit:b72c1bb6e9b77e1ab6c21bfd761f7fe3cc493647 Sage Weil
09:27 AM CephFS Feature #321 (Resolved): allow dns names for monitor switch
if the -m switch is set to a dns name cfuse fails:
# cfuse -m node001:6879 /mnt
unable to parse addrs in 'node001...
Thomas Mueller
02:45 PM Bug #323 (Duplicate): Get the wireshark dissector up to date
Yehuda Sadeh
02:16 PM rbd Bug #322: Can't create an image via rbd or qemu-kvm
My bad, this is a duplicate of #263, but the solution of #263 doesn't apply here. Wido den Hollander
02:12 PM rbd Bug #322 (Closed): Can't create an image via rbd or qemu-kvm
... Wido den Hollander
01:28 PM Feature #195 (Resolved): monitor naming and removal support
merged in commit:9052986fd3fb5fc065fd9781bbfdb131194e845a Sage Weil
04:57 AM rgw Feature #313: Support loglevel
One small glitch in the patch, setting a loglevel of 0 was not supported due to a wrong if-condition.
Attached a p...
Wido den Hollander
04:51 AM rgw Cleanup #320 (Resolved): Remove some duplicates and unused declarations
I've done some small cleanups in the gateway, mostly unused declarations, double things or just unneeded comment line... Wido den Hollander
04:04 AM Revision ce07f280 (ceph): ceph.spec.in: rbdtool -> rbd
Reported-by: markn1 Sage Weil

07/28/2010

11:28 PM Revision b0616f85 (ceph): debian: drop gcc, g++ build deps
Sage Weil
11:25 PM Revision 47688a43 (ceph): debian: fix weak-library-dev-dependency
E: ceph source: weak-library-dev-dependency librados1-dev on librados1
E: ceph source: weak-library-dev-dependency li...
Sage Weil
11:25 PM Revision 07d880ea (ceph): debian: update to latest standards version
Sage Weil
11:25 PM Revision 39a3db6b (ceph): debian: fix debhelper depend version
dh_lintian was introduced in 6.0.7 Sage Weil
11:25 PM Revision 58181fe5 (ceph): debian: add trailing newline
Sage Weil
11:25 PM Revision 487f0e85 (ceph): debian: remove gcc/g++ build-depends
(as per Clint Byrum) Sage Weil
11:25 PM Revision 9c6ae54c (ceph): release: drop wildcard
Sage Weil
10:41 PM Revision 3ed08a33 (ceph): check_version: don't globber .git_version if we're not in a git repo
namely, when you extract the release tarball! Sage Weil
10:00 PM Revision 8b0bf8e1 (ceph): filestore: add option to disable use of btrfs clone range ioctl
Sage Weil
09:57 PM Revision eb1f5874 (ceph): filestore: fix fallback clone_range implementation on short files
If we read less data from the source, don't loop forever... return -ERANGE
instead.
Sage Weil
08:57 PM Revision 43d28dbc (ceph): release: put .git_version in release tarball
Sage Weil
08:37 PM Revision 9b8e2b4f (ceph): pgmon: Remove creating_pgs which are from deleted pools.
Greg Farnum
07:47 PM Revision 425e052e (ceph): osd: drop broken (and useless) PG::merge_log assert
The assert should be olog.backlog, not log.backlog. But it's the same
condition as the if guard that's 3 lines up, m...
Sage Weil
06:55 PM Revision 05a18e20 (ceph): mds: handle this log replay error better:
Log it to the central log so it gets noticed, and
delete the "wrong inode" instead of just unlinking it, to prevent l...
Greg Farnum
06:41 PM Revision 801150a7 (ceph): rgw: Support loglevel settings
The gateway now supports changing the verbosity of the logging, this can be achieved by changing the
RGW_LOG_LEVEL en...
Wido den Hollander
06:41 PM Revision 593937d6 (ceph): librados: quiet down some log messages
Yehuda Sadeh
06:41 PM Revision b01cc38f (ceph): rgw: set default log level to 20
Yehuda Sadeh
11:56 AM CephFS Bug #312 (Can't reproduce): MDS crash: LogSegment::try_to_expire(MDS*)
Unfortunately we can't solve the root cause of this because we don't have detailed logging when the inodes in questio... Greg Farnum
10:12 AM CephFS Bug #312 (In Progress): MDS crash: LogSegment::try_to_expire(MDS*)
Greg Farnum
09:54 AM rgw Feature #313: Support loglevel
I gave this some work today and created a patch, see the attachment.
The gateway will read the evirionment variabl...
Wido den Hollander

07/27/2010

08:22 PM Revision 6bd40ac9 (ceph): qa: consistent snaptest-%d.sh naming
Sage Weil
07:53 PM Revision 9127cd9d (ceph): mds: fix uninitialized LeaseStat for null lease
./include/encoding.h: In member function void
Locker::issue_client_lease(CDentry*, client_t, ceph::bufferlist&, utime...
Sage Weil
07:49 PM Revision 3de9c8d2 (ceph): client: some whitespace cleanup
Sage Weil
07:49 PM Revision 7fbe1655 (ceph): debug: no name symlink when explicit --log-file
Sage Weil
07:49 PM Revision 5c714bfb (ceph): osd: log when we get marked down but aren't
Sage Weil
07:13 PM Revision 8195899a (ceph): qa: add localized version of Thomas Mueller's snaptest-2.sh
http://github.com/vinzent/ceph-testsuite/blob/master/tests/snaptest-2 Greg Farnum
06:18 PM Revision 2d35d247 (ceph): rgw: exit after 5 seconds from SIGUSR1 anyway
Yehuda Sadeh
05:28 PM Revision a2ba0d35 (ceph): rgw: make sure error structure is initialized
Yehuda Sadeh
05:28 PM Revision 58cf95bb (ceph): rgw: Make sure all times are converted to GMT, so we can compare them c...
Wido den Hollander
05:28 PM Revision 3d1f3072 (ceph): rgw: fix returned error code in If-None-Match
Yehuda Sadeh
05:28 PM Revision 50a89fcb (ceph): rgw: handle sigusr1, get notified for shutdown requests
Yehuda Sadeh
05:28 PM Revision a48b1504 (ceph): rgw: fix that we return data when partial content is requested
Wido den Hollander
05:28 PM Revision 0acfd014 (ceph): rgw: support partial content (206) response
Couldn't get this one to work yet, as apache (or probably one of
the apache filters) rewrites the content-length valu...
Yehuda Sadeh
05:28 PM Revision 234b6916 (ceph): We shoud always return the Accept-Ranges header, with a GET or HEAD req...
This is to let the client know we accept requests for partial content.
On HEAD requests we should also return the Co...
Wido den Hollander
05:28 PM Revision 017588ed (ceph): We should sent back the Last-Modified header in GMT as specified by RFC...
Wido den Hollander
05:28 PM Revision 577095b3 (ceph): rgw: fix error status code
Yehuda Sadeh
05:28 PM Revision d9ff7e09 (ceph): rgw: fix return code for prepare_get_obj() on the fs storage
Yehuda Sadeh
05:28 PM Revision b628ba19 (ceph): rgw: fix date/time format string
Wido den Hollander
02:14 PM CephFS Bug #319 (Closed): allows snaps in root directory
Sage Weil
12:57 PM Linux kernel client Bug #317 (Resolved): sync hangs when doing concurrent syncs
Fixed by commit:e33b35cf88cd06d69332bd399d5720edc85c17fa.
Problem was that we didn't use complete_all and wake_up_...
Yehuda Sadeh
11:38 AM Linux kernel client Bug #317 (Resolved): sync hangs when doing concurrent syncs
For example, running iozone and doing a sync concurrently, sync will hang (as iozone does sync internally). Yehuda Sadeh
12:01 PM CephFS Bug #288 (Closed): cmds disappears under snapshot load
All right, it works on one MDS. Opened #318 to track issues with the multi-mds cluster. Greg Farnum
12:01 PM CephFS Bug #318 (Resolved): Multi-MDS snaptest-2 fails
See #288. Looks like a problem with flushing (or other handling) of snapshotted inodes. Greg Farnum
11:11 AM CephFS Bug #316 (Resolved): mds: track dirty snapped metadata without faking a cap
Sage Weil
10:57 AM rgw Feature #174: Support large files better
Probably the ETag calculation is not being done right. Should be done in chunks too. Yehuda Sadeh
10:56 AM rgw Feature #302 (Resolved): Support If-Modified-Since
Yehuda Sadeh
10:54 AM rgw Bug #310 (Resolved): rgw fcgi processes don't go down
Should be fixed now with commit:50a89fcb5206e5afde88c01cd5370518b86c470d and commit:2d35d247844d030fe628b227c836ff657... Yehuda Sadeh
10:47 AM Feature #315 (Resolved): mds disaster recovery tools
things like
- extract mds journal
- edit/wipe mds journal
Sage Weil
10:47 AM Feature #314 (Closed): osd disaster recovery tools
things like:
- edit or wipe pg logs
- "hide" problematic pgs to get the system online
- throttle osd cluster recov...
Sage Weil
10:14 AM Linux kernel client Feature #206: make a 'soft' mode
Make 'hard' and 'soft' mount options, ala nfs. Default is 'hard'.... Sage Weil
10:07 AM rgw Bug #309 (Resolved): Error status is not correctly initialized
Yehuda Sadeh
07:29 AM rgw Bug #309: Error status is not correctly initialized
I can confirm it has been fixed. States now work, no more "caching" like i saw. Wido den Hollander
10:03 AM rgw Feature #301 (Resolved): Support partial content response
Yehuda Sadeh
10:03 AM rgw Feature #301: Support partial content response
Great! I modified your fix a bit. We should only return the data if there's no actual error, and pushed it. Yehuda Sadeh
12:49 AM rgw Feature #301: Support partial content response
There was still a problem that with partial content no data was returned. This was due to a error in rgw_rest.cc.
...
Wido den Hollander
08:06 AM rgw Feature #313 (Resolved): Support loglevel
Like spoken about yesterday, it would be usefull if we could set a loglevel for the RADOS Gateway.
We would use th...
Wido den Hollander
04:33 AM CephFS Bug #312 (Can't reproduce): MDS crash: LogSegment::try_to_expire(MDS*)
This morning i upgraded my cluster to the latest unstable, afterwards i tried to mount the cluster, which failed.
...
Wido den Hollander

07/26/2010

11:43 PM Revision e2b1a4ee (ceph): mds: Use get_oldest_snap() (not first) in handle_client_lssnap.
Greg Farnum
11:13 PM Revision 49c21837 (ceph): rgw: remove unused label 'done' (compiler warning)
Greg Farnum
10:58 PM Revision 0ae06c81 (ceph): msgr: fix direct send to loopback connection
If we send directly to the local/loopback connection, we need to
queue it for local delivery, not go through the usua...
Sage Weil
10:03 PM Revision 997d67e5 (ceph): mon: revamp subscribe protocol [backward compatible protocol change]
Before, we would provide "have" and a bool "onetime" flag. The struct was
also screwed up with an extra __le64. The...
Sage Weil
06:44 PM Revision fa48a1a8 (ceph): mds: restore CInode::get_oldest_snap()
It's either 'first', for the first for the oldest old_inode. Sage Weil
05:27 PM Revision bc9a7955 (ceph): mds: defer anchortable updates before CLIENTREPLAY, not ACTIVE
Otherwise we hang trying to replay an op that interacts with the
anchortable.
Sage Weil
05:24 PM Revision ad6f92c0 (ceph): Merge remote branch 'origin/large_objects' into unstable
Sage Weil
05:24 PM Revision 0775c30b (ceph): mds: get rid of old lease mask cruft
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:24 PM Revision ac9305f8 (ceph): mds: fix lease revocation mask vs type
Sage Weil
04:43 PM CephFS Bug #288: cmds disappears under snapshot load
Switched back to using get_oldest_snap, works on a single-MDS install as of commit: e2b1a4ee119a68b403582ae3bc15b54e9... Greg Farnum
03:23 PM RADOS Cleanup #311 (New): osd: remove read(len=0) full object behavior
..,after the objecter doesn't need it (see #303) Sage Weil
03:23 PM Feature #175 (Resolved): Make the system large-object safe
the recovery portion of this is done.
Remaining bits include #303, and then removing the '0 == full object' logic ...
Sage Weil
03:22 PM Bug #308 (Resolved): osd can't get full map history
resolved by commit:997d67e5b1ce6df1b467a9c8b284f1289fbee22a Sage Weil
12:10 PM Bug #308 (Resolved): osd can't get full map history
10.07.26_20:55:17.810053 7f424cf8d710 osd10 0 handle_osd_map epochs [78,78], i have 0
10.07.26_20:55:17.810093 7f424...
Greg Farnum
03:05 PM rgw Feature #302: Support If-Modified-Since
The patch seems fine, just a small issue is that you changed the return code in the unmodified-since case. This would... Yehuda Sadeh
02:01 PM rgw Feature #302: Support If-Modified-Since
See the attached patch, this will convert all times to GMT, so they can be compared correctly. Wido den Hollander
03:20 AM rgw Feature #302: Support If-Modified-Since
There is some more to this and it might need some more work to do so.
Right now, all the times stored by the RADOS...
Wido den Hollander
02:58 PM rgw Bug #310 (Resolved): rgw fcgi processes don't go down
Need to implement a signal handler for the SIGUSR1 signal, to receive apache requests to go down. Yehuda Sadeh
02:57 PM rgw Bug #309: Error status is not correctly initialized
should be fixed by commit:21c8144c0d631530704acb1f08f6dddfa6c83eb1
Yehuda Sadeh
02:55 PM rgw Bug #309 (Resolved): Error status is not correctly initialized
Reported by wido: once getting a "304" response, we keep on getting that error, even when operating on different obje... Yehuda Sadeh

07/24/2010

04:50 PM Revision 9932e4b4 (ceph): osd: on dups, reply if committed, or wait until ondisk
Sage Weil
04:49 PM Revision 47fd7fe2 (ceph): osd: helper should reply with ACK on error, op flags on success
Sage Weil
04:37 PM Revision b0332590 (ceph): osd: maintain pg last_update_ondisk
Only defined when is_active(). Only printed when it lags (i.e. we are
in the process of replicating and/or writing a...
Sage Weil
04:37 PM Revision fb359c8e (ceph): osd: map indexed caller_ops back to relevant entry
This let's us look up a version from the osd_reqid_t. Sage Weil
10:11 AM Bug #307 (Resolved): osd: fix wrnoop on dup op
commit:9932e4b40713df2e33f53583274d9ff35adca03b Sage Weil

07/23/2010

11:05 PM Revision 3733b177 (ceph): initscripts: fix type.id check in ceph.conf
Sage Weil
10:53 PM Revision 8f9f57a2 (ceph): Merge branch 'unstable' into large_objects
Sage Weil
10:52 PM Revision b2194357 (ceph): osd: make assemble_backlog more tolerant of races
The assemble_backlog is merging data generated while racing with online
updates. It needs to tolerate races with thi...
Sage Weil
10:51 PM Revision dd684835 (ceph): osd: clean up size learning during pull
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:51 PM Revision 1d0c91d5 (ceph): osd: populate in-memory snapset context with just-pulled SnapSet
This can come from the SNAPDIR or NOSNAP (head) object.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:51 PM Revision c6111924 (ceph): osd: create missing temp collection/dir
Sage Weil
10:51 PM Revision b59cb10d (ceph): osd: skip degraded objects we're already recoverying in recover_replicas
Otherwise we'll start_recovery_op, dup the push, etc.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:50 PM Revision 9520917d (ceph): mds: drop remove_client_lease return value
Unused Sage Weil
10:50 PM Revision 74fcb6bf (ceph): mds: fix dentry lease mask/lockid disconnect
There is only one lock here, no need to be tricky with the mask. Sage Weil
10:50 PM Revision c33171c0 (ceph): mds: no return value for issue_client_lease()
Return value is unused. Also drop obsolete inode lease method declaration.
Signed-off-by: Sage Weil <sage@newdream....
Sage Weil
10:50 PM Revision 34957ff3 (ceph): mds: fix lease issue mask
We define 1 to be the only lease "mask" we currently support (for dentry)
and divorce ourselves from the CEPH_LOCK na...
Sage Weil
10:50 PM Revision 15abbfbe (ceph): mds: create leases with non-zero seq
We want to avoid the situation:
- mds issues lease, seq 0
- mds times out lease, seq 0
- client sends seq 0 release
...
Sage Weil
10:50 PM Revision 92273f70 (ceph): osd: log/debug hb set changes
Sage Weil
10:50 PM Revision b33b53b1 (ceph): osdmap: recalc num_osds after applying incremental
Sage Weil
10:50 PM Revision 4ab66fa4 (ceph): osd: clean_up_local should ignore snapped, snapdir objects
The snap_trimmer is responsible for removing any/all snap objects.
snapdir only exists if snapped objects exist and ...
Sage Weil
10:50 PM Revision fdd13cb2 (ceph): filestore: make stat print correct errno
Sage Weil
10:50 PM Revision 622e9b47 (ceph): osd: simplify heartbeat checks
- Only check heartbeats when we have heartbeat_lock and osdmap rdlocked,
and thus _know_ heartbeat info and map are i...
Sage Weil
10:49 PM Revision 9e3650be (ceph): osd: red is good enough; don't blink
Sage Weil
09:37 PM Revision f731d4d6 (ceph): mds: fix snap lookups in CInode encoding
Greg Farnum
09:37 PM Revision 7bba8447 (ceph): qa: runoncfuse shouldn't start cfuse in background any more.
Greg Farnum
09:37 PM Revision 9c1ceb1e (ceph): mds: only try and lssnaps on a dir for as long as it's been around
Greg Farnum
08:21 PM Revision ff53e9b3 (ceph): mon: don't crash on 'osd pool set' command
Sage Weil
06:46 PM Revision e986721b (ceph): osd: zero ondisklog pointers when starting pg deletion
This fixes a problem where the osd stops part way through pg cleanup. It
seens the old ondisklog bounds, but then fa...
Sage Weil
06:43 PM Revision 16c0fea2 (ceph): librados: remove use of snapid_t
Yehuda Sadeh
04:42 PM Revision 6b7693bf (ceph): objecter: default debug level 0 (no 'laggy' messages)
Sage Weil
04:31 PM Bug #307 (In Progress): osd: fix wrnoop on dup op
Sage Weil
03:27 PM Bug #307 (Resolved): osd: fix wrnoop on dup op
- need to return success, but not until the request has definitely committed.
- shouldn't do the old noop nonsense.....
Sage Weil
02:35 PM rgw Feature #302: Support If-Modified-Since
Seems that the times were not parsed correctly and converted to GMT.
Attached patch will fix this and make the if-...
Wido den Hollander
01:20 PM rgw Feature #302: Support If-Modified-Since
Yes, Amazon does that. The RFC specifies that when you do so, you have to return a 304 Not Modified header.
Like y...
Wido den Hollander
09:52 AM rgw Feature #302: Support If-Modified-Since
Actually we do implement the If-Modified-Since, if it's not working that's a bug. One difference is that we don't ret... Yehuda Sadeh
08:24 AM rgw Feature #302 (Resolved): Support If-Modified-Since
"The If-Modified-Since request-header field is used with a method to make it conditional: if the requested variant ha... Wido den Hollander
02:13 PM Bug #306 (Resolved): osd: adjusting replication should be fast, keep data available
it's currently not... something isn't working as intended. Sage Weil
01:24 PM Bug #305 (Resolved): Monitor crash on "ceph osd pool set"
fixed in commit:ff53e9b33d1911d047d8be0cdaa65e1e6c62d7f1 Sage Weil
01:18 PM Bug #305 (Resolved): Monitor crash on "ceph osd pool set"
When running "ceph osd pool set" with no arguments, the monitor who receives this command crashes.... Wido den Hollander
01:13 PM Linux kernel client Bug #304: GPF in writepages_finish
the first crash is addr.c:534,
WARN_ON(!PageUptodate(page));
bad page pointer page=5b5b5b.., i=0.
Sage Weil
01:13 PM Linux kernel client Bug #304: GPF in writepages_finish
another node got this, probably the same bug?... Sage Weil
01:02 PM Linux kernel client Bug #304 (Can't reproduce): GPF in writepages_finish
untar_snap_rm.sh... Sage Weil
12:58 PM CephFS Bug #288 (In Progress): cmds disappears under snapshot load
Looks like there's more to it than this, I got another crash farther on. Continuing to study.
Also there might be ...
Greg Farnum
11:57 AM Linux kernel client Bug #150 (Can't reproduce): order:1 page allocation failure
we've fixed a bunch of memory leaks. haven't seen this recently. Sage Weil
11:53 AM Feature #303 (Resolved): objecter: limit in-flight ops and/or bytes written
Sage Weil
11:51 AM Bug #279 (Resolved): OSD crash on empty pglog
Sage Weil
11:51 AM Bug #279: OSD crash on empty pglog
i think this is fixed by commit:e986721b06973a9379a37961e49ce334c480fa1a. not a disk full problem at all. Sage Weil
11:49 AM Bug #300: osd: make pg removal atomic
actaully commit:e986721b06973a9379a37961e49ce334c480fa1a Sage Weil
11:48 AM Bug #300 (Resolved): osd: make pg removal atomic
commit:9a3c990d3708b3ec67a6a980c93fa814aece1c2e Sage Weil
11:44 AM Feature #33 (Closed): O_LAZY or equivalent
Sage Weil
10:20 AM Bug #282: osd: heartbeat can't keep up with large cluster changes
Greg Farnum wrote:
> Do we still think this is an issue after commit:856999eda434fa9b7d93b152427cf7c82240f220 ("osd:...
Sage Weil
10:17 AM Bug #282: osd: heartbeat can't keep up with large cluster changes
Do we still think this is an issue after commit:856999eda434fa9b7d93b152427cf7c82240f220 ("osd: clear failure_queue w... Greg Farnum
10:06 AM Linux kernel client Bug #139: BUG ceph_dentry_info: Objects remaining on kmem_cache_close()
actually, commit:252af5214682191e34e57204e1a31924fb82c207 Sage Weil
07:43 AM rgw Feature #301: Support partial content response
Uh, my bad, the Last-Modified header value was not a GMT timestamp. The attached patch fixes this by using "gmtime()"... Wido den Hollander
07:06 AM rgw Feature #301: Support partial content response
I've done some work on this today, while doing so i found some more issues.
The attached patch fixes that with GET...
Wido den Hollander

07/22/2010

09:27 PM Revision d665d494 (ceph): initscripts: use $type.$id, not $name, when possible
Sage Weil
09:26 PM Revision de9262e9 (ceph): initscripts: accept osd0 or osd.0, regardless of what conf uses
Sage Weil
09:17 PM Revision d9a7acbf (ceph): Merge remote branch 'origin/large_objects' into unstable
Sage Weil
07:01 PM Revision ad1cd81f (ceph): mds: be careful obeying REQRDLOCK
Only do a simple_sync() if we are stable, auth, and not already sync. The
client request can race with other state c...
Sage Weil
05:53 PM Revision b8c1bda8 (ceph): debian: fix typo in libcrush description
Reported-by: Wido Sage Weil
04:30 PM rgw Feature #301 (Resolved): Support partial content response
When playing rgw-hosted movie via vlc, can't fast forward/set movie position. This happens because we don't reply wit... Yehuda Sadeh
04:14 PM Bug #300 (Resolved): osd: make pg removal atomic
currently we can start pg removal, delete pglog, then crash while deleting objects, then fail on restart in load_pgs ... Sage Weil
03:31 PM CephFS Bug #30: multimds: slave_request on getattr
In order to change this, we need a way to wake up the replica waiter when we find out the auth changed, so we can res... Sage Weil
02:37 PM Bug #273 (Can't reproduce): OSD crashes during replication level change
Sage Weil
02:30 PM Bug #290 (Resolved): take osd1 or osd.1 to initscript
commit:de9262e995085e892df481bb5607b39510a616a7 Sage Weil
02:19 PM CephFS Bug #297 (Closed): MDS crash on Objecter::handle_osd_op_reply
this is just out of memory. opened up #299 to improve logging. Sage Weil
06:20 AM CephFS Bug #297: MDS crash on Objecter::handle_osd_op_reply
I've tried restarting the MDS'es multiple times, all resulting in the same crash again from both MDS'es.
The core-...
Wido den Hollander
05:49 AM CephFS Bug #297: MDS crash on Objecter::handle_osd_op_reply
Forgot to note my cluster state:... Wido den Hollander
05:49 AM CephFS Bug #297 (Closed): MDS crash on Objecter::handle_osd_op_reply
While doing a rsync of kernel.org again, both my MDS'es crashed.... Wido den Hollander
02:14 PM Linux kernel client Bug #291 (Resolved): leaked dentry (?) in ceph_readdir_prepopulate when ceph_init_dentry fails
fixed by commit:68301be0b91242139ffa215c1163a78f3e18db38 Sage Weil
02:11 PM Linux kernel client Bug #139 (Resolved): BUG ceph_dentry_info: Objects remaining on kmem_cache_close()
ah, finally fixed by commit:59bb6d659605044c2c62e2f5c76b712db36f7da8 Sage Weil
01:33 PM Feature #245: Logging to syslog
Thought about this some. A few things:
the dout() macros just need to use a temp stringstream (not *_dout), then ...
Sage Weil
01:28 PM Cleanup #299 (New): catch std::bad_alloc and die with helpful error in log on ENOMEM
Sage Weil
12:12 PM CephFS Bug #288: cmds disappears under snapshot load
Sage should have got this in commit:1271fdd0e345d64493c386167e38e3bfea7c52e6. Will test and confirm. Greg Farnum
12:04 PM CephFS Bug #296 (Resolved): Locker::simple_sync assert(0)
fixed by commit:ad1cd81fd00067a2da88daf053b82683a2e64a0b Sage Weil
11:17 AM Feature #298 (Resolved): osd: "watch" operation
- returns when object changes
- client needs to support long-running ops (w/o timing out or whatever)
- need to exp...
Sage Weil

07/21/2010

08:30 PM Revision 1271fdd0 (ceph): mds: initialize snaprealm created, current_parent_since on creation
Need to initialize created and current_parent_since on new snaprealms
when they are created, or else we get incorrect...
Sage Weil
08:30 PM Revision 6aa32f8c (ceph): Merge remote branch 'origin/debian' into unstable
Sage Weil
06:03 PM Revision 93a2ba8e (ceph): rgw: fix calls to finish_get_obj() with uninitialized handle
Yehuda Sadeh
04:31 PM Revision 0bd671f7 (ceph): debian: drop unused *.modules.in
Sage Weil
04:27 PM Revision ef2ecdab (ceph): fuse.cc: make license GPL
Originally based on Miklos' fusexmp.c, which is GPL. Keep that license.
LGPL makes no sense here anyway.
Sage Weil
04:27 PM Revision dd9d23b3 (ceph): debian: cleanup from Clint Byrum <clint@ubuntu.com>
With a few changes by me (changelog, copyright clarification) Sage Weil
04:27 PM Revision 68fc3fc8 (ceph): debian: no more ceph-kclient-soruce
Sage Weil
03:26 PM Linux kernel client Bug #271: dropping dirty+flushing AxFxw state for
It does look like a multiple mds only bug. This message can occur when ci->i_caps is empty while the ci->i_flushing_i... Yehuda Sadeh
02:40 AM Linux kernel client Bug #271: dropping dirty+flushing AxFxw state for
I just saw this again, saved the logs on *logger.ceph.widodh.nl* in */srv/ceph/issues/issue_271*... Wido den Hollander
01:35 AM rgw Feature #174: Support large files better
I think i was a bit to early with reporting. A few minutes later the file showed up with the correct filesize and a E... Wido den Hollander
01:27 AM rgw Feature #174: Support large files better
After that commit the Content-Length works fine.
Uploading large files still fails, for example:...
Wido den Hollander

07/20/2010

10:07 PM Revision 24c75167 (ceph): clean up new pool crush rule values [backward compatible protocol change]
Default to -1 (not 0!), which means 'server default'. Sage Weil
10:07 PM Revision 6cb3ea76 (ceph): mon: specify default new pool size/rule/pg_num in conf
Sage Weil
08:24 PM Revision 856999ed (ceph): osd: clear failure_queue when marked down
This prevents bleed through of failures (due to not getting hearbeats, due
to us being marked down) so they don't get...
Sage Weil
08:24 PM Revision e8eec86c (ceph): mon: set/clear pause{rd,wr} osdmap flags
Sage Weil
08:24 PM Revision 9ddb6ab2 (ceph): osd: infer missing map is empty when last_complete==last_update
Sage Weil
07:39 PM Revision 618f7f25 (ceph): osd: clean up pulling first/complete business
Sage Weil
07:30 PM Revision 46247640 (ceph): osd: push in chunks
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:13 PM Revision f3eb9645 (ceph): rgw: fix content-length reporting
and also some other various issues that came up while at it Yehuda Sadeh
05:38 PM Revision 190df1c6 (ceph): debug: fix log append
ios::ate does not work as advertised!
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:36 PM Revision 538ceedc (ceph): debug: say 'append' or 'new' when opening log
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:25 PM Revision 2483096e (ceph): osd: wait for non-existant pools (w/o crashing); clean up handle_op()
Make sure the pool exists before calling OSDMap::raw_pg_to_pg(). Fixes
crash on operations shortly after pg pools ar...
Sage Weil
04:37 PM CephFS Bug #296 (Resolved): Locker::simple_sync assert(0)
Started seeing this today while working on #288. Once it strikes a run it sticks around, but it's intermittent about ... Greg Farnum
04:33 PM CephFS Bug #295 (Can't reproduce): mds: can't rmdir due to dir size underflow
[dir 10000181ec7 /ceph1/untar_snap_rm.sh/ [3b,head] auth v=5988 cv=5988/5988 state=1073741826|complete f(v0 m10.07.2... Sage Weil
11:55 AM rgw Feature #174: Support large files better
commit:f3eb96457b193b1f5d79cf2b41a3cda690c0eab0 fixes the content length issue. Yehuda Sadeh
02:54 AM rgw Feature #174: Support large files better
Removing the following lines in *rgw_rados.cc* the content-length is returned correctly:
*Line 621:**...
Wido den Hollander
02:08 AM rgw Feature #174: Support large files better
The commit from last night seems to have broken the Content-Length header, see the attached screenshot.
There is a...
Wido den Hollander
11:21 AM Linux kernel client Feature #294 (Resolved): honor PAUSE* flags in osdmap
Sage Weil
10:43 AM Bug #293 (Can't reproduce): cmon crash during paxos update
hmm, i fixed this by fixing logm/last_committed to have the actual last committed state (3141 i think). I'm not sure... Sage Weil
08:50 AM Bug #293 (Can't reproduce): cmon crash during paxos update
Today i experienced some crashes of my monitors and mds'es due to my disks filling up with logs.
I had to re-start...
Wido den Hollander
10:29 AM Bug #292 (Resolved): OSD crash raw_pg_to_pg
fixed by commit:2483096e664c772478bb7dcf718297f7779fe5c7 Sage Weil
02:06 AM Bug #292 (Resolved): OSD crash raw_pg_to_pg
In my cluster osd28 just got marked as down, i assume the heartbeat problem again. ( I was playing with the S3 Gatewa... Wido den Hollander
12:20 AM Revision a0cd80cf (ceph): rgw: fix copy
Yehuda Sadeh

07/19/2010

11:50 PM Revision 925e2092 (ceph): rgw: get/put read and write by chunks
Yehuda Sadeh
11:22 PM Revision 1e30deaa (ceph): osd: refactor push code
- send_push_op() does a push, nothing else
- push_start() starts a primary->replica push, tracks state
- push_to_repl...
Sage Weil
11:20 PM Revision eed7f726 (ceph): rgw: lay groundwork for chunky read/write
Yehuda Sadeh
09:44 PM Revision 1d201832 (ceph): osd: recover degraded objects _before_ modifying it
This will slow down writes to degraded objects because we will wait for it
to recover before applying the write. OTO...
Sage Weil
08:42 PM Revision 2e521b82 (ceph): osd: make client op version actual at_version
I.e. this should take into account any cloning. So set it after we call
prepare_transaction, not before.
Sage Weil
08:42 PM Revision f7cc6e8f (ceph): osd: fix subop version in snaptrimmer
The at_version should end equal to last_update, not +1. Sage Weil
08:08 PM Revision 71ff557f (ceph): osd: observe osd_recovery_max_chunk when pulling objects to primary
Sage Weil
08:06 PM Revision be8359ad (ceph): intervalset: implement span_of()
Sage Weil
05:48 PM Revision 2e137c9a (ceph): osd: clean out temp pg on startup
Sage Weil
05:44 PM Revision e0b62557 (ceph): osd: create temp collection during mkfs
Sage Weil
05:40 PM Revision a108774e (ceph): osd: give coll_t an explicit type in {meta,temp,pg}
Sage Weil
05:20 PM Revision bf8aa18d (ceph): rbd: add removal of snapshots
Yehuda Sadeh
05:15 PM Revision b4b6a824 (ceph): osd: add osd_recovery_max_chunk
Sage Weil
05:13 PM Revision 0b366744 (ceph): mon: check command length on osdmon
Fixes crash on 'ceph osd tell'.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
04:30 PM rgw Feature #174: Support large files better
The the gateway should now (following commit:925e2092486bbc78f011065172524d6c550ae7c6) do its get and put operations ... Yehuda Sadeh
04:09 PM Linux kernel client Bug #291 (Resolved): leaked dentry (?) in ceph_readdir_prepopulate when ceph_init_dentry fails
Sage Weil
03:06 PM Linux kernel client Bug #289 (Closed): can't handle ipv4 and ipv6 localhost
The parse bug is fixed.
The v4+v6 localhost is really a question of whether we want to bind to multiple addresses ...
Sage Weil
09:02 AM Linux kernel client Bug #289 (Closed): can't handle ipv4 and ipv6 localhost
if I mount ceph with localhost,
> mount -t ceph localhost:/ /mnt
it fails with:
mount error 22 = Invalid arg...
Greg Farnum
02:16 PM CephFS Bug #288: cmds disappears under snapshot load
Tried this with cfuse and got an mds crash with core dump. Looks like there's an issue with selecting the proper snap... Greg Farnum
09:31 AM CephFS Bug #288 (In Progress): cmds disappears under snapshot load
Greg Farnum
09:00 AM CephFS Bug #288 (Closed): cmds disappears under snapshot load
the ceph.git/unstable cmds gets killed by my snaptest-2 (http://
github.com/vinzent/ceph-testsuite/blob/master/tests...
Greg Farnum
01:11 PM Bug #287 (Resolved): cmon crash when "osd tell" is given with no arguments
fixed by commit:0b366744aeed614229b8ad6cc05bccb576c86202 Sage Weil
12:54 PM Bug #290 (Resolved): take osd1 or osd.1 to initscript
if conf has [osd1], the init script won't take osd.1. Sage Weil
12:34 PM Linux kernel client Bug #271: dropping dirty+flushing AxFxw state for
While this was going on, one of my MDS'es was pulling 92% CPU, with the following cluster state:... Wido den Hollander
12:33 PM Linux kernel client Bug #271: dropping dirty+flushing AxFxw state for
Just had this issue again.
Situation:
* Last night my 2 mons crashed during two rsync's (have not reported this c...
Wido den Hollander
09:57 AM Bug #281 (Resolved): remove snapshots via rbd tool
Yehuda Sadeh
09:56 AM Bug #281: remove snapshots via rbd tool
done, commit:bf8aa18d4801e9713af6098f9697b5e9e21c0230 Yehuda Sadeh

07/17/2010

12:55 PM Bug #287 (Resolved): cmon crash when "osd tell" is given with no arguments
... Wido den Hollander

07/16/2010

11:18 PM Revision 344c4318 (ceph): rgw: rgw_rados: initialize the position in list_buckets_init!
Greg Farnum
11:18 PM Revision 7315f4fc (ceph): rgw: rgw_rest: Properly return the content-length.
From wido. Greg Farnum
11:18 PM Revision 9810f7ea (ceph): librados: don't read attributes until the callback is done!
Greg Farnum
09:51 PM Feature #154 (Closed): support IPv6 addresses
Sage Weil
09:05 PM Revision f00992c9 (ceph): librados: librados.hpp should be standalone
removed references to object_t, declared librados::SnapContext Yehuda Sadeh
09:03 PM Revision bee69c88 (ceph): monc: be quiet about 'continuing hunt' too
Sage Weil
07:03 PM Revision 0656d0f0 (ceph): mds: small EMetablob fixup
Push to the back, update the back.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:02 PM Revision fdd79de6 (ceph): osd: always set snapset context if can_create
The most common code paths loaded it, but not the case where the obc was
already loaded.
Signed-off-by: Sage Weil <s...
Sage Weil
06:43 PM Revision 00fae3a2 (ceph): mds: remove bogus 'oldest snap' floor on lssnap result
I suspect the intent was to exclude snaps from parents from before we
existed. However, get_snap_info() already does...
Sage Weil
06:43 PM Revision 74e3d796 (ceph): mds: simplify rstat/dirstat debug printing
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:43 PM Revision 63642224 (ceph): client: drop unnecessary #include
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:43 PM Revision 1ac265f8 (ceph): mds: helpful journal replay debug output
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:43 PM Revision 8ea9cd59 (ceph): mds: journal dirty items in order
There was some weird thing where dirty items were added to the front of
the list in the EMetaBlob, dating from 2007. ...
Sage Weil
06:38 PM Revision eb173870 (ceph): lower loglevel for successfull mon hunting
I'm receving log output on every rados_initialize (... found mon0).
Since this makes the qemu-iotests fail, I'd sugge...
Christian Brunner
04:56 PM Revision cf02db42 (ceph): messages: fix MClientRequest::is_write(), rename to may_write().
Also remove unused can_forward(), auth_is_best(). Greg Farnum
04:15 PM rgw Bug #270 (Closed): Content-Type not returned / saved correctly
Greg Farnum
04:14 PM rgw Bug #270: Content-Type not returned / saved correctly
Done in commit:e6ec74fb3b8198feb427d33a9ef115e66589e494. The getxattrs function got broken a few months ago. Greg Farnum
04:14 PM rgw Bug #260 (Closed): Segmentation fault when listing buckets without uid
Fixed in commit:64b00ecf928d4ffc0dd58e1f3137c82872cacb17.
Turns out the state wasn't getting properly initialized......
Greg Farnum
02:17 PM Bug #262: osd: make sync work on ext4
BLKFLSBUF ioctl? Sage Weil
12:59 PM Feature #286 (Rejected): librados: set read chunk size
Make a max read size, such that if you do read(len=0) it will read the object in chunks.
g_conf.objecter_read_chun...
Sage Weil
12:05 PM rbd Bug #284 (Resolved): qemu-kvm can't create snapshots with multiple rbd devices
ok, actually fixed by commit:d5d1d04228ca28540cc88893ccf7c33aa9e88f01 Yehuda Sadeh
11:11 AM rbd Bug #284: qemu-kvm can't create snapshots with multiple rbd devices
At this point we'll be using the snapshot name as the unique snapshot id. Should be fixed with commit:a56b7fd513f6e88... Yehuda Sadeh
11:57 AM Linux kernel client Bug #271 (Can't reproduce): dropping dirty+flushing AxFxw state for
If you see this again, can you save the mds logs? It looks like one of the MDS sessions is closing, but i have no id... Sage Weil
11:46 AM CephFS Bug #29 (Resolved): mds: rfiles underflow
fixed by commit:8ea9cd5956c7c158fe93076930701927e13bfa03. woot! Sage Weil
11:46 AM CephFS Bug #16 (Resolved): mds restart vs dbench
also commit:8ea9cd5956c7c158fe93076930701927e13bfa03 Sage Weil
10:35 AM Bug #285 (Resolved): client4110 specified ino X but mds0 allocated Y
This was a bad commit in hte master branch, commit:f0e3b1ffe4b7c631f14ffa43da518d357d776602. It's fixed now, replace... Sage Weil
05:53 AM Bug #285 (Resolved): client4110 specified ino X but mds0 allocated Y
I just did a mkcephfs and started re-creating my fs tree.
When doing so, i noticed the following messages:
<pre...
Wido den Hollander
09:56 AM Linux kernel client Bug #275 (Can't reproduce): Unable to remove module when monitors or mds'es are down
Sage Weil
05:04 AM Bug #279: OSD crash on empty pglog
Same issue occured on *osd12*, i've uploaded the logs + core-dump to the same directory on "logger.ceph.widodh.nl"
...
Wido den Hollander
03:02 AM Bug #279: OSD crash on empty pglog
Btw, old logs are not available due to the NFS issues i had yesterday.
There is also enough disk-space on osd5:
...
Wido den Hollander
02:57 AM Bug #279: OSD crash on empty pglog
I'm seeing this crash on another osd right now. (osd5)... Wido den Hollander

07/15/2010

09:51 PM Revision 5f6d077a (ceph): mds: throw std::bad_alloc on boost:pool<> depletion
A bit more informative than a segfault from this=NULL in the constructor.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
09:51 PM Revision 0d9b998d (ceph): msgr: put brackets around ipv6 addrs
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:51 PM Revision 2d358a4c (ceph): msgr: use in6addr_any extern for is_blank_addr()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:50 PM Revision 1f7e11aa (ceph): rbd: fix usage
Yehuda Sadeh
09:35 PM Revision b0a5cc2d (ceph): rbd: update man page, other external tools
Yehuda Sadeh
09:35 PM Revision d565ae3d (ceph): rbd: rbd tool can parse embedded snapshots name
also update man page Yehuda Sadeh
06:05 PM Revision 851a7341 (ceph): mon: limit size of MOSDMap messages when subscribing
Send multiple MOSDMap messages (max of 100 incrementals per) when
clients who are way behind subscribe.
Signed-off-b...
Sage Weil
05:54 PM Revision 17a31fd5 (ceph): debug: fix append to log file
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:51 PM Revision cf6cc8b4 (ceph): osd: oops, fix pool thing
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:43 PM Revision 787ecb8f (ceph): osd: MOSDOp: print snapc if snap_seq != 0, not when snap list is empty
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:43 PM Revision e8991f19 (ceph): osd: use client or pool snapc based on _pool_ snap mode, not client op
The pool snap mode determines where the snapc should come from. In
particular, if the client specifies no snap seq, ...
Sage Weil
04:43 PM rbd Bug #284 (Resolved): qemu-kvm can't create snapshots with multiple rbd devices
There is a problem with multiple devices when trying to create a snapshot. Qemu tries to reuse the same snapshot id w... Yehuda Sadeh
03:23 PM Revision af65f15b (ceph): rgw: delete bucket objects from root pool on destruction.
This allows a bucket with that name to be created later. Greg Farnum
02:51 PM Bug #280 (Resolved): need to update man page for the rbd tool
done, commit:b0a5cc2db2cb73c4776243d0d2869fd6cc0f4903 Yehuda Sadeh
02:44 PM Linux kernel client Bug #283 (Resolved): ceph_add_cap: couldn't find snap realm, NULL ptr deref
mds restart.
rmdir dir (ENOTEMPTY)
ls dir/.snap
then crash....
Sage Weil
02:33 PM Bug #277 (Closed): "No space left on device", while fs is not full
Yeah, that's the problem. You should weight the OSD in crush based on the disk size. Or take it out of the mix enti... Sage Weil
02:29 PM Bug #277: "No space left on device", while fs is not full
Since my cluster is in a bad shape at the moment, my OSD won't be worth much, but here it is:... Wido den Hollander
10:23 AM Bug #277: "No space left on device", while fs is not full
Also, the flags for full are (at present) set once a single OSD reaches 95% of its disk space. It could potentially b... Greg Farnum
09:25 AM Bug #277: "No space left on device", while fs is not full
can you dump your osdmap (ceph osd getmap -o /tmp/foo) and post it here? there are 'fs is full' and 'fs is almost fu... Sage Weil
12:23 AM Bug #277: "No space left on device", while fs is not full
I checked the dmesg, but there were no messages about that. (Actually, no messages at all)
The diskspace of the OS...
Wido den Hollander
02:10 PM CephFS Bug #274 (Closed): OSD crash during rsync
Sage Weil
02:06 PM CephFS Bug #274: OSD crash during rsync
Yes, i was overcommitting at the client side too, since this was running OOM too. But it seems these values were a bi... Wido den Hollander
11:45 AM CephFS Bug #274: OSD crash during rsync
Sage Weil
11:45 AM CephFS Bug #274: OSD crash during rsync
Most of the crashes are definitely OOM. (The prepare_null_dentry one i'm not 100% sure about, but does look that way... Sage Weil
02:08 PM Bug #279: OSD crash on empty pglog
I don't indeed (no log), but i've checked, there are multiple files with a filesize of 0 (pglog_*) on osd20 and other... Wido den Hollander
12:36 PM Bug #279: OSD crash on empty pglog
The real question is how the pglog got to be 0 in the first place. You don't have an older osd.20.log, do you?
(T...
Sage Weil
01:52 PM Linux kernel client Bug #271: dropping dirty+flushing AxFxw state for
Multiple, 2 to be exact. Wido den Hollander
11:53 AM Linux kernel client Bug #271: dropping dirty+flushing AxFxw state for
single or multiple mds's? Sage Weil
01:49 PM Bug #282 (Resolved): osd: heartbeat can't keep up with large cluster changes
in wido's case, a new crushmap makes osds flap. Sage Weil
11:47 AM Linux kernel client Bug #275: Unable to remove module when monitors or mds'es are down
Yeah, you should really do umount -f. If that fails with 'filesystem busy' then kill -9 on the running procs should ... Sage Weil
11:10 AM Bug #272: Monitor crashes during replication level change
Sage Weil
11:09 AM Bug #272: Monitor crashes during replication level change
It looks like your system generated a bazillion osdmap updates for some reason. And then when osds start up, they te... Sage Weil
10:46 AM Bug #278 (Resolved): data corruption after rbd rollback
fixed by commit:e8991f19526939ee843c7b04c167fe290f113602 Sage Weil
08:21 AM rgw Bug #265 (Closed): Removing a bucket doesn't clean up object in .rgw pool
Fixed in commit af65f15b855819188aae9952d38a2cc035265417 Greg Farnum

07/14/2010

11:59 PM Revision 27bd9018 (ceph): rbdtool: rename rbdtool to 'rbd' tool and change usage
Yehuda Sadeh
10:52 PM Bug #281 (Resolved): remove snapshots via rbd tool
Yehuda Sadeh
10:51 PM Bug #280 (Resolved): need to update man page for the rbd tool
Yehuda Sadeh
08:19 PM Revision 14542911 (ceph): cfuse: return EROFS when attempting to modify snapshots
Greg Farnum
04:52 PM Bug #277: "No space left on device", while fs is not full
Will be nice to have the kernel log, see where this error code is coming from. Might be that one of the osds got fill... Yehuda Sadeh
08:25 AM Bug #277 (Closed): "No space left on device", while fs is not full
While doing a rsync of kernel.org i got the message that there was no space left on the device:... Wido den Hollander
02:50 PM Linux kernel client Bug #275: Unable to remove module when monitors or mds'es are down
I'm not sure that the -l here is in place. It just gives you an illusion that it actually did anything, but it did no... Yehuda Sadeh
01:37 PM Bug #279 (Resolved): OSD crash on empty pglog
During a crushmap change, one of my OSD's crashed on a empty pglog... Wido den Hollander
01:27 PM Bug #278 (Closed): data corruption after rbd rollback
Andrew created a kvm-rbd image, took a snapshot (from the rbdtool while the rbd-kvm was running, however, I don't thi... Yehuda Sadeh
01:21 PM CephFS Bug #269 (Closed): mds: don't crash when attempting to write to a snapshotted dir
Fixed in commit:9c90f7b79479604b0065df7de508145e5167f1cb Greg Farnum
01:20 PM CephFS Bug #268 (Closed): cfuse: return EROFS when writing to snaps
Fixed in commit:145429118d79d532852c386b4260d22b8b6b08a3. Greg Farnum
07:22 AM Feature #276 (Resolved): Possibility to dump/list xattrs from RADOS object
While hunting for #270 i've found out that i couldn't dump the xattrs of a RADOS object.
It might be easy if you c...
Wido den Hollander
06:24 AM rgw Bug #270: Content-Type not returned / saved correctly
Searching further i found that it seems to go wrong in "rgw_rados.cc"
Line 475:...
Wido den Hollander
05:59 AM rgw Bug #270: Content-Type not returned / saved correctly
I've spent some time searching for the cause of this bug.
It seems the issue is in "rgw_rest.cc" in method "RGWGet...
Wido den Hollander

07/13/2010

06:35 PM Revision 9c90f7b7 (ceph): mds: Check all write ops to make sure they aren't in a snapshot; return...
Greg Farnum
01:40 AM CephFS Bug #274: OSD crash during rsync
Ofcourse, the title had to be "MDS crash during rsync".
I just tried to sync kernel.org again, (after a mkcephfs),...
Wido den Hollander
01:00 AM Linux kernel client Bug #275 (Can't reproduce): Unable to remove module when monitors or mds'es are down
When a cluster fails you can unmount the filesystem with "umount -lf /path/to/ceph", that works fine.
But then "rm...
Wido den Hollander

07/12/2010

10:48 PM CephFS Bug #274: OSD crash during rsync
Starting the MDS'es again also fails. When doing so, the client reconnects to mds1, which then crashes after a few se... Wido den Hollander
10:40 PM CephFS Bug #274 (Closed): OSD crash during rsync
Trying to replicate issue #272 and #273 i started a rsync to sync kernel.org and the Ubuntu releases (running at the ... Wido den Hollander
02:17 PM Bug #273 (Can't reproduce): OSD crashes during replication level change
In reference to #272 i create this issue.
During this replication change i had some crashes at the OSD's. On almos...
Wido den Hollander
01:51 PM Bug #272: Monitor crashes during replication level change
Just tried to start the monitors again, they both crashed again:
mon0...
Wido den Hollander
01:43 PM Bug #272 (Closed): Monitor crashes during replication level change
Today i tried to change my replication level from 2 to 3 on my "data" and "metadata" pools.
This caused various cr...
Wido den Hollander
01:09 PM Linux kernel client Bug #271 (Can't reproduce): dropping dirty+flushing AxFxw state for
This weekend i tried to rsync kernel.org to my cluster, just to get some data on it.
At first it all seems to go f...
Wido den Hollander
11:54 AM CephFS Bug #269 (In Progress): mds: don't crash when attempting to write to a snapshotted dir
Greg Farnum

07/11/2010

04:07 AM rgw Bug #270 (Closed): Content-Type not returned / saved correctly
When setting the contentType when uploading, the gateway does not return this correctly.
Putting a file:...
Wido den Hollander

07/10/2010

12:07 AM Revision 4884f742 (ceph): rbdtool: fix add-snap
Yehuda Sadeh

07/09/2010

11:24 PM Revision 2b02afd6 (ceph): rbdtool: rename should first check source
that is, source image doesn't exist error should have priority
over destination already exists.
Yehuda Sadeh
11:24 PM Revision 8b96c3ca (ceph): rbdtool: rearrange usage
Yehuda Sadeh
10:51 PM Revision d19f05a4 (ceph): debug: fix debug rank symlink rotate when log_per_instance
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:51 PM Revision e61ceee1 (ceph): debian: override lintian warning unstripped-binary-or-object
Sage Weil
10:37 PM Revision e0229b0a (ceph): rbdtool: implement copy
Yehuda Sadeh
10:31 PM Revision 8f0da0ae (ceph): rbdtool: use canonical image name (pool/image)
Yehuda Sadeh
10:31 PM Revision 98bdd8da (ceph): rbdtool: can read data using snapshot
Yehuda Sadeh
09:47 PM Revision 3cbf578e (ceph): mds: issue all snap_caps allowed given locks, not by current cap
For snap_caps, who cares if the live inode has a cap for this client. We
always want to issue the maximum set of cap...
Sage Weil
09:47 PM Revision 372db051 (ceph): mds: fix acquire_lock() calls in handle_client_open()
We want to acquire_locks exactly once per path through the function.
Don't look at mdr->* yet if we haven't locked ye...
Sage Weil
09:47 PM Revision 8921a18e (ceph): mds: fix up snap cap issue logic comments
Sage Weil
08:51 PM Revision f5c4fc61 (ceph): fix FCGI detection and specfile
configure is building radosgw as soon as libfcgi is installed, even
if --without-radosgw is given. Here is a fix for ...
Christian Brunner
08:14 PM Revision 878a7de5 (ceph): mon: avoid OOB m->cmd access in auth proprocess_command
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:27 PM Revision 71b9daa8 (ceph): debug: only truncate log file on open if log_per_instance is set
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:27 PM Revision c91c8a31 (ceph): msgr: brackets (not braces) for ipv6 addrs
Sigh
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:09 PM Revision 876cca9e (ceph): mds: termiante client sessions before flushign log during shutdown
We need to close out client sessions, and caps, to empty the open_files
lists, so that we can properly trim the EOpen...
Sage Weil
04:53 PM CephFS Bug #16: mds restart vs dbench
hopefully fixed by commit:696bab43d66532768a9f85f71a86ddef2bff527f 'ceph: generate request paths only once'?? needs m... Sage Weil
04:14 PM CephFS Bug #16: mds restart vs dbench
aha, when teh rename request is replayed, the target(?) filename changes:... Sage Weil
03:50 PM CephFS Bug #16 (In Progress): mds restart vs dbench
whoops, no, hit it.... Sage Weil
03:04 PM CephFS Bug #16 (Can't reproduce): mds restart vs dbench
this has apparently been fixed since april.. can't reproduce it now. Sage Weil
04:38 PM Revision 84d4c11f (ceph): ceph.spec.in: create /var/log/ceph directory
I've missed this one in my previous patch series.
Regards,
Christian
Signed-off-by: Sage Weil <sage@newdream.net>
Christian Brunner
04:38 PM Revision bf047a64 (ceph): ceph.spec.in: add /var/lib/ceph/
Signed-off-by: Sage Weil <sage@newdream.net> Christian Brunner
04:38 PM Revision 8ec5134a (ceph): ceph.spec.in: add a build option for radosgw (--with radosgw)
Signed-off-by: Sage Weil <sage@newdream.net> Christian Brunner
04:38 PM Revision 19886eb6 (ceph): ceph.spec.in: add logrotate configuration
Signed-off-by: Sage Weil <sage@newdream.net> Christian Brunner
04:38 PM Revision 293299f0 (ceph): ceph.spec.in: set correct path for rbd classes
Signed-off-by: Sage Weil <sage@newdream.net> Christian Brunner
03:18 PM Bug #224 (Resolved): rbdtool copy / rename
copy implemented at commit:e0229b0a1f9f46a6f6140873c17420d9bd85c87c Yehuda Sadeh
03:12 PM Feature #87 (Closed): osd: separate interface/messenger for backend (osd to osd) network
Sage Weil
02:49 PM Linux kernel client Bug #226 (Resolved): kclient can hang waiting for snap caps
Sage Weil
02:47 PM Linux kernel client Bug #226: kclient can hang waiting for snap caps
should be fixed by commit:3cbf578e5e4ec6a79c78fc18d9104d4e71aab014 Sage Weil
11:30 AM Linux kernel client Bug #226 (In Progress): kclient can hang waiting for snap caps
Sage Weil
12:43 PM CephFS Bug #269 (Closed): mds: don't crash when attempting to write to a snapshotted dir
I think this is a matter of returning EROFS for any write op if mdr->snapid != CEPH_NOSNAP.
Probably fix this befo...
Sage Weil
12:30 PM CephFS Bug #268 (Closed): cfuse: return EROFS when writing to snaps
Lots of places to check.. setattr, create, mkdir, mknod, symlink, link, unlink, rename, setxattr... If the inode or d... Sage Weil
12:22 PM Linux kernel client Bug #193 (Resolved): protocol error after control-c
I think this was caused by the message revocation bug fixed by commit:ed98adad3d87594c55347824e85137d1829c9e70, #252.... Sage Weil
12:11 PM Linux kernel client Bug #139: BUG ceph_dentry_info: Objects remaining on kmem_cache_close()
Haven't seen this bug for a while, might be resolved by accident? Wido den Hollander
11:30 AM Feature #267 (Resolved): Support IPv6 in monmaptool
Sage Weil
11:30 AM Feature #267: Support IPv6 in monmaptool
parsing was broken, fixed by commit:c91c8a3115347b8102ccf6a061b57a3a9c4ebd94
works for me:...
Sage Weil
11:12 AM Feature #267 (Resolved): Support IPv6 in monmaptool
Adding a IPv6 monitor to the monmap fails:... Wido den Hollander
11:28 AM Bug #263 (Resolved): failed to assign a block name for image (during create)
Sage Weil
11:17 AM Bug #263: failed to assign a block name for image (during create)
Seems fixed since today. (unstable branch) (Had to install binutils though)
Had to run "cclass -a" to fix it.
Wido den Hollander
09:52 AM Linux kernel client Feature #261: Support IPv6 while mounting
my bad, there was a fix in unstable that wasn't in master. moved that over, and it gets rid of the -97 error for me. Sage Weil
05:52 AM Linux kernel client Feature #261: Support IPv6 while mounting
Tried mounting with IPv6 today, but it failed, the config:... Wido den Hollander

07/08/2010

10:41 PM Revision 9b74eb6f (ceph): osd: print debug write before adding to transaction
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:55 PM Revision 4701e838 (ceph): msgr: return error and dethrottle if message decode fails
This was broken by the addition of the throttling stuff.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
09:27 PM Revision b81273d5 (ceph): debian: make helper scripts remember most recent release
So you don't have to cut&paste it on the command line
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:34 PM Revision f456b51d (ceph): osd: fix delayed pg creation use after free
Use local acting[], not the one stashed in creating_pgs[pgid]. And drop
the duplication .erase().
Signed-off-by: Sa...
Sage Weil
06:34 PM Revision 6612a1f1 (ceph): vstart: specify osd class tmp dir
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:56 PM Revision 9fd3f593 (ceph): monc: make hunt, ping rates configuration
3s for hunt rate, 10s for ping rate.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:49 PM Revision a396fcb2 (ceph): client: increase client mount timeout
This needs to be longer than the monclient hunt rate (currently 10s) so
that we succeed if we initially pick a down m...
Sage Weil
05:14 PM Revision 89c7d920 (ceph): filestore: improve lseek64 error checking, and use buffer::list::write_...
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:09 PM Revision a3fa9660 (ceph): buffer: make write_fd() const
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:49 PM Revision 303a1e3c (ceph): mount.ceph: handle ipv6 addrs, colons, braces
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:38 PM Revision 25ff6e4c (ceph): ceph.spec: require libedit
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:36 PM Revision e0d443cc (ceph): ceph.spec.in: require binutils
cclass needs /usr/bin/nm
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
03:33 PM Revision c5d86f9d (ceph): debian: require binutils
cclass needs 'nm' to extract symbols from rados classes.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
03:29 PM Revision fd8ba5d5 (ceph): debian: explicit postinst/postrm for shlibs
The dh_shlibdeps sees the .so in 'ceph' and incorrectly adds ldconfig to
postinst and postrm. Which is useless and u...
Sage Weil
11:06 AM Cleanup #258 (Resolved): make logger output rotatable, cleaner
fixed by commit:83d1ea6636dd432dcbb6a0c6046d551bee7be5c6 Sage Weil
11:05 AM Linux kernel client Feature #261 (Resolved): Support IPv6 while mounting
fixed by commit:54b4fe8b92c7983ccb9b588420951e11aa5aa87c on the kenrel side. there is also a ceph.git patch commit:3... Sage Weil
09:04 AM rgw Bug #264 (Resolved): Creating a bucket gives ErrorBucketAlreadyExists
Yehuda Sadeh
08:56 AM rgw Bug #264: Creating a bucket gives ErrorBucketAlreadyExists
This is actually a bug in the s3 client. Should be fixed with the attached patch. Yehuda Sadeh
01:43 AM rgw Bug #264 (Resolved): Creating a bucket gives ErrorBucketAlreadyExists
When trying to create a bucket, you get the message that it already exists, when it doesn't:... Wido den Hollander
08:59 AM CephFS Feature #266 (New): mount.ceph: specify secret via name=foo and keyring=bar
It can run cauthtool -p keyringfile to extract the secret. Sage Weil
01:46 AM rgw Bug #265 (Closed): Removing a bucket doesn't clean up object in .rgw pool
When removing a bucket, the corresponding object in the .rgw pool doesn't get removed, thus preventing us from creati... Wido den Hollander
01:01 AM Bug #263: failed to assign a block name for image (during create)
Also fails with qemu-kvm:... Wido den Hollander
12:43 AM Bug #263 (Resolved): failed to assign a block name for image (during create)
Trying to create a image with rbdtool, this fails:... Wido den Hollander

07/07/2010

11:55 PM Revision 07646308 (ceph): rbdtool: should support files larger than 2GB
Yehuda Sadeh
11:34 PM Revision ae82dd5a (ceph): rbdtool: fix export
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:33 PM Revision 1e08e4bf (ceph): mon: create new pools using rjenkins hash
Otherwise they get -1, which always maps to an invalid (-1) pgid.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:56 PM Revision b977f4e5 (ceph): debian: _do_ strip cosd
Sage Weil
10:56 PM Revision cf38e86e (ceph): class: make class tmp directory configurable, not /tmp
/tmp usually gets mounted with 'noexec', making it a poor choice.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:00 PM Revision d5e3962a (ceph): config: can use '-' as a config option
Yehuda Sadeh
09:41 PM Revision 2096d8d7 (ceph): logger: shut up
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:33 PM Revision 01989903 (ceph): msgr: remove fake orig_source_addr fakery
This makes get_orig_*() match get_*().
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
09:32 PM Revision 6ae95b6c (ceph): mon: fix message forwarding to use Connection to fake source addr
This make get_source_addr() work (not just get_orig_source_addr()).
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:50 PM Revision 243491ef (ceph): debian: do not strip cosd
Sage Weil
07:50 PM Revision ed66dd57 (ceph): classhandler: print error when dlopen fails
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:34 PM Bug #262 (Resolved): osd: make sync work on ext4
FileStore currently does a single fsync() to push all dirty data to disk (data=ordered(/journal?)) behavior. ext4 do... Sage Weil
03:24 PM Revision cc5fcb99 (ceph): ceph: drop version from ceph_fs.h, use autoconf VERSION macro instead
There's no reason to define this in the code anymore. Since the kernel
side doesn't use it it just confuses matters....
Sage Weil
02:47 PM Revision 6cca139f (ceph): shell script fixes
ar Fred
02:38 PM Revision 9432a958 (ceph): mds: fix logger init on startup, rank symlink creation
Busted by 83d1ea6636dd432dcbb6a0c6046d551bee7be5c6.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
02:36 PM Revision b0e6e4f1 (ceph): cclass: $libdir/rados-classes
Sage Weil
02:36 PM Revision 929f6721 (ceph): debian: include rados-classes
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
02:36 PM Revision 8d2393c0 (ceph): osd: consolidate osd stats in _refresh_my_stats
Also keep local copy of osd_stat (send with pg stats update to monitor).
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
02:36 PM Revision 0bf19e5a (ceph): debian: put rbd classes in ceph, not librados1 package
The cluster classes are a server-side thing, not client lib thing.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
02:36 PM Revision 4fb67c31 (ceph): Makefile: cleanup
Sage Weil
02:36 PM Revision a321ae43 (ceph): debian: add postinst, postrm for ceph package
This will keep debain from using the default, which thinks it needs to run
ldconfig. Which it doesn't.
Sage Weil
02:36 PM Revision f6e1fd53 (ceph): debian: do not strip rados classes
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:04 AM Linux kernel client Feature #261 (Resolved): Support IPv6 while mounting
Right now the client doesn't support mounting over IPv6 yet, so it's not possible to run with IPv6 only.
In order ...
Wido den Hollander
07:41 AM CephFS Bug #259 (Resolved): MDS crash during log initialize
sorry, fixed for real by commit:9432a9588972860aa2fdb3f9ea18eb88073ace9a Sage Weil
03:20 AM CephFS Bug #259: MDS crash during log initialize
Seems to be in commit 83d1ea6636dd432dcbb6a0c6046d551bee7be5c6, reverting to 1ca446dd9ac2a03c47b3b6f8cc7007660da911ec... Wido den Hollander
02:37 AM CephFS Bug #259 (Resolved): MDS crash during log initialize
Running with the latest unstable (83d1ea6636dd432dcbb6a0c6046d551bee7be5c6) my MDS'es crash while initializing their ... Wido den Hollander
07:08 AM rgw Bug #260 (Closed): Segmentation fault when listing buckets without uid
When listing buckets without a uid, radosgw_admin segfaults:... Wido den Hollander
04:12 AM Revision 1ca446dd (ceph): auth: rename supported_auth -> auth_supported
It should be 'auth supported' for naming consistency.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
04:12 AM Revision 83d1ea66 (ceph): logger: huge rewrite, cleanup
- name *.log
- name after name, not rank
- reopen on sighup
- remove pointer on shutdown
- clean up interface
- imple...
Sage Weil

07/06/2010

10:14 PM Revision 0c345a98 (ceph): client: put MDSSession pointers in the msd_session map, rather than the...
This is a bit cleaner and clears the way to make it more closely mimic the kernel. Greg Farnum
10:14 PM Revision 74bd8d77 (ceph): client: handle_client_session now careful opening mds sessions while un...
Greg Farnum
10:00 PM Revision 5b097603 (ceph): msgr: immediately drop messages with null dest
This avoids spawning a new pipe and then failing to connect to a bad
address.
Signed-off-by: Sage Weil <sage@newdrea...
Sage Weil
10:00 PM Revision c39ee594 (ceph): paxos: send summary if observer has fallen behind first_committed
This avoids a ceph crash like
mon/PGMap.h: In function 'void PGMap::apply_incremental(PGMap::Incremental&)':
mon/PGM...
Sage Weil
09:48 PM Revision a40631fb (ceph): mds: fix handle_dentry_unlink to handle remote unlink (no straydn)
The straydn is optional, and NULL if we're unlinking a remote dentry.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
09:34 PM Revision 0fbb8eeb (ceph): mds: be less noisy about memory usage in log
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:02 PM Revision 08f74066 (ceph): ceph: --in-file, not --in-data
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:47 PM Revision 89a40b3f (ceph): osd: track pg_num by type in logger
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:27 PM Revision 013df04a (ceph): librados: add pool set_snap_context
Set the snap context for a given pool handle.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:02 PM Revision 84bca030 (ceph): client: take the first inode as root, rather than whatever comes out
with ino=1.
Also, translate root inode properly in fuse_ll.
Greg Farnum
06:53 PM Revision 4a80e5e0 (ceph): ceph.spec: install /etc/logrotate.d/ceph
Sage Weil
05:59 PM Revision cff705b5 (ceph): config: allow optional parameters
Yehuda Sadeh
05:59 PM Revision fdcc2413 (ceph): rbdtool: modify usage
Yehuda Sadeh
05:53 PM Revision de668159 (ceph): osd: document some of the logger stats
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:24 PM Revision 100b6776 (ceph): mds: do not include snapped inodes in EOpen or LogSegment::open_files
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:22 PM Revision 0f8099a0 (ceph): radosgw: fix usage syntax
That fixes bug #256. Yehuda Sadeh
05:13 PM Revision a497c616 (ceph): mds: print useful error on malformed EOpen event
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:10 PM Revision 1d450742 (ceph): Makefile: distribute check_version
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:10 PM Revision 29505805 (ceph): debian: simplify release.sh
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:10 PM Revision 26b67ff8 (ceph): Makefile: drop unnecessary dist-hook for make_version
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:27 PM Cleanup #258 (Resolved): make logger output rotatable, cleaner
Sage Weil
03:45 PM Bug #247 (In Progress): osd: allow separate interface for internal cluster traffic
I'm just going to begin by setting up the OSD code to accept and work with multiple messengers.
Other parts may be a...
Greg Farnum
03:28 PM CephFS Cleanup #236 (Resolved): cfuse: clean up mds session handling
Done in commit:74bd8d777df932922007ec69d3bd9b8dfe5c1394. Greg Farnum
12:50 PM Feature #257 (Resolved): osd: separte pg type Logger counts
Sage Weil
12:17 PM Feature #257 (Resolved): osd: separte pg type Logger counts
primary, replica, stray Sage Weil
12:01 PM CephFS Feature #235 (Resolved): cfuse: allow specifying server path to mount on command line
Done in commit:84bca03089673985ec9341ae73c53e23932b854b Greg Farnum
10:28 AM CephFS Bug #255 (Resolved): MDS crash while during journal replay
fixed by commit:100b6776ddb095c43cf20734b48e399d359d7b1b Sage Weil
04:39 AM CephFS Bug #255 (Resolved): MDS crash while during journal replay
While updating my cluster to the latest unstable i saw a crash on both my MDS'es.
The backtrace (same on both):
...
Wido den Hollander
10:11 AM Linux kernel client Bug #254 (Resolved): leaked authorizer memory
fixed by commit:22b1de06c9fe128ca3de72560c3e8c2cabf2927a Sage Weil
10:03 AM rgw Cleanup #256 (Resolved): Usage of radosgw_admin reports rgw_admin
Fixed with commit:0f8099a088cab90c49a4c016c5ec6505ae922d53.
Yehuda Sadeh
09:48 AM rgw Cleanup #256 (Resolved): Usage of radosgw_admin reports rgw_admin
... Wido den Hollander
 

Also available in: Atom