Project

General

Profile

Activity

From 04/29/2010 to 05/28/2010

05/28/2010

08:21 PM Revision 83094d97 (ceph): paxos: fix store_state fix
Sage Weil
07:59 PM Revision 62e290e8 (ceph): msgr: print bind errors to stderr
Sage Weil
07:56 PM Revision 6060bdd8 (ceph): rbd: some fixes to conform with qemy code style
Yehuda Sadeh
07:50 PM Revision 3a705ded (ceph): paxos: cleanup
Sage Weil
07:48 PM Revision 3c3e82e0 (ceph): paxos: only store committed values in store_state
The uncommitted value is handled specially by handle_last() Sage Weil
07:41 PM Revision 187011cd (ceph): initscript: fix typo with $lockfile stuff
Sage Weil
07:37 PM Revision 6b72d70b (ceph): paxos: set last_committed in share_state()
It wasn't getting set for LAST message, which broke recovery somewhat.
Broken by 8e76c5a1d827e01f77149245679bd00ba27...
Sage Weil
01:44 PM Linux kernel client Bug #162 (Can't reproduce): list bug during shrink_dcache_for_umount
ceph3, rsync workload.
unstable circa 5/25...
Sage Weil
01:12 PM Linux kernel client Bug #141 (Resolved): ERESTARTSYS on mds update operations cause bad results
Sage Weil
10:49 AM Linux kernel client Bug #141: ERESTARTSYS on mds update operations cause bad results
I assume that switching to wait_for_completion_killable() fixed this one?
related commit: 0ec773c7f9ecbff4b75c3c68...
Yehuda Sadeh
12:58 PM Bug #158 (Resolved): cmon silently fails if addr is wrong in ceph.conf (0.20.1)
fixed by commit:62e290e87fa2ce5b33a847e0837b2198bac6842b Sage Weil
08:42 AM Bug #158 (Resolved): cmon silently fails if addr is wrong in ceph.conf (0.20.1)
Sage Weil
12:47 PM Bug #161: Monitor crashes on begin
actually, commit:3c3e82e0f5feacef5f191a5ce34bf96c15fdaed5 Sage Weil
12:37 PM Bug #161 (Resolved): Monitor crashes on begin
fixed by commit:6b72d70be42823e32bb8bcec033ac3a62943e089 Sage Weil
11:39 AM Bug #161 (Resolved): Monitor crashes on begin
On an assert:
assert(begin->last_committed == last_committed);
(gdb) bt
#0 0x00007f39eacfdf45 in *__GI_raise (sig...
Greg Farnum
12:07 PM Linux kernel client Bug #148 (Resolved): iozone failure
yeah, this has survived 24 hours, whereas before it was failing after an hour or two. Sage Weil
12:00 PM Linux kernel client Bug #144: GPF at con_close_socket+0x40/0x9f
What was the specific scenario? Can it be reproduced? Yehuda Sadeh
11:40 AM Bug #159 (In Progress): cfuse abort on file delete (0.20.2)
Greg Farnum
08:43 AM Bug #159 (Resolved): cfuse abort on file delete (0.20.2)
Sage Weil
11:17 AM Linux kernel client Bug #150: order:1 page allocation failure
Too many dirty pages? Too many pending osd requests?
We should probably try to get how many osds requests were in-fl...
Yehuda Sadeh
11:07 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
nfs uses the rpc code, which, if I understand it correctly initializes a work queue for socket allocation and connect... Yehuda Sadeh
10:36 AM Feature #160 (Resolved): rbd revert-to-snapshot
Need to fully implement revert-to-snapshot functionality. Currently there's a partial implementation in the rbd-class... Yehuda Sadeh
12:12 AM Revision 8c448257 (ceph): osd: fix compilation
Yehuda Sadeh

05/27/2010

11:32 PM Revision 4b797745 (ceph): mds: fix null dn deref during anchor_prepare
Sage Weil
11:14 PM Revision bb8b1398 (ceph): mds: fix invalid use of connection
Greg Farnum
10:25 PM Revision 93804416 (ceph): mds: switch some session->inst send_message calls to session->connection;
switch an MDS broadcast from instance-based to Connection *-based send. Greg Farnum
10:02 PM Support #156: Example debug levels in sample.ceph.conf
Looks quite useful, thank you. There are also some logging directories available, e.g. logger sym. May those be add... Alex Nelson
09:49 PM Support #156 (Resolved): Example debug levels in sample.ceph.conf
Sage Weil
09:59 PM Revision 330e1e21 (ceph): osd: warn, don't crash, on purged_snaps shrinkage
Sage Weil
09:59 PM Revision a1a13502 (ceph): mkcephfs: pass -c to cmon --mkfs
Sage Weil
09:59 PM Revision 0a1d526b (ceph): osdmap: assert maxrep >= minrep
Sage Weil
09:59 PM Revision 594d4568 (ceph): osdmaptool: include raw, up, acting mappings
Sage Weil
09:59 PM Revision 892a0e25 (ceph): config: parse in $host from conf file
So you can do stuff like
log dir = /data/$host
Sage Weil
09:58 PM Revision d2c40055 (ceph): initscript: incorporate Josef's fedora fixes
Add 'status' command.
Add chkconfig line.
Do lockfile stuff only if /var/run/subsys exists.
Still specifying the run...
Sage Weil
09:31 PM Revision b83b0733 (ceph): rados: add op for rollback
Greg Farnum
09:31 PM Revision e935b8ec (ceph): osd: add rollback to ceph_osd_op_name
Greg Farnum
09:31 PM Revision 23336561 (ceph): osd: create _delete_head function, move CEPH_OSD_OP_DELETE handling to it.
Greg Farnum
09:31 PM Revision be1030d8 (ceph): rados: add snap.snapid to ceph_osd_op, to replace use of MOSDOp's snapid
Greg Farnum
09:31 PM Revision b82ba820 (ceph): osd: implement rollback functionality
Greg Farnum
09:31 PM Revision 91fb924a (ceph): objecter: add rollback_object function, which rolls back a single objec...
Greg Farnum
09:31 PM Revision 0292f2e6 (ceph): librados: add rollback_object functions.
Greg Farnum
09:31 PM Revision 9dd35584 (ceph): rados: add rollback functionality to rados
Greg Farnum
09:31 PM Revision bd9cf968 (ceph): osd: set clone_overlaps properly on rollback
Greg Farnum
09:31 PM Revision edffc122 (ceph): librados: update C header file to proper name for rollback function
Greg Farnum
09:31 PM Revision 7cc3ab62 (ceph): rados.h: should use __le64 instead of __u64
Yehuda Sadeh
09:26 PM Linux kernel client Bug #148: iozone failure
I think this may have been caused by the mds request signal handling? It isn't happening on the latest unstable. Sage Weil
09:24 PM Revision 08f69663 (ceph): ceph.spec: build-required libatomic_ops-devel, not libatomic_ops
And no perl-devel. Sage Weil
09:23 PM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
We could have a pool of preallocated sockets.. but that could be exhausted.
Or duplicate a bunch of socket creation ...
Sage Weil
06:23 PM Revision f95e1e0a (ceph): mds: add Connection * to Session
Greg Farnum
06:22 PM Revision 53523267 (ceph): Merge branch 'unstable' into msgr
Greg Farnum
04:34 PM CephFS Feature #45: Investigate adding Connection * to mds Session
Added Connection *; now testing my send_message conversions to make sure I'm not trying to use any Connection *s whil... Greg Farnum
02:57 PM Linux kernel client Bug #157 (Resolved): fix auth_x memory leak
fixed by 'ceph: fix leak of osd authorizer'. the osd_client put_osd() didn't clean up the ceph_authorizer. Sage Weil
01:14 PM Linux kernel client Bug #157 (Resolved): fix auth_x memory leak
this is on ceph1, qa loopall.sh workload, unstable branch.... Sage Weil
04:47 AM Revision a3dc4bda (ceph): sample.ceph.conf: include debug options, commented out
Sage Weil

05/26/2010

11:58 PM Revision 78375cfd (ceph): mon: add crush_rule data member to MPoolOp; use it in new pool creation...
Greg Farnum
11:58 PM Revision a9e17271 (ceph): objecter: add optional crush_rule parameter; set in pool_op_submit as n...
Greg Farnum
11:58 PM Revision 8044f7ac (ceph): librados: add crush_rule parameter to create_pool functions
Greg Farnum
11:58 PM Revision 05256bb0 (ceph): rados: you can now set the crush rule to use when creating a pool
Greg Farnum
09:54 PM Support #156 (In Progress): Example debug levels in sample.ceph.conf
We should still add a wiki page with debugging information. I can include info about debug options, and also other s... Sage Weil
09:53 PM Support #156 (Resolved): Example debug levels in sample.ceph.conf
good idea. commit:a3dc4bdac2057c2d0fcd27cab9c416c5089b4c76 Sage Weil
05:20 PM Support #156 (Resolved): Example debug levels in sample.ceph.conf
The debug options for ms, osd, etc. could afford to be listed in the sample.ceph.conf file, even commented out. Ther... Alex Nelson
09:47 PM Revision a92df208 (ceph): mds: include LAZYIO in CEPH_CAP_ANY set
Sage Weil
09:47 PM Revision a13b5b1c (ceph): mds: include LAYZIO cap in sync->mix and mix->sync transitions
Sage Weil
09:47 PM Revision 297d3ecd (ceph): client: update ioctl.h (lazyio, invalidate_range)
Sage Weil
09:47 PM Revision 648ce976 (ceph): mds: LAYZIO is not liked, but it is allowed
Sage Weil
09:35 PM Revision 9b4d25b9 (ceph): mon: detect and warn on clock synchronization problems;
change MMonPaxos::lease_expire to lease_timestamp Greg Farnum
09:35 PM Revision 75de2723 (ceph): mon: warn to log, not just dout, on clock drift
Greg Farnum
09:11 PM Revision bee74a1e (ceph): ceph: add conversion to qemu coding style
Hi Yehuda,
I've added a small hack to make push_to_qemu.pl convert tabs to spaces.
Christian
Christian Brunner
05:59 PM Revision a1c99811 (ceph): paxos: use helper to store committed state; fix master mon catch up usi...
The catch up logic in handle_last didn't handle the stashed state, so we
crashed and burned if it was the master that...
Sage Weil
05:01 PM Revision c0df916a (ceph): cfuse: bail out on mount() errors
Sage Weil
04:58 PM Feature #135 (Resolved): Specify crush rules
Added crush_rule parameters/data members as appropriate to OSDMonitor pool creation functions, objecter, librados, an... Greg Farnum
02:47 PM Feature #33: O_LAZY or equivalent
Sage Weil
02:32 PM Feature #105 (Resolved): mon: warn on clock drift
It warns to dout and the logger:
1) when the slave notices the leader is behind by >(mon_lease - latency), or
2) wh...
Greg Farnum
11:33 AM CephFS Bug #52: mds: dentry versionlock
merged into unstable Sage Weil
11:32 AM Bug #37 (Rejected): osd: recover missing clone object
this could have been related to the osd recovery fixes (wrt snapdir). haven't seen this in weeks. dropping it for now. Sage Weil
11:03 AM Feature #112: osd: snap rollback object op
Greg Farnum
11:01 AM Bug #151 (Resolved): cmon crash in PGMonitor::update_from_paxos at mon/PGMonitor.cc:90
fixed by commit:a1c99811bae2199a4ef3eef8681ac70ccfa128f5 Sage Weil
06:39 AM Bug #151 (Resolved): cmon crash in PGMonitor::update_from_paxos at mon/PGMonitor.cc:90
one of my 3 monitors crashed today, the whole ceph cluster was idle at that time.
cmon compiled at f7708dea1f, ple...
ar Fred
10:44 AM Documentation #155 (Resolved): document ceph auth
Sage Weil
10:14 AM Feature #154 (Closed): support IPv6 addresses
most of the infrastructure is there... Sage Weil
10:02 AM Bug #152 (Resolved): cfuse problem
fixed by commit:c0df916a790f9560d487c74c22152a7e16e6f226 Sage Weil
06:50 AM Bug #152 (Resolved): cfuse problem
Hi,
compiled at 7ecf493fd2c
it seems cfuse starts fuse when it fails connecting to a monitor (which is dead in ...
ar Fred
10:00 AM Bug #145: Check build dependencies for FastCGI
Will any adjustments be needed to check for required header files also in a subdirectory like "fastcgi"?
Does a conf...
Markus Elfring
09:54 AM CephFS Bug #153 (Resolved): mds: fix snap dentry replication vs readdir on frag auth
The request may be something like #123//foo/some/path/to/dir, dir lives in the stray dir, and is auth on another node... Sage Weil

05/25/2010

11:40 PM Revision 32d34f06 (ceph): Merge branch 'lazyio' into unstable
Conflicts:
src/mds/locks.c
Sage Weil
09:44 PM Revision e6b9055f (ceph): interval_set: fix union_of, intersection_of size accounting
Sage Weil
08:47 PM Revision 2b9ef644 (ceph): init-ceph: use = not == for comparison operator
Sage Weil
08:13 PM Revision fc228b5b (ceph): Merge branch 'mds_dentries' into unstable
Sage Weil
08:01 PM Revision 701d2672 (ceph): mds: better debugging on rmdir
Sage Weil
08:01 PM Revision 29ca21f5 (ceph): mds: fix scatterlock gather, writebehind
We stopped overloading the virutal is_updated() when we renamed to
is_dirty.
broken by 7f19ee1ac36095cd4d4c169858d93...
Sage Weil
03:42 PM Linux kernel client Bug #143 (Resolved): avoid resending requests on mon ticket renewal
fixed by 'ceph: do not resend mon requests on auth ticket renewal' and 'ceph: renew auth tickets before they expire' Sage Weil
02:37 PM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
What it actually means is that sock_alloc_inode is being called under the kswapd context and it does an allocation wi... Yehuda Sadeh
10:34 AM Linux kernel client Bug #147 (Resolved): lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and c...
... Sage Weil
02:15 PM Cleanup #146: Complete build options for Pthread API
If there is an environment where the -lpthread isn't sufficient, sure. Send a patch! :) Sage Weil
06:15 AM Cleanup #146 (Rejected): Complete build options for Pthread API
Would you like to combine "your check for this programming interface":http://ceph.newdream.net/git/?p=ceph.git;a=blob... Markus Elfring
02:07 PM Bug #145: Check build dependencies for FastCGI
Something nonstandard with Suse then? On debian it's
fatty:src 02:05 PM $ dpkg -S /usr/include/fcgiapp.h
libfcgi...
Sage Weil
01:41 AM Bug #145: Check build dependencies for FastCGI
I wonder why a header is not found because the file "/usr/include/fastcgi/fcgiapp.h" is available from the package "F... Markus Elfring
02:01 PM Linux kernel client Bug #150 (Can't reproduce): order:1 page allocation failure
workload was rsync to a ceph mount.
ceph3 mounting cosd0:/
not sure which version. probably unstable from last wee...
Sage Weil
11:09 AM Bug #149: Stale NFS Handle when copying from snapshot
Reproduced on version:
kclient: current unstable branch (240ed68eb567d80dd6bab739341999a5ab0ad55d)
server: current ...
Yehuda Sadeh
11:05 AM Bug #149 (Closed): Stale NFS Handle when copying from snapshot
Happens in the following scenario:
mount ceph
cd /mnt
mkdir a; cd a
tar xvfj ~/linux-2.6.xx.tar.bz2 (^C after a f...
Yehuda Sadeh
10:37 AM Bug #134 (Resolved): rbdtool segfaults when listing
Might have been due to protocol change in the pool-op that didn't get a protocol version bumped up. Resolving it unti... Yehuda Sadeh
10:35 AM Linux kernel client Bug #148 (Resolved): iozone failure
on ceph4, running
* rbd 3a6e756 ceph-rbd: snapshots support...
Sage Weil
10:32 AM Linux kernel client Bug #106 (Resolved): msgpool depletion?
Sage Weil
10:28 AM Linux kernel client Bug #106: msgpool depletion?
On what version did it happen? Do we have any reproducible scenario? Yehuda Sadeh

05/24/2010

11:51 PM Revision f8f9e6c4 (ceph): mds: make export targets stay in mdsmap for a while
This limits the mdsmap churn some. Keep old targets around for at least
min-max iterations before removing them.
Sage Weil
11:51 PM Revision 7f0ef1cd (ceph): mds: balancer cleanup
Sage Weil
11:50 PM Revision da42d061 (ceph): mds: warn on dn release that dne
Sage Weil
11:11 PM Revision 06b86ea4 (ceph): rbd: modify rbd on-disk header
Yehuda Sadeh
10:58 PM Revision 7cf48614 (ceph): rbd: fix push_to_qemu.pl
Yehuda Sadeh
10:56 PM Revision be082f0d (ceph): filestore: make mkfs() zap any file or dirs it finds
Sage Weil
10:56 PM Revision 7113775b (ceph): mon: roll mkmonfs functionality into cmon --mkfs
Sage Weil
10:55 PM Revision 5e8a6096 (ceph): rbd: modify header, add utility to ease sync with qemu tree
Yehuda Sadeh
09:00 PM Revision a9b494c4 (ceph): mon: no need for 'whoami' file in store
The monitor rank is provided during startup. No need to verify it against
the monitor store, especially since the st...
Sage Weil
09:00 PM Revision 0d98fc6f (ceph): osd: keep recovery ops in sync with pull
Call start_recovery_op from pull() instead of fixing every caller (some
were wrong). This keeps the recovery state i...
Sage Weil
03:31 PM Bug #132 (Resolved): slow mon recovery after operating degraded for too long
fixed by commit:bf1cb87d255b88d8e06b2988b6700e400ceb1b92 and commit:357aa0334436da79065dc67b270ff78f8899493f Sage Weil
03:30 PM Cleanup #121 (Resolved): roll mkmonfs functionality into cmon
commit:752a0fd5630aba92dedc3bb30fccec0ec837fa59 Sage Weil
02:25 PM Bug #133 (Resolved): mds crash on snapshot
The crash I saw here (related to an anchor table lookup) is fixed by commit:51c5823472ef8208c1b7a6b094f1655ccdc1190e
Sage Weil
02:23 PM Bug #145: Check build dependencies for FastCGI
Hmm, there is a rule in configure.ac checking for FCGX_Init. Is that rule broken, or is checking FCGX_Init insuffici... Sage Weil
02:14 PM Bug #145 (Resolved): Check build dependencies for FastCGI
I stumble on the following messages for my compilation try.... Markus Elfring

05/23/2010

10:13 PM Revision 56c4043a (ceph): reword blacklisted output so it's clearly discussing MDSes and not OSDs
Greg Farnum

05/22/2010

04:56 PM Revision f7708dea (ceph): uclient: don't unlink null dentry when getting null linkage in mds reply
This broke semi-recently when the mds started returning null linkages (and
associated leases).
Sage Weil
10:02 AM Bug #140 (Resolved): Cfuse crashes when mv-ing a file
fixed by commit:f7708dea1f2db5d3be31ddc2aaf1500e1d50746d
Sage Weil
09:47 AM Bug #140 (In Progress): Cfuse crashes when mv-ing a file
Sage Weil

05/21/2010

11:17 PM Revision bf1cb87d (ceph): mon: trim pgmap states even when we don't have a full quorum
Sage Weil
11:17 PM Revision 357aa033 (ceph): paxos: recover using stashed latest when state histories don't overlap
If we don't have incremental states to catch up, jump to the latest. Sage Weil
09:55 PM Revision 51c58234 (ceph): mds: anchor multiversion inode before unlinking it
If we are going to create a remote dentry linking to a multiversion inode
we're unlinking, make sure it's anchored!
...
Sage Weil
08:44 PM Revision fbbff743 (ceph): librados.h: add other TMAP definitions
also add a comment in rados.h about the defines in librados.h Yehuda Sadeh
07:08 PM Revision 3ec46059 (ceph): monc: hunting by default
Otherwise if we fail to connect to the first mon we try, we never retry. Sage Weil
07:08 PM Revision cd1b0710 (ceph): monc: pick a different mon when repicking
Sage Weil
07:08 PM Revision 929048f9 (ceph): mds: fix readdir pingpong on snapped dir with multiple mds
Our traverse helper will follow the auth if we're looking at snapped
metadata, but we _don't_ want that for readdir b...
Sage Weil
01:39 PM Linux kernel client Bug #141: ERESTARTSYS on mds update operations cause bad results
It seems pretty important to me that users be able to abort MDS requests -- if for some reason part of the filesystem... Greg Farnum
08:53 AM Linux kernel client Bug #141 (Resolved): ERESTARTSYS on mds update operations cause bad results
- process does a create
- gets signal and returns ERESTARTSYS before reply comes back
- kernel retries the operatio...
Sage Weil
12:56 PM Linux kernel client Cleanup #142 (Resolved): reuse message for mon subscribe
Sage Weil
12:20 PM Linux kernel client Cleanup #142 (Resolved): reuse message for mon subscribe
no need to allocate a fresh message each time around Sage Weil
12:50 PM Linux kernel client Bug #144 (Can't reproduce): GPF at con_close_socket+0x40/0x9f
... Sage Weil
12:31 PM Linux kernel client Bug #143 (Resolved): avoid resending requests on mon ticket renewal
Sage Weil
12:22 PM Linux kernel client Bug #66 (Resolved): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
Sage Weil
12:22 PM Linux kernel client Bug #66 (In Progress): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
Sage Weil
08:58 AM Linux kernel client Bug #139: BUG ceph_dentry_info: Objects remaining on kmem_cache_close()
Looks this isn't fixed after all (see #63). Maybe a dentry is allocated but never added to the dcache? Sage Weil
02:08 AM Linux kernel client Bug #139 (Resolved): BUG ceph_dentry_info: Objects remaining on kmem_cache_close()
After unmounting my Ceph filesystem and removing my kernel module i got the following message:... Wido den Hollander
02:33 AM Bug #140 (Resolved): Cfuse crashes when mv-ing a file
Hello,
I try to set up a little test ceph cluster, based on the testing branch.
I encounter a problem using the c...
Jean Adrien

05/20/2010

11:19 PM Revision 16ab067c (ceph): librados: update librados to define CEPH_OSD_TMAP_SET
Yehuda Sadeh
06:14 PM Revision 0050dd84 (ceph): mon: fix mon injectargs, and simplify
Sage Weil
05:58 PM Revision 9e4e53e0 (ceph): osd: simplify --mkjournal, add --flush-journal
Sage Weil
05:39 PM Revision 41b26060 (ceph): Merge branch 'osd_snapdir' into unstable
Sage Weil
05:39 PM Revision 1d9ab261 (ceph): osd: nicer debug output
Sage Weil
05:33 PM Revision f3ab812b (ceph): interval_set: fix union_of _size accounting; optimize ==
Sage Weil
01:50 PM Bug #98: reserved identifier violation
Markus Elfring wrote:
> I suggest to append a kind of "UUID":http://en.wikipedia.org/wiki/Universally_Unique_Identif...
Sage Weil
01:49 PM Bug #98: reserved identifier violation
The header ifdef guards should definitely be fixed. I think
#ifndef CEPH_FOO_H
#define CEPH_FOO_H
should be s...
Sage Weil
10:59 AM Cleanup #137 (Resolved): osd: --apply-journal, --mkjournal?
Sage Weil
10:40 AM Bug #76 (Resolved): osd: snapdir object recovery doesn't work
Sage Weil
09:08 AM Cleanup #67: add 'autoscan' items to configure.ac
Would you like to integrate any changes from the appended update suggestion into your source code repository? Markus Elfring

05/19/2010

11:43 PM Revision e162aab3 (ceph): mds: fix interval_set copy of projected_free
Sage Weil
11:42 PM Revision 155efe24 (ceph): mds: fix interval_set copy of projected_free
Sage Weil
11:11 PM Revision f6c48274 (ceph): osd: use blank reqid for snapdir events, too
Make reqid_is_indexed() less weird. Sage Weil
11:10 PM Revision 8dfe74f6 (ceph): osd: do not index by reqid if reqid not defined
Sage Weil
10:42 PM Revision 4ed3acb0 (ceph): osd: update purged_snaps in PG::Info on trim completion; and replicate
Sage Weil
08:17 PM Revision 9149dfa1 (ceph): rbd: fix snap_seq type in rbd_header
Yehuda Sadeh
06:49 PM Revision 99690f63 (ceph): initscripts: remove 'flushoncommit' from default btrfs mount options
Sage Weil
05:53 PM Revision 46891dd0 (ceph): osd: trim snaps via replicated osd ops
Sage Weil
05:42 PM Revision ff94c3a4 (ceph): osd: make build_removed_snaps, is_removed_snap consistent
Sage Weil
05:35 PM Revision e0315485 (ceph): rados: update documentation to mention mkpool and rmpool
Greg Farnum
03:25 PM Feature #138 (Resolved): Try out tcmalloc
The cosd daemon seems to eat up fragmented memory or something, since heap size stays fairly consistent but top memor... Greg Farnum
02:20 PM Cleanup #137 (Resolved): osd: --apply-journal, --mkjournal?
onetime style commands (do something, then exit), ala --mkfs. Sage Weil
11:39 AM Feature #136 (Rejected): sensible grammar for monitor commands
- clean up commands so the syntax is more intuitive
- some sort of help dump (full or partial?) grammar so you can s...
Sage Weil
10:46 AM Feature #135 (Resolved): Specify crush rules
The ability to create many CRUSH rules isn't very helpful if you can't use them all! Implement all appropriate ways t... Greg Farnum
05:58 AM Bug #134 (Resolved): rbdtool segfaults when listing
I'm experiencing a segfault with rbdtool when listing after a fresh mkcephfs.... Wido den Hollander

05/18/2010

11:00 PM Bug #130: Build needs more configuration files.
> see commit:a7769755c18882a259af6b8756f227bf2e71561e
I can not see it so far. When will this change be also publi...
Markus Elfring
09:41 AM Bug #130 (Resolved): Build needs more configuration files.
Ah, that makes sense. Thanks! Sage Weil
09:30 AM Bug #130: Build needs more configuration files.
It seems that the error messages about missing files will be displayed if somebody like me tries to regenerate the sc... Markus Elfring
08:03 AM Bug #130: Build needs more configuration files.
I added the automake args, and fixed the INCLUDES thing, but I don't see the other errors (on automake 1.10.1). and ... Sage Weil
06:18 AM Bug #130 (Resolved): Build needs more configuration files.
I would like to update the "build configuration template":http://ceph.newdream.net/git/?p=ceph.git;a=blob;f=configure... Markus Elfring
10:04 PM Revision ee218a1e (ceph): osd: fix peer_info updates on active primary
Sage Weil
07:18 PM Revision baba34bc (ceph): msgr: remove unused utime_t now
Greg Farnum
03:01 PM Revision a7769755 (ceph): automake: some clean up
Sage Weil
02:40 PM Bug #133 (Resolved): mds crash on snapshot
(09:45:41 AM) wido: i'm experiencing a MDS crash regarding snapshots.
(09:46:11 AM) wido: Test case: download the Li...
Sage Weil
12:20 PM rgw Support #8 (Closed): Document differences from S3
It's on the wiki, now, anyway. Greg Farnum
11:49 AM Linux kernel client Feature #23: fcntl/flock advisory lock support
Ahah, file_lock's fl_nspid pointer isn't filled in before calling the filesystem's lock handlers. I've fixed that so ... Greg Farnum
10:03 AM Linux kernel client Feature #23 (In Progress): fcntl/flock advisory lock support
Found some issues with recovery after all; working on them now. Greg Farnum
09:03 AM Bug #132 (Resolved): slow mon recovery after operating degraded for too long
need to trim even when degraded. and make sure recovery works using stashed latest. Sage Weil
08:37 AM Linux kernel client Feature #19 (Resolved): rbd
Sage Weil
08:35 AM Feature #71 (Resolved): msgr: throttle incoming messages
Sage Weil
08:04 AM Feature #131 (Resolved): bring wireshark plugin is up to date
Sage Weil

05/17/2010

11:29 PM Revision 1ea0f858 (ceph): poolop: fix MPoolOpReply decoding
Yehuda Sadeh
09:53 PM Revision 736d837e (ceph): throttle: allow large items if we're under our max
Normally we stay under max, but for large items, take it as long as we're
currently below max. This avoids deadlock.
Sage Weil
04:14 PM Bug #74 (Resolved): make removed_snaps contiguous
Sage Weil
03:43 PM Linux kernel client Feature #23 (Resolved): fcntl/flock advisory lock support
It should support flock and fcntl locks now. Currently there are no caps for this, so all locking requests are routed... Greg Farnum
02:04 PM Linux kernel client Bug #63 (Resolved): dentry_info slab not empty
Sage Weil
11:30 AM Linux kernel client Bug #63: dentry_info slab not empty
hopefully fixed by commit:7a597c3f4aa58d30d1236b1c1bf980e28a899578 Sage Weil
01:40 PM Linux kernel client Feature #26: statlite
http://marc.info/?t=123908749900004&r=1&w=2
http://marc.info/?t=123914651100002&r=1&w=2
http://marc.info/?l=linux-f...
Sage Weil

05/16/2010

03:55 AM Revision d006ae93 (ceph): osd: purged_snaps in PG::Info, queue snap trim on primary
Sage Weil

05/14/2010

09:57 PM Linux kernel client Bug #38 (In Progress): rm -r failure
Sage Weil
09:57 PM Linux kernel client Bug #38: rm -r failure
Sage Weil
09:18 PM Revision 47ba928b (ceph): osd: include snapdir objects in pg log for proper replication, recovery
Sage Weil
06:36 PM Revision ffd72a98 (ceph): strings: clean up pool op names
Sage Weil
06:32 PM Revision d1c78fcb (ceph): mds: allow readdir result limit in bytes
This will allow the client to bound the size of the reply it gets Sage Weil
04:43 PM Revision 7be27f43 (ceph): debian: put proper distribution in debian changelog
Sage Weil
04:14 PM Revision 33bf1a2b (ceph): version: use next version ~rc for unstable branch
This makes unstable always sort after stable, testing releases:
0.21~rc-unstable... > 0.20.1-testing...
Sage Weil
01:21 PM Linux kernel client Bug #126 (Resolved): qemu rbd driver doesn't work with virtio
Fixed, pushed. Yehuda Sadeh
12:01 PM Cleanup #129 (Rejected): msgr: separate message encoding into sections
instead front, middle, data, just break the message encoding into N sections, identified by some integer. this makes... Sage Weil
11:31 AM rbd Feature #41: Support snapshots
Read-only snapshots are now implemented on the kernel client. Still need to have the kvm-rbd implementation. Yehuda Sadeh
09:24 AM Linux kernel client Bug #127 (Resolved): fix r_aborted locking
Sage Weil
09:10 AM Linux kernel client Bug #66 (Resolved): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
Sage Weil

05/13/2010

08:28 PM Revision 0e177d28 (ceph): radosgw_admin: die after first bad argument
Sage Weil
05:47 PM Revision c54d6cde (ceph): objecter: separately track pgmap, osdmap state machine version
Mixing these up can make our request hang on the monitor indefinitely. Sage Weil
05:47 PM Revision 52e544bf (ceph): mon: return correct state machine epoch in replies
Sage Weil
05:46 PM Revision ecc4f686 (ceph): testrados: fix aio api usage
Sage Weil
05:46 PM Revision c3a8adaa (ceph): librados: implement rados_stat_pool()
Sage Weil
01:04 PM Linux kernel client Bug #127 (In Progress): fix r_aborted locking
Sage Weil
11:10 AM Linux kernel client Bug #127: fix r_aborted locking
no, there needs to be some locking. if we abort and return to the caller, we need to know that fill_trace isn't doin... Sage Weil
08:08 AM Linux kernel client Bug #127 (Resolved): fix r_aborted locking
r_aborted is protected by mdsc->mutex (along with r_reply, r_err), but it tested during fill_trace under s_mutex. is... Sage Weil
10:50 AM Feature #128 (Resolved): librados: implement get pool stats
Sage Weil
09:22 AM Feature #128 (Resolved): librados: implement get pool stats
Sage Weil
07:57 AM Linux kernel client Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
here are the final two crashes i got on this, presuambly because i had the wrong version of the module loaded:
<pr...
Sage Weil
07:28 AM Linux kernel client Bug #66 (In Progress): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
hit this again, on commit:e84346b726ea90a8ed470bc81c4136a7b8710ea5
workload was kernel compilation....
Sage Weil

05/12/2010

11:13 PM Revision 4ee1e261 (ceph): Merge branch 'unstable' of ceph.newdream.net:git/ceph into unstable
Sage Weil
11:13 PM Revision e741d43f (ceph): rados: fix typo
Sage Weil
11:06 PM Revision 342fc871 (ceph): osd: add CEPH_PG_MAX_SIZE to header
Sage Weil
11:06 PM Revision 7f43cf8a (ceph): filestore: update btrfs ioctl.h
Sage Weil
11:06 PM Revision 856bdf2f (ceph): client: un-"fix" u64 types in client/ioctl.h
Sage Weil
11:06 PM Revision c9b1aee1 (ceph): msgr: fix possible overflow when sending seq
Sage Weil
11:06 PM Revision 65074e5a (ceph): msgr: print message encoding version to aid debugging
Sage Weil
10:10 PM Revision a902bf02 (ceph): cmpxattr: null termination fixes
Yehuda Sadeh
10:10 PM Revision 80dcc28a (ceph): ceph_fs.h: checkpatch fixes
Yehuda Sadeh
10:10 PM Revision afa1993e (ceph): ceph_strings: checkpatch fix
Yehuda Sadeh
06:55 PM Revision 07fdde4b (ceph): rados: add 'tmap dump'
Sage Weil
06:55 PM Revision b5029166 (ceph): osd: set obs.exists in projected object state during recovery
Sage Weil
06:55 PM Revision dbcb4f82 (ceph): mds: warn, don't crash, on trailing garbage in dir objects
Sage Weil
03:36 PM Linux kernel client Bug #126 (Resolved): qemu rbd driver doesn't work with virtio
rbd images work fine with virtio off, but with it on I just get:... Andrew F
12:03 PM Bug #122 (Resolved): msgr: msgvec should go on heap
Sage Weil
11:26 AM Feature #125 (Resolved): log rotation
Sage Weil
04:30 AM Revision 8fbabe03 (ceph): msgr: put msgvec on heap
It can get too big for the stack. Sage Weil
04:14 AM Revision 876a0ccd (ceph): msgr: tolerate incoming seq #'s that skip ahead
This is necessary because the kclient may pull messages out of the out/sent
queues, and we can't renumber previously ...
Sage Weil

05/11/2010

09:46 PM Revision a576e6e2 (ceph): ceph: return error code returned by server
Sage Weil
09:43 PM Cleanup #124 (Resolved): msgr: change protocol handshake to exchange in_seq
This will allow peer to only requeue sent messages that weren't actually received. No need to resend stuff that will... Sage Weil
09:42 PM Linux kernel client Bug #123 (Resolved): fix msgr message retry seq numbering
fix:
- we now allow seq #'s to jump forward
- we only assign seq # once after ceph_con_send'ing a message. if it r...
Sage Weil
09:07 PM Linux kernel client Bug #123 (Resolved): fix msgr message retry seq numbering
we currently assign seq #'s when we send the message over the wire. this numbering breaks when we reconnect because ... Sage Weil
08:28 PM Revision 12f7c0b4 (ceph): mds: drop 'closed' bit from MClientReconnect
Sage Weil
08:13 PM Revision 7a23b5fb (ceph): msgr: set outgoing msg connection before encoding
This allows encode_payload to adjust behavior based on the target peer's
feature bits.
Sage Weil
04:59 PM Revision f6c2e1c4 (ceph): poolop: make new encoding backward compatible
This makes cea221c64 behave when messages using the old encoding (that is,
older versions of the client talk to us).
Sage Weil
04:48 PM Bug #122 (Resolved): msgr: msgvec should go on heap
it can be too large for the stack for big messages Sage Weil
04:36 PM Revision 0ebf2599 (ceph): osd: fix layout return type
Sage Weil
04:36 PM Revision ffc3e63f (ceph): osd: fix compile error from cmpxattr, cleanup.
I think the xattr bufferlist still needs to be null terminated... Sage Weil
04:36 PM Revision 4a7118e2 (ceph): msgr: be less noisy about msgr throttling
Sage Weil
01:40 PM Cleanup #58 (Resolved): kill nstring/cstring, use std::basic_string instead
Sage Weil
12:49 PM Linux kernel client Bug #107 (Resolved): lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
fixed by 'ceph: fix cap removal race' commit:d855b8010914b52d8dd596f6d22c162bf81ccf21 Sage Weil
11:56 AM Linux kernel client Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
finaly caught it!... Sage Weil
11:43 AM Linux kernel client Cleanup #113 (Resolved): audit mds_client locking, esp reply handler
Sage Weil
11:42 AM Linux kernel client Bug #116 (Resolved): can we drop user. xattr prefix for magic ceph xattrs?
yes. see 'ceph: use ceph. prefix for virtual xattrs'. Sage Weil
10:54 AM Cleanup #121 (Resolved): roll mkmonfs functionality into cmon
We can just do 'cmon --mkfs ...' instead of 'mkmonfs', similar to how the cosd initializes its local storage. Sage Weil
10:41 AM Bug #120 (Resolved): monitor cluster expansion broken
Sage Weil

05/10/2010

11:18 PM Revision f857a2e1 (ceph): osd: add cmpxattr op handling
Yehuda Sadeh
11:00 PM Revision 4d667dde (ceph): debian: remove pull.sh calls from helper .sh scripts
Sage Weil
10:53 PM Revision 324fe827 (ceph): atomic: cast away const on read()
(Only needed to build on lenny, this can go away someday) Sage Weil
10:10 PM Linux kernel client Cleanup #113 (In Progress): audit mds_client locking, esp reply handler
Sage Weil
03:25 PM Linux kernel client Cleanup #113: audit mds_client locking, esp reply handler
see also #66 Sage Weil
10:09 PM Linux kernel client Bug #64: crash in handle_mds_map (corrupt s_waiting list?)
fixed by commit:1c0806d2caacc683c56a587eaf1502769a7c0698 Sage Weil
04:35 PM Linux kernel client Bug #64 (Resolved): crash in handle_mds_map (corrupt s_waiting list?)
fixed by 'ceph: fix locking, error paths when waking reconnect requests' Sage Weil
10:09 PM Linux kernel client Bug #66: BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
fixed by commit:9abf82b8bc93dd904738a71ca69aa5df356d4d24 Sage Weil
04:34 PM Linux kernel client Bug #66 (Resolved): BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
fixed by 'ceph: fix locking, error paths when waking reconnect requests' Sage Weil
03:24 PM Linux kernel client Bug #66: BUG_ON(req->r_reply) at fs/ceph/mds_client.c:1841!
unable to reproduce... but, see #113 Sage Weil
10:04 PM Bug #120 (Resolved): monitor cluster expansion broken
The wiki procedure at http://ceph.newdream.net/wiki/Monitor_cluster_expansion does not work. It crashes with:
<pr...
Sage Weil
08:40 PM Revision 99cdd525 (ceph): osd: 'stop' command
Sage Weil
04:44 PM Linux kernel client Feature #119 (New): avoid looping connect/retry errors on console
we should try to avoid filling up logs with stuff like this:... Sage Weil
04:35 PM Linux kernel client Bug #78 (Resolved): bdi_init list bug
Sage Weil
04:13 PM Linux kernel client Feature #18 (Resolved): reconnect fixups
Sage Weil
03:42 PM CephFS Feature #118 (Rejected): kclient: clean pages when throwing out dirty metadata on session teardown
see 'ceph: throw out dirty caps metadata, data on session teardown' Sage Weil
03:25 PM Linux kernel client Bug #50 (Resolved): osd timeout reset leaves some ops hanging
Sage Weil
10:31 AM Linux kernel client Bug #50: osd timeout reset leaves some ops hanging
Sage Weil
10:31 AM Linux kernel client Bug #50: osd timeout reset leaves some ops hanging
finally found this, fixed by commit:77eb74b92fee7340d104b24a9ee2800196b0f140 Sage Weil
03:23 PM Bug #117 (Rejected): osd: lone osd might not notice new peers of simultaneously marked down
Sage Weil
10:35 AM Bug #117 (Rejected): osd: lone osd might not notice new peers of simultaneously marked down
see sepia.a's osd6 epoch 3206 Sage Weil

05/07/2010

10:57 PM Revision d37c9dc4 (ceph): mds: fix max_size==1 bug
We should provide a minimum of one stripe unit. Sage Weil
10:00 PM Revision 4d2d5d03 (ceph): Merge branch 'string' into unstable
Conflicts:
src/client/Client.h
src/messages/MAuthReply.h
src/messages/MClientLease.h
src/osd/ReplicatedPG.cc
src...
Sage Weil
09:55 PM Revision 6f00c8f7 (ceph): types: clean up printf format warnings
bah Sage Weil
09:50 PM Revision 2741e183 (ceph): ceph.spec.in: --without-hadoop --without-debug
to sync up with f13 Sage Weil
09:50 PM Revision 737b5043 (ceph): endian: simplify __le* type hackery
Instead of preventing linux/types.h from being included, instead name
our types ceph_le*, and remap using #define _on...
Sage Weil
09:50 PM Revision edc92490 (ceph): types: standardize on uint64_t
The problem is that on some platforms __u64 == uint64_t (x86_64), and on
others it's doesn't (ppc64). Which means we...
Sage Weil
09:50 PM Revision 0aa2208f (ceph): rgw: clean up dup #define
Sage Weil
08:49 PM Revision 91b9f01a (ceph): rbdtool: --list on empty pool returns correct message
and also command doesn't hang. This fixes bug #115. Yehuda Sadeh
06:58 PM Revision 404938c4 (ceph): time: include _ between date and time
Sage Weil
06:58 PM Revision a8579ebd (ceph): osd: remove bad assert in get_object_context()
This has always been fishy, and it's original purpose isn't quite clear.
The '!obc->obs.ssc' is clearly the create ca...
Sage Weil
06:58 PM Revision 906d3e67 (ceph): mds: always release snap cap after flushsnap
The old condition makes no sense. It appears to have been broken since
2008, commit a5d5e679? Yikes.
The problem i...
Sage Weil
04:42 PM Feature #93 (Closed): osd: how to tell osd to cleanly shut down, via monitor
ceph osd tell 0 stop
commit:5e4700aeb03a6771df7f8b36f8f4cb1619fe9814
Sage Weil
04:34 PM Revision 76e40e3d (ceph): osd: don't put() null ctx->op (message)
Sage Weil
04:14 PM Bug #110 (Rejected): utime_t should be UTC, not local time
Sage Weil
10:27 AM Bug #110: utime_t should be UTC, not local time
it is utc; its converted to local time only when printed. maybe the print method should include a tz abbrev? Sage Weil
04:05 PM Revision 8580aa59 (ceph): scripts: fix check_pglog.sh
Sage Weil
03:59 PM Bug #59 (Resolved): max_size of 1
finally found this, fixed by commit:d37c9dc4958398fb327d9e36b99d60319b09af48 Sage Weil
03:52 PM Revision fe7a8be1 (ceph): script: add check_pglog.sh
Check pglog for zero segments, normally caused by a race between
pg log replacement (rm log, rewrite log) vs append r...
Sage Weil
03:44 PM Cleanup #58: kill nstring/cstring, use std::basic_string instead
you can't give std::string() a NULL char * ... hopefully i found them all! Sage Weil
03:01 PM Linux kernel client Bug #116 (Resolved): can we drop user. xattr prefix for magic ceph xattrs?
Sage Weil
01:43 PM Linux kernel client Bug #115 (Resolved): rbdtool --list on empty pool should return correct message
Fixed, pushed to unstable. Yehuda Sadeh
12:39 PM Linux kernel client Bug #115: rbdtool --list on empty pool should return correct message
Here's the output:... Wido den Hollander
12:03 PM Linux kernel client Bug #115 (Resolved): rbdtool --list on empty pool should return correct message
It currently returns some weird error.
Reported by wido.
Yehuda Sadeh
10:26 AM Bug #114 (Resolved): osd: corrupted pglog
seen by fract.
probably a reordering of write_log vs append_log, similar to commit:686192a03eb7d28c7340dcab6d96715...
Sage Weil
09:51 AM Linux kernel client Cleanup #113 (Resolved): audit mds_client locking, esp reply handler
what does mdsc->mutex protect? s_mutex? which protects requests? Sage Weil
09:39 AM Linux kernel client Bug #65 (Resolved): crash in tcp_sendpage
may have also been related to #109.
closing this one, since we haven't seen it in a while.
Sage Weil
09:38 AM Linux kernel client Bug #109: kernel bugs out with bad osd caps
osd errors weren't unregistering the request. fixed by commit:a40355b39e006459b1ffba052c53084d20d64209 Sage Weil
09:37 AM Linux kernel client Bug #109 (Resolved): kernel bugs out with bad osd caps
Sage Weil
08:22 AM Feature #112 (Resolved): osd: snap rollback object op
roll back and object's value to specified snap. should be efficient so that if it hasn't changed, only object metadat... Sage Weil
08:19 AM Linux kernel client Bug #111 (Resolved): handle EAGAIN from osd
currently we just return this to the caller, when we should retry.
not that the osd returns this very often (ever?)
Sage Weil

05/06/2010

10:30 PM Revision a290fc57 (ceph): mdsmap: lose angle braces, use tabs
Sage Weil
10:27 PM Revision f0e6af93 (ceph): osdmap: initially osds don't exist; create them as they boot
Sage Weil
09:04 PM Revision dc24ab0d (ceph): osd: resurrect CEPH_OSD_EXISTS flag
Mainly so we can track how many osds are in the cluster, even if they
don't start at 0.
Sage Weil
06:31 PM Revision 66350d2b (ceph): osd: clear pg REPLAY when we reinitiate peering
Among other things, a stray/bad REPLAY flag can trigger a false activation
like so:
osd/PG.cc: In function 'void PG:...
Sage Weil
06:30 PM Revision 1959f874 (ceph): logger: be more careful calculating variance
Sage Weil
06:30 PM Revision e9cc4a37 (ceph): librados: separate callbacks for aio ack/complete and safe
Sage Weil
06:30 PM Revision a8fa5286 (ceph): debian: use 'testing' or 'unstable' in place of 'git' in version suffix
Sage Weil
03:35 PM Bug #108 (Closed): osdmap: fix up exists, make osd count in summary accurate
commit:dc24ab0dac695c241c5828ebab250865b44a9a60 Sage Weil
10:57 AM Bug #108 (Closed): osdmap: fix up exists, make osd count in summary accurate
Sage Weil
02:40 PM Bug #110 (Rejected): utime_t should be UTC, not local time
Sage Weil
02:37 PM Linux kernel client Bug #109: kernel bugs out with bad osd caps
... Sage Weil
02:34 PM Linux kernel client Bug #109 (Resolved): kernel bugs out with bad osd caps
e.g.,... Sage Weil
02:24 PM Bug #102: Fix signal handler
Sure, assuming there are in fact safe alternatives to fprintf.
If you do fix this, considering fixing nfs-utils' t...
Sage Weil
12:30 PM Linux kernel client Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
again:... Sage Weil

05/05/2010

09:39 PM Linux kernel client Bug #78: bdi_init list bug
i suspect this was fixed by commit:5dfc589a8467470226feccdc50f1b32713318e7b Sage Weil
09:38 PM Linux kernel client Cleanup #79 (Closed): use bdi setup and register helper
no. i did rename the bdi ceph-%d, though. Sage Weil
09:37 PM Linux kernel client Bug #104 (Resolved): bdi problem on EPERM from osd
Sage Weil
09:36 PM Linux kernel client Bug #104: bdi problem on EPERM from osd
problem was use of invalid wbc in completion. fixed by commit:54ad023ba8108d0163acc931ed4b5e4a8a3a7327 Sage Weil
04:53 PM Linux kernel client Bug #104: bdi problem on EPERM from osd
the problem is when writepages_finish gets a -1 result code. Sage Weil
03:03 PM Linux kernel client Bug #104 (In Progress): bdi problem on EPERM from osd
Sage Weil
02:57 PM Linux kernel client Bug #104: bdi problem on EPERM from osd
ladder0:
mount -a
echo asdf > /c/foo
sync
<crash>
Sage Weil
09:04 PM Revision ec600db8 (ceph): osd: fix critical cap parsing bugs and documentation.
Greg Farnum
09:04 PM Revision 642be312 (ceph): rgw: admin uses --secret instead of --key (as --key is already taken)
Greg Farnum
03:56 PM Linux kernel client Bug #107: lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
hmm, s_cap_lock usage sites look okay... don't think it's a leaked spinlock Sage Weil
03:03 PM Linux kernel client Bug #107 (Resolved): lockup on __cap_is_valid (via aio_write) vs __ceph_remove_cap
on master. rsync workload.
[168476.538425] BUG: soft lockup - CPU#0 stuck for 61s! [kswapd0:318]
[168476.538430] ...
Sage Weil
11:45 AM Linux kernel client Bug #106 (Resolved): msgpool depletion?
which pool is it?
[104608.030333] ceph: msgpool_get ffff88010f1fa370 now 0/0, may fail
[104608.036614] ----------...
Sage Weil
11:40 AM Feature #105 (Resolved): mon: warn on clock drift
>2s clock drift takes down non-master monitors.. bleh. Sage Weil
12:58 AM Bug #98: reserved identifier violation
I suggest to append a kind of "UUID":http://en.wikipedia.org/wiki/Universally_Unique_Identifier as a suffix to make i... Markus Elfring
12:37 AM Bug #99: Check return codes everywhere
Would you like to reuse any class library?
I do not like "assert" for consistent error handling because "the check...
Markus Elfring

05/04/2010

08:57 PM Revision 33b5e426 (ceph): msgr: fix local_pipe peer_type
Need to re-init_local_pipe() after register_entity(). Sage Weil
07:57 PM Revision caa77126 (ceph): Merge branch 'mds_snapids' into unstable
Sage Weil
07:51 PM Revision 89d1b5d0 (ceph): msgr: existing connection may be OPEN on connection race
Yup! Sage Weil
06:56 PM Revision 9bbb545d (ceph): osd: add CMPXATTR defines
Sage Weil
06:33 PM Revision 3d7d7d1f (ceph): osdmaptool: add -h; don't assert on bad filename
Sage Weil
03:46 PM Linux kernel client Bug #104 (Resolved): bdi problem on EPERM from osd
last sysfs file: /sys/class/net/lo/operstate
CPU 1
Modules linked in: ceph [last unloaded: ceph]
Pid: 5724, comm:...
Sage Weil
03:32 PM Bug #99: Check return codes everywhere
For something like strdup, which fails with ENOMEM, we can throw the usual C++ out of memory exception.
In general...
Sage Weil
01:45 PM Bug #99: Check return codes everywhere
I suggest to avoid unchecked function calls.
Would you like to detect every error situation as early as possible?
...
Markus Elfring
10:56 AM Bug #99: Check return codes everywhere
Yeah, there are lots of these. Some hosts seem to spit out nice warnings for lots of call sites where the return val... Sage Weil
02:08 AM Bug #99 (Closed): Check return codes everywhere
This issue corresponds to my previous ""feature request":https://sourceforge.net/tracker/?func=detail&atid=763578&aid... Markus Elfring
02:20 PM Bug #98: reserved identifier violation
No need for the trailing gibberish, I think.. the trailing _H and preceding CEPH_ should be sufficient. Sage Weil
01:38 PM Bug #98: reserved identifier violation
Leading underscores can be avoided for various identifiers.
How do you think about the application of a name patte...
Markus Elfring
10:53 AM Bug #98: reserved identifier violation
Yeah, I think all the headers #ifdef guards can be changed from __FOO_H to CEPH_FOO_H or similar. Patches welcome!
...
Sage Weil
01:43 AM Bug #98 (Resolved): reserved identifier violation
This issue corresponds to my previous ""feature request":https://sourceforge.net/tracker/?func=detail&atid=763578&aid... Markus Elfring
01:21 PM CephFS Bug #52 (In Progress): mds: dentry versionlock
Sage Weil
01:15 PM CephFS Bug #44 (Closed): Make MDS use new connection-based send_message
Greg Farnum
01:06 PM Bug #62: readdir out of sync on different nodes
dup #75 Sage Weil
01:05 PM Bug #62 (Closed): readdir out of sync on different nodes
Sage Weil
01:02 PM Bug #74: make removed_snaps contiguous
commit:1c098c23cdb40513c63a723f26f0d416de696f05 Sage Weil
12:58 PM Bug #74: make removed_snaps contiguous
Sage Weil
12:57 PM Feature #71: msgr: throttle incoming messages
Sage Weil
11:32 AM Linux kernel client Bug #28: gracefully fail on fill_trace errors
this includes ENOMEM on xattr blob Sage Weil
10:57 AM Cleanup #100: #define ? enum?
Sure, these could be changed, but I think it's a pretty low priority item. Sage Weil
02:36 AM Cleanup #100 (Closed): #define ? enum?
This issue corresponds to my previous ""feature request":https://sourceforge.net/tracker/?func=detail&atid=763578&aid... Markus Elfring
04:22 AM Cleanup #103 (Closed): Introduction of namespaces
This issue corresponds to my previous "feature request":https://sourceforge.net/tracker/?func=detail&atid=763578&aid=... Markus Elfring
04:06 AM Revision 32784666 (ceph): ceph.spec.in: run configure
Reported by MarkN in #ceph on irc.oftc.net Sage Weil
04:02 AM Revision 22e9777c (ceph): Merge remote branch 'origin/msgr_throttle' into unstable
Sage Weil
03:34 AM Bug #102 (Closed): Fix signal handler
This issue corresponds to my previous ""feature request":https://sourceforge.net/tracker/?func=detail&atid=763578&aid... Markus Elfring
03:02 AM Feature #101 (Rejected): Conversion of pointer parameters into references
This issue corresponds to my previous "feature request":https://sourceforge.net/tracker/?func=detail&atid=763578&aid=... Markus Elfring

05/03/2010

11:57 PM Revision 8a37ac6f (ceph): rbd: update rbd image header format
Yehuda Sadeh
11:57 PM Revision f28decef (ceph): poolop: do op immediately, instead of waiting for timeout
Yehuda Sadeh
11:57 PM Revision cea221c6 (ceph): MPoolOp: rearrange parsing, for easier kernel implementation
Yehuda Sadeh
11:41 PM Revision 1c098c23 (ceph): osd: add removed snapid to removed_snaps for unmanaged snaps
Sage Weil
11:41 PM Revision 7ec68dcb (ceph): mds: add removal snap_seq to removed_snaps
This prevents lots of fragmentation in removed_snaps interval set. Sage Weil
11:20 PM Cleanup #67: add 'autoscan' items to configure.ac
This issue fits to my previous feature request ""Complete build configuration template":https://sourceforge.net/track... Markus Elfring
10:10 PM Linux kernel client Bug #54 (Resolved): do dentry offset assignment when dentry becomes non-null
Sage Weil
10:09 PM Linux kernel client Bug #54: do dentry offset assignment when dentry becomes non-null
added to unstable. Sage Weil
10:04 PM Revision 14dd8a9a (ceph): mds: acquire anchor_create locks in caller
We're not allowed to call acquire_locks again once we've completed locking.
Do it in the caller. Add a helper that w...
Sage Weil
09:32 PM Feature #71: msgr: throttle incoming messages
Sage Weil
09:24 PM Feature #97 (Resolved): qa: better snap tests
Sage Weil
09:23 PM Feature #96 (Resolved): msgr: close idle connections?
Sage Weil
09:22 PM Feature #95 (Resolved): mon: adjust overload based on osd disk utilization
Sage Weil
09:22 PM Feature #94 (Resolved): mon: monitor some osds
Sage Weil
09:22 PM Feature #93 (Closed): osd: how to tell osd to cleanly shut down, via monitor
Sage Weil
09:21 PM CephFS Bug #92 (Resolved): mds: journal client_map for rename importing inode
Sage Weil
09:20 PM CephFS Feature #91 (Resolved): mds: up:shadow mode
replay client while in standby, so we can take over immediately on failure. Sage Weil
09:19 PM CephFS Bug #90 (New): mds: don't sync log on every clientreplay request
Sage Weil
09:19 PM CephFS Cleanup #89 (Closed): mds: put inode dirty fields in dirty_bits_t to reduce memory footprint
Most inodes in the cache aren't dirty. Sage Weil
09:18 PM CephFS Feature #88 (Resolved): mds: change stray commit strategy to avoid rolling stray dir commits
Once we commit a temporary item to the stray dir, we have to re-commit later (after purge) to make it go away, at whi... Sage Weil
09:17 PM Feature #87 (Closed): osd: separate interface/messenger for backend (osd to osd) network
Sage Weil
09:16 PM CephFS Feature #86 (Resolved): mds: implement fsck
rebuild hierarchy after subset of pg's is lost.
and/or, traverse hierarchy and check for errors.
Sage Weil
09:15 PM RADOS Feature #85 (Resolved): osd: pg_num shrink
Sage Weil
09:15 PM RADOS Feature #84 (In Progress): mon: auto adjust pg_num as pool grows
of course, splitting needs to be more efficient first! Sage Weil
09:14 PM CephFS Feature #83 (New): mds: rename over old files should flush data or revert to old contents?
write to foo.conf.tmp
close
rename foo.conf.tmp to foo.conf
<crash before flushing new file content>
foo.conf n...
Sage Weil
09:12 PM Feature #82 (Resolved): mon: osd failure smarts
We shouldn't necessarily believe every osd failure report we get, especially when there are lots of other nodes monit... Sage Weil
09:11 PM Revision 53a5b4a5 (ceph): mdsmap: cleanup up braces
Sage Weil
09:11 PM Revision 11184b90 (ceph): objecter: fix ~Message() calls (use ->put())
Sage Weil
09:11 PM CephFS Feature #81 (Resolved): mds: do authentication checks
uid/gid vs file modes, etc. Sage Weil
09:09 PM Feature #80 (Resolved): uclient: readdir from cache
Sage Weil
09:02 PM Revision 409a54f1 (ceph): mds: fix lock->sync rdlock rule
Observed symptom was touch foo on one node, and missing from ls on another
node.
Since auth rdlock isn't allowed for...
Sage Weil
09:02 PM Revision 632b24f7 (ceph): mds: fix EXCL rdlock rule
Only the exclusive client should be able to rdlock. (if anyone!)
No observed problems, caught by inspection.
Sage Weil
05:17 PM Revision 291c0ab7 (ceph): filejournal: create if mkfs and journal is small OR dne.
Sage Weil
04:13 PM Linux kernel client Cleanup #79 (Closed): use bdi setup and register helper
See commit:e6d086d83cf7f102d48c006f58172a69ec0c15a4
This will make our /sys/kernel/debug/bdi directory pretty (cep...
Sage Weil
04:10 PM Linux kernel client Bug #78 (Resolved): bdi_init list bug
There were 2 clients mounted, here, so unclear what what was. One was behaving fine.
The other was forcefully unm...
Sage Weil
03:19 PM CephFS Bug #77 (Resolved): mds: anchor_create calls acquire_locks
fixed in commit:14dd8a9a8a653ef0142ce795454ae3e0e920d386 Sage Weil
12:57 PM CephFS Bug #77 (Resolved): mds: anchor_create calls acquire_locks
Locks need to be taken all at once, by caller. Maybe add a helper? Sage Weil
02:03 PM CephFS Bug #75 (Resolved): mds: fix rdlock semantics
fixed by commit:409a54f17515a417f72fb0e1d70733624fe4b609 Sage Weil
01:57 PM CephFS Bug #75: mds: fix rdlock semantics
broken result: rdlock succeedes during lock->sync:
10.05.03 13:56:19.316016 7fb391bb0910 mds0.locker rdlock_start...
Sage Weil
01:49 PM CephFS Bug #75: mds: fix rdlock semantics
rdlock does, actually. The problem was specifically the LOCK->SYNC transition allows rdlock on auth, when it shouldn't. Sage Weil
10:09 AM CephFS Bug #75 (Resolved): mds: fix rdlock semantics
The rdlock does not take which client is doing the locking into account.
Consequently,
clienta$ ls
clientb$ t...
Sage Weil
12:11 PM CephFS Bug #34 (Closed): mds: nonempty cap xlist on snaprealm during trim
Sage Weil
12:11 PM Bug #48 (Closed): mds: unnecessary snaprealm splits on unlink/rename
Sage Weil
12:11 PM Bug #53 (Closed): osd: weird crash in snaptrimmer
Sage Weil
12:11 PM CephFS Bug #51 (Closed): delete directories don't appear in snap
Sage Weil
12:10 PM Linux kernel client Bug #47 (Closed): gfp at ceph_update_snap_trace+0x16a/0x419
Sage Weil
12:10 PM Linux kernel client Bug #38 (Closed): rm -r failure
Sage Weil
12:10 PM Linux kernel client Bug #22 (Closed): BUG at fs/ceph/caps.c:253
Sage Weil
12:10 PM Linux kernel client Bug #4 (Closed): lockdep warning in socket code
Sage Weil
12:10 PM Linux kernel client Bug #3 (Closed): leaked dentry ref on umount
Sage Weil
12:10 PM Linux kernel client Bug #2 (Closed): BUG at fs/ceph/caps.c:2178
Sage Weil
12:09 PM Linux kernel client Bug #1 (Closed): gpf in tcp_sendpage
Sage Weil
12:09 PM Bug #68 (Closed): mds/MDCache.cc:4797: FAILED assert(con)
Sage Weil
12:08 PM Bug #70 (Closed): Interrupting rados tool crashes osd?
Sage Weil
11:13 AM Linux kernel client Bug #65: crash in tcp_sendpage
this is probably a problem with the backport.. it went away when we switch to 2.6.34-rc3 on issdm Sage Weil
11:09 AM Bug #76 (Resolved): osd: snapdir object recovery doesn't work
The snapdir object doesn't end up in the missing list. Sage Weil
03:50 AM Revision b53d0ed6 (ceph): filejournal: return error instead of asserting
Sage Weil

04/30/2010

11:24 PM Revision 60095d9d (ceph): mon: correctly update up map when taking over for creating|starting mds
Otherwise we can get crap like
in <0>
up <{0=4103}>
failed <>
stopped <>
4300: 192.168.141.103:6800/9446 '0' mds0.0 ...
Sage Weil
10:44 PM Revision 3374fbfc (ceph): Merge commit 'v0.20' into unstable
Conflicts:
src/mon/LogMonitor.cc
src/mon/MDSMonitor.cc
src/mon/Monitor.cc
src/mon/OSDMonitor.cc
src/mon/PGMonito...
Sage Weil
10:11 PM Revision 266d2076 (ceph): debian: fix up debian scripts
Sage Weil
09:22 PM Revision f244067a (ceph): ceph.spec.in: from Josef
Sage Weil
09:22 PM Revision b022a285 (ceph): ceph.spec.in: updates
Sage Weil
08:02 PM Revision aebc239e (ceph): osd: fix pg_to_acting_osds() calculation to consider pg_temp
The OSD was using pg_to_up_acting, the client uses pg_to_acting, and their
calculations of 'acting' weren't consisten...
Sage Weil
08:02 PM Revision 974f2eee (ceph): mds: fix trim_dentry on dentry under unlinked dir
We can get a dentry that is trimmable (e.g. null) under a new unlinked dir,
which has no subtree. This will only hap...
Sage Weil
07:58 PM Revision 0b0e1201 (ceph): mds: fix trim_dentry on dentry under unlinked dir
We can get a dentry that is trimmable (e.g. null) under a new unlinked dir,
which has no subtree. This will only hap...
Sage Weil
07:58 PM Revision 17a2dcbd (ceph): objecter: show osds in laggy debug output
Sage Weil
05:46 PM Revision 1b3e4fb0 (ceph): osd: fix pg_to_acting_osds() calculation to consider pg_temp
The OSD was using pg_to_up_acting, the client uses pg_to_acting, and their
calculations of 'acting' weren't consisten...
Sage Weil
05:11 PM Revision 5210d6c3 (ceph): objecter: show osds in laggy debug output
Sage Weil
03:22 PM Bug #72 (Closed): mon session segfault
fixed by commit:84c386ea0d255855791fa84d5ab44c6c3a03945a Sage Weil
03:21 PM Bug #74 (Resolved): make removed_snaps contiguous
removed_snaps [2~1,4~1,6~1,8~1,a~1,c~1,e~1,10~1,12~1,14~1,16~1,18~1,1a~1,1c~1,1e~1,20~1,22~1,24~1,26~1,28~1,2... Sage Weil
01:03 PM Bug #68 (Resolved): mds/MDCache.cc:4797: FAILED assert(con)
Sage Weil
11:30 AM Bug #68: mds/MDCache.cc:4797: FAILED assert(con)
finally found this. commit:dd9bc423a3c95d230772eb3952032f99375973e4 Sage Weil
04:43 AM Revision 0cb8ad43 (ceph): filestore: fail to mount if journal open fails
Clean up error reporting too. Sage Weil
04:43 AM Revision dc84acd6 (ceph): filestore: fail to mount if journal open fails
Clean up error reporting too. Sage Weil
04:12 AM Revision 7ad42023 (ceph): msgr: fix sigint/term signal handler to actually quit (by chaining)
Sage Weil
04:12 AM Revision 3480dc69 (ceph): msgr: fix sigint/term signal handler to actually quit (by chaining)
Sage Weil

04/29/2010

10:05 PM Revision 269d1426 (ceph): librados: fix blank pool stats
Sage Weil
10:04 PM Revision 96a90efc (ceph): librados: fix blank pool stats
Sage Weil
09:30 PM Feature #73 (Rejected): kernel: implement 'flush disk cache' ioctl
Currently the osd journal on a raw block device requires that you disable the disk write cache via hdparm. That's an... Sage Weil
09:29 PM Revision 61b4b295 (ceph): qa: make pjd clean up
Sage Weil
09:29 PM Revision ebe05ae6 (ceph): mds: clean up MIN/MAX, client_ranges
Avoid double evaluation via macros... Sage Weil
09:29 PM Revision d595ad44 (ceph): msgr: remove pid file on SIGINT, SIGTERM
Sage Weil
09:28 PM Revision 3a9439ca (ceph): mds: clean up MIN/MAX, client_ranges
Avoid double evaluation via macros... Sage Weil
09:28 PM Revision 8fcb0cc4 (ceph): qa: make pjd clean up
Sage Weil
09:28 PM Revision 30561d84 (ceph): msgr: remove pid file on SIGINT, SIGTERM
Sage Weil
06:57 PM Revision a31ea46f (ceph): config: post-process default config options too
Previously we were only doing the ones we read out of the .conf. Sage Weil
06:56 PM Revision d1441462 (ceph): config: post-process default config options too
Previously we were only doing the ones we read out of the .conf. Sage Weil
05:43 PM Revision 318dfbeb (ceph): mon: use (robust) buffer::list::write_fd() to write to disk
Sage Weil
05:43 PM Revision 63385d02 (ceph): mon: use (robust) buffer::list::write_fd() to write to disk
Sage Weil
05:27 PM Revision 1f55b175 (ceph): mon: m->get_session() may return null if session has closed
because the session close clears connection->priv. We need to check at
each site anyway, either for null session, or...
Sage Weil
05:22 PM Revision 84c386ea (ceph): mon: m->get_session() may return null if session has closed
because the session close clears connection->priv. We need to check at
each site anyway, either for null session, or...
Sage Weil
 

Also available in: Atom