Project

General

Profile

Activity

From 05/22/2011 to 06/20/2011

06/20/2011

11:41 PM Revision d5ec8b77 (ceph): MonClient: deglobalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision 4615913c (ceph): hex.cc: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision f6c7343f (ceph): auth: more deglobalization
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision 2b22c32b (ceph): OSDMap: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision 40532049 (ceph): DoutStreambuf: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision fae8dde3 (ceph): Messenger: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision 83020ad0 (ceph): assert: de-globalize
Implement a simple registration system so that assert output can still
show up non-interleaved in daemons and utiliti...
Colin Patrick McCabe
11:41 PM Revision 8147350c (ceph): MDSMap: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision 6ff09ef5 (ceph): Move signal, pidfile to global
Move the global parts of signal and pidfile handling into global/
Signed-off-by: Colin McCabe <colin.mccabe@dreamhos...
Colin Patrick McCabe
11:41 PM Revision fc434386 (ceph): Create global_context.cc to house g_ceph_context
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision 91065f48 (ceph): auth: AuthSupported: deglobalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision b0cb4808 (ceph): KeyRing: derr -> lderr
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision ba611910 (ceph): msg: remove globals
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision 0fc6ef2e (ceph): LogClient: remove accidental g_conf references
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision ba5b7331 (ceph): tcp.cc: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:41 PM Revision dcc64482 (ceph): ProfLogger: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:37 PM Revision 63bbaddd (ceph): move common_init to global_init
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision b44ead76 (ceph): mon/MonMap: deglobalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision 02c2ee16 (ceph): mon/MonClient: use ldout instead of dout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision c95062c7 (ceph): libclient: remove client/SyntheticClient.cc
client/SyntheticClient.cc is only of interest to csyn. The other users
of libclient don't need it.
Signed-off-by: Co...
Colin Patrick McCabe
11:35 PM Revision c46814db (ceph): Makefile: put LIBGLOBAL LDFLAGS into LIBGLOBAL_LDA
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision da2d4297 (ceph): Create ProfLoggerCollection to hold ProfLoggers
Rather than having global variables, create ProfLoggerCollection to
store the current set of ProfLoggers. This collec...
Colin Patrick McCabe
11:35 PM Revision 17d22a96 (ceph): auth: CryptoKey, CryptoAES: deglobalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision ab75f2ab (ceph): auth: KeyRing, RotatingKeyRing: deglobalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision 76c5f248 (ceph): auth: AuthAuthorizeHandler: deglobalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision 51ffa7b1 (ceph): auth: CephxKeyServer: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision 01038faf (ceph): auth: remove remaining globals
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision 5d5a7116 (ceph): common_init: misc globals cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision f82611d7 (ceph): common: de-globalize LogClient, ProfLogger, Timer
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision d224876e (ceph): ThreadPool: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision 8069e83c (ceph): common/DecayCounter: deglobalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:35 PM Revision 8a0d4864 (ceph): SimpleMessenger: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:34 PM Revision 7735b35f (ceph): g_clock.now -> ceph_clock_now
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision 9085597b (ceph): run_cmd: ret empty string on success;err otherwise
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision 1dbb0ac8 (ceph): auth: de-globalize TicketManager, TicketHandler
de-globalize CephXTicketManager and CephXTicketHandler. Also
de-globalize two static functions.
Signed-off-by: Colin...
Colin Patrick McCabe
11:22 PM Revision be8fe54a (ceph): auth: de-globalize AuthClientHandler and friends
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision d03c96e5 (ceph): auth: de-globalize AuthServiceHandler and pals
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision d6adc4e7 (ceph): auth: de-globlize CephXAuthorizer
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision 2452ef44 (ceph): common/Clock: add deglobalized clock stuff
Add clock functions which don't use globals. Remove Clock::recent_now.
Signed-off-by: Colin McCabe <colin.mccabe@dre...
Colin Patrick McCabe
11:22 PM Revision c0e804db (ceph): ObjectStore: clean up includes a little bit
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision 07ff69a7 (ceph): include/Completion: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision ff626f0e (ceph): bufferlist: get rid of derr usage
bufferlist doesn't need to use dout/derr. Only one function used it, and
that was only to return an error string.
Si...
Colin Patrick McCabe
11:22 PM Revision bf2c226a (ceph): include/atomic.h: whitespace cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision 03c83799 (ceph): include/atomic cleanup
* Don't allow copying of class atomic_t.
* Remove common/Spinlock.h because it's unecessary
* SimpleMessenger: use at...
Colin Patrick McCabe
11:22 PM Revision 0fa77948 (ceph): common/Finisher: use ldout
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision f2bd39fa (ceph): common/MemoryModel: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision 6c58da04 (ceph): include/Context.h: de-globalize
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:22 PM Revision d80b331f (ceph): lockdep: code cleanup and de-globalization
common/Mutex.cc: this is a duplicate of the code in common/lockdep.cc.
Delete the duplicate code.
common/lockdep.cc:...
Colin Patrick McCabe
11:17 PM Revision 823eb267 (ceph): Thread.cc: clean up
Don't implement everything in the header file. It slows down compilation
and usually runtime.
Signed-off-by: Colin M...
Colin Patrick McCabe
11:06 PM Revision 9678ad3f (ceph): lockdep: add lockdep_register_ceph_context,cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:06 PM Revision ee5502d2 (ceph): Remove cdout
cdout just duplicates the functionality of dout.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
11:06 PM Revision 5a612f95 (ceph): Create de-globalized versions of the dout macro
The de-globalized versions are available in dout.h
Versions relying on globals continue to be available in debug.h
S...
Colin Patrick McCabe
11:05 PM Revision 620d0784 (ceph): signal: const cleanup, don't use derr
In signal.cc: const cleanup, don't use derr, add a comment.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
11:01 PM Revision 128bfcfd (ceph): rgw: put_obj_data doesn't need mtime
Yehuda Sadeh
11:01 PM Revision 1e5cb068 (ceph): rgw: put_obj_data uses aio
This is just a trivial fix, making the etag calculation done while the io
is pending. Still some room for improvement.
Yehuda Sadeh
10:58 PM Revision 10171cad (ceph): filestore: fix fiemap
Yehuda Sadeh
10:58 PM Revision ca2f4e28 (ceph): rbd: fix read_iterate for sparse read
Yehuda Sadeh
09:31 PM Revision bc8cc868 (ceph): Fix bug that thought all >1 node clusters always had core dumps.
Accidentally shared the stdout between all the runs. Tommi Virtanen
09:31 PM Revision e481db13 (ceph): Archive syslog messages while the test was in progress.
Tommi Virtanen
09:06 PM Revision f78de01a (ceph): encoding: add list<shared_ptr<T>> macros
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:06 PM Revision cc644b84 (ceph): mds: avoid EMetaBlob::fullbit copying with shared_ptr
The fullbit subclass has a pointer, making it unsafe to copy around. Fix
by adding a undefined copy constructor and ...
Sage Weil
09:02 PM Revision 96ef8a67 (ceph): debian: ceph-client-tools depends on librados2, librbd
It contains rados and rbd command line tools.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:47 PM Revision 04d10d10 (ceph): qa: tiobench workunit: remove timeout
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:28 PM Revision 29e64df9 (ceph): Merge remote branch 'origin/stable'
Sage Weil
05:09 PM CephFS Bug #1206: NFS reexport file creation lags 1-3 seconds
If I reexport the ceph filesystem via fuse/nfs, there is *no* lag in creating files. Brian Chrisman
05:02 PM CephFS Bug #1206 (Closed): NFS reexport file creation lags 1-3 seconds
I'm attaching the kernel logs and mds logs for the creation of a file called scale-product/testfoo3 via a touch runni... Brian Chrisman
04:38 PM Feature #183: qa: xfstests workunit
I looked at this briefly and it appears surprisingly annoying to do since xfstests normally wants to do the mounting ... Sage Weil
04:37 PM Feature #1205 (Resolved): qa: run teuthology clients in parallel
Sage Weil
04:37 PM Feature #1204 (Resolved): qa: add existing workunits to suite
Sage Weil
04:29 PM CephFS Bug #1199: 4-MDS fsstress: remote ino lookup assert
The problem is that if the backtrace for a given inode changed, the AnchorServer didn't overwrite the old Anchor that... Greg Farnum
03:31 PM CephFS Bug #1199: 4-MDS fsstress: remote ino lookup assert
Okay, this seems to be an anchortable bug. It's getting the last trace that got committed for the inode in question, ... Greg Farnum
04:28 PM CephFS Bug #1200 (Duplicate): 4-MDS fsstress remote ino lookup cycle
Yep, this is the same as #1199. Greg Farnum
04:09 PM Feature #1196 (Resolved): qa: install kernels via teuthology
Josh Durgin
03:15 PM CephFS Bug #1110 (Resolved): mds: ls -l hangs on concurrent writer
Created a separate task for the request prioritiziation on the osd, #1203 Sage Weil
09:22 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Okay, so I found one other issue while testing this patch. So there are two reasons this is slow:
- when we revo...
Sage Weil
08:18 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
> Indeed: It was at epoch 5 which has a "modified" timestamp of 2011-06-15 22:37:48. This matches exactly the time wh... Andre Noll
03:14 PM RADOS Feature #1203 (New): osd: priority or fairness osd operations
The idea is be to prioritize MDS messages in the OSD op queue to minimize latency for metadata writes. See #1110. Sage Weil
03:08 PM Bug #1194: kclient: NFS reexport does not survive ceph fs remount
I was able to reproduce this.
ESTALE shows up in the messages log.
I don't see much in the mds log.
At this poin...
Brian Chrisman
11:26 AM Linux kernel client Bug #1140: balance_dirty_pages makes Fw cap revocation slow
commit:afc2c255f157c1ff6ecff5d9ad7979a226a19e1c is a patch that avoids the balance_dirty_pages issue. That will solv... Sage Weil

06/18/2011

03:17 PM Feature #1202 (Resolved): qa: workunits to set file layout and verify it works.
This'll only work on the kclient, of course; no ioctl for cfuse. Sage Weil
02:18 PM Bug #1201: rbd export doesn't match import
P.S. v0.29.1-4-g93623fb Paul Komkoff
02:17 PM Bug #1201 (Resolved): rbd export doesn't match import
rbd import foo bar output:
rbd import file_pos=0 extent_len=12288
reading 12288 bytes at offset 0
rbd import file_...
Paul Komkoff

06/17/2011

11:20 PM Revision 7335fbbf (ceph): librados: return value on non void function
Yehuda Sadeh
11:18 PM Revision 830f48d2 (ceph): update clitests with new usage
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:17 PM Revision 93623fba (ceph): common: fix descriptinos for -i, -n too
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
11:01 PM Revision 1b9575ea (ceph): common: fix generic usage for -D vs -d
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
11:01 PM Revision 6a32f723 (ceph): Autotests run no ceph software, so don't try to do code coverage there.
Tommi Virtanen
11:00 PM Revision b9c36729 (ceph): rgw: suspend/enable buckets through pool async api
Yehuda Sadeh
11:00 PM Revision e2150a08 (ceph): librados: add async pool op operations
Yehuda Sadeh
11:00 PM Revision 57c542b9 (ceph): Archive cores dumped during test, record test as failed if any seen.
Tommi Virtanen
10:01 PM Revision 6fb971ff (ceph): qa: organize workunits
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:24 PM Revision b2c803db (ceph): secret.c: use safe_read when appropriate
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:59 PM Revision 7b8bf550 (ceph): mds: fix use-after-free on ESession replay
Reported by: Sam Lang <samlang@gmail.com>
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:26 PM Revision 4efdc52a (ceph): rgw: remove stray semicolon
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:26 PM Revision 4ca8054c (ceph): rgw: cast RGW_SUSPENDED_USER_AUID
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:22 PM Revision 9e4adf0d (ceph): qa: make snap-rm-diff workunit clean up
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:22 PM Revision e9f308df (ceph): qa: wget -q
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:21 PM Revision 20e3ac2c (ceph): mds: CDir::log_mark_dirty needs to flush before waiting
We need to flush before waiting or else our completion (and mark_diryt())
will be out of order with respect to other ...
Sage Weil
03:30 PM CephFS Bug #1200: 4-MDS fsstress remote ino lookup cycle
Looks like this is the same root cause as #1199. Greg Farnum
03:19 PM CephFS Bug #1200 (Duplicate): 4-MDS fsstress remote ino lookup cycle
Notice how it's missing an expected ino in a dir that's marked complete, and then it just tries to do the lookup agan... Greg Farnum
03:12 PM CephFS Bug #1199 (Resolved): 4-MDS fsstress: remote ino lookup assert
... Greg Farnum
02:36 PM Subtask #1177 (Resolved): qa: post-process coverage data
Sage Weil
02:34 PM Feature #1119 (Resolved): qa: gcov/lcov html output
Sage Weil
01:45 PM Bug #1152: Mon getting killed by OOM killer
Humm, these heap dumps are never larger than 0.2MB!
Looking at the total memory allocation over the lifetime of th...
Greg Farnum
11:48 AM Bug #1152: Mon getting killed by OOM killer
I hit the bug again it seems. The OOM killer came around and killed my monitor again. I didn't touch the machine yet,... Wido den Hollander
11:36 AM rgw Feature #1198 (Resolved): rgw user suspension
This should include:
- the ability to suspend/reenable users
- the ability to disable access to a suspended user'...
Yehuda Sadeh
11:33 AM rgw Feature #767 (Resolved): rgw: incremental/large file uploads
Done, as of commit:8b4b83842cdf2e707efd6c3aded9abbf1ab352d7 Yehuda Sadeh
11:31 AM Bug #1197 (Resolved): osd: make inconsistent state durable
I'm pretty sure that if we ever get an inconsistent PG, that knowledge needs to be in permanent storage so that we do... Greg Farnum
11:30 AM Feature #900 (Resolved): qa: thrash in/out
Sage Weil
11:29 AM Feature #1196 (Resolved): qa: install kernels via teuthology
Sage Weil
11:12 AM Bug #1192: multi-client stall under heavy write load
I've caught another stall using your patch, but it is slightly different.
Maybe I waited longer, but this one even...
Jim Schutt
10:54 AM Bug #1192: multi-client stall under heavy write load
Any news? Sage Weil
11:04 AM CephFS Bug #1178: cfuse doesn't shut down -- Fw cap doesn't flush on one inode
Will be testing locally, push later. We had to make sure to check the mseq when setting a new auth_cap based on the M... Greg Farnum
09:41 AM CephFS Bug #1195: ceph mds crash on version upgrade
Oh, you have multiple MDSs.. can you dump the same object for whichever one(s) crashed in
13: (SessionMap::decode...
Sage Weil
09:38 AM CephFS Bug #1195: ceph mds crash on version upgrade
Attached log from mds crash with suggested debugging enabled. Sam Lang
09:30 AM CephFS Bug #1195: ceph mds crash on version upgrade
Attached result of above command. Sam Lang
09:25 AM CephFS Bug #1195: ceph mds crash on version upgrade
Can you 'rados -p metadata get mds0_sessionmap /tmp/mds0_sessionmap' and attach? I'm curious what is in the object t... Sage Weil
09:14 AM CephFS Bug #1195: ceph mds crash on version upgrade
Hmm, Ceph *should* be upgradable to newer versions. These backtraces don't look familiar though and i don't see anyth... Greg Farnum
08:03 AM CephFS Bug #1195 (Resolved): ceph mds crash on version upgrade
This may just not be something that's handled in Ceph yet, but while trying to upgrade from stable v0.28 to v0.29, I ... Sam Lang
08:59 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Sage Weil wrote:
> Can you confirm that it hung when the fs was full? Because that's normal (and unrelated to this ...
Andre Noll
03:02 AM Revision ad5ff230 (ceph): mount.ceph: specify full path for /sbin/modprobe
Fixes: #1188
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:01 AM Revision f5f563dd (ceph): radosgw_admin: fix cli test
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
02:44 AM Revision ca4e2877 (ceph): Merge branch 'stable'
Sage Weil
02:44 AM Revision 3a59579e (ceph): mds: fix transitions away from XSYN state
If we are in XSYN, we need to transition first to EXCL before moving on.
Simply going to the other unstable state isn...
Sage Weil
02:44 AM Revision 1c5f6550 (ceph): mds: when prodding head inode on snapped behalf, avoid XSYN
If we try to rdlock a snapped inode in snap->sync state, we need to prod
the head inode into SYNC to get the data flu...
Sage Weil
02:44 AM Revision 447ea1dc (ceph): osd: fix snap_trimmer obc ref
As of fc4cc39 new_repop claims the obc reference (from the clonerange
branch) but this caller didn't get fixed.
Sign...
Sage Weil
02:44 AM Revision bc1782a1 (ceph): osd: fix find_object_context debug output
Erroneously said get_object_context, confusing developers everywhere.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil

06/16/2011

11:53 PM Revision 9974b7e5 (ceph): rgw: user suspension
Yehuda Sadeh
11:51 PM Revision 2738ce01 (ceph): Better examples for kclient task.
Tommi Virtanen
11:11 PM Revision 0b28b96e (ceph): Remove done and abandoned TODO entries.
Tommi Virtanen
11:09 PM Revision 4f46e4b7 (ceph): Make cfuse and kclient default to all clients.
Tommi Virtanen
11:09 PM Revision 5b237fc5 (ceph): Support running multiple autotest tests.
Docs had this from the start, it was just buggy. Still
doesn't support using the "autotest" task more than once,
no e...
Tommi Virtanen
11:09 PM Revision b49c7390 (ceph): Archive autotest result output.
Tommi Virtanen
10:19 PM Revision 7fe89b79 (ceph): add clock task to sync clocks
Sage Weil
10:19 PM Revision 55cfd929 (ceph): fix typo in workunits example
Sage Weil
10:19 PM Revision cebf88d9 (ceph): show role in log prefix when running workunits
Sage Weil
10:11 PM Bug #1150: kclient: ERESTARTSYS from flock/fcntl locks
On 3.0.0-rc3 kernel (master branch) the same trouble. Fyodor Ustinov
09:36 PM Revision 78a3c234 (ceph): Move non-ceph logic out of the ceph task: base dir, archive transfer.
Tommi Virtanen
09:36 PM Revision 259eb350 (ceph): Refactor ceph mds daemon handling into a separate subtask.
Tommi Virtanen
09:36 PM Revision 445186b0 (ceph): Refactor ceph healthy waiting into a separate subtask.
Note, not a context manager, just a task. There's no cleanup. Tommi Virtanen
09:36 PM Revision 9353fcc9 (ceph): Move summary inside context.
This will hopefully make run_tasks usable by tasks.
The ceph task can't use it yet, because it has its
yield inside t...
Tommi Virtanen
09:36 PM Revision 594a3d3f (ceph): Refactor ceph log handling into a separate subtask.
Tommi Virtanen
09:36 PM Revision 629ad447 (ceph): Move /tmp/cephtest/data creation to where it belongs, in ceph.cluster.
Tommi Virtanen
09:36 PM Revision 301ab567 (ceph): Move non-ceph logic out of the ceph task: host in use check.
To avoid every config always listing basic tasks, we silently
add internal.* tasks in front of the task list.
Tommi Virtanen
09:20 PM Revision c35770c4 (ceph): Merge branch 'rgw-multipart'
Yehuda Sadeh
09:19 PM Revision abfa872f (ceph): Merge remote-tracking branch 'origin/master' into rgw-multipart
Conflicts:
src/os/FileStore.cc
src/rgw/rgw_admin.cc
src/rgw/rgw_fs.h
src/rgw/rgw_log.cc
src/rgw/rgw_rados.cc
sr...
Yehuda Sadeh
09:07 PM Revision 3aa6a4d6 (ceph): qa: pjd must run as root
Sage Weil
08:26 PM Revision 8b4b8384 (ceph): rgw: get multipart list marker to work
Yehuda Sadeh
08:10 PM Revision 77d38e42 (ceph): v0.29.1
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:34 PM Revision 3fc00e37 (ceph): Refactor ceph mon daemon handling into a separate subtask.
Tommi Virtanen
06:34 PM Revision 2da2d258 (ceph): Refactor ceph osd daemon handling into a separate subtask.
Tommi Virtanen
06:30 PM Revision 062ddf25 (ceph): Merge remote branch 'origin/mds_xsyn'
Sage Weil
06:17 PM Revision a379c67d (ceph): rgw: some multipart cleanups, fixes
Yehuda Sadeh
06:04 PM Revision 574cfe98 (ceph): Remove the "ceph mds set_max_mds" call, apparently it's not needed.
We can always re-add it if and when it's useful. Tommi Virtanen
05:39 PM Revision 9746e774 (ceph): Allow running of any function in a task.
Josh Durgin
05:39 PM Revision 14ddb419 (ceph): Start splitting the ceph task into smaller units.
This lets them nicely have independent error handling;
an example case where that is desirable is that the
old code f...
Tommi Virtanen
05:39 PM Revision 915a366a (ceph): Refactor ceph cluster setup into a separate subtask.
Tommi Virtanen
05:39 PM Revision eaba08fb (ceph): Add debug logging to contextutil.nested.
Tommi Virtanen
04:51 PM Bug #1194: kclient: NFS reexport does not survive ceph fs remount
I reproduced the problem several times before submitting this bug but can't reproduce it now.
I'm going to leave thi...
Brian Chrisman
02:23 PM Bug #1194 (Resolved): kclient: NFS reexport does not survive ceph fs remount
NFS doesn't survive restarts/remounts
Reproduce:
create new ceph fs
export via NFS
(on NFS client) nfs mount
(on...
Brian Chrisman
03:27 PM CephFS Bug #1178: cfuse doesn't shut down -- Fw cap doesn't flush on one inode
I'm pretty sure the client is getting the right import/export messages, there's just something going wrong in the twi... Greg Farnum
02:56 PM CephFS Bug #1178: cfuse doesn't shut down -- Fw cap doesn't flush on one inode
oh, interesting. in that case it won't get an IMPORT event because it wasn't exported.. and the client doesn't find ... Sage Weil
02:48 PM CephFS Bug #1178: cfuse doesn't shut down -- Fw cap doesn't flush on one inode
Oh, so it's an inode where the auth got changed, and it only has the cap for the new auth but it doesn't think it is ... Greg Farnum
01:48 PM CephFS Bug #1178 (In Progress): cfuse doesn't shut down -- Fw cap doesn't flush on one inode
Looking at this now. Greg Farnum
02:11 PM CephFS Bug #1114: NFS export extreme slowdown
I think I've figured out the more general symptom.
File creation via NFS-ontop-of-ceph takes 1-3 seconds per file.
...
Brian Chrisman
01:05 PM Bug #1192: multi-client stall under heavy write load
I'm compiling it now. Jim Schutt
12:28 PM Bug #1192: multi-client stall under heavy write load
Can you try branch osd_pg_active? The request you mention gets kept around on the waiting_for_active list when it sh... Sage Weil
08:11 AM Bug #1192: multi-client stall under heavy write load
I tried to upload osd.32.log.bz2, but got this:
Internal error
An error occurred on the page you were trying to...
Jim Schutt
08:06 AM Bug #1192 (Resolved): multi-client stall under heavy write load
I've been having trouble with Linux client stalls under heavy
write load. I think I've got the signature isolated. ...
Jim Schutt
11:29 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Can you confirm that it hung when the fs was full? Because that's normal (and unrelated to this issue); if the FS is... Sage Weil
01:42 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
> > Otherwise, there's nothing there. If the situation is reproducible, the best would be an mds log generated with ... Andre Noll
10:47 AM Bug #1152: Mon getting killed by OOM killer
Unfortunately, it looks like the heap dumps we have are from after it grew too large and they don't capture the growt... Greg Farnum
10:36 AM Bug #1152 (In Progress): Mon getting killed by OOM killer
Greg Farnum
09:39 AM CephFS Bug #1193 (Resolved): cmds crashes with stacktrace when using --reset-journal and an invalid rank
Here are the two stack traces I noticed that are different.... Damien Churchill
09:16 AM Bug #1188 (Resolved): modprobe not found by mount.ceph
Looks like this is done! Greg Farnum
08:36 AM Bug #998 (Resolved): qemu/librbd race conditon
Fixed by commit:ea3db17697b0d6fadf9bb26af0ed0b61773d530a and commit:515f0884d4f55efc21365ccbd94e1e5c22788051. Josh Durgin
08:17 AM Bug #1191: FAILED assert(!missing.is_missing(soid))
... ar Fred
08:02 AM Bug #1191 (Can't reproduce): FAILED assert(!missing.is_missing(soid))
In 30 seconds, that error happened on 4 out of 8 OSDs. This is with today's stable branch.
That happend after some...
ar Fred
07:56 AM CephFS Bug #1170: 2-MDS fsstress: SR fails subtree assert
I'm seeing this with the 0.29 packages, using Ubuntu Natty and Linux 3.0rc3 built from git. Only some of the crashed ... Damien Churchill
01:08 AM Revision 1dd25b47 (ceph): Add task for mounting with the kernel client.
Josh Durgin
01:06 AM Revision 9d70d048 (ceph): Add function to output a secret key to a file.
Josh Durgin
12:43 AM Revision 5e1768ee (ceph): Merge remote branch 'origin/stable'
Josh Durgin
12:32 AM Revision 515f0884 (ceph): librbd: fix block_completion race condition
If block completions finished before all of them were scheduled, the
pending_count could reach 0, and the callback co...
Josh Durgin
12:06 AM Revision e9e3feea (ceph): rgw: implement list multiparts
doesn't work completely, getting there Yehuda Sadeh

06/15/2011

10:52 PM Revision 9d603447 (ceph): Output a summary file when archiving a run.
For now, just record whether the test succeeded. Josh Durgin
10:22 PM Revision 98a80719 (ceph): Remove unused imports and variable.
Josh Durgin
09:43 PM Revision 2d63264c (ceph): librbd: add AioCompletion debugging
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
09:24 PM Revision 4a7a42b5 (ceph): uclient: path_walk should notice when it gets absolute paths.
Previously it assumed everything was relative, but we need to handle
absolute paths on occasion, and already have the...
Greg Farnum
08:06 PM CephFS Bug #1158: Unfinished freeze hangs fsstress
Unfortunately, adjust_nested_auth_pins never sees the person who actually grabbed it. The others print out the grabbi... Greg Farnum
06:51 PM CephFS Bug #1158: Unfinished freeze hangs fsstress
If you can reproduce, you can enable the auth pin set define in mdstypes.h, which tracks who the pinners are.
//#d...
Sage Weil
06:04 PM CephFS Bug #1158: Unfinished freeze hangs fsstress
Well, it's a nested auth pin.... Greg Farnum
07:32 PM Revision 1a20ba11 (ceph): Tee teuthology log into the archive directory, if archiving.
Tommi Virtanen
07:11 PM Revision e90e0411 (ceph): vstart: put .journal outside of osd data dir
This prevents it from getting cleaned up during mkfs. Then we can make it
a symlink, or at the very least avoid dele...
Sage Weil
07:10 PM Revision 2bae22f0 (ceph): Create the archive dir earlier, move out of ceph task.
Tommi Virtanen
06:59 PM Revision faa855dd (ceph): Add task for running ceph workunits, QA tests that expect cwd to be cep...
Tommi Virtanen
06:51 PM Revision e00b7fc6 (ceph): Clean up after an autotest run.
Tommi Virtanen
06:50 PM Revision cd8f31db (ceph): Write config.yaml directly to file.
Tommi Virtanen
06:49 PM Revision d157e323 (ceph): Move imports to the top.
Tommi Virtanen
06:27 PM Revision af9879de (ceph): qa: Make snaptest-git-ceph.sh clean up after itself.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
05:16 PM Revision 1aa29320 (ceph): rgw: fix abort multipart upload
Yehuda Sadeh
04:38 PM Bug #998: qemu/librbd race conditon
Can't get kvm to crash anymore during my latest tests. Two days before, with the same amount of testing, I would have... ar Fred
02:32 PM Bug #998: qemu/librbd race conditon
I updated the wip-librbd-race branch again - I managed to confuse myself into thinking the block completion patch was... Josh Durgin
01:06 PM Bug #998: qemu/librbd race conditon
also at 2510a0a31e3cc742f7d7564887fd93da5203967f... ar Fred
12:50 PM Bug #998: qemu/librbd race conditon
a crash I could reproduce with 2510a0a31e3cc742f7d7564887fd93da5203967f... ar Fred
01:38 AM Bug #998: qemu/librbd race conditon
Thank you for the new patch.
I still managed to crash 3 of my VMs, each with a different stacktrace:...
ar Fred
04:32 PM CephFS Bug #1114: NFS export extreme slowdown
Here's the time sequence with each x-axis increment being 20 seconds.
Plotted separately to avoid scaling axes and a...
Brian Chrisman
04:16 PM CephFS Bug #1114: NFS export extreme slowdown
Here's a 20-second interval time evolution of /sys/kernel/debug/ceph/*/bdi/stats
I can go ahead and graph these..
Brian Chrisman
02:41 PM Feature #1189 (Resolved): qa: independent powerc and console from ceph network
Sage Weil
12:43 PM Feature #1189 (Resolved): qa: independent powerc and console from ceph network
Sage Weil
01:09 PM Bug #1186: Cluster won't recover, OSD's go up and down again (and stay down)
Failed an assert in ReplicaActive receiving a query of type other than Query::Missing. ( assert(query.query.type == ... Samuel Just
12:58 PM Feature #1190 (Resolved): codingstyle document
http://google-styleguide.googlecode.com/svn/trunk/cppguide.xml ? Sage Weil
12:31 PM Bug #1152: Mon getting killed by OOM killer
I installed both ceph and ceph-dbg on the machine, I build the debs with dpkg-buildpackage and install those.
The ...
Wido den Hollander
09:54 AM Bug #1152: Mon getting killed by OOM killer
There are no symbols.. can you run a cmon that's build with -g (not from a stripped .deb)? Sage Weil
06:54 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Sage Weil wrote:
> I do see this:
> 2011-06-10 11:41:04.325925 7f931b987700 mds0.objecter FULL, paused modify 0x7f...
Andre Noll
06:29 AM Bug #1150: kclient: ERESTARTSYS from flock/fcntl locks
Ok, I will write later about the test results. Fyodor Ustinov
12:53 AM Revision c49f9471 (ceph): Archive teuthology config.
Josh Durgin

06/14/2011

11:34 PM Revision 27f79fc8 (ceph): auth: remove CephCryptoManager
It doesn't do anything.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
10:55 PM Revision ea3db176 (ceph): librbd: fix AioCompletion race condition
If block completions finished before all of them where scheduled, the
pending_count would reach 0, and the AioComplet...
Josh Durgin
09:47 PM Revision eced4d6b (ceph): monmaptool: implement --help
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:43 PM Revision 41d9335d (ceph): test/bufferlist: don't alloc large array on stack
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:23 PM Bug #1188: modprobe not found by mount.ceph
Does the mount.ceph is the modprobe branch work okay? It just specifies /sbin/modprobe.. which is hopefully always t... Sage Weil
04:31 PM Bug #1188 (Resolved): modprobe not found by mount.ceph
# mount cephsrv:/ /mount/point
sh: modprobe: command not found
mount.ceph: modprobe failed, exit status 127
modp...
Alexandre Oliva
08:58 PM Revision 8c97e6c6 (ceph): .gitignore man/Makefile
blech
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
07:53 PM Revision 954e0966 (ceph): qa: do not use automake for workunit makefiles
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:35 PM Revision 40f5ab96 (ceph): qa/workunits/direct_io: fit .gitignore
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:27 PM Revision 51b3651b (ceph): radosgw_admin: fix clitest help
Sage Weil
03:38 PM Bug #998: qemu/librbd race conditon
I pushed a simpler (and I think correct) fix to the wip-librbd-race branch. Does this fix the problem for you, ar Fred? Josh Durgin
10:49 AM Bug #998: qemu/librbd race conditon
I'm having a lot of similar crashes these days, they usually happen during or soon after the startup of the VM.
Al...
ar Fred
03:16 PM Bug #1186: Cluster won't recover, OSD's go up and down again (and stay down)
The monitor debugging also seems to have been a problem. Turning that down and restarting the machines has allowed 3... Samuel Just
01:33 PM Bug #1186: Cluster won't recover, OSD's go up and down again (and stay down)
atom2 with cosd daemons killed:
procs -----------memory---------- ---swap-- -----io---- -system-- ----cpu----
r b...
Samuel Just
10:05 AM Bug #1186: Cluster won't recover, OSD's go up and down again (and stay down)
A bit more information:
for i in {0..9}; do ssh root@atom$i 'uptime'; done
19:01:32 up 7:42, 0 users, load a...
Samuel Just
05:09 AM Bug #1186 (Closed): Cluster won't recover, OSD's go up and down again (and stay down)
Ok, the title might be somewhat confusing, but so is the issue :)
I'm still trying to get my 40 OSD cluster back i...
Wido den Hollander
01:21 PM Bug #1187 (Can't reproduce): OSD: OSDMap::decode
During #1186 I did notice one OSD crash, the backtrace gave me:... Wido den Hollander
12:25 PM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Oh! Sorry, I didn't scroll down. :)
I do see this:
2011-06-10 11:41:04.325925 7f931b987700 mds0.objecter FULL,...
Sage Weil
10:08 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Sage Weil wrote:
> Oh, that log is from the 8th, and doesn't include the fix I pushed on the 9th fixing that particu...
Andre Noll
09:39 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Oh, that log is from the 8th, and doesn't include the fix I pushed on the 9th fixing that particular bug. Did you at... Sage Weil
01:41 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Sage Weil wrote:
> Andre, any chance you can pass along more information about the hang you were seeing? Log, or th...
Andre Noll
05:22 AM Revision d2b7e291 (ceph): crbdnamer: man page
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:19 AM Revision e6451525 (ceph): crbdnamer: +x
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:14 AM Revision cca3da1f (ceph): qa: radosgw_admin: fix clitest
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:11 AM Revision 66229c7a (ceph): qa: direct_io: fix warnings
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:56 AM Revision 71543246 (ceph): configure: check for boost statechart header
Partially fixes: #1124
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:01 AM Revision 883991a0 (ceph): added thrashosds
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
12:00 AM Revision 398a3338 (ceph): adding radosbench.py to tasks
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just

06/13/2011

11:48 PM Revision e304fd09 (ceph): radosgw_admin: log show can dump info in xml, json
Yehuda Sadeh
11:25 PM Revision 86844671 (ceph): Merge branch 'wip-workunits'
Sage Weil
11:24 PM Revision b91c2c99 (ceph): librados: use empty(), not size()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:23 PM Revision 8ce98b22 (ceph): qa: test_sync_io: zero buffers prior to read
Be extra paranoid to ensure we read back the correct data and didn't
just find it in undefined memory.
Signed-off-by...
Sage Weil
11:23 PM Revision efe1aa71 (ceph): radosgw_admin: pool create command
Yehuda Sadeh
11:23 PM Revision 28e1a894 (ceph): radosgw_admin: fix log show (with new poolid param)
Yehuda Sadeh
11:18 PM Revision d89c9a13 (ceph): qa: test_sync_io improvements
Exit (without removing foo) on error.
Allow read- or write-only operation.
Signed-off-by: Sage Weil <sage@newdream....
Sage Weil
11:03 PM Bug #1150: kclient: ERESTARTSYS from flock/fcntl locks
which version of the kernel client are you using?
Can you try the latest for-linus branch of ceph-client.git? T...
Sage Weil
10:59 PM Feature #1174 (Resolved): crbdnamer man page
Sage Weil
10:52 PM Linux kernel client Bug #1148 (Resolved): o_direct crash in msgr
Sage Weil
10:37 PM Bug #1124 (Resolved): Depend on new enough Boost
the version dependency in the .spec file is nontrivial (something about epochs?) and i'm too lazy to figure it out pr... Sage Weil
10:30 PM Revision bc8e1209 (ceph): rgw: log pool id, and store/retrieve pool id info
Yehuda Sadeh
10:30 PM Revision 34098a39 (ceph): librados: expose pool id
Conflicts:
src/librados.cc
Yehuda Sadeh
10:21 PM CephFS Bug #1137 (Can't reproduce): MDS Crash
If this turns up again, let us know! I suspect it may be related to the rename journaling changes; I'll be testing f... Sage Weil
10:19 PM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Andre, any chance you can pass along more information about the hang you were seeing? Log, or the specific workload? Sage Weil
09:17 PM Revision 1e96f71f (ceph): qa: fix test_shorT_dio_read
No args, hardcode filename.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:57 PM CephFS Bug #1084: blogbench won't finish: waiting for Fr cap forever
It is working great. I haven't see this problem after applying the patches. Henry Chang
09:40 AM CephFS Bug #1084 (Resolved): blogbench won't finish: waiting for Fr cap forever
I can't reproduce this on my mds_issue_caps, so I've merged it into master, commit:0f8fbd562cc58351541ac4f60a1dc98bdd... Sage Weil
08:36 PM Revision f9e9490d (ceph): PG: clear scrub_received_maps in scrub_clear_state
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
08:31 PM Revision 9d1e8971 (ceph): qa: fix loopall, runallonce scripts
Run anything executable.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:29 PM Revision d5bba79e (ceph): qa: move compiled workunits into organized directory
No need for a separate qa/src/ dir; just put them where we want them. We
need to ignore non-executable files when do...
Sage Weil
08:22 PM Revision 953bb605 (ceph): qa: move compiled items into qa/src/
Sage Weil
08:22 PM Revision f2aef3e1 (ceph): qa: organize workunits a bit
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:22 PM Revision adc98c0e (ceph): qa: expand test_sync_io
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:22 PM Revision 8147076d (ceph): qa: add direct and sync io workunits
Just run the test programs in ../src
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:22 PM Revision 7087b2da (ceph): qa: include $basedir/src/ (qa/src/) in path
This lets the workunits find stuff build in qa/src/, in case you are
running using these (old) scripts.
Signed-off-b...
Sage Weil
08:22 PM Revision 8a12a01c (ceph): qa: add henry's direct io tests
Not sure what the second one is testing, actually, but including it anyway.
Signed-off-by: Sage Weil <sage.weil@drea...
Sage Weil
08:22 PM Revision d851deea (ceph): qa: add short directio read test
This tests for the bug fixed by linux commit:c3cd628.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:02 PM Revision f3617e51 (ceph): osd: fix head_exists fix
Needs to default to false, or else deletions don't work.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:05 PM Revision 056a0c5a (ceph): osd: don't update snapset head_exists unless full transaction succeeds
Make the update at the end of make_writeable, along with the other snapset
updates.
Signed-off-by: Sage Weil <sage@n...
Sage Weil
07:04 PM Revision 11ad7658 (ceph): osd: fix make_writeable
Look at the ObjectState to determine if the head exists, not the snapset
(which doesn't reflect the prior object stat...
Sage Weil
05:08 PM rgw Feature #1180 (Resolved): radosgw_admin log show dumps info in alternate formats
Done, commit:e304fd0970492e38fd95ec71d00a221b6ec460c9. Can dump info in xml, json. Yehuda Sadeh
10:05 AM rgw Feature #1180 (Resolved): radosgw_admin log show dumps info in alternate formats
In order to be able to add additional information without breaking compatibility. Yehuda Sadeh
05:07 PM rgw Feature #1179 (Resolved): rgw log should be able to show bucket owner
That turned to be problematic, as we don't look at the bucket owner on all operations and we don't want to do that as... Yehuda Sadeh
09:45 AM rgw Feature #1179 (Resolved): rgw log should be able to show bucket owner
Currently the log shows the object owner on the relevant operations. Yehuda Sadeh
05:04 PM Revision 35663dee (ceph): monc: protect msg source check with monc_lock
Push the locking up into ms_dispatch, so that we protect the source check
and don't race with session reopens. Also ...
Sage Weil
04:43 PM Revision 0f8fbd56 (ceph): Merge branch 'mds_issue_caps'
Sage Weil
04:20 PM Feature #1147 (Resolved): qa: tests for O_DIRECT, sync
Sage Weil
02:09 PM Bug #1185 (Can't reproduce): rados: export caught in loop on 'buck' bucket (1.5M objects)
dumped an object list, watched strace, and periodically checked the current file/object name against the list, and it... Sage Weil
01:13 PM Bug #1184: osd: snapset, pg info stats needs to be projected like obs
also ObjectState& obs -> new_obs, so it's clear in the code this is the new, projected change.
each of these conve...
Sage Weil
01:03 PM Bug #1184 (Resolved): osd: snapset, pg info stats needs to be projected like obs
We need a new_snapset to go along with new_obs in OpContext. And all the transaction code needs to be updated to mat... Sage Weil
12:00 PM CephFS Bug #1183 (Resolved): mds: snaptest-snap-rm-cmp.sh failing
osd regression, fixed by commit:056a0c5a008fef0924b17c7e24c85cf73b3b390e Sage Weil
11:03 AM CephFS Bug #1183 (Resolved): mds: snaptest-snap-rm-cmp.sh failing
this used to work :( Sage Weil
10:32 AM Tasks #1182 (Resolved): qa: lock machines
Teuthology (or a layer above it) should lock machines while tests are executed. Josh Durgin
10:13 AM Bug #1138 (Resolved): need to package rados.py in the debian .deb
Sage Weil
10:07 AM CephFS Bug #1181 (Can't reproduce): mds: old_inodes crash
... Sage Weil
10:04 AM Bug #1151 (Resolved): OSD: CephxClientHandler::handle_response
Fixed locking in mon_client, commit:35663dee2f7128df642594d0ebdf16640fe1f29a Sage Weil

06/12/2011

05:10 PM CephFS Bug #1178 (Resolved): cfuse doesn't shut down -- Fw cap doesn't flush on one inode
Logs in kai:~gregf/logs/fsstress/hung_client2. This is on my mds_rename branch with 4 MDSes in fsstress. I suspect it... Greg Farnum
04:26 PM CephFS Bug #1158: Unfinished freeze hangs fsstress
I managed to reproduce this on my mds_rename branch. Greg Farnum

06/11/2011

12:37 AM Revision 83a55f4d (ceph): Merge remote branch 'origin/stable'
Conflicts:
src/dumpjournal.cc
Sage Weil
12:35 AM Revision 78012fde (ceph): Merge remote branch 'origin/deb-pybind'
Sage Weil
12:10 AM Revision aa5c2244 (ceph): debian: Move python-ceph into section python.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
12:10 AM Revision efb82828 (ceph): debian: Strip and do -dbg magic for librgw too.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
12:09 AM Revision 061460b7 (ceph): debian: Fix copy-pasto in librgw1 control paragraph.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen

06/10/2011

11:58 PM Revision 6fc8819b (ceph): debian: Depend on python-ceph in obsync.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
11:58 PM Revision 822e7770 (ceph): debian: Depend on librgw1 in python-ceph.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
11:43 PM Revision 762c87e0 (ceph): debian: Package librgw.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
11:15 PM Revision ab9dda87 (ceph): pybind: Open shared libs by their major version.
The *.so files are only in the -dev packages, and normal
operation should not require those. The major version
number...
Tommi Virtanen
11:15 PM Revision d444d5b8 (ceph): debian: Make git ignore obsync, gceph* package dirs.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
11:15 PM Revision ddf0263a (ceph): debian: Prevent git ignore from recursing on *.substvars etc.
Should never have triggered but this is faster simpler
and correcter.
Signed-off-by: Tommi Virtanen <tommi.virtanen@...
Tommi Virtanen
11:15 PM Revision 0a0493de (ceph): debian: Make git ignore all of the *.debhelper files.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
11:15 PM Revision 5cf62c1c (ceph): debian: Dev packages no longer have soname in their names.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
11:15 PM Revision 0da30c35 (ceph): debian: Properly package the python bindings.
Build-depend on python-support. Add binary package
python-ceph, making it contain all the ceph python
packages, regar...
Tommi Virtanen
10:02 PM Revision edc6659b (ceph): rgw: list bucket display the correct objects owners
Yehuda Sadeh
09:55 PM Revision 3e284a72 (ceph): boto_tool: add put_bucket_acl
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:42 PM Revision 5cbe10c3 (ceph): added testsnaps
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
08:24 PM Revision 7fa8b973 (ceph): rgw: don't allow handing out object ownership
Yehuda Sadeh
07:26 PM Revision 52e9e5ec (ceph): heap_profiler: if log_dir is empty, don't try and log to root dir!
If log_dir was empty then the prefix would be of the form /mon.a
That isn't good, so fill it in so it writes to the c...
Greg Farnum
06:55 PM Revision 98a3b546 (ceph): librados: get reference to the io context for the pending async ops
Yehuda Sadeh
06:55 PM Revision b2826035 (ceph): librados: remove useless reference holding
Yehuda Sadeh
06:31 PM Revision f41773be (ceph): coverage.sh: use .lcov instead of .info to avoid confusion
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:28 PM Revision 08d0d0ae (ceph): Merge remote branch 'origin/wip_coverage'
Conflicts:
src/mon/Monitor.cc
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
06:24 PM Revision d09ae0b2 (ceph): With coverage, use SIGTERM to make the daemons exit(0).
Josh Durgin
06:17 PM Revision e52bb0cf (ceph): Put all coverage data in the archive dir.
Josh Durgin
06:12 PM Revision 41e19f5e (ceph): Initialize BASE before using it.
Josh Durgin
05:23 PM Revision a6afb051 (ceph): rgw: remove required indexes when modifying user info
Conflicts:
src/rgw/rgw_admin.cc
Yehuda Sadeh
05:21 PM Subtask #1177 (Resolved): qa: post-process coverage data
Combine from multiple machines per test, and from multiple tests per commit. Josh Durgin
05:17 PM Tasks #1176 (Closed): qa: html test results
Generate a page with a table of commit id, tests passed/run, and links to coverage data (aggregate and per-test). Josh Durgin
05:14 PM Tasks #1175 (Resolved): qa: multi-test runner
This will be a cron job that picks the latest valid master at e.g. midnight and runs a suite of tests via teuthology. Josh Durgin
05:13 PM Feature #1174 (Resolved): crbdnamer man page
this needs to go in the librbd1 package Sage Weil
05:12 PM Revision 9a705c74 (ceph): mon: Follow the Message rules when forwarding tells
My initial commit was naughty, fix that up here by
creating a new message for each mon and just copying
the command.
...
Greg Farnum
05:11 PM Feature #1173 (Rejected): gceph: man page
Sage Weil
05:10 PM Feature #1172 (Resolved): obsync: man page
Sage Weil
05:07 PM Revision 842f3ac6 (ceph): rgw: remove required indexes when modifying user info
Yehuda Sadeh
04:25 PM Revision c2de9e6a (ceph): mds: clear correct state bit in do_file_recover
The RECOVERING bit is set, not NEEDSRECOVER. This way we match
_recovered().
Signed-off-by: Sage Weil <sage@newdrea...
Sage Weil
03:07 PM rgw Bug #1171 (Resolved): rgw list bucket displays wrong object owner
Fixed, commit:6f02b4220e9ef925787772132c01328ccf4d07ae. Yehuda Sadeh
02:51 PM rgw Bug #1171 (Resolved): rgw list bucket displays wrong object owner
It just dumps the id of the current user. Yehuda Sadeh
02:30 PM CephFS Bug #1169: 2-MDS fsstress: Active fails adjust_subtree_auth
Wait, this wasn't on mds_rename. It was on my wip-sanity branch. Greg Farnum
02:27 PM CephFS Bug #1169 (Closed): 2-MDS fsstress: Active fails adjust_subtree_auth
This is on mds_rename branch... Greg Farnum
02:30 PM CephFS Bug #1170: 2-MDS fsstress: SR fails subtree assert
Wait, this wasn't on mds_rename. It was on my wip-sanity branch. Greg Farnum
02:28 PM CephFS Bug #1170 (Closed): 2-MDS fsstress: SR fails subtree assert
This is on mds_rename branch.... Greg Farnum
01:33 PM rgw Bug #894 (Resolved): openstack pseudo folders
I'm closing this one. Pseudo folders were implemented, the only thing that was missing is testing it using the older ... Yehuda Sadeh
01:31 PM rgw Bug #1115 (Resolved): rgw allows users to "give away" s3 objects
Fixed, commit:859462298170520d53a167c1da214e378ae2e78a. Yehuda Sadeh
01:31 PM Bug #1152: Mon getting killed by OOM killer
Attached are the heap dumps from the mon process.
The log is a big, 1.4GB, so I didn't upload it. It can be found ...
Wido den Hollander
11:42 AM Bug #1152: Mon getting killed by OOM killer
It'll dump a summary in the clog and try to dump the analysis data into a file named something like osd.1.0001.heap. ... Greg Farnum
10:22 AM Bug #1152: Mon getting killed by OOM killer
I've started my mon with memory profiling enabled, but it will take some hours before it starts eating memory.
At ...
Wido den Hollander
01:11 PM CephFS Bug #1168 (Closed): fsstress 2-mds Standby-replay adjust_subtree asserts
... Greg Farnum
11:19 AM CephFS Bug #1167 (Resolved): Fsstress: bad rstat assert
Got a bad rstat assert in fsstress. This is with 4 MDSes. Logs in kai:~gregf/logs/fsstress/bad_rstats... Greg Farnum
11:11 AM CephFS Bug #1166: mds: standby-replay bound asserts
Logs in kai:~gregf/logs/fsstress/standby_bound_asserts Greg Farnum
10:55 AM CephFS Bug #1166 (Resolved): mds: standby-replay bound asserts
Got two of them on a 4-mds run of ffsstress:... Greg Farnum
11:10 AM CephFS Bug #1114: NFS export extreme slowdown
I've verified this problem continues with 2.6.39 kernel with today's (5/26/11) build from ceph-client git repo (maste... Brian Chrisman
10:23 AM rgw Bug #1162 (Resolved): rgw_admin user modify --email
Fixed with commit:3e8da6c71de01330867ad16b8e19b903f94bb906. Yehuda Sadeh
10:21 AM Bug #1151: OSD: CephxClientHandler::handle_response
The full OSD log is a bit large to attach, so I've left it on my server at the office.
You can access it by SSH'in...
Wido den Hollander
10:11 AM Tasks #1165 (Closed): snap collection clone removal
On recovery, new pg members won't have seen the transactions removing old clones from snap collections. When they pr... Samuel Just
09:54 AM CephFS Bug #1110 (In Progress): mds: ls -l hangs on concurrent writer
Andre Noll wrote:
> Andre Noll wrote:
> > Sage Weil wrote:
> > > I pushed commit:5922de29e7414be3aaa6eb04be8bb0ae5...
Sage Weil
03:50 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Andre Noll wrote:
> Sage Weil wrote:
> > I pushed commit:5922de29e7414be3aaa6eb04be8bb0ae5c943ccf, which should fix...
Andre Noll
09:26 AM Feature #1154 (Resolved): Make tcmalloc profiler commands work with monitor
Done, and properly tested this time. Greg Farnum
05:15 AM Revision 35beabf6 (ceph): rgw: remove clutter
Yehuda Sadeh
05:05 AM Revision c863f522 (ceph): rgw: fix user anonymous permissions
Yehuda Sadeh
01:32 AM Revision dadf2a3f (ceph): Thread: remove globals. Thread create must succeed
Remove the references to global variables from Thread.h. They are really
unecessary. In every case, the printout is f...
Colin Patrick McCabe
01:04 AM Revision 4e0ddae4 (ceph): LogClient: pass in CephContext explicitly
Again, the Messenger passed to LogClient may be NULL, so pass in the
CephContext explicitly.
Signed-off-by: Colin Mc...
Colin Patrick McCabe
12:56 AM Revision 03aef0b6 (ceph): Monitor: pass in CephContext explicitly
Sometimes we create a Monitor without a Messenger. So we can't pull the
CephContext out of the Messenger, because it ...
Colin Patrick McCabe
12:43 AM Revision 2b5e0111 (ceph): mds, mon, osd: with coverage enabled, exit(0) when SIGTERM is received
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
12:43 AM Revision befe4226 (ceph): Revert "mon: add all_exit and exit commands"
This reverts commit 69092735c4b25c88ded58b17b1c198fafcf5931b. Josh Durgin
12:42 AM Revision cf129ff7 (ceph): Revert "mds: allow mds to 'exit immediately'"
This reverts commit c01961ff4b712a12e25e437adceae264273d7428. Josh Durgin
12:42 AM Revision dd9ea9c6 (ceph): Revert "osd: add command to exit cleanly"
This reverts commit 1cc78b6a864d36cbe1eda45890d7f361adb8201d. Josh Durgin
12:22 AM Revision 3a582f99 (ceph): mon: add new "tell" infrastructure.
This lets you issue commands
like "ceph mon tell 0 heap start_profiler". Hurray!
Signed-off-by: Greg Farnum <gregory...
Greg Farnum
12:22 AM Revision b6e5c080 (ceph): mds: fix up MDCache::path_is_mine to remove a totally bogus assert.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
12:17 AM Revision 43f37b8d (ceph): Add coverage support.
Josh Durgin
12:17 AM Revision 7c5eed2f (ceph): Check daemon exit codes, even when using gcov.
Tommi Virtanen
12:17 AM Revision c45488c2 (ceph): Show status message before gcov-style shutdown of daemons.
Tommi Virtanen
12:17 AM Revision 18b11739 (ceph): Add TODO notes.
Tommi Virtanen
12:17 AM Revision 2533edde (ceph): Save ceph sha1 in coverage dir.
This is needed to postprocess the coverage data. Josh Durgin
12:17 AM Revision ae77af08 (ceph): Simplify ceph sha1 saving. No need to round-trip, store N copies.
Tommi Virtanen
12:17 AM Revision 798f3cc6 (ceph): Can now run specific ceph version based on sha1.
Tommi Virtanen
12:16 AM Revision f162f07e (ceph): Support ceph tarball "favors", that is different gitbuilders.
Use that for picking the gcov gitbuilder, when told to record code
coverage. Actual coverage support comes in the nex...
Tommi Virtanen
12:12 AM Revision f3757f76 (ceph): Let ceph task config specify branch or tag to run.
Tommi Virtanen
12:10 AM Revision 4903d06a (ceph): Archive everything written to the "archive" directory.
This avoids listing "log", "profiling-logger", etc individually.
Adding new types of archived files should just be a ...
Tommi Virtanen
12:03 AM Revision 4c798752 (ceph): rgw: can specify alternative remote addr http header param
Yehuda Sadeh

06/09/2011

11:40 PM Revision cf5e70d4 (ceph): rgw: skeleton for list multipart uploads
Yehuda Sadeh
11:21 PM Revision 2c4367f9 (ceph): CephContext: initialize module_type in ctor
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:14 PM Revision 5b304bdd (ceph): mds: make pneed_issue required for eval(SimpleLock*)
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision 87ba0435 (ceph): mds: fix xlock_finish issue flag check
We were sometimes setting do_issue but not *pneed_issue. Simplify by
setting do_issue internally to the function and...
Sage Weil
11:14 PM Revision 7c0ae247 (ceph): mds: fix eval_gather callers passing pissue_gather as wrong arg
Whoops!
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
11:14 PM Revision 96f01b03 (ceph): mds: fix xlock_finish do_issue checks
Should default to false, and only get set to true if there are caps for
this lock. Among other things this means we ...
Sage Weil
11:14 PM Revision 20d11b0b (ceph): mds: be slightly more careful about checking if lock has caps
issued_caps_need_gather==true implies it, but we can skip the work if the
lock has no caps.
Signed-off-by: Sage Weil...
Sage Weil
11:14 PM Revision 2a45bcad (ceph): mds: issue caps from drop_locks
In drop_locks, build a set of inodes we need to issue caps on. Then do it
all at once. This does two things:
- it...
Sage Weil
11:14 PM Revision 88b22463 (ceph): mds: make issue_caps from file_update_finish smarter
We do one funky thing in file_update_finish that only issues caps on a
single cap when max_size changes. This is mor...
Sage Weil
11:14 PM Revision 3e52a33d (ceph): mds: set or issue caps on lock state changes
Set pneed_issue (or issue ourselves) whenever we jump directly to the
target lock state. Make sure we only do it if ...
Sage Weil
11:14 PM Revision f62f8deb (ceph): mds: efficiently drop_rdlocks and drop_xlocks too
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision cc398ccf (ceph): mds: efficiently issue_caps for scatterlocks unlock during refragment
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision ffeb59d5 (ceph): mds: explicitly issue_caps for slave xlock release
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision a340421a (ceph): mds: explicitly issue_caps for rdlock_finish_set
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision bc69d4e0 (ceph): mds: efficiently issue_caps for OOO or extra locks in acquire_locks
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision b06fa1ce (ceph): mds: make pneed_issue required for *lock_finish()
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision bdd94ef1 (ceph): mds: explicitly issue_caps after file_recover
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:14 PM Revision c7863911 (ceph): mds: pass pissue_caps through *lock_finish()
This allows *lock_finish() callers to handle the issue_caps themselves.
None of them do yet (this arg is still option...
Sage Weil
11:09 PM Revision 6d4b93dd (ceph): CephContext: remove silly debug code
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:08 PM Revision e6349ec2 (ceph): Dispatcher should take a CephContext argument
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
10:33 PM Revision c1ea3a26 (ceph): mds: replay: save the right dir!
We were using the wrong one and breaking replay.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
10:33 PM Revision 58250fdd (ceph): mdcache: correct error message.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
10:33 PM Revision 379dc6fa (ceph): MDiscoverReply: encode/decode wanted_ino.
This wasn't encoded previously so it was always zero on the
receiver's end, which led to nasty hangs. :(
Signed-off-...
Greg Farnum
10:33 PM Revision 63073ab8 (ceph): mds: xlock_finish should only do_issue in certain cases.
We accidentally (we think) initialized this variable to true when
we want it to be false: we should only do_issue if ...
Greg Farnum
10:33 PM Revision 4ef41c5a (ceph): mds: fix tcmalloc check.
We can run commands to do things like start the profiler even if
it's not already running!
Signed-off-by: Greg Farnu...
Greg Farnum
10:32 PM Feature #1154 (In Progress): Make tcmalloc profiler commands work with monitor
I missed some when setting up the "tell *" option, need to look at it a bit more. Initial fixes failed and something ... Greg Farnum
04:37 PM Feature #1154 (Resolved): Make tcmalloc profiler commands work with monitor
Pushed to master in commit:3a582f994998fa925a6f2b619326bfc24616306b Greg Farnum
11:12 AM Feature #1154 (Resolved): Make tcmalloc profiler commands work with monitor
This will require setting up a tell interface or similar for the monitors. Whee! Greg Farnum
10:15 PM rgw Bug #1163 (Resolved): anonymous access to objects broken
Fixed, commit:c863f52245372820ab04814a4957963fb3467032. Yehuda Sadeh
05:10 PM rgw Bug #1163 (Resolved): anonymous access to objects broken
After setting bucket and/or object as public-read, can't access them without being authenticated first. Yehuda Sadeh
09:34 PM Revision f69bc301 (ceph): debian: move gceph into a separate package
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:03 PM Revision 0b40f480 (ceph): tools/common: fix C_ObserverRefresh constructor
Fixes #1159
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
08:57 PM Revision 3308aa67 (ceph): rgw: implement abort multipart upload
(not tested yet) Yehuda Sadeh
08:25 PM Revision 802e9e5a (ceph): rgw: implement namespace, use it in complete multipart
Yehuda Sadeh
06:42 PM Subtask #1164 (Resolved): initialize g_ceph_context in common_preinit
It will be a major milestone when we can initialize g_ceph_context in common_preinit. Basically, in order to do this,... Colin McCabe
06:38 PM Subtask #844 (Resolved): g_conf should become a pointer
implemented Colin McCabe
06:31 PM Revision 01df7c66 (ceph): ReplicatedPG: make_writeable, use correct size for clone_size entry
Previously, we used obs.oi.size for the clone size, but obs refers to
new_obs. The clone's size should be the old si...
Samuel Just
06:22 PM Revision 1376a5ab (ceph): gtest: add -lpthread to AM_LDFLAGS
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:18 PM Revision e8abcf0e (ceph): readme: need libevent-dev
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:10 PM Revision b2becefa (ceph): man: update cosd man page to include info on flush-journal option.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:36 PM Revision d224b3e2 (ceph): Makefile.am: Add -lpthread to UNITTEST_LDADD
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:09 PM rgw Feature #1161 (Resolved): configurable client ip header variable
Yehuda Sadeh
05:09 PM rgw Feature #1161: configurable client ip header variable
Fixed, commit:4c798752c6d2805e74464588cae1d67da69e8a4a.
The RGW_REMOTE_ADDR_PARAM can be set now in the configurat...
Yehuda Sadeh
04:06 PM rgw Feature #1161 (Resolved): configurable client ip header variable
When going through a load balancer that rewrites the header, the client ip address (that we want to write into the lo... Yehuda Sadeh
05:02 PM CephFS Bug #1084: blogbench won't finish: waiting for Fr cap forever
The mds_issue_caps branch has my current work on cleaning the issue_caps stuff up. I'm pretty happy with what the MD... Sage Weil
04:38 PM Bug #1152: Mon getting killed by OOM killer
Pushed it, let me know if it doesn't work for you. Greg Farnum
11:14 AM Bug #1152: Mon getting killed by OOM killer
I've created #1154 to make those commands work and hope to get it done today, but if you're feeling dedicated you sho... Greg Farnum
10:29 AM Bug #1152 (Closed): Mon getting killed by OOM killer
I've been seeing this for the last few weeks, my single mon keeps eating more and more memory until it reaches 4G Res... Wido den Hollander
04:30 PM rgw Bug #1162 (Resolved): rgw_admin user modify --email
We need to make sure that when modifying email address, the old email object under .users.email is removed. Yehuda Sadeh
04:11 PM Revision 5922de29 (ceph): mds: only target XSYN state if we have a valid target loner
If we are in EXCL and really shouldn't be, no need to go to XSYN; go to
SYNC instead. This keeps the assert
asser...
Sage Weil
04:09 PM CephFS Bug #1158: Unfinished freeze hangs fsstress
Although actually based on how long fsstress is taking on this disk maybe nothing was blocked and it was just going s... Greg Farnum
01:21 PM CephFS Bug #1158 (Can't reproduce): Unfinished freeze hangs fsstress
I've got a freeze that doesn't finish blocking fsstress. Logs in kai:~gregf/logs/fsstress/freeze_not_finishing.
Ha...
Greg Farnum
04:02 PM Revision 7a5201e4 (ceph): mkcephfs: fix ceph.conf reference
It's $conf (which is $dir/conf).
Reported-by: biyan chen <riby.chen@gmail.com>
Signed-off-by: Sage Weil <sage@newdre...
Sage Weil
02:52 PM Subtask #1160 (Resolved): introduce CephContext to some structures in common/
Some data structures in common like Thread and Context need to use CephContext to do their logging and configuration.... Colin McCabe
02:46 PM Cleanup #1149 (Resolved): debian: separate gceph into a separate package
Sage Weil
02:24 PM Bug #1145 (Won't Fix): monitor assert fails due to ENOSPC
Hi Sam-
This is done deliberately so that the system doesn't continue thinking it wrote something when it didn't. ...
Sage Weil
02:09 PM Bug #1159 (Resolved): ceph -w segfault
fixed by commit:0b40f48073649c74f8f29148cdc84b997a1666e8 Colin McCabe
01:49 PM Bug #1159: ceph -w segfault
This is something with CephToolCtx, giving to Colin. Greg Farnum
01:28 PM Bug #1159 (Resolved): ceph -w segfault
On latest master with a few extra MDS patches, I got a segfault. Core and binary in kai:~gregf/logs/ceph_core Greg Farnum
12:41 PM CephFS Bug #1153 (Resolved): Hang on 3-MDS fsstress
Okay, this one was hanging because the wanted_ino member of MDiscoverReply wasn't encoded and decoded. Duh. Greg Farnum
11:10 AM CephFS Bug #1153 (Resolved): Hang on 3-MDS fsstress
While testing our rename changes I managed to get a hang in a 3-MDS setup of fsstress. I'm looking through the logs n... Greg Farnum
11:27 AM Bug #1144: snap_trimmer
wip_snap_trimmer has the current incarnation of the changes Samuel Just
11:25 AM Tasks #1157 (Closed): create teuthology tests for updated snap_trimmer
Samuel Just
11:24 AM Tasks #1156 (Closed): assert failure in _scrub, assert(p->second.size == snapset.clone_size[*curc...
In make_write, we set the clone size to the new object size obs.oi.size rather than the cloned size ctx->obc->obs.oi.... Samuel Just
11:15 AM Tasks #1155 (Resolved): Replica does not remove snap collections
Caused by replica not updating snap_collections.
Have fix in wip_snap_trimmer, won't push until the rest of the br...
Samuel Just
10:34 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Sage Weil wrote:
> I pushed commit:5922de29e7414be3aaa6eb04be8bb0ae5c943ccf, which should fix that crash. Can you r...
Andre Noll
09:08 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
I pushed commit:5922de29e7414be3aaa6eb04be8bb0ae5c943ccf, which should fix that crash. Can you retest?
Still work...
Sage Weil
05:01 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
First of all, sorry for the delay, I haven been distracted by other work.
Sage Weil wrote:
> Okay, there was an i...
Andre Noll
09:56 AM Bug #1151: OSD: CephxClientHandler::handle_response
Also, tell us more about this cmon memory thing! :) Sage Weil
09:01 AM Bug #1151: OSD: CephxClientHandler::handle_response
Can you attach the full osd log? It looks like there are two threads doing the auth handshake. Sage Weil
03:06 AM Bug #1151 (Resolved): OSD: CephxClientHandler::handle_response
I just saw a bunch of OSD's going down with:... Wido den Hollander

06/08/2011

11:35 PM Revision 6f6ac75d (ceph): Makefile: remove ancient comment
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:23 PM Revision 9202a4b0 (ceph): ceph tools: de-globalize CephToolContext
De-globalize CephToolContext. It's important to do this now because the
constructor for CephToolContext references th...
Colin Patrick McCabe
10:57 PM Revision 92ee0de8 (ceph): mds: drop workaround for embedded null decoding
We removed the embedded null check recently, 9b37f4f, so this workaround
isn't needed.
Signed-off-by: Sage Weil <sag...
Sage Weil
09:55 PM Revision 6069fa6a (ceph): mon: weaken pool creation caps check
Allow a pool creation if we specify an auid but are allowed to create
buckets as anybody.
Signed-off-by: Sage Weil <...
Sage Weil
09:54 PM Revision 80c039cd (ceph): rgw_admin: user info dumps auid
Yehuda Sadeh
09:53 PM Revision 825cec3c (ceph): moncaps: whitespace
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:50 PM Revision 4e58308d (ceph): mds: rename: remove illicit assert.
We actually do want witnesses who aren't auth for anything
to do journaling in some cases, so kill the assert.
That a...
Greg Farnum
09:13 PM Revision 40262471 (ceph): rgw_admin: can reset auid
Yehuda Sadeh
09:03 PM CephFS Bug #1084: blogbench won't finish: waiting for Fr cap forever
I just tested it with the latest ceph stable branch (0.29+) and kclient master branch (backported for 2.6.32 by me). ... Henry Chang
09:18 AM CephFS Bug #1084: blogbench won't finish: waiting for Fr cap forever
Looked at this some more yesterday. Turns out the problem seems to be that the client closes the inode, and all caps ... Greg Farnum
09:00 PM Revision 8cd949f6 (ceph): Merge remote branch 'origin/mds_rename'
Sage Weil
08:29 PM Revision 5b43419c (ceph): mds: try_trim_non_auth_subtree if we rename a dir away from a non-auth ...
It's possible we have non-auth metadata only because we have a subtree
nested beneath. If we rename a directory out o...
Sage Weil
08:27 PM Revision 072e80ed (ceph): mds: use CDIR_AUTH_UNDEF where possible
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:20 PM Revision e19d639f (ceph): Gevent 0.14 switches to libev, avoid it for now.
This was in requirements.txt earlier, but using this library
from elsewhere does not respect that, only setup.py.
Tommi Virtanen
08:18 PM Revision 811dcae7 (ceph): mds: remove unlinked metadata from cache on replay
If we replay a metablob that unlinks something, throw it out immediately.
Recursively. This comes up when:
- we ren...
Sage Weil
08:14 PM Revision 23302815 (ceph): rgw: set locator on all required rados calls
Yehuda Sadeh
08:10 PM Revision 5cf6d0e6 (ceph): rgw: put locator key in rgw_obj
Yehuda Sadeh
07:32 PM Revision 8bd984d9 (ceph): rgw: refactor rgw backend interface
Yehuda Sadeh
07:30 PM Revision f3688b71 (ceph): Fix build errors in unit tests
libcommon uses symbols from the crypto libraries, so they must appear on
the link line whenever libcommon appears. L...
Colin Patrick McCabe
07:05 PM Revision 24701308 (ceph): monitorstore: return error codes for consistency
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:27 PM Revision cfbcfd7c (ceph): Fix typo in usage output for --num-osds
This patch fixes the typo in the usage description to match the actual argument string.
Signed-off-by: Sam Lang <sam...
Sam Lang
06:25 PM Revision 5d56dfb7 (ceph): Fix segfault caused by invalid argument string.
This patchset includes minor fixes to the crushtool utility. If an invalid bucket type is speicifed on the command l... Sam Lang
06:22 PM Revision a1a71471 (ceph): qa: test witness subtree adjustment
This exercises the mds at least; we don't have a way to verify mds
behavior currently.
Signed-off-by: Sage Weil <sag...
Sage Weil
05:34 PM Bug #1150 (Duplicate): kclient: ERESTARTSYS from flock/fcntl locks
After upgrade from 0.28.2 to 0.29 amanda backup no longer working if amanda files placed on ceph fs and ceph fs mount... Fyodor Ustinov
02:12 PM CephFS Bug #1146 (Resolved): mds: correctly trim cache after rename witness replay
Sage Weil
11:25 AM Cleanup #1149 (Resolved): debian: separate gceph into a separate package
Sage Weil
07:25 AM Bug #1032: osd: Marked down and become zombies after killing
I'm just upgrading my cluster, I see a Zombie from May 26th, running v0.28.1 and 2.6.38.2.
I'll see if I can repro...
Wido den Hollander
06:48 AM Linux kernel client Bug #1148: o_direct crash in msgr
Hi Sage,
I refactored stripe_read a bit and fixed the calculation of page count in ceph_osdc_new_request.
See:
...
Henry Chang
05:09 AM Revision e2c808ae (ceph): qa: test_sync_io
Currently only tests the read path for O_DIRECT and sync (if the ioctl is
in place). Also currently crashes the kcli...
Sage Weil
03:49 AM Revision 21795aaf (ceph): qa: add dir rename tests
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:48 AM Revision 00ec86a2 (ceph): mds: open renamed import child frags during journal replay
Open up any child frags of the imported renamed inode that are noted in
the journal event. (Note we blindly open up ...
Sage Weil
03:46 AM Revision 44d19304 (ceph): mds: journal open srci frags on srci import (master)
If we are importing the renamed inode, and it is a directory, journal a
list of all open dirfrags (currently, this is...
Sage Weil
03:43 AM Revision a7c083ec (ceph): mds: journal renames on witnesses if we have nested subtrees
If a rename witness has any subtrees that are nested beneath the renamed
directory, we need to journal the rename eve...
Sage Weil
03:40 AM Revision 5c870c60 (ceph): mds: CDir::contains() should use projected hierarchy
The only current callers are in resolve stage, when nothing is projected.
Signed-off-by: Sage Weil <sage.weil@dreamh...
Sage Weil

06/07/2011

11:14 PM Revision 1b0fdcae (ceph): re-fix ignore return value change
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
10:45 PM Linux kernel client Bug #1148 (Resolved): o_direct crash in msgr
test_sync_io makes us crash on... Sage Weil
10:44 PM Feature #1147: qa: tests for O_DIRECT, sync
beginnings of test with commit:e2c808aea97ced6e9a55b143116b66d344f72c0b Sage Weil
10:04 PM Feature #1147 (Resolved): qa: tests for O_DIRECT, sync
Test correct results with varying buffer alignment, file offset, and either sync or O_DIRECT io.
write with buffer...
Sage Weil
10:13 PM Revision 3a2ce92b (ceph): mds: fix check for no-op rename of two links to the same inode
touch a
ln a b
mv a b
is a no-op, according to posix. Move the check earlier.
Signed-off-by: Sage Weil <sage.weil...
Sage Weil
09:51 PM Revision fcc73e0d (ceph): Class loading no longer needs a temp dir.
Tommi Virtanen
09:48 PM Revision 047669e9 (ceph): Cleanup is done, adjust TODO.
Tommi Virtanen
09:47 PM Revision e6ae6bdd (ceph): Archive logs if given --archive=PATH. Clean up after a test run.
Tommi Virtanen
09:41 PM Revision 37cbd7ab (ceph): Support PIPE for stdout and stderr, too.
Tommi Virtanen
09:41 PM Revision f225db7b (ceph): rgw: remove parts after multi-part upload
Yehuda Sadeh
09:27 PM Revision a7012f28 (ceph): cfuse: really ignore write() return value
Hopefully this will make gcc stop complaining. The void cast no longer
seems to silence unchecked return code errors,...
Colin Patrick McCabe
09:13 PM Revision 37fd3b58 (ceph): rgw: calc etag for multipart upload
Yehuda Sadeh
09:08 PM CephFS Bug #1128 (Resolved): clustered mds: failed verify_subtree_bounds
closing this, opening specific bugs for remaining issues Sage Weil
09:07 PM CephFS Bug #1146 (Resolved): mds: correctly trim cache after rename witness replay
Sage Weil
08:57 PM Revision dfa4aeb8 (ceph): Merge remote branch 'origin/wip_gcov'
Josh Durgin
08:34 PM Revision 2969161b (ceph): qa: add rename dir checks
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:29 PM Revision 1f2740a0 (ceph): Makefile.am: install gcno files
This is needed so that coverage data can be generated from the
gitbuilder tarballs.
Signed-off-by: Josh Durgin <josh...
Josh Durgin
08:29 PM Revision 0eda7195 (ceph): Makefile.am: add check-coverage to get coverage of 'make check'
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
08:29 PM Revision b32f00eb (ceph): coverage: add ceph-coverage to run something with the right GCOV_PREFIX...
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
08:28 PM Revision e4f05415 (ceph): Makefile.am: clean gcno and gcda files in "make clean"
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
08:27 PM Revision 2cb0c110 (ceph): qa: rename tests
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:24 PM Revision 3e0352d3 (ceph): mds: rename: add missing pop_and_dirty_projected_inode calls
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:24 PM Revision 569baab6 (ceph): mds: rename: use temp *in in _rename_apply
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:24 PM Revision bdfcc0df (ceph): mds: rename: move unlink out of every possible branch
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:58 PM Revision ce10d4f5 (ceph): Add missing trailing commas.
Lists of strings where last item has no comma are just
traps waiting to hurt you when you add a new item at
the end.
Tommi Virtanen
07:05 PM Revision 12bb3081 (ceph): test/bufferlist: add copy_all test
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
07:04 PM Revision 1cc78b6a (ceph): osd: add command to exit cleanly
This is required for gcov to work on daemons since the coverage data
is written atexit, and the function that writes ...
Josh Durgin
07:04 PM Revision 347dba4e (ceph): mon: ceph tell mds * is a valid command
Previously this fell through and returned -EINVAL to the user.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
07:04 PM Revision c01961ff (ceph): mds: allow mds to 'exit immediately'
This is temporary until shutting down cleans up well.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
07:04 PM Revision 69092735 (ceph): mon: add all_exit and exit commands
all_exit makes each daemon exit(0), for gcov data collection.
exit causes cmon to do this.
Signed-off-by: Josh Durgi...
Josh Durgin
07:04 PM Revision eb097649 (ceph): coverage: add helper script to get coverage for a local test
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
07:02 PM Revision 24195b9c (ceph): configure: add option for building with gcov coverage support
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:58 PM Revision 7b9fa3c3 (ceph): test/bufferlist.cc: retab for consistency
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:56 PM Revision d9d3c78e (ceph): cfuse.cc: fix warning
We genuinely don't care about the return code we get back from this call
to write(2). So cast the result to void to t...
Colin Patrick McCabe
06:40 PM Revision ea9c4fc2 (ceph): bufferlist::iterator: add copy_all, use in KeyRing
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:25 PM Revision 6b2bf234 (ceph): mds: rename: fix oldin pop
- make the pop match position with the project in prepare
- don't pop on linkmerge, since we don't project in that ca...
Sage Weil
06:24 PM Revision bf7d7fa9 (ceph): mds: rename: some cleanup
- add oldin temp
- make if conditions match between _prepare and _apply
no functional changes.
Signed-off-by: Sage ...
Sage Weil
06:23 PM Revision b794aeb3 (ceph): client: fix rename request target selection
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:11 PM Revision 084a1c74 (ceph): client: fix mds selection for unlink
It was broken when a remote link target was on another mds.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
06:08 PM Revision a2288574 (ceph): client: make debug prints for choose_target_mds more helpful
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:07 PM Revision 5e679d3a (ceph): client: allow caps to be dropped on another inode for requests
We need the ability to drop caps on another inode that isn't req->inode
or req->old_inode in the request struct.
Sig...
Sage Weil
05:59 PM Revision 0fb79737 (ceph): mds: fix unlink new snaprealm check
The bool was flipped.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
05:50 PM Revision dbe41a30 (ceph): client: fix rename debug print
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:38 PM Revision 83e49cd8 (ceph): g_conf: change to pointer
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:23 PM Revision e61f5c5c (ceph): simple_spin: fix compiler warnings
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:17 PM Revision 776d3814 (ceph): init: set cct->module_type in common_preinit
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:04 PM Revision e36a4b26 (ceph): mds: rename: fix handling of target that is remote dentry
If the target is a remote dentry, we need to consider that the destdn
and desti may have different auths.
Signed-off...
Sage Weil
04:58 PM Revision a5662293 (ceph): client: fix mds routing for link requests
We want to send link requests to the auth for the new name, not the
target inode. We also want to drop FILE_SHARED c...
Sage Weil
04:49 PM Revision 7770e1c6 (ceph): mds: rename: only add target to stray if destdn is a primary
Only add target into the stray dir if we are renaming over a primary
dentry. (Otherwise we aren't moving the target....
Sage Weil
04:41 PM Revision df7f8954 (ceph): mds: fix/clean up xlock import/export
- create xlock import/export helpers
- fix/simplify checks: we want to export/import only xlocks on the inode
that is...
Sage Weil
08:40 AM Bug #1145 (Won't Fix): monitor assert fails due to ENOSPC
I put the monitor storage on a boot disk that was already 100% full, which is a user error on my part, but I'm not su... Sam Lang
08:36 AM CephFS Bug #1084: blogbench won't finish: waiting for Fr cap forever
The real question here is why the earlier callers of issue_caps think that the client (a loner) can't have the Fr cap... Greg Farnum

06/06/2011

10:38 PM Revision c8ecd129 (ceph): Remove mountpoint after shutting down cfuse.
Tommi Virtanen
10:21 PM rgw Subtask #1125 (Resolved): osd: support for merging/cloning several objects into one final object
Sage Weil
10:16 PM Revision 3c6a7e19 (ceph): gitbuilder: quiet
Yehuda Sadeh
10:12 PM Revision a988083c (ceph): Add orchestra.cluster, for running commands on multiple hosts.
Tommi Virtanen
10:10 PM Revision d11f471d (ceph): rgw: silence gitbuilder (hopefully)
Yehuda Sadeh
09:32 PM Revision 26d720d3 (ceph): Add TODO note about results gathering.
Tommi Virtanen
09:22 PM Revision 2b237882 (ceph): First draft of documentation.
Tommi Virtanen
09:22 PM Revision f292fb7c (ceph): radosgw_admin: link bucket to user
Yehuda Sadeh
08:30 PM Revision b5011e21 (ceph): mds: adjust subtree roots on rename
If we replay a dir rename operation, we need to adjust the subtree map
accordingly.
This covers the case where the m...
Sage Weil
08:06 PM Revision 10750f80 (ceph): common: add module_type to CephContext; use to initialize rotating keys
This fixes rotating keys.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
07:36 PM Revision ad81fccd (ceph): Use orchestra repo on ceph.newdream.net, now that it exists.
Tommi Virtanen
07:36 PM Revision b28ba774 (ceph): osd: use osd_op.soid in call
Yehuda Sadeh
07:27 PM Revision 5cc146ea (ceph): osd: allow src_oids to be snapped
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:46 PM Revision f9af9ced (ceph): remove dumpjournal
This code has suffered a great deal of bitrot and is superceded
by the cmds --dump-journal functionality anyway, whic...
Greg Farnum
06:09 PM Revision 8e69c39f (ceph): v0.29
Sage Weil
05:58 PM Revision 73257a0d (ceph): Merge remote branch 'origin/wip_librbd_locking'
Sage Weil
05:09 PM Revision 0baa108b (ceph): osd: src oid is in OSDOp now
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:57 PM Revision 4e95ac5a (ceph): Merge remote branch 'origin/stable' into next
Sage Weil
04:56 PM Revision 6f21e41a (ceph): Merge remote branch 'origin/filestore_debugging' into next
Sage Weil
04:54 PM Revision 5a86126f (ceph): osd: don't crash on malformed clone_range (now really)
Yehuda Sadeh
04:46 PM Revision ea8917ea (ceph): osd: don't crash on invalid encoded clone_range
Yehuda Sadeh
04:42 PM Revision 4d214b1e (ceph): objecter: set src oid on clone_range properly
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:25 PM Revision e2e60147 (ceph): monclient: use default mon port if not specified in conf
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:25 PM Revision ea56a558 (ceph): monmaptool: use default mon port if not specified
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:08 PM CephFS Bug #1084: blogbench won't finish: waiting for Fr cap forever
Having looked at this some more, I'm pretty sure it's a different issue than Sage though.
The eval family of funct...
Greg Farnum
11:22 AM Bug #1144 (Resolved): snap_trimmer
Currently, there is a race on the replica between processing the repops removing objects from a snap collection and p... Samuel Just
11:00 AM Bug #1142 (Resolved): dumpjournal crashes without dumping the journal
Resolved via removal in the next branch. Use cmds --dump-journal instead! Greg Farnum
09:55 AM Bug #1127 (Resolved): RBD got silent after 1 month
merged by commit:6f21e41a9182c5d587dedf53e6c84c889dd7ac08 Sage Weil
09:22 AM Bug #1143 (Resolved): mon addr without port breaks a new setup
Fixed by commit:e2e60147ecf089a46e7a1c6836da1bc926a249ee and commit:ea56a558cef4cebf37a923ce3e5eb8c753d94464. thanks! Sage Weil
04:35 AM Bug #1032: osd: Marked down and become zombies after killing
I've been away for some time, so haven't got a chance to hunt this one further down. I'm back now, so I'll see if I c... Wido den Hollander
03:35 AM CephFS Bug #1137: MDS Crash
Unfortunately after adding the debug to the config the crash stopped occurring which is a nuisance. Damien Churchill
12:44 AM CephFS Bug #1137: MDS Crash
Unfortunately that was just a one off crash. I have just set debug-mds = 20 in the ceph configuration now though. I'm... Damien Churchill

06/05/2011

11:34 PM Bug #1127: RBD got silent after 1 month
Hi Josh, Fix confirmed. Please commit the fix to the upstream. Yoshi Tamura
03:16 AM Bug #1143 (Resolved): mon addr without port breaks a new setup
When you create a new cephFS with a config that doesn't specify a port on the mon addr line, it creates an unusable c... Bernard Grymonpon

06/04/2011

01:45 AM Revision 69f90874 (ceph): dumpjnl: call msgr->register_entity before start
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
01:15 AM Revision 4abdf6fd (ceph): simple_spin: use file-scope global not function
function-scoped globals are protected by a mutex, and taking a mutex
inside a spin lock implementation kind of defeat...
Colin Patrick McCabe
12:17 AM Revision b198e5ac (ceph): messages: fix missing bit
Yehuda Sadeh
12:13 AM Revision 53adde03 (ceph): fix the MonClient problems for --with-debug programs.
Still doesn't compile, though.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
12:08 AM Revision b0afaacb (ceph): messages: fix test for multi op
Yehuda Sadeh

06/03/2011

11:36 PM Revision ec18be53 (ceph): simple_spin: fix linker error
For some stupid reason the linker is pulling this in twice, resulting in
/bin/sh ../libtool --tag=CXX --mode=link ...
Sage Weil
10:19 PM Revision a082747c (ceph): osd: make CLONERANGE src oid encoding more sane
Encode the src_oid in the OSDOp data space, but put it in a separate easy
to access member. This avoids changing the...
Sage Weil
10:16 PM Revision a635a9cb (ceph): rgw: multipart complete upload
Yehuda Sadeh
10:15 PM Revision 8e55e186 (ceph): librados: remove useless reference holding
Yehuda Sadeh
09:57 PM Revision 740eea1e (ceph): Refactor MonClient, KeyRing
MonClient should contain a KeyRing and a RotatingKeyRing. All the
MonClient users, except possibly csyn, don't want t...
Colin Patrick McCabe
09:49 PM Revision 7f393379 (ceph): Prettify exception handling.
Display exception type (e.g. "RuntimeError").
Don't re-display the traceback.
Tommi Virtanen
09:48 PM Revision 08607692 (ceph): Remove dead code.
Tommi Virtanen
09:47 PM Revision b6e22436 (ceph): Prettify config debug printing.
Tommi Virtanen
09:47 PM Revision f2f2f42e (ceph): osd: src src_oids oloc check
We need to ensure that the src and dst objects are always in the same pg.
That is true if
- both oloc.keys match, or...
Sage Weil
09:47 PM Revision 57f979f1 (ceph): Refactor for modularity.
New style: run "./virtualenv/bin/teuthology -v interactive.yaml". Tommi Virtanen
09:47 PM Revision 90b53543 (ceph): dout:remove stream from dout_emerg_streams earlier
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:47 PM Revision ed41f29a (ceph): remove g_keyring
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:47 PM Revision 98226c22 (ceph): DoutStreambuf: de-globalize dout lock
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:47 PM Revision 6ed9a583 (ceph): Add simple_spin
Add simple spinlock implementation that is safe to use from anywhere.
Signed-off-by: Colin McCabe <colin.mccabe@drea...
Colin Patrick McCabe
09:47 PM Revision 5b7049c8 (ceph): DoutStreambuf: de-globalize emergency logging
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:44 PM Revision 0d975b5b (ceph): Record Remote in RemoteProcess.remote, for caller convenience.
Tommi Virtanen
09:29 PM Revision 70d77095 (ceph): Revert "cfuse.cc: use safe_write"
This reverts commit e8ac5aa2a4c4e3ce84ed553dbebfb1cccf5679a9.
This commit is just erroneous. It adds checks on a pip...
Greg Farnum
09:09 PM Revision 73ea844a (ceph): librados: get reference to the io context for the pending async ops
Yehuda Sadeh
09:09 PM Revision 1aee7f98 (ceph): rgw: use clone_range for multi upload completion
Yehuda Sadeh
08:28 PM Revision befcff02 (ceph): SimpleMessenger: Keep a disposable flag for use in reset
pipes marked disposable must not inherit the lossy policy on reconnect.
Also, in Pipe::writer, when sent.empty() && c...
Samuel Just
07:36 PM CephFS Bug #1084: blogbench won't finish: waiting for Fr cap forever
I haven't test it for a while. I'll give it a try after the holidays. Henry Chang
04:34 PM CephFS Bug #1084 (In Progress): blogbench won't finish: waiting for Fr cap forever
Henry, have you seen this lately? I get the impression you were seeing it very easily and I've not been able to repro... Greg Farnum
07:13 PM Revision 7bd016f9 (ceph): rados_bencher: re-add written objects constraint to read benchmark.
Somehow, in the last major change, the constraints that kept the
bencher from trying to read non-existent objects got...
Greg Farnum
07:04 PM Bug #1142 (Resolved): dumpjournal crashes without dumping the journal
I reproduced this on change commit:6fd694c3942a12a3730a30d059b51b37d3f7536f, before the wip-815 branch was merged in.... Colin McCabe
06:53 PM Revision b4eb5efa (ceph): rados_bencher: re-add written objects constraint to read benchmark.
Somehow, in the last major change, the constraints that kept the
bencher from trying to read non-existent objects got...
Greg Farnum
06:22 PM Revision a97451f6 (ceph): librados: support clone_range
Yehuda Sadeh
05:49 PM Revision d1d3e26c (ceph): mds: remove now-erroneous comment
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:12 PM Revision 4ef74308 (ceph): Merge branch 'next'
Conflicts:
src/mds/Server.cc
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
04:53 PM Revision 19949f6d (ceph): mds: Clean up _rename_prepare journaling
This has been broken for a while in terms of journaling
things the MDS isn't auth for. This patch should fix that, an...
Greg Farnum
04:48 PM Revision 4689073c (ceph): mds: _rename_prepaer should only journal dest if auth for it
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:48 PM Revision 97ff24c0 (ceph): Un-hardcode tasks.
Tommi Virtanen
04:40 PM Revision 3be4b482 (ceph): Make autotest show debug messages.
Otherwise it's very quiet for a long time. Tommi Virtanen
04:40 PM Revision 1634f3e4 (ceph): Move autotest running into a task.
Tommi Virtanen
04:14 PM Revision 44fe80ab (ceph): Merge branch 'osd_clonerange' into rgw-multipart
Yehuda Sadeh
02:34 PM Bug #1141 (Closed): osd: misc snap bugs
Sage Weil
01:46 PM CephFS Bug #1139 (Resolved): cfuse crashes on exit
Fixed in master by commit:e8ac5aa2a4c4e3ce84ed553dbebfb1cccf5679a9.
The bug doesn't do anything meaningful except ...
Greg Farnum
10:20 AM CephFS Bug #1139 (Resolved): cfuse crashes on exit
I believe this is a new bug. All the FUSE components exit properly but it leaves behind a core dump from the assert i... Greg Farnum
11:33 AM Bug #1118 (Resolved): Crash OSD after upgrdae from 0.28.1 to 0.28.2
If it's xfs it sounds like that was it! Sage Weil
09:31 AM Bug #1118: Crash OSD after upgrdae from 0.28.1 to 0.28.2
Yes, I use xfs.
Sorry, I cannot reproduce this error - I already reformat cluster.
Fyodor Ustinov
11:32 AM Linux kernel client Bug #1140 (Resolved): balance_dirty_pages makes Fw cap revocation slow
See comments for #1110 Sage Weil
11:31 AM CephFS Bug #1110 (Resolved): mds: ls -l hangs on concurrent writer
I'm going to open a separate kclient issue to deal with the balance_dirty_pages issue. Sage Weil
11:29 AM Bug #1121 (Resolved): rados: rados bench read aborts with an error
Pushed to master in commit:b4eb5efaf87d8213f89dee0d9bb156171fcd18e1 and stable in commit:7bd016f97691919689a84b4bd27e... Greg Farnum
10:29 AM Cleanup #726 (Closed): Make libcommon self-sufficient
we can reopen if this causes any real problems Sage Weil
10:27 AM CephFS Bug #1047 (Can't reproduce): mds: crash on anchor table query
The log doesn't have enough info. If anyone sees this again, let's reopen! Sage Weil
10:25 AM Bug #1018: error on building ceph on red hat 5.5
Does this problem still exists on v0.28+? Have you looked at the redhat info in the wiki?
http://ceph.newdream.ne...
Sage Weil
10:23 AM Bug #1032: osd: Marked down and become zombies after killing
Wanted to check in on this one. Are you still seeing this problem? When the processes are zombies, are there any btr... Sage Weil
10:17 AM CephFS Bug #1137: MDS Crash
does this happen each time you try to start cmds?
If so, can you add
debug mds = 20
debug ms = 1
to [mds] sec...
Sage Weil
04:03 AM CephFS Bug #1137 (Can't reproduce): MDS Crash
... Damien Churchill
10:17 AM CephFS Bug #1041 (Resolved): standby-replay fails on multi-mds fsstress journals
Okay, after 3 or 4 more runs I've only seen #1128. Greg Farnum
09:33 AM CephFS Bug #1041: standby-replay fails on multi-mds fsstress journals
All right, I went over _rename_prepare pretty carefully and reworked a lot of the checks on journaling and now i have... Greg Farnum
04:59 AM Bug #1138 (Resolved): need to package rados.py in the debian .deb
need to package rados.py in the debian .deb
It's a little tricky because paths to 'site-python' vary based on pyth...
Colin McCabe
04:57 AM Bug #1134 (Resolved): rados export --delete-after can't clean up after a crash
Resolved by commit:0f3224e172a077155f64897c8a3665fea6d5d892 and commit:637dfc3ed3194fdb1f5235cd48c8023c7fb1cbda Colin McCabe
01:35 AM Revision c28b749b (ceph): uclient: don't use racy check for uncommitted data.
Previously we used a check for if there were CEPH_CAP_FILE_BUFFER refs,
but that was racy if we had other threads (th...
Greg Farnum
01:35 AM Revision cd5049dc (ceph): uclient: reset flushing_caps on (mds) cap import.
Previously, we could get stuck thinking that we'd flushed caps
(that went to the original MDS, waited on freeze for e...
Greg Farnum
01:35 AM Revision 39d50c13 (ceph): mds: fail out of path_traverse if we have a null dentry.
Previously if we had a null dentry which we were not auth for,
we would go into a loop of discover lookups on that de...
Greg Farnum
01:35 AM Revision 2c6b5600 (ceph): uclient: call the right function pointer on truncate
fixes 67533e14439e9b to do what it meant to.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
01:31 AM Revision 350e6503 (ceph): mds: use XSYN state for rdlocks during EXCL
Move to XSYN state if we get an rdlock attempt from EXCL. This means that
when there is an EXCL client doing buffere...
Sage Weil
01:22 AM Revision bbaf0b57 (ceph): mds: add xsyn states
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
01:21 AM Revision 5fc6d921 (ceph): filestore: compare dentry->d_type against d_type constant
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
01:10 AM Revision ade2ccbe (ceph): osd, filestore: debug collection listing
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
01:10 AM Revision ea76ea50 (ceph): filestore: stat to test for file type if d_type is unsupported
This only affects list_collections. Previously, when using an FS that
does not support d_type, like xfs, load_pgs wou...
Josh Durgin

06/02/2011

11:59 PM Revision 637dfc3e (ceph): rados_sync: add test for temp file deletion, fix
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:58 PM Revision d4edd17c (ceph): rgw: multipart: use locator on created parts
Yehuda Sadeh
11:48 PM Revision 0f3224e1 (ceph): rados_sync: in export, download, then rename
Download files to a 'temporary' name and then rename them when they are
complete. If the download gets aborted halfwa...
Colin Patrick McCabe
10:24 PM Revision 37666185 (ceph): rgw: multipart additions and fixes
Yehuda Sadeh
10:14 PM Revision 6fd694c3 (ceph): Remove unneeded libcrush1 files
Laszlo Boszormenyi
10:13 PM Revision d6bbf3e5 (ceph): mds: journal parents of srci when srcdn is remote
If srcdn is a remote dentry, we will be journaling the src inode to update
the mtime, but we need to ensure the paren...
Sage Weil
10:04 PM Revision ce5f0e71 (ceph): Move interactive and cfuse into tasks.
Tommi Virtanen
09:51 PM Revision 806646b0 (ceph): journaler: also initialize safe_pos
on reread_head. Keep consistent across the two methods.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
09:50 PM Revision a13b6643 (ceph): journaler: fix trim crash after standby-replay -> active
The reread_head method needs to initialize trimming_pos (like read_head
does) or else we get confused later.
Signed-...
Sage Weil
09:14 PM Revision 7ca240bf (ceph): mds: cleanup rename_prepare a bit
Use *srci tmp.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:57 PM Revision 0bcd9ac7 (ceph): vstart.sh: turn down debug ms
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:45 PM Revision 4d03e159 (ceph): rgw: some more multipard upload list
Yehuda Sadeh
08:33 PM Revision 52bf3fce (ceph): rgw: extend multipart list parts response
Yehuda Sadeh
08:24 PM Revision a670b4b3 (ceph): osd: implement clonerange
Clone ranges of bytes between objects, provided
- src object locators match dest object
- src objects are not miss...
Sage Weil
08:24 PM Revision fc4cc399 (ceph): osd: give obc refs to RepGather
Just give the ref to RepGather instead of doing a get and put.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:16 PM Revision 4cf342a1 (ceph): mds: pin inode while one renamed_files list
Make sure the inode is pinned while it is on the LogSegment::renamed_files
list. Avoids a crash when an inode on tha...
Sage Weil
07:25 PM Bug #1098 (Closed): mds never coming "up:active" awaits in "up:creating"
Sage Weil
07:11 PM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Okay, there was an issue with the behavior with the MDS locks (they didn't do what I thought they did). I added a ne... Sage Weil
06:59 PM Linux kernel client Bug #1136 (Resolved): mempool_destroy failure on umount
on current master, i was doing umount on a sluggish cluster, and... Sage Weil
06:23 PM CephFS Bug #1117: mds: rename rollback broken on slaves during replay
It seems to also be broken on the master -- I've been testing cross-MDS rename ops and of course you see a lot of rol... Greg Farnum
06:15 PM Bug #1118: Crash OSD after upgrdae from 0.28.1 to 0.28.2
This looks like the same problem as #1127. If you're not using ext3/4 or btrfs, it almost certainly is. Can you try t... Josh Durgin
06:11 PM Bug #1127: RBD got silent after 1 month
Hi Yoshi, I put a fix in the filestore_debugging branch, which will generate a new package in about 1/2 an hour.
L...
Josh Durgin
05:39 PM Revision b152a93c (ceph): rgw: more cleanup
Yehuda Sadeh
05:28 PM Revision 3546cfdd (ceph): rgw: some cleanup
Yehuda Sadeh
04:49 PM Revision 50731646 (ceph): rgw: multipart upload parser test util
Yehuda Sadeh
04:39 PM Revision 2f3f36ab (ceph): rgw: fix multipart upload complete parser
Yehuda Sadeh
04:30 PM Revision 711a77cf (ceph): rgw: multipart complete fix
Yehuda Sadeh
04:19 PM Revision 0cce0a5e (ceph): filestore: allow clone_range to different offsets
The old OP_CLONERANGE would only clone a range of bytes at the same offset
in both objects. Add an OP_CLONERANGE2 op...
Sage Weil
04:17 PM Revision 502baeab (ceph): filestore: fix fallback/slow do_clone_range
We need to seek to the appropriate offsets on the src and destination
fd's for this to do the right thing.
Signed-of...
Sage Weil
04:17 PM Revision 6ca168ed (ceph): filestore: fix fallback/slow do_clone_range
We need to seek to the appropriate offsets on the src and destination
fd's for this to do the right thing.
Signed-of...
Sage Weil
04:09 PM Revision 95163e94 (ceph): Fetch ceph binary tarball independently on every node.
Avoids shuffling the bytes through the controlling node.
Use sha1 file to make sure everyone gets the same version.
Tommi Virtanen
03:58 PM Cleanup #1135 (Resolved): d_type cleanup
the codebase seems to have lots of this going on:
src/mds/CDir.cc:726: if (dn->get_linkage()->get_remote_d_ty...
Anonymous
03:21 PM Bug #1134: rados export --delete-after can't clean up after a crash
I guess I should add that manually removing that file from the exported directory makes it work again!
Also, it wo...
Jeremy Kitchen
03:16 PM Bug #1134 (Resolved): rados export --delete-after can't clean up after a crash
I was using rados export to dump out a pool and it was taking a long time so I ctrl-c'd it. Now when I do it on that ... Jeremy Kitchen
03:13 PM RADOS Bug #1129 (Won't Fix): sort out libcrush
meh, let's not worry about it until someone needs libcrush.so. Sage Weil
03:11 PM CephFS Bug #1132 (Resolved): mds: missing parent in rename metablob
commit:d6bbf3e5fbe1df26d1bfe6f695ca52cfbb3694b2 Sage Weil
01:19 PM CephFS Bug #1132 (Resolved): mds: missing parent in rename metablob
single mds, fsstress -p 30 workload... Sage Weil
03:07 PM CephFS Bug #1133 (Resolved): mds: journaler failed assertion on standby-replay -> replay
fixed by commit:a13b66436561bfe86f4907d18d2ea7762632d36d Sage Weil
02:04 PM CephFS Bug #1133 (Resolved): mds: journaler failed assertion on standby-replay -> replay
fsstress workload. kill master mds. standby crashes with:... Sage Weil
12:29 PM Bug #1131 (Resolved): OSD assert failure in update_heartbeat_peers()
Probably fixed in current stable: c5470e0f855b246cfbde6982ca90f565e7074600. Let us know if it persists! Samuel Just
12:20 PM Bug #1131 (Resolved): OSD assert failure in update_heartbeat_peers()
I'm not sure I can reproduce it, because my system state is a bit out of whack due to a previous bug (#1130), but I'v... Sam Lang
09:54 AM Linux kernel client Bug #1096 (Resolved): LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
Thanks Jeff! Sage Weil
02:51 AM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
Hi , I apply the patch ,to verify this bug,
run "./fsstress -d /mnt/ceph/fstest -l 1 -n 10000 -p 1 -v" , pass.
r...
changping Wu
04:20 AM Revision 7e2e4779 (ceph): mon: make sure osd paxos is writeable before doing timeouts
The osd paxos machine has to be writeable before we can update it.
Fixes: #1130
Signed-off-by: Sage Weil <sage.weil@...
Sage Weil
12:05 AM Revision c5470e0f (ceph): OSD: don't keep old connection over new one in update_heartbeat_peers
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just

06/01/2011

11:34 PM Revision 780322db (ceph): boto_tool: add get_bucket_acl
Signed-off-by: "Colin McCabe" <colin.mcccabe@dreamhost.com> Matthew Wodrich
11:28 PM Revision e11958b2 (ceph): Merge branch 'stable' into next
Sage Weil
11:23 PM Revision 59501e1d (ceph): Merge branch 'stable'
Sage Weil
11:04 PM Revision de0f0c72 (ceph): Refactor to use Cluster and Remote, to evaluate the new APIs.
Tommi Virtanen
10:13 PM Revision 65dc8411 (ceph): rgw: implement list multipart
still partially implemented Yehuda Sadeh
09:54 PM Bug #1130 (Resolved): monitor crash in PaxosService:propose_pending()
This should be fixed by commit:7e2e4779e4323429167af36e9a5fb9741c075e96. Thanks for the report! Sage Weil
04:36 PM Bug #1130 (Resolved): monitor crash in PaxosService:propose_pending()
While doing some failure testing, one of the ceph monitors crashed. I have 6 osds, 3 monitors, and 3 mds servers run... Sam Lang
06:42 PM Revision e340bfe1 (ceph): dout: use recursive mutex for dout
Using a recursive mutex for dout is desirable because it allows us to
survive situations like this:
> foo() { dout <...
Colin Patrick McCabe
06:21 PM Linux kernel client Tasks #1112: check all igrab at ceph-client,remove deadlock : spin_lock(&inode->i_lock) + igrab...
Hi ,
ceph_set_page_dirty still exist igrab ,
i merge the patch to ceph-cleint-standalone,
run fsstress, still hit...
changping Wu
06:18 PM Revision 44770df8 (ceph): lockdep: fix shadowed global, add printout
Fix a bug that was keeping lockdep from starting. Add a printout that
lets the user know that lockdep is enabled.
Si...
Colin Patrick McCabe
04:44 PM Revision 9b37f4fa (ceph): Allow embedded '\0' in bufferlists when copying to std::string.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
04:36 PM RADOS Bug #1129 (Won't Fix): sort out libcrush
librados and libceph now statically link in crush code. Should it be a .so? Should be provide a .so anyway, for thi... Sage Weil
04:16 PM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
This should be fixed by commit:85defe76f7e2a0b3d285a3be72fcffce96629b5c, pushed to the master branch. Can you test an... Sage Weil
11:35 AM Linux kernel client Bug #1096 (In Progress): LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
Scratch that, something a bit more subtle is going on. Sage Weil
11:14 AM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
The problem is a short O_DIRECT read that hits EOF. This seems to fix it for me:... Sage Weil
12:19 AM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
git ceph-client commit 98cc99822dac96710a8b64bdc2be4eccffc78956 ,
hand compiling , btrfs+ ubuntu 10.10+2.6.39+ ..
...
changping Wu
04:09 PM CephFS Bug #1128: clustered mds: failed verify_subtree_bounds
Oh right. Logs and core dump in:
kai:~gregf/logs/fstress/replay_bad_bounds
Greg Farnum
04:04 PM CephFS Bug #1128 (Resolved): clustered mds: failed verify_subtree_bounds
... Greg Farnum
09:14 AM Bug #1127: RBD got silent after 1 month
Yoshi, can you attach 'ceph osd dump -o - 26', 'ceph osd dump -o -', and 'ceph pg dump -o -' outputs? Sage Weil
08:43 AM Bug #1127: RBD got silent after 1 month
Looked into this a bit on irc yesterday. This part of the osd log looks problematic - there's only one osd, so the pg... Josh Durgin
12:03 AM Revision 7c6c6a9e (ceph): rados_sync: don't hash paths with periods
A period is not such a bad character.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe

05/31/2011

11:57 PM Revision 4870393a (ceph): test_rados_tool.sh: test hashed paths
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
11:45 PM Revision b4bc1c68 (ceph): rados export: better name mangling rules, fix test
Introduce a versioning scheme for name mangling, so that we can change
it in the future if we want to.
For names tha...
Colin Patrick McCabe
11:05 PM Revision 5dd0e122 (ceph): rgw: handle multipart completion
still wip Yehuda Sadeh
10:32 PM Revision d29b3b77 (ceph): rgw: parser for multi upload completion
Yehuda Sadeh
10:01 PM Revision 7a474b10 (ceph): Use orchesta.remote as a higher-level wrapper, stop worrying about host...
This changes just first caller in a series of many; the rest will change
once a role-based API is in place.
Tommi Virtanen
10:01 PM Revision 33c39ab5 (ceph): rados_sync: prefix user extended attributes
Start user extended attributes with USER_XATTR_PREFIX.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
09:59 PM Revision 0806e65b (ceph): rgw: some more xml reshuffling
Yehuda Sadeh
09:59 PM Revision 9970b86c (ceph): Wrap Remote._runner in staticmethod() or it gets mistaken for a method.
It used to get an extra self argument, and mistook that as client. Tommi Virtanen
09:33 PM Revision dc9aaacf (ceph): Add a pretty wrapper on top of Paramiko and run.run.
Most importantly right now, it knows its name, and can
prettyprint it.
Tommi Virtanen
09:31 PM Revision f5d6be6e (ceph): rgw: move generic xml parsing code to some shared location
Yehuda Sadeh
09:31 PM Revision 5875f796 (ceph): Remove dead code.
Tommi Virtanen
09:28 PM Revision efee7466 (ceph): objecter, osd: clonerange operation
Add a src_oids field to MOSDOp, referenced by a new CLONERANGE osd op type
that will clone data from one object to an...
Sage Weil
08:58 PM Revision 07c1989a (ceph): librados: implement aio_flush
Implement a per-ioctx flush that blocks until all previously submitted
aio operations on the ioctx are safe. Each ai...
Sage Weil
08:51 PM Revision 7d4bb120 (ceph): Initial import.
Currently hardcoded to run dbench, not modular, and the remote
execution API is clumsy.
Tommi Virtanen
08:46 PM Revision 6db2a4e2 (ceph): crushtool: error out if uniform weights vary
Fixes: #1075
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:28 PM Revision 35b19a41 (ceph): osd: fix ScrubFinalizeWQ::_clear condition
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
07:58 PM Revision 1528d2c4 (ceph): debian: depend on libboost-dev >= 1.34
for statechart. Partially fixes #1124.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:45 PM Bug #1127 (Resolved): RBD got silent after 1 month
RBD got silent after about 1 month running.
Although I restarted the daemons, the symptom doesn't go away.
Attached...
Yoshi Tamura
04:37 PM Revision 0cfa911f (ceph): osd: don't leak Connection reference
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:37 PM Revision 8aa67aa4 (ceph): osd: ignore old/stale heartbeat messages
If we get heartbeat messages from old epochs from peers that are not
current, drop them and mark the connection down....
Sage Weil
04:37 PM Revision e5c9100b (ceph): osd: fix map sharing due to heartbeats
- share the map with the cluster addr
- use the new {note,get}_peer_epoch helpers to do it sanely
- don't share if we...
Sage Weil
02:26 PM RADOS Feature #1126 (Rejected): crush: extend rule definition
The current rule command structure does not allow you to do something like:
- pick 2 racks
- pick 2 devices under...
Sage Weil
01:57 PM Feature #511 (Resolved): librados: implement flush
Sage Weil
01:44 PM RADOS Feature #1075 (Resolved): crushtool: warn if uniform item weights vary
Sage Weil
01:15 PM rgw Subtask #1125 (Resolved): osd: support for merging/cloning several objects into one final object
Sage Weil
12:34 PM Bug #1124: Depend on new enough Boost
and
3. ceph.spec.in
Anonymous
11:47 AM Bug #1124 (Resolved): Depend on new enough Boost
Ensure that we depend on a new enough libboost to build successfully. Do this in
1. debian/control
2. autoconf
...
Anonymous
11:34 AM Feature #1123 (Resolved): qa: small but completely functional suite
Sage Weil
10:57 AM Bug #906 (Can't reproduce): clustered mds: lchown not setting uid/gid
Sage Weil
10:53 AM CephFS Bug #1111 (Resolved): file lock requests in wait queue not getting cleaned up after process exit
Sage Weil

05/30/2011

09:45 PM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6

echo 'file fs/ceph/caps.c +p' > /sys/kernel/debug/dynamic_debug/control
logs attached .
changping Wu
08:29 PM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6

Hi ,i git ceph-client master branch:
commit 98cc99822dac96710a8b64bdc2be4eccffc78956
Author: Sage Weil <sage@ne...
changping Wu
09:14 PM Bug #906: clustered mds: lchown not setting uid/gid
I don't think that I ever did manage to reproduce it.
I haven't thought it through much, but it's also possible th...
Greg Farnum
08:49 PM Linux kernel client Bug #1109 (Closed): rbd: btrfs crash
this was on old code. Sage Weil
08:47 PM Bug #1122 (Resolved): kclient: async readahead
Many people now have noticed that sequential read performance is slower than writes. Is this simply a matter of adju... Sage Weil
08:46 PM Bug #1121 (Resolved): rados: rados bench read aborts with an error
Reported by multiple people now on ceph-devel. Probably easy to fix? Sage Weil
08:45 PM Feature #1120 (Resolved): qa: gcov metrics
generate total coverage statistics for the entire qa suite so we can measure overall coverage and improvements. we c... Sage Weil
08:44 PM Feature #1119 (Resolved): qa: gcov/lcov html output
generate browsable lcov pages for individual tests and/or the whole qa suite Sage Weil
07:37 PM Revision 5b7c8ae8 (ceph): osd: protect recovery_wq ops with the recovery lock
We were calling recovery_item.remove_myself() without holding the
recoveryWQ::lock. Naughty naughty!
Signed-off-by: ...
Greg Farnum
07:37 PM Revision b3fb58ea (ceph): crushtool: add -v verbose for --test mode
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
01:27 PM Bug #1118 (Resolved): Crash OSD after upgrdae from 0.28.1 to 0.28.2
I know, "after" is not always "because". :)
I stop cosd, upgrade (by aptitude from yours repository) and start aga...
Fyodor Ustinov
01:10 PM Bug #1116 (Resolved): RecoveryWQ assert failure
commit:5b7c8ae8bdc26e7593323c76527cb37912b9d833 Sage Weil

05/29/2011

10:55 PM Linux kernel client Tasks #1112: check all igrab at ceph-client,remove deadlock : spin_lock(&inode->i_lock) + igrab...
Hi ,
I'am verifing fsstress test with ceph-client master branch:
commit 98cc99822dac96710a8b64bdc2be4eccffc78956
...
changping Wu
10:00 PM Revision 57ea5020 (ceph): Add content to obsync package
Laszlo Boszormenyi
09:42 PM RADOS Bug #1017 (Closed): ceph 0.26 ,mkcephfs --crushmap crush.new ,wait for very long time,mds stat i...
Looks like you need 'chooseleaf' instead of 'choose' in the crush rules. Sage Weil
09:42 PM RADOS Bug #1016 (Closed): ceph 0.26,crushmap change,mount fail.
Looks like you need 'chooseleaf' instead of 'choose' in the crush rules. Sage Weil

05/28/2011

04:14 PM Revision 23242045 (ceph): v0.28.2
Sage Weil

05/27/2011

09:46 PM Revision 7e1de380 (ceph): hadoop: track Hadoop API changes
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:22 PM Revision 232cd6b3 (ceph): rgw: generate random upload id
Yehuda Sadeh
09:05 PM Revision 4ddf8df8 (ceph): SimpleMessenger: allow multiple calls to shutdown
Fixes a case where radostool crashed on an error shutdown.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
09:01 PM Revision 8490b784 (ceph): common/Thread.h: const cleanup
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
08:35 PM Revision a0d521b2 (ceph): rgw: fix signing for some requests
Yehuda Sadeh
07:50 PM Revision 818bfd15 (ceph): rgw: serve multipard init upload
still needs to generate a random hidden object, and use rados locator
for accessing it.
Yehuda Sadeh
05:59 PM Revision 7cfb3b6a (ceph): Merge branch 'wip-obsync'
Colin Patrick McCabe
04:01 PM Bug #1116: RecoveryWQ assert failure
Looks as though this patch has helped.
At least this osd has completd rebalancing.
Great! Thanks!
Fyodor Ustinov
12:17 PM Bug #1116: RecoveryWQ assert failure
Okay, checked this out. It turns out that the only function violating the locking was OSD::do_recovery. Simply adding... Greg Farnum
09:48 AM Bug #1116 (Resolved): RecoveryWQ assert failure
From Fyodor:... Greg Farnum
02:30 PM CephFS Bug #1117 (Resolved): mds: rename rollback broken on slaves during replay
Best I can tell it's just busted. The rollback object contains all the dentries and inodes, but on a slave it's entir... Greg Farnum
10:03 AM Bug #1052 (Resolved): obsync: add rados backend tests to test-obsync.py
Implemented. Colin McCabe
09:39 AM CephFS Bug #1041: standby-replay fails on multi-mds fsstress journals
Back from vacation, and I'm trying to remember what's still broken here. Looking through my logs:
1) MDS 1 gets requ...
Greg Farnum
09:24 AM Linux kernel client Tasks #1112: check all igrab at ceph-client,remove deadlock : spin_lock(&inode->i_lock) + igrab...
Jeff Wu wrote:
> static int ceph_set_page_dirty(struct page *page)
> {
> ...............................
> /* dir...
Sage Weil
08:01 AM Linux kernel client Tasks #1112: check all igrab at ceph-client,remove deadlock : spin_lock(&inode->i_lock) + igrab...

static int ceph_set_page_dirty(struct page *page)
{
...............................
/* dirty the head */
spin...
changping Wu
08:00 AM Linux kernel client Tasks #1112: check all igrab at ceph-client,remove deadlock : spin_lock(&inode->i_lock) + igrab...
Hi ,
I attached some of logs at bug #1096 http://tracker.newdream.net/issues/1096.
:ceph-client-fsstress log 1,2,3....
changping Wu
08:17 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
> OK, thanks. I'll try out 2.6.39 tomorrow. Will keep you informed.
Now running 2.6.39 everywhere on freshly creat...
Andre Noll
04:37 AM Revision 574b58f3 (ceph): mkcephfs: pass config to osdmaptool
This lets OSDMap::create_simple() see g_conf.osd_pool_default_size when
creating the initial data, metadata, and rbd ...
Sage Weil
04:31 AM Revision d2ab764b (ceph): drop useless cm.txt
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:20 AM Revision 1292436b (ceph): osdmap: take default pool size from config
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil

05/26/2011

10:18 PM Revision 9e8484e8 (ceph): rgw: handle POST requests for s3
Yehuda Sadeh
10:07 PM Revision 9b8daa92 (ceph): crushtool: update help
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:00 PM rgw Feature #767 (In Progress): rgw: incremental/large file uploads
Sage Weil
09:11 PM Revision 6f704e33 (ceph): obysnc: rgw target: validate all users
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
08:17 PM Revision 22082c4f (ceph): mon: remove pg_temp mappings when we delete pools
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:17 PM Revision ae5bbc7b (ceph): Merge branch 'wip-obsync'
Colin Patrick McCabe
08:15 PM Revision e0cbb131 (ceph): test-obsync: test sync directly from s3->rgw
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
08:12 PM Revision a93c86e5 (ceph): crushtool: fix --add-item weight being zero when parent bucket(s) created
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:21 PM Revision 56d5d959 (ceph): obsync: fix bucket creation through rgw target
The rgw: target can now create buckets. Add a test.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
06:04 PM Revision 9cefb56b (ceph): Merge branch 'stable'
Sage Weil
05:25 PM Revision b2c1bff8 (ceph): test-obsync: test big objects, user-defined xattr
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:22 PM Bug #1098 (In Progress): mds never coming "up:active" awaits in "up:creating"
shyamali mukherjee wrote:
> I have put OSd logfile and journal to ext3. osd data still comes from "btrfs".
>
> I ...
Sage Weil
11:26 AM Bug #1098: mds never coming "up:active" awaits in "up:creating"
I have put OSd logfile and journal to ext3. osd data still comes from "btrfs".
I have tried atleast about 50 times...
shyamali mukherjee
10:14 AM Bug #1098: mds never coming "up:active" awaits in "up:creating"
You switched everything over to ext3?
It doesn't look like a user_xattr issue; the cosd daemon will error out and ...
Sage Weil
10:01 AM Bug #1098: mds never coming "up:active" awaits in "up:creating"
The cosd has blocked on a btrfs bug; it doesn't have much to do with Ceph.
Eventually your cluster should declare ...
Greg Farnum
09:50 AM Bug #1098: mds never coming "up:active" awaits in "up:creating"
Hi Sage,
I know you have closed the issue. But I could not attach the logfile as it is too huge. I have got few li...
shyamali mukherjee
05:19 PM Revision e9eeb161 (ceph): mkcephfs: set rdir for local mon setup
Fixes: #1113
Reported-by: Bernard Grymonpon <bernard@openminds.be>
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
04:55 PM Revision 5d51b8fd (ceph): init-ceph: ssh
Another bell/whistle.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
04:31 PM rgw Bug #1115: rgw allows users to "give away" s3 objects
I added a test for this to s3-tests. To run it, use:... Colin McCabe
03:36 PM rgw Bug #1115 (Resolved): rgw allows users to "give away" s3 objects
The Rados gateway should not allow the owner of an object to be changed through a PUTACL operation. Amazon doesn't al... Colin McCabe
03:42 PM Bug #1056 (Won't Fix): obsync: add warning when source owner is not the same as dest owner (after...
The owner of the object we create is determined by the access key and secret key supplied by the user. It can never b... Colin McCabe
03:37 PM Bug #1046 (Resolved): rgw: changing object owners
Filed bug #1115 because RGW's behavior does not match Amazon's. Colin McCabe
03:33 PM Bug #1046: rgw: changing object owners
The answer to question #1 is no, Amazon does not allow users to "give away" the ownership of objects. Colin McCabe
03:29 PM CephFS Bug #1114 (Rejected): NFS export extreme slowdown
Attached is debug mds 20 output.
Below is ceph -w output for a corresponding period.
Time synchronization is < 0.1s...
Brian Chrisman
03:28 PM Bug #906: clustered mds: lchown not setting uid/gid
Greg, what did you do before to reproduce this? Sage Weil
02:29 PM Bug #1050 (Won't Fix): obsync: implement --filter to allow certain objects in the source to be sk...
The original reason we wanted this feature was to skip objects with different owners.
This was handled by the creati...
Colin McCabe
02:28 PM Bug #1051 (Resolved): obsync: create a librgw to parse binary ACLs generated by RGW
Colin McCabe
01:22 PM Bug #960 (Resolved): obsync: support rados pool "buckets"
> - sync directly to/from librados
Implmeneted in the rgw: target.
> - copy amazon acl's into same xattr name tha...
Colin McCabe
12:53 PM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Sage Weil wrote:
> Andre Noll wrote:
> > Hm that does not seem to work. I had to compile a kernel with dynamic debu...
Andre Noll
09:32 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Andre Noll wrote:
> Hm that does not seem to work. I had to compile a kernel with dynamic debug enabled,
> but noth...
Sage Weil
02:30 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
Hm that does not seem to work. I had to compile a kernel with dynamic debug enabled,
but nothing makes it to the log...
Andre Noll
11:16 AM CephFS Bug #1108: Large number of files in a directory makes things grind to a halt
Excellent thanks for the tips. It'll have to wait until Tuesday now for testing but I'll report back then. Going to u... Damien Churchill
10:10 AM CephFS Bug #1108: Large number of files in a directory makes things grind to a halt
If that turns out to be too unstable for you and you have gobs of RAM for your MDS, you could also bump up the MDS ca... Greg Farnum
10:20 AM Bug #1113 (Resolved): rdir is not set correctly for the mons
Fixed in stable branch. BTW in the future please add a Signed-off-by to your patches... see SubmittingPatches file i... Sage Weil
09:45 AM Bug #1113 (Resolved): rdir is not set correctly for the mons
In mkcephfs, rdir is used to keep the config in for remote hosts, and dir is used for localhost. However, when bootst... Bernard Grymonpon
09:53 AM Bug #1095: run "rados bench 10 seq -p data" print "error during benchmark: -5"
Actually, the write benchmark should record how many objects are left and the read benchmark isn't supposed to go pas... Greg Farnum
09:17 AM Linux kernel client Tasks #1112: check all igrab at ceph-client,remove deadlock : spin_lock(&inode->i_lock) + igrab...
Hi Jeff-
Are there actual cases of this that you're seeing? I've fixed several of these, but I'm not aware curren...
Sage Weil
12:48 AM Revision 05cfb4d5 (ceph): obysnc: fix content-type on RGWStore
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:36 AM Revision 6cf67a26 (ceph): test-obsync: compare_directory now compares xattrs
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe

05/25/2011

11:50 PM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
Hi ,i git ceph-client master :
commit 35b0ed997b1a49ff73a6110cbd04681467dbe217
Author: Sage Weil <sage@newdream.n...
changping Wu
07:40 AM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
I will build the kernel to verify it.thanks. changping Wu
11:45 PM Linux kernel client Tasks #1112 (Resolved): check all igrab at ceph-client,remove deadlock : spin_lock(&inode->i_lock...
Hi , at igrab function,it has existed the codes: spin_lock(&inode->i_lock);
if coding this:
spin_lock(&inode->i_...
changping Wu
10:55 PM Revision 4cae0ea8 (ceph): ceph-pybind-test: test embedded NULLs in data
Test embedded nulls in rados data. Fix a bug in rados.Object.__str__
Signed-off-by: Colin McCabe <colin.mccabe@dream...
Colin Patrick McCabe
10:49 PM Revision a2d35295 (ceph): obsync: more fixes for RgwStore
* Fix content-type handling
* add vvprint and use it in Object::equals.
* support RgwStore::prefix
* more tests
S...
Colin Patrick McCabe
10:48 PM Revision b76874f6 (ceph): pybind/rados: correctly return data with NULLs
Correctly handle returning data with embedded NULLs in it.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com>
Colin Patrick McCabe
10:21 PM Revision 970897ce (ceph): pybind/rados.py: throw NoData on ENODATA
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
09:56 PM Revision 28c30265 (ceph): mds: fix canceled lock attempt
If client tries to lock a file, has to wait, and then cancels the attempt,
the client will send an unlock request to ...
Sage Weil
09:34 PM Revision 596a3d6a (ceph): librbd: make image contexts threadsafe
Use refresh_lock to protect the needs_refresh member, and
ImageContext::lock for the header and snapshot metadata.
S...
Josh Durgin
09:22 PM Revision d38001c7 (ceph): pybind/rados.py: rados.Object.key should be string
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
07:58 PM Revision b2554823 (ceph): obysnc: RgwStore: make sure destination users exist
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
07:36 PM Revision 5d865fb6 (ceph): obsync: fix DST_OWNER
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
07:33 PM Revision 73e28f2e (ceph): rgw: return EACCES if acl xattr doesn't exist
Yehuda Sadeh
07:05 PM Revision ea76712a (ceph): obsync: Add boto_retries, remove rgw_store.prefix
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
06:23 PM Revision e3dd77d8 (ceph): librbd: const cleanup
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:06 PM Revision 2aa9151e (ceph): librbd: clean up md_oid use a bit
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:01 PM Revision 0adaa6b6 (ceph): rados python bindings: handle xattrs with NULL
Handle extended attributes that contain NULL bytes correctly, rather
than treating everything as zero-terminated C st...
Colin Patrick McCabe
05:54 PM Revision d4bfd964 (ceph): PG: fix race in _activate_committed
Previously, _activate_committed would access the osdmap epoch racing
with handle_osd_map's osdmap update. This would...
Samuel Just
05:50 PM Revision 7de7ba00 (ceph): RgwStore: fix some ACL issues
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:21 PM Revision 42f873e6 (ceph): Proper ACL support for rados targets
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:21 PM Revision 17053739 (ceph): test-obsync: refactor a little bit
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:21 PM Revision e4e098ba (ceph): Rename RadosStore to RgwStore
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
05:21 PM Revision 3f5f5620 (ceph): test-obysnc.py: support librgw testing
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
04:38 PM Revision 0aa18f32 (ceph): mds: do not shift to EXCL or MIX while rdlocked
There was an old change in file_eval() that was allowing us to switch from
SYNC to MIX or EXCL while there were rdloc...
Sage Weil
04:25 PM Messengers Bug #1107 (Resolved): msgr: old outgoing connection + mark_down leaves stale state on remote peer
Sage Weil
02:57 PM CephFS Bug #1111: file lock requests in wait queue not getting cleaned up after process exit
pushed kclient fix to ceph-client.git master branch and mds fix ceph.git master branch. Sage Weil
02:38 PM CephFS Bug #1111 (In Progress): file lock requests in wait queue not getting cleaned up after process exit
Sage Weil
12:12 PM CephFS Bug #1111: file lock requests in wait queue not getting cleaned up after process exit
Adding our test program. Brian Chrisman
11:37 AM CephFS Bug #1111: file lock requests in wait queue not getting cleaned up after process exit
Here's the notes on our bug and the related PIDs
NODE 1: 192.168.98.112 (client 4113)
- gets the lock and ho...
Brian Chrisman
11:36 AM CephFS Bug #1111 (Resolved): file lock requests in wait queue not getting cleaned up after process exit
Our interpretation of events:
1) proc1 requests lock
2) proc1 receives lock
3) proc2 requests lock
4) proc2 gets ...
Brian Chrisman
12:05 PM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
The kclient isn't responding to a cap revocation message. I'm not seeing anything since 2.6.38 that would have fixed... Sage Weil
11:39 AM CephFS Bug #1110: mds: ls -l hangs on concurrent writer
mds log created with
ceph mds tell 0 injectargs '--debug-mds 20 --debug-ms 1'
The problem occurs at 2011-05...
Andre Noll
10:49 AM CephFS Bug #1110 (Resolved): mds: ls -l hangs on concurrent writer
... Sage Weil
10:57 AM Bug #1098 (Closed): mds never coming "up:active" awaits in "up:creating"
Sage Weil
10:56 AM Bug #1012 (Rejected): Autotest: Measure RADOS IO performance under read and write loads
Sage Weil
10:56 AM Feature #948 (Rejected): autotest: graph rbd performance
Sage Weil
08:59 AM CephFS Bug #1108: Large number of files in a directory makes things grind to a halt
Enabling directory fragmention should fix this.. add
mds bal frag = true
to your [mds] section and restart the...
Sage Weil
04:18 AM CephFS Bug #1108 (Closed): Large number of files in a directory makes things grind to a halt
Whilst extracting a copy of our mail directories onto a 10 node cluster(3xmds, 3xmon, 10xosd) I found that there was ... Damien Churchill
08:55 AM Linux kernel client Bug #1109 (Closed): rbd: btrfs crash
... Sage Weil
04:14 AM Revision fe955881 (ceph): crushtool: clean up add-item a bit; don't add item to same bucket twice
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:05 AM Revision dd89ff44 (ceph): crushtool: fix remove-item
Scan all buckets instead of doing a tree traverse.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:30 AM Revision 1c334d1a (ceph): radosgw_admin: update clitest
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
01:58 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Fixed!
Pulled in the latest changes, recompiled, and works like a charm now.
Bernard Grymonpon
01:16 AM Revision ab01d74e (ceph): mkcephfs.in: print out usage if no actions given
If the user didn't specify any actions, print out a usage message rather
than silently exiting.
Signed-off-by: Colin...
Colin Patrick McCabe
12:53 AM Revision f7ea7c98 (ceph): rgw: Fix RGWAccess::init_storage_provider
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost.com> Colin Patrick McCabe
12:06 AM Revision c67dd164 (ceph): mkcephfs: error out on bad usage
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

05/24/2011

11:40 PM Revision 5532f897 (ceph): make: fix build for rgw
Yehuda Sadeh
11:33 PM Revision 203a43bf (ceph): rgw_admin: clean warning
Yehuda Sadeh
10:58 PM Messengers Bug #1107 (Resolved): msgr: old outgoing connection + mark_down leaves stale state on remote peer
Peers A and B:
- A reached epoch 10, in which A and B should exchange heartbeats
- A advances to epoch 20, and ...
Sage Weil
10:30 PM Revision 95c594f6 (ceph): Merge commit 'origin/master' into rgw-multiuser
Yehuda Sadeh
09:29 PM Revision ab278b4b (ceph): rgw_admin: add key create
Yehuda Sadeh
09:19 PM Bug #1095 (Closed): run "rados bench 10 seq -p data" print "error during benchmark: -5"
Hi Jeff-
I think the problem here is just that the read phase is running out of data to read. Let the write phase...
Sage Weil
09:17 PM Revision bd0eb9a3 (ceph): rgw_admin: subuser and key removal
Yehuda Sadeh
08:38 PM Revision 0566de49 (ceph): Let callers specify that some arguments should not be quoted.
This lets you do things such as "test -e /foo && bar" or
"cd /tmp && blah". Remember that shell pipelines do not dete...
Tommi Virtanen
08:29 PM Revision dad0a67a (ceph): Simple unit tests for shell quoting.
Tommi Virtanen
08:27 PM Revision be28e5bf (ceph): Refactor to extract shell quoting into utility function.
Tommi Virtanen
08:16 PM Revision 1a459dd7 (ceph): Depend on Paramiko 1.7.7 or newer to be able to read modern OpenSSH keys.
Tommi Virtanen
08:16 PM Revision 7330c3c4 (ceph): journaler: tolerate ENOENT when prezeroing
ENOENT is okay and expected.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:12 PM Revision 37c94af8 (ceph): Pyflakes cleanup.
Tommi Virtanen
08:07 PM Revision 5d5b1795 (ceph): Add a utility function run.wait to wait for processes to exit.
Tommi Virtanen
08:06 PM Revision 073a4bbc (ceph): Paramiko ChannelFile.close() didn't actually close the remote stdin.
Add a wrapper that does the calls shutdown on the channel itself,
to actually cause EOF. Add integration test using r...
Tommi Virtanen
08:01 PM Revision 6dd4774f (ceph): Log debug info of commands actually executed.
Tommi Virtanen
08:01 PM Revision 9c42fe6b (ceph): Cleanup dead code.
Tommi Virtanen
08:00 PM Revision f10668f5 (ceph): Allow easy writing to stdin of remote processes.
Tommi Virtanen
07:36 PM Revision bb13c92a (ceph): test_common.sh: skip rm before put
The rm before the put is unecessary and actually incorrect now.
Signed-off-by: Colin McCabe <colin.mccabe@dreamhost....
Colin Patrick McCabe
07:34 PM Revision e42736ae (ceph): radostool: rados put should use write_full
If "rados put" uses write instead of write_full, the resulting object on
the server may be a mismash of old and new o...
Colin Patrick McCabe
07:22 PM Revision cfe372ec (ceph): Merge branch 'wip_ceph_context'
Colin Patrick McCabe
07:21 PM Revision 9ff7cc7c (ceph): Create a libcommon service thread
Create a libcommon service thread. Use it to handle SIGHUP.
Handle it by means of a flag that gets set. Using a queu...
Colin Patrick McCabe
05:00 PM Revision 29702685 (ceph): librados: len should be size_t
Unsigned, and size_t because it's a buffer size.
Fixes signedness warning in testrados.
Signed-off-by: Sage Weil <s...
Sage Weil
04:47 PM Revision ce04e3db (ceph): osd: add ability to explicitly mark unfound as lost
Instead of automatically marking unfound objects lost (once we've tried
every location we can think of), do it when t...
Sage Weil
04:42 PM Revision 87309e94 (ceph): osd: make automatically marking of unfound as lost optional
We may not want to do this automatically until we have more confidense in
the recovery code. Even then, possible not...
Sage Weil
04:27 PM Revision cea7b651 (ceph): mds: clean up get_or_create_stray
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:24 PM Revision 081acc4c (ceph): mds: initialize stray_index on startup
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:17 PM Revision 754cfaac (ceph): Merge branch 'stable'
Sage Weil
01:21 PM CephFS Bug #1104: Segmentation fault when deleting a folder
I'll try it first thing tomorrow, no more access to the machines now - everything is always updated completely on all... Bernard Grymonpon
12:55 PM CephFS Bug #1104: Segmentation fault when deleting a folder
cherry-picked commit:7330c3c473aa128b1e3ecb8752278f655bc79620 to stable. i'm a bit surprised you're seeing this on t... Sage Weil
12:50 PM CephFS Bug #1104: Segmentation fault when deleting a folder
There we go:
[Switching to Thread 0x7ffff5574700 (LWP 27162)]
0x00007ffff67c1165 in raise () from /lib/libc.so....
Bernard Grymonpon
12:18 PM CephFS Bug #1104: Segmentation fault when deleting a folder
I'll have to rebuild everything, "r" it is optimized out in my build. This will take a little longer...
#6 0x0000...
Bernard Grymonpon
11:37 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Can you check with gdb to see what the value of 'r' actually is? Sage Weil
11:32 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Tried the stable branch (i'm at ce04e3dbaf2383a521b267585a860f772c4cc786), made debian packages, installed it all, st... Bernard Grymonpon
11:20 AM CephFS Bug #1104 (Resolved): Segmentation fault when deleting a folder
Yay! Thanks for your help testing. We'll do 0.28.2 in a few days. Sage Weil
11:19 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Compiled from last master sources (sorry, forgot switch to stable branch) not have this trouble. Hooray? Maybe it mak... Fyodor Ustinov
10:15 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Attached! You may have problems if your libraries don't match mine. There are also the autobuilt debian packages th... Sage Weil
09:56 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Sage Weil wrote:
> the 'stable' branch has that fix, or you can apply it manually...
Published in your repository...
Fyodor Ustinov
09:28 AM CephFS Bug #1104: Segmentation fault when deleting a folder
the 'stable' branch has that fix, or you can apply it manually... Sage Weil
09:23 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Sage Weil wrote:
> Can you try with this patch applied?
It's 0.28.1 or I should compile master branch?
Fyodor Ustinov
09:01 AM CephFS Bug #1104: Segmentation fault when deleting a folder
Can you try with this patch applied?... Sage Weil
01:40 AM CephFS Bug #1104: Segmentation fault when deleting a folder
I can not attach files to this issue.
http://blog.ufm.su/core.zip - core file
http://blog.ufm.su/mds.zip - log fi...
Fyodor Ustinov
12:50 PM Linux kernel client Feature #962: d_prune
Sage Weil
12:50 PM Linux kernel client Bug #851: make dcache readdir with I_COMPLETE work
Sage Weil
12:50 PM Linux kernel client Bug #850: make NULL lookup using I_COMPLETE work
Sage Weil
11:33 AM Bug #1099: osd: handle recovery of lost objects
For the time being I disabled automatic marking of lost objects. That makes dealing when "recovering" them less of a... Sage Weil
11:31 AM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
This is a kclient bug due to multiple threads entering flush_dirty_caps, which is not reentrant due to commit:e9964c1... Sage Weil
09:53 AM Linux kernel client Bug #1096: LTP fsstress test always hang ,ceph 0.27.1+linux-2.6.38.6
The 30k lines still doesn't have the last client_request arrival. I bumped the limit to 50mb. Can you grab a bigger ... Sage Weil
04:11 AM Revision d66c6ca1 (ceph): v0.28.1
Sage Weil
12:28 AM Revision 9a660ac9 (ceph): librads, libceph: store CephContext
Don't use the global g_ceph_context. Instead, store the CephContext in
the structures provided by the library user.
...
Colin Patrick McCabe
12:28 AM Revision 13aed89e (ceph): Add CephContext
A CephContext represents the context held by a single library user.
There can be multiple CephContexts in the same pr...
Colin Patrick McCabe
12:07 AM Revision 1c7b9821 (ceph): Split common_init_daemonize from common_init_finish
Split off common_init_daemonize from common_init_finish. cfuse is a
daemon that calls common_init_finish, but handles...
Colin Patrick McCabe

05/23/2011

11:52 PM Revision 478c6bbc (ceph): rgw_admin: make interface a bit more explicit
Yehuda Sadeh
10:12 PM Revision c167a28d (ceph): rgw: subuser permissions
Yehuda Sadeh
09:58 PM Revision 6360154d (ceph): mon: verify that crush max does not exceed osd max
- when injecting a new crushmap
- when adjusting osdmap max_osd
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:45 PM RADOS Bug #1106 (Resolved): crush/osd: inconsistent mapping values
This was because crush max_devices was osdmap.max_osd - 1. Need to add some loud warnings and checks for this. Sage Weil
03:24 PM Bug #1098: mds never coming "up:active" awaits in "up:creating"
The MDS isn't coming up because teh OSD requests aren't completing because btrfs is wedged. Which kernel are you usin... Sage Weil
03:16 PM Feature #1105 (Resolved): have multiple access keys per user in rgw
Although the radosgw_admin interface needs a bit of polishing, it's implemented as of commit:c167a28d73b665f7239f8fe7... Yehuda Sadeh

05/22/2011

11:25 PM Revision 5d982803 (ceph): crushtool: add --reweight-item <name> <weight>
Reweight and individual item via crushtool.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
 

Also available in: Atom