Project

General

Profile

Activity

From 10/02/2011 to 10/31/2011

10/31/2011

10:03 PM Revision 9ea02239 (ceph): osd: kill unused on_osd_failure() hook
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:00 PM Revision 1d9e8065 (ceph): RadosModel.h: use default conf location
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:54 PM Revision 810cae1a (ceph): testrados: specify CEPH_CONF directly
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:02 PM Revision b9a0b2b7 (ceph): Revert "PG: call set_last_peering_reset in Started contructor"
Unfortunately, the Started constructor doesn't occur until map
activation. We need to reset last_peering_reset exact...
Samuel Just
06:15 PM Revision f9b7ecdb (ceph): hadoop: Return NULL when the path does not exist.
Although unspecified in the declaration header, other file
systems return a single result when the path is a file.
T...
Noah Watkins
05:53 PM Bug #1633: osd crash in CryptoKey::decrypt
Another occurrence in teuthology:~teuthology/archive/nightly_coverage_2011-10-28/1170/remote/ubuntu@sepia50.ceph.drea... Josh Durgin
05:32 PM CephFS Bug #1666: hadoop: time-related meta-data problems
It looks like the check is equality of timestamps. So, I think Hadoop is setting an explicit timestamp, and sometime ... Noah Watkins
05:30 PM CephFS Bug #1666: hadoop: time-related meta-data problems
All of the local clocks on the nodes look good. The code is comparing timestamps (I assume since epoch), so maybe the... Noah Watkins
05:06 PM CephFS Bug #1666: hadoop: time-related meta-data problems
Neither of these errors are in code that's remotely familiar to me. So my first favorite question is:
Are your clock...
Greg Farnum
04:55 PM CephFS Bug #1666 (Resolved): hadoop: time-related meta-data problems
The following exceptions are being thrown. It looks like something related to lstat?
pre>
java.io.IOException: Th...
Noah Watkins
02:59 PM Bug #1657 (Resolved): teuthology: testrados failed to find conf
Should work now
ceph: 1d9e8065c835c343608930585c2853984cde2fa8
teuthology: 810cae1a1d03138abfa54cd31059723ec0c22ab1
Samuel Just
02:04 PM Bug #1665 (Resolved): osd: last_peering_reset incorrect on stray?
b9a0b2b7a4d3b5a7db1f942af0158712199377a8 reverted 6d123067ce1ba99522281d5c72623bd5ba3e0fc8 Samuel Just
12:09 PM Bug #1665: osd: last_peering_reset incorrect on stray?
this is why. the interval starts at 150,a nd that is when teh query is sent. on the stray, we hit it in 151:... Sage Weil
11:46 AM Bug #1665 (Resolved): osd: last_peering_reset incorrect on stray?
on alexandria,... Sage Weil
01:55 PM Bug #1588 (Can't reproduce): blogbench on kclient possibly made machine die
I think this is fixed - the nightly tests haven't hit it in the past week, since 339573406737461cfb17bebabf7ba536a302... Josh Durgin
11:35 AM CephFS Bug #1661 (Resolved): Hadoop: expected system directories not present
Apparently this was actually the result of an API mismatch. Fixed by Noah's patch in commit:f9b7ecdb5bba1439dc4c13005... Greg Farnum
11:26 AM Feature #1618: libvirt: make sure migration works
Braindump of what I did for the earlier libvirt migration demo:
- on each vm host, install kvm 0.15 (0.14 is too o...
Anonymous
09:13 AM Bug #1415 (Duplicate): cosd assertion: existing->state == STATE_CONNECTING || existing->state ==...
Sage Weil
09:11 AM rgw Feature #1664 (Resolved): rgw: pass swift tests
Sage Weil
09:06 AM Messengers Feature #1648 (Duplicate): msgr: choose ip to bind to based on network
Sage Weil
09:02 AM Messengers Feature #1648: msgr: choose ip to bind to based on network
duplicates #1487 Sage Weil
07:58 AM Bug #1529: cosd: os/FileStore.cc: 2390: FAILED assert(0 == "ENOENT on clone suggests osd bug")
Sage Weil wrote:
> Do you have the odd log preferring the restart?
Er, osd log preceeding ...
Sage Weil
07:54 AM Bug #1529: cosd: os/FileStore.cc: 2390: FAILED assert(0 == "ENOENT on clone suggests osd bug")
Do you have the odd log preferring the restart? Sage Weil
06:46 AM Bug #1529: cosd: os/FileStore.cc: 2390: FAILED assert(0 == "ENOENT on clone suggests osd bug")
I'm still seeing this one. All my 6 OSDs went down and after starting them most of them would crash:... Wido den Hollander

10/30/2011

12:42 AM Revision 5bd029ef (ceph): osdmap: fix g_ceph_context reference
Use cct.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil

10/28/2011

10:48 PM Revision 0fa86182 (ceph): ReplicatedPG: check for peering restart before share_pg_info
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:33 PM Revision 199e04ab (ceph): mkcephfs: build initial osdmap from information in ceph.conf
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:32 PM Revision 3f678931 (ceph): crush: make insert_item take float for weight
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:32 PM Revision 07c9de83 (ceph): osdmaptool: build initial map from ceph.conf
This builds the intial osd and crush maps from what is in the ceph.conf,
taking advantage of host or rack tags that a...
Sage Weil
09:25 PM Revision ef4b95c8 (ceph): ReplicatedPG: Clean up old snap links when recovering a clone
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:25 PM Revision bd3223f9 (ceph): PG: Create new snap directories independently on replica
Previously, we shipped over the collection creation as part
of the transaction. However, the snap directory on the
r...
Samuel Just
09:04 PM Revision b497b385 (ceph): rgw: canonical resource should use unencoded url
Yehuda Sadeh
08:00 PM Revision 5fe8e00a (ceph): Merge pull request #4 from vzctl/master
fix error: 'snprintf' was not declared in this scope Sage Weil
06:49 PM Revision a8450005 (ceph): rgw: cleanup, remove unused user_id
Some access methods required user_id param, but that was never really used. At
this point we should just remove them.
Yehuda Sadeh
06:42 PM Revision 7ee0747c (ceph): mkcephfs: skip non-btrfs osds even with --mkbtrfs
This lets you do a mixed btrfs and non-btrfs file systems.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:39 PM Revision 2bb283ba (ceph): Merge branch 'stable'
Sage Weil
05:38 PM Revision 3a17f023 (ceph): debian: break redundant dependencies
They confuse APT it seems.
ceph-common -> librbd1 -> librados2
radosgw -> ceph-common -> librados2
Signed-off-by:...
Sage Weil
05:05 PM Revision b8979f4d (ceph): MOSDMap: do not leave {oldest,newest}_map uninitialized when decoding o...
This leads to badness like
osd_map(295..296 src has 74308224..0) v1
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
03:46 PM CephFS Bug #1661: Hadoop: expected system directories not present
Blindly creating directories is definitely not the proper solution. Somebody will need to take the time to figure out... Greg Farnum
03:32 PM CephFS Bug #1661: Hadoop: expected system directories not present
In this particular instance it is a map-reduce specific directory. I suspect that MapReduce is responsible for this, ... Noah Watkins
03:22 PM CephFS Bug #1661: Hadoop: expected system directories not present
Sounds to me like CephFileSystem should just create the directory if it doesn't exist.. Sage Weil
03:13 PM CephFS Bug #1661: Hadoop: expected system directories not present
Good to know. I think at this point I need to paper over many things, but want to record all these issues. I'll just ... Noah Watkins
03:08 PM CephFS Bug #1661: Hadoop: expected system directories not present
I remember running into this issue when developing things and deciding to just paper over it at the time -- I couldn'... Greg Farnum
03:05 PM CephFS Bug #1661: Hadoop: expected system directories not present
Adding: when this directory is created by hand before map reduce starts the error is gone. Noah Watkins
03:04 PM CephFS Bug #1661 (Resolved): Hadoop: expected system directories not present
Hadoop complains that directories within the file system that are expected to be present are not present. Hadoop may ... Noah Watkins
03:24 PM CephFS Bug #1663: Hadoop: file ownership/permission not available in hadoop
Noah Watkins wrote:
> This is a very simple hack that will make hadoop ignore the permission for the time being:
...
Noah Watkins
03:23 PM CephFS Bug #1663: Hadoop: file ownership/permission not available in hadoop
This is a very simple hack that will make hadoop ignore the permission for the time being:
diff --git a/src/mapred...
Noah Watkins
03:16 PM CephFS Bug #1663 (Resolved): Hadoop: file ownership/permission not available in hadoop
Hadoop complains about incorrect file ownership. An 'ls' via Hadoop FS interface reveals no permission information, b... Noah Watkins
03:08 PM rbd Feature #1662 (Resolved): libvirt: obscure qemu/rbd secrets
Sage Weil
02:36 PM Feature #1067 (Resolved): mkcephfs: magically group osds on same host into subtrees in the genera...
commit:199e04aba1bd3d0c5a2a0e13e4500bef9cc206cf Sage Weil
01:46 PM Revision 6353d7b5 (ceph): include stdio in order to fix snprintf compilation error
Signed-off-by: Alexey Lapitsky <lex@realisticgroup.com> Alexey Lapitsky
12:08 PM rgw Bug #1645 (Resolved): rgw bucket suspended broken
Fixed, commit:6752babdfda1be0524d82b84adfa4663aded32f6. Also added a teuthology test. Yehuda Sadeh
09:30 AM rgw Feature #829 (Resolved): rgw: support swift POST
We actually support now swift POST for metadata changes. For ACL changes there's issue #830. Yehuda Sadeh
09:28 AM rgw Bug #1643: radosgw-admin log show should accept --time
The problem is that the logs are indexed by date, and not by time. Filtering by time means that we need to scan the o... Yehuda Sadeh
04:04 AM Revision 46bb82f5 (ceph): client: fix return value for _readdir_cache_cb
Return 0 for end of directory here, too.
Clarify some comments.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:28 AM Revision 943893e8 (ceph): ceph: fix snprintf warning
warning: tools/ceph.cc:146: format not a string literal and no format arguments
Signed-off-by: Sage Weil <sage.weil@...
Sage Weil
01:12 AM Revision 64992113 (ceph): auth: return unknown if no supported auth is found
If NONE is supported, it will already be in the list of supported
protocols, so there's no need to default to it here...
Josh Durgin
01:01 AM Bug #1659 (Can't reproduce): Upgrade from 0.27 -> 0.37 going wrong, OSDs miss map updates
Hi,
Like I mentioned on IRC, I had some problems with upgrading my cluster from 0.27 to 0.37.
It was a big step...
Wido den Hollander
12:24 AM Revision 1a4eec20 (ceph): uclient: fix _getdents and add some documentation.
If readdir_r_cb returns 0, that means SUCCESS, regardless of how
many entries it actually wrote.
If it returns <0, th...
Greg Farnum

10/27/2011

11:15 PM Revision 27ec04e7 (ceph): cfuse: remove unneeded loop.
The only time this was looping previously was completely unnecessary
anyway, as 1 meant the same thing as 0: there ar...
Greg Farnum
11:15 PM Revision e37ab416 (ceph): uclient: align readdirplus_r with readdir_r.
The only user of this code expects to get 1 on a successfully-filled
value, 0 on a successful non-fill, or -errno oth...
Greg Farnum
11:15 PM Revision 55aace73 (ceph): uclient: readdir_r_cb documentation, and it only returns 0 or -errno.
Returning 0 or 1 in different situations that were effectively the
same is useless and confusing.
Signed-off-by: Gre...
Greg Farnum
09:35 PM Revision 354055f8 (ceph): rgw: swift related adjustments
Yehuda Sadeh
09:26 PM Revision 713a4428 (ceph): Merge branch 'master' of github.com:NewDreamNetwork/ceph
Sage Weil
09:04 PM Revision ed839f5a (ceph): fixed graphic reference and headings
Sondra.Menthers
09:00 PM Revision 2c4eb075 (ceph): fixed image reference
Sondra.Menthers
08:54 PM Revision b42443ec (ceph): fixed architecture document
Sondra.Menthers
08:43 PM Revision c57ed06c (ceph): add images for documentation
Sondra.Menthers
07:51 PM Revision 7a022029 (ceph): rgw: handle swift PUT with incorrect etag
Sondra.Menthers
07:44 PM Revision cae7d5a0 (ceph): rgw: handle swift PUT with incorrect etag
Sondra.Menthers
07:44 PM Revision 697bba39 (ceph): rgw: handle swift PUT with incorrect etag
Sondra.Menthers
07:11 PM Revision 10c35087 (ceph): rgw: add user suspend/enable test
Yehuda Sadeh
06:32 PM Revision 86aa940f (ceph): rgw: log-to-stderr is now a binary flag
Yehuda Sadeh
06:20 PM Revision a817a38e (ceph): rgw: handle swift PUT with incorrect etag
Sondra.Menthers
06:16 PM Revision d9dfd147 (ceph): rgw: handle swift PUT with incorrect etag
Sondra.Menthers
06:02 PM Revision 87224c08 (ceph): rgw: handle swift PUT with incorrect etag
Sondra.Menthers
05:02 PM Revision e4dcbd03 (ceph): ceph: refactor for generic --admin-daemon <sock> <cmd> too
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:50 PM Revision 6979eaa0 (ceph): filejournal: journal_replay_from
Force journal replay from a point other than the op_seq recorded by the
fs. This is useful if you want to skip bad e...
Sage Weil
04:50 PM Revision 89dccc0e (ceph): ceph: --dump-perf-counters[-schema] sockpath
Quick and dirty way to dump perfcounters stats. Not documenting this until
we decide this is where it should live.
...
Sage Weil
04:26 PM Revision a9b75f21 (ceph): Merge branch 'stable'
Sage Weil
04:26 PM Revision b3e1e3e1 (ceph): rados: improve error message
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:46 AM CephFS Bug #1549 (Need More Info): mds: zeroed root CDir* vtable in scatter_writebehind_finish
bleh. need logs... i'll start this up in a loop again. Sage Weil
10:33 AM CephFS Bug #1549: mds: zeroed root CDir* vtable in scatter_writebehind_finish
This happened again today after fsstress. From teuthology:~teuthworker/archive/nightly_coverage_2011-10-27/1083/teuth... Josh Durgin
09:26 AM Feature #1658 (Resolved): osd: backfill instead of backlog
Sage Weil
08:59 AM Feature #1646: mon: catch up on committed items before attempting to join quorum
Not sure exactly what you mean, but that sounds a bit like the behavior when the encoding changes and the monitors ar... Sage Weil
03:55 AM Feature #1646: mon: catch up on committed items before attempting to join quorum
Any chance this is related with an issue I noticed last night, in which the primary mon was receiving and displaying ... Alexandre Oliva
04:20 AM Revision 11691a71 (ceph): radosgw-admin: fix key create check
Also fixes warning
warning: rgw/rgw_admin.cc:812: suggest parentheses around ‘&&’ within ‘||’
Signed-off-by: Sage W...
Sage Weil
12:24 AM Revision 921ce53d (ceph): osd: guard checks for writes
fa722de6708d3e92037df6289cc29ece12c8ea66 moved these checks, and
accidentally removed the may_write() guard. This cau...
Josh Durgin
12:20 AM Revision 0c78f0dc (ceph): rgw: handle swift PUT with incorrect etag
Yehuda Sadeh
12:00 AM Revision 213eb13d (ceph): Revert "hadoop: get hadoop bindings to build again" and fix.
It's just wrong. The Java code is still passing a String along
regardless of what you ask the C to do! Fix it by grab...
Greg Farnum

10/26/2011

11:07 PM Revision e8e10158 (ceph): rgw: rgw-admin --skip-zero-entries
Yehuda Sadeh
11:00 PM Revision 180c744b (ceph): perfcounters: fix accessor name
FreakingCamelCaps
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
11:00 PM Revision 1a0a732e (ceph): objecter: instrument with perfcounter
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:34 PM Revision e747456c (ceph): rgw: rgw-admin generate-key/access-key=false fix
Yehuda Sadeh
10:34 PM Revision 9386a7b5 (ceph): rgw: rgw-admin can show log summation
Yehuda Sadeh
09:56 PM Revision 7fbf28a9 (ceph): osd: read_log: only list the collection once
After upgrading we may need to list the collection to recover the hash
value when upgrading an old collection.
Signe...
Sage Weil
09:30 PM Revision 6752babd (ceph): rgw: fix bucket suspension
Yehuda Sadeh
05:46 PM Bug #1654 (Resolved): snaps workunit failed on cfuse
Fixed by 921ce53d6efc3f1bf7056f05467aff5c3104dcc8. Josh Durgin
03:24 PM Bug #1654: snaps workunit failed on cfuse
And the librados selfmanaged snaps tests also failed with an unexpected EINVAL when reading from a snapshot. Josh Durgin
11:39 AM Bug #1654: snaps workunit failed on cfuse
There might have been a bug introduced in snapshot contexts - two rbd tests got EINVAL when setting a snapshot, meani... Josh Durgin
11:35 AM Bug #1654 (Resolved): snaps workunit failed on cfuse
... Josh Durgin
05:31 PM Bug #1657 (Resolved): teuthology: testrados failed to find conf
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-26/1037/teuthology.log:... Josh Durgin
04:11 PM rgw Feature #773: rgw: efficient list-objects filtering
With the new osd-class index, this should be pretty straight forward. Yehuda Sadeh
04:09 PM rgw Feature #1641 (Rejected): radosgw-admn log show --bandwidth-only
commit:9386a7b5e57de4994ff3ad4987ef309cb8275392 added data aggregation, so there's no need to dump the entire log now... Yehuda Sadeh
04:06 PM rgw Feature #1642 (Resolved): radosgw-admin log show --nonzero-only
Fixed, commit:e8e101580ea04628713f51171e9af58aec1acbd2.
rgw-admin accepts --skip-zero-entries now.
Yehuda Sadeh
04:03 PM CephFS Bug #1656: Hadoop client unit test failures
Sounds good to me -- which patches we want to keep in the tree are probably a management decision but I'm happy to pu... Greg Farnum
03:55 PM CephFS Bug #1656: Hadoop client unit test failures
Alright, so I think at this point I'd like to see two patches:
1) A patch against the downloadable tarball (much e...
Noah Watkins
03:49 PM CephFS Bug #1656: Hadoop client unit test failures
I believe the patch was made against the then-current svn 0.21 branch (which is now very dead). I pushed changes to t... Greg Farnum
03:39 PM CephFS Bug #1656: Hadoop client unit test failures
This was hadoop-0.20.205.0 with the latest Ceph master branch.
It looked like the patch in src/client/hadoop was o...
Noah Watkins
03:30 PM CephFS Bug #1656: Hadoop client unit test failures
What versions of the systems were you running when these failed?
I don't remember how they're set up but they migh...
Greg Farnum
01:59 PM CephFS Bug #1656 (Won't Fix): Hadoop client unit test failures
The Ceph Hadoop File System passes nearly all its tests except a few. I've included the test log below that shows the... Noah Watkins
03:38 PM Bug #1555 (Resolved): radosgw_admin --gen-access-key=false and --gen-secret=false flags appear to...
Fixed, commit:e747456c9f6cc8cc0367bb80e757b1b24e098de1. Yehuda Sadeh
01:49 PM Feature #1655 (Resolved): gitbuilder aggregator page
single page that has 1 line per gitbuilder, with instance name and then the top line of the gitbuilder status screen ... Sage Weil
10:13 AM Bug #1590 (Duplicate): occasionally excessive mon memory footprint
Sage Weil
10:12 AM Bug #1590: occasionally excessive mon memory footprint
this will go away with #1646. Sage Weil
10:11 AM Bug #1634 (Can't reproduce): osd: crash decoding non-existent object_info_t
going to see if this comes up again after this last round of osd fixes Sage Weil
09:58 AM Feature #1653 (Resolved): librados: python binding nose tests
Sage Weil
04:34 AM Revision f197e845 (ceph): rgw: fix uninitialized variable warnings
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil

10/25/2011

11:39 PM Revision 952be11a (ceph): hadoop: bring back Java changes.
These convert the Hadoop stuff to work on the branch-0.20 API.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost....
Greg Farnum
11:29 PM Revision 71fd8302 (ceph): Merge branch 'master' of ssh://github.com/NewDreamNetwork/ceph
Conflicts:
src/rgw/rgw_rados.cc
Yehuda Sadeh
11:23 PM Revision d9f73605 (ceph): rgw: fix attr cache
Yehuda Sadeh
10:35 PM Bug #1628 (Resolved): segfault attempting to map an rbd snapshot
Sage Weil
10:33 PM Bug #1099 (Closed): osd: handle recovery of lost objects
this has been reimplemented (at least the revert case). Sage Weil
10:32 PM Cleanup #146 (Rejected): Complete build options for Pthread API
Sage Weil
10:29 PM Feature #641 (Rejected): allow logs to be piped to an external program
works for me. Sage Weil
10:28 PM Bug #250 (Resolved): mon: delete old states to avoid filling disk
Sage Weil
10:28 PM Feature #875 (Resolved): osd: clean up old osdmaps
Sage Weil
10:24 PM Feature #1649 (Resolved): osd: make replay interval a per-pool setting
Sage Weil
10:08 PM Revision 5151a8af (ceph): common/ceph_extattr.[ch] > common/xattr.[ch]
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:54 PM Revision 46f330d0 (ceph): Merge branch 'master' into wip-freebsd
Sage Weil
09:15 PM Revision ef48183a (ceph): fix osdmaptool clitests
Sage Weil
09:02 PM Revision 8ae02dab (ceph): Merge branch 'wip-pools'
Sage Weil
05:52 PM Revision 6287ccf6 (ceph): mon: reencode routed messages
The message encoding may depend on the target features. Clear the
payload so that the Message gets reencoded appropr...
Sage Weil
05:51 PM Revision 72e0ca02 (ceph): MOSDMap: reencode full map embedded in Incremental, as needed
The Incremental may have a bufferlist containing a full map; reencode
that too if we are reencoding for old clients.
...
Sage Weil
05:13 PM Revision cd6d7009 (ceph): Merge remote-tracking branch 'gh/wip-rbd-tool'
Sage Weil
04:53 PM Revision 6ca99060 (ceph): mon: parse 0 values properly
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:53 PM Revision 90f0429f (ceph): mon: fix rare races with pool updates
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:48 PM CephFS Bug #1114 (Need More Info): NFS export extreme slowdown
Need to reproduce this on the current trunk and fully characterize what is going on.
- the the nfs server in sync ...
Sage Weil
04:46 PM Bug #1194 (Resolved): kclient: NFS reexport does not survive ceph fs remount
going to assume the above fixed it until we hear otherwise :) Sage Weil
03:50 PM CephFS Bug #1585 (Can't reproduce): mds crash during shutdown
Sage Weil
03:38 PM Bug #1629 (Can't reproduce): pgs stuck degraded (only mapped to 1 osd)
pre-prior set refactor and current round of thrashing fixes. Sage Weil
03:34 PM Bug #1624 (Resolved): osd crash in HearbeatMap::_check
going to chalk these up to the infinite loop fixed in that previous patch. Sage Weil
03:33 PM Bug #1617 (Rejected): pgs stuck down and peering with only one osd down and out
non-specific, and pre-prior set refactor. Sage Weil
03:31 PM Bug #1311 (Closed): qa: TestSnaps: stuck in active
ancient and presumably covered by current thrashing tests Sage Weil
03:30 PM Bug #1292 (Closed): qa: bench & thrashosd PG won't go clean
this is ancient and presumably covered by the new thrashing tests. Sage Weil
03:29 PM Bug #1609 (Resolved): osd: failed assert(info.last_complete == info.last_update)
lots of stuff, mainly commit:03ad5a28eee2328eb2419c48a14df1a3624fc4c7 Sage Weil
10:31 AM Bug #1526 (Resolved): log bound mismatch after thrashing with bonnie
Sage Weil
05:51 AM Revision 43aa33a2 (ceph): Merge remote branch 'gh/wip-osd-queue'
Sage Weil
05:50 AM Revision 7de2f7a9 (ceph): osd: print useful debug info from choose_acting
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:50 AM Revision c30ab1e2 (ceph): osd: MOSDPGNotify: print prettier
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:50 AM Revision 12b3b2d5 (ceph): osd: fix generate_past_intervals maybe_went_rw on oldest interval
We stop working backwards when we hit last_epoch_clean, which means for the
oldest interval first_epoch may not be th...
Sage Weil
05:50 AM Revision 03ad5a28 (ceph): osd: fix last_complete adjustment after recovering an object
After we recover each object, we try to raise the last_complete value
(and matching complete_to iterator). If our lo...
Sage Weil
05:50 AM Revision e2f3c20b (ceph): osd: make proc_replica_log missing dump include useful information
I needed to see have/need to debug a weird unfound issue turned up by
thrashing.
Signed-off-by: Sage Weil <sage@newd...
Sage Weil
05:21 AM Revision f8e92896 (ceph): osd: fix/simplify op discard checks
Use a helper to determine when we should discard an op due to the client
being disconnected. Use this when the op is...
Sage Weil
05:13 AM Revision fa722de6 (ceph): osd: move queue checks into enqueue_op, kill _handle_ helpers
This simplifies things, and renames the checks to make it clear that we are
doing validation checks only, with no sid...
Sage Weil
04:59 AM Revision 3a2dc656 (ceph): osd: move op cap check into helper
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
04:54 AM Revision b17c9ca5 (ceph): osd: handle missing/degraded in op thread
The _handle_op() method (and friends) are called when an op is initially
queued and when it is requeued. In the requ...
Sage Weil
04:54 AM Revision b1de9131 (ceph): osd: drop ability to disable op queue entirely
This is pretty useless, and broken wrt requeueing anyway.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:54 AM Revision 662414d7 (ceph): osd: drop useless PG hooks
These no longer need to be exposed to the generic OSD code.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:54 AM Revision 7aa0d89b (ceph): osd: set reqid on push/pull ops
Not strictly necessary, but makes logs easier to follow.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
03:42 AM Revision e2766bd8 (ceph): mon: remove compatset cruft
The CompatSet is built on demand; it's no longer static.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil

10/24/2011

11:54 PM Revision 6f1b65c6 (ceph): ReplicatedPG: fix snapshot directory handling in snap_trimmer
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
11:54 PM Revision 024bcc4b (ceph): FileStore: ignore EEXIST on clones and collection creation !btrfs_snap
We need to ignore EEXIST on btrfs also when m_filestore_btrfs_snap is
disabled.
Signed-off-by: Samuel Just <samuel.j...
Samuel Just
11:43 PM Revision 4d884040 (ceph): rgw: fix rgw_obj compare function
Yehuda Sadeh
10:34 PM Revision df2967a6 (ceph): rgw: use a uint64_t instead of a size_t for storing the size
librados uses uint64_t so that 32-bit architectures aren't hobbled.
Signed-off-by: Greg Farnum <gregory.farnum@dream...
Greg Farnum
10:32 PM Revision 4b10cad8 (ceph): rbd: check command before opening the image
Now map/unmap won't use librbd, and commands that don't take --snap
will give an error when it's used.
Signed-off-by...
Josh Durgin
10:32 PM Revision 8c6db18d (ceph): rbd: specify which commands take --snap in usage
Maybe this will be less confusing.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
10:32 PM Revision 46bb4122 (ceph): rbd: let all commands use the pool/image@snapshot format
This way you aren't forced to use '-p' or '--snap' to specify a pool
or snapshot for some commands.
Signed-off-by: J...
Josh Durgin
10:32 PM Revision afa34794 (ceph): librbd: show correct size for snapshots
header.size is the current size of the image.
ImageCtx::get_image_size() already does the right thing for
snapshots.
...
Josh Durgin
10:32 PM Revision f4aa69a8 (ceph): workunit: check that rbd info returns the right size for snapshots
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:32 PM Revision e2296c3a (ceph): clitests: add rbd usage and invalid snap usage tests
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:32 PM Revision 93ccccd7 (ceph): rbd: remove unnecessary condition
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:32 PM Revision bfb5ceb2 (ceph): workunits: add rbd rollback and snapshot removal tests
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:32 PM Revision 315ab94e (ceph): librbd: propagate error from snap_set
Previously rbd_snap_set always returned 0, even when the snapshot did
not exist.
Signed-off-by: Josh Durgin <josh.du...
Josh Durgin
10:32 PM Revision a5a8a9cf (ceph): test_rbd: add a test for rolling back after resizing
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:32 PM Revision ae91911c (ceph): librbd: resize if necessary before rolling back
This is a partial fix for test_rbd.TestImage.test_rollback_with_resize
Signed-off-by: Josh Durgin <josh.durgin@dream...
Josh Durgin
10:32 PM Revision 2af32a41 (ceph): librados: use stored snap context for all operations
Using an empty snap context led to the failure of
test_rbd.TestImage.test_rollback_with_resize, since clones weren't
...
Josh Durgin
10:32 PM Revision b7aa57ff (ceph): rbd.py: update python bindings for new copy interface
It was changed to return 0 on success in d7f7a213546b599d2eec4c6617593d232b43a7d6
Signed-off-by: Josh Durgin <josh.d...
Josh Durgin
10:32 PM Revision e161ce15 (ceph): workunits: test rbd python bindings
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:15 PM Revision 2be3999d (ceph): Add btrfs dimension to thrash tasks
Thrash tasks will now also run with and without btrfs.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
09:30 PM Revision 2ad6545a (ceph): Add testrados based thrashing tasks
readwrite.yaml runs a read/write workload against a set of objects.
snaps.yaml adds snaps and rollback.
Signed-off-b...
Samuel Just
09:25 PM Revision 8d0a7c59 (ceph): testrados: rename testsnaps to testrados and make snap testing optional
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
08:52 PM Revision a1249d07 (ceph): workunit: set PYTHONPATH so we can test python bindings
Josh Durgin
06:46 PM Revision 88905b3a (ceph): test/osd: Add TestReadWrite
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
06:27 PM Revision 5e4e7972 (ceph): mon: allow adjustment of per-pool crash_replay_interval
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:12 PM Revision 40b7b572 (ceph): Merge branch 'rgw-dir-cleanup'
Greg Farnum
05:06 PM Revision f57c33df (ceph): rgw: fix check_disk_state; add a strip_namespace function.
Use copies of the IoCtx rather than references so that
we can set locators without breaking stuff, and make use of th...
Greg Farnum
05:04 PM Revision 0da45ca6 (ceph): rgw: rename translate_raw_obj to translate_raw_obj_to_obj_in_ns
And document it. Because the naming is so bad that neither I nor
the author noticed it wasn't doing what we wanted it...
Greg Farnum
05:04 PM Revision 927c3577 (ceph): rgw: add locators to the directory objects, and functions handling them
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
03:55 PM Linux kernel client Bug #1652 (Resolved): rbd: rollback correctly after resizing
I just fixed this bug in librbd, but it seems the kernel has it too. If you take a snapshot, resize the image, then r... Josh Durgin
11:38 AM rgw Bug #1567 (Resolved): rgw [list|delete]_bucket should clean up
Merged into master in commit:40b7b57239515bd0794ef5da2477a2c5eb7a85e4.
Passed s3tests with a greatly-reduced timeo...
Greg Farnum
10:53 AM Feature #1651 (Resolved): command line tool to interact with admin socket
Maybe something like 'ceph --socket /var/run/ceph/osd.0.asok foo'? Sage Weil
04:07 AM Revision f37b08f8 (ceph): librados: behave if shutdown is called twice
On failure, we shut ourselves down. If the caller calls shutdown again,
don't crash.
Fixes: #1650
Signed-off-by: Sa...
Sage Weil
04:05 AM Revision c15e62aa (ceph): mon: need to print pool id for output to be useful
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:40 AM Revision 8a087729 (ceph): mon: PGMap::dump: fix order in totals
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
02:01 AM Revision 1b941390 (ceph): osd: make osd dump slightly more concise
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:13 AM Revision 34c2f6a4 (ceph): osd: pg_pool_t: set crash_replay_interval on data pool when decoding old
We want to preserve the crash_replay_interval on old clusters being
upgraded. Kludge this by setting it to 60 (the o...
Sage Weil

10/23/2011

11:26 PM Revision 6779eb39 (ceph): osd: make osd replay interval a per-pool property
Change the config value to only control the interval set when the data
pool is first created (presumably during mkfs)...
Sage Weil
11:26 PM Revision 8bb8e85d (ceph): Merge remote-tracking branch 'gh/master' into n
Conflicts:
src/osd/OSDMap.h
Sage Weil
11:24 PM Revision f2816a1e (ceph): osd: pg_pool_t: normalize encoding
Normalize encoding to be less awkward. Use a FEATURE bit to indicate
whether the new encoding is supported, and enco...
Sage Weil
11:24 PM Revision 7cb4d25d (ceph): osd: pg_pool_t: introduce flags, crash_replay_interval
Introduce a per-pool crash_replay_interval so we can control whether
the OSD waits for replayed ACKed but not COMMITt...
Sage Weil
09:35 PM Bug #1650 (Resolved): “rados df” joins on thread never started with mons down or laggy (regressio...
fixed by commit:f37b08f821a54263847e2c5c095bba5750908f86 Sage Weil
07:56 PM Bug #1650 (Resolved): “rados df” joins on thread never started with mons down or laggy (regressio...
If rados's attempt to connect the mons time out, it prints:
# rados df
couldn't connect to cluster! error -110
c...
Alexandre Oliva
05:30 PM Revision 61cbb321 (ceph): ceph.conf: python parser doens't like ; comments
Sage Weil
05:16 AM Revision 3ed06562 (ceph): ceph.conf: more frequent osd scrubbing; remove old cruft
Sage Weil
03:44 AM Revision 54e28263 (ceph): scratchtool[pp]: fix rados_conf_set/get test of log_to_stderr
Fix this warning
warning: scratchtool.c:142: comparison with string literal results in unspecified behavior
and fli...
Sage Weil
03:41 AM Revision 9323f25a (ceph): osd: fix PG::Log::copy_after wrt backlogs (again)
Commit 68fe748fc2d703623050e8f2a448a0fd31ca8a0f fixed half of this problem,
but set this->tail incorrectly. If we re...
Sage Weil

10/22/2011

10:13 PM Bug #1530: osd crash during build_inc_scrub_map
I'm going to up the scrub frequency in the teuthology conf to help shake out these problems. There was another bug r... Sage Weil
10:07 PM Bug #1616 (Resolved): crash in is_supported_auth
Sage Weil
10:06 PM Bug #1631 (Need More Info): osd: failed assert(repop_queue.front() == repop)
need an osd log on this one Sage Weil
10:05 PM Cleanup #1644 (Resolved): osd: prior_set refactor
Sage Weil
01:01 PM Bug #1471: osd: destroy_collection on non-empty dir
I'm actually hitting the same bug with v0.37
It was time to upgrade my old (and good running!) 0.27 cluster to the...
Wido den Hollander

10/21/2011

11:36 PM Revision 1b846f43 (ceph): radosgw: drop useless/broken set_val daemonize
Not sure what the intent was here anyway... but it is broken (the func
takes a string, not a bool).
Signed-off-by: S...
Sage Weil
11:35 PM Revision 1f7cb757 (ceph): config: separate --log-to-stderr and --err-to-stderr
Instead of having magic values (1 == errors only to stderr, 2 =
everything), have two booleans.
Signed-off-by: Sage ...
Sage Weil
11:14 PM Revision e98cbc43 (ceph): rgw: fix xattrs cache
Yehuda Sadeh
10:24 PM Revision cf6a9404 (ceph): osd: eliminate CRASHED state
This was an intermediate state that indicated that replay would be needed.
It was poorly named, and not very useful. ...
Sage Weil
10:24 PM Revision 03593019 (ceph): mon: fix last_clean_interval calculation
This up_rom == first check is old and wrong. It may have been correct at
the time, when the OSD had a defined shutdo...
Sage Weil
10:24 PM Revision 600bda47 (ceph): osd: fix last_clean interval bounds
It was _first and _last, inclusive, but the epochs are really points in
time, so _last should have been non-inclusive...
Sage Weil
10:24 PM Revision 249ed569 (ceph): osd: move may_need_replay calculation out of PriorSet
Although they both depend on past intervals, they are unrelated. Factor
out the may_need_replay calculation from Pri...
Sage Weil
10:24 PM Revision 30c34ab8 (ceph): osd: trim past intervals when we complete recovery.
We weren't trimming at all, which meant these would just accumulate
indefinitely.
Signed-off-by: Sage Weil <sage@new...
Sage Weil
10:14 PM Revision d6661f93 (ceph): ReplicatedPG: Include pg version in MOSDOpReply on error
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
08:23 PM Revision f8afd8bf (ceph): rgw: reduce rados bucket stats (and getxattrs)
we didn't pass the context, and some other issue with the context map Yehuda Sadeh
05:54 PM Revision b8beff3d (ceph): ceph_manager: count active+clean+<somjething else> as active+clean
In my case, one pg was active+clean+scrubbing.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:32 PM Revision a1756c5e (ceph): rgw: object removal should remove object from index anyway
even if object doesn't exist. Index might have the wrong info. Yehuda Sadeh
04:56 PM Revision dd5087fa (ceph): osd: simplify finalizing scrub on replica
We can simply call osr.flush() (with pg lock held) to ensure that prior
writes are visible and scrubbable. This avoi...
Sage Weil
04:29 PM Feature #1649 (Resolved): osd: make replay interval a per-pool setting
Most pools don't need it. Make it a per-pool thing.
This involves a feature bit and refactor of the pg_pool_t e...
Sage Weil
04:14 PM Revision 29899de5 (ceph): osd: PriorSet: acting/up membership implies still alive
If the osd is in the acting or up sets, we can assume they are still alive,
even though we don't know that for sure, ...
Sage Weil
03:58 PM Revision a1ddec2a (ceph): Merge remote branch 'gh/master' into wip-prior
Conflicts:
src/osd/PG.cc
Sage Weil
03:37 PM Messengers Feature #1648 (Duplicate): msgr: choose ip to bind to based on network
Currently we bind to an explicit address or to any, and learn what address to advertise by looking at our first outgo... Sage Weil
03:34 PM Feature #1647 (Resolved): mon: robust bootstrap
Currently mkfs looks like:
- create initial states on each monitor independently
- start them up and they'll fo...
Sage Weil
03:29 PM Feature #1646 (Resolved): mon: catch up on committed items before attempting to join quorum
This will prevent a mon that is way behind from dragging down the mon cluster when it comes back online. Sage Weil
12:45 PM rgw Bug #1645 (Resolved): rgw bucket suspended broken
code still looks at the pool auid, which is obviously broken Yehuda Sadeh
11:07 AM Bug #213: non-idempotent transactions (clone) under ext3 may not replay correct result
I think the simplest solution would be:
- for all operations, set an xattr with the last op_seq to write to that ...
Sage Weil
10:59 AM Bug #1632 (Need More Info): osd: crash in dequeue_op
need osd logs Sage Weil
10:59 AM CephFS Bug #1640 (Need More Info): mds: failed assert(trim_to > trimming_pos)
need logs with 'debug journaler = 20' and 'debug ms = 1' on the mds for this one Sage Weil
10:58 AM Bug #1624 (Need More Info): osd crash in HearbeatMap::_check
Sage Weil
10:57 AM CephFS Bug #1509 (Need More Info): cfuse sometimes hangs after unmount
Sage Weil
10:56 AM CephFS Bug #1596 (Need More Info): mds crash during ffsb on kernel client in CInode::is_frozen
Sage Weil
10:55 AM Bug #1609 (Need More Info): osd: failed assert(info.last_complete == info.last_update)
Sage Weil
10:55 AM Bug #1598 (Resolved): osd: fix lost objects
merged, along with the teuthology tests Sage Weil
10:52 AM CephFS Bug #1603 (Need More Info): ceph-fuse crash during unmount
have this one going in a loop to catch it with logs Sage Weil
10:51 AM Bug #1530 (Need More Info): osd crash during build_inc_scrub_map
Sage Weil
10:51 AM Bug #1432 (In Progress): libvirt: fix definition for rbd params/sources/etc
Sage Weil
10:51 AM Bug #1508 (Need More Info): iozone stuck on kernel rbd mount
Sage Weil
10:10 AM Cleanup #1644 (Resolved): osd: prior_set refactor
Sage Weil
09:39 AM rgw Bug #1643 (Rejected): radosgw-admin log show should accept --time
Yehuda Sadeh
09:39 AM rgw Feature #1642 (Resolved): radosgw-admin log show --nonzero-only
Have another flag for radosgw-admin low show like --nonzero-only that only prints a log entry if it will have a nonze... Yehuda Sadeh
09:38 AM rgw Feature #1641 (Rejected): radosgw-admn log show --bandwidth-only
Have a flag for radosgw-admin log show like --bandwidth-only that reduces a log line down to {'bytes_sent':<number>, ... Yehuda Sadeh
12:20 AM Revision f94a44e6 (ceph): OSDMonitor: reweight towards average utilization
The existing reweight-by-utilization calculation did not take into
account the current weight of an OSD, and depended...
Josh Durgin

10/20/2011

11:28 PM Revision 409c5717 (ceph): coverage: don't remove ceph tarball
We want to keep it for examining core files, and we're already
fetching it here, once per suite run.
Josh Durgin
10:56 PM Revision 49b6c118 (ceph): osd: PG::PriorSet: make debug_pg arg const
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:51 PM Revision fa66e65c (ceph): osd: PgPriorSet -> PriorSet
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:50 PM Revision 7bc855a8 (ceph): osd: PgPriorSet: rename prior_set_affected -> affected_by_map
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:47 PM Revision 78236e4e (ceph): osd: PgPriorSet: remove obsolete comment
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:46 PM Revision 2a870c14 (ceph): osd: PgPriorSet: move prior_set_affected into PgPriorSet
This is really where it belongs.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:46 PM Revision c2e66fbd (ceph): osd: PgPriorSet: kill whoami; make PG arg strictly optional
It is only used for the debug output prefix. Make it so we can leave it
out entirely (e.g. for unit tests).
We don'...
Sage Weil
09:12 PM Revision 47e938c0 (ceph): Merge branch 'stable'
Sage Weil
09:12 PM Revision 2b3bdea9 (ceph): osd: fix requeue_ops
The ls argument passed to requeue_ops() is a reference, and one of the
methods we call (say, _handle_op) might want t...
Sage Weil
08:59 PM Revision 3b76f9fc (ceph): perfcounters: remove dout
We can't use this because we're part of libglobal and there is no
g_ceph_context. And i'm too lazy to use cct.
Sign...
Sage Weil
08:58 PM Revision 863e5b04 (ceph): perfcounters: fix unit test
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:48 PM Revision 1d002a1e (ceph): Merge remote branch 'gh/wip-unfound'
Sage Weil
08:16 PM Revision 28df1e91 (ceph): filestore: measure commit interval, latency, journal full count
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:45 PM Revision d2dbae97 (ceph): osd: clean up perfcounter names
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:43 PM Revision d31e78f6 (ceph): filestore: simplify perfcounter lifecycle
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:43 PM Revision b000e4d4 (ceph): filestore: simplify, clean up perfcounters
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:34 PM Revision d26488bc (ceph): perfcounters: fix addition/removal
We are not responsible for deleting removed perfcounters.
Add debugging.
Signed-off-by: Sage Weil <sage.weil@dreamh...
Sage Weil
06:33 PM Revision 7207b819 (ceph): filestore: fix perfcounter definition
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:59 PM Revision 53ad579e (ceph): filestore: fix logger start
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:54 PM Revision d3366ccc (ceph): Merge remote-tracking branch 'github/master' into wip-swift-fix
Yehuda Sadeh
05:52 PM Feature #1420 (Resolved): build release rpms
this is done the extent that i am willing to spend time on it. hopefully suse will show up at some point and improve... Sage Weil
05:45 PM Bug #1636 (Resolved): reweight-by-utilization does not choose good weights
The existing reweight-by-utilization code didn't make sense - commit:f94a44e688883f2db0971435a5333a8b60c77dec fixes t... Josh Durgin
04:11 AM Bug #1636 (Resolved): reweight-by-utilization does not choose good weights
there's a problem distributing the data evenly over all devices.
i'm using v0.36 and have a test setup with two host...
pille palle
05:22 PM CephFS Bug #1640 (Resolved): mds: failed assert(trim_to > trimming_pos)
This happened with bonnie++ on cfuse in teuthology:~teuthworker/archive/nightly_coverage_2011-10-20/729/remote/ubuntu... Josh Durgin
05:09 PM RADOS Feature #1639 (New): osd: guard against bad objects in cls map functions
Got this when I accidentally set a bad locator:... Greg Farnum
04:21 PM Revision 288ccc88 (ceph): perfcounters: clean up interface a bit
No logger_ prefix necessary.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:21 PM Revision daea03ef (ceph): perfcounters: use simple names
We don't need to uniquely identify ourselves in the global namespace with
the PerfCounter name.. only in the current ...
Sage Weil
02:15 PM Bug #1624: osd crash in HearbeatMap::_check
commit:2b3bdea9f7bcf9e9f8d4328f62d82ff43e996b3a fixes at least some of these.... Sage Weil
01:45 PM Bug #1624: osd crash in HearbeatMap::_check
running this in a loop with logs to try ot catch it Sage Weil
12:02 PM Bug #1624: osd crash in HearbeatMap::_check
And teuthology:~teuthworker/archive/nightly_coverage_2011-10-20/730/remote/ubuntu@sepia50.ceph.dreamhost.com/log/osd.... Josh Durgin
12:00 PM Bug #1624: osd crash in HearbeatMap::_check
And teuthology:~teuthworker/archive/nightly_coverage_2011-10-20/727/remote/ubuntu@sepia27.ceph.dreamhost.com/log/osd.... Josh Durgin
11:57 AM Bug #1624: osd crash in HearbeatMap::_check
Happened again in teuthology:~teuthworker/archive/nightly_coverage_2011-10-20/726/remote/ubuntu@sepia41.ceph.dreamhos... Josh Durgin
02:01 PM Feature #1630 (Resolved): Monitor journal fullness (bytes used, size) via perfcounters
Sage Weil
01:45 PM Bug #1635 (Duplicate): osd hit suicide timeout in heartbeat_map thread
Sage Weil
01:28 PM Bug #1588: blogbench on kclient possibly made machine die
Happened again today - just more transactions timing out in the logs. Josh Durgin
01:23 PM Bug #1633: osd crash in CryptoKey::decrypt
Happened again while thrashing in teuthology:~teuthworker/archive/nightly_coverage_2011-10-20/732/remote/ubuntu@sepia... Josh Durgin
11:52 AM Bug #1530: osd crash during build_inc_scrub_map
This happened again during cfuse on ffsb (teuthology:~teuthworker/archive/nightly_coverage_2011-10-20/694). Josh Durgin
09:27 AM Bug #1638 (Won't Fix): Can't create object with large xattrs in a single operation (on extN)
A single compound operation that does:
- create
- setxattr (small enough to fit but large enough to fill in the e...
Yehuda Sadeh
04:45 AM Revision f25879ac (ceph): encoding: add optional features
Update encode macros to allow a feature bitmask to be passed through
to a classes encode() method.
Signed-off-by: Sa...
Sage Weil
04:22 AM Feature #1637 (Duplicate): OSDs running full take down other OSDs
this issue has a relation to #1636.
in my test setup of v0.36 when one OSD runs full it gets taken down.
this start...
pille palle
04:14 AM Revision 0aa40ea0 (ceph): assert: no 0x before thread id
There's no 0x prefix in the log lines either. This makes it easier to
copy/paste word and search.
Signed-off-by: Sa...
Sage Weil
03:48 AM Revision 0f0c5947 (ceph): osdmap: uninline big stuff
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:46 AM Revision a71455c8 (ceph): rgw: properly handle cleaning up of listings
If a listing you get back from the OSD consists only of
non-existent entries, you still need to handle it and resume ...
Greg Farnum
12:46 AM Revision 470742d8 (ceph): cls_rgw: move stat update code after error checks in complete_op
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
12:46 AM Revision 6dc0da4c (ceph): cls_rgw: implement a dir_suggest_changes function.
This takes a bufferlist of suggested changes to the directory, trims
out any sufficiently old tags, and then applies ...
Greg Farnum
12:46 AM Revision 952ebbae (ceph): cls_rgw: add constructors to data structs; don't leak tags on races
We were leaking tags on races before, since we cut out of the function
before clearing the tag. We don't do that any ...
Greg Farnum
12:45 AM Revision 9496732d (ceph): rgw: write and use the check_disk_state function
This is used to check the actual on-disk state, and encode
suggested updates for the index.
Then cls_bucket_list send...
Greg Farnum

10/19/2011

11:47 PM Revision a5ada568 (ceph): rgw: fix bad snprintf
Yehuda Sadeh
10:35 PM Revision 5de847f3 (ceph): .gitignore: add test_filestore_idempotent
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:35 PM Revision b57e8967 (ceph): test_filestore_idempotent: initialize var
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
09:58 PM Revision 7f5f1ec1 (ceph): rgw: implement swift metadata POST
Yehuda Sadeh
04:14 PM Revision cf333152 (ceph): Merge branch 'stable'
Conflicts:
src/mon/OSDMonitor.cc
src/osd/OSD.cc
Sage Weil
03:15 PM Bug #1526: log bound mismatch after thrashing with bonnie
another occurance, running on swab. This may have led to pg version reset.... Yehuda Sadeh
03:02 PM Bug #1635 (Duplicate): osd hit suicide timeout in heartbeat_map thread
This was while thrashing with radosbench, during peering, with osds 3 and 6 marked out.
From teuthology:~teuthworker...
Josh Durgin
02:46 PM Bug #1634 (Can't reproduce): osd: crash decoding non-existent object_info_t
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-19/680/remote/ubuntu@sepia28.ceph.dreamhost.com/log/osd... Josh Durgin
12:05 PM Bug #1633 (Resolved): osd crash in CryptoKey::decrypt
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-19/682/remote/ubuntu@sepia72.ceph.dreamhost.com/log/osd... Josh Durgin
11:59 AM Bug #1632 (Can't reproduce): osd: crash in dequeue_op
During ffsb:... Josh Durgin
11:26 AM Bug #1631 (Can't reproduce): osd: failed assert(repop_queue.front() == repop)
This happened on two osds during a multiple_rsync workunit (teuthology:~teuthworker/archive/nightly_coverage_2011-10-... Josh Durgin
10:51 AM Feature #1630 (Resolved): Monitor journal fullness (bytes used, size) via perfcounters
Anonymous
05:33 AM Revision b297d1ed (ceph): osdmap: make encoding based on features
Instead of relying on the caller to decide whether encode_old_client()
is appropriate, pass in the feature set and en...
Sage Weil
05:26 AM Revision 6e2018ce (ceph): osd: normalize encoding of pg_pool_t
Instead of using a cumbersom C struct, move members into pg_pool_t and
use normal encode/decode methods.
Signed-off-...
Sage Weil
05:26 AM Revision cee1b27f (ceph): crush: clean up encoder/decoder
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:25 AM Revision 954d3f9b (ceph): use WRITE_CLASS_ENCOER macro when possible
Sage Weil
05:25 AM Revision 9d93bfce (ceph): encoding: WRITE_CLASS_ENCODER_MEMBER -> WRITE_CLASS_MEMBER_ENCODER
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

10/18/2011

11:52 PM Revision 83914d22 (ceph): test_filestore_idempotent: simple tool to generate a worklaod of non-id...
Generate a workload of operations that are non-idempotent. These are:
transaction {
clone A -> A.($n-1)
writ...
Sage Weil
11:52 PM Revision 3e92aace (ceph): filestore: tolerate EEXIST on mkcoll when not-btrfs
For non-btrfs file systems we should tolerate EEXIST because we may
replay the event more than once.
Signed-off-by: ...
Sage Weil
11:41 PM Revision ba165fec (ceph): rgw: some swift obj metadata related fixes
Yehuda Sadeh
11:18 PM Revision 13b0bbb3 (ceph): mds: handle xattrs on inode creation
Allow mknod, mkdir, symlink, create to provide xattrs for the new
inode. This will be used by the kclient to set ACL...
Sage Weil
11:04 PM Revision 7ea07832 (ceph): radosgw-admin: fix conflict with KeyType in libnss
rgw/rgw_admin.cc:459:6: error: using typedef-name 'KeyType' after 'enum'
/usr/include/nss3/keythi.h:69:3: error: 'Key...
Sage Weil
08:28 PM Revision ed5e4341 (ceph): rgw: add content-type to index dirent
Yehuda Sadeh
06:42 PM Revision da6cdfdd (ceph): osd: PgPriorSet: cur -> probe
Rename cur to probe, the set of OSDs we need to probe in order to
successfully peer.
Signed-off-by: Sage Weil <sage@...
Sage Weil
06:40 PM Revision 4e5242e0 (ceph): osd: PgPriorSet: restructure lost checks for prior set
When we add down osds to the cur set, we block peering because there
are OSDs that may have data we need and they are...
Sage Weil
06:01 PM Revision 298dbbe6 (ceph): rgw: workqueue suicide timeout is infinity
Yehuda Sadeh
04:49 PM Bug #1629 (Can't reproduce): pgs stuck degraded (only mapped to 1 osd)
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-18/636/teuthology.log:... Josh Durgin
04:42 PM Bug #1624: osd crash in HearbeatMap::_check
argh, the tarball is already gone:
# wget http://ceph.newdream.net/gitbuilder/output/sha1/e6dbd7141bd8b4403f3b931f...
Sage Weil
12:00 PM Bug #1624 (Resolved): osd crash in HearbeatMap::_check
Logs with debugging are in vit:~joshd/thrash_stuck_active4. This happened on osds 0 and 4:... Josh Durgin
03:35 PM Bug #1628: segfault attempting to map an rbd snapshot
This is a bug in the rbd command line tool - it accepts snapname but doesn't use it for map/unmap. Additionally, it d... Josh Durgin
03:18 PM Bug #1628 (Resolved): segfault attempting to map an rbd snapshot
... John Leach
03:08 PM Bug #1626: ceph-mon HA not working right; all must be up
Sorry to dribble this in: it seems with one mon down and voted out, "ceph -s" takes <1sec 66% of the time, ~3sec 33% ... Anonymous
03:07 PM Bug #1626: ceph-mon HA not working right; all must be up
Oh sorry, what I see with vstart is a 10-second timeout until the mons vote mon.c out. This is *not* what Carl report... Anonymous
03:05 PM Bug #1626: ceph-mon HA not working right; all must be up
Carl saw it originally. Easy to repro with vstart:... Anonymous
02:48 PM Bug #1626: ceph-mon HA not working right; all must be up
where did you see this? Sage Weil
02:28 PM Bug #1626 (Can't reproduce): ceph-mon HA not working right; all must be up
If mon.gamma is down, "ceph -s" hangs trying to connect to all three ceph-mon. The paxos majority rule system does no... Anonymous
02:52 PM Bug #213: non-idempotent transactions (clone) under ext3 may not replay correct result
FWIW even if we know what not to replay, we could still be screwed with ext4 (which does not commit everything in ord... Sage Weil
02:25 PM Bug #213: non-idempotent transactions (clone) under ext3 may not replay correct result
Tommi Virtanen wrote:
> Isn't the idempotency in that case "clone foo_head -> foo_2 IFF foo_2 does not exist" ?
T...
Sage Weil
02:06 PM Bug #213: non-idempotent transactions (clone) under ext3 may not replay correct result
Isn't the idempotency in that case "clone foo_head -> foo_2 IFF foo_2 does not exist" ? Anonymous
02:37 PM Bug #1627 (Can't reproduce): ceph-mon memleak if ceph-osd cluster ip is not reachable, but public...
... Anonymous
02:25 PM Feature #1625 (Rejected): changing ceph-mon ip address needs monmap change on every mon machine
Moving mon.{alpha,beta,gamma} to new IP addresses was a fairly convoluted process. This would be nice if it was simpl... Anonymous
02:15 PM Feature #641: allow logs to be piped to an external program
This is feature creep. If you want to process the logs asap in another process, just have it get ceph stdout as stdin... Anonymous
11:19 AM Bug #1620 (Resolved): rgw suicide due to heartbeat timeout
Fixed, commit:298dbbe64f8b0738ec58db43782813d0686717c7. Basically a 0 value for the rgw suicide timeout should do the... Yehuda Sadeh
11:01 AM Bug #1588: blogbench on kclient possibly made machine die
This happened again yesterday and today with different machines. Both times, the only unusual thing in kern.log was t... Josh Durgin
01:59 AM Revision 0a027599 (ceph): osd: PgPriorSet: simplify (and change) CRASHED logic
Any single OSD from a given interval surviving is sufficient to ensure
that an ACKed write during that interval was c...
Sage Weil
01:57 AM Revision f7ef94d3 (ceph): osd: PgPriorSet: update comment terms a bit
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:51 AM Revision bbb06d34 (ceph): osd: do not short-cut up_thru update for new PGs
Commit e731885d2550ee985bf875ab5bb5faf28f1693eb made it possible for
a new PG to go active without forcing the OSDs u...
Sage Weil
12:44 AM Revision 57e0ab74 (ceph): osd: PgPriorSet: clean up per-interval var names
We don't actually use any_lost_now, but it makes the logic easier
to understand to have it there.
Signed-off-by: Sag...
Sage Weil
12:44 AM Revision 53381364 (ceph): osd: PgPriorSet: clean up comments a bit
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:44 AM Revision 33b33f7e (ceph): osd: PgPriorSet: remove unused PG member
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
12:44 AM Revision 113b7833 (ceph): osd: PgPriorSet: revert start_since_joining check
Commit 5b78f5db8c200edcc949033e1badae70fecd2e08 added a check to
prevent some sort of badness when osds were marked l...
Sage Weil
12:44 AM Revision 3dda4465 (ceph): osd: PgPriorSet: remove up_thru crap
This was added way back in 1cf9bebc8e5063f5f311d33e7735bcc9286e98ce,
but as far as I can tell it didn't make any sens...
Sage Weil
12:44 AM Revision f89f4d9b (ceph): osd: PgPriorSet: do not include UP osds in prior.cur
The up osds are not (directly) relevant since they are not necessarily
members of the PG. We only care about acting ...
Sage Weil
12:09 AM Revision 9dfa1105 (ceph): rgw: fix swift account and containers listing limits
Yehuda Sadeh

10/17/2011

11:48 PM Revision c5638b70 (ceph): osd: PgPriorSet: any_survived -> any_is_alive_now
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:13 PM Revision e6dbd714 (ceph): doc: Change diagram to have radosgw closer to direct rados access.
Signed-off-by: Tommi Virtanen <tommi.virtanen@dreamhost.com> Tommi Virtanen
10:35 PM Revision 3c90c0d4 (ceph): add singleton lost-unfound
Sage Weil
10:32 PM Revision 4ec37b23 (ceph): add lost_unfound task
Also some misc useful bits to ceph_manager. Sage Weil
10:21 PM Revision edcd4d97 (ceph): rgw: some more swift fixes
Yehuda Sadeh
09:45 PM Revision 83cf3fef (ceph): Expect 'wrongly marked me down' messages during thrashing
Josh Durgin
09:42 PM Revision bcded7f1 (ceph): ceph: add whitelist for cluster log errors
Some messages are expected when thrashing osds or creating unfound
objects.
Fixes: #1622
Josh Durgin
09:13 PM Revision 0bad37e3 (ceph): streamtest: do mkfs
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:12 PM Revision 525a610f (ceph): streamtest: print to stdout
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:49 PM Revision 9c956049 (ceph): mkcephfs: copy ceph.conf to /etc/ceph/ceph.conf (when -a)
You can disable this with --no-copy-conf.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
05:40 PM Revision fba220ec (ceph): nuke: reset syslog configuration after rebooting
Previously we removed a file and rebooted without syncing, so the file
was never deleted.
Josh Durgin
04:56 PM Bug #1623 (Can't reproduce): ceph-osd fails to bind socket
... Yehuda Sadeh
03:51 PM Revision 9baf5ef4 (ceph): ceph.spec: don't chkconfig
This was fighting with suse insserv. Still needs some cleanup.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
03:50 PM Revision 21d941e8 (ceph): ceph.spec: work around build.opensuse.org
The redhat-rpm-config isn't installed on build.opensuse.org, which means
the processor is set to i386 instead of some...
Sage Weil
03:49 PM Revision 195a484b (ceph): ceph.spec: capitalize first letter to make rpmlint happy
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:35 PM Revision a6f3bbb7 (ceph): v0.37
Sage Weil
03:27 PM Bug #1473 (Resolved): osd assert failure: FAILED assert(0 == "oi disagrees with stat, or error c...
Samuel Just
03:26 PM Bug #1473: osd assert failure: FAILED assert(0 == "oi disagrees with stat, or error code on stat")
At least the recent instances of this were probably caused by the btrfs xattr bug (#1612). Samuel Just
03:25 PM Bug #1486 (Resolved): osd: 0-length meta/pginfo_* files
Sage Weil
10:10 AM Bug #1486: osd: 0-length meta/pginfo_* files
I saw this on alexandria, and it was caused by:
1- EMFILE (too many open files)
2- filestore wasn't assering on...
Sage Weil
03:20 PM Bug #1612 (Resolved): osd/PG.cc: 3839: FAILED assert(missing[oid].need <= v)
This was caused by a btrfs xattr bug. I got a patch back from josef and pushed it to the dho kernel. Samuel Just
03:05 PM Feature #1622 (Resolved): teuthology: whitelist ceph.log entries
Implemented in teuthology and whitelisted 'wrongly marked me down' messages for thrashing jobs in the suite. Josh Durgin
10:08 AM Feature #1622 (Resolved): teuthology: whitelist ceph.log entries
Need to be able to do this to make certain tests pass. notably thrashing and the new lost_unfound. Sage Weil
03:37 AM Revision ca8f6036 (ceph): osd: fix assemble_backlog
This was written assuming that le->prior_version wouldn't be the version
that we have locally on disk. Not always tr...
Sage Weil
03:37 AM Revision 2fdec7b8 (ceph): osd: fix add_next_event Missing::item::have
The missing set should be accurate up to the current point in the log. The
log_tail has no bearing on that, nor does...
Sage Weil

10/15/2011

05:56 AM Revision c1cabf56 (ceph): ceph: don't crash when sending message to !up osd
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:03 AM Revision 615689a9 (ceph): osd: implement lost_revert
Roll back to the last available version of an object. If there is no
available version, delete it.
Leave the door o...
Sage Weil
04:03 AM Revision 3a046774 (ceph): osd: pull old version to revert to
If we are the primary, and are doing a LOST_REVERT, pull the old version
of the object and update the version when we...
Sage Weil
04:02 AM Revision 03cd1088 (ceph): osd: adjust LOST log entry types; simplify log entry type strings
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:02 AM Revision ad39d814 (ceph): osd: all_unfound_are_queried_or_lost
The check to make isn't whether all locations are lost, but whether all
locations are either lost or have been querie...
Sage Weil
04:02 AM Revision 81f36c2d (ceph): osd: remove superfluous write_info calls
- merge_log() will write_info (and log) as needed
- Activate() will do the same
Signed-off-by: Sage Weil <sage@newdr...
Sage Weil
04:02 AM Revision c3fa0783 (ceph): messages/MOSDPG*: clean up output a bit
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:02 AM Revision a3be6651 (ceph): osd: fix share_pg_log()
We need to handle a log message in the ReplicaActive state. And set the
epoch properly when we send it.
Signed-off-...
Sage Weil
04:02 AM Revision 22684f25 (ceph): osd: pass version explicitly to pull
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:02 AM Revision efe5abfc (ceph): osd: make C_OSD_CommittedPushedObject::op optional
This lets us reuse this helper for commiting recovery ops that aren't a
result of a push.
Signed-off-by: Sage Weil <...
Sage Weil
04:02 AM Revision 51158820 (ceph): osd: factor out recover_primary_got() helper
This handles the missing set and lsat_complete adjustment when we recover
an object on the primary.
Signed-off-by: S...
Sage Weil
04:02 AM Revision 43bd49d8 (ceph): osd: fix up PG::Missing methods a bit
Pass in iterators when possible. Stack methods instead of duplicating
functionality.
Signed-off-by: Sage Weil <sage...
Sage Weil
04:02 AM Revision 7c05c1fe (ceph): osd: simplify share_pg_log
Use Log::copy_after(). Drop the useless argument. Strip out the broken
LOST logic.
Signed-off-by: Sage Weil <sage@...
Sage Weil
04:02 AM Revision a8760e50 (ceph): osd: fix up mark_all_unfound_lost so that it actually works
Well, it works given our weak definition of LOST.
- use ObjectContexts properly
- move into ReplicatedPG
- no need f...
Sage Weil
03:43 AM Revision 35dab57f (ceph): msg: add MCommand, MCommandReply message types
These are similar to MMonCommand[Ack], but aren't PaxosServiceMessage
children, don't include the command in the repl...
Sage Weil
03:43 AM Revision beaca74d (ceph): msg: entity_name_t::parse()
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:43 AM Revision a37d6d03 (ceph): cephtool: ability to send commands directly to osds
This makes commands beginning with 'tell <target>' magic in that they go
to the given target instead of to the monito...
Sage Weil
03:43 AM Revision 7f687fca (ceph): osd: handle (and reply to) direct MCommands
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:43 AM Revision f868e382 (ceph): osd: remove some pg stats debug cruft
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:43 AM Revision b35d96d5 (ceph): mon: feed MPGStats tids back through the MPGStatsAck
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:43 AM Revision 1cbcc953 (ceph): osd: process commands in a workqueue
This lets us do commands that can potentially block. For example:
- flush pg stats to osd
- request (and wait for...
Sage Weil
03:43 AM Revision 84a6f6e7 (ceph): osd: implement 'flush_pg_stats' command
This flushes the current pg stats to the monitor, and blocks until the
monitor commits it.
Signed-off-by: Sage Weil ...
Sage Weil
03:21 AM Revision 502fbba5 (ceph): paxos: trim extra state dirs
OSDMonitor, for instance, stores both an "osdmap" and "osdmap_full" for
each state. Trim them both.
Signed-off-by: ...
Sage Weil
03:20 AM Revision 6d123067 (ceph): PG: call set_last_peering_reset in Started contructor
Calling it here should cover all possible replica and primary peering
resets.
Signed-off-by: Samuel Just <samuel.jus...
Samuel Just

10/14/2011

11:49 PM Revision b5c60623 (ceph): filestore: assert on any unexpected error
Right now, the only errors we expect out of the underlying filesystem are
-ENOENT, -ENODATA, or (as a workaround for ...
Sage Weil
08:31 PM Revision ba41e6c7 (ceph): osd: send full map if we don't have sufficiently old incremental
If the peer has a really old map, send a full map instead of crashing
because we are missing the needed incremental.
...
Sage Weil
08:30 PM Revision 607043ed (ceph): osd: send full map if we don't have sufficiently old incremental
If the peer has a really old map, send a full map instead of crashing
because we are missing the needed incremental.
...
Sage Weil
08:30 PM Revision 0cc7da2f (ceph): osd: share oldest_map info with peers
This helps OSDs trim their old maps even when they don't get MOSDMap
messages directly from the monitor.
It also fee...
Sage Weil
08:30 PM Revision 818cf8c8 (ceph): mon: make number of old paxos states configurable
Currently settable on osdmaps, pgmaps, and log. Still need MDSMap and
authmap trimming.
Signed-off-by: Sage Weil <s...
Sage Weil
08:30 PM Revision 474e368d (ceph): osd: remove old osdmaps
When the monitor removes old maps, we should too.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:30 PM Revision 3acb9197 (ceph): paxos: trim extra state dirs
OSDMonitor, for instance, stores both an "osdmap" and "osdmap_full" for
each state. Trim them both.
Signed-off-by: ...
Sage Weil
08:20 PM Revision dd3282c1 (ceph): rgw: multiple swift fixes and cleanups
Yehuda Sadeh
08:03 PM Revision b3c68a51 (ceph): PG: call set_last_peering_reset in Started contructor
Calling it here should cover all possible replica and primary peering
resets.
Signed-off-by: Samuel Just <samuel.jus...
Samuel Just
07:57 PM Revision ef30e69c (ceph): PG: Fix log.empty confusion
Previously, log.empty meant that the log.head was everion_t(). However,
it was in a few places used to mean that log...
Samuel Just
05:17 PM Revision fccd28df (ceph): PG: Fix log.empty confusion
Previously, log.empty meant that the log.head was everion_t(). However,
it was in a few places used to mean that log...
Samuel Just
01:35 PM Feature #1604 (Resolved): kclient: handle osdmap discontinuity
Sage Weil
01:00 PM Bug #1449 (Resolved): osd: FAILED assert(0 == "we got a bad state machine event")
b3c68a514135318e0dfda9f929f15f26340cd664 Samuel Just
12:42 PM Bug #1620 (Resolved): rgw suicide due to heartbeat timeout
Happens around a hour after osd went down:... Yehuda Sadeh
10:14 AM Bug #1607 (Resolved): osd: failed assert(missing.is_missing(oe.soid))
a50fbe2b982e5d19040f4ae5795455dde3a9a02e Samuel Just
10:13 AM Bug #1599 (Resolved): osd assert fail (new_tail >= ondisklog.tail)
fccd28df371dceffaf6ff7a50422b6a5b1ee126c should take care of it. Samuel Just
03:03 AM Revision f658cb4a (ceph): makefile changes for interval tree
Added unit test case for interval tree to the makefile template.
Signed-off-by: Jojy George Varghese <jvarghese@scal...
Jojy George Varghese
03:02 AM Revision d516f9b5 (ceph): mds: Unit tests for interval tree
Provides usage scenarios and test cases for interval tree
implementation.
Tests include:
- testing addInterval inte...
Jojy George Varghese
03:02 AM Revision 72d50fa5 (ceph): mds: Interval tree implementation
Interval tree is an optimized data structure for representing and
querying intervals. Elementary intervals are repres...
Jojy George Varghese

10/13/2011

11:02 PM Revision 87f8389e (ceph): rgw: more swift fixes and adjustments
Yehuda Sadeh
08:28 PM Revision b6d9ed94 (ceph): auth: remove global instance of auth_supported
Wrap it in a class.
Instantiate locally, or keep a copy around if we'll need it often.
Factor out the protocol sele...
Sage Weil
04:53 PM Revision 1f3b12e0 (ceph): osd: bound generate_past_intervals() by oldest map
The oldest osdmap we maintain is a lower bound on last_epoch_clean for the
entire system (assuming the monitor is doi...
Sage Weil
04:35 PM Revision 0167e824 (ceph): cls_rgw: rewrite rgw_bucket_complete_op to use update.
Unfortunately we can't do multiple writes via the interface -- the
second one will clobber the first one. So use the ...
Greg Farnum
04:35 PM Revision 45ebaf70 (ceph): cls_rgw: remove the write_bucket_dir function.
It's no longer called anywhere. Hurray, we don't do our own
read-modify-write cycle any more (and can exploit the pow...
Greg Farnum
04:33 PM Revision 75f7e546 (ceph): cls_rgw: refactor rgw_bucket_complete_op in terms of TMAP
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:33 PM Revision 2592e41a (ceph): cls_rgw: refactor rgw_bucket_prepare_op in terms of tmap
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:33 PM Revision 83504c42 (ceph): cls_rgw: refactor rgw_bucket_init_index in terms of tmap
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:33 PM Revision 15a3df84 (ceph): cls_rgw: refactor read_bucket_dir in terms of tmap.
This function won't be called often once refactoring is done, but
its functionality will be needed for listing, if no...
Greg Farnum
04:32 PM Revision 583e16d9 (ceph): objclass: add map interfaces.
Right now, they implement the TMAP functions, plus a few obvious
extras to read/write select keys and the header. In ...
Greg Farnum
04:29 PM Feature #1619 (Resolved): libvirt: test with selinux/apparmour enabled
There are probably checks that assume the image is a file. Josh Durgin
04:29 PM Revision c98e1c57 (ceph): ReplicatedPG: remove unused tmap implementation.
If it's surrounded by an if(0), it shouldn't still be in the code.
Signed-off-by: Greg Farnum <gregory.farnum@dreamh...
Greg Farnum
04:28 PM Feature #1618 (Resolved): libvirt: make sure migration works
I think there's a small patch needed since it assumes the image is a file. Josh Durgin
04:18 PM Bug #1617: pgs stuck down and peering with only one osd down and out
Happened in run 494 as well. These were both rados bench with thrashing. Josh Durgin
03:42 PM Bug #1617 (Won't Fix): pgs stuck down and peering with only one osd down and out
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-13/491/teuthology.log:... Josh Durgin
01:35 PM Bug #1616: crash in is_supported_auth
hopefully this is resolved by commit:b6d9ed9412cb046747bb0d0713c286613757bfcf
i confess i don't see why exactly th...
Sage Weil
12:52 PM Bug #1616: crash in is_supported_auth
This happened again in run 493. Josh Durgin
11:54 AM Bug #1616 (Resolved): crash in is_supported_auth
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-13/490/remote/ubuntu@sepia29.ceph.dreamhost.com/log/osd... Josh Durgin
12:41 PM rgw Bug #1584 (Resolved): rgw: swift key management is busted
We can now hold multiple swift keys, and multiple S3 keys. There's one swift key per subuser, and we can specify key ... Yehuda Sadeh
11:15 AM rgw Bug #1568 (Rejected): rgw: add object_locator to bucket index
The only locator we use is on shadow and temporary objects, and these are located by their associated actual object. ... Greg Farnum
10:35 AM Linux kernel client Bug #1615 (Can't reproduce): null pointer dereference in ceph_msg_new
This happened during a blogbench run:... Josh Durgin
10:02 AM Bug #1599: osd assert fail (new_tail >= ondisklog.tail)
Finally reproduced this with debugging - logs and pg and osd dump will be in vit:~joshd/thrash_stuck_active3 in a bit... Josh Durgin
09:58 AM rgw Bug #1570 (Resolved): rgw: use tmap for bucket index objects
Pushed to master in commit:45ebaf705d1e37f6b0af84f27767c141496c2f1e
Passes S3 tests.
Greg Farnum
09:58 AM Feature #1569 (Resolved): osd: create a tmap class api
Pushed to master in commit:583e16d9591391c834cd17154571926bffc05abc Greg Farnum

10/12/2011

11:26 PM Revision 42c8ae77 (ceph): test_librbd: expect copy to succeed
0 is the success return code. These were accidentally changed in the
conversion to gtest.
Signed-off-by: Josh Durgin...
Josh Durgin
11:26 PM Revision d0d265bf (ceph): librbd: return errors when read_iterate fails during copy
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
10:44 PM Revision a50fbe2b (ceph): PG: merge_old_entry: merged delete might not be in missing
If the new log does not contain an entry for that oid, it might not yet
be in missing, and we would need to add it.
...
Samuel Just
10:37 PM Revision 493596a7 (ceph): radosgw-admin: test swift keys creation/removal
Yehuda Sadeh
09:46 PM Revision 42bbea89 (ceph): rgw: swift key removal
Yehuda Sadeh
06:14 PM Revision 05dae94f (ceph): Revert "config: base default libdir, sysconfdir off autoconf values"
This reverts commit 7e5dee907a8218647a88d1c7d3316cc277e1c44b. Sage Weil
06:09 PM Revision 1216eb2d (ceph): rgw: some swift api fixes
Yehuda Sadeh
04:34 PM Revision 7e5dee90 (ceph): config: base default libdir, sysconfdir off autoconf values
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
03:42 PM Bug #1449 (In Progress): osd: FAILED assert(0 == "we got a bad state machine event")
This happened again, with a log being received in GetInfo. This was during radosbench and fast thrashing.
From teuth...
Josh Durgin
12:54 PM Bug #1613 (Resolved): mon crash
Greg Farnum
11:44 AM Bug #1613: mon crash
Excellent. I got it running again using second monitor's data on first monitor.
Yes I am running kernel client on...
Hong Cho
11:11 AM Bug #1613: mon crash
The mon data dir is specified in your ceph.conf.
This backtrace though makes it look like you're running the kerne...
Greg Farnum
11:02 AM Bug #1613: mon crash
Unfortunately I didn't get a chance to record the OOPS. I'll try to get them next time. In the syslog I found this ... Hong Cho
09:25 AM Bug #1613: mon crash
Did you record the OOPS somewhere? It looks as though the monitor is pulling bad data off disk. You should be able to... Greg Farnum
11:24 AM Bug #1594 (Resolved): pgs stuck degraded or active after 3 hours
The bug in the second reproduced case was fixed by commit:af6a9f30696c900a2a8bd7ae24e8ed15fb4964bb. Josh Durgin
09:36 AM Bug #1614 (Resolved): default rados class location needs to be depend on autoconf libdir
Sage Weil
09:20 AM Bug #1614 (Duplicate): default rados class location needs to be depend on autoconf libdir
it's /usr/lib64/... on many platforms. Sage Weil
09:14 AM CephFS Bug #1435: mds: loss of layout policies upon mds restart
Greg Farnum wrote:
> I would assume this is just the IFILE lock state thing you talked about earlier?
>
> There w...
Sage Weil
09:09 AM CephFS Bug #1435: mds: loss of layout policies upon mds restart
I would assume this is just the IFILE lock state thing you talked about earlier?
There were a few other bugs that ...
Greg Farnum
09:05 AM CephFS Bug #1435 (In Progress): mds: loss of layout policies upon mds restart
Can you do a bit of legwork and help us get a process to reproduce this? Once we have that it's easy to fix.
Prob...
Sage Weil
07:54 AM CephFS Bug #1435: mds: loss of layout policies upon mds restart
I lied :-(
I had been running with a single mds for a while, and even though it restarted a number of times, it di...
Alexandre Oliva

10/11/2011

11:43 PM Bug #1613 (Resolved): mon crash
Hi,
I am currently running ceph 0.36 w/ kernel 3.0.0 on two machines each with 4 2tb HDDs. All of those HDDs are ...
Hong Cho
10:06 PM Revision e07b956b (ceph): rgw: implement some missing swift api, other swift fixes
Yehuda Sadeh
07:31 PM Revision f883e638 (ceph): rgw: multiple swift keys per user
Yehuda Sadeh
07:31 PM Revision 5790a363 (ceph): rgw: encoded swift key contains full user name
Yehuda Sadeh
06:39 PM Revision 9b18e55e (ceph): PG,OSD: clean up xlist::items on pg removal
Should take care of #1599.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
06:16 PM Revision f7ec9f79 (ceph): thread: whitespace
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:16 PM Revision b43b8645 (ceph): osd: fix race between op requeueing and _dispatch
If a message is working it's way through _dispatch, and another thread
requeues waiting messages under pg->lock (e.g....
Sage Weil
06:08 PM Revision fc3aac6b (ceph): filestore: clean up error output
No color codes in the logs!
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:08 PM Revision cd2dedd7 (ceph): filestore: tolerate missing current/
We remove it anyway. If it's missing entirely, just continue and roll
back to the latest snap_ when the user passes ...
Sage Weil
05:31 PM rgw Bug #1570 (In Progress): rgw: use tmap for bucket index objects
Pushed a first attempt to wip-tmap. Haven't tested it yet, but it does compile (and shouldn't change any behavior, wh... Greg Farnum
05:30 PM Feature #1569 (In Progress): osd: create a tmap class api
Pushed an implementation to wip-tmap. Haven't tested it yet. Greg Farnum
02:01 PM rgw Feature #1591: rgw: instrument with perfcounter
see wip-rgw-perfcounter branch Sage Weil
11:25 AM rgw Feature #1591 (In Progress): rgw: instrument with perfcounter
Sage Weil
01:41 PM Bug #1608 (Resolved): osd crash in get_authorize_handler
I'm going to cross my fingers and call this resolved, unless/until it comes up again. Sage Weil
01:41 PM Bug #1490 (Resolved): cfuse assert failure: assert(ob->last_commit_tid < tid)
Sage Weil
01:12 PM Bug #1547: client log doesn't go to stderr unless 'log file' specified
The interface is just confusing. We'll use two flags (--log-to-stderr, --err-to-stderr) instead. Yehuda Sadeh
12:40 PM Bug #1558 (Resolved): osd crash when deleting pg
Should be fixed in 9b18e55e86a18de6ff91a42984ba9ba35d3bdd4f Samuel Just
11:32 AM Bug #1473: osd assert failure: FAILED assert(0 == "oi disagrees with stat, or error code on stat")
Just saw this on osd.11 on alexandria (see #1612):... Sage Weil
08:41 AM Bug #1612: osd/PG.cc: 3839: FAILED assert(missing[oid].need <= v)
Oh.. when i marked that node out, this came up:... Sage Weil
02:11 AM Revision 032dd84c (ceph): ceph.spec.in: more updates to make build.opensuse.org happy
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
02:11 AM Revision 468c7dce (ceph): rgw: find fcgi headers in /usr/include or /usr/include/fastcgi
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
02:11 AM Revision 3f619ddb (ceph): ceph.spec.in: fix radosgw files
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
02:11 AM Revision 465f1bf1 (ceph): ceph.spec.in: add init-radosgw back
Accidentally took this out a few patches back.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil

10/10/2011

06:51 PM Revision 52bad62d (ceph): rgw: push log list|show|rm into RGWStore
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:51 PM Revision 93a88150 (ceph): radosgw-admin: push log listing into RGWRados
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:51 PM Revision 69e9f51d (ceph): rgw: chdir to dir in conf (default /)
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:51 PM Revision 930d57f8 (ceph): rgw: push intent log processing into RGWRados layer
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
06:51 PM Revision d7f7a213 (ceph): librbd: don't return >32bit value from copy().
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:04 PM Revision 9ec60b4d (ceph): librbd: fix overflow on large large image copy
Fixes: #1611
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:03 PM Revision b6c46158 (ceph): librbd: slightly cleaner
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:55 PM Revision 42ac18cd (ceph): .gitignore: test_libhadoopcephfs build
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
05:55 PM Revision 968b0909 (ceph): Merge branch 'master' of github.com:NewDreamNetwork/ceph
Greg Farnum
05:04 PM Bug #1612 (Resolved): osd/PG.cc: 3839: FAILED assert(missing[oid].need <= v)
osd.11 on cephstore6357, full log is available! Sage Weil
04:50 PM Revision 7060efa9 (ceph): librbd: fix copy progress
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:44 PM Bug #1490: cfuse assert failure: assert(ob->last_commit_tid < tid)
Reproduced. This time ordering is violated by a race between scrub completion doing take_waiters(waiting_for_active)... Sage Weil
11:06 AM Bug #1611 (Resolved): rbd cp broken?
Sage Weil
09:36 AM Bug #1611 (Resolved): rbd cp broken?
(09:34:39 AM) psomas: btw, i haven't searched if this is a known issue, but i tried "rbd cp image1 image2", and i get... Sage Weil
10:17 AM Bug #1580 (Resolved): watch expiration
Sage Weil

10/09/2011

11:28 PM Revision 61855170 (ceph): hadoop: get the right class member, and ask for it properly.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
08:45 PM rgw Feature #1375 (Rejected): Actually find pool limits
Sage Weil
08:45 PM rgw Feature #1268 (Rejected): rgw: optimistically fetch acl and first chunk
Sage Weil
08:42 PM Bug #1032 (Can't reproduce): osd: Marked down and become zombies after killing
Sage Weil

10/08/2011

09:07 PM Feature #1565 (Resolved): crowbar demo
Sage Weil
11:15 AM CephFS Bug #1435 (Resolved): mds: loss of layout policies upon mds restart
Great news, thanks. We haven't tested it. Sage Weil
01:14 AM CephFS Bug #1435: mds: loss of layout policies upon mds restart
Wasn't this fixed in 0.35? I haven't lost directory layout information any more. Alexandre Oliva
01:08 AM Bug #1590: occasionally excessive mon memory footprint
I've just run into this while only two out of the 3 mons were up: mon.0 was taking several minutes to complete a sync... Alexandre Oliva

10/07/2011

11:47 PM Revision dc40b374 (ceph): auth: move AuthAuthorizeHandler registry into class
Static classes with constructors and destructors are dangerous. Explicitly
manage these as part of the server compon...
Sage Weil
11:07 PM Revision 11a1d601 (ceph): OSD,ReplicatedPG: expire and cleanup unconnected watchers
During handle_notify_timeout or ms_handle_reset, watchers are now marked
unconnected via pg->register_unconnected_wat...
Samuel Just
09:51 PM Revision 321381d7 (ceph): teuthology-worker: remove --keep-locked-on-error
Josh Durgin
09:49 PM Revision 3d3eb0ef (ceph): Remove --keep-locked-on-error, and behave as if it were specified
This will help prevent machines with cephtest dirs still present from
being used. It's easy to unlock machines - the ...
Josh Durgin
09:37 PM Revision 9d846d33 (ceph): move ceph_mount call
Greg Farnum
09:29 PM Revision 039035b3 (ceph): osd: discard requests that from disconnected clients
If we can't reply, throw out the request; they'll need to resend it anyway.
Signed-off-by: Sage Weil <sage@newdream....
Sage Weil
09:29 PM Revision 88de6ab5 (ceph): ceph.spec.in: handle docdir properly
Pass correct path to configure (fixes SuSE builds).
Use %doc command to install sample.ceph.conf and sample.fetch_co...
Sage Weil
09:13 PM Revision 0121bcd2 (ceph): clitests: add test case for crush exhaustive search
This detects the bug that caused #1594.
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com>
Josh Durgin
08:13 PM Revision e5db7b24 (ceph): Makefile: only reset docdir if it's not already defined
This is only for old versions of automake, and won't work for ancient
SuSe.
Signed-off-by: Sage Weil <sage@newdream....
Sage Weil
08:03 PM Revision 2f877f82 (ceph): crushtool: fix --weight arg parsing
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:02 PM Revision 8ff1cbb5 (ceph): crushtool: add --force N
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:54 PM Revision af6a9f30 (ceph): crush: try _all_ bucket items when doing exhaustive search
N-1 isn't exhaustive.
Fixes: #1594
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
06:10 PM Revision 5a236942 (ceph): ceph.spec.in: build on suse
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:05 PM Bug #1609 (Resolved): osd: failed assert(info.last_complete == info.last_update)
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-07/246/remote/ubuntu@sepia67.ceph.dreamhost.com/log/osd... Josh Durgin
04:53 PM Revision 6e29c282 (ceph): mon: tolerate 50ms (instead of 10ms) of clock drift
NTP on sepia can't manage 10ms, it seems. Sigh.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
04:49 PM Bug #1608: osd crash in get_authorize_handler
if this was caused by global static class lameness, it should be fixed by commit:dc40b37403298a60cb5823c030fa94518b0c... Sage Weil
04:22 PM Bug #1608 (Resolved): osd crash in get_authorize_handler
Possibly a use-after-free. From teuthology:~teuthworker/archive/nightly_coverage_2011-10-07/251/remote/ubuntu@sepia86... Josh Durgin
04:32 PM Revision ff31d3cc (ceph): rgw: fix printf format warning
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:45 PM Bug #1607 (Resolved): osd: failed assert(missing.is_missing(oe.soid))
While thrashing with bonnie++, this ocurred:... Josh Durgin
11:29 AM Bug #1594: pgs stuck degraded or active after 3 hours
Reproduced with 2.1p3 stuck in active since the up and acting sets were different. In this case 3 osds were marked ou... Josh Durgin
10:56 AM Bug #1454 (Can't reproduce): cosd process dies and remains in defunct state
Sage Weil
10:54 AM CephFS Bug #1437 (Can't reproduce): cfuse can't change permissions of a file
Sage Weil
10:34 AM Bug #1581 (Resolved): teuthology: restarting osds sometimes allows daemon-helper to fail
Josh Durgin
10:20 AM Bug #1403 (Resolved): osd: FAILED assert(0 == "we got a bad state machine event")
Sage Weil
10:20 AM Bug #1449 (Resolved): osd: FAILED assert(0 == "we got a bad state machine event")
Sage Weil
09:38 AM Feature #1606 (Rejected): osd: bound missing set memory usage
Sage Weil
09:38 AM Feature #1583: osd: bound pg log memory usage
I don't think this is a new problem, but it is a problem! Sage Weil
09:36 AM RADOS Feature #1605 (New): osd: rewrite missing object if op is replayed
If an MOSDOp is received with a reqid that matches a missing object, and we have the prior version (or no prior versi... Sage Weil
09:33 AM Bug #1602 (Duplicate): mon crash during startup
#1415
Sage Weil
09:05 AM rgw Bug #1592 (Resolved): rgw: remove preallocated pools
This got merged into master and passed s3tests. Greg Farnum
12:39 AM Revision 8be9450b (ceph): rgw: make log object naming configurable
Use date(1) codes for object name, plus %i and %n for bucket id/name, and
make UTC vs localtime configurable.
Signed...
Sage Weil
12:18 AM Revision c56ab974 (ceph): reconnect: ignore SSHExceptions before the timeout expires
Fixes: #1587 Josh Durgin

10/06/2011

11:17 PM Revision 4fe9cca5 (ceph): objecter: cope with missing osdmaps in history
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:17 PM Revision 6e04f60e (ceph): mon: fix osdmap trimming unsigned overflow
This was trimming way to aggressively for early epochs due to the unsigned
version overflowing.
Signed-off-by: Sage ...
Sage Weil
11:17 PM Revision 64935d4c (ceph): auth: fix authorizer leak fix
We were never reusing the authorizer, so make it local. This fixes a crash
(double-free) on reset().
Signed-off-by:...
Sage Weil
11:17 PM Revision 3d2f89fa (ceph): mon: make other send_incremental variant handle map discontinuity
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
10:59 PM Revision 85bbc2f9 (ceph): .gitignore: add multi_stress_watch
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
10:45 PM Revision ad48ada8 (ceph): pg: rename warm_restart and last_warm_restart
These names make more sense, since last_warm_restart was updated
outside of the warm_restart function.
Signed-off-by...
Josh Durgin
10:45 PM Revision b344d797 (ceph): osd, pg: ignore responses to obsolete queries
This adds a query_epoch to notify and log messages, which are
sent in response to queries from the primary during pee...
Josh Durgin
09:51 PM Revision 5840ae46 (ceph): test/multi_stress_watch.cc: add watch notify tester for teuthology
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:51 PM Revision 6bfae03a (ceph): ReplicatedPG: assert *_FLUSHING unreachable for AccessMode
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:34 PM Revision 4722d468 (ceph): task/watch_notify_stress: watch_notify_stress now thrashes clients
This should exercise the watch notify timeout code.
Signed-off-by: Samuel Just <samuel.just@dreamhost.com>
Samuel Just
08:56 PM Revision 088d0dfa (ceph): qa: rados-api: try harder to make these pool names unique
Still getting -17 sometimes on sepia... :/
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
08:39 PM Revision 646ef6aa (ceph): xlist: more assertive
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:24 PM Revision d78b8c76 (ceph): xattr: use sys/types.h instead of linux/types.h
This way it compiles.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:24 PM Revision 960deb49 (ceph): Makefile: include ceph_extattr.h to dist tarball
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:24 PM Revision b1ea37fc (ceph): Merge remote branch 'stass/wip-freebsdport-rebased2' into wip-freebsd
Sage Weil
08:12 PM Revision 07c88609 (ceph): Revert "osd: simplify the session ref-counting branches"
This reverts commit f002144ef7d314e80d3d19a5cd2c4cef38851056. Sage Weil
07:58 PM Revision fce17617 (ceph): osd: boot with map of oldest-1
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:50 PM Revision 4e61e483 (ceph): rgw: keep radosgw in foreground
It defaults to a daemon now. Sage Weil
07:35 PM Revision d2721464 (ceph): radosgw-admin: UTC for time in 'log show', local time in separate field
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:35 PM Revision 77cfbfbb (ceph): radosgw-admin: for date filtering for 'log list'
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:35 PM Revision e8dd1f8b (ceph): utime: add gmtime() output function
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
05:32 PM Bug #1587 (Resolved): teuthology: handle SSHException when reconnecting
Implemented in c56ab97442bbf61ad84526580bf773b324a05066 of teuthology.git. Josh Durgin
05:30 PM Revision 6e0a3df8 (ceph): Merge branch 'wip-leaks'
Greg Farnum
04:58 PM Revision d9262014 (ceph): cephx: don't leak Authorizers on each request
It's not clear to me why this is a class member -- it's only
written to or read from in this function, which allocate...
Greg Farnum
04:44 PM Revision 5d7e4bd5 (ceph): osd: put session ref in ms_handle_reset
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:44 PM Revision f002144e (ceph): osd: simplify the session ref-counting branches
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:40 PM Revision d78286e1 (ceph): rgw: fix chdir result warning
warning: rgw/rgw_main.cc:265: ignoring return value of ‘int chdir(const char*)’, declared with attribute warn_unused_... Sage Weil
04:33 PM Feature #1604 (Resolved): kclient: handle osdmap discontinuity
Sage Weil
04:20 PM Bug #1449: osd: FAILED assert(0 == "we got a bad state machine event")
This should be fixed with commit:b344d7979b8e0aac931c9f337f367e400a932e36 Josh Durgin
03:52 PM Revision ca3662c8 (ceph): rgw: fix rgw_switch_url_previx printf arg
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
03:50 PM Revision 02be9938 (ceph): osd: same_acting_since -> same_interval_since
Interval is (currently) defined as same acting and up sets.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
01:26 PM CephFS Bug #1603 (Can't reproduce): ceph-fuse crash during unmount
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-06/165/teuthology.log:... Josh Durgin
01:24 PM Bug #1602 (Duplicate): mon crash during startup
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-06/164/remote/ubuntu@sepia51.ceph.dreamhost.com/log/mon... Josh Durgin
10:11 AM Bug #1534 (Can't reproduce): ceph tool failed assert(mon_addr.count(n))
Sage Weil
10:10 AM CephFS Bug #1601: mds crash during snaps workunit
this looks like heap corruption to me. we should run that teuthology task with valgrind on the mds. Sage Weil
10:08 AM CephFS Bug #1601 (Can't reproduce): mds crash during snaps workunit
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-06/163/teuthology.log:... Josh Durgin
04:41 AM Revision 5ab7f8fa (ceph): osd: fix same_acting_since update
same_acting_since actually means "same acting AND up set since". This is
what the other same_acting_since update doe...
Sage Weil
04:30 AM Revision 5235b970 (ceph): rgw: add conf rgw_remote_addr_param instead of using fcgi environment
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:30 AM Revision 24e05e32 (ceph): rgw: use conf for rgw_dns_name instead of fcgi environment
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:30 AM Revision 85cf0a8d (ceph): rgw: use conf for rgw_print_continue instead of fcgi environment
Also remove unused expect100cont bool.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
04:30 AM Revision 8f62c3bb (ceph): rgw: use conf for rgw_swift_url[_prefix] instead of fcgi env
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil

10/05/2011

10:16 PM Revision 1e151664 (ceph): vstart: use normal name for .asok files
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
10:16 PM Revision 9d53b3d3 (ceph): osd: fix pg stats encoding for unfound
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
08:48 PM Revision ed878715 (ceph): rgw: fix initscript default user
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:37 PM Revision 4ab2a852 (ceph): rgw: daemonize by default
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:07 PM Revision 5af012a8 (ceph): rgw: set locator key before calling get_obj_state()
get_obj_state assumes the io_ctx is ready to go. Set the locator key
earlier rather than later.
Signed-off-by: Sage...
Sage Weil
08:05 PM Revision c4b51e30 (ceph): objecter: normalize oloc.key
If oloc.key == oid, clear it. Users can set it to the oid or leave it
blank; they are equivalent.
Signed-off-by: Sa...
Sage Weil
07:02 PM Revision a3316f44 (ceph): cli: fix test contents again.
I don't know why this didn't come across properly from my wip branch...
Signed-off-by: Greg Farnum <gregory.farnum@d...
Greg Farnum
06:13 PM Revision b38cba2d (ceph): paxos: update first_committed
Otherwise this stays at 0, which isn't accurate.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
06:13 PM Revision e4a69004 (ceph): osd: fix map epoch check in handle_osd_map
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
05:47 PM Bug #1599: osd assert fail (new_tail >= ondisklog.tail)
Happened in 151 as well. Josh Durgin
01:51 PM Bug #1599: osd assert fail (new_tail >= ondisklog.tail)
I'm thrashing with osd debugging. This should turn up (as well as other bugs we've seen while thrashing). Josh Durgin
01:42 PM Bug #1599: osd assert fail (new_tail >= ondisklog.tail)
can we reproduce with osd logs on? Sage Weil
01:38 PM Bug #1599: osd assert fail (new_tail >= ondisklog.tail)
This happened again in teuthology:~teuthworker/archive/nightly_coverage_2011-10-05/150/remote/ubuntu@sepia5.ceph.drea... Josh Durgin
11:13 AM Bug #1599 (Resolved): osd assert fail (new_tail >= ondisklog.tail)
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-04/102/remote/ubuntu@sepia75.ceph.dreamhost.com/log/osd... Josh Durgin
05:45 PM Revision a76f8d42 (ceph): rgw: fix mis-merge issues.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
04:54 PM Revision 20e0d36a (ceph): Merge branch 'wip-rgw-pools'
Conflicts:
src/rgw/rgw_bucket.cc
src/rgw/rgw_main.cc
Greg Farnum
04:50 PM Revision b5c27aec (ceph): monclient: add an initialized bool to guard shutdown.
The addition of a Finisher in 9c56070bc20878e87fcb4715b0a3559dd1aaf9ff
broke shutdown in the case where MonClient::in...
Greg Farnum
04:25 PM Revision 13d48030 (ceph): rgw: remove rgw_create_bucket.
Push all its extra functionality down into RGWRados::create_bucket. Convert
callers to the different interface (there...
Greg Farnum
04:25 PM Revision 5f960f82 (ceph): rgw: remove select_bucket_placement from RGWAccess interface.
RGWRados::create_bucket is the only user now, so make it private
and make the interface a little tighter.
(We are go...
Greg Farnum
04:25 PM Revision 12472785 (ceph): rgw: move rgw_store_bucket_info into RGWRados.
create_bucket is the only user, so just make it a simple helper.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhos...
Greg Farnum
04:25 PM Revision f723f0a5 (ceph): rgw: remove the rgw_get_bucket_info functions
Push them down into the RGWAccess layers.
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
04:25 PM Revision 1c2ff9d6 (ceph): rgw: remove the rgw_buckets files
We don't need them any more. Hurrah!
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
04:25 PM Revision 98f15e64 (ceph): rgw: new command to add pools to the list of available places for storage.
Do this by adding an add_bucket_placement function to RGWAccess. Hurray
leaky abstractions! :(
Signed-off-by: Greg F...
Greg Farnum
02:26 PM Bug #1449: osd: FAILED assert(0 == "we got a bad state machine event")
Replicated this with osd debugging enabled by thrashing with teuthology. Logs are in vit:~joshd/thrash_bad_state_mach... Josh Durgin
12:47 PM Linux kernel client Bug #1600 (Can't reproduce): general protection fault during blogbench
Logs in teuthology:~teuthworker/archive/nightly_coverage_2011-10-05/129
From kern.log on sepia29:...
Josh Durgin
10:09 AM Bug #1530: osd crash during build_inc_scrub_map
A third recurrence, this time with ffsb on rbd. Logs in teuthology:~teuthworker/archive/nightly_coverage_2011-10-05/142/ Josh Durgin
03:36 AM Revision 3bb6ad9d (ceph): Merge branch 'stable'
Sage Weil
03:35 AM Revision a788bfdb (ceph): osd: fix PG::up calculation on startup
We weren't filling in pg->up, which meant we booked an incorrect past
interval in PG::warm_restart(), leading to unkn...
Sage Weil
03:30 AM Revision 36aaa774 (ceph): mon: fix maps sent after osd boot
This was a bad hunk that crept in while I was rebasing wip-oldmaps. :(
Signed-off-by: Sage Weil <sage.weil@dreamhos...
Sage Weil
02:03 AM Revision d4340bac (ceph): On FreeBSD, libedit should be linked in explicitly, because this library
is present in the system, but it is not in the pkg-config database.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org>
Stanislav Sedov
12:34 AM Revision 9fde4d94 (ceph): First cut of FreeBSD support. This patch allowes ceph to compile succe...
under FreeBSD.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org>
Stanislav Sedov
12:34 AM Revision 2b9ba0bd (ceph): Use system-independent definitions for Ceph's setxattr flags
parameter. Ceph passes Linux setxattr flags on the wire without
converting them, so use the same values for these de...
Stanislav Sedov
12:34 AM Revision 8ffbaffb (ceph): osd: Minimize amount of #ifdefs in src/os/FileStore.cc by allowing
btrfs_ioctl.h to compile and relying on the btrfs runtime checks
wherever possible.
Signed-off-by: Stanislav Se...
Stanislav Sedov
12:34 AM Revision c4f4ea96 (ceph): osd: use numeric values for OSD errnos, as ERESTART and ESHUTDOWN values
are system-dependent. In future, we'll probably want to use system
independent values as Ceph result codes, not...
Stanislav Sedov
12:34 AM Revision 8140bdb0 (ceph): Abstract out extended attribute calls in ceph_extattr.c so all
the system-dependent extattr code can be moved into a single
place. Implement FreeBSD and Darwin support.
Signed-of...
Stanislav Sedov
12:34 AM Revision 71cc72f9 (ceph): osd: prevent a potential huge memory allocation in case a error
code is returned from getxattr.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org>
Stanislav Sedov
12:34 AM Revision 3efa1603 (ceph): Fix endianess detection on FreeBSD.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org> Stanislav Sedov
12:34 AM Revision 5d75d69d (ceph): Change extattr wrapper for FreeBSD to return ERANGE error code in case
the buffer provided was too small similar to what Linux does.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org>
Stanislav Sedov
12:34 AM Revision 8f19bb0e (ceph): SimpleMessenger: getnameinfo needs the correct address size to be passe...
not just a size of the whole sockaddr_storage placeholder.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org>
Stanislav Sedov
12:34 AM Revision 07b9bc03 (ceph): SimpleMessenger: add new method addr_size() to return the correct size ...
address object. Also, do not assume ss_family is 2 bytes
on all platforms.
Signed-off-by: Stanisla...
Stanislav Sedov
12:34 AM Revision 3e0122c9 (ceph): Change PAGE_MASK/PAGE_SIZE definitions to CEPH_PAGE_MASK/CEPH_PAGE_SIZE
so they won't interfere with system definitions.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org>
Stanislav Sedov
12:34 AM Revision cb45fbcb (ceph): SimpleMessenger: pass correct address sizes to bind(2) and connect(2),
otherwise they fail on FreeBSD (and possibly on other
platforms).
Signed-off-by: Stanislav Sedov <s...
Stanislav Sedov
12:34 AM Revision 0cd3e562 (ceph): Use /bin/sh, which is more portable.
Signed-off-by: Stanislav Sedov <stas@FreeBSD.org> Stanislav Sedov
12:21 AM Revision 107db6a9 (ceph): Retry listing machines if the lock server goes down.
Josh Durgin

10/04/2011

11:09 PM Revision 39a1e760 (ceph): rgw: use normal logging mechanism
Keep capturing stdout/err, even though it should end up empty.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
11:08 PM Revision 7cbb68d8 (ceph): Merge branch 'wip-rgw-log'
Sage Weil
11:07 PM Revision 64d2b97d (ceph): osd: remove some dead code
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:05 PM Revision 167c091e (ceph): rgw: write debug acl output to the debug output (not cout)
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:02 PM Revision 3b645fb9 (ceph): common: CINIT_FLAG_NO_BANNER
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
11:02 PM Revision 4e9969e7 (ceph): rgw: more sane defaults
Notably, this gives us log_to_stderr = 1.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:40 PM Revision 09b23b42 (ceph): rgw: rgw_log -> debug_rgw
Normalize with the rest of the code base.
Also, 'rgw log' might be confused with 'log file'.
Signed-off-by: Sage We...
Sage Weil
10:36 PM Revision 37c88295 (ceph): rgw: remove useless RGWConf::log_level
Not used for anything.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
10:36 PM Revision 14338615 (ceph): rgw: set dout condvar to rgw_log
We would normally set DOUT_SUBSYS and get debug_$foo, but the setting in
this case is rgw_log.
Signed-off-by: Sage W...
Sage Weil
09:59 PM Revision 27cf8547 (ceph): rgw: RGW_LOG -> dout
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:43 PM Bug #1598 (Resolved): osd: fix lost objects
- track them properly
- consider rolling back to prior version vs strict EIO
- if a lost object had no prior_versio...
Sage Weil
09:40 PM Bug #1594: pgs stuck degraded or active after 3 hours
Found one unrelated but, commit:a788bfdb93548751cec7184b65d42702cc207508.
I see one other possible badness:
- op...
Sage Weil
04:39 PM Bug #1594: pgs stuck degraded or active after 3 hours
I reproduced this with debugging enabled. Logs are in vit:~joshd/thrash_stuck_active.
In this case there was 1 stuck...
Josh Durgin
09:08 PM Revision f24216a4 (ceph): mon: make MPoolOp*::print() readable
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
09:08 PM Revision a7abb068 (ceph): rados-api/test: don't srand(time())
This will give duplicate results if called more than once per second, which
leads to things like -EEXIST (#1558).
Si...
Sage Weil
08:46 PM Revision b8f5a35a (ceph): Merge branch 'wip-oldmaps'
Sage Weil
08:46 PM Revision 80842721 (ceph): rgw: set xmlns on ListBucketResult element
Certain S3 clients (e.g., s3fs) require the xmlns to parse the
results correctly.
Signed-off-by: Henry C Chang <henr...
Henry Chang
08:46 PM Revision b2361dc7 (ceph): radosgw: run as 'user' field in ceph.conf
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:44 PM Revision 71792e83 (ceph): objecter: fix locking on is_latest_map checks
We need to take the lock protecting these structures in the completion
context.
Signed-off-by: Sage Weil <sage@newdr...
Sage Weil
08:44 PM Revision e661bf86 (ceph): osd: use pointers for handle_advance_map
So that lastmap can be optional.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:44 PM Revision b8ada495 (ceph): osdmap: include oldest, newest known epoch in message
If non-zero, these are filled in with the oldest and newest map epochs
available. This helps the client decide wheth...
Sage Weil
08:44 PM Revision a47aedd4 (ceph): mon: limit maps send on onetime osdmap subscribe
This throttles the load put on the monitor by making the client request
osdmaps in discrete chunks. Use a feature bi...
Sage Weil
08:44 PM Revision cd93f18f (ceph): osdmap: show old addrs for down osds
This is useful for admins to find which ip/node to connect to.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:44 PM Revision ddbb2ca4 (ceph): osdmap: show weight for out osds
This info is reduncant (weight is 0 for out osds), but makes the output
cleaner.
Signed-off-by: Sage Weil <sage@newd...
Sage Weil
08:44 PM Revision 6ff57932 (ceph): osdmap: include addrs in json dump
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:44 PM Revision 4f591b55 (ceph): osd: get latest osdmaps before booting
- get the latest osdmaps before adding/marking ourselves up
- behave if there is a discontinuity in the osdmap histor...
Sage Weil
08:44 PM Revision ea087e39 (ceph): mon: implicitly mark lost_at when a fresh osd rejoins the cluster
If an osd is wiped an rejoins the cluster, record that by implicitly
setting lost_at.
Signed-off-by: Sage Weil <sage...
Sage Weil
08:44 PM Revision 906e1dad (ceph): osd: add last_epoch_clean to pg_stat_t
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:44 PM Revision 799323b1 (ceph): mon: trim old osdmaps
Trakc the oldest clean pg, and trim osdmaps that are older than that.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:44 PM Revision 2a9330fb (ceph): osd: do not request osdmap when loner if not booted
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:44 PM Revision 51fc234c (ceph): osd: is_active() instead of !is_booting() for map sharing checks
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:44 PM Revision 54ce5442 (ceph): monclient: implement get_version() to get newest _and_ oldest map versions
Reimplement is_latest_map() in terms of more general helper. Add field
to reply message.
Signed-off-by: Sage Weil <...
Sage Weil
08:44 PM Revision 04b90cc7 (ceph): monclient: clean up old version_requests on session reset
This throws out the old contexts. We assume that the caller will not wait
for them.
Signed-off-by: Sage Weil <sage@...
Sage Weil
08:44 PM Revision 9c56070b (ceph): monclient: use an async finisher
Otherwise we can deadlock if any completions call back into MonClient.
Signed-off-by: Sage Weil <sage@newdream.net>
Sage Weil
08:44 PM Revision b8aca4e2 (ceph): radosgw: run as 'user' field in ceph.conf
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:55 PM Revision 8e37140f (ceph): radosgw-admin: make 'log list' behave when .log pool dne
Signed-off-by: Sage Weil <sage.weil@dreamhost.com> Sage Weil
07:32 PM Revision 7b7ff6e8 (ceph): teuthology-worker: clean up last_in_suite jobs
There's no reason not to delete them once they start. Josh Durgin
07:17 PM Revision 3d3ba1eb (ceph): daemon-helper: detect the signal actually sent
I thought I fixed this when I implemented coverage collection, but I
guess it got lost in a rebase or something.
Josh Durgin
05:35 PM Feature #1559 (Resolved): teuthology: continue waiting if the lock server fails
Done in 107db6a913e58201ed51b097881979539723a139. Josh Durgin
04:08 PM rgw Feature #1593 (Resolved): rgw: normalize logging
Sage Weil
02:49 PM CephFS Bug #1596: mds crash during ffsb on kernel client in CInode::is_frozen
nothing obvious in the code. needs logs on this one. Sage Weil
10:35 AM CephFS Bug #1596 (Can't reproduce): mds crash during ffsb on kernel client in CInode::is_frozen
Logs in teuthology:~teuthworker/archive/nightly_coverage_2011-10-04/82:... Josh Durgin
02:46 PM Linux kernel client Bug #1597: general protection fault during mount
not sure if this is ceph related at all. google turned up
http://forum.ubuntu-it.org/index.php?topic=422944.msg33...
Sage Weil
11:28 AM Linux kernel client Bug #1597 (Can't reproduce): general protection fault during mount
Teuthology log in teuthology:teuthworker/archive/nightly_coverage_2011-10-04/84 shows:... Josh Durgin
02:40 PM Bug #1563 (Closed): OSD isn't prioritizing data with waiting ops during transfer

2011-09-26 12:36:16.150430 7f66b08b4700 -- 64.188.54.43:6800/4514 <== mds0 64.188.54.36:6800/10043 6 ==== osd_op(m...
Sage Weil
01:48 PM Bug #1562 (Resolved): osd: fix osd replacement
Sage Weil
12:32 PM Bug #1581: teuthology: restarting osds sometimes allows daemon-helper to fail
Probably fixed with 3d3ba1ebb1c9f145300e972829b73a7eeaf00faa. I'll close the issue if it doesn't recur in the next co... Josh Durgin
12:06 PM Bug #1581: teuthology: restarting osds sometimes allows daemon-helper to fail
Looking into this since it's happened again today. Josh Durgin
09:57 AM Bug #1581: teuthology: restarting osds sometimes allows daemon-helper to fail
This happened again in teuthology:~teuthworker/archive/nightly_coverage_2011-10-03/41... Josh Durgin
09:39 AM rgw Bug #1592: rgw: remove preallocated pools
Done in branch wip-rgw-pools as of commit:f458d7aac4597e7ad735853cfdfe405c80a2dacb. I'm going to try and finish clean... Greg Farnum
03:03 AM Revision 81c5f6ad (ceph): debian: don't recommend gceph
Not until it's worth installing.
Signed-off-by: Sage Weil <sage.weil@dreamhost.com>
Sage Weil
12:49 AM Revision d305d61b (ceph): ceph_manager: remove unused raw_pg_status method
Josh Durgin
12:49 AM Revision 8e031730 (ceph): ceph_manager: run ceph -s as a normal program
This allows failures from it to be detected better. Josh Durgin
12:11 AM Revision bad609e6 (ceph): teuthology-results: include passed tests in email
Josh Durgin
12:08 AM Revision 8bcd2a74 (ceph): teuthology-results: include reasons for failure in email
Josh Durgin
12:07 AM Revision 817b9504 (ceph): radosbench: get coverage and cores
Josh Durgin
12:07 AM Revision 1cad309d (ceph): Add failure_reason to summary for the first failure detected.
For now, this is the exception raised during a task, the error found
in the central log, or coredumps found. More spe...
Josh Durgin
12:07 AM Revision 030161ed (ceph): teuthology-ls: show reasons for failures with -v
Josh Durgin

10/03/2011

10:40 PM Revision 9f71bb0c (ceph): rgw: don't specify create_pool and set_marker in create_bucket.
It's wildly inappropriate for that kind of implementation detail to
leak out of the interface. For the moment, leave ...
Greg Farnum
10:23 PM Revision 3503b34a (ceph): Merge branch 'stable'
Sage Weil
10:23 PM Revision 4ae8f1c9 (ceph): radosgw: make stop succeed when not running
This fixes apt-get errors like
No /usr/bin/radosgw found running; none killed.
invoke-rc.d: initscript radosgw, acti...
Sage Weil
10:14 PM Revision 36f650b4 (ceph): rgw: move rgw_bucket_select_host_pool behind RGWAccess as select_bucket...
This doesn't really belong in front of the interface. Maybe later we
can hide it completely, but for now we can put i...
Greg Farnum
10:02 PM Revision c49c19c1 (ceph): radosgw: fix init-radosgw provides
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
09:32 PM Revision bbc644e0 (ceph): rgw: remove withdraw_pool function.
I think its use was broken anyway?
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com>
Greg Farnum
09:32 PM Revision 1018673c (ceph): rgw: remove preallocation of pools
Rename rgw_bucket_allocate_pool to rgw_bucket_select_host_pool, since
that better describes functionality we might ac...
Greg Farnum
09:31 PM Revision df9c6602 (ceph): rgw: remove preallocating pools maintenance tick
Signed-off-by: Greg Farnum <gregory.farnum@dreamhost.com> Greg Farnum
09:26 PM Revision fe1a271d (ceph): watch_notify_stress.py: add ceph flags option
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
09:26 PM Revision 28d60172 (ceph): ceph.py: add btrfs option
Signed-off-by: Samuel Just <samuel.just@dreamhost.com> Samuel Just
08:43 PM CephFS Bug #1472 (Can't reproduce): cfuse hangs with v0.34
Sage Weil
08:42 PM rgw Feature #1542 (Resolved): rgw: combine multiple buckets in a pool
Sage Weil
08:23 PM Revision 04f825ed (ceph): rgw: set xmlns on ListBucketResult element
Certain S3 clients (e.g., s3fs) require the xmlns to parse the
results correctly.
Signed-off-by: Henry C Chang <henr...
Henry Chang
08:19 PM Revision 848a1f95 (ceph): debian: add non-stripped exception for libcls_rgw rados class
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
08:18 PM Revision 5d54398c (ceph): mds: fix possible deadlock in multi-mds setup
This should fix the file_excl case on a file_max update when there
is more than one mds.
If we dont issue caps here ...
Brandon Seibel
08:18 PM Revision 2a078f89 (ceph): Makefile: more radosgw_admin -> radosgw-admin
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
07:22 PM Revision a192ee1b (ceph): basic: add rbd cli tests
Josh Durgin
07:20 PM Revision b968ff3a (ceph): workunit: clean up temporary files
Signed-off-by: Josh Durgin <josh.durgin@dreamhost.com> Josh Durgin
06:25 PM Revision 6103ee9e (ceph): more radosgw_admin -> radosgw-admin
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
06:20 PM Bug #1594 (Resolved): pgs stuck degraded or active after 3 hours
From teuthology:~teuthworker/archive/nightly_coverage_2011-10-03/42/teuthology.log... Josh Durgin
06:19 PM Revision c5b0e3ee (ceph): osd: trim ondisk log using Log::Entry::offset
The block_map is a waste of memory. It is simpler just to keep track of
what offset a Log::Entry was read/written/ap...
Sage Weil
06:13 PM Revision 12ce321b (ceph): ReplicatedPG: reset return code after find_object_context
This way the object is actually deleted when it has no snapshots,
since the transaction is not aborted. This makes
te...
Josh Durgin
05:59 PM Revision f85dfa74 (ceph): osd: combine log_op into append_log
They're both short, and log_op only ends up hiding information that
append_log will soon need (the raw Log::Entry's)....
Sage Weil
05:30 PM Feature #1557 (Resolved): teuthology: summary string
This does exceptions, the first most severe error in the central log (in order of SEC, ERR, WRN), and core dumps. Thi... Josh Durgin
04:56 PM Revision ae196028 (ceph): nuke: keep up with renaming cfuse -> ceph-fuse
Josh Durgin
04:55 PM Revision f4e61db8 (ceph): mdcache: tolerate no subtrees in create_subtree_map()
We don't really need mydir here. It is normally opened up in the
subsequent call to open_root(). That will be ident...
Sage Weil
04:55 PM Revision 8111b619 (ceph): mds: make jouranl writeable in MDLog::append()
When restarting a stopped MDS, we need to mark the Journaler read/write
before we use it. Do this in MDLog::append()...
Sage Weil
04:45 PM Revision a827f040 (ceph): Merge remote branch 'gh/stable'
Sage Weil
04:45 PM Revision b93a0077 (ceph): tasks/radosgw-admin: test radosgw-admin tool
Not yet complete... Sage Weil
04:45 PM Revision 2b601a32 (ceph): radosgw-admin: test additional keys, log list/show/rm
Signed-off-by: Sage Weil <sage@newdream.net> Sage Weil
04:44 PM Revision afc2dc08 (ceph): nuke: killall apache2 and radosgw too
Sage Weil
03:15 PM rgw Feature #1593 (Resolved): rgw: normalize logging
Instead of logging to stdout/err, use the normal log mechanism to write to wherever log_file points to. Then fix the... Sage Weil
02:22 PM Feature #1377 (Resolved): teuthology: use btrfs to back osds
Samuel Just
11:50 AM Bug #1554 (Resolved): rbd rollback does not work with a snapshot of an empty image
Fixed by commit:12ce321b3217b1f3134a7586c1f7d1e6cc384b02 Josh Durgin
11:47 AM CephFS Bug #1538 (Resolved): mds: all clients can and up becoming unresponsive, mds locker waiting for u...
I applied your patch, slightly modified.. I still pass need_issue and call issue_caps explicitly. Same end result, b... Sage Weil
10:18 AM Feature #1483 (Resolved): glance: rbd backend
Sage Weil
10:12 AM Feature #1583: osd: bound pg log memory usage
Just started looking at this, but alldown-to-cleanboth points to the Ondisklog::block_map, which is a stupid piece of... Sage Weil
10:04 AM rgw Bug #1592 (Resolved): rgw: remove preallocated pools
We don't want this since we're putting many buckets into a single pool now. Keeping it around breaks many interface l... Greg Farnum
10:01 AM rgw Feature #1544 (Resolved): rgw: explicit bucket index
We have a bucket index. Other tickets exist for doing the cleanup, etc. Greg Farnum
09:55 AM CephFS Bug #1589 (Resolved): MDS assert failure
Sage Weil

10/02/2011

10:30 PM Revision 4922757e (ceph): ceph.spec.in still packages libceph.so in 0.36
Package libcephfs.so instead of libceph.so.
Signed-off-by: Alexandre Oliva <oliva@lsd.ic.unicamp.br>
Alexandre Oliva
03:17 PM rgw Feature #1591 (Resolved): rgw: instrument with perfcounter
things like op counts, bytes, latency from receiving request to sending first byte Sage Weil
 

Also available in: Atom