Project

General

Profile

Activity

From 03/26/2014 to 04/24/2014

04/24/2014

11:48 PM Revision 499adb1d (ceph): rados.h,ReplicatedPG: add CEPH_OSD_FLAG_ENFORCE_SNAPC and use on flush
We need to ensure that even with pool snaps, we use the snapc provided in order
to ensure that the clones are written...
Samuel Just
09:29 PM Revision ee69c7a4 (ceph): rgw: update idle_timeout for rgw_s3tests_multiregion.yaml
Fixes: #8194
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:49 PM rgw Fix #8210 (New): rgw: bulk deletes are slow
Ceph deletions are a bit problematic. They didn't implement bulk deletes correctly. Instead of being faster, they are... Ian Colle
08:42 PM Revision 9b37398d (ceph): Merge pull request #1717 from dachary/wip-auid
mon: add ceph osd pool set <pool> auid
Reviewed-by: Greg Farnum <greg@inktank.com>
Loïc Dachary
08:27 PM Revision e8b13f71 (ceph): Merge pull request #1724 from ceph/wip-uselocalgithubforqemu-wusui
Use new git mirror for qemu-iotests wusui
07:55 PM Revision ddf37d90 (ceph): Use new git mirror for qemu-iotests
Fixes: 8191
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Warren Usui
07:48 PM Revision 1885792c (ceph): ECBackend::continue_recovery_op: handle a source shard going down
get_min_avail_to_read_shards might return an error if there are
no longer enough sources to reconstruct the missing s...
Samuel Just
05:56 PM Bug #8193: HitSetTrim test in test/librados/tier.cc needs to be skipped if thrasher running

This particular test case is timing sensitive. It doesn't make sense to run it when the thrasher is running. This ...
David Zafman
04:27 PM rgw Bug #8194: rgw: test_region_copy_object fails with erasure coding
I pushed an update to the test suite, should be ok now. Yehuda Sadeh
02:24 PM rgw Bug #8194: rgw: test_region_copy_object fails with erasure coding
Cross region copy is just too slow due to ec backend. Apache ends up timing out. Need to increase the idle_timeout pa... Yehuda Sadeh
03:51 PM Bug #8207 (Duplicate): "[ERR] 3.6 missing primary copy.." in upgrade:dumpling-x:stress-split-fire...
This cold be a duplicate of #7976
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-23_19:55:03-u...
Yuri Weinstein
03:46 PM Revision af209851 (ceph): Don't push last_in_suite jobs to paddles
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
02:44 PM Bug #8206 (Duplicate): "osd.4 ...[ERR] : 3.14 push" in upgrade:dumpling-x:stress-split-firefly-di...
This one was not reproduced on manual re-run.
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-2...
Yuri Weinstein
02:25 PM Fix #8205 (Resolved): FileStore: properly fill in XATTR_NO_SPILL_OUT tag
Right now, the only way the FileStore sets the XATTR_SPILL_OUT_NAME xattr to contain XATTR_NO_SPILL_OUT is when remov... Greg Farnum
02:21 PM Bug #8204 (Duplicate): "timed out waiting for admin_socket to appear after osd.5 restart" in upgr...
I could not reproduce it manually, but after consulting with devel still logging, so we can trace to similar race con... Yuri Weinstein
01:15 PM Feature #8203 (Resolved): Replica setting values in df output
The ability to see replica settings in ceph df. Potentially a warning when the replica is less than X in value, where... JuanJose Galvez
01:01 PM devops Bug #7617 (Fix Under Review): ceph-deploy uninstall should document why it doesn't remove all rel...
PR opened https://github.com/ceph/ceph-deploy/pull/182 Alfredo Deza
10:06 AM devops Bug #7617 (In Progress): ceph-deploy uninstall should document why it doesn't remove all relevant...
Alfredo Deza
12:50 PM Bug #8161: osd/ECBackend.cc: 475: FAILED assert(r == 0)
Samuel Just
10:16 AM Bug #8161: osd/ECBackend.cc: 475: FAILED assert(r == 0)
Samuel Just
12:29 PM rgw Bug #8202 (Resolved): rgw: failure to copy objects with chinese names
From mailing list:... Yehuda Sadeh
11:30 AM CephFS Bug #8201 (Resolved): client: (optionally) crash/exit if we are refused reconnect to the mds
currently we hang and there is no way for users of the fs to know that it is not going to unhang in the future. Sage Weil
11:29 AM CephFS Bug #8200: failing kclient_workunit_kclient test
teuthology-2014-04-20_23:04:17-kcephfs-master-testing-basic-plana/206396/ Greg Farnum
11:27 AM CephFS Bug #8200 (Resolved): failing kclient_workunit_kclient test
http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-22_23:05:45-kcephfs-firefly-testing-basic-plana/210650/
http:...
Greg Farnum
10:04 AM Bug #7891: osd: leaked pg refs on shutdown
ubuntu@teuthology:/a/sage-2014-04-23_18:03:07-rados-firefly-testing-basic-plana/211808 Samuel Just
10:03 AM Bug #8199 (Resolved): rados unit test failure: LibRadosTwoPoolsECPP.FlushTryFlushRaces hang
ubuntu@teuthology:/a/sage-2014-04-23_18:03:07-rados-firefly-testing-basic-plana/211806 Samuel Just
09:55 AM rbd Bug #8184 (Fix Under Review): krbd: make sure we have latest osdmap on 'rbd map'
wip-rbd-maposdmap-v2;
"rbd: make sure we have latest osdmap on 'rbd map'" on ceph-devel.
Ilya Dryomov
09:02 AM Bug #7922: osd: multi-backfill reservation does not release on reject
Kenneth Waegeman wrote:
> Is this fixed in 0.79 ? Or can I patch this myself? I seem to have this problem too
Yes...
Sage Weil
08:57 AM Bug #7922: osd: multi-backfill reservation does not release on reject
Is this fixed in 0.79 ? Or can I patch this myself? I seem to have this problem too Kenneth Waegeman
08:29 AM Revision c0c2361b (ceph): brag : implement --verbose on client
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
08:18 AM Revision 70092110 (ceph): brag : document the zero argument behavior
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
08:18 AM Revision 2b16a818 (ceph): brag : meaningfull error messages
To help figure out problems, include the error message in the output
when a submission fails.
Signed-off-by: Loic Da...
Loïc Dachary
08:15 AM rbd Feature #2467 (New): qemu: implement bdrv_invalidate_cache
Josh Durgin
06:45 AM rbd Bug #8197 (Resolved): Cannot unlink rbd images using rbd-fuse
rbdfs_unlink is calling find_openrbd with the wrong path. The following patch fixes it.
diff --git a/src/rbd_fuse...
Shawn Edwards
05:59 AM devops Bug #7356: Kill all while loops that will never end....
This is still an issue. Alfredo Deza
01:00 AM Revision 2708c3c5 (ceph): Merge remote-tracking branch 'gh/firefly'
Sage Weil
12:58 AM rbd Bug #8178 (Resolved): 0.79: feature set mismatch, my 4a042a42 < server's 104a042a42, missing 1000...
Ilya Dryomov
12:58 AM rbd Bug #8178: 0.79: feature set mismatch, my 4a042a42 < server's 104a042a42, missing 1000000000
In terms of features, 3.13 is almost 6 months old (3.13-rc1 was
released 5 months ago). But yeah, we should definit...
Ilya Dryomov
12:55 AM Linux kernel client Feature #8196 (New): Document which features are supported by the kernel client
Document which kernel supports what and the possible pitfalls, like
#8178 and a number of "feature set mismatch" thr...
Ilya Dryomov
12:23 AM Revision d384d3a6 (ceph): Merge pull request #1720 from jdurgin/wip-list-children-test
test_rbd.py: ignore children in cache pools
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil

04/23/2014

11:07 PM Revision 5b979766 (ceph): Merge pull request #1719 from ceph/wip-8168
Wip 8168
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:11 PM Revision 39c1bfc4 (ceph): ReplicatedPG::do_op: don't return ENOENT for whiteout on snapdir read
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
09:11 PM Revision 83f89348 (ceph): ReplicatedPG::do_osd_ops: consider head whiteout in list-snaps
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:47 PM Revision 76a21389 (ceph): Merge pull request #1718 from ceph/wip-7882-wusui
Support latest qemu iotest code
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
08:28 PM Revision a83aff54 (ceph): test_rbd.py: ignore children in cache pools
This is necessary until http://tracker.ceph.com/issues/8187 is fixed.
Signed-off-by: Josh Durgin <josh.durgin@inktan...
Josh Durgin
08:20 PM Revision aae16ab3 (ceph): mon: add ceph osd pool set <pool> auid
When a pool is created with ceph osd pool create, the auid is not
inferred from the session auid and is set to zero. ...
Loïc Dachary
08:20 PM Revision 606e725e (ceph): Support latest qemu iotest code
Modified qemu-iotests workunit script to check for versions
that use the latest qemu (currently only Trusty). Limit ...
Warren Usui
06:11 PM Feature #8195: shorten window of highest risk during recovery

In the current scheme since the primary runs through the objects in a hashed order it allows new writes before or a...
David Zafman
04:53 PM Feature #8195 (New): shorten window of highest risk during recovery
Say a 3-sized PG experienced failure of two OSDs, the second one failing when the first replacement was part-way thro... Alexandre Oliva
05:13 PM Bug #8193: HitSetTrim test in test/librados/tier.cc needs to be skipped if thrasher running

2014-04-22T16:26:11.096 INFO:teuthology.task.workunit.client.0.out:[10.214.131.16]: [ RUN ] LibRadosTierECPP.H...
David Zafman
02:16 PM Bug #8193 (Resolved): HitSetTrim test in test/librados/tier.cc needs to be skipped if thrasher ru...
Command failed on 10.214.131.16 with status 1: 'mkdir -p --
/home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -...
Samuel Just
04:32 PM rgw Bug #8194 (Resolved): rgw: test_region_copy_object fails with erasure coding
It looks like this has been failing since we added erasure coding pools, but this is the most recent one:
http://q...
Josh Durgin
04:05 PM Bug #8168 (Resolved): osd: rbd_test.test_diff_iterate fails with a cache pool
Sage Weil
03:51 PM Revision 4ed25fdb (ceph): Merge pull request #1714 from ceph/wip-fs-client
two small fixes for client
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:07 PM Bug #8113 (Fix Under Review): agent_work can be continuously rescheduled during recovery while mo...
David Zafman
02:55 PM Bug #8066 (Duplicate): osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + fire...
Samuel Just
02:55 PM Bug #8066: osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + firefly)
Actually, osd.3 prematurely advanced last_backfill, was still on dumpling.
ubuntu@teuthology:/var/lib/teuthworker/...
Samuel Just
02:42 PM Bug #8066: osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + firefly)
2014-04-23 05:24:38.517382 7f7f6945d700 10 osd.0 911 dequeue_op 0x2b4d780 prio 127 cost 0 latency 0.000238 pg_backfil... Samuel Just
06:52 AM Bug #8066: osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + firefly)
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-22_19:55:02-upgrade:dumpling-x:stress-split-firefly... Sage Weil
02:25 PM Bug #8192 (Duplicate): osd.0 crashed in upgrade:dumpling-x:stress-split-firefly---basic-plana
Duplicate of 8180 Yuri Weinstein
02:11 PM Bug #8192 (Duplicate): osd.0 crashed in upgrade:dumpling-x:stress-split-firefly---basic-plana
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-22_19:55:02-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
11:10 AM Feature #8189 (New): ceph: display tier relationships visually
Currently tiering information is only available via 'ceph osd dump'. It's hard for humans to discern the tiering/cach... Josh Durgin
11:05 AM Bug #8185 (Won't Fix): ceph osd pool create does not set auid
comments I made in irc:
I don't think we want to automatically set auids; they aren't expected in a lot of the infra...
Greg Farnum
06:46 AM Bug #8185 (Fix Under Review): ceph osd pool create does not set auid
"proposed fix":https://github.com/ceph/ceph/pull/1715 Loïc Dachary
06:09 AM Bug #8185 (Won't Fix): ceph osd pool create does not set auid
When a pool is created using the command line by a user associated with *auid*, the pool is not associated (owned) by... Loïc Dachary
11:04 AM Feature #8188 (Resolved): librados: interface to inspect pool properties
Right now the only way to view pool properties about tiering (and probably a few other things) is via 'ceph osd dump'... Josh Durgin
10:59 AM rbd Bug #8187 (Resolved): librbd: list_children() reports duplicates with cache pools
list_children() and the internals of snap_unprotect() both go through all pools to check for children of a snapshot. ... Josh Durgin
09:33 AM Revision 26517504 (ceph): rbd: add libkrbd convenience library
Add libkrbd libtool convenience library to provide an interface for
mapping and unmapping rbd images programmatically...
Ilya Dryomov
09:33 AM Revision 2521e73a (ceph): mount.ceph: switch to module_load()
Implement modprobe() in terms of module_load() from common/module.h
Signed-off-by: Ilya Dryomov <ilya.dryomov@inktan...
Ilya Dryomov
09:33 AM Revision 0ba3960c (ceph): rbd: switch to libkrbd for 'rbd {map,showmapped,unmap}' operations
Thanks to libkrbd, 'rbd map' now outputs the device node it mapped to
to stdout:
$ sudo rbd map foo
/dev/rbd...
Ilya Dryomov
09:33 AM Revision 4238ffdc (ceph): doc: do not mention modprobe in rbd docs
rbd binary will load rbd.ko itself, with appropriate options. Loading
it by hand with default options is undesirable...
Ilya Dryomov
09:33 AM Revision 0c2b0fb8 (ceph): doc: 'rbd showmapped' doesn't need privileges
No need to run 'rbd showmapped' with sudo.
Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
Ilya Dryomov
09:33 AM Revision f6318545 (ceph): rbd: deprecate --no-settle option
Waiting for udev has been the default for a while now, and, after
switching to libkrbd, is no longer an option. (lib...
Ilya Dryomov
09:31 AM rbd Bug #8178: 0.79: feature set mismatch, my 4a042a42 < server's 104a042a42, missing 1000000000
Dear Ilya,
You got the right impression but I didn't even mapped anything from new erasure pool when connected RBD...
Dmitry Smirnov
12:20 AM rbd Bug #8178: 0.79: feature set mismatch, my 4a042a42 < server's 104a042a42, missing 1000000000
Hi Dmitry,
I'm assuming what you did is you created an EC pool, tried to map an
image out of the replicated pool,...
Ilya Dryomov
02:29 AM Revision bad34e90 (ceph): client: check cap ID when handling cap export message
handle following sequence of events:
- mds0 exports an inode to mds1. client receives the cap import
message from m...
Yan, Zheng
02:14 AM Revision 383d21dc (ceph): client: avoid releasing caps that are being used
To avoid releasing caps that are being used, encode_inode_release()
should send implemented caps to MDS.
Signed-off-...
Yan, Zheng
12:42 AM Revision 3a2c8886 (ceph): rados: add ec and rep lost_unfound_delete tests
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:39 AM Revision e64d8314 (ceph): task/: add tests for ec and rep mark_unfound_lost delete
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:36 AM Revision d726251f (ceph): doc: Fix hyperlink to CRUSH maps.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:36 AM Revision a06f8667 (ceph): Merge pull request #1713 from ceph/wip-7439
Wip 7439
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
12:31 AM Revision 6902e224 (ceph): doc: Added cache tiering settings to ceph osd pool set.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:30 AM Revision 0d964bc6 (ceph): doc: Added new cache tiering doc to index/TOC.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:30 AM Revision 44e4e3d5 (ceph): doc: Added new cache tiering doc to main docs.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins

04/22/2014

11:17 PM rbd Bug #8184: krbd: make sure we have latest osdmap on 'rbd map'
An attempt is in wip-rbd-maposdmap, Sage suggested the mon_get_version approach. Ilya Dryomov
11:14 PM rbd Bug #8184 (Resolved): krbd: make sure we have latest osdmap on 'rbd map'
... Ilya Dryomov
10:46 PM Revision 8350b6e4 (ceph): Bump psutil version requirement
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
10:43 PM Revision 2182815c (ceph): ReplicatedPG: handle ec pools in mark_all_unfound_lost
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
09:21 PM Revision 6769f4dc (ceph): 0.80-rc1
Jenkins
08:37 PM Revision 245923e7 (ceph): ReplicatedPG: enable mark_unfound_lost delete for ec pools
revert is tricky to implement at this time for ec pools, so
we'll instead just implement delete for ec pools.
Fixes:...
Samuel Just
07:51 PM Bug #8183 (Won't Fix): osd: In a tiered pool after successful removal request object still appear...

What works:
After a removal a second removal returns ENOENT
After a removal the creation of another object with t...
David Zafman
07:44 PM Revision 387110b1 (ceph): rados/singleton/all/cephtool: whitelist scrub vs split vs agent issue
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:41 PM Bug #8182 (Rejected): After rados bench on tiered pool can't remove objects
David Zafman
07:14 PM Bug #8182: After rados bench on tiered pool can't remove objects
My trace is after a subsequent removal attempt which wouldn't be clear from the description. Now that I look at the ... David Zafman
07:10 PM Bug #8182 (Rejected): After rados bench on tiered pool can't remove objects
I'm based on the firefly branch with changes to the tiering agent code which shouldn't affect this test.
$ ./rados...
David Zafman
07:40 PM Revision 47866fd2 (ceph): Merge pull request #1691 from ceph/wip-8139
osd_types: pg_t: allow is_split to handle checks for splits prior to the most recent
Reviewed-by: Samuel Just <sam.j...
Sage Weil
07:35 PM Revision 9078513c (ceph): Fix for #8115
Increase boot disk size per #8115 where monitors shut down due to
/ being full on vm machines.
Signed-off-by: Sandon...
Sandon Van Ness
07:16 PM CephFS Bug #8177 (Duplicate): Client: seg fault in verify_reply_trace on traceless reply
Zheng Yan
07:11 AM CephFS Bug #8177: Client: seg fault in verify_reply_trace on traceless reply
No idea if they are same. Zheng Yan
06:52 AM CephFS Bug #8177: Client: seg fault in verify_reply_trace on traceless reply
See #5021 and wip-5021; possibly related (or same)?
The wip-5021 worked okay except that it caused a crash with sm...
Sage Weil
06:59 PM Revision 009e8746 (ceph): qa/workunits/rbd/copy.sh: skip some tests when tiering is enabled
The rados ls bit doesn't work.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@ink...
Sage Weil
06:59 PM Revision c0bff439 (ceph): qa/workunits/rbd/copy.sh: fix test
I broke this in commit 9d64ac66082bd108ec3c2a74e2e77475b5564eae.
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-...
Sage Weil
06:25 PM Revision 5daf5385 (ceph): ECBackend: use std::swap for boost::optional
Reviewed-by: Sage Weil <sage@inktank.com>
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:42 PM Feature #7439 (Resolved): EC: adapt unfound teuthology tests and add to nightly for EC
Samuel Just
04:43 PM Revision 90040490 (ceph): rbd: use stringify() in options parsing routines
Use stringify() in map_option_{uuid,ip,int}_cb() instead of essentially
open-coding it.
Signed-off-by: Ilya Dryomov ...
Ilya Dryomov
04:43 PM Revision 070a8208 (ceph): configure: check for blkid/blkid.h header
The check for the presence of blkid/blkid.h was missing.
Signed-off-by: Ilya Dryomov <ilya.dryomov@inktank.com>
Ilya Dryomov
04:43 PM Revision 944dd1c6 (ceph): Makefile: build common/secret.c with libtool
Turn common/secret.c into a libtool convenience library, libsecret.la.
Currently it is build directly, twice: for mou...
Ilya Dryomov
04:43 PM Revision ac9b461f (ceph): common: add module_{load,has_parameter}()
Add two kernel module helpers: module_{module,has_parameter}(). They
are going to live in common/module.[ch].
Signe...
Ilya Dryomov
04:43 PM Revision be081dbd (ceph): stringify: use ostringstream instead of stringstream
Use ostringstream, as we don't need both input and output of the
stringstream in stringify().
Signed-off-by: Ilya Dr...
Ilya Dryomov
04:02 PM Revision 6cb5ce86 (ceph): Merge pull request #1710 from ceph/wip-coverity
a couple coverity fixes
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
03:51 PM Bug #8168: osd: rbd_test.test_diff_iterate fails with a cache pool
wip-8168 didn't fix it:... Josh Durgin
02:59 PM Bug #8168: osd: rbd_test.test_diff_iterate fails with a cache pool
Here's the osd/filestore log of one of the failing list-snaps calls:... Josh Durgin
02:14 PM Bug #8168: osd: rbd_test.test_diff_iterate fails with a cache pool
So the scenario that is failing (still fails on master) is that after the objects are deleted via rbd_discard(), diff... Josh Durgin
03:37 PM Revision f244109c (ceph): Merge pull request #1711 from ceph/wip-coverity-respawn
mds: make strncpy in ::respawn safer
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:31 PM Revision cac15c7d (ceph): mds: make strncpy in ::respawn safer
Previous code assumed null terminated argv[0]
was not longer than PATH_MAX and the resulting
strncpy was not strictly...
John Spray
03:29 PM Revision b4eb5025 (ceph): osd/osd_types: RWState: initialize snaptrimmer_write_marker
** CID 1204295: Uninitialized scalar field (UNINIT_CTOR)
/osd/osd_types.h: 2716 in ObjectContext::RWState::RWState(...
Sage Weil
03:28 PM Revision 4e5f4420 (ceph): osdc/Objecter: drop unused field
This as missed by 860d72770cdf092c027d50f4ee03bed76c975599.
** CID 1204296: Uninitialized scalar field (UNINIT_CTO...
Sage Weil
03:27 PM Revision 124a663a (ceph): doc/release-notes: a bit of prose about firefly
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:00 PM Bug #8180 (Duplicate): osd.3 crashed in upgrade:dumpling-x:stress-split-firefly-distro-basic-vps
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-21_20:35:06-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
01:11 PM Bug #8036 (In Progress): levedb: throws std::bad_allow on 14.04
Joao Eduardo Luis
01:10 PM Bug #7942 (Resolved): promote uses cloneid, but backend may have a different cloneid
Samuel Just
01:08 PM Bug #7398 (Resolved): osd: ERANGE from clone
7e697b1bc2ffac086b6a24f97aba755401cd8c37 Samuel Just
01:07 PM Bug #8067 (Duplicate): mon: enomem on vps, killed at ~800MB
Samuel Just
01:06 PM Bug #8082 (Duplicate): hung recovery
Samuel Just
01:05 PM Bug #7987 (Duplicate): osd: backfill/recovery makes no progress
Samuel Just
12:41 PM devops Feature #7716 (Resolved): Build debug packages for EL6
Verified the centos release buid machine is now including the debug packages with today's build (the build I schedule... Sandon Van Ness
12:39 PM Bug #8139 (Resolved): osd/osd_types.cc: 398: FAILED assert(m_seed < old_pg_num)
Sage Weil
11:57 AM devops Bug #7356 (Need More Info): Kill all while loops that will never end....
Is this still an issue? Zack Cerza
08:07 AM devops Bug #6726: Official packages do not appear to be available for Saucy
Tom Verdaat wrote:
> Lauri Vant wrote:
> > When can we expect this to be resolved?
>
> Based on my understanding...
Lauri Vant
06:48 AM devops Bug #6726: Official packages do not appear to be available for Saucy
Note that the Trusty Ubuntu archive will contain 0.80 (it already contains 0.79) once it's released by Inktank. Peter Matulis
05:44 AM devops Bug #6726: Official packages do not appear to be available for Saucy
Lauri Vant wrote:
> When can we expect this to be resolved?
Based on my understanding of Sandon's replies above: ...
Tom Verdaat
04:26 AM Revision 66170f39 (ceph): osd/osd_types: pg_interval_t: dump primary
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:26 AM Revision 931ae6b8 (ceph): osd/osd_types: pg_interval_t: include up_primary in pg_interval_t
Nothing uses this, but it triggers a new interval, which makes it confusing
when it is not recording in the interval ...
Sage Weil
04:26 AM Revision 18aded2e (ceph): osd/osd_types: pg_interval_t: include primaries in operator<<
Also make up vs acting explicit.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:26 AM Revision 000233f7 (ceph): osd: change in up set primary constitutes a peering interval change
In several places, a change in the up_primary triggers a new peering
interval, but the palces that actually generate ...
Sage Weil
04:26 AM Revision 5562e26e (ceph): osd: use parent pgid (as appropriate) in generate_past_intervals()
Feed in the ancestor pg_t (if any) when we are looking at intervals for
previous maps that may have preceded a recent...
Sage Weil
03:49 AM Revision 62301462 (ceph): Merge pull request #1651 from enovance/wip-brag
Few bug fixes in ceph-brag
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:20 AM Revision 025ab9f4 (ceph): doc/release-notes: v0.80
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:48 AM rbd Bug #8178 (Resolved): 0.79: feature set mismatch, my 4a042a42 < server's 104a042a42, missing 1000...
For some weeks I knew no troubles with RBD clients on Linux-3.13.10 x86_64.
Today after I created new erasure pool a...
Dmitry Smirnov

04/21/2014

11:55 PM Bug #8113: agent_work can be continuously rescheduled during recovery while most objects are missing
David Zafman
11:53 PM Revision c80f128c (ceph): Merge pull request #1707 from ceph/wip-rbd-test
rbd: fix tests for cache pools
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
11:47 PM Revision a80e66f9 (ceph): qa/workunit/rbd/import_export.sh: skip list-objects tests with tiering
Listing objects isn't reliable with cache pools; skip that part of the
test if we see that rbd has tiering enabled.
...
Sage Weil
11:26 PM Revision 9d64ac66 (ceph): qa/workunit/rbd/copy.sh: do not delete/recreate rbd pool
Among other things, it breaks when tiering is enabled.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:43 PM Revision c3833d7c (ceph): doc: Fixed syntax to include 'pool'.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:31 PM Revision 8620bd2f (ceph): PG::PriorSet: consider lost osds in up_now for pcontdec
Otherwise, the pg will remain down even as osds are marked lost.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:18 PM CephFS Bug #8177: Client: seg fault in verify_reply_trace on traceless reply
Zheng Yan
08:37 PM CephFS Bug #8177: Client: seg fault in verify_reply_trace on traceless reply
Also /a/teuthology-2014-04-14_23:00:38-fs-master-testing-basic-plana/192241 Greg Farnum
08:31 PM CephFS Bug #8177 (Resolved): Client: seg fault in verify_reply_trace on traceless reply
http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-17_23:01:49-fs-firefly-distro-basic-plana/199687/... Greg Farnum
10:13 PM Revision f44e2c82 (ceph): Merge pull request #1703 from ceph/wip-7942
Wip 7942
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
10:11 PM Revision 95394b60 (ceph): ReplicatedPG::do_op: check for blocked snapset obj
Otherwise, we might use an invalid snapset in find_object_context.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:11 PM Revision 8259d874 (ceph): ReplicatedPG: in trim, grab w locks on obc and snapset_obc
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
10:11 PM Revision f3df5018 (ceph): ReplicatedPG: do not create whiteout clones
First, make_writeable treats whiteout heads like snapdir for
cloning purposes. Second, to ensure that we send the co...
Samuel Just
10:11 PM Revision 0d5a5393 (ceph): ReplicatedPG: if we get ENOENT on clone, remove clone from snapset
Fixes: #7916
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:11 PM Revision caa63565 (ceph): ReplicatedPG,rados: add CEPH_OSD_[COPY_FROM]_MAP_SNAP_TO_CLONE
When promoting a clone, we want to use the provided snapid to specify
specify the clone id directly.
Signed-off-by: ...
Samuel Just
09:28 PM Revision bd39ecd6 (ceph): Merge pull request #1705 from ceph/wip-8124
Wip 8124
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:18 PM Revision 2cb0bac6 (ceph): qa/workunits/cephtool/test.sh: make set pg_num test non-racy
Loop while the pool is creating.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank....
Sage Weil
05:53 PM Revision e4a048c4 (ceph): ECMsgTypes::ECSubWrite: fix at_version indentation
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:53 PM Revision 7bb20115 (ceph): encoding: use unqualified name for encode/decode in boost::optional enc...
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:53 PM Revision 5821cc7e (ceph): osd/: propogate hit_set history with repop
We don't actually send the whole info on each repop, just the log
entries, updated stats, and a few other bits. For ...
Samuel Just
05:53 PM Revision 16eccdd3 (ceph): PG,PGLog: update hit_set during peering
Fixes: #8124
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:53 PM Revision f7e75880 (ceph): ReplicatedPG::agent_load_hit_sets: take ondisk_read_lock
Otherwise, the hit_set might be not yet written due to a recently
completed recovery.
Signed-off-by: Samuel Just <sa...
Samuel Just
05:53 PM Revision 506dce84 (ceph): ReplicatedPG: do not use shard for hit_set object names
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:52 PM Revision ddf1e986 (ceph): osd: track the number of hit_set archive objects in a pg
Also, use this value in agent_choose_mode instead of the max
number.
Related: #8124
Signed-off-by: Samuel Just <sam....
Samuel Just
05:46 PM Revision 1fb90c94 (ceph): ReplicatedPG::hit_set_persist: clean up degraded check
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:21 PM Bug #8168: osd: rbd_test.test_diff_iterate fails with a cache pool
The actual error is that the diff is nothing instead of a 512 byte extent being discarded. The error removing the ima... Josh Durgin
11:24 AM Bug #8168 (Resolved): osd: rbd_test.test_diff_iterate fails with a cache pool

With a cache pool set up, test_rbd.test_diff_iterate, which removes the two snapshots it creates before removing ...
Josh Durgin
04:55 PM Bug #8176 (Resolved): Change target_max_objects/target_max_bytes has no immediate effect

I would expect to be able to change these values and affect the balance of data in the cache/base tiers. The funct...
David Zafman
04:52 PM Bug #8175 (Resolved): Some values of target_max_objects for tiering will crash OSDs

ceph osd pool set cache target_max_objects 10
It looks like a value below 1024 will cause x/1024 == 0 which will...
David Zafman
03:48 PM CephFS Bug #8172 (Resolved): ceph_get_cap+0x2b/0x120
commit b9baf44e(ceph: pre-allocate ceph_cap struct for ceph_add_cap()) Zheng Yan
02:44 PM CephFS Bug #8172 (Resolved): ceph_get_cap+0x2b/0x120
... Sage Weil
03:11 PM Revision 1448cdf5 (ceph): Work around #8166
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
02:54 PM Bug #8174 (Resolved): rados put of a long object name crashes the OSD process

rados -p testpool put
foo.0000000000000000.0000000000000000.0000000000000000.0000000000000000.0000000000000000.00...
Sheldon Mustard
02:27 PM Bug #8124 (Resolved): too many hitset objects preventing full state from ending
Sage Weil
02:10 PM Bug #8171 (Resolved): crypto: cryptopp has precendence over libnss
We need libnss to have precedence over licryptopp, however, at the moment it's the other way around. If both are inst... Yehuda Sadeh
01:56 PM Revision b7394efe (ceph): multimds: bump up timeout for misc.yaml
This keeps timing out after 3h.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:41 PM rgw Bug #8170 (Resolved): rgw: missing manifest response header when reading swift user manifest object
Yehuda Sadeh
12:41 PM rgw Bug #8169 (Resolved): rgw: swift user manifest does not compute etag
etag for swift user manifest objects should contain the has of the concatenated etags for all the parts. Currently it... Yehuda Sadeh
11:02 AM Feature #7514 (Resolved): qa: add ceph_test_objectstore to rados test suite
added by Josh Durgin on 7afc277736612eb624449f10743958da37f62f9a in the qa-suite repo Joao Eduardo Luis
10:57 AM devops Bug #6726: Official packages do not appear to be available for Saucy
Sandon Van Ness wrote:
> There was a problem with our repo generator script for release builds which was causing eve...
Lauri Vant
10:53 AM Bug #8165 (Duplicate): mon: subscribe doesn't wait for PaxosService readable
dup of #7997 Sage Weil
09:38 AM Bug #8165 (In Progress): mon: subscribe doesn't wait for PaxosService readable
Joao Eduardo Luis
07:08 AM Bug #8165 (Duplicate): mon: subscribe doesn't wait for PaxosService readable
ubuntu@teuthology:/a/teuthology-2014-04-20_19:33:18-upgrade:dumpling-x:parallel-firefly---basic-plana/205300... Sage Weil
03:30 AM Revision 476b929e (ceph): Update mkcephfs.rst
There should be no blank between mount options. xanpeng
01:59 AM Revision 95d0278d (ceph): ReplicatedPG::mark_all_unfound_lost: delete local copy if necessary
There might be a local copy for an EC pool in the DELETE case. The replica
copies should be already handled by merge...
Samuel Just

04/20/2014

03:06 PM Bug #8164 (Duplicate): "[ERR] 4.15 0 tried to pull" in upgrade:dumpling-x:stress-split-firefly---...
I could not reproduced it manually.
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-19_19:55:01-u...
Yuri Weinstein
02:42 PM Bug #8163 (Resolved): stuck recovering due to a 50 min delay in processing Push op
Based on master, 7a61cdbfd533c1092fc61acb7042053251c03f7f (actual branch wip-sam-testing-safe sha1:
ef0fb611696929c...
Samuel Just
02:32 AM CephFS Bug #8025 (Resolved): nfs-on-kclient: rm -r failed
Zheng Yan

04/19/2014

10:23 PM CephFS Bug #8140: 0.79: MDS / CephFS: unable to read directory
Already using. :) Thanks for useful advise. Very helpful. Dmitry Smirnov
08:43 PM CephFS Bug #8140: 0.79: MDS / CephFS: unable to read directory
3.15. For now, please use readdir_max_entries mount option Zheng Yan
06:23 PM CephFS Bug #8140: 0.79: MDS / CephFS: unable to read directory
Makes sense, thank you for explaining. Dmitry Smirnov
09:34 PM Bug #8162 (Pending Backport): osd: dumpling advances last_backfill prematurely
this is the bug where a dumpling osd advances last_backfill prematurely Sage Weil
09:03 PM Bug #8162 (In Progress): osd: dumpling advances last_backfill prematurely
Sage Weil
11:55 AM Bug #8162 (Resolved): osd: dumpling advances last_backfill prematurely
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-18_20:35:03-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
06:50 PM Bug #8048: osd/ReplicatedPG: FAILED assert(!parent->get_log().get_missing().is_missing(soid))
Please have a look at the comments of bug #8008 -- there may be some additional information related to this issue. I ... Dmitry Smirnov
06:46 PM Bug #8008: osd/ReplicatedPG.cc: 258: FAILED assert(missing_loc.needs_recovery(hoid)) during pg re...
This may be similar to #8048 so I applied corresponding fix commit:3d0e80ac as well as number of other post-0.79 PG-r... Dmitry Smirnov
01:53 PM Bug #8008: osd/ReplicatedPG.cc: 258: FAILED assert(missing_loc.needs_recovery(hoid)) during pg re...
Unfortunately revision 6ff645f5 applied on top of 0.79 did not fixed the issue.
I can't repair inconsistent PG.
<...
Dmitry Smirnov
12:58 PM Revision 61b6564b (ceph): Simple mechanical cleanups
* Removed trailing and useless whitespaces
* Removed useless imports.
Signed-off-by: Christopher Glass <christopher....
Chris Glass
09:03 AM Bug #8161 (Resolved): osd/ECBackend.cc: 475: FAILED assert(r == 0)
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-18_21:29:10-rados:thrash-testing-testing-basic-plana/2022... Sage Weil
08:15 AM Bug #8011: osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-18_21:29:10-rados:thrash-testing-testing-basic-plana/202157 Sage Weil
03:04 AM devops Bug #8160 (Duplicate): multipath-tools does not co-exist with ceph
If *multipath-tools 0.4.9-3ubuntu5* is installed on a... Loïc Dachary
12:33 AM Revision 7a61cdbf (ceph): buffer: adjust #include order
The pthread.h include is somehow clobbering things, although it is not
clear how. :(
Signed-off-by: Sage Weil <sage...
Sage Weil

04/18/2014

10:12 PM Revision 74f4d573 (ceph): Merge pull request #1696 from ceph/wip-8097
buffer: use Mutex instead of Spinlock for raw crcs
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
09:32 PM Revision fbb90f6e (ceph): Merge pull request #26 from ceph/wip-rbd-cache
test rbd with cache pool
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Sage Weil
09:24 PM Revision 0234bcfc (ceph): Merge pull request #1697 from ceph/wip-num_objects_omap
osd_types::object_stat_sum_t: fix add/sub for num_objects_omap
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:09 PM Revision e087eae8 (ceph): Merge pull request #1695 from ceph/wip-8153
Revert "ReplicatedPG::get_snapset_context: assert snap obj is not missin...
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
08:59 PM Revision f9e9365f (ceph): Revert "ReplicatedPG::get_snapset_context: assert snap obj is not missing"
This breaks mark_lost_unfound_revert.
This reverts commit 0d2177a18071ad9c9581826a43751c36bab5b2db.
Samuel Just
08:57 PM Bug #7987: osd: backfill/recovery makes no progress
/a/teuthology-2014-04-18_02:30:16-rados-master-testing-basic-plana/200757 Sage Weil
08:55 PM Bug #7986: 3.1s0 scrub stat mismatch, got 2041/2044 objects, 0/0 clones, 2041/2044 dirty, 0/0
ubuntu@teuthology:/a/teuthology-2014-04-18_02:30:16-rados-master-testing-basic-plana/200799
failure_reason: '"2014...
Sage Weil
08:54 PM Revision dec77c34 (ceph): Merge pull request #1693 from ceph/wip-7997
mon: fix get_version race (more)
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
Sage Weil
08:50 PM Revision 4413670d (ceph): osd: throttle snap trimmming with simple delay
This is not particularly smart, but it is *a* knob that lets you make
the snap trimmer slow down. It's a flow and a ...
Sage Weil
08:41 PM Revision 82edda23 (ceph): test: handle the create-pg delay when testing cache split syntax
Signed-off-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
Greg Farnum
06:47 PM Revision d07ce841 (ceph): Merge pull request #1692 from ceph/wip-7784
mon: OSDMonitor: HEALTH_WARN on 'mon osd down out interval == 0'
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
06:15 PM Revision b2112d50 (ceph): mon: OSDMonitor: HEALTH_WARN on 'mon osd down out interval == 0'
A 'status' or 'health' request will return a HEALTH_WARN whenever the
monitor handling the request has the option set...
Joao Eduardo Luis
06:12 PM Revision 09985d25 (ceph): mon: wait for PaxosService readable in handle_get_version
We were waiting for the election to finish, but we need to *also* wait for
paxos to recover. Being a peon or leader ...
Sage Weil
06:10 PM Bug #8113 (In Progress): agent_work can be continuously rescheduled during recovery while most ob...
David Zafman
04:53 PM Revision d7967b42 (ceph): rbd/thrash: factor out install + ceph
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:51 PM Revision e97b8650 (ceph): rbd: do most tests with a (small) cache pool in front
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:51 PM Revision 03a84442 (ceph): rbd/basic: factor out install + ceph
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:49 PM Revision 007d9752 (ceph): Require requests >= 1.0
Zack Cerza
03:45 PM Bug #8156 (Rejected): Crash in Thread.cc "common/Thread.cc: 110: FAILED assert(ret == 0)" in upgr...
this appears to be a simple out of memory (it's a few lines further up in the teuthology log). we need more memory o... Sage Weil
02:53 PM Bug #8156 (Rejected): Crash in Thread.cc "common/Thread.cc: 110: FAILED assert(ret == 0)" in upgr...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-17_20:35:01-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
03:11 PM Messengers Bug #8097 (Resolved): msgr: mon stuck in set_crc()
Samuel Just
02:43 PM Fix #5844: osd: snaptrimmer should throttle itself
This has been seen starving client I/O. David Zafman
02:38 PM Feature #8155 (Resolved): Disallow changing cache_mode in nonsensical ways
We currently have no limitations on when users can change cache_mode. Anybody who changes it from "writeback" to "non... Greg Farnum
02:08 PM Bug #8153 (Resolved): osd/ReplicatedPG.cc: 7221: FAILED assert(attrs || (!pg_log.get_missing().is...
Sage Weil
01:56 PM Bug #8153 (Resolved): osd/ReplicatedPG.cc: 7221: FAILED assert(attrs || (!pg_log.get_missing().is...

ceph version 0.79-247-gd07ce84 (d07ce84148edf0ee4a7271b9ee691815be91520e)
1: (ReplicatedPG::get_snapset_context(...
Samuel Just
01:59 PM Revision c623b3dd (ceph): rados/thrash: whitelist 'must scrub before tier agent can activate'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:53 PM Bug #7997 (Resolved): handle_get_version returns old map epochs
Sage Weil
09:33 AM Bug #7997 (Fix Under Review): handle_get_version returns old map epochs
Sage Weil
11:47 AM Feature #7784 (Pending Backport): mon osd down out interval = 0 should prevent ceph health from r...
Sage Weil
09:43 AM Feature #7784 (Fix Under Review): mon osd down out interval = 0 should prevent ceph health from r...
Went with the simplest approach: have the leader spit out the warning if it has the option set to zero. All other mo... Joao Eduardo Luis
11:30 AM devops Bug #8151 (Rejected): Perms on /etc/ceph/ceph.client.admin.keyring wrong on some nodes after install
... Christina Meno
09:40 AM Feature #8150 (Resolved): mon: disseminate config options throughout the mon cluster
There are some options that will affect cluster-wide behavior.
Having all the monitors in the quorum using the lea...
Joao Eduardo Luis
09:32 AM Bug #8133 (Duplicate): "Segmentation fault" in upgrade:dumpling-x:parallel-firefly---basic-plana ...
dup #7997 Sage Weil
06:59 AM Feature #8147 (Resolved): osd: make split automatically trigger scrub
i think this is probably a good idea anyway, but it's more important given the cache tiering agent. Sage Weil
06:36 AM CephFS Bug #8140: 0.79: MDS / CephFS: unable to read directory
When -ENOMEM happens, the kclient does not properly release (cache coherence related) resources. that's why ceph-fuse... Zheng Yan
05:04 AM CephFS Bug #8140: 0.79: MDS / CephFS: unable to read directory
I found commit in "ceph-client: https://github.com/ceph/ceph-client/commit/54008399dc0ce511a07b87f1af3d1f5c791982a4
...
Dmitry Smirnov
05:51 AM devops Bug #7889 (Resolved): IPv6 support with ceph-deploy
Merged into ceph-deploy master branch with hash a3a61b7 Alfredo Deza
04:21 AM Revision 7251983d (ceph): Merge pull request #1676 from ceph/wip-8092
Wip 8092
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:19 AM Revision 375e4ee8 (ceph): Merge pull request #1678 from ceph/wip-8108
osd: OSDMap: have osdmap json dump print valid boolean instead of string
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:17 AM Revision 8fb2388d (ceph): osd_types: pg_t: add get_ancestor() method
Give us the ancestor for when the pool had a past value for pg_num.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:58 AM Revision 7afc2777 (ceph): rados: include objectstore tests
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
12:53 AM Revision 2dd2b11f (ceph): Merge pull request #1683 from ceph/wip-mds-op-prio
mds: dynamically adjust priority of committing dirfrags
Reviewed-by: Greg Farnum <greg@inktank.com>
Gregory Farnum

04/17/2014

11:28 PM CephFS Bug #8140: 0.79: MDS / CephFS: unable to read directory
Sorry, that can't be right. First of all I can't find this commit. Could you please use correct commit ID? I'd like t... Dmitry Smirnov
11:13 PM CephFS Bug #8140: 0.79: MDS / CephFS: unable to read directory
Which kernel version contains this fix? Dmitry Smirnov
05:14 PM CephFS Bug #8140 (Resolved): 0.79: MDS / CephFS: unable to read directory
This issue should be fixed by commit 54008399 (ceph: preallocate buffer for readdir reply). For old kernel, you can a... Zheng Yan
02:11 PM CephFS Bug #8140 (Resolved): 0.79: MDS / CephFS: unable to read directory
With kernel client I got the following error when I attempted to list files in directory containing 1021 files:
<p...
Dmitry Smirnov
09:51 PM Revision dea70112 (ceph): Merge pull request #1689 from ceph/wip-8091
Wip 8091
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:46 PM Revision 7e697b1b (ceph): ReplicatedPG::recover_replicas: do not recover clones while snap obj is...
Otherwise, we cannot safely read the snapset for the clone.
Fixes: #8091
Signed-off-by: Samuel Just <sam.just@inktan...
Samuel Just
09:46 PM Bug #8143 (Resolved): BuildRoot is now silently ignored in .spec files
http://fedoraproject.org/wiki/Packaging:Guidelines#BuildRoot_tag
I chased this for a long time until I found that ...
Dan Mick
09:21 PM CephFS Bug #8092 (Resolved): multimds ceph-fuse hang on write waiting for max size
Sage Weil
09:18 PM Bug #8108 (Resolved): OSD json output uses strings for booleans
Sage Weil
09:06 PM Revision 0e90c69f (ceph): watch_tube() belongs to the beanstalk module
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
08:33 PM Revision e9a1c778 (ceph): Update requests version
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
08:31 PM Revision 3ad51c8e (ceph): osd_types::object_stat_sum_t: fix add/sub for num_objects_omap
Introduced in a130a4452e4fb159dc62fb417077d98dc9ebd621
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:18 PM Revision 79e7db75 (ceph): Merge pull request #1688 from ceph/wip-8048
osd/ReplicatedPG: check clones for degraded
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:18 PM Revision ac014510 (ceph): Merge pull request #1685 from ceph/wip-8132
mon: set leader commands prior to first election
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
Sage Weil
08:11 PM Revision 3d0e80ac (ceph): osd/ReplicatedPG: check clones for degraded
We check whether the head is degraded, and we check whether a clone is
unreadable, but in the case where we have a ca...
Sage Weil
08:03 PM Revision 5dbc6426 (ceph): s/wait-for-package/wait_for_package/
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
07:49 PM Revision 224a0f57 (ceph): Merge pull request #1674 from ceph/wip-8086
ReplicatedPG::agent_work: skip hitset objects before getting object cont...
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
07:40 PM Revision 5580ffb8 (ceph): Merge pull request #242 from ceph/wip-7773
Mirror beanstalkd queue in paddles Alfredo Deza
05:50 PM Revision 26f4d5b0 (ceph): Merge pull request #1687 from ceph/wip-8130
osdc/Objecter: fix osd target for newly-homeless op
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:48 PM Revision 93c0515f (ceph): osdc/Objecter: fix osd target for newly-homeless op
If we recalculate the mapping and find that there is no primary, we need
to set the 'osd' field to -1. Otherwise, th...
Sage Weil
05:35 PM Bug #8011: osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
Samuel Just
02:00 PM Bug #8011: osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
and to check that agent_work also does the right thing Samuel Just
02:00 PM Bug #8011: osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
ReplicatedPG::do_op already does the right thing as far as blocking ops which may flush. What remains is to avoid fl... Samuel Just
01:56 PM Bug #8011 (In Progress): osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >...
Samuel Just
05:27 PM Revision 03b8cdac (ceph): Refactor try_delete_jobs()
Also tweak its error message Zack Cerza
05:27 PM Revision 66a27422 (ceph): Add methods for querying and deleting jobs
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Revision 165f5d53 (ceph): When killing a run, delete paddles jobs
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Revision 1449e753 (ceph): Use shared methods to connect to beanstalkd
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Revision 8a4de411 (ceph): Rename teuthology.queue to teuthology.worker
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Revision 8fdea4d1 (ceph): Submit queued jobs to paddles
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Revision ee33192f (ceph): When deleting jobs, also delete them from paddles
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Revision 741c773b (ceph): Look for archive_base in config
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Revision d12e6f4e (ceph): Be slightly less verbose about logging
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:22 PM Bug #8044 (Duplicate): osd/ReplicatedPG.cc: 2276: FAILED assert(p != snapset.clones.end())
this looks like a dup of #8091 Sage Weil
05:07 PM Revision fe71a12d (ceph): Merge pull request #1684 from onlyjob/debian
spelling corrections
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
05:05 PM Revision b0338ca3 (ceph): Merge pull request #1671 from ceph/wip-7699
mds: Fix respawn (add path resolution)
Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inkt...
Sage Weil
05:03 PM Revision 3a794d5f (ceph): Merge pull request #1677 from ceph/wip-poolset-noblock
mon: Don't block on EAGAIN from `osd pool set`
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:33 PM Revision 881680ee (ceph): mon: set leader commands prior to first election
If we have just started and receive a command, we currently will reply with
EINVAL because the leader commands are em...
Sage Weil
04:08 PM Revision fc948794 (ceph): safe_while: Don't sleep() on the first attempt
This was causing unnecessary delays in several places
Signed-off-by: Zack Cerza <zack.cerza@inktank.com>
Zack Cerza
03:55 PM RADOS Feature #8141 (New): Nice if we had a state for when a pg can't recover because all missing objec...

I put a pg into the following state by taking down 2 OSDs at just the right time after peering but before recovery ...
David Zafman
02:50 PM Bug #8091 (Resolved): osd/SnapMapper.cc: 217: FAILED assert(r == -2)
Sage Weil
02:43 PM Revision e3233927 (ceph): Pass -D flag to teuthology report
Fixes an issue where tests run on old teuthology branches that died for
uncommon reasons were not being marked as dea...
Zack Cerza
02:28 PM Revision 40e8dbbb (ceph): mon: EBUSY instead of EAGAIN when pgs creating
In 69321bf, EAGAIN changed behaviour to block indefinitely
rather than returning to user. Change the return for
`osd...
John Spray
02:23 PM Bug #8139 (Fix Under Review): osd/osd_types.cc: 398: FAILED assert(m_seed < old_pg_num)
Sage Weil
01:20 PM Bug #8139 (Resolved): osd/osd_types.cc: 398: FAILED assert(m_seed < old_pg_num)
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-15_02:30:04-rados-firefly-distro-basic-plana/193012... Sage Weil
01:54 PM Bug #8124: too many hitset objects preventing full state from ending
Samuel Just
01:17 PM Bug #8048 (Resolved): osd/ReplicatedPG: FAILED assert(!parent->get_log().get_missing().is_missing...
Samuel Just
01:11 PM Bug #8048 (Fix Under Review): osd/ReplicatedPG: FAILED assert(!parent->get_log().get_missing().is...
Sage Weil
01:16 PM Bug #8132 (Resolved): mon: no leader commands before first election
Sage Weil
09:06 AM Bug #8132: mon: no leader commands before first election
... Sage Weil
09:06 AM Bug #8132 (Resolved): mon: no leader commands before first election
got EINVAL on pool create from leader mon who had just started and was starting its election:
ubuntu@teuthology:/a...
Sage Weil
01:10 PM Bug #8099: LibRBD.DiffIterateStress failure - extra extent in diff
Possibly related to #8091 David Zafman
12:48 PM Bug #8086 (Resolved): FDCache::clear failed assert
Sage Weil
12:13 PM Bug #8086 (Fix Under Review): FDCache::clear failed assert
Samuel Just
12:12 PM Bug #8086: FDCache::clear failed assert
Samuel Just
12:40 PM Tasks #7864: please clarify copyright and the license
Also please clarify version of CC-BY-SA license used for files in /doc and /man.
Which particular version of the l...
Dmitry Smirnov
11:55 AM Feature #7873 (Fix Under Review): pg query: dump peer_info, peer_missing in all states
David Zafman
11:53 AM Bug #8138 (Won't Fix): Make PG repair safe by requiring force flag to repair an ambiguous situation

be_select_auth_object() should have a force flag and not arbitrarily use the first shard as the authoritative objec...
David Zafman
11:19 AM Bug #8103: pool has too few PGs warning misleading when using cache pools
Mark Nelson wrote:
> It seems like there may be other situations where this is misleading too. Say if you have many ...
Sage Weil
10:53 AM Bug #8130 (Resolved): Objecter: resending Ops to wrong target
Sage Weil
10:47 AM Bug #8130 (Fix Under Review): Objecter: resending Ops to wrong target
Sage Weil
09:37 AM Bug #8130 (In Progress): Objecter: resending Ops to wrong target
this is affecting master now too:
teuthology-2014-04-16_02:30:03-rados-master-testing-basic-plana has many failures
Sage Weil
10:12 AM Bug #8133: "Segmentation fault" in upgrade:dumpling-x:parallel-firefly---basic-plana suite
FYI - Manual re-run did not produce errors. Yuri Weinstein
09:27 AM Bug #8133 (Duplicate): "Segmentation fault" in upgrade:dumpling-x:parallel-firefly---basic-plana ...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-16_19:33:25-upgrade:dumpling-x:parallel-firefly---... Yuri Weinstein
10:05 AM CephFS Bug #7966 (Resolved): ceph-mds respawn doesn't always work
Sage Weil
10:00 AM Bug #8043 (Resolved): until we fix it more better, we should disallow split on cache pools
Sage Weil
09:04 AM Feature #7784 (In Progress): mon osd down out interval = 0 should prevent ceph health from report...
Joao Eduardo Luis
08:40 AM Bug #8066: osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + firefly)
ubuntu@teuthology:/a/teuthology-2014-04-15_22:35:26-upgrade:dumpling-x:stress-split-firefly-distro-basic-vps/196331 Sage Weil
04:57 AM Revision 2e375b6f (ceph): Merge pull request #1675 from guangyy/wip-bench
Make rados/rest bench work for multiple write instances without metadata conflict.
Reviewed-by: Greg Farnum <greg@in...
Gregory Farnum
02:43 AM Revision f22e2e9a (ceph): spelling corrections
Dmitry Smirnov
01:16 AM Revision 75a5bd5d (ceph): Merge pull request #1681 from ceph/wip-8043
mon/OSDMonitor: require force argument to split a cache pool
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
01:13 AM Revision 6d58e3c9 (ceph): Merge pull request #1682 from ceph/wip-8020
OSD: split pg stats during pg split
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:10 AM Revision 18caa1cd (ceph): OSD: split pg stats during pg split
Fixes: #8020
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
01:08 AM Revision 5e4a5dc6 (ceph): osd_types::osd_stat_sum_t: fix floor for num_objects_omap
Introduced in a130a4452e4fb159dc62fb417077d98dc9ebd621
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just

04/16/2014

10:09 PM Revision a3d759eb (ceph): Merge branch 'wip-8100'
Reviewed-by: Mark Nelson <mark.nelson@inktank.com> David Zafman
10:06 PM Revision a3d452ac (ceph): common/obj_bencher: Fix error return check from read that is negative o...
Fixed read return value in d99f1d9f68db41231e0ffff4082b05d6d095c231
Fixes: #8100
Signed-off-by: David Zafman <david...
David Zafman
09:45 PM Bug #8130 (Resolved): Objecter: resending Ops to wrong target
From teuthology:/a/gregf-2014-04-16_12:06:55-rados:thrash-wip-fast-dispatch-testing-basic-plana
Note how it marks_...
Greg Farnum
09:35 PM Bug #8048 (In Progress): osd/ReplicatedPG: FAILED assert(!parent->get_log().get_missing().is_miss...
Sage Weil
08:21 PM Revision 4b9202bc (ceph): Update to use psutil 2.x API
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
08:10 PM Feature #7873 (In Progress): pg query: dump peer_info, peer_missing in all states
peer_info is already output in the "query" in ReplicatedPG::do_command()
I've added the peer_missing output right af...
David Zafman
06:49 PM Revision 24da7d0c (ceph): Merge pull request #1680 from ceph/wip-7786
civetweb: update subproject Sage Weil
06:29 PM Bug #8113: agent_work can be continuously rescheduled during recovery while most objects are missing
this is probably just a matter of subtracting num_missing from num_flushable? Sage Weil
06:12 PM Bug #8020 (Resolved): evenly split stats on split
Sage Weil
06:08 PM Revision 4db1984c (ceph): osd/ReplicatedPG: add missing whitespace in debug output
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
05:25 PM Bug #7891: osd: leaked pg refs on shutdown
ubuntu@teuthology:/a/teuthology-2014-04-15_02:30:04-rados-firefly-distro-basic-plana/193023 but not debug patch applied Sage Weil
03:10 PM Bug #8100 (Resolved): Rados Bench seq read errors on tiered configuration
a3d452acdf2fcf9ad10002c5f24c2548d12952bd David Zafman
02:14 PM Bug #8100: Rados Bench seq read errors on tiered configuration
David Zafman
01:52 PM Bug #8100: Rados Bench seq read errors on tiered configuration
Through some bisecting and a well-informed guess by Yehuda, it appears that this is being caused by d99f1d9f. Mark Nelson
02:45 PM Revision 089dda15 (ceph): Optionally use civetweb instead of apache
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
02:45 PM Revision 761d7693 (ceph): Don't run apache functions if not using apache
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
02:45 PM Revision 8b93c03f (ceph): Generate subtasks instead of copy/pasting them
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
01:06 PM Bug #8124 (Resolved): too many hitset objects preventing full state from ending
Unclear how, but the pg appears to have around 17 hitset objects which ends up causing the pg to be unable to evict e... Samuel Just
12:32 PM Bug #8123 (Can't reproduce): OSD: received operation against clone which was not backfilled (but ...
Let's start by saying I saw this on my wip-fast-dispatch branch, so it *could* be a bug there rather than in master. ... Greg Farnum
11:49 AM rgw Bug #7786 (Resolved): civetweb segfaults with file uploads larger than 2GB
Sage Weil
11:17 AM Linux kernel client Bug #8122: bogus mount error "mount: error writing /etc/mtab: Invalid argumentnothing was mounted"
This is on 0.79. Dmitry Smirnov
11:16 AM Linux kernel client Bug #8122 (Duplicate): bogus mount error "mount: error writing /etc/mtab: Invalid argumentnothing...
mount.ceph print the following error every time I mount Ceph file system:... Dmitry Smirnov
10:48 AM Bug #8121 (Resolved): ReplicatedBackend::build_push_op() should handle a short read or assert

I noticed that the existing code in build_push_op() may not handle all scenarios of fixing data_included interval w...
David Zafman
10:36 AM Bug #7892 (Duplicate): osd/ReplicatedPG.cc: 7881: FAILED assert((data_included.empty() && data.le...

There were 2 identical crashes. This is the trace of one of them:
object: cecc4d22/plana9117053-25/8d//3
pg: 3...
David Zafman
10:15 AM Bug #8103: pool has too few PGs warning misleading when using cache pools
It seems like there may be other situations where this is misleading too. Say if you have many mostly empty pools and... Mark Nelson
10:11 AM Bug #8036: levedb: throws std::bad_allow on 14.04
Have been spending a fair amount of time trying to figure out what may have gone wrong with this (and #8067, which ap... Joao Eduardo Luis
09:45 AM devops Feature #8120 (Resolved): RHEL7 GA kernel build
The new RC is out so we need to rebuild the kernel packages again... Neil Levine
09:43 AM Feature #7784: mon osd down out interval = 0 should prevent ceph health from reporting ok
Mapping a config option to a map flag is not an intuitive thing to do or to expect. What if the user injects a diffe... Joao Eduardo Luis
07:11 AM Revision 924064f8 (ceph): mds: dynamically adjust priority of committing dirfrags
Adjust priority of committing dirfrags according to number of
expiring log segments. The more expiring log segments, ...
Yan, Zheng
05:51 AM Revision 0640a085 (ceph): mds: fix cap revoke confirmation
when the _revokes list is emptied, it doesn't mean that client has
released the revoking caps. It's possible that cli...
Yan, Zheng
01:28 AM Revision 8c7a5ab8 (ceph): Use string instead of char* when saving arguments for rest-bench
Guang Yang
01:21 AM CephFS Bug #8118 (Closed): MDS crashes
Active MDS crashes (v0.79).
log file attached.
Host did not ran out of memory, Standby MDS took over successfully....
Kenneth Waegeman

04/15/2014

10:37 PM Revision 0d2177a1 (ceph): ReplicatedPG::get_snapset_context: assert snap obj is not missing
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:57 PM Revision 015df934 (ceph): mon/OSDMonitor: require force argument to split a cache pool
There are several perils when splitting a cache pool:
- split invalidstes pg stats, which disables the agent
- a s...
Sage Weil
07:28 PM Revision 823219bb (ceph): Don't pass apache's config to radosgw
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
06:53 PM devops Feature #7716 (In Progress): Build debug packages for EL6
I believe this was caused by redhat-rpm-config not being installed on the centos gitbuilder used for release builds (... Sandon Van Ness
06:12 PM Revision 12af2abb (ceph): Rename some functions and variables
This is to make the refactoring a little smoother and easier to read.
Signed-off-by: Zack Cerza <zack.cerza@inktank....
Zack Cerza
05:32 PM Revision c2523458 (ceph): osd: OSDMap: have osdmap json dump print valid boolean instead of string
Fixes: 8108
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Joao Eduardo Luis
03:58 PM Revision f82f6637 (ceph): Fix all but one of the PEP-8 issues
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
02:58 PM Bug #8114 (Can't reproduce): "osd/RadosModel.h: 1055: FAILED assert" in upgrade:dumpling-x:stress...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-13_22:35:20-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
02:52 PM Bug #8103: pool has too few PGs warning misleading when using cache pools
Given that this is a transient issue for a new, empty cluster, I'm not sure if it is worth making an exception for th... Sage Weil
02:20 PM Bug #8091: osd/SnapMapper.cc: 217: FAILED assert(r == -2)
recover_replicas can cause us to read the snapset from an obsolete snapdir or head object. recover_replicas should n... Samuel Just
02:08 PM devops Tasks #7678 (Resolved): f20 Jenkins slave
Ian Colle
02:07 PM devops Feature #6020 (Fix Under Review): radosgw-apache opinionated package
Ian Colle
01:56 PM Bug #8080 (Resolved): objecter: linger ops don't pay attention to cache overlay
Sage Weil
01:56 PM Bug #8043 (Fix Under Review): until we fix it more better, we should disallow split on cache pools
Sage Weil
01:46 PM Bug #8098 (Can't reproduce): ceph v0.79-125 : Random osd's are flapping too frequently : OSD wron...
Thanks for the follow-up. Please let us know if you can figure out how to reproduce the problem, or can gather more ... Sage Weil
02:24 AM Bug #8098: ceph v0.79-125 : Random osd's are flapping too frequently : OSD wrongly marked me down
Thanks for your interest Sage , today morning the cluster looks normal and healthy. Unfortunately i do not have syste... karan singh
01:10 PM devops Fix #8109: OSD-disk fails to activate when final mount dir is not empty and shows no proper error...
the other way to solve that is just to change
elif os.listdir('/var/lib/ceph/osd/{cluster}-{osd_id}'.format(
clu...
Alphe Salas
06:59 AM devops Fix #8109 (Closed): OSD-disk fails to activate when final mount dir is not empty and shows no pro...
OSD-disk fails to activate when STATEDIR + '/osd/{cluster}-{osd_id}'.format(cluster=cluster,osd_id=osd_id) is not emp... Alphe Salas
01:03 PM Feature #7547: Basic docs for Cache Tiering functionality
Ian Colle
01:01 PM Feature #7940 (Resolved): add pool snaps to ceph_test_rados
Samuel Just
01:00 PM Feature #7831 (Resolved): OSD: track objects with omap entries and don't count toward caps
Samuel Just
12:58 PM Feature #8041 (Resolved): ceph uses GCC-specific strerror_r; easy to make more portable
d0a7632a31258d0963dc5d4cf7502905cc8abfe7, merged into master at ab4a35f75eac161d2509e9e34853942fcc4ed6bb Dan Mick
12:58 PM Bug #7588: OSD Seg fault in string assign ObjectOperation::C_ObjectOperation_copyget::finish()
Samuel Just
12:52 PM Bug #7588: OSD Seg fault in string assign ObjectOperation::C_ObjectOperation_copyget::finish()
It's because we reset the tid on redirected ops and the op_cancel in ReplicatedPG therefore fails to work. Samuel Just
11:43 AM Bug #7588: OSD Seg fault in string assign ObjectOperation::C_ObjectOperation_copyget::finish()
...I just hit this on 3 osds at the same time Samuel Just
10:56 AM Bug #7588: OSD Seg fault in string assign ObjectOperation::C_ObjectOperation_copyget::finish()
Samuel Just
10:50 AM Bug #7588: OSD Seg fault in string assign ObjectOperation::C_ObjectOperation_copyget::finish()
Saw this pop up once on my fast dispatch branch:
/a/gregf-2014-04-14_16:40:42-rados:thrash-wip-fast-dispatch-testing...
Greg Farnum
12:44 PM Bug #8100: Rados Bench seq read errors on tiered configuration
It's all automated, though I did try manually testing reads from the command line as well. FWIW, with debugging enab... Mark Nelson
10:10 AM Bug #8100: Rados Bench seq read errors on tiered configuration
Did you check for typos? :p Right pool name? That "-3" looks easy to get wrong. Greg Farnum
10:08 AM Bug #8100: Rados Bench seq read errors on tiered configuration
This appears to be happening on non-tiered pools as well, regardless if erasure coding or replication is used. Mark Nelson
11:33 AM Bug #8113 (Resolved): agent_work can be continuously rescheduled during recovery while most objec...
We probably need to detect when we've gone through the entire hash space without starting anything. Samuel Just
11:28 AM Revision aa6df59e (ceph): mds: Fix respawn (add path resolution)
Previously assumed that ceph-mds executable was in
PWD - now use /proc/self/exe to find the
executable whereever it m...
John Spray
11:19 AM rgw Documentation #8112 (Resolved): radosgw usage and manpage need updating
While working on #7933, I noticed a large gap in what flags our tests are passing to radosgw, and what flags are docu... Zack Cerza
10:54 AM devops Bug #5193 (Resolved): RHEL6 does not ship with xfsprogs
Sandon Van Ness
10:54 AM Bug #8077 (Resolved): osd/ReplicatedPG.cc: 10862: FAILED assert(r >= 0) (agent_load_hit_sets)
Samuel Just
10:54 AM Bug #8063 (Resolved): LibRadosTwoPoolsECPP.PromoteSnap got EAGAIN
Samuel Just
10:54 AM Bug #8081 (Resolved): hitset-get on missing object fails
Samuel Just
10:53 AM Bug #8085 (Resolved): osd/PG.cc: 2218: FAILED assert(!actingbackfill.empty()) from do_command
Samuel Just
10:53 AM Bug #7997 (Resolved): handle_get_version returns old map epochs
Samuel Just
10:52 AM Bug #8008 (Resolved): osd/ReplicatedPG.cc: 258: FAILED assert(missing_loc.needs_recovery(hoid)) d...
Samuel Just
10:51 AM Bug #8089 (Resolved): osd: ENOENT on cache-evict
Samuel Just
10:51 AM Bug #8086 (Resolved): FDCache::clear failed assert
Samuel Just
10:34 AM Bug #8108: OSD json output uses strings for booleans
... Joao Eduardo Luis
10:32 AM Bug #8108: OSD json output uses strings for booleans
pull request: https://github.com/ceph/ceph/pull/1678 Joao Eduardo Luis
09:54 AM Bug #8108 (Fix Under Review): OSD json output uses strings for booleans
wip-8108 Joao Eduardo Luis
06:56 AM Bug #8108 (Resolved): OSD json output uses strings for booleans
The output for osd stat using JSON format uses strings for booleans:... Alfredo Deza
09:30 AM rgw Bug #8111 (Resolved): /etc/init.d/ceph-radosgw for RHEL needs QA
During testing of the insallation procedure for RHEL, I found it possible to start ceph-radosgw in user space, but I ... John Wilkins
09:11 AM rgw Tasks #8110 (Resolved): rgw: diagram for rgw notifications (zone object sync)
Yehuda Sadeh
09:08 AM rgw Documentation #7434 (Fix Under Review): rgw: doc user/group quota
Ian Colle
08:30 AM Revision f6db1bc2 (ceph): mds: share max size to client who is allowed for WR cap
WR cap is allowed for the loner client when filelock is in excl->mix
state. MDS should share max size with the loner ...
Yan, Zheng
08:02 AM Revision 358bde5d (ceph): Add clone test on store_test
Signed-off-by: Haomai Wang <haomaiwang@gmail.com> Haomai Wang
07:48 AM Revision 308758b7 (ceph): Make rados/rest bench work for multiple write instances without metadat...
Signed-off-by: Guang Yang <yguang@yahoo-inc.com> Guang Yang
05:50 AM Bug #7159: ceph status --format=json num_in_osds and num_up_osds formatting not consistent
fixed by 790dda9c
do we need to backport it?
Joao Eduardo Luis
01:43 AM Bug #7710: Multiple rados bench instance will overwrite the metadata object
A new patch was submitted, please help to review - https://github.com/ceph/ceph/pull/1675 Guang Yang
01:30 AM CephFS Bug #8092: multimds ceph-fuse hang on write waiting for max size
Zheng Yan
12:21 AM Revision 4c015136 (ceph): Improve unlock error messages.
Added messages if the hostname is invalid, and if
the user is not the owner of the lock.
Fixes: 6295
Signed-off-by: ...
Warren Usui
12:13 AM Revision 908fa5ed (ceph): Merge pull request #1666 from ceph/wip-mds
Wip mds Zheng Yan

04/14/2014

11:55 PM Revision edda6f74 (ceph): ReplicatedPG::_rollback_to: fix comment, clone certainly could be missing
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:55 PM Revision d39e0031 (ceph): ReplicatedPG::cancel_flush: requeue dup_ops even if !op
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:55 PM Revision 3f7861e2 (ceph): ReplicatedPG::agent_work: skip if head is missing
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:55 PM Revision 43b7c3a2 (ceph): ReplicatedPG::finish_promote: soid.clone may have been trimed, fix assert
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
11:55 PM Revision 9b7fa388 (ceph): ReplicatedPG::process_copy_chunk: don't check snaps if we got head
Even if we are promoting a clone, we may be reading from head.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:12 PM Revision 9f6f7d35 (ceph): Merge pull request #1673 from ceph/wip-stress-watch
ceph_test_stress_watch: test over cache pool
Reviewed-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgi...
Samuel Just
11:11 PM Revision 895b6d4d (ceph): Merge pull request #1667 from ceph/wip-8089
osd: fix dup request ahndling for ENOENT and cache ops
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:10 PM Revision 898ee489 (ceph): Merge pull request #1654 from ceph/wip-7940
Wip 7940
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:09 PM Revision cab29ac1 (ceph): Merge pull request #1664 from ceph/wip-8085
osd: handle misdirected pg command
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:07 PM Revision a2323a61 (ceph): Merge pull request #1660 from ceph/wip-hitset-missing
osd: handle hitset-get on a missing hit_set object
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:07 PM Revision eef2bf67 (ceph): Fix kvm issues for Trusty
Change kvm reference to qemu-system-x86_64 and use raw format
in kvm/qemu command. Tested on both Trusty and Precise...
Warren Usui
10:57 PM Revision 37ed4b60 (ceph): ceph_test_stress_watch: test over cache pool
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:36 PM Revision 4388d876 (ceph): Merge pull request #1661 from ceph/wip-objecter
objecter: make linger watch the correct pool/object
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Reviewed-by: Josh...
Josh Durgin
08:57 PM Revision ab4a35f7 (ceph): Merge pull request #1672 from ceph/wip-strerror
Use cpp_strerror() wherever possible, and use autoconf for portability
Reviewed-by: Josh Durgin <josh.durgin@inktank...
Josh Durgin
08:07 PM Revision d0a7632a (ceph): Use cpp_strerror() wherever possible, and use autoconf for portability
strerror_r is not portable; on Gnu libc it returns char * and sometimes
does not fill in the supplied buffer. Use au...
Dan Mick
08:04 PM devops Bug #8095: [chef] No version specified, and no candidate version available for libmpich2-3
Sandon found the reason this afternoon:
(05:05:46 PM) houkouonchi-home: dmick: https://launchpad.net/ubuntu/+sour...
Dan Mick
07:06 AM devops Bug #8095 (Resolved): [chef] No version specified, and no candidate version available for libmpic...
Some issues installing this package are making ceph-deploy tests fail.... Alfredo Deza
06:44 PM Revision 29d83fef (ceph): Merge pull request #1668 from ceph/wip-librados-tests
ceph_test_rados_api_*: fix build warnings and memset ranges
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
06:33 PM CephFS Bug #8092 (Need More Info): multimds ceph-fuse hang on write waiting for max size
Zheng Yan
06:08 PM Revision 502cc614 (ceph): ReplicatedPG::agent_work: skip hitset objects before getting object con...
Otherwise, we might read the attr on a hitset object we are in the
process of deleting.
Fixes: #8086
Signed-off-by: ...
Samuel Just
05:55 PM Revision ba74f907 (ceph): Merge pull request #25 from ceph/wip-fastdumpemp-wusui
Reduce runs in the dumpling-emperor upgrade suite. Zack Cerza
05:07 PM Revision f6d1d922 (ceph): pipe: only read AuthSessionHandler under pipe_lock
session_security, the AuthSessionHandler for a Pipe, is deleted and
recreated while the pipe_lock is held. read_messa...
Josh Durgin
05:07 PM Revision 27d6032e (ceph): auth: add rwlock to AuthClientHandler to prevent races
For cephx, build_authorizer reads a bunch of state (especially the
current session_key) which can be updated by the M...
Josh Durgin
05:06 PM Bug #8104 (Resolved): OSD: changing min_size to larger than the acting set does not make the PG g...
It looks like maybe we can only change into the Inactive state when going through peering, and changing the min_size ... Greg Farnum
05:06 PM Bug #8103 (Won't Fix): pool has too few PGs warning misleading when using cache pools
When using cache pools on a fresh filesystem, quickly the number of objects in the cache pool can greatly exceed the ... Mark Nelson
05:04 PM Revision 25c32482 (ceph): osdc/ObjectCacher: back off less during flush
In cce990efc8f2a58c8d0fa11c234ddf2242b1b856 we added a limit to avoid
holding the lock for too long. However, if we ...
Sage Weil
04:54 PM Revision 2b4b00b7 (ceph): pipe: only read AuthSessionHandler under pipe_lock
session_security, the AuthSessionHandler for a Pipe, is deleted and
recreated while the pipe_lock is held. read_messa...
Josh Durgin
04:54 PM Revision c049967a (ceph): auth: add rwlock to AuthClientHandler to prevent races
For cephx, build_authorizer reads a bunch of state (especially the
current session_key) which can be updated by the M...
Josh Durgin
04:27 PM Revision 0d4aed88 (ceph): mon: fix Session ref leak in handle_get_version
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:20 PM CephFS Bug #8025: nfs-on-kclient: rm -r failed
yes, commit 85f6def97b75b840d6be97671cb8bacd2a074a24 should fix this Zheng Yan
01:37 PM CephFS Bug #8025: nfs-on-kclient: rm -r failed
This turned up again on yesterday's run: http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-13_23:01:11-knfs-mast... Greg Farnum
04:19 PM Revision d5634349 (ceph): doc: Distro specific rgw.conf example.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:19 PM Revision bd22cac9 (ceph): doc: Distro specific rgw.conf example.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:18 PM Revision 75606e98 (ceph): doc: Created standalone doc for Keystone Integration.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:18 PM Revision 6650c0e8 (ceph): doc: Added new docs to index.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:18 PM Revision 1310af23 (ceph): doc: Reworked the simple configuration guide to be more generic.
Changes include removing keystone and putting it into a separate document,
removing user config and putting it into a...
John Wilkins
04:16 PM Revision 6853d21a (ceph): doc: New admin guide for Ceph Object Gateway. Needs some clarification ...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:15 PM Revision e02b8458 (ceph): doc: Admin API usage for quotas. Needs additional clarification on syntax.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:56 PM Revision d0d162a5 (ceph): buffer: use Mutex instead of Spinlock for raw crcs
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:48 PM devops Bug #8017 (Resolved): Redhat Dependencies Unmet
All RHEL dependencies in jenkins and built. Sandon Van Ness
03:47 PM devops Bug #5193: RHEL6 does not ship with xfsprogs
And this can now be built in jenkins. Closing it out. Sandon Van Ness
03:45 PM Bug #8098: ceph v0.79-125 : Random osd's are flapping too frequently : OSD wrongly marked me down
This is usually caused by low memory leading to swapping. Can you verify the CPU and memory are not oversubscribed? Sage Weil
09:23 AM Bug #8098: ceph v0.79-125 : Random osd's are flapping too frequently : OSD wrongly marked me down
I have been searching on internet and found there has been similar kind of bugs came up in past as well and fixes wer... karan singh
09:19 AM Bug #8098 (Can't reproduce): ceph v0.79-125 : Random osd's are flapping too frequently : OSD wron...
Hello Developers
I have been facing weird problem with my ceph cluster
Problem : Randomly OSDs are flapping , ...
karan singh
02:48 PM Bug #8086: FDCache::clear failed assert
wip-8086 appears to have solved this. Thanks Sam! Mark Nelson
11:08 AM Bug #8086: FDCache::clear failed assert
Samuel Just
09:27 AM Bug #8086: FDCache::clear failed assert
On further review, this seems to be happening when an erasure coded base pool (that has an associated writeback cache... Mark Nelson
02:17 PM Bug #8102 (Duplicate): OSD json output mixes ints with strings
Oh, this was already input (and fixed) as #7159 Greg Farnum
01:56 PM Bug #8102 (Duplicate): OSD json output mixes ints with strings
This is unexpected output from checking the OSD status in JSON format:... Alfredo Deza
02:14 PM Linux kernel client Feature #190 (Fix Under Review): krbd: DISCARD support
Josh Durgin
01:50 PM RADOS Feature #8101 (New): Quickly delete base pool and cache pool without removing overlay
Right now it's rather cumbersome to delete pools that have a cache tier. Simply removing them doesn't work (as we in... Mark Nelson
01:41 PM Bug #8100 (Resolved): Rados Bench seq read errors on tiered configuration
Attempting to perform the following rados bench seq test on a tiered pool setup that had data written to it via rados... Mark Nelson
11:39 AM rgw Bug #7450 (Resolved): "radosgw-admin key create" ignores specified access key when subuser specified
commit:c09f58ef81db9f6dbd528b2aa4c2f135aa6d262e Josh Durgin
11:28 AM rbd Bug #7165 (Resolved): high ping latency in vm using rbd writeback cache in qemu 1.7 with async flush
From the logs it was caused by the ObjectCacher's flusher thread holding the lock too long. Backported the fixes for ... Josh Durgin
11:25 AM rbd Bug #6480 (Resolved): librbd crashed qemu-system-x86_64
Thanks for the update Mike! Backported the fixes to dumpling (commit:2b4b00b76b245b1ac6f95e4537b1d1a4656715d5 commit:... Josh Durgin
09:19 AM rbd Bug #6480: librbd crashed qemu-system-x86_64
I've run wip-6480-0.67.7 across our infrastructure for 10 days now without any qemu crashes. Thanks Josh! Mike Dawson
10:57 AM Bug #8099 (Duplicate): LibRBD.DiffIterateStress failure - extra extent in diff
rbd's diff iterate stress test failed: http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-13_23:00:01-rbd-master-... Josh Durgin
10:36 AM rbd Bug #7310 (Can't reproduce): rbd snap unprotect test failing
This hasn't occurred again afaict. Josh Durgin
09:52 AM Revision 64cd332e (ceph): Merge pull request #1622 from dachary/wip-mailmap
mailmap updates
Reviewed-By: Christophe Courtaut <christophe.courtaut@gmail.com>
Loïc Dachary
09:39 AM CephFS Bug #8004 (Resolved): LibCephFS.HardlinkNoOriginal hang
Zheng Yan
09:31 AM Revision 7c17fc4a (ceph): mds: don't modify inode when calculating client ranges
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
09:27 AM Bug #7997 (Fix Under Review): handle_get_version returns old map epochs
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-14_02:30:16-rados-master-testing-basic-plana/190255 Sage Weil
08:39 AM Revision 3dd88006 (ceph): Merge pull request #1669 from ceph/wip-client-debug
client: print inode max_size Zheng Yan
08:38 AM Revision 65ec24e3 (ceph): client: print inode max_size
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
08:22 AM Messengers Bug #8097 (Resolved): msgr: mon stuck in set_crc()
... Sage Weil
07:54 AM Bug #8069: ceph-deploy README suggests questionable SSH parameters
Thank you! Florian Haas
07:10 AM Bug #8069 (Resolved): ceph-deploy README suggests questionable SSH parameters
Updated and pushed to master ceph-deploy c5fc7ad438cf276f4fb566eb4572b70057e7bb52 Alfredo Deza
07:18 AM CephFS Fix #8094: MDS: be accurate about stats in check_rstats
Zheng Yan
07:03 AM CephFS Fix #8094 (Resolved): MDS: be accurate about stats in check_rstats
We see occasional crashes in CDir::check_rstats function, on... Greg Farnum
07:04 AM CephFS Bug #8090: multimds: mds crash in check_rstats
I made a fix ticket for that: http://tracker.ceph.com/issues/8094 Greg Farnum
06:54 AM CephFS Bug #8090: multimds: mds crash in check_rstats
it's CDir::check_rstats() bug instead of rstats corruption Zheng Yan
04:59 AM Revision d6c71b76 (ceph): osd/ReplicatedPG: add missing whitespace in debug output
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:37 AM Revision 171d5c50 (ceph): ceph_test_rados_api_*: fix build warnings, memset ranges
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:33 AM Revision 8905e3e2 (ceph): osd/ReplicatedPG: handle dup ops earlier in do_op
Current the dup op checks happen in execute_ctx, long after we handle
cache ops or get the obc and (potentially) retu...
Sage Weil

04/13/2014

10:12 PM CephFS Bug #8092 (Resolved): multimds ceph-fuse hang on write waiting for max size
... Sage Weil
10:10 PM Bug #8091 (Resolved): osd/SnapMapper.cc: 217: FAILED assert(r == -2)
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-13_09:43:35-rados:thrash-testing-testing-basic-plana/1892... Sage Weil
10:08 PM Bug #8089 (Fix Under Review): osd: ENOENT on cache-evict
Sage Weil
03:32 PM Bug #8089 (Resolved): osd: ENOENT on cache-evict
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-13_09:43:35-rados:thrash-testing-testing-basic-plana/1891... Sage Weil
09:47 PM Bug #8011: osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-13_09:43:35-rados:thrash-testing-testing-basic-plana/189348 Sage Weil
09:42 PM Bug #8011: osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-13_09:43:35-rados:thrash-testing-testing-basic-plana/189166 Sage Weil
09:45 PM CephFS Bug #8090 (Duplicate): multimds: mds crash in check_rstats
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-11_23:01:29-multimds-master-testing-basic-plana/187... Sage Weil
08:11 PM Feature #8088 (Closed): Make FileStore::_do_copy_range use fiemap
Haomai Wang
02:20 AM Feature #8088: Make FileStore::_do_copy_range use fiemap
I noticed discuss on http://www.spinics.net/lists/ceph-devel/msg06858.html and http://tracker.ceph.com/issues/2535.
...
Haomai Wang
12:37 AM Feature #8088: Make FileStore::_do_copy_range use fiemap
It can also reduce the consuming time of "_set_replay_guard", because "_set_replay_guard" has two fsync which are aff... Haomai Wang
02:11 PM Bug #7892: osd/ReplicatedPG.cc: 7881: FAILED assert((data_included.empty() && data.length() == 0)...
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-13_09:43:35-rados:thrash-testing-testing-basic-plana/189080 Sage Weil
12:38 PM Revision 26659a5a (ceph): mds: don't issue/revoke caps before client has caps
If early reply is not allowed, MDS does not send reply to client immediately
after Locker::issue_new_caps adds new ca...
Yan, Zheng
12:33 PM Revision bd8aa6f4 (ceph): mds: do file recover after authpin inode
MDCache::do_file_recover may call Locker::evel_gather, which may change
filelock to stable state. So we should authpi...
Yan, Zheng
05:23 AM Revision a72bcddf (ceph): osd/ReplicatedPG: handle misdirected do_command
We can get a query on a pg we still have but are no longer primary for. If
that happens, do not reply. The client w...
Sage Weil

04/12/2014

11:07 PM Feature #8088: Make FileStore::_do_copy_range use fiemap
https://github.com/ceph/ceph/pull/1665 Haomai Wang
10:43 PM Feature #8088 (Closed): Make FileStore::_do_copy_range use fiemap
In rbd usage, a large volume which has lots of objects will be in a low performance during a long time after creating... Haomai Wang
10:29 PM Linux kernel client Bug #8087 (Resolved): libceph: null deref in osd_reset -> __reset_osd -> __remove_osd
I thought there was an open bug for this but I couldn't find it.... Sage Weil
01:40 PM rgw Bug #7799: Errors in upgrade:dumpling-x:stress-split-firefly---basic-plana suite
I don't see those errors in latest runs. Yuri Weinstein
12:21 PM Bug #8086: FDCache::clear failed assert
you'll probably have to reproduce with debug filestore = 20 debug osd = 20 Samuel Just
12:15 PM Bug #8086: FDCache::clear failed assert
Note; of the 6 OSDs in the cache tier, 5 appeared to fail with a similar stacktrace. Mark Nelson
12:10 PM Bug #8086 (Resolved): FDCache::clear failed assert
Hit this today during tiering performance testing with EC backend. OSD is on an SSD that is part of the cache tier a... Mark Nelson
10:51 AM Bug #8085 (Resolved): osd/PG.cc: 2218: FAILED assert(!actingbackfill.empty()) from do_command
... Sage Weil
06:14 AM Revision 4ccc845c (ceph): mds: fix typo in Server::do_rename_rollback()
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
05:33 AM Revision 93b933fb (ceph): Merge pull request #1659 from ceph/wip-8054
mds: finish table servers recovery after creating newfs
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
02:06 AM Bug #8083 (Need More Info): erasure-code: fix static code analysis errors found in gf-complete
Loïc Dachary
01:47 AM Bug #8083: erasure-code: fix static code analysis errors found in gf-complete
"static overrun":https://github.com/ceph/gf-complete/blob/e1dcbfaa9bac94fd48e2c3f2fb8e133720e57359/src/gf_w8.c#L1267
...
Loïc Dachary
01:44 AM Bug #8083: erasure-code: fix static code analysis errors found in gf-complete
The reason for "the deadcode":https://github.com/ceph/gf-complete/blob/e1dcbfaa9bac94fd48e2c3f2fb8e133720e57359/src/g... Loïc Dachary
01:20 AM Bug #8083 (Resolved): erasure-code: fix static code analysis errors found in gf-complete
... Loïc Dachary
01:15 AM Bug #7914 (Resolved): osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length f...
The workload ran 30 more times without errors. The loop was interrupted and the plana machines released. Loïc Dachary
12:46 AM Revision 809d0fab (ceph): osd/ReplicatedPG: handle missing hit_set on HITSET_GET rados op
Fixes: #8081
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:33 AM Revision d72602f6 (ceph): Merge pull request #1655 from ceph/wip-8077
osd: handle missing hti_set objects in agent_load_hit_sets
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:11 AM Revision b31107b7 (ceph): test/librados/TestCase: add Param option that can set up a cache pool
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:11 AM Revision 53a4b719 (ceph): ceph_test_rados_api_watch_notify: test over cache pool
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil

04/11/2014

11:22 PM Revision 6e0496b9 (ceph): librados: Rollback interface additions
Add C interface rados_ioctx_snap_rollback() and indicate that rados_rollback()
is deprecated.
Add C++ interface IoCt...
David Zafman
11:22 PM Revision b6b0c3bd (ceph): librados: Add ObjectWriteOperation::snap_rollback() for pool snapshots
snap_rollback() is the same as selfmanaged_snap_rollback() but we want an
independent interface for pool snapshots. ...
David Zafman
11:22 PM Revision 09b5538c (ceph): test: Fix ceph_test_rados to not core dump with invalid arguments
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
11:22 PM Revision 3ce40780 (ceph): test: Add --pool-snaps option to ceph_test_rados
Fixes: #7940
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman
10:50 PM Revision 3853cc90 (ceph): Merge pull request #1658 from ceph/wip-8008
osd: fix repair_object
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:50 PM Revision 2e40afcd (ceph): Merge pull request #1657 from ceph/wip-8063
ceph_test_rados_api_tier: fix scrub test
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:39 PM Revision 6ff645f5 (ceph): osd/PG: fix repair_object when missing on primary
If the object is missing on the primary, we need to fully populate the
missing_loc.needs_recovery_map. This broke wi...
Sage Weil
10:32 PM CephFS Bug #8054 (Resolved): multimds hang on fsstress
Sage Weil
10:34 AM CephFS Bug #8054 (In Progress): multimds hang on fsstress
reverted the commit.. it breaks mds startup in the trivial single-mds case Sage Weil
06:55 AM CephFS Bug #8054 (Resolved): multimds hang on fsstress
Sage Weil
10:12 PM Bug #8044: osd/ReplicatedPG.cc: 2276: FAILED assert(p != snapset.clones.end())
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-11_15:47:29-rados:thrash-testing-testing-basic-plana/185859 Sage Weil
05:57 PM Bug #8044: osd/ReplicatedPG.cc: 2276: FAILED assert(p != snapset.clones.end())
Sage Weil
09:48 PM Revision 19acfebc (ceph): ceph_test_librados_tier: tolerage EAGAIN from pg scrub command
We may get EAGAIN if the osd happens to be down, for example due to
thrashing. Try a few times and then give up.
No...
Sage Weil
09:44 PM Revision b1db0751 (ceph): Merge pull request #1656 from ceph/wip-osd-boot
mon: fix osd boot check
Reviewed-by: Greg Farnum <greg@inktank.com>
Gregory Farnum
09:32 PM Revision 39b9d9d8 (ceph): mon/OSDMonitor: fix osd epoch in boot check
This was introduced in 4c99e978a77a242e540cb8ccacb967d24322416c and was
incorrect; boot_epoch is the previous epoch t...
Sage Weil
08:14 PM Revision 78df66f5 (ceph): osd/ReplicatedPG: skip missing hit_sets when loading into memory
We weren't handling hit_sets that were missing.
Two changes here:
1- Load the hit_sets oldest to newest. That mean...
Sage Weil
07:23 PM Revision 7077438b (ceph): mds: finish table servers recovery after creating newfs
Fixes: #8054
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
06:12 PM Linux kernel client Bug #6809: 3.11 kernel panic: Workqueue: ceph-msgr con_work
Another instance of the crash in the last comment, on ubuntu 12.04 with xfs on top with a 3.13 kernel when osds were ... Josh Durgin
06:02 PM Bug #8082 (Duplicate): hung recovery
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-11_15:47:29-rados:thrash-testing-testing-basic-plana/1856... Sage Weil
05:57 PM Bug #8066: osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + firefly)
Sage Weil
05:56 PM Bug #8048: osd/ReplicatedPG: FAILED assert(!parent->get_log().get_missing().is_missing(soid))
Sage Weil
05:54 PM Bug #8080 (Fix Under Review): objecter: linger ops don't pay attention to cache overlay
Sage Weil
04:14 PM Bug #8080 (In Progress): objecter: linger ops don't pay attention to cache overlay
Sage Weil
04:07 PM Bug #8080 (Resolved): objecter: linger ops don't pay attention to cache overlay
see wip-objecter for an initial stab. need to build out the tests now. Sage Weil
05:44 PM Bug #8081 (Resolved): hitset-get on missing object fails
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-11_15:47:29-rados:thrash-testing-testing-basic-plana/1857... Sage Weil
05:33 PM Revision 052519ed (ceph): Revert "mds: finish table servers recovery after creating newfs"
This reverts commit f6c20730c16a7632061639dd83be523fc6a9a44f.
This breaks single MDS startup.
Signed-off-by: Sage W...
Sage Weil
05:20 PM Revision f3745912 (ceph): Merge pull request #1650 from dachary/wip-erasure-code-doc
erasure-code: document the ruleset-root profile parameter
Reviewed-by: Mark Nelson <mark.nelson@inktank.com>
Loïc Dachary
05:08 PM Revision f12bccc9 (ceph): radosgw-admin: check params for object rewrite
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
(cherry picked from commit 28c716bccd955eaf68e9c739139c901ca4f419f7)
Yehuda Sadeh
05:08 PM Revision 9130e7d6 (ceph): radosgw-admin: new 'bucket rewrite' command
Iterates through objects and rewrites them.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
(cherry picked from com...
Yehuda Sadeh
05:08 PM Revision 6f2ee99a (ceph): radosgw-admin: add some conditions for bucket rewrite
--min-rewrite-size, --max-rewrite-size to specify object size
conditions
--start-date, --end-date to specify objec...
Yehuda Sadeh
05:08 PM Revision 82d8397a (ceph): rgw: update bucket / object rewrite
Get code up to date for firefly, need to pass bucket owner for quota
related functionality.
Signed-off-by: Yehuda Sa...
Yehuda Sadeh
05:04 PM Revision fcd94d6a (ceph): Merge pull request #1630 from ceph/wip-7450
Wip 7450
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
03:54 PM Bug #8043: until we fix it more better, we should disallow split on cache pools
Do you really expect cache pool targets to not be set based on the underlying storage capacity? (Hint: no.)
;)
Greg Farnum
03:40 PM Bug #8043: until we fix it more better, we should disallow split on cache pools
How about a flag in the pg info that prevents the agent from blocking on full until it sees the thresholds drop below... Sage Weil
03:04 PM Bug #8043: until we fix it more better, we should disallow split on cache pools
Requiring a scrub first means that any cache pool which is actually being used and gets split will fill up and block ... Greg Farnum
02:55 PM Bug #8043: until we fix it more better, we should disallow split on cache pools
Is the concern the need for scrub, or are there other issues?
We could make the (shallow) scrub triggered automati...
Sage Weil
03:48 PM Bug #7987: osd: backfill/recovery makes no progress
Those two pgs appeared to go clean from the osd log. Samuel Just
03:38 PM Bug #8008 (Fix Under Review): osd/ReplicatedPG.cc: 258: FAILED assert(missing_loc.needs_recovery(...
Sage Weil
02:58 PM Bug #8008 (In Progress): osd/ReplicatedPG.cc: 258: FAILED assert(missing_loc.needs_recovery(hoid)...
Sage Weil
03:33 PM Revision ec4a6ce0 (ceph): Merge pull request #1635 from ceph/wip-7437
Reviewed-by: Sage Weil <sage@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgin <j...
David Zafman
03:25 PM rbd Feature #6934: Consistency check when importing RBD diff
related: http://wiki.ceph.com/Planning/Blueprints/Giant/Diff%3A_integrity___local_import Josh Durgin
02:48 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
we haven't seen this in several days; i'm ready to call it resolved! Sage Weil
05:08 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
All changes have been accepted upstream ( jerasure / gf-complete ), backported and copied into http://github.com/ceph... Loïc Dachary
03:29 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
The above workload ran 82 times without crashing. The loop stopped because a DNS resolution failure made it impossibl... Loïc Dachary
02:48 PM Bug #8063 (Fix Under Review): LibRadosTwoPoolsECPP.PromoteSnap got EAGAIN
Sage Weil
01:22 PM Bug #8063 (In Progress): LibRadosTwoPoolsECPP.PromoteSnap got EAGAIN
Sage Weil
01:59 PM Revision 1ad5bdcd (ceph): Merge pull request #1641 from ceph/wip-multimds
mds: guarantee message ordering when importing non-auth caps
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:57 PM Revision 60cb2aec (ceph): Merge branch 'master' of https://github.com/enovance/ceph-brag into fir...
Babu Shanmugam
01:56 PM Revision 204b7a46 (ceph): Merge pull request #1645 from ceph/wip-8054
mds: finish table servers recovery after creating newfs
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:51 PM devops Bug #7617: ceph-deploy uninstall should document why it doesn't remove all relevant packages
Oh, I dunno. I was depressed enough about the "qemu requires librbd" that I kinda lost interest.
I'm good with cl...
Dan Mick
07:27 AM devops Bug #7617: ceph-deploy uninstall should document why it doesn't remove all relevant packages
@dmick would an acceptable workaround to this problem in the meantime be to add an extra flag to handle this?
Some...
Alfredo Deza
01:50 PM RADOS Feature #8079 (New): Warn user when there are no OSDs in the acting set during pool creation
Create a parallel crush hierarchy using the ceph cli with:
1) it's own ruleset
2) no OSDs in the acting set
Wh...
Mark Nelson
01:28 PM Revision ad40356d (ceph): Included the total cluster size in components_count object
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
01:14 PM Bug #8077 (Fix Under Review): osd/ReplicatedPG.cc: 10862: FAILED assert(r >= 0) (agent_load_hit_s...
Sage Weil
01:03 PM Bug #8077 (In Progress): osd/ReplicatedPG.cc: 10862: FAILED assert(r >= 0) (agent_load_hit_sets)
It's just missing; we don't handle that yet. Sage Weil
12:53 PM Bug #8077 (Resolved): osd/ReplicatedPG.cc: 10862: FAILED assert(r >= 0) (agent_load_hit_sets)
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-11_10:40:02-rados:thrash-firefly-distro-basic-plana... Sage Weil
12:05 PM Revision 78fcb1a0 (ceph): Fetching the date from ceph osd dump as that is more reliable across ce...
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
11:59 AM Revision 99d74eef (ceph): doc: Add additional information over CloudStack and RBD
Wido den Hollander
11:51 AM Revision db3e0b51 (ceph): erasure-code: document the ruleset-root profile parameter
If unspecified it is ruleset-root=default and will translate into
take default
when a ruleset is created for an ...
Loïc Dachary
11:28 AM Revision 6d42bd9f (ceph): While generating crush_types, avoiding item parsing, and calculating ty...
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
11:20 AM Bug #7952 (Resolved): After aio_read() completes a call to return_value() doesn't return bytes read
d99f1d9f68db41231e0ffff4082b05d6d095c231
I forgot about this bug report, so it wasn't specified in the commit comm...
David Zafman
10:38 AM Feature #7940 (Fix Under Review): add pool snaps to ceph_test_rados
David Zafman
10:06 AM Bug #8049 (Resolved): ceph_test_rados can seg fault
3371a251159a6e4ecd28a29e6a53705ccb3083a3 David Zafman
09:27 AM rgw Documentation #8073: rgw: document placement_pools
See here:
[[http://comments.gmane.org/gmane.comp.file-systems.ceph.user/4992]]
Yehuda Sadeh
09:27 AM rgw Documentation #8073 (New): rgw: document placement_pools
Need to document how different users / buckets can be assigned to different placement targets. Yehuda Sadeh
09:18 AM devops Bug #7889: IPv6 support with ceph-deploy
There is a PR open with failing tests: https://github.com/ceph/ceph-deploy/pull/172 Alfredo Deza
08:45 AM Feature #7437 (Resolved): EC: add adapt unittest teuthology task and add to nightly
ec4a6ce0c2d175a3bf82059ff992f20ce18bb80e
The merged branch also includes improvements to the librados interface. F...
David Zafman
08:42 AM Revision 19878325 (ceph): Bug fix in the way crush_type is extracted from osd crush dump
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
07:11 AM Revision e46af060 (ceph): Merge pull request #1647 from ceph/wip-lockdep
a couple of lockdep fixes
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
05:11 AM Feature #8071 (Resolved): erasure-code: simplify jerasure initialization
Take advantage of the newly exposed "galois_init_default_field":https://bitbucket.org/jimplank/jerasure/commits/af8d2... Loïc Dachary
04:36 AM Revision 072d3711 (ceph): RWLock: make lockdep id mutable
This allows us to keep the lock/unlock methods const, as per commit
970d53fc0fefc89ffe7550880a4aaa36bd534955.
Signed...
Sage Weil
04:34 AM Revision da0d3820 (ceph): Revert "RWLock: don't assign the lockdep id more than once"
This reverts commit 957ac3cbe394473f225ffd2b632461fcdaca99e6.
It's important to assign these for all operations for ...
Sage Weil
04:34 AM Revision 632098f2 (ceph): common_init: remove dup lockdep message
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:27 AM Bug #8069 (Resolved): ceph-deploy README suggests questionable SSH parameters
In README.rst in ceph-deploy, the following sshd options are suggested for any host to which ceph-deploy should conne... Florian Haas
03:02 AM Revision 8c38ec7a (ceph): Merge pull request #1646 from dmick/wip-erasure-doc
doc: Wordsmith the erasure-code doc a bit. John Wilkins
02:55 AM Revision 3c54a49e (ceph): Wordsmith the erasure-code doc a bit
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
01:57 AM Revision f6c20730 (ceph): mds: finish table servers recovery after creating newfs
Fixes: #8054
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
01:30 AM Revision 9fa5c5fa (ceph): big: test xfs + btrfs
Note that previously we were specifying neither, so the OSDs were running
off of /.
Signed-off-by: Sage Weil <sage@i...
Sage Weil
01:25 AM Revision 756e3626 (ceph): Merge pull request #1643 from ceph/wip-8062
mon/OSDMonitor: ignore boot message from before last up_from
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
12:39 AM Revision 3db74861 (ceph): mds: issue new caps before starting log entry
Locker::issue_new_caps() calls Locker::eval(), which may dispatch
other requests.
Signed-off-by: Yan, Zheng <zheng.z...
Yan, Zheng
12:22 AM Revision 98127202 (ceph): librados, test: Have write, append and write_full return 0 on success
Fix consistency of write, append, write_full, all return 0 on success
Include C (rados_*) variants, C++ ctx variants
...
David Zafman
12:22 AM Revision d2113814 (ceph): pybind: Check that "key" is a string
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
12:22 AM Revision 39bf68c3 (ceph): pybind, test: Add python binding for append and add to test
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
12:22 AM Revision 3d290c2f (ceph): test: Add EC unaligned append write test to ceph_test_rados_api_io
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
12:22 AM Revision d99f1d9f (ceph): test, librados: aio read *return_value consistency, fix ceph_test_rados...
test:
Add set_completion*PP() functions to cast arg to correct class
Add return_value checks
Add some reads wit...
David Zafman
12:22 AM Revision 69afc59b (ceph): test: Add multiple write test cases to ceph_test_rados_api_aio
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
12:22 AM Revision 07e8ee20 (ceph): test: Add EC testing to ceph_test_rados_api_aio
Fixes: #7437
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman

04/10/2014

11:48 PM Revision 008663aa (ceph): rgw, radosgw-admin: bucket link uses bucket instance id now
Fixes: 7499
We need to link user to a specific bucket instance. This updates both
the radosgw-admin link command, and...
Yehuda Sadeh
10:54 PM Revision 6ce7116f (ceph): civetweb: update subproject
Fixes: #7786
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
09:53 PM Revision 6dc29904 (ceph): Change status to _status everywhere (fix regression)
0550dd119150acf8f7b7da2f32b3938c910c3c99 didn't add '_' to 'status'
everywhere it was needed.
Signed-off-by: Dan Mic...
Dan Mick
08:40 PM Revision 43d837d3 (ceph): rgw: radosgw-admin object rewrite
A radosgw-admin command that copies the object into itself while
preserving mtime and attributes so that data can be ...
Yehuda Sadeh
08:34 PM Revision 4c99e978 (ceph): mon/OSDMonitor: ignore boot message from before last up_from
It is possible we will have a dup OSDBoot message queued up in the mon
and will process it again after that osd was m...
Sage Weil
08:24 PM Revision 364d0b40 (ceph): ceph.conf: longer client_moutn_timeout (which also applies to librados)
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:50 PM CephFS Bug #8054: multimds hang on fsstress
Zheng Yan
06:54 AM CephFS Bug #8054 (Resolved): multimds hang on fsstress
ceph-fuse:
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-09_23:01:37-multimds-master-testing-bas...
Sage Weil
06:35 PM Bug #8067: mon: enomem on vps, killed at ~800MB
this is probably a dup of #8036 Sage Weil
06:13 PM Bug #8067 (Duplicate): mon: enomem on vps, killed at ~800MB
ubuntu@teuthology:/a/teuthology-2014-04-09_22:35:02-upgrade:dumpling-x:stress-split-firefly-distro-basic-vps/182651
...
Sage Weil
06:31 PM Bug #8036: levedb: throws std::bad_allow on 14.04
It would be interesting to know why the monitor's virtual mem usage got to 700MB, although a portion of it should go ... Joao Eduardo Luis
01:19 PM Bug #8036: levedb: throws std::bad_allow on 14.04
It's a good practice to run tests on scaled down machines, the question then is - do we fix bugs related like this on... Yuri Weinstein
11:54 AM Bug #8036: levedb: throws std::bad_allow on 14.04
This was run on vms, so they have much less memory than the usual physical machines. Josh Durgin
11:25 AM Bug #8036: levedb: throws std::bad_allow on 14.04
core is corrupted:... Joao Eduardo Luis
06:24 PM Bug #8062 (Resolved): [ERR] map e10 had wrong cluster addr (10.214.131.8:6815/1007096 != my 10....
Sage Weil
01:34 PM Bug #8062 (Fix Under Review): [ERR] map e10 had wrong cluster addr (10.214.131.8:6815/1007096 !...
Sage Weil
01:33 PM Bug #8062 (Resolved): [ERR] map e10 had wrong cluster addr (10.214.131.8:6815/1007096 != my 10....
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-10_02:30:24-rados-firefly-distro-basic-plana/183189... Sage Weil
06:18 PM Bug #8068 (Resolved): try_flush_mark_clean can end up using a snapset from the past corrupting th...
Samuel Just
06:05 PM Bug #8066: osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + firefly)
ubuntu@teuthology:/a/teuthology-2014-04-09_22:35:02-upgrade:dumpling-x:stress-split-firefly-distro-basic-vps/182696 Sage Weil
06:02 PM Bug #8066 (Duplicate): osd/PG.cc: 2826: FAILED assert(r == 0) in update_snap_map (dumpling + fire...
ubuntu@teuthology:/a/teuthology-2014-04-09_22:35:02-upgrade:dumpling-x:stress-split-firefly-distro-basic-vps/182710<p... Sage Weil
06:01 PM Revision 28371a24 (ceph): Merge pull request #1624 from ceph/wip-6789
mon: Monitor: suicide on start if mon has been removed from monmap
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
05:55 PM Revision a8f09539 (ceph): osd/ReplicatedPG: adjust obc + snapset_obc locking strategy
Prevoiusly we assumed that if we had snapset_obc set, !exists on the head
and if we got the snapdir lock we were good...
Sage Weil
05:53 PM Bug #7996: 0.78: OSD is not suspend-friendly (unresponsive cluster on OSD crash)
When cluster is stable output of `ceph -s` is normal, showing health status HEALTH_OK, 5 monitors and 11 OSDs up and ... Dmitry Smirnov
05:26 PM Feature #7940: add pool snaps to ceph_test_rados
David Zafman
02:14 PM Revision 86b85947 (ceph): mon: Monitor: suicide on start if mon has been removed from monmap
If the monitor has been marked as having been part of an existing quorum
and is no longer in the monmap, then it is s...
Joao Eduardo Luis
01:49 PM devops Bug #8034 (Resolved): ceph-deploy should run sudo yum clean all after installing ceph-release rpm
Merged into ceph-deploy's master branch with hash: 5d82162 Alfredo Deza
09:54 AM devops Bug #8034 (Fix Under Review): ceph-deploy should run sudo yum clean all after installing ceph-rel...
Pull request opened https://github.com/ceph/ceph-deploy/pull/177 Alfredo Deza
09:48 AM devops Bug #8034 (In Progress): ceph-deploy should run sudo yum clean all after installing ceph-release rpm
Alfredo Deza
01:41 PM Bug #8063 (Resolved): LibRadosTwoPoolsECPP.PromoteSnap got EAGAIN
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-10_02:30:24-rados-firefly-distro-basic-plana/183560... Sage Weil
01:25 PM Bug #8061: LibRadosAio.FlushAsync failure
teuthology commit 364d0b4091a9b87db9ca21238052b776d0dce27c Sage Weil
01:24 PM Bug #8061 (Resolved): LibRadosAio.FlushAsync failure
mon thrashing. increased the timeout from 5m to 10m. Sage Weil
01:20 PM Bug #8061 (Resolved): LibRadosAio.FlushAsync failure
... Sage Weil
11:46 AM Revision 02048dcc (ceph): mds: guarantee message ordering when importing non-auth caps
Current code allow importing non-auth caps when inode is being exported.
This can breaks message ordering because the...
Yan, Zheng
11:01 AM Bug #6789 (Pending Backport): cannot remove the leader when there only are two monitors
Sage Weil
10:55 AM Bug #8046 (Resolved): osd/ReplicatedPG.h: 666: FAILED assert(got) in get_rw_locks()
Sage Weil
10:45 AM rgw Bug #8016: "testPrefixAndLimit (test.functional.tests.TestContainerUTF8) ... ERROR" in upgrade:du...
I see seemingly the same issue in today run.
Logs are here - http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-...
Yuri Weinstein
10:37 AM RADOS Feature #8060 (New): Make it easier to map crush rules to pools
Right now when you want to create a new crush rule via create-simple, you do something like:... Mark Nelson
10:31 AM rgw Bug #8059 (Resolved): ACL updated in master zone not updating in slave
In Summary:
While using a setup with a master and slave zone the ACL for an object is updated on the master zone, ...
JuanJose Galvez
10:21 AM Linux kernel client Bug #8024 (Resolved): kclient: misdirected osd request
Ilya Dryomov
10:21 AM Linux kernel client Bug #7954 (Resolved): misdirected op
Ilya Dryomov
07:13 AM Linux kernel client Bug #7954 (Fix Under Review): misdirected op
Ilya Dryomov
07:05 AM Linux kernel client Bug #7954: misdirected op
probably this: ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-09_23:01:12-kcephfs-master-testing-b... Sage Weil
10:17 AM Bug #8021: osd: ENOENT on clone on dumpling
I think I see it on firefly if it's the same issue.
See logs here:
http://qa-proxy.ceph.com/teuthology/teuthology-2...
Yuri Weinstein
09:51 AM rgw Bug #7786: civetweb segfaults with file uploads larger than 2GB
Please pull down the newest civitweb that includes this fix. Ian Colle
09:50 AM rgw Bug #7450 (Fix Under Review): "radosgw-admin key create" ignores specified access key when subuse...
Josh please review PR for wip-7450 Ian Colle
09:30 AM Bug #8058 (Resolved): "LibRadosTierECPP.FlushWriteRaces" failed in upgrade:dumpling-x:parallel-fi...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-09_19:33:29-upgrade:dumpling-x:parallel-firefly---... Yuri Weinstein
06:58 AM CephFS Bug #8055 (Can't reproduce): knfs: NFS: nfs4_discover_server_trunking unhandled error -5. Exiting...
... Sage Weil
04:19 AM Revision cf69bdbd (ceph): Merge pull request #1639 from ceph/wip-multimds
Wip multimds
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:13 AM Revision ac51fcac (ceph): mds: include truncate_seq/truncate_size in filelock's state
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
03:13 AM Revision 808ba130 (ceph): mds: remove wrong assertion for remote frozen authpin
For across authority rename, the MDS first freezes the source inode's
authpin. It happens while the source dentry isn...
Yan, Zheng
01:02 AM Revision 860d7277 (ceph): osdc/Objecter: move mapping into struct, helper
Move the common bits of Op and LingerOp into op_target_t and separate the
actual mapping calculation into calc_target...
Sage Weil
12:21 AM Revision 5df98f47 (ceph): Merge pull request #1637 from ceph/wip-8042
mon: fix election required_features checks
Reviewed-by: Greg Farnum <greg@inktank.com>
Gregory Farnum

04/09/2014

11:51 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
The above workload ran 51 times without crashing. Loïc Dachary
03:16 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...

"init fix merged":https://bitbucket.org/jimplank/jerasure/pull-request/17/make-galois_init_default_field-int-w , "b...
Loïc Dachary
03:15 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
The above workload ran 26 times without crashing. rados suite did not run yet on the latest master. Loïc Dachary
11:25 PM Revision 71d97f99 (ceph): Merge pull request #1636 from ceph/wip-6480
fix auth races that may have lead to qemu crashes
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
11:21 PM Revision 48895a46 (ceph): osdc/ObjectCacher: back off less during flush
In cce990efc8f2a58c8d0fa11c234ddf2242b1b856 we added a limit to avoid
holding the lock for too long. However, if we ...
Sage Weil
11:21 PM Revision f3b5ba6f (ceph): osdc/ObjectCacher: limit writeback IOs generated while holding lock
While analyzing a log from Mike Dawson I saw a long stall while librbd's
objectcacher was starting lots (many hundred...
Sage Weil
11:03 PM Revision 18642ed3 (ceph): mon: tell peers missing features during probe
Use a new probe op to inform mons that they are missing features during
the earliest probe phase. This prevents them...
Sage Weil
11:01 PM Revision 39ca440b (ceph): mon: move required_features back into Monitor
This is simpler and cleaner.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:40 PM Revision c8039ab8 (ceph): mon: ignore sync clients without required_features
If we let them sync data they don't understand they will get confused
and crash.
Signed-off-by: Sage Weil <sage@inkt...
Sage Weil
09:34 PM Bug #7987: osd: backfill/recovery makes no progress
It looks like thrashosds gave up waiting as it usually does (it doesn't kill the child task so you just don't see any... Sage Weil
09:31 PM Revision 75948357 (ceph): utime: declare is_zero(), ceph_timespec(), and sleep() as const
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision 9af10b2c (ceph): auth: CephxProtocol const cleanup
need_key() and build_authorizer() can be const.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
09:31 PM Revision 3ccef662 (ceph): auth: AuthClientHandler const cleanup
get_protocol(), build_request(), build_rotating_request(), and
build_authorizer() can all be declared const now.
Sig...
Josh Durgin
09:31 PM Revision b297689a (ceph): auth: make AuthClientHandler::validate_ticket() protected
It's just used internally. Make it private in the subclasses since
there's just one level of inheritance.
Signed-off...
Josh Durgin
09:31 PM Revision 50ed65fb (ceph): auth: remove unused get_global_id() method
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision 4d3d89bf (ceph): auth: remove unused tick() method
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:31 PM Revision 957ac3cb (ceph): RWLock: don't assign the lockdep id more than once
This never does anything since lockdep_register() assigns an id >= 0
in the RWLock constructor. This also prevents me...
Josh Durgin
09:31 PM Revision 970d53fc (ceph): RWLock: make read locking methods const
This allows methods using RWLock for reading to be declared const.
There might be cases where we'd want to take a wri...
Josh Durgin
09:31 PM Revision 3119022d (ceph): auth: separate writes of build_request() into prepare_build_request()
validate_tickets() updates internal state, as does
tickets.get_handler(). Move them into a new method called before
b...
Josh Durgin
09:29 PM Revision 2cc76bcd (ceph): auth: add rwlock to AuthClientHandler to prevent races
For cephx, build_authorizer reads a bunch of state (especially the
current session_key) which can be updated by the M...
Josh Durgin
09:27 PM Bug #8042 (Resolved): mon: crash decoding incremental osdmap on split firefly/dumpling
Sage Weil
04:14 PM Bug #8042 (Fix Under Review): mon: crash decoding incremental osdmap on split firefly/dumpling
Sage Weil
11:23 AM Bug #8042: mon: crash decoding incremental osdmap on split firefly/dumpling
Greg Farnum
11:16 AM Bug #8042 (Fix Under Review): mon: crash decoding incremental osdmap on split firefly/dumpling
Sage Weil
09:35 AM Bug #8042: mon: crash decoding incremental osdmap on split firefly/dumpling
... Sage Weil
09:20 PM Revision d4f2a8ce (ceph): Reduce runs in the dumpling-emperor upgrade suite.
Add + to 2-workload, 4-workload and 6-final-workload
suites. Reduce 5-upgrade-sequence to only run two
tests and mak...
Warren Usui
08:59 PM Revision f1d63236 (ceph): Merge pull request #238 from ceph/wip-pep8
Wip pep8 GregMeno
08:56 PM Revision 0550dd11 (ceph): more statuses redefinition fixes
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com> Alfredo Deza
08:36 PM Revision 30f3b01a (ceph): rename variable to avoid collision
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com> Alfredo Deza
08:27 PM Revision b476ec29 (ceph): remove unused import
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com> Alfredo Deza
08:26 PM Revision 15a3acb7 (ceph): fix redefinition of region
Signed-off-by: Alfredo Deza <alfredo.deza@inktank.com> Alfredo Deza
06:35 PM Revision 2e8035fa (ceph): osd: Fix appending write to return any error back to caller
Also, correct double bumping of num_writes
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman
06:35 PM Revision 3371a251 (ceph): test: Fix Seg fault in ceph_test_rados
Fixes: #8049
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman
06:35 PM Revision edd542e4 (ceph): tools: Improve ceph_scratchtoolpp
Minor output improvements
Remove clone_range code that was asserting
Signed-off-by: David Zafman <david.zafman@inkta...
David Zafman
06:13 PM Revision 34d69cdc (ceph): mon: refresh elector required_features when they change
Currently we only refresh required_features on Elector::start(). This
does not prevent an old peer from calling an e...
Sage Weil
06:09 PM Revision b3b502f1 (ceph): mon/Elector: ignore ACK from peers without required features
If an old peer gets a PROPOSE from us, we need to be sure to ignore their
ACK. Ignoring their PROPOSEs isn't suffici...
Sage Weil
05:37 PM Revision 5a567c47 (ceph): Merge pull request #1626 from ceph/wip-8031
osd: improve misdirected op checks
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:34 PM Revision 5b16650b (ceph): Merge pull request #1627 from ceph/wip-8001
osd/PG: set CREATING pg state bit until we peer for the first time
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:34 PM Revision 2a9f5fd5 (ceph): Merge pull request #1631 from ceph/wip-8045
osd: fix check_osdmap_features deadlock
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
04:58 PM devops Bug #5193 (In Progress): RHEL6 does not ship with xfsprogs
Not marking it as complete yet because the automation/insertion in jenkins is not complete but for now I built packaa... Sandon Van Ness
04:24 PM rbd Bug #6480 (Pending Backport): librbd crashed qemu-system-x86_64
Sage Weil
03:57 PM Bug #8020: evenly split stats on split
Samuel Just
03:48 PM Revision 3382fe3c (ceph): os/FileStore: reset journal state on umount
We observed a sequence like:
- replay journal
- sets JournalingObjectStore applied_op_seq
- umount
- mount
...
Sage Weil
03:47 PM Revision 06f27fc6 (ceph): os/FileStore: reset journal state on umount
We observed a sequence like:
- replay journal
- sets JournalingObjectStore applied_op_seq
- umount
- mount
...
Sage Weil
03:26 PM rgw Feature #8052 (Duplicate): Support for Keystone Identity API v3
With Havana, Identity V3.0 was implemented, with support for domains.
RGW seems to only support Identity API v2.0...
Dag Stenstad
03:14 PM Revision f1a89340 (ceph): Merge pull request #1632 from ceph/wip-5469
librbd: fix zero length request handling
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
02:18 PM Bug #7976: 4.8 missing primary copy of ..., unfound (dumpling)
4139e75d63b0503dbb7fea8036044eda5e8b7cf1 and 7a06a71e0f2023f66d003dfb0168f4fe51eaa058 are part of the story. We prob... Samuel Just
02:17 PM Bug #7976 (Pending Backport): 4.8 missing primary copy of ..., unfound (dumpling)
Actually, this is mildly tricky since the backfill logic changed a bit after dumpling to avoid some other related cat... Samuel Just
02:09 PM Bug #7976: 4.8 missing primary copy of ..., unfound (dumpling)
mm, doesn't apply cleanly, working on it Samuel Just
01:48 PM Bug #7976 (Fix Under Review): 4.8 missing primary copy of ..., unfound (dumpling)
Samuel Just
01:48 PM Bug #7976: 4.8 missing primary copy of ..., unfound (dumpling)
wip-7676-dumpling I think Samuel Just
09:54 AM Bug #7976: 4.8 missing primary copy of ..., unfound (dumpling)
This is a backfill bug in dumpling. The problem seems to be somewhere around here... the backfill for 57594e48/bench... Sage Weil
08:28 AM Bug #7976: 4.8 missing primary copy of ..., unfound (dumpling)
Sage Weil
02:12 PM Revision dc4bbfa7 (ceph): Merge pull request #1634 from ceph/wip-8028
rpm: add redhat-lsb dependency
Reviewed-by: Alfredo Deza <alfredo.deza@inktank.com>
Alfredo Deza
02:05 PM Revision f1c6b65b (ceph): ceph.spec.in: require redhat-lsb-core
We need this for /lib/lsb/init-functions.
Fixes: #8028
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:55 PM Revision 1d0c62fa (ceph): Merge pull request #1606 from ceph/wip-shrink-icache
client: try shrinking kernel inode cache when trimming session caps
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
11:15 AM Bug #8049 (Resolved): ceph_test_rados can seg fault

$ ./ceph_test_rados --pool ctrpool --op read 4 --op write 20
.
.
.
update_object_version oid 5 v 384 (ObjNum 32...
David Zafman
10:37 AM Bug #8031 (Resolved): osd/ReplicatedPG.cc: 405: FAILED assert(needs_recovery)
Sage Weil
10:35 AM Bug #8001 (Resolved): hung recovery; pg 3.f disappeared
Sage Weil
10:35 AM Bug #8045 (Resolved): osd: deadlock from osdmap feature update
Sage Weil
09:31 AM Bug #8047 (Closed): 0.79: new OSD crashed within minutes
Ian Colle
12:12 AM Bug #8047: 0.79: new OSD crashed within minutes
Never mind, root cause for this one is physical media errors in HDD. Apologies for noise -- please close this bug.
T...
Dmitry Smirnov
09:22 AM Feature #6707 (Rejected): Backport per pool iops data to 0.67
Sage Weil
08:47 AM Bug #8019 (Resolved): os/JournalingObjectStore.cc: 121: FAILED assert(op > committed_seq) on wheezy
Sage Weil
08:29 AM Bug #7891: osd: leaked pg refs on shutdown
http://qa-proxy.ceph.com/teuthology/joshd-2014-04-08_17:21:20-rados-wip-6480-testing-basic-plana/179778/ Josh Durgin
08:23 AM Bug #8048 (Resolved): osd/ReplicatedPG: FAILED assert(!parent->get_log().get_missing().is_missing...
From effectively the master branch: http://qa-proxy.ceph.com/teuthology/joshd-2014-04-08_17:21:20-rados-wip-6480-test... Josh Durgin
08:14 AM rbd Bug #5469 (Resolved): qemu-io: segfault when tried IO with invalid arguments
Sage Weil
07:19 AM Bug #7996: 0.78: OSD is not suspend-friendly (unresponsive cluster on OSD crash)
What's the output of "ceph -s" while the cluster is stable, while it's frozen, and after it's recovered? Greg Farnum
02:03 AM Bug #7996: 0.78: OSD is not suspend-friendly (unresponsive cluster on OSD crash)
I reproduced this problem on 0.79. It looks like it is no different from 0.78 in regards to this problem.
I suspen...
Dmitry Smirnov
07:13 AM Bug #8028 (Resolved): /lib/lsb/init-functions does not exist in latest firefly rc
Sage Weil
06:05 AM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
It works great except that it pulls a lot of other things other than the init-functions that we need:... Alfredo Deza
06:07 AM Revision ae09361a (ceph): mailmap: Yan, Zheng name normalization
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
06:06 AM Revision 5ee0b7b2 (ceph): mailmap: Nikola Kotur affiliation
Reviewed-by: Nikola Kotur <kotnick@gmail.com>
Signed-off-by: Loic Dachary <loic@dachary.org>
Loïc Dachary
03:48 AM Revision e5f3eb82 (ceph): Merge pull request #1633 from ceph/wip-8004
client: wake up umount waiter if receiving session open message
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:23 AM Revision 02aedbc4 (ceph): client: wake up umount waiter if receiving session open message
Wake up umount waiter if receiving session open message while
umounting. The umount waiter will re-close the session....
Yan, Zheng
02:30 AM Revision 1848a237 (ceph): librados: Fix typo for read functions documentation
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
02:30 AM Revision 9c6733e6 (ceph): librados: Allow rados_pool_list() to get NULL buffer if len == 0
Coverity wants a NULL check for buf, but caller should be able to
pass a NULL buf and 0 len in order to get byte coun...
David Zafman
02:30 AM Revision 409999c3 (ceph): rbd: Prevent Seg fault by checking read result in snap_read_header()
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
01:25 AM Revision 0bed25c6 (ceph): Merge pull request #236 from ceph/wip-fix-mon-warn-corrected
changed mon data avail warn to avoid monitor storage warnings
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
01:21 AM Revision b3218eeb (ceph): changed mon data avail warn to avoid monitor storage warnings
Yuri Weinstein
12:39 AM Revision a8330f5c (ceph): librbd: fix zero length request handling
Zero-length writes would hang because the completion was never
called. Reads would hit an assert about zero length in...
Josh Durgin
12:28 AM Revision 22a0c1fd (ceph): osd: do not block when updating osdmap superblock features
We are holding osd_lock in check_osdmap_features, which means we cannot
block while waiting for filestore operations ...
Sage Weil

04/08/2014

11:51 PM Bug #8047 (Closed): 0.79: new OSD crashed within minutes
On 0.79 I added new OSD (on btrfs). Shortly after re-balancing begin newly added OSD crashed:... Dmitry Smirnov
10:54 PM Revision 43f0519b (ceph): doc: Made minor changes to quick start preflight for RHEL.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:53 PM Revision ab7a25ce (ceph): doc: Notes and minor modifications to gateway installation doc.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:40 PM Revision 1d74170a (ceph): pipe: only read AuthSessionHandler under pipe_lock
session_security, the AuthSessionHandler for a Pipe, is deleted and
recreated while the pipe_lock is held. read_messa...
Josh Durgin
10:12 PM Bug #8011: osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-08_14:01:14-rados:thrash-wip-7891-testing-basic-plana/178972 Sage Weil
09:47 PM Revision 26907e3d (ceph): Merge pull request #1628 from ceph/wip-5835
update package descriptions
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
09:34 PM Revision 084db49e (ceph): remove mkcephfs
Fixes: #7552
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:19 PM Revision 277e7ac4 (ceph): debian: update ceph description
Fixes: #5835
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:18 PM Revision 72dc7327 (ceph): ceph.spec: update ceph description
Fixes: #5835
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:49 PM Bug #8028 (Fix Under Review): /lib/lsb/init-functions does not exist in latest firefly rc
Sage Weil
08:49 PM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
The gitbuilders like this fine.. Alfredo, do you want to do a final test with ceph-deploy to make sure yum install fi... Sage Weil
05:57 PM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
Sage Weil
05:56 PM Bug #8028 (Fix Under Review): /lib/lsb/init-functions does not exist in latest firefly rc
Sage Weil
05:03 PM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
This is because of missing depends on "lsb-base". I reckon it (or similar package) should be available on RHEL/CentOS... Dmitry Smirnov
12:38 PM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
but those do not export the same functions... what is it that we needed from init-functions that we had to add that l... Alfredo Deza
09:34 AM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
the offending commit by the way is commit:012bb5fb5bbc76e5a2c5037dc0c6558f0b1b0a45 Sage Weil
09:34 AM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
... Sage Weil
08:16 AM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
Ok so this looks like this gets installed with the redhat-lsb package that has this file in it:... Alfredo Deza
07:04 AM Bug #8028: /lib/lsb/init-functions does not exist in latest firefly rc
Commenting out that one line allows me to start the monitors but I understand this has side-effects that are not enti... Alfredo Deza
06:07 AM Bug #8028 (Resolved): /lib/lsb/init-functions does not exist in latest firefly rc
... Alfredo Deza
08:25 PM CephFS Bug #8004: LibCephFS.HardlinkNoOriginal hang
Zheng Yan
07:45 PM Revision 4bb0628a (ceph): Merge pull request #1625 from ceph/wip-8019
osd: fix journal umount/mount weirdness
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
07:26 PM Revision 79ac2f79 (ceph): osd/PG: set CREATING pg state bit until we peer for the first time
We send PG state updates to the monitor while creating a PG before the
actual creation and been finalized and persist...
Sage Weil
06:49 PM rbd Bug #5469 (Fix Under Review): qemu-io: segfault when tried IO with invalid arguments
https://github.com/ceph/ceph/pull/1632 Josh Durgin
06:36 PM Bug #8046: osd/ReplicatedPG.h: 666: FAILED assert(got) in get_rw_locks()
Sage Weil
06:31 PM Bug #8046: osd/ReplicatedPG.h: 666: FAILED assert(got) in get_rw_locks()
The wr locks are held on both head and snapset due to a previous op (delete) that is committed but not yet applied. Sage Weil
05:35 PM Bug #8046 (Resolved): osd/ReplicatedPG.h: 666: FAILED assert(got) in get_rw_locks()
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-08_02:30:14-rados-firefly-distro-basic-plana/178780... Sage Weil
06:02 PM Revision 1cdb7381 (ceph): vstart.sh: make crush location match up with what init-ceph does
This makes is to that ./init-ceph restart osd.0 won't modify the CRUSH
tree. And in any case, the localhost/localrac...
Sage Weil
06:02 PM Revision 4de49e86 (ceph): os/FileStore: reset journal state on umount
We observed a sequence like:
- replay journal
- sets JournalingObjectStore applied_op_seq
- umount
- mount
...
Sage Weil
05:48 PM Revision b2923872 (ceph): rgw: deny writes to a secondary zone by non-system users
Fixes: #6678
We don't want to allow regular users to write to secondary zones,
otherwise we'd end up with data incons...
Yehuda Sadeh
05:43 PM Revision ddafcc37 (ceph): Merge pull request #1623 from ceph/wip-8026
mds: fix shared_ptr MDRequest bugs
Reviewed-by: Greg Farnum <greg@inktank.com>
Gregory Farnum
05:40 PM Bug #8045: osd: deadlock from osdmap feature update
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-08_02:30:14-rados-firefly-distro-basic-plana/178740 Sage Weil
05:30 PM Bug #8045 (Fix Under Review): osd: deadlock from osdmap feature update
Sage Weil
05:18 PM Bug #8045 (Resolved): osd: deadlock from osdmap feature update
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-08_02:30:14-rados-firefly-distro-basic-plana/178670... Sage Weil
05:14 PM Revision 667137cc (ceph): Merge pull request #1621 from dachary/wip-7914
erasure-code: thread-safe initialization of gf-complete
This looks like a good interim solution until gf-complete ex...
Sage Weil
05:11 PM Revision 96e17744 (ceph): changed idle_timeout to 300
Yuri Weinstein
05:01 PM Bug #8044 (Duplicate): osd/ReplicatedPG.cc: 2276: FAILED assert(p != snapset.clones.end())
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-08_02:30:14-rados-firefly-distro-basic-plana/178733... Sage Weil
04:11 PM Bug #8043 (Resolved): until we fix it more better, we should disallow split on cache pools
Samuel Just
04:06 PM Revision d3833dda (ceph): osd: make misdirected checks explicit about replicas, flags
Only allow read ops to target replicas if the necessary op flags are set.
The previous checks were very sloppy.
Fixe...
Sage Weil
04:06 PM Revision 5d611619 (ceph): osd: drop previous interval ops even if primary happens to be the same
If we have two consecutive intervals with the same primary, the client
will not resend the op and the same_primary_si...
Sage Weil
04:06 PM Revision d2edd9c1 (ceph): osd: drop unused same_for_*() helpers
These were all identical and mostly served to obscure the actual logic,
which is now captured by can_discard_op() and...
Sage Weil
03:52 PM Bug #8042 (Resolved): mon: crash decoding incremental osdmap on split firefly/dumpling
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-07_22:35:16-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
03:24 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Kevin Greenan proposes to expose a static function to reduce the amount of code required from the plugin : https://bi... Loïc Dachary
10:14 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Sage Weil
10:02 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
the above config.yaml ran 6 times without failing Loïc Dachary
05:47 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
running tests against the proposed change, using the previous config.yaml. Not really hoping that it will failing. Ju... Loïc Dachary
04:30 AM Bug #7914 (Fix Under Review): osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer ...
Loïc Dachary
03:04 PM Feature #8041 (Resolved): ceph uses GCC-specific strerror_r; easy to make more portable
GCC's strerror_r returns the string; the POSIX version returns success (the string is returned in the supplied buffer... Dan Mick
02:49 PM devops Bug #5835 (Resolved): Change text in package builds
Sage Weil
02:19 PM devops Bug #5835 (Fix Under Review): Change text in package builds
Sage Weil
02:46 PM devops Bug #6779 (Resolved): fix typo on the modfastcgi repo for fedora18
Simple typo fix fixed in ceph-qa-chef commit 1e8ba35 Sandon Van Ness
02:36 PM devops Feature #8039 (Closed): move to libgoogle-perftools4
From James on ceph-maintainers:... Sage Weil
02:35 PM devops Bug #7552 (Fix Under Review): dregs of mkcephfs still live on
Sage Weil
02:22 PM devops Feature #8037 (Closed): Test leveldb 1.12 (or newer) and package as necessary
Ian Colle
02:22 PM devops Bug #7918 (Won't Fix): Mon hangs at start after upgrading to leveldb-1.12.0-3.fc18.x86_64 from th...
we're not going to worry about fc18 at this point; let's focus on making sure the fc10 and centos/rhel stuff works. Sage Weil
02:13 PM Feature #6258 (New): ceph-disk: zap should wipefs
Ian Colle
02:08 PM Bug #8036: levedb: throws std::bad_allow on 14.04
... Sage Weil
02:03 PM Bug #8036 (Can't reproduce): levedb: throws std::bad_allow on 14.04
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-07_22:35:16-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
02:04 PM Bug #7744: osd: assert(last_e.version.version < e.version.version)
Sage Weil
01:56 PM Bug #3652 (Duplicate): split should not mess up stats
Ian Colle
01:55 PM Bug #4253 (Can't reproduce): radosgw: segfault in lockdep register
Sage Weil
01:54 PM Bug #5634: auth startup reports "ObjectNotFound" when keyring file is unreadable
Sage Weil
01:54 PM Bug #6629 (Won't Fix): fd cache and external changes to recently-modified files don't behave nicely
Samuel Just
01:53 PM Bug #6686 (Resolved): segfault in prioritized queue dequeue
commit:44028983 Sage Weil
01:52 PM Bug #6684 (Rejected): osd/PGLog.cc: 402: FAILED assert(log.head >= olog.tail && olog.head >= log....
Samuel Just
01:51 PM Bug #6826 (Duplicate): Non-equal performance of 'freshly joined' OSDs
Samuel Just
01:50 PM Bug #6826: Non-equal performance of 'freshly joined' OSDs
Probably related to snap trimming on newly clean new osds. Samuel Just
01:50 PM Bug #6944: objecter: localized read for missing objects polls replicas instead of waiting for pri...
Can you ensure this is fixed in your current objecter work? Ian Colle
01:49 PM Bug #6833 (Can't reproduce): `/etc/init.d/ceph status` occasionally exists silently
Sage Weil
01:49 PM Bug #3569 (Can't reproduce): Monitor & OSD failures when an OSD clock is wrong
Samuel Just
01:48 PM Bug #7044 (Can't reproduce): Segmentation fault rados suite on master
Sage Weil
01:48 PM Bug #6797 (Won't Fix): ceph osd out does not migrate properly
Sage Weil
01:47 PM Bug #7161 (Can't reproduce): rados api test LibRadosMisc.Exec failed on next branch
Sage Weil
01:46 PM Bug #7335 (Won't Fix): librbd does not raise "Object Not Found", instead returning NUL bytes
Sage Weil
01:45 PM Bug #7350 (Won't Fix): osd: scrub does not detect recently touched and then renamed backend files
Sage Weil
01:43 PM Bug #7593 (Resolved): Disk saturation during PG folder splitting
Sage Weil
01:41 PM Bug #7688 (Won't Fix): warn at fs/btrfs/extent-tree.c:5748 __btrfs_free_extent+0x9ce/0xa20
Sage Weil
01:41 PM Bug #7520 (Resolved): Lock contention during scrubbing which could potentially hang the OSD for a...
Sage Weil
01:41 PM Bug #7868 (Can't reproduce): "failed to recover before timeout expired" in powercycle-firefly---b...
Sage Weil
01:39 PM Bug #7936 (Can't reproduce): "failed: rados" in upgrade:dumpling-x:parallel-firefly-distro-basic-...
Sage Weil
01:37 PM Bug #7957 (Resolved): "[ERR] scrub mismatch" in upgrade:dumpling-emperor-x:parallel-firefly-testi...
Sage Weil
01:36 PM Bug #7968 (Won't Fix): ImportError occurred when run command 'ceph -v'
in order ot get the python stuff installed properly you should use a package. or install teh stuff in ceph.git/src/p... Sage Weil
01:34 PM rgw Bug #8016 (Resolved): "testPrefixAndLimit (test.functional.tests.TestContainerUTF8) ... ERROR" in...
Sage Weil
09:44 AM rgw Bug #8016: "testPrefixAndLimit (test.functional.tests.TestContainerUTF8) ... ERROR" in upgrade:du...
I think the 120s time just isn't long enough. Let's make it 300s (here and in the other thrashing/upgrade test). git... Sage Weil
01:33 PM Bug #7218 (Resolved): Displaying wrong number of pools with ceph -s after removeing a pool
Sage Weil
01:33 PM Bug #6689 (Resolved): osd: remove_redundant_pg_temp() can be slow on big clusters
Sage Weil
01:32 PM Bug #7549 (Won't Fix): Mon deadlock
This is most likely a bug in the older libgoogle-perftools* which is part of the Precise Ubuntu distribution. Either... David Zafman
01:21 PM rgw Bug #7815 (Can't reproduce): Test failed in upgrade:dumpling-x:parallel-firefly-testing-basic-pla...
Sage Weil
01:19 PM Bug #7068 (Can't reproduce): os/FileStore.cc: 4035: FAILED assert(omap_attrs.size() == omap_aset....
Samuel Just
01:17 PM Bug #6756: journal full hang on startup
Samuel Just
01:14 PM Bug #7398: osd: ERANGE from clone
might be dup of #7916 Sage Weil
01:14 PM Bug #6003 (Need More Info): journal Unable to read past sequence 406 ...
Sage Weil
01:13 PM Bug #7858 (Resolved): agent with snaps ceph_test_rados error
Samuel Just
01:13 PM Bug #7916 (Need More Info): ceph_test_rados got ENOENT on ec pool + thrashing
Sage Weil
01:08 PM Bug #7659 (Resolved): osd/ReplicatedPG.cc: 6751: FAILED assert(attrs || !pg_log.get_missing().is_...
David Zafman
01:08 PM Bug #8019 (Pending Backport): os/JournalingObjectStore.cc: 121: FAILED assert(op > committed_seq)...
Sage Weil
01:07 PM Bug #8019 (Resolved): os/JournalingObjectStore.cc: 121: FAILED assert(op > committed_seq) on wheezy
Sage Weil
12:13 PM Bug #8019 (Fix Under Review): os/JournalingObjectStore.cc: 121: FAILED assert(op > committed_seq)...
Sage Weil
01:06 PM Bug #7710 (In Progress): Multiple rados bench instance will overwrite the metadata object
This went through some review and is waiting on a respin. Greg Farnum
12:49 PM Bug #7891 (In Progress): osd: leaked pg refs on shutdown
Sage Weil
12:35 PM RADOS Fix #8035 (New): OSD: must guarantee we are newer than Objecter reply send epochs
Because OSDs are now normal clients of each other in some circumstances, we've broken our map synchronization guarant... Greg Farnum
12:33 PM devops Bug #6726: Official packages do not appear to be available for Saucy
There was a problem with our repo generator script for release builds which was causing even the new releases to not ... Sandon Van Ness
02:57 AM devops Bug #6726: Official packages do not appear to be available for Saucy
There should be as saucy/trusty was built for that release as I was involved with it today. I will find out what happ... Sandon Van Ness
02:00 AM devops Bug #6726: Official packages do not appear to be available for Saucy
Sandon Van Ness wrote:
> Since this ticket was opened our release build has been changed to include saucy/trusty so ...
Tim Bishop
01:32 AM devops Bug #6726: Official packages do not appear to be available for Saucy
There might be some confusion here. We have 'gitbuilders' that yes have been building them for some time that do nigh... Sandon Van Ness
01:24 AM devops Bug #6726: Official packages do not appear to be available for Saucy
I understand but that's not the issue. They have always been built, but the problem is that they are not being publis... Tom Verdaat
12:50 AM devops Bug #6726: Official packages do not appear to be available for Saucy
All new releases are being built for trusty/saucy. It is not super high priority at the moment to rebuild all our old... Sandon Van Ness
12:43 AM devops Bug #6726: Official packages do not appear to be available for Saucy
Any progress on this bug Sandon? Tom Verdaat
12:26 PM Bug #8001 (Fix Under Review): hung recovery; pg 3.f disappeared
Sage Weil
12:20 PM devops Bug #8034 (Resolved): ceph-deploy should run sudo yum clean all after installing ceph-release rpm
Otherwise you will get an unexpected old version from a previous ceph-release install after someone attempts to do a ... Sandon Van Ness
12:12 PM CephFS Bug #8026 (Resolved): shared pointer completely break multiple mds
Sage Weil
07:04 AM CephFS Bug #8026: shared pointer completely break multiple mds
Zheng Yan
12:04 AM CephFS Bug #8026 (Resolved): shared pointer completely break multiple mds
Zheng Yan
12:06 PM Feature #7437: EC: add adapt unittest teuthology task and add to nightly
David Zafman
12:05 PM devops Bug #5338 (Resolved): need rpm packages built for libapache-mod-fastcgi
This is complete. We have been doing these in jenkins for a while now. Sandon Van Ness
11:52 AM devops Bug #5338: need rpm packages built for libapache-mod-fastcgi
Ian Colle
12:04 PM rgw Feature #6678 (Resolved): rgw: reject writes to secondary zones
Pushed to dumpling, commit:b29238729f87c73dfdcf16dddcf293577678dea2 Yehuda Sadeh
11:56 AM devops Feature #7925: Feature: create new download.ceph.com site
I believe after a discussion with neil/Ian it was decided this was going to be on the back-burner for a bit so unassi... Sandon Van Ness
11:55 AM Bug #8031 (Fix Under Review): osd/ReplicatedPG.cc: 405: FAILED assert(needs_recovery)
Sage Weil
09:37 AM Bug #8031: osd/ReplicatedPG.cc: 405: FAILED assert(needs_recovery)
Sage Weil
08:28 AM Bug #8031 (Resolved): osd/ReplicatedPG.cc: 405: FAILED assert(needs_recovery)
ubuntu@teuthology:/a/teuthology-2014-04-07_23:01:05-kcephfs-master-testing-basic-plana/178021... Sage Weil
11:53 AM Feature #7792: leveldb 1.12.0 for rhel
After building 1.12 we later decided to take the package down. Until I get updates what we are doing about this it is... Sandon Van Ness
11:50 AM devops Feature #6098: put teuthology.front.sepia.ceph.com apache configuration files under source control
Ian Colle
10:47 AM Feature #8033 (New): Epic: Kerberos/LDAP Support
Users with existing LDAP or AD systems would like to integrate them into the cephx system so authentication and autho... Neil Levine
10:34 AM Bug #5818: leveldb 1.12: hang on shutdown (mon)
observed this again on leveldb 1.12:... Sage Weil
10:32 AM Bug #8007: osd: hang on shutdown with valgrind on trusty
nevermind, the second instance is a mon shutdown and it is a leveldb 1.12 compaction vs shutdown race. Sage Weil
10:20 AM Bug #8007: osd: hang on shutdown with valgrind on trusty
saw this on precise:
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-07_23:00:18-rgw-master-test...
Sage Weil
10:09 AM Bug #6789: cannot remove the leader when there only are two monitors
Cool :-) Loïc Dachary
09:34 AM Bug #6789 (Fix Under Review): cannot remove the leader when there only are two monitors
Joao Eduardo Luis
09:32 AM Bug #6789: cannot remove the leader when there only are two monitors
https://github.com/ceph/ceph/pull/1624 Joao Eduardo Luis
09:28 AM Bug #6789: cannot remove the leader when there only are two monitors
Also, it's relevant to mention that this does not happen only with the leader. Any monitor that is removed from the ... Joao Eduardo Luis
09:23 AM Bug #6789 (In Progress): cannot remove the leader when there only are two monitors
I was wrong. This does happen on current, and emperor, and dumpling.
The monitor has this features that allows hi...
Joao Eduardo Luis
09:49 AM Bug #8022 (Duplicate): coredumps found in librbd tests
Sage Weil
09:49 AM Bug #7995: osd shutdown: ./common/shared_cache.hpp: 93: FAILED assert(weak_refs.empty())
ubuntu@teuthology:/a/teuthology-2014-04-06_01:10:23-ceph-deploy-firefly-distro-basic-vps/173432 Sage Weil
09:40 AM Bug #8021: osd: ENOENT on clone on dumpling
i think this has been fixed on firefly but not dumpling.... Sage Weil
09:27 AM Linux kernel client Bug #7954: misdirected op
teuthology-2014-04-05_23:05:02-krbd-firefly-testing-basic-plana/173341
ubuntu-2014-04-05_23:18:18-kcephfs-master-tes...
Sage Weil
09:18 AM rbd Bug #8030 (Duplicate): krbd,kcephfs: misdirected request
Sage Weil
09:17 AM rbd Bug #8030: krbd,kcephfs: misdirected request
To be explicit: the error is -ENXIO, returned by the OSD when the write was not sent to the correct primary (and the ... Josh Durgin
08:19 AM rbd Bug #8030 (Duplicate): krbd,kcephfs: misdirected request
ubuntu@teuthology:/a/teuthology-2014-04-07_23:00:55-krbd-master-testing-basic-plana/178000... Sage Weil
09:10 AM rgw Bug #7799: Errors in upgrade:dumpling-x:stress-split-firefly---basic-plana suite
Yuri - have we seen this in a while? We believe it's fixed, but want to confirm. Ian Colle
09:06 AM devops Feature #7171 (Resolved): rbdmap should be part of ceph-common
commit:17732dc0c8878ea58813ad543c5359cb811079cc Josh Durgin
09:04 AM rbd Bug #6480: librbd crashed qemu-system-x86_64
Ian Colle
08:29 AM Revision 55cfb142 (ceph): mds: fix shared_ptr MDRequest bugs
The main change is use shared_ptr instead of weak_ptr to define
active request map. The reason is that slave request ...
Yan, Zheng
07:55 AM devops Bug #8027 (Can't reproduce): Ceph v0.79 Firefly RC :: erasure-code-profile command set not presen...
Alfredo Deza
07:53 AM devops Bug #8027: Ceph v0.79 Firefly RC :: erasure-code-profile command set not present for CentOS RPM
This looks like a problem with an unclean upgrade of sorts because I cannot replicate this problem at all.
Package...
Alfredo Deza
03:07 AM devops Bug #8027 (Can't reproduce): Ceph v0.79 Firefly RC :: erasure-code-profile command set not presen...
I have been using 0.78 in order to test EC and TP , with 0.78 i was not able to test erasure code profile feature ( ... karan singh
07:52 AM Bug #7991: ceph-mon crash
There is no evidence of a crash on the logs.
One of the monitors appears to be working fine.
The other monitor ...
Joao Eduardo Luis
07:47 AM Revision 2ba00bd2 (ceph): erasure-code: thread-safe initialization of gf-complete
Instead of relying on an implicit initialization happening during
encoding/decoding with galois.c:galois_init_default...
Loïc Dachary
06:59 AM CephFS Bug #3424: java: Add the correct JUnit package dependencies on supported platforms and ensure the...
Ian Colle
06:51 AM CephFS Bug #8025: nfs-on-kclient: rm -r failed
Zheng Yan
06:40 AM Revision 0022d912 (ceph): mailmap: Paul Meserve affiliation
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
06:27 AM Revision f261ad1e (ceph): mailmap: Michael Nelson affiliation
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
06:23 AM Revision 89a5ec78 (ceph): mailmap: Sandon Van Ness name normalization
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
04:27 AM Revision 2a6d962f (ceph): Merge pull request #1610 from ceph/wip-4354-shared_ptr
Use shared pointers for Mutations/OpRequests in the MDS
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:27 AM Revision 9ea31f15 (ceph): Merge pull request #1594 from ceph/wip-7958
wip 7958
Passed sage-2014-04-07_07:04:02-fs-wip-7958-testing-basic-plana.
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:55 AM Revision be616c62 (ceph): mon: wait for quorum for MMonGetVersion
We should not respond to checks for map versions when we are in the
probing or electing states or else clients will g...
Sage Weil
03:55 AM Revision 051a17eb (ceph): mon: wait for quorum for MMonGetVersion
We should not respond to checks for map versions when we are in the
probing or electing states or else clients will g...
Sage Weil
03:55 AM Revision 85802336 (ceph): rgw: fix swift range response
Fixes: #7099
Backport: dumpling
The range response header was broken in swift.
Reported-by: Julien Calvet <julien.ca...
Yehuda Sadeh
03:55 AM Revision 8f804391 (ceph): rgw: don't log system requests in usage log
Fixes: 6889
System requets should not be logged in the usage log.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
(...
Yehuda Sadeh
03:53 AM Revision 0716516d (ceph): rgw: fix swift range response
Fixes: #7099
Backport: dumpling
The range response header was broken in swift.
Reported-by: Julien Calvet <julien.ca...
Yehuda Sadeh
03:52 AM Revision 94a1deef (ceph): rgw: don't log system requests in usage log
Fixes: 6889
System requets should not be logged in the usage log.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
(...
Yehuda Sadeh
12:04 AM Revision 010dff12 (ceph): osd_types: fix pg_stat_t::encode, object_stat_sum_t::decode version
Introduced in a130a4452e4fb159dc62fb417077d98dc9ebd621
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by:...
Samuel Just
12:00 AM Revision 6c3fc3ee (ceph): Locker: use raw MutationImpl* instead of MutationRef in several places
Sadly, you can't implicitly convert non-const references to shared pointers, so avoid the atomic ops necessary when c... Greg Farnum
12:00 AM Revision 5f2ccab6 (ceph): MDCache: use raw MutationImpl* instead of MutationRef in a few places
Avoid the atomic ops necessary when copying a shared_ptr.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
12:00 AM Revision 22894c3a (ceph): SimpleLock: Switch MutationRef& for MutationRef in get_xlock()
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:00 AM Revision 6b76998d (ceph): MDCache: use MutationRef instead of raw pointers
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:00 AM Revision a5c7b27d (ceph): MDCache: use a null_ref instead of NULL in a few places
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:00 AM Revision 1294e55e (ceph): MDCache: fix users of active_requests for use of shared_ptr
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:00 AM Revision e800bfa9 (ceph): Locker: remove Mutation param from xlock_import
It's not used.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
12:00 AM Revision 099d1d6f (ceph): Locker: Use MutationRef instead of raw pointers
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:00 AM Revision 920fd6ca (ceph): Locker: use a null_ref instead of NULL
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:00 AM Revision 4dedab64 (ceph): MDS: switch cache object classes to use MutationRef instead of raw poin...
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
12:00 AM Revision a6a08000 (ceph): Server: use MutationRef instead of raw pointer
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum

04/07/2014

11:31 PM Revision 9a4a429e (ceph): ceph_test_rados_api_misc: print osd_max_attr_size
Very confusing results from this test in bug #8009.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:11 PM Revision 91a9d574 (ceph): Merge pull request #1612 from ceph/wip-7919
mon: MonCommands: have all 'auth' commands require 'execute' caps
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
11:09 PM Revision aa301b0e (ceph): Merge pull request #1620 from ceph/wip-8003
Wip 8003
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
11:02 PM Revision a0c1952d (ceph): ceph.conf: don't force lockdep on mds
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:59 PM Revision aff20f5e (ceph): Merge pull request #1611 from ceph/wip-7975
osd: disable agent when stats_invalid (post-split)
Reviewed-by: Samuel Just <sam.just@inktank.com>
Sage Weil
10:49 PM Revision 4b66868e (ceph): doc: Removed --stable arg and replaced with --release arg for ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:40 PM Revision fa46aec7 (ceph): osd/ReplicatedPG: kickstart the agent if scrub stats become valid
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:40 PM Revision 02d7e84d (ceph): osd/ReplicatedPG: dump agent state on pg query
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:40 PM Revision 7273d9ea (ceph): osd/ReplicatedPG: warn if invalid stats prevent us from activating agent
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:21 PM Linux kernel client Bug #8024: kclient: misdirected osd request
Maybe caused by:... Zheng Yan
09:55 PM Linux kernel client Bug #8024 (Resolved): kclient: misdirected osd request
teuthology:/a/teuthology-2014-04-06_23:01:04-kcephfs-master-testing-basic-plana/175847... Greg Farnum
10:08 PM CephFS Bug #8025 (Resolved): nfs-on-kclient: rm -r failed
teuthology-2014-04-06_23:01:11-knfs-master-testing-basic-plana/175859/... Greg Farnum
10:04 PM Revision 51839a33 (ceph): Merge tag 'v0.79' into firefly
v0.79 Sage Weil
09:47 PM Revision 9870b75b (ceph): Merge pull request #1619 from ceph/wip-7659
Wip 7659
Reviewed-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
Samuel Just
09:34 PM Revision 67b2342a (ceph): ReplicatedPG::trim_object: account evicted prev clone for stats
If the previous clone is evicted, we shouldn't adjust the stats to
account for its new clone_overlap value.
Fixes: #...
Samuel Just
09:34 PM Revision 07099ecd (ceph): ReplicatedPG: do not evict head while clone is being promoted
Fixes: #8003
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
09:33 PM Revision 6ff6c19d (ceph): ReplicatedPG::make_writeable: check for evicted clone before adjusting ...
Fixes: #7964
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
09:26 PM CephFS Bug #7958 (Resolved): ceph-fuse+fsx umount hang on leaked inode reference
Sage Weil
09:20 PM Bug #7666 (Duplicate): librados: lock cycle on shutdown
Sage Weil
09:20 PM Bug #7376 (Resolved): mon: >10s spent in remove_redundant_pg_temp
208959a0dcacba40116730702021090a24865eb3 Sage Weil
09:17 PM CephFS Feature #7319 (Resolved): qa: multimds, no failure
Sage Weil
09:17 PM Feature #5437 (Resolved): ceph-mon performance on ARM
Sage Weil
09:17 PM Feature #2088 (Rejected): msgr: refactor 2 threads to one
Sage Weil
09:13 PM CephFS Bug #7739 (Resolved): mds: uninitialized field in message
Sage Weil
09:02 PM Revision afff0ef4 (ceph): Merge pull request #1617 from ceph/wip-7904
Wip 7904
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:01 PM Revision 008ce85d (ceph): Merge pull request #1614 from ceph/wip-7964
Wip 7964
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
08:59 PM Revision 71fc7ae3 (ceph): Merge pull request #1616 from ceph/wip-7916
ReplicatedPG: improve get_object_context debugging
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
08:55 PM Bug #7997 (Resolved): handle_get_version returns old map epochs
Sage Weil
08:14 AM Bug #7997 (Pending Backport): handle_get_version returns old map epochs
Sage Weil
08:53 PM rgw Bug #6889 (Resolved): rgw: usage log: don't log system user operations
Sage Weil
08:53 PM rgw Bug #7099 (Resolved): Strange Comportments with media files
Sage Weil
08:51 PM Revision 32862838 (ceph): ReplicatedPG::_scrub: account for clone_overlap on each clone
Otherwise, we end up subtracting off clone_overlap for evicted clones
whose sizes we did not add in.
Fixes: #7964
Si...
Samuel Just
08:51 PM Revision a8a49a03 (ceph): ReplicatedPG: use get_clone_bytes on evict/promote
Fixes: #7964
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:51 PM Revision d0e2c986 (ceph): ReplicatedPG::finish_promote: add debugging assert for clone_size
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:51 PM Revision 4d5db061 (ceph): ReplicatedPG::find_object_context: check obs.exists on clone obc before...
Fixes: #7858
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:49 PM Bug #7994 (Resolved): OSD: share map when sending subops to peers
Sage Weil
11:02 AM Bug #7994 (Pending Backport): OSD: share map when sending subops to peers
Sage Weil
10:58 AM Bug #7994: OSD: share map when sending subops to peers
The simple fix was merged into master in commit:1a9952c60570aa308410c69db0289160f44969b1. Greg Farnum
10:57 AM Bug #7994 (Resolved): OSD: share map when sending subops to peers
Sage Weil
08:49 PM Bug #7736 (Resolved): mon: can expose stale state
Sage Weil
08:49 PM Bug #7738 (Resolved): osd: journal crash on startup on wheezy
Sage Weil
08:49 PM Bug #6992 (Resolved): OSD assert fails after it found it was marked as down by monitor during hig...
Sage Weil
08:49 PM Bug #6909 (Resolved): Incomplete state should retry on Notify
Sage Weil
06:03 PM Bug #8022 (Duplicate): coredumps found in librbd tests
Coredump link http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-06_01:10:23-ceph-deploy-firefly-distro-basic-vps... Alfredo Deza
06:02 PM Revision 23fed8fc (ceph): OSD: _share_map_outgoing whenever sending a message to a peer
This ensures that they get new maps before an op which requires them (that
they would then request from the monitor)....
Greg Farnum
05:57 PM Revision 1a9952c6 (ceph): Merge pull request #1613 from ceph/wip-7994
OSD: _share_map_outgoing whenever sending a message to a peer
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
05:30 PM Revision e213ee15 (ceph): qa: workunits: mon: auth_caps.sh: test 'auth' caps requirements
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
05:17 PM Revision 85a1cf31 (ceph): mon: MonCommands: have all 'auth' commands require 'execute' caps
Earlier patch already have the entity requiring 'execute' caps for
read-only commands. This patch introduces the sam...
Joao Eduardo Luis
04:48 PM Revision 4c2d73a5 (ceph): 0.79
Jenkins
04:43 PM Bug #7996: 0.78: OSD is not suspend-friendly (unresponsive cluster on OSD crash)
Sage Weil wrote:
> I suspect that the mon on that machine is the key factor at play here. There was a fix that went...
Dmitry Smirnov
09:43 AM Bug #7996: 0.78: OSD is not suspend-friendly (unresponsive cluster on OSD crash)
I suspect that the mon on that machine is the key factor at play here. There was a fix that went in just after 0.78 ... Sage Weil
04:42 PM devops Bug #8017: Redhat Dependencies Unmet
From John's log it looks like the dumpling ceph-release rpm (http://ceph.com/rpm-dumpling/el6/noarch/ceph-release-1-0... Josh Durgin
04:39 PM devops Bug #8017: Redhat Dependencies Unmet
RH packages are dependent upon EPEL. Need to build all missing packages and include in repo. Ian Colle
02:57 PM devops Bug #8017 (Duplicate): Redhat Dependencies Unmet
Ian Colle
02:42 PM devops Bug #8017: Redhat Dependencies Unmet
This has been verified as a problem when installing dumpling too.
john@admin-host:~/rgw-validation-cluster$ ceph-...
John Wilkins
02:06 PM devops Bug #8017 (Resolved): Redhat Dependencies Unmet
When attempting to install Ceph Cuttlefish on a bare metal installation of Redhat 6.5, I encountered a series of unme... John Wilkins
04:31 PM Bug #8009 (New): librados failing tests for APILock
I'm totally confused. The test in question *only* runs if osd_max_attr_size != 0, which it is not (defaults to 0, no... Sage Weil
10:53 AM Bug #8009 (In Progress): librados failing tests for APILock
Sage Weil
07:15 AM Bug #8009 (Closed): librados failing tests for APILock
A few failures on ceph-deploy nightly tests:... Alfredo Deza
04:18 PM CephFS Bug #8005 (Rejected): fuse hang
no error in client log, looks like mds was killed by someone Zheng Yan
04:08 PM Bug #8003 (Resolved): head eviction can race with clone promotion
Sage Weil
03:59 PM Bug #7999: osd: pgs share info that hasn't been persisted
Sage Weil
01:20 PM Bug #7999: osd: pgs share info that hasn't been persisted
osd.0 starts a repop:... Sage Weil
03:58 PM Bug #7975 (Resolved): osd: handle inconsistent stats in the osd post split
Sage Weil
03:56 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
"work in progress":https://github.com/ceph/ceph/pull/1621 Loïc Dachary
01:16 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Kevin Greenan writes:
> Using the doc and a few special functions in jerasure, you can ensure that the underlying ...
Loïc Dachary
11:52 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
"gfp_array is a global variable":https://bitbucket.org/jimplank/jerasure/src/80fc5d1d95f06ea4732717b06b42177099cc93c9... Loïc Dachary
11:33 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Interesting : it is in *decode* this time, not *encode* Loïc Dachary
10:54 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
... Sage Weil
12:38 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Ran 27 times without triggering the problem. Will keep running it 100 times more. Loïc Dachary
03:42 PM Revision c45e15fd (ceph): msgr: fix rebind() race
stop the accepter and mark all pipes down before rebind to avoid race
Fixes: #6992
Signed-off-by: Xihui He xihuihe@...
Xihui He
03:41 PM Bug #8021 (Duplicate): osd: ENOENT on clone on dumpling
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-06_22:35:23-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
03:41 PM Revision 3d31cf01 (ceph): PG: retry GetLog() each time we get a notify in Incomplete
If for some reason there are no up OSDs in the history which
happen to have usable copies of the pg, it's possible th...
Samuel Just
03:39 PM Revision 1f80bbdf (ceph): os/FileJournal: return errors on make_writeable() if reopen fails
This is why #7738 is resulting in a crash instead of an error.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry p...
Sage Weil
03:38 PM Revision 62d94229 (ceph): mon/Paxos: commit only after entire quorum acks
If a subset of the quorum accepts the proposal and we commit, we will start
sharing the new state. However, the mon ...
Sage Weil
03:37 PM Revision 2160c72b (ceph): PrioritizedQueue: cap costs at max_tokens_per_subqueue
Otherwise, you can get a recovery op in the queue which has a cost
higher than the max token value. It won't get ser...
Samuel Just
03:37 PM Revision b390b314 (ceph): PrioritizedQueue: cap costs at max_tokens_per_subqueue
Otherwise, you can get a recovery op in the queue which has a cost
higher than the max token value. It won't get ser...
Samuel Just
03:36 PM Revision ede254a4 (ceph): Fix byte-order dependency in calculation of initial challenge
Fixes: #7977
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry picked ...
Dan Mick
03:36 PM Revision 5e2b6928 (ceph): rbd.cc: tolerate lack of NUL-termination on block_name_prefix
Fixes: #7577
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry picked ...
Dan Mick
03:35 PM Revision 1dd42e0f (ceph): Fix byte-order dependency in calculation of initial challenge
Fixes: #7977
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry picked ...
Dan Mick
03:30 PM Bug #8020 (Resolved): evenly split stats on split
At least it's better than what we currently do. Samuel Just
03:30 PM Bug #7967 (Resolved): finish_promote needs to handle the omap flag
Samuel Just
03:22 PM Revision 51da3bb0 (ceph): mds: fix uninit val in MMDSSlaveRequest
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:11 PM Revision db281bf5 (ceph): Merge pull request #1607 from ceph/wip-7997
mon: wait for quorum for MMonGetVersion
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
Sage Weil
03:03 PM Bug #8019 (Resolved): os/JournalingObjectStore.cc: 121: FAILED assert(op > committed_seq) on wheezy
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-06_22:35:23-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
02:22 PM Documentation #7886: What's the policy on URL stability for public documentation?
We could certainly have a "latest-release" link on the website and legislate that to be our 'stable' doc link... Dan Mick
02:15 PM Documentation #7886: What's the policy on URL stability for public documentation?
Getting off master and using the last named release branch name as the default docs root makes total sense. Does sphi... Neil Levine
02:01 PM Documentation #7886: What's the policy on URL stability for public documentation?
The Ceph.com site uses the master branch by default, which changes somewhat frequently. A more stable approach would ... John Wilkins
02:22 PM RADOS Fix #8018 (New): OSD: check if messages are actually handled in ms_dispatch
OSD::ms_dispatch returns "true" no matter what happens. The _dispatch() function which does the real work doesn't eve... Greg Farnum
02:17 PM Bug #7937 (Resolved): [ERR] deep-scrub 5.ds0 79d5820d/burnupi0838757-23/1f7//5 expected clone
Samuel Just
02:16 PM Bug #7985 (Rejected): 2014-04-02T20:36:41.677 INFO:teuthology.task.rados.rados.0.err:[10.214.131....
Samuel Just
02:02 PM rgw Bug #8016 (Resolved): "testPrefixAndLimit (test.functional.tests.TestContainerUTF8) ... ERROR" in...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-06_22:35:23-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
02:02 PM Bug #7904 (Resolved): osd/ReplicatedPG.cc: 10661: FAILED assert(is_active())
Sage Weil
02:01 PM Bug #7964 (Resolved): ceph_test_rados with snaps and caching stat errors
Sage Weil
12:57 PM CephFS Bug #8010: It's impossible to remove unused filesystem pools from a cluster
It's happening with
ceph version 0.78 (f6c746c314d7b87b8419b6e584c94bfe4511dbd4)
Linux access.car.dot.com 3.13.0-...
Maxym Kutsevol
10:07 AM CephFS Bug #8010 (Resolved): It's impossible to remove unused filesystem pools from a cluster
We've inadvertently made it impossible to remove a filesystem from a Ceph cluster. If there is not data in the FS, it... Greg Farnum
10:38 AM Bug #8011 (Resolved): osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= s...
osd/ReplicatedPG.cc: 5244: FAILED assert(soid < scrubber.start || soid >= scrubber.end)
ceph version 0.78-600-g19...
Samuel Just
08:37 AM Bug #7706 (Resolved): osd: PrioritizedQueue can starve
Sage Weil
08:36 AM Bug #7977 (Resolved): cephx has embedded byte-order dependency
Sage Weil
06:26 AM Bug #8008 (Resolved): osd/ReplicatedPG.cc: 258: FAILED assert(missing_loc.needs_recovery(hoid)) d...
Here is the log from crashed OSD:... Dmitry Smirnov
05:36 AM Revision 09586ec2 (ceph): client: pin parent dentry of inode who has ll_ref > 0
This prevents Client:trim_dentry() from unlinking parent dentry of
directory inode referenced by fuse kernel module.
...
Yan, Zheng
12:56 AM Revision 68e27116 (ceph): Merge pull request #1609 from ceph/wip-7739
mds: fix some uninitialized message fields
Reviewed-by: Zheng Yan <zheng.z.yan@intel.com>
Sage Weil
12:36 AM Revision 76cbd5dd (ceph): mds: fix uninit MMDSSlaveRequest lock_type
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil

04/06/2014

11:32 PM Revision c0fd3df4 (ceph): Merge pull request #1608 from ceph/wip-8002
osd: fix osd map subscribe on YOU_DIED osd_ping
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:03 PM Revision 4ea9e481 (ceph): osd: fix map subscription in YOU_DIED osd_ping handler
If we have epoch X and find out we died as of epoch Y, we still want to
request X+1. Among other things, this fixes ...
Sage Weil
09:22 PM CephFS Bug #8005: fuse hang
still looks like MDS was dead Zheng Yan
01:44 PM CephFS Bug #8005 (Rejected): fuse hang
... Sage Weil
08:19 PM Revision 2f7522c8 (ceph): msgr: add ms_dump_on_send option
This is useful only for debugging. The encoded contents of a message are
dumped to the log on message send. This is...
Sage Weil
08:18 PM Revision 87e6a62e (ceph): mds: fix uninitialized fields in MDiscover
Fixes: #7739
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:54 PM Bug #8007: osd: hang on shutdown with valgrind on trusty
... Sage Weil
05:52 PM Bug #8007 (Can't reproduce): osd: hang on shutdown with valgrind on trusty
have seen several runs hang on osd shutdown. trusty. valgrind. here is the gdb thread dump:... Sage Weil
05:40 PM CephFS Bug #8006 (Rejected): fuse hang on flush (icache branch)
The flush hang is because ceph-fuse was umounting (received signal). umounting can't finish becase MDS was dead at th... Zheng Yan
01:49 PM CephFS Bug #8006 (Rejected): fuse hang on flush (icache branch)
... Sage Weil
05:29 PM CephFS Bug #8004: LibCephFS.HardlinkNoOriginal hang
oh, and the 32-bit pointer thing is because ceph-fuse is running under valgrind. Sage Weil
05:27 PM CephFS Bug #8004: LibCephFS.HardlinkNoOriginal hang
seems easy to reproduce, just hit this again with... Sage Weil
01:41 PM CephFS Bug #8004 (Resolved): LibCephFS.HardlinkNoOriginal hang
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-04-05_15:44:13-multimds:verify-wip-ms-dump-testing-basic-pla... Sage Weil
04:32 PM Bug #8002 (Resolved): osds down, but not advancing osdmaps
Samuel Just
04:03 PM Bug #8002 (Fix Under Review): osds down, but not advancing osdmaps
Sage Weil
03:53 PM Bug #8002: osds down, but not advancing osdmaps
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-06_02:30:05-rados-master-testing-basic-plana/173853 Sage Weil
04:05 PM Bug #7986: 3.1s0 scrub stat mismatch, got 2041/2044 objects, 0/0 clones, 2041/2044 dirty, 0/0
ubuntu@teuthology:/var/lib/teuthworker/archive/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana
Sage Weil
04:05 PM Bug #7998 (Duplicate): 3.2s1 scrub stat mismatch, got 2000/2001 objects, 0/0 clones, 2000/2001 di...
#7986 Sage Weil
01:39 PM Bug #8003 (Resolved): head eviction can race with clone promotion
_verify_no_head_clones will check copy_ops. Samuel Just
01:22 PM CephFS Bug #7739: mds: uninitialized field in message
Sage Weil
06:39 AM rbd Bug #6480: librbd crashed qemu-system-x86_64
Josh, I am now running wip-6480-0.67.7 across our whole infrastructure. No issues yet. Because the race is rare, I th... Mike Dawson
03:03 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
The workload is slightly different from the previous one:... Loïc Dachary
02:58 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
The teuthology config.yaml... Loïc Dachary
02:39 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
The plana08 machine has been recycled and there is no core archived. See the attachment:ceph-osd.0.log.gz for the ful... Loïc Dachary
02:36 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
SSE4 plugin selected and ECX = *029ee3ff* (this is consistent)... Loïc Dachary
02:34 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
/var/lib/teuthworker/archive/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana/170763/remote/ubuntu@plana0... Loïc Dachary

04/05/2014

11:58 PM Revision 67fd4218 (ceph): mon: wait for quorum for MMonGetVersion
We should not respond to checks for map versions when we are in the
probing or electing states or else clients will g...
Sage Weil
09:07 PM Bug #8002 (Resolved): osds down, but not advancing osdmaps
ubuntu@teuthology:/var/lib/teuthworker/archive/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana/170941
...
Sage Weil
08:58 PM Bug #8001 (Resolved): hung recovery; pg 3.f disappeared
ubuntu@teuthology:/var/lib/teuthworker/archive/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana/170892
...
Sage Weil
08:48 PM Bug #7891: osd: leaked pg refs on shutdown
ubuntu@teuthology:/var/lib/teuthworker/archive/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana/171097 Sage Weil
08:33 PM rbd Bug #8000 (Closed): SLAB: Unable to allocate memory on node 0
I'm getting the following kernel errors with ext4 on rbd:... Dmitry Smirnov
05:09 PM Bug #7999: osd: pgs share info that hasn't been persisted
... Sage Weil
05:09 PM Bug #7999 (Resolved): osd: pgs share info that hasn't been persisted
ubuntu@teuthology:/a/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana/170880 Sage Weil
04:59 PM Bug #7997 (Fix Under Review): handle_get_version returns old map epochs
Sage Weil
04:45 PM Bug #7997 (In Progress): handle_get_version returns old map epochs
Sage Weil
03:44 PM Bug #7997 (Resolved): handle_get_version returns old map epochs
ubuntu@teuthology:/var/lib/teuthworker/archive/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana/170649<pr... Sage Weil
03:55 PM Bug #7998 (Duplicate): 3.2s1 scrub stat mismatch, got 2000/2001 objects, 0/0 clones, 2000/2001 di...
... Sage Weil
03:45 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
ubuntu@teuthology:/var/lib/teuthworker/archive/gregf-2014-04-04_22:05:49-rados-wip-7994-testing-basic-plana/170763 Sage Weil
01:23 PM Revision a75af4c2 (ceph): client: try shrinking kernel inode cache when trimming session caps
Notify kernel to invalidate top level directory entries. As a side
effect, the kernel inode cache get shrinked.
Sign...
Yan, Zheng
02:35 AM Revision 82015e40 (ceph): client: release clean pages if no open file want RDCACHE
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
01:15 AM Revision 9484daf9 (ceph): osd: disable agent when stats_invalid (post-split)
After a split the pg stats are approximate but not precisely correct. Any
inaccuracy can be problematic for the agen...
Sage Weil
01:07 AM Revision 6a4c50d7 (ceph): Merge pull request #1605 from ceph/wip-7993
ceph-post-file: use getopt for multiple options, add longopts to help
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil

04/04/2014

11:06 PM Revision 232ac1a5 (ceph): OSD: _share_map_outgoing whenever sending a message to a peer
This ensures that they get new maps before an op which requires them (that
they would then request from the monitor)....
Greg Farnum
10:26 PM Revision 6f40b644 (ceph): ceph-post-file: use getopt for multiple options, add longopts to help
Fixes: #7993
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
10:17 PM Revision ebb865b1 (ceph): Merge pull request #1603 from ceph/wip-7983
osd/ReplicatedPG: do not hit_set_persist while potentially backfilling hit_set_*
Reviewed-by: Samuel Just <sam.just@...
Samuel Just
09:41 PM Revision f2edd959 (ceph): Merge pull request #1604 from ceph/wip-7992
ceph-post-file: fix installation of ssh key files Dan Mick
09:39 PM Revision 2f6a62b4 (ceph): ceph-post-file: fix installation of ssh key files
Fixes: #7992
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:56 PM Revision e02b7f93 (ceph): osd/ReplicatedPG: do not hit_set_persist while potentially backfilling ...
The hit_set transactions may include both a modify of the new hit_set and
deletion of an old one, spanning the backfi...
Sage Weil
08:43 PM Bug #7996 (Won't Fix): 0.78: OSD is not suspend-friendly (unresponsive cluster on OSD crash)
One machine running MON and OSD got suspended.
Shortly after (within seconds) the whole cluster got unresponsive for...
Dmitry Smirnov
07:59 PM Revision 4aef403d (ceph): doc/release-notes: note about emperor backport of mon auth fix
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:57 PM Revision 36e9d80b (ceph): mon: MonCommands.h: have 'auth' read-only operations require 'x' cap
This reintroduces the same semantics that were in place in dumpling prior
to the refactoring of the cap/command match...
Joao Eduardo Luis
07:54 PM Bug #7975 (Fix Under Review): osd: handle inconsistent stats in the osd post split
Sage Weil
02:08 PM Bug #7975 (In Progress): osd: handle inconsistent stats in the osd post split
Sage Weil
07:51 PM Revision db266a3f (ceph): mon: MonCommands.h: have 'auth' read-only operations require 'x' cap
This reintroduces the same semantics that were in place in dumpling prior
to the refactoring of the cap/command match...
Joao Eduardo Luis
07:28 PM CephFS Bug #7980: 0.78: MDS crash (segmentation fault) on client wake-up from suspend.
Works as expected, problem solved, thank you. Dmitry Smirnov
12:18 AM CephFS Bug #7980: 0.78: MDS crash (segmentation fault) on client wake-up from suspend.
Very nice, thank you. I'll test and confirm. Dmitry Smirnov
12:01 AM CephFS Bug #7980 (Resolved): 0.78: MDS crash (segmentation fault) on client wake-up from suspend.
fixed by https://github.com/ceph/ceph/commit/fb72330fb3514be690dc60598242036aa560e023 Zheng Yan
06:07 PM Bug #7993 (Resolved): ceph-post-file can only accept one option
Sage Weil
03:26 PM Bug #7993 (Fix Under Review): ceph-post-file can only accept one option
Dan Mick
03:21 PM Bug #7993 (In Progress): ceph-post-file can only accept one option
Dan Mick
03:02 PM Bug #7993 (Resolved): ceph-post-file can only accept one option
currently only looks once at options, uses $1 where it means $2, etc. It needs it some getopts love. Dan Mick
05:40 PM Revision 9caf3dbc (ceph): Migrator: use a null ref instead of NULL when calling into path_traverse
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:40 PM Revision 3be138f5 (ceph): Mutation: rename to MutationImpl and define MutationRef
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:40 PM Revision c09878e9 (ceph): Mutation: move self_ref into MutationImpl instead of MDRequestImpl
We keep an MDRequestImpl::set_self_ref(MDRequestRef&) function so
that we don't need to do the pointer conversion els...
Greg Farnum
05:40 PM Revision 3429dc59 (ceph): SimpleLock: use MutationRef instead of raw pointers
While we're here, remove the non-const get_xlock_by() (because
we don't need it). Also note we return a full Mutation...
Greg Farnum
05:40 PM Revision 0c9af939 (ceph): Migrator: use MDRequestRef and MutationRef instead of raw pointers
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:39 PM Revision f773307e (ceph): mds: MDRequest: rename to MDRequestImpl, and declare MDRequestRef
We're switching the MDRequest to be used as a shared pointer. This is the
first step on the path to inserting an OpTr...
Greg Farnum
05:39 PM Revision 90ceb7c5 (ceph): MDS: Convert the request_start* functions and their immediate callers
Also, the active_requests mapping gets weak pointers.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
05:39 PM Revision 565b2c89 (ceph): Server: Use MDRequestRef instead of raw pointers
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:39 PM Revision 5872c2d8 (ceph): MDCache: use MDRequestRef instead of MDRequest*
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:39 PM Revision f41a2f87 (ceph): Locker: use MDRequestRef instead of MDRequest*
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:38 PM Revision fd235cdd (ceph): include/memory: add static_pointer_cast
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:34 PM Revision 82d2551c (ceph): Merge pull request #1602 from ceph/wip-cache-create-fix
ReplicatedPG: fix CEPH_OSD_OP_CREATE on cache pools
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:10 PM Bug #7995 (Can't reproduce): osd shutdown: ./common/shared_cache.hpp: 93: FAILED assert(weak_refs...
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-04_02:30:19-rados-master-testing-basic-plana/168602... Sage Weil
05:08 PM Bug #7891: osd: leaked pg refs on shutdown
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-04_02:30:19-rados-master-testing-basic-plana/168600 Sage Weil
11:07 AM Bug #7891: osd: leaked pg refs on shutdown
ubuntu@teuthology:/a/teuthology-2014-04-02_02:30:02-rados-master-testing-basic-plana/161303/remote$
Samuel Just
04:49 PM Revision f68e60ea (ceph): ObjectCacher: assert no waiter when remove buffer head
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
04:49 PM Revision d1216781 (ceph): client: fix null pointer dereference in Client::unlink
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
04:49 PM Revision 4be0b6b1 (ceph): client: cleanup Client::_invalidate_inode_cache()
drop parameter 'keep_caps'
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
04:48 PM Revision abc19dd4 (ceph): client: drop Fr cap before gettattr CEPH_STAT_CAP_SIZE
When MDS receives the getattr request, corresponding inode's filelock
can be in unstable state which waits for client...
Yan, Zheng
04:48 PM Revision 954007e6 (ceph): client: properly retain used caps
Pass properly 'retain' to Client::send_cap() because it is used to
adjust cap->issued.
Also make Client::encode_inod...
Yan, Zheng
04:47 PM Revision 1538a98a (ceph): client: hold Fcr caps during readahead
Fcr caps prevent the file from being truncated.
Fixes: #7958
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
04:47 PM Revision 2d5bd84b (ceph): client: assign implemented caps to caps field of MClientCaps
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
04:47 PM Revision 701c22a8 (ceph): client: implement RDCACHE reference tracking
make the code be able to track Fc caps used by aysnc buffer reads
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
04:30 PM Bug #7994: OSD: share map when sending subops to peers
The quick-and-dirty fix is in wip-7994; I'll run it through the suite as soon as it builds. A (at least slightly) mor... Greg Farnum
04:28 PM Bug #7994 (Resolved): OSD: share map when sending subops to peers
Right now, the OSD doesn't preemptively share maps when sending subops. Fix it. Greg Farnum
04:23 PM Revision b219c8f9 (ceph): ReplicatedPG: fix CEPH_OSD_OP_CREATE on cache pools
The following
./ceph osd pool create data-cache 8 8
./ceph osd tier add data data-cache
./ceph osd tier cache-mode d...
Ilya Dryomov
04:22 PM Revision 2bd548e9 (ceph): Merge pull request #1600 from ceph/wip-7922
Wip 7922
Passes my manual testing and the new teuthology test case.
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:22 PM Revision 3e012252 (ceph): Merge pull request #24 from ceph/wip-7922
rados/singleton-nomsgr: add multi-backfill reservation rejection test Sage Weil
04:21 PM Bug #7984 (Duplicate): osd/ReplicatedPG.cc: 2273: FAILED assert(p != snapset.clones.end())
Samuel Just
04:06 PM Bug #7984 (In Progress): osd/ReplicatedPG.cc: 2273: FAILED assert(p != snapset.clones.end())
Samuel Just
10:29 AM Bug #7984: osd/ReplicatedPG.cc: 2273: FAILED assert(p != snapset.clones.end())
ubuntu@teuthology:/a/teuthology-2014-04-02_02:30:02-rados-master-testing-basic-plana/161266/remote Samuel Just
10:29 AM Bug #7984 (Duplicate): osd/ReplicatedPG.cc: 2273: FAILED assert(p != snapset.clones.end())

ceph version 0.78-522-gedb8a59 (edb8a5965e72b6173d3f88d1a63c8b3ca1b9235c)
1: (ReplicatedPG::trim_object(hobject_...
Samuel Just
03:16 PM Bug #7983 (Resolved): osd: erroneously present object
Samuel Just
02:07 PM Bug #7983 (Fix Under Review): osd: erroneously present object
Sage Weil
01:13 PM Bug #7983 (In Progress): osd: erroneously present object
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-04_02:30:19-rados-master-testing-basic-plana/168472 Sage Weil
08:31 AM Bug #7983 (Resolved): osd: erroneously present object
ubuntu@teuthology:/a/teuthology-2014-04-03_02:30:03-rados-firefly-distro-basic-plana/166349 Sage Weil
02:53 PM Bug #7992 (Resolved): ceph-post-file keys are wrong
Sage Weil
02:37 PM Bug #7992 (Resolved): ceph-post-file keys are wrong
Snippet of install script in Makefile.am installs the known_hosts file to all three
of known_hosts, id_dsa, and id_d...
Dan Mick
02:28 PM Bug #7991: ceph-mon crash
Logs have been uploaded via cephdrop@ceph.com in issue7991 folder. Thanks.
Cluster details:
Ubuntu 12.04 - 3 x ...
Andrei Mikhailovsky
02:11 PM Bug #7991 (Rejected): ceph-mon crash
I've had an issue with crashing ceph-mon. It happened twice over the course of last two weeks. Attached are the ceph-... Andrei Mikhailovsky
01:26 PM rgw Feature #7990 (New): RGW: Ldap Integration
For users with existing LDAP systems, they would like to be able to configure RGW so that authentication and authoriz... Neil Levine
01:21 PM Feature #7988 (Resolved): Logs: Log every administrative action taken by a user
Many enterprise users have strict security policies which require that all events generated by a user are explicitly ... Neil Levine
12:59 PM Fix #7919: mon: prevent clients with a read cap from reading the full keyring
merged to master/firefly. backported to emperor too, along with a release note. Sage Weil
12:55 PM Fix #7919 (Resolved): mon: prevent clients with a read cap from reading the full keyring
Sage Weil
11:59 AM Bug #7987 (Duplicate): osd: backfill/recovery makes no progress
ubuntu@teuthology:/a/teuthology-2014-04-02_02:30:02-rados-master-testing-basic-plana/161007/remote$
At least, tha...
Samuel Just
11:04 AM Bug #6756: journal full hang on startup
ubuntu@teuthology:/a/teuthology-2014-04-02_02:30:02-rados-master-testing-basic-plana/161291/remote Samuel Just
10:50 AM Bug #7986: 3.1s0 scrub stat mismatch, got 2041/2044 objects, 0/0 clones, 2041/2044 dirty, 0/0
Base pool, issue with copy_from? Samuel Just
10:50 AM Bug #7986 (Can't reproduce): 3.1s0 scrub stat mismatch, got 2041/2044 objects, 0/0 clones, 2041/2...
duration: 1915.8093299865723
failure_reason: '"2014-04-02 21:31:21.281062 osd.4 10.214.133.26:6812/11657 29 : [ERR]
...
Samuel Just
10:45 AM Bug #7985 (Rejected): 2014-04-02T20:36:41.677 INFO:teuthology.task.rados.rados.0.err:[10.214.131....
ubuntu@teuthology:/a/teuthology-2014-04-02_02:30:02-rados-master-testing-basic-plana/160986/remote
2014-04-02T20:3...
Samuel Just
09:47 AM devops Bug #7981 (Resolved): chef fails to install libleveldb1
This was fixed a few hours after that test ran on the 2nd. Sandon Van Ness
06:17 AM devops Bug #7981 (Resolved): chef fails to install libleveldb1
Not sure if this is because the package is just not there or because there was a network hiccup when the tests ran
...
Alfredo Deza
09:23 AM Bug #7576 (Fix Under Review): osd: large skew in pg epochs (dumpling)
Sage Weil
09:21 AM Bug #7922 (Resolved): osd: multi-backfill reservation does not release on reject
Sage Weil
08:58 AM Support #7609: http://tracker.ceph.com/account/register returns 500 Internal error
someone ran into it again Loïc Dachary
08:30 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
This didn't trigger at all in yesterday's run. :/ Sage Weil
08:15 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
To find the failed ones:... Loïc Dachary
08:29 AM Bug #7892: osd/ReplicatedPG.cc: 7881: FAILED assert((data_included.empty() && data.length() == 0)...
ubuntu@teuthology:/a/teuthology-2014-04-03_02:30:03-rados-firefly-distro-basic-plana Sage Weil
05:17 AM rbd Bug #6480: librbd crashed qemu-system-x86_64
This may help somehow - recently I`ve started to collect perf samples via endless loop on selected hosts and some hos... Andrey Korolyov
05:13 AM Revision be8b2281 (ceph): osd: Send REJECT to all previously acquired reservations
When getting a REJECT from a backfill target, tell already GRANTed targets to
go back to RepNotRecovering state by se...
David Zafman
01:28 AM Revision 4dc62669 (ceph): Fix byte-order dependency in calculation of initial challenge
Fixes: #7977
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Dan Mick
01:28 AM Revision 18201efd (ceph): doc/release-notes: v0.79 release notes
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:11 AM Revision b97c3803 (ceph): rados/thrash: rejection backfill reservations sometimes while thrashing
Sage Weil
01:08 AM Revision 946a9685 (ceph): rados/singleton-nomsgr: add multi-backfill reservation rejection test
This triggers #7922.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:53 AM Revision 6cb50d74 (ceph): ReplicatedPG::_delete_oid: adjust num_object_clones
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:53 AM Revision 0f2ab4dd (ceph): ReplicatedPG::agent_choose_mode: improve debugging
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:44 AM Revision 80a1ed8a (ceph): Merge pull request #1599 from ceph/wip-7978
rgw: only look at next placement rule if we're not at the last rule
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
12:36 AM rbd Bug #5876: Assertion failure in rbd_img_obj_callback() : rbd_assert(which >= img_request->next_co...
Just in case, the fix you are running with is now in 3.14. However we
are still working on a better fix, so we'll k...
Ilya Dryomov
12:18 AM rbd Bug #5876: Assertion failure in rbd_img_obj_callback() : rbd_assert(which >= img_request->next_co...
I haven't got this problem anymore, it seems really stable for me now. Thanks !
I think the issue can be mark as r...
Olivier Bonvalet

04/03/2014

11:47 PM CephFS Bug #7980 (Resolved): 0.78: MDS crash (segmentation fault) on client wake-up from suspend.
MDS crashes (segmentation fault) when I wake-up machine with CephFS (mounted using kernel client) from suspend:
<p...
Dmitry Smirnov
10:54 PM Bug #7922 (Fix Under Review): osd: multi-backfill reservation does not release on reject
David Zafman
06:08 PM Bug #7922: osd: multi-backfill reservation does not release on reject
... Sage Weil
03:29 PM Bug #7922: osd: multi-backfill reservation does not release on reject
i reproduced this on my first try with the patch in wip-7922.... Sage Weil
10:15 PM Revision 0552ecba (ceph): rgw: only look at next placement rule if we're not at the last rule
Fixes: #7978
We tried to move to the next placement rule, but we were already at the
last one, so we ended up looping...
Yehuda Sadeh
10:11 PM Revision 0f6c92f0 (ceph): Merge pull request #23 from ceph/wip-7771-wusui
FIxed dumpling - emperor upgrade suites Sage Weil
09:22 PM Revision 39166b2d (ceph): upgrade/dumpling-x: run dumpling version of s3tests
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:04 PM Revision eb23ac46 (ceph): ReplicatedPG::agent_choose_mode: use num_user_objects for target_max_by...
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:04 PM Revision cc9ca67a (ceph): ReplicatedPG::agent_choose_mode: exclude omap objects for ec base pool
Fixes: #7831
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:04 PM Revision a130a445 (ceph): osd/: track num_objects_omap in pg stats
Fixes: #7831
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:03 PM Revision 9894a55d (ceph): ReplicatedPG: handle FLAG_OMAP on promote and copyfrom
Fixes: #7967
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:02 PM Revision a60b50d6 (ceph): FIxed dumpling - emperor upgrade suites
Reformatted many of the yaml files.
Fixed duplication of workunit definitions.
Do not do install.upgrade on clients.
...
Warren Usui
07:53 PM Revision 78e9813c (ceph): ReplicatedPG: do not create snapdir on head eviction
Head eviction implies that no clones are present. Also, add
an exists flag to SnapSetContext in order prevent an ssc...
Samuel Just
07:53 PM Revision a11b3e8d (ceph): ReplicatedPG::do_op: use get_object_context for list-snaps
find_object_context provides some niceties which we don't need since we know
the oid of the clones. Problematically,...
Samuel Just
07:20 PM Revision c66b61f9 (ceph): rbd: return 0 and an empty list when pool is entirely empty
rbd_list will return -ENOENT when no rbd_directory object
exists. Handle this in the cli tool and interpret it as suc...
Josh Durgin
07:06 PM Revision 31df91e0 (ceph): osd: add 'osd debug reject backfill probability' option
This will make the OSD randomly reject backfill reservation requests. This
exercises the failure code paths but does...
Sage Weil
06:45 PM Revision 90c4540b (ceph): Merge pull request #1598 from ceph/wip-test-alloc-hint-ec-fix
qa: test_alloc_hint: set ec ruleset-failure-domain to osd
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
06:44 PM Revision 9f41975c (ceph): Merge pull request #1581 from ceph/wip-init
a few deb changes Sage Weil
06:42 PM Revision 267307e6 (ceph): upgrade/dumpling-emperor-x: do not warn about tunables
We do the same for dumpling-x.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:42 PM Revision a5a5ba3c (ceph): upgrade/dumpling-emperor-x: ignore 'scrub mismatch' from mon
The encoded full osdmaps can differ due to a series of issues that were
fixed well after emperor. We also whitelist ...
Sage Weil
06:30 PM Bug #7977 (Pending Backport): cephx has embedded byte-order dependency
Sage Weil
01:56 PM Bug #7977 (Resolved): cephx has embedded byte-order dependency
Calculation of the original session key is byte-order-dependent; cephx_calc_client_server_challenge gets a message di... Dan Mick
05:49 PM Bug #7964: ceph_test_rados with snaps and caching stat errors
Samuel Just
01:04 PM Bug #7964 (Rejected): ceph_test_rados with snaps and caching stat errors
Samuel Just
05:43 PM rgw Bug #7978 (Resolved): rgw: infinite loop when iterating multipart object
Sage Weil
03:17 PM rgw Bug #7978 (Fix Under Review): rgw: infinite loop when iterating multipart object
Yehuda Sadeh
03:13 PM rgw Bug #7978 (Resolved): rgw: infinite loop when iterating multipart object
This happens when the object created has a final placement rule with part_size > 0, e.g.,... Yehuda Sadeh
05:16 PM Revision d3236340 (ceph): qa: test_alloc_hint: set ec ruleset-failure-domain to osd
Create a custom profile with ruleset-failure-domain=osd. (The default
ruleset-failure-domain=host won't do because t...
Ilya Dryomov
04:57 PM Revision 60d19756 (ceph): Merge pull request #1593 from dachary/wip-vstart-erasure-code-default
vstart: set a sensible default for ruleset-failure-domain
Reviewed-by: Ilya Dryomov <ilya.dryomov@inktank.com>
Sage Weil
04:57 PM Revision cdcd8368 (ceph): Merge pull request #1596 from ceph/wip-vstop-unmap
Unmap rbd images when stopping the whole cluster
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
02:14 PM Revision afc5dc53 (ceph): stop.sh: do not trace commands
Command tracing here doesn't bring any value and simply pollutes the
terminal, as the script always runs to completio...
Ilya Dryomov
02:14 PM Revision 8e46fe00 (ceph): stop.sh: unmap rbd images when stopping the whole cluster
Unmap rbd images when stopping the whole cluster. Not doing so results
in images that cannot be unmapped until the s...
Ilya Dryomov
02:03 PM Revision 0110a19b (ceph): stop.sh: indent 4 spaces universally
Currently there is a mix between tabs and 4 spaces indent. Switch to
4 spaces indent.
Signed-off-by: Ilya Dryomov <...
Ilya Dryomov
01:55 PM Bug #7976 (Duplicate): 4.8 missing primary copy of ..., unfound (dumpling)
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-04-02_22:35:24-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
01:38 PM rbd Bug #7973 (Resolved): rbd ls returns error code of 1 on empty pool
This is the same as #6693. Backported the fix to the dumpling branch in commit:c66b61f9dcad217429e4876d27881d9fb2e7666f Josh Durgin
11:55 AM rbd Bug #7973 (Resolved): rbd ls returns error code of 1 on empty pool
Using version 0.67.7 rbd ls is returning an error code of 1 on an empty pool. This is causing OpenStack Havana proble... JuanJose Galvez
01:28 PM Bug #7975 (Resolved): osd: handle inconsistent stats in the osd post split
Samuel Just
01:01 PM Bug #7967: finish_promote needs to handle the omap flag
Samuel Just
12:07 PM Revision e4a8535a (ceph): vstart: set a sensible default for ruleset-failure-domain
Set ruleset-failure-domain=osd so that
./ceph osd pool create ecpool 12 12 erasure
./rados --pool ecpool put SOM...
Loïc Dachary
10:24 AM Fix #7919 (Fix Under Review): mon: prevent clients with a read cap from reading the full keyring
pull request 1597 https://github.com/ceph/ceph/pull/1597 Joao Eduardo Luis
06:53 AM CephFS Bug #7958: ceph-fuse+fsx umount hang on leaked inode reference
Zheng Yan
05:32 AM CephFS Bug #7958: ceph-fuse+fsx umount hang on leaked inode reference
I guess it's introduce by commit f1c7b4ef0 (client: pin Inode during readahead). Readahead raced with truncate. Objec... Zheng Yan
12:43 AM Bug #7968 (Won't Fix): ImportError occurred when run command 'ceph -v'
I. How to reproduce:
1. Clone the latest ceph master code from github
2. From my laptop, install the pre-require...
HouMing Wang
12:13 AM Revision 552f9d8d (ceph): multimds: test ceph-fuse and kclient
Add client as a variable on the basic collection. Move the other stuff
(libcephfs plus a weird restart test) to libc...
Sage Weil
12:03 AM Revision 89f38c09 (ceph): Merge pull request #1592 from ceph/wip-7965
lockdep: fix when instantiated multiple times (bug 7965)
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin

04/02/2014

11:46 PM Revision c43822cd (ceph): lockdep: reset state on shutdown
If we shut down, clear out all of the lockdep state. This ensures that if
we start up again on another cct, we will ...
Sage Weil
11:46 PM Revision 7a49f3da (ceph): lockdep: do not initialize if already started
If we have already registered a cct for lockdep, do not accept another one.
We already check that the cct matches whe...
Sage Weil
11:13 PM Revision eae5a377 (ceph): Merge pull request #1591 from ceph/wip-7915
mon: bump snap_epoch when adding a tier (fixes 7915)
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
11:03 PM Revision 6bf46e23 (ceph): OSDMap: bump snap_epoch when adding a tier
When we make an existing pool a tier, we start copying the snap metadata
from the base tier. That includes removed_s...
Sage Weil
10:15 PM Revision 27e353cc (ceph): Merge pull request #1580 from ceph/wip-7937
osd: fix scrub logic for snapdir object
Reviewed-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage...
Samuel Just
10:12 PM Revision 01445d5c (ceph): ReplicatedPG::_scrub: don't bail early for snapdir
Fixes: #7937
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:10 PM Revision 0815140f (ceph): test: use older names for module setup/teardown
setUp and tearDown require nosetests 0.11, but 0.10.4 is the latest on
centos. Rename to use the older aliases, which...
Josh Durgin
10:00 PM CephFS Bug #7958: ceph-fuse+fsx umount hang on leaked inode reference
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-01_23:00:33-fs-firefly-distro-basic-plana/160589 Sage Weil
10:23 AM CephFS Bug #7958 (Resolved): ceph-fuse+fsx umount hang on leaked inode reference
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-03-31_23:00:36-fs-master-testing-basic-plana/157636
...
Sage Weil
09:56 PM Bug #7922: osd: multi-backfill reservation does not release on reject
logs on flab:/more2/t/7922
on osd.9, we see:...
Sage Weil
03:21 PM Bug #7922 (In Progress): osd: multi-backfill reservation does not release on reject
Sage Weil
01:19 PM Bug #7922: osd: multi-backfill reservation does not release on reject
All highly-verbose logfiles have been uploaded at Sage's request (4.3GiB): http://www.aarontc.com/logs/ceph-logs-chek... Aaron T
09:22 PM Revision 5f680f90 (ceph): ReplicatedPG::_verify_no_head_clones: missing implies that the clone ex...
Fixes: #7659
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:57 PM Revision b405bfa4 (ceph): OSD: don't clear peering_wait_for_split in advance_map()
I really don't know why I added this... Ops can be discarded from the
waiting_for_pg queue if we aren't primary simp...
Samuel Just
08:54 PM Revision 60b7aa96 (ceph): test: use older names for module setup/teardown
setUp and tearDown require nosetests 0.11, but 0.10.4 is the latest on
centos. Rename to use the older aliases, which...
Josh Durgin
08:07 PM Bug #7965 (Resolved): osd: SEGV in handle_recovery_read_complete
Ian Colle
04:47 PM Bug #7965: osd: SEGV in handle_recovery_read_complete
reliably reproduced with lockdep enabled with ceph_test_rados_api_tier. appears to be due to multiple initialization... Sage Weil
04:46 PM Bug #7965 (Fix Under Review): osd: SEGV in handle_recovery_read_complete
Sage Weil
04:08 PM Bug #7965 (Resolved): osd: SEGV in handle_recovery_read_complete
... Sage Weil
07:57 PM Revision a498c940 (ceph): Merge remote-tracking branch 'gh/wip-7888-dumpling' into dumpling
Sage Weil
07:34 PM Bug #4185: Python multiprocessing exhibiting odd behaviour with librados
Some notes for the Next guy that comes across this issue:
You can use the multiprocessing managers to push all the...
Evan Felix
06:29 PM Revision 7909262f (ceph): debian: fix control to allow upgrades
Signed-off-by: Mohammad Salehe <salehe+dev@gmail.com> Mohammad Salehe
05:52 PM Revision 250a1029 (ceph): Merge pull request #1590 from ceph/wip-7939
PG: set role for replicated even if role != shard
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
05:44 PM Revision b6c70534 (ceph): Merge pull request #1589 from ceph/wip-7805-emperor
PG: only complete replicas should count toward min_size
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:35 PM Revision d6258b63 (ceph): Merge pull request #1579 from ceph/wip-7907
osd/ReplicatedPG: mark_unrollbackable when _rollback_to head
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
05:29 PM Revision 17732dc0 (ceph): debian: move rbdmap config and sysvinit/upstart scripts into ceph-common
Fixes: #7171
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:28 PM Bug #7967 (Resolved): finish_promote needs to handle the omap flag
Samuel Just
05:22 PM Bug #7689 (Duplicate): librados: ENOENT on ioctx create
Sage Weil
05:21 PM Bug #7689: librados: ENOENT on ioctx create
oh, i think this is a dup of #7736 Sage Weil
05:19 PM Bug #6429 (Can't reproduce): msg/Pipe.cc: 1029: FAILED assert(m)
Sage Weil
05:17 PM Bug #7776 (Resolved): client lockdep crash
Sage Weil
04:28 PM Bug #7776 (In Progress): client lockdep crash
Sage Weil
03:33 PM Bug #7776: client lockdep crash
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-01_02:30:19-rados-firefly-distro-basic-plana/158403... Sage Weil
03:32 PM Bug #7776: client lockdep crash
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-04-01_02:30:19-rados-firefly-distro-basic-plana/158409... Sage Weil
04:13 PM Bug #7915 (Resolved): ./include/interval_set.h: 385: FAILED assert(_size >= 0)
Samuel Just
04:04 PM Bug #7915 (Fix Under Review): ./include/interval_set.h: 385: FAILED assert(_size >= 0)
Sage Weil
11:55 AM Bug #7915 (Need More Info): ./include/interval_set.h: 385: FAILED assert(_size >= 0)
we've added more debug to master/firefly Sage Weil
04:12 PM CephFS Bug #7966 (Resolved): ceph-mds respawn doesn't always work
... Sage Weil
03:55 PM rgw Bug #7450: "radosgw-admin key create" ignores specified access key when subuser specified
@Yehuda: That code change looks good. I'll try to test by monday and get back to you with confirmation of it working. Robin Johnson
07:31 AM rgw Bug #7450: "radosgw-admin key create" ignores specified access key when subuser specified
I pushed some work into wip-7450. It takes care of the default access key generation in the case of subusers, and som... Yehuda Sadeh
03:54 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Compiled TestErasureCodeJerasure.cc with m=1 https://github.com/ceph/ceph/blob/master/src/test/erasure-code/TestErasu... Loïc Dachary
02:23 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
... Loïc Dachary
10:57 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
mira087 has the fisrst 2 failures, mira089 has the earlier one. they all see to crash in teh same place:... Sage Weil
10:19 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
... Loïc Dachary
09:38 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Ran test for both gf-complete and jerasure and they are valgrind clean ( https://bitbucket.org/jimplank/jerasure/pull... Loïc Dachary
09:34 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
/a/teuthology-2014-03-31_02:30:03-rados-master-testing-basic-plana/155875/teuthology.log is the only ec_pool test tha... Loïc Dachary
09:10 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
also happened on ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154379 on commi... Sage Weil
03:44 PM Revision 86a032f2 (ceph): Merge pull request #1586 from ceph/wip-dirfrag
mds: fix check for merging/spliting dirfrag
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:43 PM Revision 84e62e9f (ceph): Merge pull request #1587 from onlyjob/debian
init.d: correcting rbdmap LSB header / init order:
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:31 PM Bug #7964 (Resolved): ceph_test_rados with snaps and caching stat errors
Samuel Just
03:21 PM Bug #6681 (Resolved): osd recovery hung
backported to dumpling Sage Weil
01:52 PM Bug #6681 (Pending Backport): osd recovery hung
/var/lib/teuthworker/archive/teuthology-2014-04-01_19:00:30-rados-dumpling-testing-basic-plana/159217 Samuel Just
03:15 PM Bug #7576 (In Progress): osd: large skew in pg epochs (dumpling)
wip-7576 Sage Weil
03:07 PM Bug #7917 (Resolved): "ERROR: test_rbd."* in upgrade:dumpling-x:parallel-firefly-distro-basic-vps
Just needed to cherry-pick the fix for #6368 to dumpling. Josh Durgin
02:25 PM Revision 1d42de54 (ceph): init.d: correcting rbdmap init order:
* Require "$remote_fs" since it guarantees /usr availability
(rbd executable is in /usr/bin/rbd)
* Speed-up init....
Dmitry Smirnov
01:38 PM RADOS Feature #7962 (New): allow the user to query inconsistent object status and specify repair strategy
Samuel Just
12:56 PM Messengers Bug #7888 (Resolved): msgr: keepalive is insufficient
Sage Weil
10:56 AM devops Feature #7960 (Resolved): backport rpm creation of /usr/lib64/qemu/librbd.so.1 symlink to dumpling
Backport #7293 to dumpling so it works seamlessly with rhev and rhel-osp as well. Josh Durgin
10:51 AM Bug #7939 (Resolved): pg role wrong for replicated pools
Sage Weil
10:42 AM Bug #7805 (Pending Backport): emperor can go active with < min_size non-incomplete peers since we...
Sage Weil
08:50 AM Bug #7805 (Fix Under Review): emperor can go active with < min_size non-incomplete peers since we...
Sage Weil
10:41 AM Bug #7907 (Resolved): osd: rollback to head didn't mark_unrollbackable
Sage Weil
09:45 AM Bug #7393: osd: scrub stat mismatch, got 9/9 objects, 0/0 clones, 9/4 dirty, 0/0 whiteouts, 26738...
... Anonymous
09:36 AM Bug #7949 (Duplicate): "s3tests.functional.test_s3"* errors in upgrade:dumpling-x:parallel-firefl...
Dup of 7935 Ian Colle
09:33 AM Bug #7951 (Duplicate): "test_rbd."* tests failed in upgrade:dumpling-x:parallel-firefly-distro-ba...
Ian Colle
09:07 AM Bug #7957 (Resolved): "[ERR] scrub mismatch" in upgrade:dumpling-emperor-x:parallel-firefly-testi...
Logs are in http://qa-proxy.ceph.com/teuthology/wusui-2014-04-02_01:54:27-upgrade:dumpling-emperor-x:parallel-firefly... Yuri Weinstein
07:53 AM RADOS Feature #7956 (New): osd: implement posix_fadvise/POSIX_FADV_DONTNEED to prevent data caching
Running Ceph OSDs on commodity hardware often means that servers are not used exclusively for OSDs and may have other... Dmitry Smirnov
07:32 AM Revision 771e88a4 (ceph): mds: fix check for merging/spliting dirfrag
check actual number of items instead of number of cached items
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
07:24 AM Bug #7931 (Can't reproduce): setcrushmap crashing monitor
Greg Farnum
06:36 AM Bug #7931: setcrushmap crashing monitor
Hi Greg,
I don't think that was the issue, however it has since been working. The crushmap I've uploaded was the r...
Luis Periquito
04:47 AM Revision edb8a596 (ceph): Merge pull request #1583 from ceph/wip-largedir
Wip largedir
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:19 AM Revision 5a9b99aa (ceph): mds: initialize bloom filter according to dirfrag size
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
04:19 AM Revision 43bc39be (ceph): mds: ignore CDir::check_rstats() when debug_scatterstat is off
It uses lots of CPU when dirfrag is large
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
04:19 AM Revision 16af25fb (ceph): mds: add dentries in dirfrag to LRU in reverse order
Files in a dirfrag are usually processed in the order of readdir
results. Files at the beginning of are more likely t...
Yan, Zheng
04:01 AM Revision d351e5fb (ceph): Merge pull request #1584 from ceph/wip-multimds
Wip multimds
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:03 AM Revision 06ecb2c7 (ceph): mds: handle freeze authpin race
For across authority rename, the MDS first freezes the source inode's
authpin. It happens while the source dentry isn...
Yan, Zheng
03:03 AM Revision b65a8184 (ceph): mds: don't start new segment while finishing disambiguate imports
This avoid inserting ESubtreeMap among EImportFinish events that
finish disambiguate imports. Because the ESubtreeMap...
Yan, Zheng
03:03 AM Revision d1967f32 (ceph): mds: treat cluster as degraded when there is clientreplay MDS
This forbids exporting subtrees and fragmenting dirfrags when there
is MDS in clientreplay state. During replaying cl...
Yan, Zheng
03:03 AM Revision ff44a99a (ceph): mds: trim non-auth subtree more aggressively
When a non-auth dirfrag is pinned by uncommitted slave update,
there still can be non-auth child dirfrags that are tr...
Yan, Zheng

04/01/2014

11:02 PM Revision c09f58ef (ceph): radosgw-admin: don't always generate access key for subusers
Only generate key for subuser if one was not specified for it, and only
when creating a subuser.
Signed-off-by: Yehu...
Yehuda Sadeh
11:01 PM Revision 0d5d3d1a (ceph): PG: only complete replicas should count toward min_size
Backport: emperor,dumpling,cuttlefish
Fixes: #7805
Signed-off-by: Samuel Just <sam.just@inktank.com>
Signed-off-by: S...
Sage Weil
11:01 PM Revision 9ef5e4ab (ceph): Revert "PG: only complete replicas should count toward min_size"
This reverts commit b097a237e11b48c47d3fd5484f3449e683e95db0.
This causes us to crash, probably because whoami is no...
Sage Weil
10:43 PM Revision e1783a62 (ceph): rgw: subuser creation can be idempotent
No real reason to check for existing subuser before (re)creating it.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
10:40 PM Revision f54c6846 (ceph): radosgw-admin: fix subuser modify
Shouldn't check for empty access key when modifying swift user, also
set correct fields when updating swift keys.
Si...
Yehuda Sadeh
09:46 PM Bug #7952: After aio_read() completes a call to return_value() doesn't return bytes read
This might be the issue underlying #7822 Dan Mick
06:02 PM Bug #7952 (Resolved): After aio_read() completes a call to return_value() doesn't return bytes read

When using the rados_aio_read() interface the c.bl bufferlist stores the read data. So the code below will set rva...
David Zafman
09:42 PM Revision e095b1d4 (ceph): debian: make ceph-common own etc/ceph, var/log/ceph
Clients can make use of these directories, and ceph-common is required by
ceph, so nothing should break here.
Change...
Sage Weil
09:38 PM Bug #7922: osd: multi-backfill reservation does not release on reject
Here's a partial log from osd.3 around the problematic time: http://www.aarontc.com/logs/ceph-osd.3.log.bz2
Please...
Aaron T
09:22 PM Bug #7922 (Need More Info): osd: multi-backfill reservation does not release on reject
it would help to see the log from the primary sending the dup backfill request. it is not entirely trivial to determ... Sage Weil
09:18 PM Bug #7922: osd: multi-backfill reservation does not release on reject
We got a duplicate backfill reservation request from osd.3:... Sage Weil
09:27 PM Revision d4d39a01 (ceph): osd/ReplicatedPG: mark_unrollbackable when _rollback_to head
We fell into the case in _rollback_to where we just set ctx->modify = true
and don't explicitly mark the ctx and unro...
Sage Weil
08:58 PM Revision f0978a82 (ceph): Merge pull request #1578 from ceph/wip-7437
wip-7437
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:44 PM Linux kernel client Bug #7954 (Resolved): misdirected op
Test Run: teuthology-2014-03-31_23:02:09-kcephfs-master-testing-basic-plana
======================================...
Sage Weil
08:07 PM Revision 1acd547a (ceph): debian: move ceph-rest-api from ceph-common to ceph
ceph-rest-api is a 'server' thing that runs persistently; 'ceph' is a
better match.
Signed-off-by: Sage Weil <sage@i...
Sage Weil
07:38 PM Revision a926cd38 (ceph): Merge pull request #1577 from ceph/wip-7935
rgw: reverse logic to identify next part
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:19 PM Revision a76a129f (ceph): PG: set role for replicated even if role != shard
Fixes: #7939
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
07:02 PM Revision 26eeab43 (ceph): PGLog::rewind_divergent_log: log may not contain newhead
Due to split, there may be a hole at newhead.
Fixes: #6722
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewe...
Samuel Just
07:02 PM Revision 225fc97f (ceph): PG: fix operator<<,log_wierdness log bound warning
Split may cause holes such that head != tail and yet
log.empty().
Fixes: #6722
Signed-off-by: Samuel Just <sam.just@...
Samuel Just
06:59 PM rbd Bug #6480: librbd crashed qemu-system-x86_64
I found another source of race conditions, and hopefully fixed them in the wip-6480-0.67.7 branch. I'm running tests ... Josh Durgin
06:53 PM Revision 759878c1 (ceph): test: Add EC testing to ceph_test_rados_api_tier
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision 1871fe79 (ceph): test: Add EC testing to ceph_test_rados_api_lock
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision ef94fcd7 (ceph): test: Add EC testing to ceph_test_rados_api_snaphots
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision 2d7da1d5 (ceph): librados: Add missing rados_wait_for_latest_osdmap()
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision c106976f (ceph): librados: Add missing C variants for alignment
Add rados_ioctx_pool_requires_alignment()
Add rados_ioctx_pool_required_alignment()
Signed-off-by: David Zafman <dav...
David Zafman
06:53 PM Revision 74ed4489 (ceph): test: Cleanup pools when FlushWriteRaces is done
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision 966480b2 (ceph): test: Fix ceph_test_rados_api_aio to clean up all test pools
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision a4da24d3 (ceph): test: Fix ceph_test_rados_list_parallel to clean up test pool
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision 9bd5821a (ceph): test: Add EC test classes RadosTestEC and RadosTestECPP
Fixes: #7437
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman
06:53 PM Revision ea26e13b (ceph): test: Add EC testing to ceph_test_rados_api_io
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision 7ccf4129 (ceph): test: Add EC testing to ceph_test_rados_api_list
Fixes: #7437
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman
06:53 PM Revision 6cedfacd (ceph): test: Add EC testing to ceph_test_rados_api_stat
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:53 PM Revision 38b9fe7c (ceph): test: Add EC to ceph_test_rados_api_watch_notify
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:04 PM Revision dde1c916 (ceph): osd/ReplicatedPG: continue scrub logic when snapset.head_exists doesn't...
The 'continue' will cause more damange/noise than continuing because the
next_clone value won't be updated properly.
...
Sage Weil
06:04 PM Revision c2e5a428 (ceph): osd/ReplicatedPG: handle snapdir properly during scrub
Handle snapdir similarly to how head is treated when updating the
next_clone info. Also, add a warning when we have ...
Sage Weil
05:52 PM Revision ed5a5e07 (ceph): rgw: reverse logic to identify next part
Fixes: #7935
The check that identifies whether we need to move to the next part when
iterating over an object was rev...
Yehuda Sadeh
05:45 PM Revision 48fbce98 (ceph): ReplicatedPG: improve get_object_context debugging
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:40 PM Bug #7941: caching needs to be able to enforce snap context on flush even with pool snaps
Samuel Just
01:11 PM Bug #7941 (Resolved): caching needs to be able to enforce snap context on flush even with pool snaps
Samuel Just
04:40 PM Bug #7942: promote uses cloneid, but backend may have a different cloneid
Samuel Just
01:12 PM Bug #7942 (Resolved): promote uses cloneid, but backend may have a different cloneid
Samuel Just
04:20 PM Bug #7949: "s3tests.functional.test_s3"* errors in upgrade:dumpling-x:parallel-firefly-distro-bas...
A pretty good chance that this is #7935. Yehuda Sadeh
03:28 PM Bug #7949 (Duplicate): "s3tests.functional.test_s3"* errors in upgrade:dumpling-x:parallel-firefl...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-31_19:33:26-upgrade:dumpling-x:parallel-firefly-di... Yuri Weinstein
04:17 PM Bug #7950 (Duplicate): "FAIL: s3tests.functional.test_s3.test_multipart_upload_contents" in upgra...
Duplicate of #7935, failed due to a new test added. Yehuda Sadeh
03:46 PM Bug #7950: "FAIL: s3tests.functional.test_s3.test_multipart_upload_contents" in upgrade:dumpling-...
May be a duplicate of 7949 Yuri Weinstein
03:45 PM Bug #7950 (Duplicate): "FAIL: s3tests.functional.test_s3.test_multipart_upload_contents" in upgra...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-31_19:33:26-upgrade:dumpling-x:parallel-firefly-di... Yuri Weinstein
03:59 PM Bug #7805: emperor can go active with < min_size non-incomplete peers since we check acting size
wip-7805-emperor Sage Weil
03:59 PM Bug #7938 (Resolved): Coredump generated by upgrade test
this was a broken commit:b097a237e11b48c47d3fd5484f3449e683e95db0. reverted. Sage Weil
11:23 AM Bug #7938 (Resolved): Coredump generated by upgrade test
The following yaml consistently generates coredumps.... Anonymous
03:55 PM Bug #7951 (Duplicate): "test_rbd."* tests failed in upgrade:dumpling-x:parallel-firefly-distro-ba...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-31_19:33:26-upgrade:dumpling-x:parallel-firefly-di... Yuri Weinstein
03:29 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
The only difference is that ext4 is used in teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154491 an... Loïc Dachary
03:15 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Running 30 more, sequentially. Loïc Dachary
03:09 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Contrary to what I thought (reading the wrong number), plana have SSE3 indeed and the jerasure plugin including SSE4 ... Loïc Dachary
02:59 PM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
ran through 20 iterations of the above yaml and did not reproduce. Sage Weil
09:35 AM Bug #7914 (In Progress): osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer lengt...
I will run the workload many times, hoping to reproduce the crash. Loïc Dachary
08:34 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Another run with success, complete logs in attachment:run2.txt
Loïc Dachary
06:49 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
I don't see why not locking the erasure-code plugin registry could be a problem in practice but it is a problem in th... Loïc Dachary
06:26 AM Bug #7914 (Can't reproduce): osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer l...
The above teuthology workload ran on two plana machines and did not create a core dump. Loïc Dachary
06:05 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
... Loïc Dachary
05:34 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Trying to trigger the problem again with:... Loïc Dachary
05:16 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154491... Loïc Dachary
03:29 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
both errors show with "thrashers/pggrow.yaml" (one with btrfs, the other with ext4) Loïc Dachary
02:34 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
the /a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154491 radosbench completed successfully
<pre...
Loïc Dachary
02:18 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
Trying to reproduce the problem from the source tree on the current master with... Loïc Dachary
01:49 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
cpuinfo on plana17 (the machine on which the core dump occured) shows:... Loïc Dachary
01:43 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
*-msse3* was not used although it should have and this was fixed "march 30th":https://github.com/ceph/ceph/commit/1c9... Loïc Dachary
01:39 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
"latest jerasure / gf-complete":https://github.com/ceph/ceph/commit/1c92453f748aea48084e57c9c721ee8080caeeb6 submodul... Loïc Dachary
01:23 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
... Loïc Dachary
12:57 AM Bug #7914 (In Progress): osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer lengt...
Loïc Dachary
02:46 PM Bug #7937: [ERR] deep-scrub 5.ds0 79d5820d/burnupi0838757-23/1f7//5 expected clone
Sage Weil
11:05 AM Bug #7937 (Fix Under Review): [ERR] deep-scrub 5.ds0 79d5820d/burnupi0838757-23/1f7//5 expected c...
Sage Weil
10:48 AM Bug #7937: [ERR] deep-scrub 5.ds0 79d5820d/burnupi0838757-23/1f7//5 expected clone
038e94b51e4945380c4ba771c88c953b6628d0f7 (wip-sam-testing) Samuel Just
10:46 AM Bug #7937 (Resolved): [ERR] deep-scrub 5.ds0 79d5820d/burnupi0838757-23/1f7//5 expected clone
2014-03-31 17:00:27,792.792 INFO:teuthology.task.internal:Removing archive directory...
2014-03-31 17:00:27,840.840 ...
Samuel Just
02:46 PM rgw Bug #7935 (Resolved): rgw: multipart upload failure with s3cmd
commit:ed5a5e075544662a12d94e472da55aeb2f0efe5d Josh Durgin
12:31 PM rgw Bug #7935: rgw: multipart upload failure with s3cmd
This issue happens when not all parts are the same size. Extended the s3 functional test to check this. Yehuda Sadeh
10:55 AM rgw Bug #7935 (Fix Under Review): rgw: multipart upload failure with s3cmd
Yehuda Sadeh
10:05 AM rgw Bug #7935 (Resolved): rgw: multipart upload failure with s3cmd
From ceph-users mailing list:... Yehuda Sadeh
02:44 PM devops Feature #7171 (Fix Under Review): rbdmap should be part of ceph-common
https://github.com/ceph/ceph/pull/1581 Sage Weil
02:25 PM Bug #7907: osd: rollback to head didn't mark_unrollbackable
the original modify:... Sage Weil
02:14 PM devops Feature #7947 (Duplicate): Create separate ceph and ceph-common packages for EL6 and EL7 builds
No reason why rpm builds shouldn't be the same as for debs... Neil Levine
02:07 PM Fix #7919: mon: prevent clients with a read cap from reading the full keyring
My guess is that this behavior kicked in when we started matching client caps with the expected caps on a per-command... Joao Eduardo Luis
01:27 PM Fix #7919: mon: prevent clients with a read cap from reading the full keyring
That sounds like the best solution to me. Somebody on the mailing list reported that Dumpling is not exposing user da... Greg Farnum
01:25 PM Fix #7919: mon: prevent clients with a read cap from reading the full keyring
My vote is to make 'auth' special and require * to access it (or an explcit grant of auth rw or something). This is ... Sage Weil
01:23 PM Fix #7919: mon: prevent clients with a read cap from reading the full keyring
Here's the thing: obviously, allowing anyone with read permission to simply grab the whole keyring is not a good thin... Joao Eduardo Luis
01:10 PM Fix #7919 (In Progress): mon: prevent clients with a read cap from reading the full keyring
Okay, I was the one at fault here. Was missing to provide a keyring with the client's key thus getting permission de... Joao Eduardo Luis
09:12 AM Fix #7919: mon: prevent clients with a read cap from reading the full keyring
Hum, I didn't validate the issue so maybe it was user error. Should probably poke him on the mailing list to check. *... Greg Farnum
05:26 AM Fix #7919: mon: prevent clients with a read cap from reading the full keyring
doesn't seem to happen on latest emperor. checking if anything changed in between 0.72.2 and latest emperor, or if I... Joao Eduardo Luis
01:26 PM Feature #7459 (Rejected): ceph-rest-api: sysvinit and upstart scripts
Sage Weil
01:06 PM Feature #7459 (Fix Under Review): ceph-rest-api: sysvinit and upstart scripts
Samuel Just
01:09 PM Feature #7940 (Resolved): add pool snaps to ceph_test_rados
Samuel Just
01:00 PM Bug #7939: pg role wrong for replicated pools
Samuel Just
12:16 PM Bug #7939 (Resolved): pg role wrong for replicated pools
Samuel Just
10:30 AM Bug #7892 (New): osd/ReplicatedPG.cc: 7881: FAILED assert((data_included.empty() && data.length()...
Sage Weil
10:23 AM Bug #7892 (Duplicate): osd/ReplicatedPG.cc: 7881: FAILED assert((data_included.empty() && data.le...
probably dups #7916 Sage Weil
10:29 AM Bug #7576: osd: large skew in pg epochs (dumpling)
We looked at this in standup today. There is a queue_null on every PG in OSD::consume_map(), so they should be gettin... Greg Farnum
10:24 AM Bug #7588 (Can't reproduce): OSD Seg fault in string assign ObjectOperation::C_ObjectOperation_co...
Sage Weil
10:23 AM Bug #7936 (Can't reproduce): "failed: rados" in upgrade:dumpling-x:parallel-firefly-distro-basic-...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-31_19:33:26-upgrade:dumpling-x:parallel-firefly-di... Yuri Weinstein
10:21 AM Bug #7916 (In Progress): ceph_test_rados got ENOENT on ec pool + thrashing
Sage Weil
09:43 AM Bug #7934 (Resolved): ceph_test_rados_watch_notify doesn't clean-up all pools it creates

$ ./rados lspools
data
metadata
rbd
$ ./ceph_test_rados_watch_notify
...
$ ./rados lspools
data
metadata
r...
David Zafman
09:41 AM Bug #7519 (Can't reproduce): upgrade: osd crash on cuttlefish -> v0.67.1 -> emperor
Sage Weil
09:35 AM devops Bug #7918: Mon hangs at start after upgrading to leveldb-1.12.0-3.fc18.x86_64 from the ceph-extra...
We've pulled those packages until we can figure what's causing this. Ian Colle
09:34 AM Bug #7926 (Resolved): "[ERR] scrub 45.0" in upgrade:dumpling-x:parallel-firefly-distro-basic-vps ...
commit:e672c52b4f8b945a516f2eec006e33665a08f045 Sage Weil
09:31 AM rbd Bug #6257 (Resolved): rbd: cp on sparse image allocates objects in dest
Sage Weil
09:21 AM rbd Feature #7921: Openstack: live migration for ephemeral volumes
Also need to ensure that code is backported to Icehouse-based distro (Ubuntu, RDO, RHEL-OSP) products. Neil Levine
09:19 AM rbd Feature #7921: Openstack: live migration for ephemeral volumes
Sage Weil
09:15 AM rbd Feature #7921: Openstack: live migration for ephemeral volumes
Neil Levine
09:21 AM rbd Feature #7920: Openstack: cloning for rbd ephemeral disks
Also need to ensure that code is backported to Icehouse-based distro (Ubuntu, RDO, RHEL-OSP) products. Neil Levine
09:18 AM rbd Feature #7920: Openstack: cloning for rbd ephemeral disks
Sage Weil
09:14 AM rbd Feature #7920: Openstack: cloning for rbd ephemeral disks
Neil Levine
09:15 AM rbd Feature #7924: Openstack: make long-running operations async in cinder
Neil Levine
09:15 AM rbd Feature #7923: Openstack: backup from in-use volume instead of from detached volume
Neil Levine
09:14 AM Bug #7931: setcrushmap crashing monitor
You should do this with "debug mon = 20" set, but it appears to be crashing because your crush map is somehow invalid... Greg Farnum
08:08 AM Bug #7931 (Can't reproduce): setcrushmap crashing monitor
Following the guides I've created a new crushmap. When I submit this new crushmap the monitor crashes with some infor... Luis Periquito
09:14 AM rbd Feature #7895: krbd: test cloning, discard, plus regular I/O via fsx
Neil Levine
09:14 AM Linux kernel client Feature #190: krbd: DISCARD support
Neil Levine
09:10 AM rbd Feature #7455 (Resolved): krbd,kcephfs: support primary-affinity
Sage Weil
09:05 AM rgw Feature #7932 (Resolved): Create design for object versioning, including subtasks and estimates
Ian Colle
04:12 AM Revision 2e997aa7 (ceph): Report job updated time
When reading job information from archive files, use the mtime of
teuthology.log as the job updated time
Signed-off-...
Zack Cerza
03:47 AM Revision 48c36dc3 (ceph): Merge pull request #21 from ceph/wip-thrash-affinity
krbd, kcephfs: trash osd primary-affinity Sage Weil
03:46 AM Revision 06cab8b0 (ceph): Merge pull request #22 from ceph/wip-fuse-permission
multimds: fuse_default_permissions = 0 for kernel build test Sage Weil
02:09 AM Feature #7928 (Rejected): erasure-code : no SSE3 specific code
Is it worth compiling with *-msse3* and detecting it at runtime when no HAVE_SSE3 code is being compiled conditionall... Loïc Dachary
02:07 AM Revision 796af560 (ceph): Handle jobs with no targets
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
01:30 AM Revision 2e283ce6 (ceph): multimds: fuse_default_permissions = 0 for kernel build test
This can reduce the test time becuase it avoids sending getattr
request whenever the kernel checks inode permission.
...
Yan, Zheng
01:15 AM Revision 87b08b23 (ceph): Merge pull request #1575 from jdurgin/wip-librbd-cp-sparse
librbd: skip zeroes when copying an image
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
12:39 AM Revision 5c9b8a27 (ceph): osd/PG: debug cached_removed_snaps changes
See #7915.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

03/31/2014

10:46 PM Revision addfed2d (ceph): When ignoring a raised exception, at least log it
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
10:19 PM Revision a4dc6f2b (ceph): Merge pull request #1574 from mikenel/master
Add ceph-client-debug and jerasure shared objects to RPM spec file.
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
10:04 PM Revision 824da202 (ceph): librbd: skip zeroes when copying an image
This is the simple coarse-grained solution, but it works well in
common cases like a small base image resized with a ...
Josh Durgin
09:59 PM Revision 70d2e135 (ceph): Merge pull request #1573 from ceph/wip-7912
mon/PGMap: clear pool sum when last pg is deleted
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
09:57 PM Revision 51326991 (ceph): Merge pull request #1556 from ceph/wip-7888
msgr: add new ping/ping reply to use in place of keepalive
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
09:56 PM Revision a8eb9b5f (ceph): Merge pull request #1566 from ceph/wip-fuse-access
fuse: implement 'access' low level function
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:44 PM devops Bug #7879 (Resolved): sentry-db is down
Migration complete! Zack Cerza
09:38 PM Bug #7927 (Duplicate): Removed pools still show up in "ceph pg dump" output
Duplicate #7912
This should be fixed as of commit 70d2e1353ecb9d31a394fdac333dbb0de93339d3.
Greg Farnum
09:32 PM Bug #7927 (Duplicate): Removed pools still show up in "ceph pg dump" output

Creating new pools 3 and 4 then removing them leaves them behind in ceph pg dump output. Minutes later it is still...
David Zafman
09:12 PM Revision e44f85ce (ceph): qa/workunits/cephtool/test.sh: test 'osd pg-temp ...'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:12 PM Revision 2d4ec6a1 (ceph): mon/OSDMonitor: clear primary_temp on osd pg_temp updates
Until the OSD and the MOSDPGTemp messages encode primary_temp updates,
assume that any pg_temp update will clear prim...
Sage Weil
09:12 PM Revision 255f8404 (ceph): OSDMonitor: add 'mon osd allow primary temp' bool option
By default, we don't send out maps with primary_temp mappings because
there is no infrastructure in place that would ...
Ilya Dryomov
09:12 PM Revision 63ac0793 (ceph): OSDMonitor: add 'osd primary-temp ...' command
ceph osd primary-temp <pgid> [<osd>]
Examples:
ceph osd primary-temp 0.2 4 # set primary_temp mapping for 0.2 to os...
Ilya Dryomov
09:11 PM Revision 07dcffa1 (ceph): OSDMonitor: add 'osd pg-temp ...' command
ceph osd pg-temp <pgid> [<osd1> [<osd2> ...]]
Examples:
ceph osd pg-temp 0.2 0 1 2 # set pg_temp mapping for 0.2 to...
Ilya Dryomov
09:05 PM Revision d3183e31 (ceph): java/test: ceph.file.layout xattr is still not there now
b8ea65694faf59f12f285a65dc21753dab20ba11 tried to fix this, but
missed a spot.
Signed-off-by: Greg Farnum <greg@inkt...
Greg Farnum
07:40 PM Revision 880e6ea3 (ceph): Merge pull request #1567 from ceph/wip-7849
ceph-conf: don't create log files
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:36 PM Revision d474332e (ceph): Merge pull request #1570 from dachary/wip-gitignore
.gitignore: add examples/librados files
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:35 PM Revision cd1a9c49 (ceph): Add ceph-client-debug and jerasure shared objects to RPM spec file.
Michael Nelson
05:42 PM Revision 81853c61 (ceph): mon/PGMap: clear pool sum when last pg is deleted
Use the x.0 pg as a sentinel for the existence of the pool. Note that we
have to clean in up two paths: apply_incrme...
Sage Weil
05:18 PM Revision b3ffce87 (ceph): Merge pull request #1572 from ceph/wip-ec-profile-idempotent
mon: make 'ceph osd erasure-code-profile set ...' idempotent
Reviewed-by: Joao Eduardo Luis <joao.luis@inktank.com>
Sage Weil
05:01 PM Revision 8c761c47 (ceph): mon: make 'ceph osd erasure-code-profile set ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:54 PM Revision d258fcce (ceph): osd: do not send peering messages during init
Do not send any peering messages while we are still working our way
through init().
Fixes: #7093
Signed-off-by: Sage...
Sage Weil
04:34 PM Revision a536fd13 (ceph): krbd, kcephfs: trash osd primary-affinity
libceph.ko support is in testing and should be in 3.15. This commit
effectively reverts commit 66ffaa65657b ("kcephf...
Ilya Dryomov
04:16 PM rbd Bug #6257 (Fix Under Review): rbd: cp on sparse image allocates objects in dest
Flatten ignores empty objects since commit:bfa106694dc4db97f58c623eafc3c2d0f9a8bff1, which is in dumpling and emperor... Josh Durgin
04:14 PM Revision 7d321d8a (ceph): qa/workunits/rados/test_alloc_hint: fix erasure syntax
This changed recently.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
03:55 PM Bug #7917: "ERROR: test_rbd."* in upgrade:dumpling-x:parallel-firefly-distro-basic-vps
Looks like the same issue on
os_type: rhel
os_version: '6.5'
Los are in http://qa-proxy.ceph.com/teuthology/teuth...
Yuri Weinstein
09:15 AM Bug #7917 (Resolved): "ERROR: test_rbd."* in upgrade:dumpling-x:parallel-firefly-distro-basic-vps
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-29_19:33:24-upgrade:dumpling-x:parallel-firefly-di... Yuri Weinstein
03:19 PM Bug #7926 (Resolved): "[ERR] scrub 45.0" in upgrade:dumpling-x:parallel-firefly-distro-basic-vps ...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-29_19:33:24-upgrade:dumpling-x:parallel-firefly-di... Yuri Weinstein
02:59 PM Bug #7912 (Resolved): Wrong pool count when using "ceph -s" and "ceph -w"
Sage Weil
05:36 AM Bug #7912 (Resolved): Wrong pool count when using "ceph -s" and "ceph -w"
Hi,
running ceph version 0.72.2 (a913ded2ff138aefb8cb84d347d72164099cfd60)
I created a bunch of pools and delet...
Volker Voigt
02:58 PM devops Feature #7925 (Rejected): Feature: create new download.ceph.com site
See https://docs.google.com/a/inktank.com/document/d/1K8pUEZpN5-t1wd0t81MXgooB1Xz_Cgi4jkvo25dysiU/edit Neil Levine
02:57 PM Revision 4efb57d3 (ceph): Provide real error messages for unfound (sub)tasks
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
02:57 PM Messengers Bug #7888 (Pending Backport): msgr: keepalive is insufficient
Sage Weil
02:27 PM Revision 995e055b (ceph): Merge pull request #1571 from kotnik/docufix
Small glossary typo fix
Reviewed-by: Loic Dachary <loic@dachary.org>
Loïc Dachary
02:24 PM Revision 2826fdad (ceph): doc: fix typos in glossary
Nikola Kotur
02:21 PM Bug #7922: osd: multi-backfill reservation does not release on reject
After the additional logging directives were added and osd.0 was restarted, it quickly crashed again. Here is the log... Aaron T
01:55 PM Bug #7922 (Resolved): osd: multi-backfill reservation does not release on reject
Several OSDs are crashing quite frequently with this error, at least on version 0.77 and 0.78 (possibly earlier as we... Aaron T
02:15 PM rbd Bug #7577 (Resolved): rbd info displays extra random char in block prefix
Sage Weil
02:03 PM Bug #7915: ./include/interval_set.h: 385: FAILED assert(_size >= 0)
Looking at the first failure:
- PGPool::cached_removed_snaps is empty
- info.purged_snaps = 1~1
hence the fail...
Sage Weil
09:10 AM Bug #7915: ./include/interval_set.h: 385: FAILED assert(_size >= 0)
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154663 Sage Weil
09:07 AM Bug #7915: ./include/interval_set.h: 385: FAILED assert(_size >= 0)
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154509 Sage Weil
09:06 AM Bug #7915 (Duplicate): ./include/interval_set.h: 385: FAILED assert(_size >= 0)
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154635... Sage Weil
01:56 PM rbd Feature #7924 (Closed): Openstack: make long-running operations async in cinder
- deleting an image takes a long time, and blocks cinder-volume
from doing anything else during that time
...
Neil Levine
01:55 PM rbd Feature #7923 (Resolved): Openstack: backup from in-use volume instead of from detached volume
Currently, you can take a backup of a detached volume and send to a Swift/RBD backend.
We want to be able to take ba...
Neil Levine
01:55 PM Bug #7804 (Duplicate): backfill racing with a hitset object remove
Samuel Just
01:55 PM Bug #7893 (Duplicate): osd/ReplicatedPG.cc: 10190: FAILED assert(0 == "erroneously present object")
Samuel Just
09:16 AM Bug #7893: osd/ReplicatedPG.cc: 10190: FAILED assert(0 == "erroneously present object")
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154529 Sage Weil
01:55 PM Bug #7894 (Duplicate): osd: missing hitset object in cluster log
Samuel Just
01:51 PM rbd Feature #7921 (Resolved): Openstack: live migration for ephemeral volumes
- live migration for rbd ephemeral disks
- live migration can easily be truly live with shared storage
- easier mai...
Neil Levine
01:50 PM rbd Feature #7920 (Resolved): Openstack: cloning for rbd ephemeral disks
As per https://blueprints.launchpad.net/nova/+spec/rbd-clone-image-handler Neil Levine
12:42 PM Bug #7849 (Resolved): ceph-conf create empty log files
commit:fc1a424e837bee139726eec333c9efd65e2abb6a Josh Durgin
09:39 AM Bug #7849: ceph-conf create empty log files
Josh - please review the wip branch Ian Colle
11:58 AM Linux kernel client Feature #3837: krbd: support format 2 striping
Neil Levine
10:54 AM Bug #7875: osd: pg_pool_t hitset fields incompat
If you have a mix of OSDs tracking hitsets in the cluster, your data tracking isn't going to make any sense...what di... Greg Farnum
10:39 AM Fix #7919 (Resolved): mon: prevent clients with a read cap from reading the full keyring
From the mailing list thread "[ceph-users] Security Hole?"... Greg Farnum
09:45 AM Bug #7626 (Closed): After updating ceph from 0.75 to 0.77 one of the three monitors can't start
as far as I can tell, Sage is right. Nothing else seems off. Closing the ticket. Joao Eduardo Luis
02:11 AM Bug #7626: After updating ceph from 0.75 to 0.77 one of the three monitors can't start
Jasper Siero wrote:
> I updated all nodes to -0.88- 0.78 and removed the monitor and created a new one ;-)
Jasper Siero
02:10 AM Bug #7626: After updating ceph from 0.75 to 0.77 one of the three monitors can't start
I updated all nodes to 0.88 and removed the monitor and created a new one Jasper Siero
09:36 AM devops Bug #7918 (Won't Fix): Mon hangs at start after upgrading to leveldb-1.12.0-3.fc18.x86_64 from th...
I had a working 0.72.2 installation using the standard Fedora 18 RPMs.
I upgraded leveldb from 1.7.0-4.fc18 to 1.1...
Jens Kristian Søgaard
09:33 AM Bug #7908 (Resolved): "osd.3 ... [ERR] scrub" in upgrade:dumpling-x:parallel-firefly-distro-basic...
Sage Weil
09:30 AM Revision 7fa025e3 (ceph): .gitignore: add examples/librados files
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
09:09 AM Bug #7914: osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length from 4096 to...
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154491 Sage Weil
09:03 AM Bug #7914 (Resolved): osd: SEGV on ec write, ErasureCodeJerasure: encode adjusted buffer length f...
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154379... Sage Weil
09:09 AM Bug #7916 (Can't reproduce): ceph_test_rados got ENOENT on ec pool + thrashing
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154424 Sage Weil
08:58 AM Bug #7776: client lockdep crash
ubuntu@teuthology:/a/teuthology-2014-03-30_02:30:11-rados-master-testing-basic-plana/154676 Sage Weil

03/30/2014

04:20 PM Revision f89e992f (ceph): Merge pull request #1564 from dachary/wip-erasure-code-doc
doc: updates to the erasure code development docs
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:17 PM Revision f60ce297 (ceph): Merge pull request #1568 from dachary/wip-jerasure-warnings
erasure-code: update jerasure / gf-complete submodules
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:16 PM Revision ffcbeb46 (ceph): Merge pull request #1569 from dachary/wip-ssse3
autotools: s/ssse3/sse3/ typo
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
03:57 PM Revision 14418a29 (ceph): autotools: s/ssse3/sse3/ typo
Reported-by: Justin Erenkrantz <justin@erenkrantz.com>
Signed-off-by: Loic Dachary <loic@dachary.org>
Loïc Dachary
11:33 AM Revision 1c92453f (ceph): erasure-code: update jerasure / gf-complete submodules
For compilation warning patches.
http://tracker.ceph.com/issues/7909 Fixes #7909
Signed-off-by: Loic Dachary <loic@...
Loïc Dachary
09:17 AM Bug #7909 (Resolved): warnings during gf-complete/jerasure build
Sage Weil
04:33 AM Bug #7909: warnings during gf-complete/jerasure build
"work in progress":https://github.com/ceph/ceph/pull/1568 cherry pick of the above pending pull requests Loïc Dachary
01:25 AM Bug #7909 (In Progress): warnings during gf-complete/jerasure build
* https://bitbucket.org/jimplank/gf-complete/pull-request/12/fix-void-arithmetic-compilation-warning
* https://bitbu...
Loïc Dachary
08:31 AM Revision d5973aaf (ceph): doc: update jerasure plugin
* The parameters come from the erasure code profile
* Add a note about the upstream submodules gf-complete / jerasure...
Loïc Dachary
08:31 AM Revision ff51d3ff (ceph): doc: pgbackend dev doc outdated notice
* Warn the reader that the implementation is ahead and may differ
* Update the links to the Firefly branch
* Remove l...
Loïc Dachary
08:31 AM Revision 402e1047 (ceph): doc: erasure code developer notes updates
Update the introduction to explain erasure code profiles. Remove
obsolete explanations about partial writes etc. Remo...
Loïc Dachary
05:40 AM Revision 74fa3a78 (ceph): fuse: implement 'access' low level function
Add an empty 'access' function to fuse low level functions. This
allow us to use ceph-fuse with fuse_default_permissi...
Yan, Zheng
05:28 AM Revision e672c52b (ceph): osd/ReplicatedPG: fix cache tier scrub again
This condition was flipped from commit eb71924ea27e78d97bd45674ef5e6a7f
and the test case in c3292e48483d861148322590...
Sage Weil
05:27 AM Revision ea47b4a8 (ceph): ceph_test_rados_api_tier: improve promote+scrub test
We need to have multiple clones with some different patterns of
missing-ness.
Signed-off-by: Sage Weil <sage@inktank...
Sage Weil
05:00 AM Revision 98551fc7 (ceph): global: separate first half of global_init into global_pre_init
The pre_init now captures enough to create the g_ceph_context and parse
and initialize the in-memory config. However...
Sage Weil
05:00 AM Revision fc1a424e (ceph): ceph-conf: use global_pre_init to avoid starting logging
This avoids starting up logging, which is not appropriate when we are
examining the config state and not actually sta...
Sage Weil
04:27 AM Revision 5019a151 (ceph): Merge pull request #1520 from ceph/wip-multimds
Wip multimds
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
12:28 AM Revision 90e1e719 (ceph): Merge pull request #1563 from dachary/wip-sse-i386
erasure-code: do not attempt to compile SSE4 on i386
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil

03/29/2014

11:58 PM Revision c3292e48 (ceph): ceph_test_rados_api_tier: improve cache tier + scrub test
Create lots of objects and make *some* of them be missing clones but not
all.
Signed-off-by: Sage Weil <sage@inktank...
Sage Weil
11:57 PM Revision cfd6f23a (ceph): osd/ReplicatedPG: tolerate trailing missing clones on cache tiers
I missed this case in eb71924ea27e78d97bd45674ef5e6a7fce30932f.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:52 PM Bug #7849 (Fix Under Review): ceph-conf create empty log files
Sage Weil
09:39 PM Bug #7909 (Resolved): warnings during gf-complete/jerasure build
http://gitbuilder.sepia.ceph.com/gitbuilder-ceph-tarball-saucy-amd64-basic/log.cgi?log=c3292e48483d861148322590ea1f05... Sage Weil
09:27 PM Revision 040abd75 (ceph): qa/workunits/fs/misc/layout_vxattrs: ceph.file.layout is not listed
As of 08a3d6bd428c5e78dd4a10e6ee97540f66f9729c. A similar change was made
in the kernel.
Signed-off-by: Sage Weil <...
Sage Weil
09:25 PM Revision b8ea6569 (ceph): java/test: ceph.file.layout xattr is not there now
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:23 PM Revision 4f9f7f87 (ceph): qa/workunits/fs/misc/layout_vxattrs: ceph.file.layout is not listed
As of 08a3d6bd428c5e78dd4a10e6ee97540f66f9729c. A similar change was made
in the kernel.
Signed-off-by: Sage Weil <...
Sage Weil
09:12 PM Bug #7908 (Resolved): "osd.3 ... [ERR] scrub" in upgrade:dumpling-x:parallel-firefly-distro-basic...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-29_19:33:24-upgrade:dumpling-x:parallel-firefly-di... Yuri Weinstein
04:59 PM Bug #7907 (Resolved): osd: rollback to head didn't mark_unrollbackable
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-28_22:35:22-upgrade:dumpling-x:stress-split-firefl... Yuri Weinstein
04:39 PM Revision f895d16c (ceph): valgrind.supp: be less picky about library versions
...so that this works on trusty
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:20 PM Revision 74f3827f (ceph): Merge pull request #1548 from ceph/wip-7880
mds: properly propagate dirty dirstat to auth inode
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
04:18 PM Bug #7744: osd: assert(last_e.version.version < e.version.version)
Two of my osds are crashing with the same signature:
osd/PGLog.cc: 672: FAILED assert(last_e.version.version < e.v...
Jake Young
01:41 PM Revision b71e64dd (ceph): mds: find approximal bounds when adjusting subtree auth
When finishing exporting a subtree, the exporter MDS drops locks and
sends MExportDirFinish message to the importer M...
Yan, Zheng
10:25 AM Revision fd28ad55 (ceph): doc: erasure-code development complete
remove the note explaining that it is not yet available.
Signed-off-by: Loic Dachary <loic@dachary.org>
Loïc Dachary
09:34 AM Revision 399de24e (ceph): erasure-code: do not attempt to compile SSE4 on i386
SSE4 are only not availabe on older CPUs. Although the compiler could
probably generate the code, there is no point i...
Loïc Dachary
09:19 AM CephFS Bug #7880 (Resolved): multimds: directory gets rsynced twice
Sage Weil
05:28 AM Subtask #7548 (Resolved): Basic docs for Erasure Coding
Loïc Dachary
01:25 AM Revision 07ccc4e7 (ceph): mds: commit new dirfrag before splitting it
Commit 6e013cd6 (properly set COMPLETE flag when merging dirfrags)
tries solving the issue that new dirfrag's COMPLET...
Yan, Zheng
01:13 AM Revision 8f5e92dd (ceph): Merge pull request #1562 from onlyjob/debian
init: fix OSD startup issue
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:11 AM Revision 780fa91c (ceph): Revert "ceph-conf: do not log"
This reverts commit acc31e75a3e7115c00f9980609948455e3b2d49e. Sage Weil
01:11 AM Revision b5a63207 (ceph): Revert "ceph-conf: no admin_socket"
This reverts commit 72715b235a0daee7ab8e5cd3ab6e415de2939df9.
This breaks the ceph cli, which uses ceph-conf --show-...
Sage Weil
01:06 AM Revision 44afc233 (ceph): init: fix OSD startup issue
On machines with MON and OSDs (on boot) OSDs started shortly after MON startup
but MON needs time to become opration...
Dmitry Smirnov
01:01 AM Revision f5aa492a (ceph): Merge pull request #1519 from ceph/wip-6951-dumpling
rgw: reset objv tracker on bucket recreation Sage Weil
12:56 AM Revision fd76fec5 (ceph): rbd.cc: tolerate lack of NUL-termination on block_name_prefix
Fixes: #7577
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Dan Mick
12:20 AM Revision bd6e35c1 (ceph): rbd.cc: yes, cover formatted output as well. sigh.
Fixes: #7577
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
12:19 AM Revision fab374b0 (ceph): Merge pull request #1552 from ceph/wip-7902
osd/PG: fix choose_acting revert to up case
Reviewed-by: Samuel Just <sam.just@inktank.com>
Sage Weil
12:02 AM Revision fef70cbb (ceph): Merge pull request #1559 from ceph/wip-7881-dumpling
Wip 7881 dumpling
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil

03/28/2014

11:59 PM Revision 66c1ef45 (ceph): Merge pull request #1560 from ceph/wip-7903
Wip 7903
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
11:49 PM Revision 80e0a0a8 (ceph): mon/MonClient: use keepalive2 to verify the mon session is live
Verify that the mon is responding by checking the keepalive2 reply
timestamp. We cannot rely solely on TCP timing ou...
Sage Weil
11:49 PM Revision 87232183 (ceph): msgr: add KEEPALIVE2 feature
This is similar to KEEPALIVE, except a timestamp is also exchanged. It is
sent with the KEEPALIVE, and then returned...
Sage Weil
11:49 PM Revision a2f0974f (ceph): Pipe: rename keepalive->send_keepalive
Signed-off-by: Greg Farnum <greg@inktank.com>
(cherry picked from commit 38d4c71a456c1cc9a5044dbcae5378836a34484d)
Greg Farnum
11:17 PM Revision cb699f6a (ceph): rados/monthrash: shorten mon pings so that freezing triggers reconnects
This exercises (tho it doesn't necessarily verify) the mon ping reconnect
code when the thrasher freezes the mons.
S...
Sage Weil
11:09 PM Revision d747d79f (ceph): msgr: add KEEPALIVE2 feature
This is similar to KEEPALIVE, except a timestamp is also exchanged. It is
sent with the KEEPALIVE, and then returned...
Sage Weil
11:09 PM Revision 056151a6 (ceph): mon/MonClient: use keepalive2 to verify the mon session is live
Verify that the mon is responding by checking the keepalive2 reply
timestamp. We cannot rely solely on TCP timing ou...
Sage Weil
11:01 PM Revision 3a66e291 (ceph): Merge pull request #1558 from ceph/wip-7837
ReplicatedPG: include pending_attrs when reseting attrs in WRITEFULL
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
10:57 PM Revision 1aa1d938 (ceph): ReplicatedPG: hit_set_setup, agent_setup, skip if !active
We call these in on_activate and on_pool_change. In the former, we are
necessarily active. In the latter, we only w...
Samuel Just
10:56 PM Revision 881bd36a (ceph): client: pin Inode during readahead
Make sure the Inode does not go away while a readahead is in progress. In
particular:
- read_async
- start a re...
Sage Weil
10:56 PM Revision dbfb993c (ceph): osdc/ObjectCacher: call read completion even when no target buffer
If we do no assemble a target bl, we still want to return a valid return
code with the number of bytes read-ahead so ...
Sage Weil
10:48 PM Revision 9bfbce30 (ceph): client: pin Inode during readahead
Make sure the Inode does not go away while a readahead is in progress. In
particular:
- read_async
- start a re...
Sage Weil
10:48 PM Revision 23244557 (ceph): osdc/ObjectCacher: call read completion even when no target buffer
If we do no assemble a target bl, we still want to return a valid return
code with the number of bytes read-ahead so ...
Sage Weil
10:34 PM Revision f443ff30 (ceph): PGLog: remove obsolete assert in merge_log
This assert assumes that if olog.head != log.head, olog contains
a log entry at log.head, which may not be true since...
Samuel Just
10:34 PM Revision 636e53c0 (ceph): PGLog: on split, leave log head alone
This way last_update doesn't go backwards.
Fixes: 6447
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry pic...
Samuel Just
10:08 PM Revision 1af95e7b (ceph): Merge pull request #1557 from ceph/wip-7867
client: fix assert(!unclean) due to readahead vs close race
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
09:55 PM Revision f1c7b4ef (ceph): client: pin Inode during readahead
Make sure the Inode does not go away while a readahead is in progress. In
particular:
- read_async
- start a re...
Sage Weil
09:55 PM Revision 032d4ec5 (ceph): osdc/ObjectCacher: call read completion even when no target buffer
If we do no assemble a target bl, we still want to return a valid return
code with the number of bytes read-ahead so ...
Sage Weil
09:07 PM Revision c166215d (ceph): Merge pull request #1553 from ceph/wip-7874
ReplicatedPG: disable clone subsets for cache pools
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:06 PM Revision 3c18fad4 (ceph): Merge pull request #1554 from ceph/wip-7828
ReplicatedPG:: s/_delete_head/_delete_oid, adjust head_exists iff is_hea...
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:05 PM Revision decbe2c0 (ceph): Merge pull request #1555 from ceph/wip-7835
ReplicatedPG::make_writeable: fill in ssc on clone
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
09:05 PM Revision 68dc0c6b (ceph): rgw: move max_chunk_size initialization
RGWRados::initialize() is not called when doing
RGWRados::get_raw_storage_provider(). This was the culprit for issue
...
Yehuda Sadeh
08:27 PM Revision a6be1d67 (ceph): PG: always clear agent_state in clear_primary_state
We aren't actually active between activate() and all_activated_committed().
We'd have to suspend agent_work during th...
Samuel Just
08:25 PM Revision dfd3cb51 (ceph): rgw: only look at prefetched data if we actually prefetched
Fixes: #7903
Since we didn't prefetch data then we couldn't rely on the data to
actually exist there. In that case ju...
Yehuda Sadeh
08:25 PM Revision 80c5b4ce (ceph): Merge pull request #1517 from ceph/wip-7805
PG: only complete replicas should count toward min_size
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
08:10 PM Revision d78e6788 (ceph): osd/PG: fix choose_acting revert to up case
If we decide to revert back to up, we need to
1- return false, so that we go into the NeedActingChange state, and
2-...
Sage Weil
07:11 PM rbd Bug #6480 (In Progress): librbd crashed qemu-system-x86_64
Mike provided a core dump in which structures appear to be normal, though it's hard to tell with the ceph::buffer::ra... Josh Durgin
06:31 PM Bug #7849: ceph-conf create empty log files
Socket part was introduced later -- perhaps only socket part shall be reversed?
Log-related change seems safe...
Dmitry Smirnov
06:05 PM Bug #7849 (In Progress): ceph-conf create empty log files
oops, the previous fix breaks ceph.py, which does ceph-conf --show-config-value to get admin_socket. Sage Weil
06:08 PM Revision 71fa779e (ceph): mds: don't clear scatter dirty when cache rejoin ack is received
The auth mds has received dirty scatterlock state. But it hasn't
journaled the dirty state yet. The log segment that ...
Yan, Zheng
06:08 PM Revision ce936161 (ceph): mds: fix stack overflow caused by nested dispatch
Commit bc3325b37 fixes a stack overflow bug happens when replaying
client requests. Similar stack overflow can happen...
Yan, Zheng
06:08 PM Revision 34ef91a2 (ceph): mds: fix negative dirstat assertion
When splitting dirfrag, delta dirstat is always added to the first new
dirfrag. Before the delta dirstat is propagate...
Yan, Zheng
06:08 PM Revision 79aa26ff (ceph): mds: allow sending MMDSFindIno to MDS who is in clientreplay state
Because MDCache::kick_find_ino_peers() is called when a MDS enters
clientreplay state.
Signed-off-by: Yan, Zheng <zh...
Yan, Zheng
06:08 PM Revision 367987fa (ceph): mds: don't increase nlink when rollback stray reintegration
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
06:08 PM Revision 90b4e53c (ceph): mds: include authority of the overwrited inode in rename witnesses
Rename operation needs to adjust the overwrited inode's link count.
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
06:08 PM Revision 0bb911c6 (ceph): mds: don't trim non-auth root inode/dirfrag
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
06:08 PM Revision 6963a8f9 (ceph): mds: handle interaction between slave rollback and fragmenting
For slave rename and rmdir events, the MDS needs to preserve non-auth
dirfrag where the renamed inode originally live...
Yan, Zheng
06:08 PM Revision ffcbcdd6 (ceph): mds: handle race between cache rejoin and fragmenting
MDCache::handle_cache_expire() ignores mismatched dirfrags. this is
OK during normal operation because MDS doesn't tr...
Yan, Zheng
06:08 PM Revision e535f7f2 (ceph): mds: avoid journaling non-auth opened inode
Exporting inode has AUTH bit set while EExport event is being
journaled.
Signed-off-by: Yan, Zheng <zheng.z.yan@inte...
Yan, Zheng
06:08 PM Revision 6a548a97 (ceph): mds: fix CDir::WAIT_ANY_MASK
make sure CDir::WAIT_ANY_MASK include MDSCacheObject::WAIT_UNFREEZE
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
06:08 PM Revision e811b07e (ceph): mds: properly journal fragment rollback
If dirfrags are subtree roots, mark the dirfragtreelock as scattered
dirty, otherwise journal the dirfragtree change....
Yan, Zheng
06:08 PM Revision 3c6c7124 (ceph): mds: trim non-auth inode with remote parents
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
06:08 PM Revision 598c5f18 (ceph): mds: trim empty non-auth dirfrags
Fragmenting a non-auth dirfrag results several smaller dirfrags. Some
of the resulting dirfrags can be empty, which a...
Yan, Zheng
06:08 PM Revision a1f5c645 (ceph): mds: don't trim ambiguous import dirfrags
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
06:08 PM Revision 1bd575e2 (ceph): mds: fix CInode::get_approx_dirfrag
return NULL if there is no opened dirfrag
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
06:08 PM Revision ed49d5ef (ceph): mds: journal EFragment::OP_COMMIT before drop locks
Dropping locks can dispatch other requests. These request can submit
log entry.
Signed-off-by: Yan, Zheng <zheng.z.y...
Yan, Zheng
06:08 PM Revision fb19100f (ceph): mds: fix null pointer dereference in MDCache::rejoin_send_rejoins()
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
06:08 PM Revision d14ec95e (ceph): mds: skip non-opened session when flushing client sessions
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
06:08 PM Revision fbf4fbc3 (ceph): mds: explicitly set nonce for imported dirfrag
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
05:28 PM Revision 9f3a6644 (ceph): rgw_pool_type: remove accidentally added empty file
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
05:23 PM Bug #7626: After updating ceph from 0.75 to 0.77 one of the three monitors can't start
Yes, I'm pretty sure it is.. this bug affected 0.77 and was fixed for 0.78. If I remember correctly, the full osdmap... Sage Weil
05:20 PM Bug #7626: After updating ceph from 0.75 to 0.77 one of the three monitors can't start
This sounds like it could be commit:14ea8157eb2883b9f53c234044fe002153212ef8 Sage Weil
10:18 AM Bug #7626: After updating ceph from 0.75 to 0.77 one of the three monitors can't start
The store attached to the ticket shows the latest 7 full osdmaps as being unable to be decoded, which would explain t... Joao Eduardo Luis
05:18 PM Bug #7902 (Pending Backport): osd/PG.cc: 6803: FAILED assert(!pg->actingbackfill.empty())
Sage Weil
12:54 PM Bug #7902 (Resolved): osd/PG.cc: 6803: FAILED assert(!pg->actingbackfill.empty())
this a split dumpling/firefly cluster
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-03-27_22:35:2...
Sage Weil
05:17 PM Bug #7881 (Resolved): osd/PGLog.cc: 430: FAILED assert(to != olog.log.end() || (olog.head == info...
Sage Weil
03:37 PM Bug #7881: osd/PGLog.cc: 430: FAILED assert(to != olog.log.end() || (olog.head == info.last_update))
Samuel Just
03:16 PM Bug #7881 (In Progress): osd/PGLog.cc: 430: FAILED assert(to != olog.log.end() || (olog.head == i...
Samuel Just
09:52 AM Bug #7881: osd/PGLog.cc: 430: FAILED assert(to != olog.log.end() || (olog.head == info.last_update))
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-03-27_22:35:20-upgrade:dumpling-x:stress-split-firefly... Sage Weil
04:58 PM rgw Bug #7903 (Resolved): radosgw-admin: failing to set default region
Sage Weil
01:27 PM rgw Bug #7903 (Fix Under Review): radosgw-admin: failing to set default region
Yehuda Sadeh
12:59 PM rgw Bug #7903 (Resolved): radosgw-admin: failing to set default region
http://pulpito.ceph.com/teuthology-2014-03-27_23:00:17-rgw-firefly-distro-basic-plana/149617/ Yehuda Sadeh
04:00 PM rgw Bug #7837 (Resolved): s3tests test_object_metadata_replaced_on_put fails on an erasure coded pool
Sage Weil
03:56 PM CephFS Bug #7867 (Resolved): client/Client.cc: 2087: FAILED assert(!unclean)
Sage Weil
03:47 PM CephFS Bug #7867 (Pending Backport): client/Client.cc: 2087: FAILED assert(!unclean)
Sage Weil
03:52 PM Bug #7906 (Duplicate): "adjust-ulimits ... --rgw-region zero'''" filed in rgw-firefly-distro-basi...
Duplicates #7903 Yehuda Sadeh
03:36 PM Bug #7906 (Duplicate): "adjust-ulimits ... --rgw-region zero'''" filed in rgw-firefly-distro-basi...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-27_23:00:17-rgw-firefly-distro-basic-plana/149644/... Yuri Weinstein
03:23 PM Revision 0dcb54f7 (ceph): Merge pull request #1549 from dachary/wip-doc
doc: fix typos in tiering dev doc Sage Weil
02:07 PM Bug #7874 (Resolved): clone_range from clone on recovery fails if clone has been evicted
Sage Weil
02:05 PM Bug #7828 (Resolved): osd/ReplicatedPG.cc: 4984: FAILED assert(ctx->new_obs.exists == ctx->new_sn...
Sage Weil
02:05 PM Bug #7835 (Resolved): make_writeable needs to fill in ssc on new clone
Sage Weil
01:25 PM Bug #7805 (Resolved): emperor can go active with < min_size non-incomplete peers since we check a...
Sage Weil
01:21 PM Bug #7904 (Resolved): osd/ReplicatedPG.cc: 10661: FAILED assert(is_active())
We need to dump agent_state even on primary->primary
-69> 2014-03-28 13:07:10.884747 7fdc1660b700 15 journal pr...
Samuel Just
01:02 PM Revision 72eaa5e8 (ceph): doc: fix typos in tiering dev doc
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
10:43 AM Fix #7890: erasure-code: last stripe is not truncated
At present, this is by design. Samuel Just
01:05 AM Fix #7890 (New): erasure-code: last stripe is not truncated
When encoding a 10 bytes object with osd_pool_erasure_code_stripe_width = 2048 and k=2 + m=1, the last strip should b... Loïc Dachary
10:05 AM rbd Feature #7895 (Resolved): krbd: test cloning, discard, plus regular I/O via fsx
This could be another fork of src/test/librbd/fsx.c in ceph.git, or modifications to it to abstract out the I/O more ... Josh Durgin
09:21 AM Bug #7894 (Duplicate): osd: missing hitset object in cluster log
failure_reason: '"2014-03-27 16:02:47.849800 osd.4 10.214.131.17:6800/13622 33 : [ERR]
4.0 shard (4,255) missing 0...
Sage Weil
09:20 AM Bug #7893 (Duplicate): osd/ReplicatedPG.cc: 10190: FAILED assert(0 == "erroneously present object")
... Sage Weil
09:19 AM Bug #7892 (Duplicate): osd/ReplicatedPG.cc: 7881: FAILED assert((data_included.empty() && data.le...
... Sage Weil
09:18 AM Bug #7891 (Resolved): osd: leaked pg refs on shutdown
... Sage Weil
08:53 AM Bug #5884: negative num_objects_degraded in pool stats
Oops, replace 1.3 with 1.30 in previous message.... John Spray
08:50 AM Bug #5884: negative num_objects_degraded in pool stats

Seen on a cluster that's been running for the past 2 weeks on the firefly branch.
Potentially noteworthy things ...
John Spray
08:25 AM CephFS Bug #7780 (Resolved): When full flag is set, even MDS writes are blocked
Fix was merged at c647a03fffb2e1e997dbdb0ff128eeb6efc47deb John Spray
05:24 AM Revision 1b5e8f43 (ceph): mds: properly propagate dirty dirstat to auth inode
Propagate dirty dirstat to freezing auth inode if the inode is
already auth pinned by the Mutation. Otherwiese the di...
Yan, Zheng
12:58 AM Revision 5651ee81 (ceph): upgrade/dumpling-x/parallel: restart after cuttlefish->dumpling upgrade
Fixes: #7824
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:22 AM Revision dfdeb5c3 (ceph): radosgw-agent: coerce max-entries config to a string
This is necessary to pass it as a command line argument.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
12:14 AM Revision 7f4be9e9 (ceph): Merge pull request #1547 from ceph/wip-cache-scrub
osd: improve scrub checks on clones; tolerate missing clones on cache pools
Fixes: #7885
Reviewed-by: Samuel Just <s...
Samuel Just

03/27/2014

11:45 PM Revision 38d4c71a (ceph): Pipe: rename keepalive->send_keepalive
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
11:39 PM Revision 7a1990b6 (ceph): Merge branch 'wip-7875'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
11:39 PM Revision c64d03d0 (ceph): mon/OSDMonitor: require OSD_CACHEPOOL feature before using tiering feat...
The OSDs need to support this feature before we allow users to turn it
on. This is similar to what the erasure pool ...
Sage Weil
11:38 PM Revision 69321bf5 (ceph): mon/OSDMonitor: prevent setting hit_set unless all OSDs support it
We are using OSD_CACHEPOOL as a proxy for the support for the tiering
OSDMap infrastructure.
Signed-off-by: Sage Wei...
Sage Weil
10:26 PM CephFS Bug #7880: multimds: directory gets rsynced twice
Zheng Yan
06:54 AM CephFS Bug #7880 (Resolved): multimds: directory gets rsynced twice
probalby the mtime doesn't get set properly teh first time?... Sage Weil
10:12 PM Revision eb71924e (ceph): osd/ReplicatedPG: tolerate missing clones in cache pools
A few cases:
- As we are working through the list, if we see a clone that is lower than
the next one we were expec...
Sage Weil
10:00 PM Revision 6508d5ef (ceph): osd/ReplicatedPG: improve clone vs head checking
- notice when we are missing a clone (that isn't at the end of the list)
- notice when we are missing a clone on the ...
Sage Weil
09:31 PM Messengers Bug #7888: msgr: keepalive is insufficient
wip-7888 handles this for MonClient. We can do the same with Objecter, but this is less critical because we will fin... Sage Weil
09:24 PM Messengers Bug #7888 (Fix Under Review): msgr: keepalive is insufficient
Sage Weil
06:01 PM Messengers Bug #7888 (In Progress): msgr: keepalive is insufficient
Sage Weil
04:44 PM Messengers Bug #7888 (Resolved): msgr: keepalive is insufficient
the current keepalive behavior relies on writes triggering a tcp timeout/error, which does not actually happy in many... Sage Weil
08:48 PM Revision 9e2cd5fe (ceph): osd/ReplicatedPG: do not assert on clone_size mismatch
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:28 PM Revision 7f026ba6 (ceph): ceph_test_rados_api_tier: scrub while cache tier is missing clones
Trigger a scrub to verify that we can handle a cache tier that is missing
some clones. We rely on the test harness t...
Sage Weil
08:01 PM Revision c5682e78 (ceph): Merge pull request #1546 from ceph/wip-fix-pools
fix pool ops test Dan Mick
07:57 PM Revision 7cb1d3a4 (ceph): qa/workunits/mon/pool_ops.sh: fix test
The pool create command doesn't take k/v pairs any more.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:56 PM Revision 233801c6 (ceph): qa/workunits/mon/pool_ops.sh: use expect_false
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:35 PM Revision ce59760a (ceph): Merge pull request #1545 from ceph/wip-7849-b
ceph-conf: do not log
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
07:30 PM Revision 72715b23 (ceph): ceph-conf: no admin_socket
We don't need to worry about pidfile because that is done by the fork
functions, which ceph-conf doesn't call.
Signe...
Sage Weil
07:03 PM Revision e91f5c8c (ceph): Merge pull request #1522 from themgt/patch-1
document adding dev key for custom Apache/FCGI install
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
06:51 PM Revision 68343ee1 (ceph): rados/thrash/workloads: make cache-agent-big use an ec base pool
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:41 PM Revision fb208237 (ceph): jerasure: fix up .gitignore
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:36 PM Revision acc31e75 (ceph): ceph-conf: do not log
If you are querying the conf for an osd and it has a log configured, we
should not generate any log activity.
This i...
Sage Weil
06:33 PM Revision 3f1417a8 (ceph): Merge pull request #1542 from onlyjob/debian
logrotate: do not rotate empty logs (2nd logrotate file)
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
06:15 PM Revision e21561e7 (ceph): Merge pull request #1544 from ceph/wip-7876
rgw: use s->content_length instead of s->length
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
06:14 PM Revision 9f313109 (ceph): Merge pull request #1534 from dachary/wip-sse-fix
erasure code sse optimized jerasure plugin
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
06:09 PM Revision 3bbea517 (ceph): Merge pull request #234 from ceph/wip-priority
schedule_suite: ugly hack to set priority when scheduling Zack Cerza
05:58 PM Bug #7824 (Resolved): LibRadosList.ListObjectsNS failure
ugh, the test wasn't restarting after the cuttlefish->dumpling upgrade step.
ceph-qa-suite.git commit 5651ee813170...
Sage Weil
05:53 PM Revision ffd69ab3 (ceph): rgw: use s->content_length instead of s->length
Fixes: #7876
Need to use the actual content length, not the pointer to the string.
This was probably working because ...
Yehuda Sadeh
05:40 PM devops Bug #7889: IPv6 support with ceph-deploy
socket.gethostbyname supports only ipv4 and should be replaced with socket.getaddrinfo
next the inet_aton is also ...
Miha Zidar
05:36 PM devops Bug #7889 (Resolved): IPv6 support with ceph-deploy
In ceph-deploy/util/arg_validators.py the hostname function fails "hostname: X is not resolvable" if using ipv6 Miha Zidar
05:24 PM Revision b6ad5c6c (ceph): rgw: idle timeout config moves to the external server line
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:21 PM Revision 0935bb61 (ceph): Merge pull request #1540 from ceph/wip-7860
test: Wait for tier removal before next test starts
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
05:14 PM Bug #7885 (Resolved): scrub must not take offence if evicted/unpromoted clones are missing on a c...
Samuel Just
11:54 AM Bug #7885: scrub must not take offence if evicted/unpromoted clones are missing on a cache pool
Samuel Just
11:41 AM Bug #7885 (Resolved): scrub must not take offence if evicted/unpromoted clones are missing on a c...
Samuel Just
05:13 PM Revision fd6056be (ceph): schedule_suite: ugly hack to set priority when scheduling
I don't want to add another arg at the end of the very long list of
optional args, so I'll use an environment variabl...
Sage Weil
04:42 PM Revision 501e31d9 (ceph): logrotate: do not rotate empty logs (2nd logrotate file)
Signed-off-by: Dmitry Smirnov <onlyjob@member.fsf.org> Dmitry Smirnov
04:41 PM Bug #7887 (Resolved): W: shlib-with-executable-stack
Lintian produced the following warnings after building Debian packages of ceph-0.78:... Dmitry Smirnov
04:35 PM Revision 158f9ba1 (ceph): Revert "Lines formerly of the form '(remote,) = ctx.cluster.only(role)....
This reverts commit d693b3f8950ffd1f2492a4db0f8234fee31f00f0. Zack Cerza
04:35 PM Revision fac452ae (ceph): Revert "Lines formerly of the form '(remote,) = ctx.cluster.only(role)....
This reverts commit d693b3f8950ffd1f2492a4db0f8234fee31f00f0. Zack Cerza
04:29 PM Bug #7875 (Resolved): osd: pg_pool_t hitset fields incompat
Sage Weil
10:40 AM Bug #7875 (In Progress): osd: pg_pool_t hitset fields incompat
Sage Weil
02:48 PM Documentation #7886 (New): What's the policy on URL stability for public documentation?
We're trying to decide whether to include links to documentation on ceph.com within the UI of the Calamari product. W... Yan-Fa Li
02:02 PM Revision 2d553161 (ceph): Merge pull request #1541 from onlyjob/debian
logrotate improvement: do not rotate empty logs
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:27 PM Revision 91176f14 (ceph): erasure-code: test encode/decode of SSE optimized jerasure plugins
If the machine running make check has the required CPU features
available, load the SSE optimized plugin and check th...
Loïc Dachary
01:27 PM Revision cc0cc152 (ceph): erasure-code: gf-complete / jerasure modules updates
To avoid confusion, the jerasure v1 branch that contains commits pending
review upstream is named v2-ceph and the gf-...
Loïc Dachary
01:27 PM Revision c07aedb6 (ceph): autotools: intel cpu features detection
Rename SIMD to INTEL for clarity.
Instead of agregating all flags in INTEL_FLAGS, create individual flags
for each f...
Loïc Dachary
01:27 PM Revision e9878db2 (ceph): arch: add SSE3, SSSE3, SSSE41 and PCLMUL intel features
And add a note about valgrind forcing a fake cpuid.
Signed-off-by: Loic Dachary <loic@dachary.org>
Loïc Dachary
01:27 PM Revision 10fd6b31 (ceph): erasure-code: SSE optimized jerasure plugins
The jerasure plugin is compiled with three sets of flags:
* jerasure_generic with no SSE optimization
* jerasure_sse...
Loïc Dachary
01:27 PM Revision 30e71405 (ceph): osd: increase osd verbosity during functional tests
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
01:27 PM Revision b76ad972 (ceph): erasure-code: test jerasure SSE optimized plugins selection
Test the selection of the plugin depending on the CPU features. The
prefix of the plugin is "jerasure" by default (je...
Loïc Dachary
01:26 PM Revision 12d4f382 (ceph): erasure-code: allow loading a plugin from factory()
The Mutex scope is restricted to only protect the load() method and not
the factory() method. This allows a plugin to...
Loïc Dachary
01:24 PM Bug #7805 (Fix Under Review): emperor can go active with < min_size non-incomplete peers since we...
Samuel Just
01:24 PM Bug #7858: agent with snaps ceph_test_rados error
Samuel Just
12:37 PM Bug #7849 (Resolved): ceph-conf create empty log files
commit:acc31e75a3e7115c00f9980609948455e3b2d49e Josh Durgin
11:36 AM Bug #7849 (Fix Under Review): ceph-conf create empty log files
Sage Weil
11:16 AM Bug #7849: ceph-conf create empty log files
This should be just a matter of passing the right flags to common_init or global_init Sage Weil
11:34 AM Feature #7884 (New): investigate having the messenger (or dispatch q?) in the osd limit the numbe...
Samuel Just
11:14 AM rgw Bug #7876 (Resolved): rgw: > on char* in rgw_rest_user
Sage Weil
11:13 AM rgw Bug #7876 (Fix Under Review): rgw: > on char* in rgw_rest_user
Yehuda Sadeh
11:13 AM Bug #7826 (Resolved): osd: illegal instruction in jerasure
Sage Weil
03:23 AM Bug #7826 (Fix Under Review): osd: illegal instruction in jerasure
Loïc Dachary
10:40 AM CephFS Bug #7867: client/Client.cc: 2087: FAILED assert(!unclean)
Sage Weil
10:37 AM Fix #7560 (Closed): mon: add compat set feature to mark an upgraded pg format in order to disallo...
The feature was introduced for Dumpling. The only version that does not support it is Cuttlefish, and it will assert... Joao Eduardo Luis
10:20 AM Bug #7860 (Resolved): LibRadosTwoPoolsPP.PromoteSnap failed at line 311
Sage Weil
08:54 AM Bug #7881 (Resolved): osd/PGLog.cc: 430: FAILED assert(to != olog.log.end() || (olog.head == info...
mixed dumpling/firefly cluster:... Sage Weil
08:54 AM rbd Bug #6628: krbd: BUG during ceph_osdc_stop() sometimes when rbd_add() fails
Ian Colle
07:18 AM devops Bug #7879: sentry-db is down
Let's get these services moved over today. Zack Cerza
06:48 AM devops Bug #7879 (Resolved): sentry-db is down
Zack Cerza
04:44 AM Revision d9a2dea7 (ceph): Merge remote-tracking branch 'gh/firefly'
Sage Weil
03:01 AM Revision b454bd6c (ceph): rgw: add erasure coded data pool variant
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
01:43 AM Revision d693b3f8 (ceph): Lines formerly of the form '(remote,) = ctx.cluster.only(role).remotes....
and '(remote,) = ctx.cluster.only(role).remotes.iterkeys()' would fail with
ValueError and no message if there were l...
Warren Usui
01:43 AM Revision e98b1073 (ceph): Lines formerly of the form '(remote,) = ctx.cluster.only(role).remotes....
and '(remote,) = ctx.cluster.only(role).remotes.iterkeys()' would fail with
ValueError and no message if there were l...
Warren Usui
01:12 AM Revision 506d2bba (ceph): logrotate improvement: do not rotate empty logs
Signed-off-by: Dmitry Smirnov <onlyjob@member.fsf.org> Dmitry Smirnov
12:47 AM Revision dc3ce58a (ceph): osd: do not make pg_pool_t incompat when hit_sets are enabled
If we enable HitSet tracking, the OSD needs to know this, but clients do
not care. Setting the compat version is too...
Sage Weil
12:43 AM Revision 92859ed6 (ceph): ReplicatedPG: include pending_attrs when reseting attrs in WRITEFULL
This is necessary for ops like:
2014-03-25 11:48:20.354168 7f0df0ff9700 1 -- 192.168.106.226:0/1016231
--> 192.168....
Samuel Just
12:24 AM Revision b6a431b1 (ceph): ReplicatedPG: disable clone subsets for cache pools
To do this properly, we need to take appropriate read
locks on clone sources and handle evicted clones.
Fixes: #7874...
Samuel Just
12:18 AM Revision e84ac3aa (ceph): Merge pull request #1538 from ceph/wip-6910-emperor
PG: don't query unfound on empty pgs Sage Weil
12:18 AM Revision a0a560a9 (ceph): Merge pull request #1539 from ceph/wip-6910-dumpling
PG: don't query unfound on empty pgs Sage Weil
12:16 AM Revision b5702640 (ceph): Merge pull request #1537 from ceph/wip-7871
RadosModel: allow --no-omap to be specified seperately from --ec-pool
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
12:14 AM Revision ec40196f (ceph): Merge pull request #1536 from ceph/wip-7870
ReplicatedPG::do_osd_ops: only return ENOTSUP on OMAP write ops
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil

03/26/2014

11:04 PM Revision 56974b91 (ceph): test: Wait for tier removal before next test starts
Fixes: #7860
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman
10:51 PM Revision 7999bc9e (ceph): ReplicatedPG:: s/_delete_head/_delete_oid, adjust head_exists iff is_head
Fixes: #7872
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:49 PM Revision 98654092 (ceph): rgw: configurable chunk size
Fixes: #7589
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
09:55 PM Bug #7824 (In Progress): LibRadosList.ListObjectsNS failure
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2014-03-25_19:33:06-upgrade:dumpling-x:parallel-firefly---b... Sage Weil
08:24 PM Revision 41d5e9ab (ceph): PG: don't query unfound on empty pgs
When the replica responds, it responds with a notify
rather than a log, which the primary then ignores since
it is al...
Samuel Just
08:23 PM Revision c6f754c5 (ceph): PG: don't query unfound on empty pgs
When the replica responds, it responds with a notify
rather than a log, which the primary then ignores since
it is al...
Samuel Just
07:38 PM Feature #7437 (In Progress): EC: add adapt unittest teuthology task and add to nightly
David Zafman
07:00 PM CephFS Bug #5787 (Duplicate): client/Client.cc: 2081: FAILED assert(!unclean) in put_inode
Zheng Yan
07:00 PM CephFS Bug #5787: client/Client.cc: 2081: FAILED assert(!unclean) in put_inode
dup #7867 Zheng Yan
06:48 PM Revision f171c93f (ceph): Merge pull request #1535 from ceph/wip-7823
osd: trim copy-get backend read to object size
Reviewed-by: Samuel Just <sam.just@inktank.com>
Samuel Just
06:44 PM Revision 832c007d (ceph): RadosModel: allow --no-omap to be specified seperately from --ec-pool
Fixes: #7871
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
06:42 PM Revision 555ae125 (ceph): ReplicatedPG::do_osd_ops: only return ENOTSUP on OMAP write ops
This way, redirected omap reads will work correctly by returning
empty maps and buffers.
Fixes: #7870
Signed-off-by:...
Samuel Just
06:39 PM Revision 6cb85955 (ceph): ReplicatedPG::make_writeable: fill in ssc on clone
ssc should always be filled in on live ObjectContexts.
Fixes: #7835
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
06:37 PM Revision 21fc535e (ceph): osd: trim copy-get backend read to object size
We are passing a big number to the backend to read and it is trimming it
to the stripe boundary, and then setting the...
Sage Weil
06:09 PM rgw Bug #7837: s3tests test_object_metadata_replaced_on_put fails on an erasure coded pool
Samuel Just
05:44 PM rgw Bug #7837: s3tests test_object_metadata_replaced_on_put fails on an erasure coded pool
wip-7837 Samuel Just
05:28 PM rgw Bug #7837: s3tests test_object_metadata_replaced_on_put fails on an erasure coded pool
looking Samuel Just
06:05 PM rgw Bug #7876 (Resolved): rgw: > on char* in rgw_rest_user
... Sage Weil
05:57 PM CephFS Bug #7867: client/Client.cc: 2087: FAILED assert(!unclean)
pushed a simpler version, wip-7867-b, that just pins the Inode* for the duration of the readahead. This avoids a pos... Sage Weil
09:13 AM CephFS Bug #7867 (Resolved): client/Client.cc: 2087: FAILED assert(!unclean)
here is the lsat bit of log, starting with the incomplete readahead that caused the crash... Sage Weil
05:54 PM Bug #7875 (Resolved): osd: pg_pool_t hitset fields incompat
If the hitset fields get used in pg_pool_t the encoding is marked as incompatible. This breaks old clients that don'... Sage Weil
05:33 PM Revision 5aa5566b (ceph): Don't explode when finding an empty job
Signed-off-by: Zack Cerza <zack.cerza@inktank.com> Zack Cerza
05:27 PM Bug #7874: clone_range from clone on recovery fails if clone has been evicted
wip-7874 disables clone subsets for cache pools. Testing Samuel Just
05:26 PM Bug #7874: clone_range from clone on recovery fails if clone has been evicted
Samuel Just
05:13 PM Bug #7874 (Resolved): clone_range from clone on recovery fails if clone has been evicted
Options
1) don't allow clone_range from clone on recovery if cache pool
2) detect when the clone has been evicted a...
Samuel Just
05:26 PM Bug #7828: osd/ReplicatedPG.cc: 4984: FAILED assert(ctx->new_obs.exists == ctx->new_snapset.head_...
Samuel Just
05:25 PM Feature #7831: OSD: track objects with omap entries and don't count toward caps
Samuel Just
05:18 PM Bug #6910 (Resolved): don't query empty osds for unfound
Sage Weil
05:16 PM Revision 859e66b3 (ceph): Merge pull request #232 from ceph/wip-suite-priority
suite: allow priority to be set Zack Cerza
05:15 PM Feature #7871 (Resolved): ceph_test_rados: allow no-omap to be specified seperately from ec-pool
Sage Weil
11:43 AM Feature #7871 (Resolved): ceph_test_rados: allow no-omap to be specified seperately from ec-pool
Samuel Just
05:13 PM Bug #7870 (Resolved): only return ENOTSUP on omap write ops for EC pools
Sage Weil
02:04 PM Bug #7870 (Fix Under Review): only return ENOTSUP on omap write ops for EC pools
Samuel Just
11:39 AM Bug #7870 (Resolved): only return ENOTSUP on omap write ops for EC pools
This way redirected ops on objects without omap entries will work. Samuel Just
05:09 PM Revision 67844e0e (ceph): suite: allow priority to be specified when scheduling
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:59 PM rgw Feature #7589 (Resolved): rgw: configurable chunk size
commit:98654092fc5a18ef542d294d5696cac86d96229f Josh Durgin
04:10 PM Bug #7860: LibRadosTwoPoolsPP.PromoteSnap failed at line 311
David Zafman
03:47 PM Revision ee217952 (ceph): Merge pull request #1518 from dachary/wip-mailmap
mailmap updates
Reviewed-By: Christophe Courtaut <christophe.courtaut@gmail.com>
Loïc Dachary
03:33 PM Bug #7826: osd: illegal instruction in jerasure
For the record, the illegal instruction was on an AVX instruction. It was part of the flags being set at compile time... Loïc Dachary
09:53 AM Bug #7826: osd: illegal instruction in jerasure
Loïc Dachary
03:32 AM Bug #7826 (In Progress): osd: illegal instruction in jerasure
"work in progress":https://github.com/ceph/ceph/pull/1534 Loïc Dachary
02:17 PM Feature #7873 (Resolved): pg query: dump peer_info, peer_missing in all states
This debug info is useful for recovering from incomplete situations. Samuel Just
01:21 PM Bug #7872 (Duplicate): PG: all_unfound_are_queried_or_lost must skip dne replicas
Samuel Just
01:21 PM Bug #7872: PG: all_unfound_are_queried_or_lost must skip dne replicas
actually, 6910 Samuel Just
01:17 PM Bug #7872 (Duplicate): PG: all_unfound_are_queried_or_lost must skip dne replicas
Samuel Just
11:48 AM Bug #7823 (Resolved): osd: copy-get from ec pool returns wrong size
Samuel Just
11:22 AM Cleanup #7869 (Resolved): arch: use cpuid.h when possible
Milosz Tanski <milosz@adfin.com>:
Instead of doing cpuid manually you can use builtins provided in gcc
(and in...
Loïc Dachary
09:47 AM Bug #7868 (Can't reproduce): "failed to recover before timeout expired" in powercycle-firefly---b...
Logs are in http://qa-proxy.ceph.com/teuthology/teuthology-2014-03-23_23:55:02-powercycle-firefly---basic-plana/14419... Yuri Weinstein
05:39 AM Revision f1a1ac82 (ceph): Merge remote-tracking branch 'gh/firefly'
Sage Weil
05:38 AM Revision 29f74202 (ceph): Revert "osd: create the erasure-code ruleset in OSDMap::build_simple"
This reverts commit b85330eb1d62632a1c3f2025b1f309816d54bd5a.
We can't have this in the OSDMap by default or else al...
Sage Weil
05:38 AM Revision 18c3e9ec (ceph): osd: fix tests due to no default erasure-code ruleset
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:44 AM Revision fbbdbf0a (ceph): Merge pull request #1530 from onlyjob/fix-defaultweight
fix bug in 'defaultweight' calculation on OSD start.
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:32 AM rbd Bug #7282: Unresponsive rbd-backed Qemu domain causes libvirtd to stall on all connections
Thanks! Florian Haas
01:04 AM Revision b2cf052a (ceph): rgw: allow overriding options
This is mainly useful for toggling ec/replicated pools right now.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
12:15 AM devops Bug #6592: 3.8 kernel + /dev/cciss/c0d1 + precise : fail to show in /dev/disk/by-partuuid
We did not get to the bottom of this and the hardware is still available. It's cold but not dead ;-) Loïc Dachary
 

Also available in: Atom