Project

General

Profile

Activity

From 12/23/2013 to 01/21/2014

01/21/2014

09:05 PM devops Feature #6986 (In Progress): Build packages containing RBD and CephFS kernel modules for installa...
I'm building against kernel-3.10.0-54.0.1.el7.x86_64, and I get the following error when using df5f7b6abd0dacf6fa60ac... Ken Dreyer
12:39 PM devops Feature #6986: Build packages containing RBD and CephFS kernel modules for installation on RHEL7-...
Neil Levine
03:44 PM devops Bug #7157: ceph-disk list fails in encrypted disk setup
I'm having this problem, with a fresh install of Ubuntu 12.04 and the latest (as of yesterday) Ceph release (ceph ver... Stuart Longland
01:41 PM Feature #5999 (In Progress): EC: [link] OSD internals must work in terms of cpg_t
Ian Colle
01:38 PM Feature #6437 (In Progress): EC: implement ECBackend recovery
Ian Colle
01:37 PM Feature #7167 (Resolved): Add op_process_latency in perf counters
Samuel Just
01:31 PM Feature #5990 (Fix Under Review): EC: [link] Factor out the ReplciatedPG object replication and c...
Samuel Just
01:28 PM Feature #6832 (Fix Under Review): EC: Adapt pg log to include information necessary for rollback
Ian Colle
01:28 PM Feature #6831 (Fix Under Review): EC: Adapt ReplicatedPG read path to handle async reads
Ian Colle
01:27 PM Feature #5996 (Fix Under Review): EC: [link] PG::calc_acting and friends should always choose the...
Ian Colle
01:27 PM Feature #6438 (Fix Under Review): EC: implement EC client IO
Ian Colle
12:39 PM devops Feature #7047: rhel7: build process for rbd.ko, ceph.ko kernel modules
Neil Levine
12:39 PM devops Feature #7046: rhel7: gitbuilder for rbd.ko, ceph.ko kernel modules
Neil Levine
12:20 PM Feature #7196 (New): qa: test encoding semantics, not just being able to decode/encode different ...
I have a branch with at least slightly better detection by comparing the json dumps. Need to generate new object corp... Josh Durgin
12:16 PM Feature #7195 (Resolved): librados: complete C write object operation bindings
They're missing individual return values, exec, omap (covered by #7194), and possibly a few other features from the c... Josh Durgin
12:14 PM Feature #7194 (Resolved): librados: omap in the c api
It doesn't exist, but should. Add it to the read/write object operations bindings. Josh Durgin
12:13 PM Feature #7193 (Resolved): librados: object operation bindings for reads
Writes were added recently. It would be useful to have these available for binding to other languages. Josh Durgin
11:21 AM devops Documentation #2974 (Resolved): doc: update chef docs for mon key distribution
John Wilkins
09:45 AM Feature #7192 (Resolved): An easier-to-process health report

Currently, the "ceph health" is great for human consumption, but a bit awkward to feed into a monitoring app, becau...
John Spray

01/20/2014

10:20 PM Feature #7191 (Closed): Replace Mutex to RWLock with fdcache_lock in FileStore
done by https://github.com/somnathr/ceph/commits/wip-queueing-sd1. Haomai Wang
10:09 PM Feature #7191 (Closed): Replace Mutex to RWLock with fdcache_lock in FileStore
Now FileStore can't make full use of SSD because of poor parallelism, 4 OSD in the same SSD are recommended to achiev... Haomai Wang
09:45 PM Bug #7184 (Fix Under Review): osd: scrub complains about num_objects_dirty for old pools
Sage Weil
08:09 PM Feature #7167: Add op_process_latency in perf counters
Squashed commits to a new pull request: https://github.com/ceph/ceph/pull/1120 Xiangyu Lv
04:53 AM Feature #7167: Add op_process_latency in perf counters
Seems I'm spamming the backlog of pull requests. Here is a new pull request including changes after your comments: h... Xiangyu Lv
12:11 PM Fix #7187 (Resolved): In JSON "ceph health" output, 'summary' list only contains one item
Sage Weil
03:42 AM Fix #7187: In JSON "ceph health" output, 'summary' list only contains one item
https://github.com/ceph/ceph/pull/1117 John Spray
03:07 AM Fix #7187 (Resolved): In JSON "ceph health" output, 'summary' list only contains one item

The human readable output gives you a list of items, but the JSON output for some reason just picks the first one o...
John Spray
11:07 AM Bug #7190 (Resolved): osd/OSD.cc: 2338: FAILED assert(oldmap->have_pg_pool(pgid.pool()))
... Sage Weil
08:51 AM Bug #7122 (Resolved): push 0/hit_set_... v 0'0 failed because local copy is 818'7131
Sage Weil
08:51 AM Bug #7117 (Resolved): centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_comm...
Sage Weil
08:24 AM Bug #7188 (Won't Fix): Admin socket files are lost on log rotation calling initctl reload (ubuntu...
h3. Workaround
Using *restart* instead of *reload* restarts the daemons instead of sending them a signal that grac...
John Spray
07:15 AM rgw Bug #6951: rgw: bucket re-create fails when cache is disabled
I have the same problem (kind of) as described in the mailing list.
But it's not stopped by the cache, AFAICT, the c...
Sylvain Munaut

01/19/2014

07:50 PM Bug #7180 (Resolved): 500 error when get mds help via ceph-rest-api
Landed to master 63a77f465654261e13cf7fbd5ffc6ed0c9f2eca0 Ian Colle
08:22 AM Feature #7104: rest-api: support commands requiring 'w' cap without 'rw' cap
Just noticed there is a rest-api category (I was searching for ceph-rest-api..). Could someone flag this properly ? I... David Moreau Simard
08:21 AM Bug #7186: ceph-rest-api: /tell/<osdid-or-pgid>/query doesn't work
Just noticed there is a rest-api category (I was searching for ceph-rest-api..). Could someone flag this properly ? I... David Moreau Simard
08:18 AM Bug #7186 (Won't Fix): ceph-rest-api: /tell/<osdid-or-pgid>/query doesn't work
tell/<osdid-or-pgid>/query is shown twice in show_human_help (probably part of the problem?)
For instance, /tell/1...
David Moreau Simard
08:21 AM Bug #7185: ceph-rest-api inconsistent routing with show_human_help
Just noticed there is a rest-api category (I was searching for ceph-rest-api..). Could someone flag this properly ? I... David Moreau Simard
07:58 AM Bug #7185 (Won't Fix): ceph-rest-api inconsistent routing with show_human_help
This works:
http://apiserver:5000/api/v0.1/osd <-- Returns help for /osd calls
http://apiserver:5000/api/v0.1/osd/...
David Moreau Simard

01/18/2014

09:51 PM rgw Bug #7183 (Pending Backport): RGW CORS POST fix for subusers and cleanups
Yehuda Sadeh
09:31 PM rgw Bug #7183: RGW CORS POST fix for subusers and cleanups
Just to be clear, the main issue that is fixed is POST request might not authenticate when user has multiple access k... Yehuda Sadeh
09:08 PM rgw Bug #7183 (Resolved): RGW CORS POST fix for subusers and cleanups
Patches already emailed to ceph-devel list at vger.
Patch #3 is the important one: looking up the correct secret k...
Robin Johnson
09:22 PM Bug #7184 (Resolved): osd: scrub complains about num_objects_dirty for old pools
Sage Weil
09:21 PM Bug #7177 (Resolved): osd crashed at OSD::advance_pg on master branch
Sage Weil
12:52 PM Bug #7180: 500 error when get mds help via ceph-rest-api
Need review: https://github.com/ceph/ceph/pull/1112 Kai Zhang
12:13 PM Bug #7180 (Resolved): 500 error when get mds help via ceph-rest-api
I was playing with ceph-rest-api, when I try to access http://localhost:5000/api/v0.1/mds, it's supposed to return th... Kai Zhang
12:51 PM devops Documentation #7182 (Resolved): documentation miscategorizes debian as upstart based. debian use...
The ceph documentation miscategorizes debian as an upstart based distro. Debian uses systemvinit. An example of suc... Christ Schlacta
12:47 PM devops Bug #7181 (Rejected): debian 7 wheezy init.d script will not start OSDs not corresponding to a mo...
I'm in the midst of testing a ceph deployment, and am not using individual disks for ceph. They're on my main OS dri... Christ Schlacta
12:08 AM CephFS Documentation #3672: doc: how to mount ceph-fuse from fstab
On "Debian":http://debian.org only the following (sample) works with *ceph-fuse*-0.72.2-1:
/etc/fstab:
moun...
Dmitry Smirnov

01/17/2014

11:57 PM Bug #7178 (Resolved): sample.ceph.conf needs update
At the moment "sample.ceph.conf" is incomplete (some essential options are missing);
it uses old deprecated options ...
Dmitry Smirnov
05:31 PM Bug #7177: osd crashed at OSD::advance_pg on master branch
Unfortunately I can't make much sense of the core dump.
Fortunately just by looking at the code involved I'm pretty ...
Greg Farnum
04:50 PM Bug #7177 (Resolved): osd crashed at OSD::advance_pg on master branch
logs: ubuntu@mira057.front.sepia.ceph.com:/home/ubuntu/bug_osd_crash
hit this issue when trying to run the rest/te...
Tamilarasi muthamizhan
09:31 AM Feature #7173 (Resolved): Pre-populate exceptions
Pre-populate exceptions on the monitor after a large map change to avoid message storms in large clusters. Kyle Bader
08:49 AM Subtask #7146: implement osd crush rule create-erasure
Hi Sage
What if **osd crush rule create-pyramid** was something like:
* get the properties of a designated eras...
Loïc Dachary
02:58 AM Subtask #7146: implement osd crush rule create-erasure
... Loïc Dachary
08:37 AM rgw Feature #6747: PowerDNS backend for RGW bucket directing
You can fetch the region map with GET /admin/config Josh Durgin
01:08 AM rgw Feature #6747: PowerDNS backend for RGW bucket directing
This is now in development in the wip-rgw-powerdns branch.
Currently lacking support from the RGW Admin API to fet...
Wido den Hollander
08:23 AM Feature #6835 (Resolved): EC: ec pgs will need to be able to specify temp primaries other than ac...
Merged in commit:3b696f28d6cee1fd3a2b78911cc8377445158ba1 Greg Farnum
08:22 AM Linux kernel client Feature #6991: support CEPH_FEATURE_OSDMAP_ENC
We did not coalesce the feature flags. Greg Farnum
05:30 AM Feature #7167: Add op_process_latency in perf counters
I have sent another clean pull request: https://github.com/ceph/ceph/pull/1108
Please help to check and merge it.
Xiangyu Lv
05:26 AM Linux kernel client Bug #7139 (Fix Under Review): dead tcp connections during connection negotiation may lead to hangs
wip-7139; https://github.com/ceph/ceph-client/commits/wip-7139 Ilya Dryomov
05:24 AM Linux kernel client Bug #7139: dead tcp connections during connection negotiation may lead to hangs
Reproduced it a couple days ago, kernel client had a half-open socket:... Ilya Dryomov
05:09 AM Bug #5804: mon: binds to 0.0.0.0:6800something port
I saw this happening multiple times during a training class.
During the CEPH 100 training we use the machine "dais...
Wido den Hollander
04:48 AM Bug #7172: osd: OSD failed to start with osd_leveldb_cache_size > 0
I have sent a pull request to fix this bug: https://github.com/ceph/ceph/pull/1107
Please help to review it.
Xiangyu Lv
04:40 AM Bug #7172: osd: OSD failed to start with osd_leveldb_cache_size > 0
As shown in the following stack trace, the OSD process crashed while trying to
umount filestore after upgrading file...
Xiangyu Lv
04:38 AM Bug #7172 (Resolved): osd: OSD failed to start with osd_leveldb_cache_size > 0
After applying osd_leveldb_cache_size = 52428800 on an OSD host, 2 ~ 3 OSDs on
the host failed to start due to the f...
Xiangyu Lv
01:02 AM rgw Feature #6748 (Resolved): Return bucket name in response header
Merged into master as of yesterday, resolved. Wido den Hollander

01/16/2014

08:36 PM rgw Bug #7073 (Resolved): "rgw gc max objs" should have a prime number as default value
commit:5ed7865ae8880f47c4715d7db4f069f60544128f Josh Durgin
05:23 PM Bug #7160 (Duplicate): rados tests failed on next branch with scrub mismatch failures
Tamilarasi muthamizhan
04:57 PM Documentation #6682 (Resolved): Adjustments to the Ceph Quick installation guide
John Wilkins
04:46 PM devops Feature #7171 (Resolved): rbdmap should be part of ceph-common
Currently, the rbdmap sysvinit/upstart script and /etc/ceph/rbdmap are included only in the ceph package.
Machines...
Alexandre Marangone
04:20 PM Bug #7170 (Can't reproduce): rados api test LibRadosMisc.CopyPP failed on next branch
logs are copied to mira057.front.sepia.ceph.com:/home/ubuntu/testing_jan16_2... Tamilarasi muthamizhan
04:17 PM rgw Bug #7169 (Resolved): rgw: list multipart parts broken (> 1000 parts)
Yehuda Sadeh
04:11 PM Documentation #6959 (Resolved): Diagrams showing even numbers of Monitors
John Wilkins
12:35 PM Documentation #6959 (In Progress): Diagrams showing even numbers of Monitors
John Wilkins
02:26 PM rgw Bug #7168 (Fix Under Review): 404 Errors When save immediately follows a delete
Yehuda Sadeh
08:45 AM rgw Bug #7168: 404 Errors When save immediately follows a delete
Yehuda Sadeh
05:14 AM rgw Bug #7168 (Resolved): 404 Errors When save immediately follows a delete
If the delete operation clashed with the save, and the save returned an 404. Is there an option to make save successf... Xiangyu Lv
12:41 PM Bug #7071 (Resolved): osd/ReplicatedPG.cc: 5953: FAILED assert(attrs->count("_"))
This was probably fixed by 0c81849c4f1661ce89b8623edaf2873bb6611f62. Samuel Just
11:36 AM Bug #6692 (Resolved): Documentation: Command line incorrect.
http://ceph.com/docs/master/rados/operations/authentication/ John Wilkins
10:45 AM Bug #6692 (In Progress): Documentation: Command line incorrect.
John Wilkins
10:47 AM Subtask #7146 (In Progress): implement osd crush rule create-erasure
... Loïc Dachary
10:39 AM devops Documentation #5265 (Closed): node-name is confusing. hostname is probably more accurate
looks good Yan-Fa Li
10:31 AM devops Documentation #5265 (Resolved): node-name is confusing. hostname is probably more accurate
http://ceph.com/docs/master/start/quick-ceph-deploy/ John Wilkins
10:35 AM Documentation #3674 (Resolved): Deployment documentation is confusing
Documentation on deployment has been completely rewritten, and mkcephfs is no longer part of it. John Wilkins
09:04 AM Feature #7154: Allow reweight of crush subtree via command line tools
Greg Farnum wrote:
> In what scenarios would this actually be a good idea? You trying to do incremental bring-up or ...
Kyle Bader
05:23 AM Bug #7166 (Resolved): mon: Error ENOENT: unrecognized pool 'data' on a new cluster
Loïc Dachary
04:53 AM Bug #7166 (Fix Under Review): mon: Error ENOENT: unrecognized pool 'data' on a new cluster
wip-7166, pr https://github.com/ceph/ceph/pull/1095
Waiting for the gitbuilders to pass the make checks.
Joao Eduardo Luis
02:16 AM Bug #7166 (In Progress): mon: Error ENOENT: unrecognized pool 'data' on a new cluster
This breaks current master (3b696f28d6cee1fd3a2b78911cc8377445158ba1). Might have something to do with the changes in... Joao Eduardo Luis
02:02 AM Bug #7166 (Resolved): mon: Error ENOENT: unrecognized pool 'data' on a new cluster
After compiling todays "master from sources":https://github.com/ceph/ceph/commit/3b696f28d6cee1fd3a2b78911cc837744515... Loïc Dachary
05:00 AM Feature #7167: Add op_process_latency in perf counters
The code change has been done in pull request: https://github.com/ceph/ceph/pull/1096
Please help to review it. Thanks.
Xiangyu Lv
04:59 AM Feature #7167 (Resolved): Add op_process_latency in perf counters
We found that there is a need to get latency of op threads because op threads get slow during accessing LevelDB or ge... Xiangyu Lv
03:49 AM Feature #6507: librados shouldn't block indefinitely when cluster doesn't respond
I try to write perl bindings for librados, so it would be great to have that
managed inside librados.
Dietmar Maurer

01/15/2014

11:05 PM rbd Bug #7165 (Resolved): high ping latency in vm using rbd writeback cache in qemu 1.7 with async flush
See thread on qemu-devel: http://lists.nongnu.org/archive/html/qemu-devel/2014-01/msg01993.html
It happens only wh...
Josh Durgin
10:51 PM rbd Bug #6851 (Resolved): rbd snap unprotect test failing
commit:e91fb910653a672560867d4a81aa30f9d5dc0af8 Josh Durgin
10:50 PM rbd Bug #6693 (Resolved): "rbd ls" returns error if the pool empty
commit:ac547a5b7dc94282f079aef78e66348d99d9d5e9 Josh Durgin
07:06 PM Bug #7117 (Fix Under Review): centOS: init script silently dies due to missing '/usr/lib64/ceph/c...
I've verified that with this patch, the RPM packaging properly captures the @ceph_common.sh@ file in @/usr/lib64/ceph... Ken Dreyer
05:17 AM Bug #7117: centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_common.sh'
This is now affecting every test that runs on a system with that init script.
For example, all OSDs deployed in th...
Alfredo Deza
04:49 PM rgw Bug #7054 (Fix Under Review): rgw: replace mongoose with civetweb
Yehuda Sadeh
04:47 PM rgw Bug #7073 (Fix Under Review): "rgw gc max objs" should have a prime number as default value
Yehuda Sadeh
04:33 PM Bug #7161 (Can't reproduce): rados api test LibRadosMisc.Exec failed on next branch
logs: ubuntu@teuthology:/a/teuthology-2014-01-14_23:00:02-rados-next-distro-basic-plana/39381... Tamilarasi muthamizhan
04:29 PM Feature #6835 (Fix Under Review): EC: ec pgs will need to be able to specify temp primaries other...
https://github.com/ceph/ceph/pull/1086 Greg Farnum
04:26 PM Bug #7160 (Duplicate): rados tests failed on next branch with scrub mismatch failures
logs: ubuntu@teuthology:/a/teuthology-2014-01-14_23:00:02-rados-next-distro-basic-plana/39382... Tamilarasi muthamizhan
04:14 PM Bug #7159 (Resolved): ceph status --format=json num_in_osds and num_up_osds formatting not consis...
From support ticket #932
root@burnupi02:/home/ubuntu# ceph status --format=json-pretty
"osdmap": { "osdmap": ...
Tyler Brekke
04:00 PM rgw Feature #3454 (Resolved): Support temp URLs for Swift API
merged upstream, commit:c60ae09b389c808735415553e31c871cfa42f776. I opened a new task (#7156) for creating a test for... Yehuda Sadeh
03:07 PM Subtask #7158 (Resolved): EC: flesh out how the ceph tool should be used to manage ec pools and c...
Not sure how much of this has been handled already. Samuel Just
02:45 PM devops Bug #7157 (Duplicate): ceph-disk list fails in encrypted disk setup
In a setup (Dumpling 0.67.5) that is using dm-crypt "ceph-disk" list fails with:
# ceph-disk list
Traceback (most...
Andreas Kurz
02:37 PM rgw Feature #7156 (New): rgw: create tests for temp url functionality
Yehuda Sadeh
02:33 PM rgw Feature #6874 (Fix Under Review): rgw: user level quota: rest api
Yehuda Sadeh
02:33 PM rgw Feature #6873 (Fix Under Review): rgw: user level quota: radosgw-admin interface
Yehuda Sadeh
02:33 PM rgw Feature #6872 (Fix Under Review): rgw: user level quota: async update of bucket info in user objc...
Yehuda Sadeh
02:33 PM rgw Feature #6871 (Fix Under Review): rgw: user level quota: core functionality
Yehuda Sadeh
02:19 PM Feature #7154: Allow reweight of crush subtree via command line tools
In what scenarios would this actually be a good idea? You trying to do incremental bring-up or something? Greg Farnum
02:08 PM Feature #7154 (Resolved): Allow reweight of crush subtree via command line tools
Instead of having to individual adjust weights for all osds in a rack or downloading the crushmap, modifying and enti... Kyle Bader
11:27 AM Bug #7116: mon: pg_temp left behind if pool deleted while pg_temp exist

The first cut at fixing this problem is in wip-7116. Sage had comments on the pull request for this branch https:/...
David Zafman
10:12 AM Documentation #4117 (In Progress): Document various language bindings better
http://ceph.com/docs/wip-doc-librados-intro/rados/api/librados-intro/
I've covered the basics for C, C++, Java and...
John Wilkins
12:17 AM Feature #7150 (Duplicate): mon: hunt for multiple mons in parallel
I use a few helper scripts to automate things, and those scripts
call ‘ceph’ command multiple times, like
------...
Dietmar Maurer

01/14/2014

06:52 PM Feature #5997 (In Progress): EC: [link] Refactor scrub to use PGBackend methods
David Zafman
06:15 PM Bug #7116 (Fix Under Review): mon: pg_temp left behind if pool deleted while pg_temp exist
David Zafman
05:03 PM Bug #7117: centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_common.sh'
I've pushed an experimental patch to https://github.com/ceph/ceph/tree/wip-7117-redhat
I need to test and see what...
Ken Dreyer
04:20 PM Feature #7149 (Resolved): EC: missing_loc should be maintained until the object is no longer degr...
Samuel Just
03:57 PM devops Bug #7144 (Resolved): Ceph build fails on fedora20 gitbuilder
Ok this gitbuilder is working correctly now. rescheduled a bunch of the failed jobs to run again. Sandon Van Ness
02:54 PM devops Bug #7144: Ceph build fails on fedora20 gitbuilder
So it looks like it was going OOM. I didn't expect this as 12G is usually enough ram for our gitbuilders. I went ahea... Sandon Van Ness
10:17 AM devops Bug #7144 (Resolved): Ceph build fails on fedora20 gitbuilder
All builds are failing in the middle of the build when hitting mds:
http://gitbuilder-ceph-rpm-fedora20-amd64-basi...
Sandon Van Ness
03:00 PM Subtask #7146 (Resolved): implement osd crush rule create-erasure
Currently, what we need to do is make sure the EC plugin maps onto a
linear array of devices the same way that CRUS...
Loïc Dachary
02:56 PM Bug #4934: ceph-deploy: librbd1 missing as a dependency
Interesting thing here. I think that this has been biting us recently. I will open another ticket on this because it... Anonymous
11:23 AM Bug #7145: PG::update_snap_map assert
ubuntu@teuthology:/a/teuthology-2014-01-13_23:00:02-rados-master-testing-basic-plana/37582/remote Samuel Just
11:23 AM Bug #7145 (Can't reproduce): PG::update_snap_map assert
0> 2014-01-14 01:35:32.305544 7f7bb3db0700 -1 osd/PG.cc: In function 'void PG::update_snap_map(std::vector<pg_lo... Samuel Just
11:21 AM Bug #7141 (Resolved): osd: segv in DBObjectMap::clear_keys_header()
09ae4bc2aa2530ea0d20c7487a4e1c11cb7eb488 Samuel Just
10:15 AM Linux kernel client Bug #6809: 3.11 kernel panic: Workqueue: ceph-msgr con_work
I am seeing a similar issue with Kernel 3.12.1 on Ubuntu 1310
Jan 14 03:09:27 gw2 kernel: [204578.112175] libceph:...
Eric Eastman
08:38 AM devops Bug #6726: Official packages do not appear to be available for Saucy
Since this has been brought to Mark and Peter's attention, is this going to be remedied any time soon? In the meantim... Tom Voss
02:00 AM rbd Bug #7125: Assertion failure in rbd_img_obj_callback()
Thanks Eric, I'll try to reproduce it here on a smaller scale this week. Ilya Dryomov
01:56 AM rbd Bug #5425 (Resolved): krbd: xfstest 89 hang, 'read_partial_message skipping long message'
As noted in the ceph-devel post referenced in the above comment, this
is bug is sort of twofold. The first problem ...
Ilya Dryomov

01/13/2014

07:34 PM devops Bug #6746: ceph-release rpm not playing well with yum-plugin-priorities
Ken - hard to say...it's a fairly important repo so I'd say 2 or 3 on our 10 point scale, but I suspect this varies f... John Kinsella
06:11 PM rbd Bug #7142 (Resolved): list_lockers() never returns after cluster restart and health_ok (librbdpy)
The cluster was shut off and then all nodes were restarted. Mon's came up first, then OSDs all at once). While the OS... JuanJose Galvez
05:31 PM Bug #7117: centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_common.sh'
I noticed the following warning today while building v0.75.
http://jenkins.ceph.com/job/ceph-build/Arch=x86_64,Dis...
Ken Dreyer
04:50 PM Feature #6452 (Resolved): cachepool: snapshots
Sage Weil
04:23 PM devops Feature #6986: Build packages containing RBD and CephFS kernel modules for installation on RHEL7-...
Adding these links for reference.
http://people.redhat.com/jcm/el6/dup/docs/old/pre-release/whitepaper.pdf
http...
Ken Dreyer
03:47 PM Subtask #5858 (Resolved): Backfill should be able to handle multiple backfill peers
c0d92b6744d793f85c1ce2f7c5c67b9e9c18a135 David Zafman
03:46 PM Feature #7039 (Resolved): Check other OSD CEPH_FEATURE_OSD_ERASURE_CODES to enable acting set cha...
f9a11ef2f267591a38feea543a7903936702c76a David Zafman
03:45 PM Feature #5994 (Resolved): EC: [link] Backfill should be able to handle multiple backfill peers
c0d92b6744d793f85c1ce2f7c5c67b9e9c18a135 David Zafman
11:05 AM Bug #7141: osd: segv in DBObjectMap::clear_keys_header()
It would help to be holding the lock when accessing that map. Testing fix now. Samuel Just
09:37 AM Bug #7141 (Resolved): osd: segv in DBObjectMap::clear_keys_header()
... Sage Weil
03:20 AM rbd Bug #5425 (Fix Under Review): krbd: xfstest 89 hang, 'read_partial_message skipping long message'
For the record, this turned out to be not related to xfstest 89 at all,
it was just a coincidence. See ceph-devel (...
Ilya Dryomov
01:20 AM Linux kernel client Bug #7139 (In Progress): dead tcp connections during connection negotiation may lead to hangs
Nothing too suspicious in the dmesg:... Ilya Dryomov

01/12/2014

10:39 PM rbd Bug #7125: Assertion failure in rbd_img_obj_callback()
Hi Ilya,
So far I have not reproduced the problem.
Ceph cluster info:
ceph --version
ceph version 0.72.2 (a...
Eric Eastman

01/11/2014

01:57 AM Feature #7114: Hinted recovery
David Zafman is working on "multiple backfills":https://github.com/ceph/ceph/pull/931 to address the case where the p... Loïc Dachary

01/10/2014

02:29 PM Bug #7065 (Resolved): omap_clear() also removes all of object's xatts
0c81849c4f1661ce89b8623edaf2873bb6611f62 Samuel Just
02:29 PM Bug #7135 (Resolved): OMAP_CLEAR removes xattr entries as well
0c81849c4f1661ce89b8623edaf2873bb6611f62 Samuel Just
01:26 PM Bug #7135: OMAP_CLEAR removes xattr entries as well
wip-7135 Samuel Just
10:39 AM Bug #7135: OMAP_CLEAR removes xattr entries as well
Dup #7065? Greg Farnum
10:34 AM Bug #7135 (Resolved): OMAP_CLEAR removes xattr entries as well
The issue is in FileStore::_omap_clear, which calls ObjectMap::clear() rather than some form of ObjectMap::clear_omap... Samuel Just
02:15 PM Documentation #7007: Missing page for "OS Recommendations"
Trying to update... Yuri Weinstein
11:35 AM Feature #6452 (Fix Under Review): cachepool: snapshots
Sage Weil
11:35 AM Feature #7057 (Resolved): crush location
Sage Weil
11:21 AM Bug #7121 (Resolved): nightlies: failed to become clean before timeout expired due to degraded pg
Sage Weil
10:59 AM Bug #7121: nightlies: failed to become clean before timeout expired due to degraded pg
Yeah; our tests run a lot closer to the edges than users really do or should, so I'd just adjust the thrasher's min_in. Greg Farnum
10:55 AM Bug #7121: nightlies: failed to become clean before timeout expired due to degraded pg
Sage Weil
10:55 AM Bug #7121: nightlies: failed to become clean before timeout expired due to degraded pg
the behavior is the same as on dumpling, so this isn't a recent regression.
old tunables work for this pg, but ten...
Sage Weil
10:32 AM Bug #7121: nightlies: failed to become clean before timeout expired due to degraded pg
Sam suggests that maybe the retry behavior got changed erroneously with the recent refactor, and that that's what cau... Greg Farnum
10:39 AM rgw Bug #7110 (Resolved): rgw: cannot set acls on pre-dumpling buckets
Landed to dumpling - ee9fc590a556354b54a174a4790c4142949d7409
Landed to emperor - 72168f717b2632f2942fcabf69798ff59a...
Ian Colle
09:56 AM rgw Bug #7110: rgw: cannot set acls on pre-dumpling buckets
Landed to next 744895326c718efca720bf71e164d65e4eb29d32 Ian Colle
09:54 AM rgw Bug #7110 (Pending Backport): rgw: cannot set acls on pre-dumpling buckets
Yehuda Sadeh
09:54 AM rgw Bug #7110: rgw: cannot set acls on pre-dumpling buckets
still need to backport it to dumpling and emperor Yehuda Sadeh
09:51 AM rgw Bug #7110 (Resolved): rgw: cannot set acls on pre-dumpling buckets
Sage Weil
10:36 AM rbd Bug #7125 (Need More Info): Assertion failure in rbd_img_obj_callback()
Hi Eric,
Is it reproducible?
What kind of stress testing were you doing? Can you share a script or
at least d...
Ilya Dryomov
10:26 AM Feature #6173 (Fix Under Review): Add LevelDB support to ceph cluster backend store
https://github.com/ceph/ceph/pull/1049.patch Ian Colle
09:57 AM rgw Bug #7084 (Can't reproduce): segv in RGWDataChangesLog::RGWDataChangesLog
reopen if this ever comes up again... but looks like a bad build or something :/ Sage Weil
09:44 AM Bug #6097 (Resolved): btrfs locking regression on async snap ioctl
Sage Weil
09:44 AM rgw Bug #7083 (Resolved): rgw: public-read acl test failing on master
Landed to master 6fac2acc5e6f77651ffcd7dc7aa833713517d8a6 Ian Colle
09:42 AM Bug #7103 (Resolved): config: variable substitution happen only once
Sage Weil
09:42 AM Bug #7117: centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_common.sh'
Something with the make file? Ian Colle
08:36 AM Bug #7117: centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_common.sh'
AFAICT, the reason is that the init-script is not even starting any daemon due to failing on... Joao Eduardo Luis
09:40 AM Bug #7126 (Resolved): mon: returns wrong version from mon_commands
Sage Weil
09:38 AM Bug #7126: mon: returns wrong version from mon_commands
actually, le'ts not backport it.. i don't think it affects anything except the new tiering commands. Sage Weil

01/09/2014

04:05 PM Bug #7122: push 0/hit_set_... v 0'0 failed because local copy is 818'7131
Sage Weil
12:47 AM Bug #7122: push 0/hit_set_... v 0'0 failed because local copy is 818'7131
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2014-01-08_22:39:23-rados:thrash-wip-cache-snap-testing-basic-pla... Sage Weil
12:42 AM Bug #7122 (Resolved): push 0/hit_set_... v 0'0 failed because local copy is 818'7131
... Sage Weil
03:39 PM Bug #7126 (Pending Backport): mon: returns wrong version from mon_commands
Sage Weil
02:30 PM Bug #7126 (Resolved): mon: returns wrong version from mon_commands
manifested as a broken LibRadosTier.Promote test, but is uspect this has been brekaing things for ages.
ubuntu@teu...
Sage Weil
02:03 PM rbd Bug #6548 (Resolved): tgt: Kernel panic putting zpool on iSCSI LUN using bs_rbd
Josh Durgin
12:36 PM rbd Bug #6548: tgt: Kernel panic putting zpool on iSCSI LUN using bs_rbd
I have not seen a repeat of this issues with the new code base. I would recommend that we close this ticket. Eric Eastman
01:14 PM rbd Bug #7125 (Resolved): Assertion failure in rbd_img_obj_callback()
My system hung while stress testing an RBD backed XFS file system. After power cycling the system the error message ... Eric Eastman
10:59 AM Bug #7121: nightlies: failed to become clean before timeout expired due to degraded pg
crush map rule
# rules
rule replicated_ruleset {
ruleset 0
type replicated
min_size 1
...
David Zafman
10:45 AM Documentation #6234 (Resolved): all our new-user paths need to document name-resolution restrictions
http://ceph.com/docs/master/start/quick-start-preflight/ John Wilkins
12:44 AM rgw Bug #6462: Cannot update object HTTP headers using swift API
I noticed that Content-Encoding has the same problem. Aaron Schulz
12:37 AM rgw Bug #6462: Cannot update object HTTP headers using swift API
Using Emperor, I'm still having a hard time getting POST (as well as copy via PUT) to update or remove Content-Dispos... Aaron Schulz

01/08/2014

09:47 PM Bug #7121 (Resolved): nightlies: failed to become clean before timeout expired due to degraded pg

This issue has been seen before on master branch as well as testing of wip-5858-rebase testing.
teuthology:/a/te...
David Zafman
09:45 PM Bug #7120 (Duplicate): osd: EEXIST on mkcoll on dumpling
... Sage Weil
07:41 PM Bug #7093 (Resolved): osd: peering can send messages prior to auth
Sage Weil
08:02 AM Bug #7093: osd: peering can send messages prior to auth
Patches look good to me. Samuel Just
05:57 PM CephFS Bug #6914 (Resolved): Accessing specific file on CephFS causes client hang
fixed by f292992b82d042a9f738e4698802e3b205b82c83 Zheng Yan
02:54 PM rgw Bug #6830: S3 CompleteMultipartUploadResult has empty ETag element
Still happens on RHEL 6.5 with ceph-0.72.2-0.el6.x86_64. Benjamin Gilbert
09:40 AM Bug #7117: centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_common.sh'
I replicated this problem locally using the most current `master` branch. The issue is not really `gatherkeys` but th... Alfredo Deza
05:54 AM Bug #7117 (Resolved): centOS: init script silently dies due to missing '/usr/lib64/ceph/ceph_comm...

One of the failing tests: http://qa-proxy.ceph.com/teuthology/teuthology-2014-01-05_01:10:06-ceph-deploy-master-t...
Alfredo Deza
09:36 AM rgw Bug #7110: rgw: cannot set acls on pre-dumpling buckets
Ian Colle
09:26 AM Feature #7114: Hinted recovery
Instead of "a defined bucket in the CRUSH hierarchy", it probably makes more sense to say uses the nearest common anc... Kyle Bader

01/07/2014

05:28 PM devops Feature #6986: Build packages containing RBD and CephFS kernel modules for installation on RHEL7-...
Try the for-stable-3.10.24 branch of https://github.com/ceph/ceph-client.git Josh Durgin
05:11 PM devops Feature #6986: Build packages containing RBD and CephFS kernel modules for installation on RHEL7-...
Where can I find the code to build against kernel-devel-3.10.0-54.0.1.el7 ? Ken Dreyer
05:10 PM Bug #7116 (Resolved): mon: pg_temp left behind if pool deleted while pg_temp exist

I bumped the pg_num/pgp_num for pool 3 and then deleted pool 3 quickly afterwards:
./ceph osd dump
....
pg_tem...
David Zafman
03:44 PM Subtask #6895 (Fix Under Review): omap needs to be disablable on a per-pool basis, this needs to ...
"work in progress":https://github.com/ceph/ceph/pull/1053 Loïc Dachary
10:18 AM Subtask #6895: omap needs to be disablable on a per-pool basis, this needs to be required to cach...
* add a can_omap() predicate in "pg_pool_t":https://github.com/ceph/ceph/blob/5300ed942809c963ccbe4fc1f90e7767f7b74b5... Loïc Dachary
07:07 AM Subtask #6895 (In Progress): omap needs to be disablable on a per-pool basis, this needs to be re...
Loïc Dachary
03:04 PM Feature #7114 (New): Hinted recovery
In the case where you are constructing multi-site RADOS object stores, where inter-site connectivity is a premium, it... Kyle Bader
01:29 PM rgw Bug #7110: rgw: cannot set acls on pre-dumpling buckets
The fix in wip-7110-dumpling looks good to me. Josh Durgin
10:26 AM Subtask #6888: EC/Tiering: Disallow omap writes on pools which are backed by EC pools
It should be able to; this will be stuff in the caching and tiering code to look at the properties on the backing poo... Greg Farnum
06:54 AM Subtask #6888 (Need More Info): EC/Tiering: Disallow omap writes on pools which are backed by EC ...
Is it possible to implement this before implementing ErasureCodePG ? Loïc Dachary
07:04 AM Subtask #7048 (Resolved): mon: erasure crush rule vs pool process
Loïc Dachary
07:01 AM Subtask #6027 (Resolved): ensure that erasure coded pools don't work until the osds can handle it
Loïc Dachary
06:59 AM Subtask #6119 (Won't Fix): replace PG::object_contexts with SharedPtrRegistry
This is obsoleted by the work done on PGBackend. Loïc Dachary
06:54 AM Documentation #6465 (In Progress): admin/build-doc should have some kind of build check for broke...
I pushed some changes to a wip-6465 branch but it is failing with what looks like unrelated problems.
This is a li...
Alfredo Deza
06:52 AM Subtask #6434 (Need More Info): review and test PGBackend
Loïc Dachary

01/06/2014

10:30 PM rgw Bug #7110: rgw: cannot set acls on pre-dumpling buckets
Buckets created on versions prior to Dumpling cannot have ACLs set, even after upgrading to Dumpling. Ian Colle
08:28 PM rgw Bug #7110 (Won't Fix): rgw: cannot set acls on pre-dumpling buckets
Ian Colle
06:28 PM rgw Bug #7110 (Resolved): rgw: cannot set acls on pre-dumpling buckets
Yehuda Sadeh
09:05 PM devops Feature #6986: Build packages containing RBD and CephFS kernel modules for installation on RHEL7-...
RPM kmods are a special kind of pain. Red Hat has a kABI, and in theory if you rely on that, you can re-use your kern... Ken Dreyer
08:07 PM devops Feature #6986: Build packages containing RBD and CephFS kernel modules for installation on RHEL7-...
Ken Dreyer wrote:
> Do we have some form of kernel module packages available already? Or will these be developed fro...
Sage Weil
08:00 PM devops Feature #6986: Build packages containing RBD and CephFS kernel modules for installation on RHEL7-...
Do we have some form of kernel module packages available already? Or will these be developed from scratch? Ken Dreyer
08:18 PM devops Bug #6746: ceph-release rpm not playing well with yum-plugin-priorities
Hi John, What priority value would work for you? Ken Dreyer
07:35 PM Feature #7108: build: error out on inappropriate use of dout()
alternatively, do not define dout or g_ceph_context at all for library code. Sage Weil
10:38 AM Feature #7108 (New): build: error out on inappropriate use of dout()
Right now, if you use dout() instead of ldout() in a library context, the compilation phase goes fine and then you ge... Greg Farnum
06:31 PM Feature #7039 (Fix Under Review): Check other OSD CEPH_FEATURE_OSD_ERASURE_CODES to enable acting...
Part of review of wip-5858-rebase David Zafman
06:28 PM Bug #7014 (Can't reproduce): rados: stuck degraded, possibly related to acting_backfill changes
This might have been fixed by fix for #6905 which is to increase the timeout in suites/rados/thrash/thrashers/mapgap.... David Zafman
06:18 PM rgw Bug #7073: "rgw gc max objs" should have a prime number as default value
Hi Yehuda,
Thanks for the update.
Should I provide a pull request for this or you will fix it at your side?
Th...
Guang Yang
01:13 PM rgw Bug #7073: "rgw gc max objs" should have a prime number as default value
Right. We can just do a mod of some other larger prime, and then mod 32 (or whatever is configured) so that we don't ... Yehuda Sadeh
03:05 PM Feature #7104: rest-api: support commands requiring 'w' cap without 'rw' cap
There was previously no "w-only" permission supported, indeed. I'm not sure it makes sense even for allow new snaps.... Dan Mick
10:00 AM Feature #7104: rest-api: support commands requiring 'w' cap without 'rw' cap
Ah, I did the allow_new_snaps command. I'm not very familiar with the REST api, but I think it needs to be able to ha... Greg Farnum
02:18 PM Subtask #6888 (In Progress): EC/Tiering: Disallow omap writes on pools which are backed by EC pools
Loïc Dachary
12:59 PM rgw Bug #7083 (Fix Under Review): rgw: public-read acl test failing on master
Yehuda Sadeh
12:47 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
First tested the kernel rebuild (-j 8 for the make command) twice, this went fine. (Except for performance: 1 kernel ... Rolf Fokkens
11:24 AM CephFS Fix #7109 (Resolved): disallow removing the first data pool
Our lookup-by-ino code requires that we be able to look for all existing inodes in a well-known location — the defaul... Greg Farnum
10:35 AM Documentation #6234: all our new-user paths need to document name-resolution restrictions
This is also missing from the new pre-flight checklist: http://ceph.com/docs/master/start/quick-start-preflight/
T...
Josh Durgin
09:36 AM Bug #6824: Removal of an OSD that is not down should set non-successful status code
I missed this when it was going by, but it looks like that patch makes things like "osd down" and "osd out" no longer... Greg Farnum
09:26 AM CephFS Feature #3866: mds: Add lazily-updated backtraces to hard links
If we create a hardline that references the "real" location of the file (as a path), and then the "real" location mov... Greg Farnum
06:43 AM devops Bug #7107 (Resolved): bootstrap script should not install requirements
Opened pull request: https://github.com/ceph/ceph-deploy/pull/148
Merged to master with hash: 07f903c
Alfredo Deza
06:40 AM devops Bug #7107 (Resolved): bootstrap script should not install requirements
There is no need to install from requirements file anymore. All libraries defined there are only for
development onl...
Alfredo Deza

01/05/2014

03:25 PM Bug #7072: logrotate not sending HUP signal
Looking at the logrotate source code it would seem that logrotate depends on whatever is defined in postrotate to per... Matt Hook
01:06 PM Bug #7072: logrotate not sending HUP signal
I can confirm this on Ubuntu 13.04.
If you check with lsof, you can see that the radosgw + ceph-osd files are move...
Robin Johnson
12:52 PM Bug #7072: logrotate not sending HUP signal
Distributor ID: Ubuntu
Description: Ubuntu 12.04 LTS
Release: 12.04
Codename: precise
logrotate version: 3.7.8
...
Matt Hook
09:56 AM Bug #7103 (Fix Under Review): config: variable substitution happen only once
"proposed fix":https://github.com/ceph/ceph/pull/1046 Loïc Dachary

01/04/2014

10:46 PM Bug #7093 (Fix Under Review): osd: peering can send messages prior to auth
Sage Weil
09:03 AM Bug #7093: osd: peering can send messages prior to auth
several instances in teuthology-2014-01-02_19:40:02-upgrade:parallel-master-testing-basic-plana Sage Weil
07:18 AM Cleanup #7105: There are three different ways to retrieve an authentication key
Ahh.. forgot the ... David Moreau Simard
07:16 AM Cleanup #7105 (Closed): There are three different ways to retrieve an authentication key
Is it necessary to have three different commands that do the exact same thing ?
Reference:
https://github.com/cep...
David Moreau Simard
06:50 AM Feature #7104: rest-api: support commands requiring 'w' cap without 'rw' cap
Okay, I'm starting to understand how it works..
This is how the mapping is done between the API and the commands a...
David Moreau Simard
06:32 AM Feature #7104: rest-api: support commands requiring 'w' cap without 'rw' cap
I left out a line at the end of the error, the complete error is:... David Moreau Simard
06:13 AM Feature #7104 (New): rest-api: support commands requiring 'w' cap without 'rw' cap
Right now the commands must be one of "r", "rw", "rwx". We should support each of those as capabilities, not as named... David Moreau Simard
02:05 AM Bug #7103 (In Progress): config: variable substitution happen only once
Loïc Dachary
02:04 AM Bug #7103: config: variable substitution happen only once
The "test demonstrating the problem":https://github.com/dachary/ceph/commit/84374f3d60b3be37f74bbf65b4c5568f301e2832 ... Loïc Dachary
02:00 AM Bug #7103 (Resolved): config: variable substitution happen only once
In ceph.conf if... Loïc Dachary

01/03/2014

04:13 PM rgw Bug #7099: Strange Comportments with media files
MP3 Test : http://p1.13h.com/swift/v1/test/touslesmemes.mp3
Julien Calvet
03:54 PM rgw Bug #7099: Strange Comportments with media files
I'm using Ubuntu 12.04 LTS Julien Calvet
03:07 PM rgw Bug #7099 (Resolved): Strange Comportments with media files
Hello
I'm using ceph 0.72.2 and I have a problem to get media file (mp3 ou mp4 video) with RadosGw and swift emu...
Julien Calvet
04:06 PM Bug #7052 (Resolved): saucy make check doesn't clean up
Sage Weil
03:53 PM Bug #7052 (Fix Under Review): saucy make check doesn't clean up
Looks like a Automake 1.13 feature with parallel tests. As discussed in the Google+ hangout today, I've added these t... Ken Dreyer
01:06 PM rgw Feature #6513 (Fix Under Review): rgw: dr: Service scripts for meta/data sync agents
Sage Weil
12:53 PM rgw Feature #6513: rgw: dr: Service scripts for meta/data sync agents
Work in progress is wip-6513 branch in the radosgw-agent tree. Anonymous
01:03 PM devops Fix #7095 (Resolved): fix f19 gitbuilder
Sage Weil
01:03 PM devops Feature #7094 (Resolved): f20 gitbuilder
Sage Weil
01:00 PM devops Feature #6020: radosgw-apache opinionated package
Build job for the package is radosgw-apache on jenkins. Anonymous
01:00 PM devops Feature #6020: radosgw-apache opinionated package
Work in progress is in the master branch in the radosgw-apache tree. Anonymous
11:45 AM Bug #7093 (Resolved): osd: peering can send messages prior to auth
we are still authenticating:... Sage Weil
10:56 AM rgw Bug #7084: segv in RGWDataChangesLog::RGWDataChangesLog
recent log: ubuntu@teuthology:/a/sage-2014-01-02_12:58:22-upgrade:parallel-next-testing-basic-plana/22332 Tamilarasi muthamizhan
07:38 AM Bug #7085 (Resolved): ceph-disk cli test fails if /var/lib/ceph is missing
The corresponding test file been moved to "ceph-disk.sh":https://github.com/dachary/ceph/commit/de8522fa6c82768da1d61... Loïc Dachary

01/02/2014

04:52 PM devops Bug #6591 (Resolved): init script does not pass alternate configuration file to osd crush create-...
Sage Weil
02:56 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
The RPM's built fine now. I upgraded all nodes in the ceph cluster. Rolf Fokkens
07:29 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
error: Installed (but unpackaged) file(s) found:
/usr/lib64/erasure-code/libec_example.so
/usr/lib64/erasure-...
Rolf Fokkens
02:10 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
I'm currently building the RPM's using the mentioned ceph.spec which includes the patch. Rolf Fokkens
01:22 PM Bug #7085 (Resolved): ceph-disk cli test fails if /var/lib/ceph is missing
if the ceph package isn't installed locally, the ceph-disk cli test fails.
probably need to pass ceph-disk an al...
Sage Weil
01:09 PM rgw Bug #7084 (Can't reproduce): segv in RGWDataChangesLog::RGWDataChangesLog
... Sage Weil
12:40 PM rgw Bug #7083 (Resolved): rgw: public-read acl test failing on master
This test fails:... Josh Durgin
10:28 AM Bug #6810 (Can't reproduce): very high monitor memory usage after upgrade dumpling -> emperor
Sage Weil
10:24 AM Subtask #5858 (Fix Under Review): Backfill should be able to handle multiple backfill peers
David Zafman
09:35 AM Bug #7061 (Resolved): valgrind: mon and osd leaks in libnss on master
Sage Weil
09:33 AM Bug #7072 (Need More Info): logrotate not sending HUP signal
what distribution is this? Sage Weil
09:17 AM Bug #7060 (In Progress): mon: osdmap scrub crc mismatch
Joao Eduardo Luis

01/01/2014

05:05 PM Bug #6988 (Resolved): pg query doesn't show current 'problem' stats
Loïc Dachary
04:45 PM Bug #6988: pg query doesn't show current 'problem' stats
"work in progress":https://github.com/ceph/ceph/pull/950 Loïc Dachary

12/31/2013

07:06 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
previous patch doesn't work. please try the new one, thank you. Zheng Yan
05:52 PM rbd Bug #6693 (Fix Under Review): "rbd ls" returns error if the pool empty
https://github.com/ceph/ceph/pull/1028 Josh Durgin
05:42 PM rbd Bug #4045 (Fix Under Review): snap unprotect on a snapshot that is already unprotected throws ina...
https://github.com/ceph/ceph/pull/1027 Josh Durgin
05:42 PM rbd Bug #6851 (Fix Under Review): rbd snap unprotect test failing
https://github.com/ceph/ceph/pull/1027 Josh Durgin
05:40 PM rbd Bug #2689 (Resolved): qemu iozone test hangs
Retested with qemu 1.5 from the havana cloud archive for ubuntu, and ceph 0.67.4. It worked fine. I'm not sure exactl... Josh Durgin
12:34 PM rbd Bug #7067 (Resolved): rbd read-flags.sh test is failing
commit:2b0a435920daab8de27402e00b58c6b00ccb5663 Josh Durgin
11:10 AM rbd Bug #7076 (Resolved): rbd: check for watchers before removing an image
~/ceph/src$ ./rbd showmapped
id pool image snap device
0 rbd img2 - /dev/rbd0
~/ceph/src$ ./rbd rm img...
Ilya Dryomov
08:18 AM Bug #6992 (Pending Backport): OSD assert fails after it found it was marked as down by monitor du...
Sage Weil
07:30 AM devops Bug #6503 (Rejected): ceph-deploy: nosetests fail
Actually, I thought there was some mechanism in ceph-deploy that was using nose, but there isn't. Closing this as we ... Alfredo Deza
06:17 AM devops Bug #6503: ceph-deploy: nosetests fail
We no longer use nose to test anything in ceph-deploy. The correct way is to use tox:... Alfredo Deza

12/30/2013

11:28 PM rgw Bug #7073 (Resolved): "rgw gc max objs" should have a prime number as default value
Recently when we trouble shoot latency increasing on our ceph cluster, we observed a couple of gc objects were hotspo... Guang Yang
09:27 PM rbd Bug #7067 (Fix Under Review): rbd read-flags.sh test is failing
Good thing the test caught this before we were very confused by logs in the future missing some flags. Bisected and e... Josh Durgin
09:20 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
I think I know what was going on. please try the attached patch, it should solve your issue. Thank you for your detai... Zheng Yan
09:19 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
For building kernel, try increasing number of jobs that run simultaneously. Zheng Yan
02:40 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
We plan to do a clean reinstall of the cluster next week (january 6). Of course we're more than happy to provide any ... Rolf Fokkens
01:50 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
Zheng Yan wrote:
> are cpu usages of ceph-mds and ceph-osd high?
Hardly any CPU usage; Idle is > 95% on all cluster...
Rolf Fokkens
01:47 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
Zheng Yan wrote:
> Rolf Fokkens wrote:
> > After the mds is in state "replay" for a while an impressive list of "fa...
Rolf Fokkens
05:09 PM rbd Bug #5426: librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion::complete()
Fix confirmed with the packages from http://gitbuilder.ceph.com/ceph-deb-precise-x86_64-basic/ref/dumpling-5426/ on M... Dmitry Borodaenko
10:13 AM rbd Bug #5426 (Resolved): librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion::comple...
Sage Weil
04:13 PM Bug #7072: logrotate not sending HUP signal
It happened on another monitor and this time I was able to capture the output of... Matt Hook
04:05 PM Bug #7072: logrotate not sending HUP signal
Can you identify which files are not being closed by looking at the open files in /proc/`pidof ceph-mon`/fd and seein... Sage Weil
03:52 PM Bug #7072 (Resolved): logrotate not sending HUP signal
I have a cluster where the Ceph Monitors are actually Virtual Machines.
They have relatively small 20GB root drives....
Matt Hook
02:55 PM rbd Bug #7066 (Resolved): Improper bench-write implementation in do_bench_write@rbd.cc:924
commit:fd57d99b6bbd2ec7d5e4c1f4f7cfcf4ba351050b Josh Durgin
01:07 PM Bug #6824 (Resolved): Removal of an OSD that is not down should set non-successful status code
Sage Weil
12:43 PM Bug #6824: Removal of an OSD that is not down should set non-successful status code
Request to backport fix to Dumpling. Ian Colle
12:42 PM Bug #6824 (Pending Backport): Removal of an OSD that is not down should set non-successful status...
Ian Colle
10:40 AM Bug #6725 (Resolved): objecter: kick_requests() resends ops that should be paused
commit:98ab7d64a191371fa39d840c5f8e91cbaaa1d7b7 and commit:5fe3dc647bf936df8e1eb2892b53f44f68f19821 are related, alon... Josh Durgin
10:32 AM Feature #6414 (New): filestore: make read() use a bufferlist method
Josh Durgin
10:32 AM Messengers Feature #6411 (New): msgr: move iov + sendmsg into bufferlist::sendmsg method
Josh Durgin
10:14 AM Linux kernel client Bug #6984 (Resolved): RBD volume not mountable after creating 8 or more snapshots
This has been fixed in 3.12-rc1, commit
03507db631c94a48e316c7f638ffb2991544d617.
Ilya Dryomov
08:26 AM Bug #7071 (Resolved): osd/ReplicatedPG.cc: 5953: FAILED assert(attrs->count("_"))
... Sage Weil

12/29/2013

11:18 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
Rolf Fokkens wrote:
> FYI one other thing: the performance is really bad when building a kernel on cephfs. "ceph sta...
Zheng Yan
11:09 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
Rolf Fokkens wrote:
> I managed to get me a whole new list of "failed to reconnect caps for missing inodes" by doing...
Zheng Yan
08:26 PM Bug #6992: OSD assert fails after it found it was marked as down by monitor during high system lo...
https://github.com/ceph/ceph/pull/1016/
From the log the rebind was at 2013-12-10 10:51:51.909718
2013-12-10 10:5...
Xihui He
05:03 PM Bug #7065: omap_clear() also removes all of object's xatts
I think we should update test/osd/RadosModel (ceph_test_rados) so that it stores separate data in xattrs and omap (ri... Sage Weil
12:20 PM Bug #6003: journal Unable to read past sequence 406 ...
ubuntu@teuthology:/a/teuthology-2013-12-28_23:00:01-rados-next-distro-basic-plana/18074... Sage Weil
03:17 AM Linux kernel client Bug #7069 (Resolved): CephFS hang when using fscache - several "blocked for more than 120 seconds...
Wanting to accellerate CephFS access I added fscache. This resulted in a CephFS hang to the point that processes acce... Rolf Fokkens

12/28/2013

10:16 AM Bug #7068 (Can't reproduce): os/FileStore.cc: 4035: FAILED assert(omap_attrs.size() == omap_aset....
... Sage Weil
08:55 AM rbd Bug #7067 (Resolved): rbd read-flags.sh test is failing
teuthology-2013-12-27_23:00:59-rbd-master-testing-basic-plana has 3 instances Sage Weil
01:27 AM rbd Bug #7066 (Resolved): Improper bench-write implementation in do_bench_write@rbd.cc:924
for (off = 0; off < io_bytes; off += io_size) {
b.wait_for(io_threads - 1);
uint64_t i = 0;
uint64_t...
Haomai Wang

12/27/2013

10:39 PM Bug #7065: omap_clear() also removes all of object's xatts
It's amazing that just notice it now.
I think it's a little related to feature(http://tracker.ceph.com/issues/7059...
Haomai Wang
10:16 PM Bug #7065 (Resolved): omap_clear() also removes all of object's xatts
Zheng Yan
01:19 PM Bug #7061: valgrind: mon and osd leaks in libnss on master
teutholgoy commit:9a17bb5134ae7072089f780cd975761e61c708db Sage Weil
06:48 AM Bug #7061: valgrind: mon and osd leaks in libnss on master
Is there a branch with the patch? Looked on github but nothing obvious caught my eye. Joao Eduardo Luis
12:42 PM rbd Feature #6264 (Resolved): rbd: expose all options available to rbd map
commit:9b7364d2450c7b51043bd39a0f586cf1c6041f4d Josh Durgin
10:20 AM Bug #7062 (Resolved): Dead lock happend if daemonize
Sage Weil
06:32 AM rgw Bug #7064 (Resolved): S3 request to non-existent object in non-existent bucket results in 30 seco...
A request to:
@curl -v http://192.168.2.16:8080/doesnotexist/missingkey@
Results in a 30 second timeout from Apac...
Michael Kidd
05:18 AM Feature #7063: Replace lfn_find call in omap* functions with more lighter implematation
It seemed that "IndexManager::get_index" protect concurrent write operations on the same object. So if remove it, mul... Haomai Wang

12/26/2013

09:34 PM Bug #7061 (Pending Backport): valgrind: mon and osd leaks in libnss on master
Sage Weil
09:34 PM Bug #7061: valgrind: mon and osd leaks in libnss on master
Sage Weil
06:35 AM Bug #7061: valgrind: mon and osd leaks in libnss on master
recent changes touching both the osd and the mon on master as of late, maybe the erasure coding stuff? Joao Eduardo Luis
09:29 PM rgw Bug #6940 (Pending Backport): rgw: reading bucket policy through bucket restful api fails
Sage Weil
07:38 PM Feature #7063: Replace lfn_find call in omap* functions with more lighter implematation
Maybe we can maintain a hashmap<object, state>, "state" can be int and 0 mean unknown, 1 mean existing. If unknown, i... Haomai Wang
06:54 AM Feature #7063 (Closed): Replace lfn_find call in omap* functions with more lighter implematation
find_index plays a important role in handling transactions with FileStore. Although it's not a heave implementation, ... Haomai Wang
06:57 PM rbd Bug #5426 (Pending Backport): librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion...
I cherry-picked the fix to dumpling-5426 and emperor-5426 branches based on dumpling and emperor, respectively. The f... Josh Durgin
06:40 PM rbd Bug #5426 (Fix Under Review): librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion...
https://github.com/ceph/ceph/pull/1000 Josh Durgin
08:58 AM rbd Feature #6264 (Fix Under Review): rbd: expose all options available to rbd map
wip-rbd-map-options; https://github.com/ceph/ceph/pull/999 Ilya Dryomov

12/25/2013

06:47 PM Feature #7059: filestore: avoid leveldb check for xattr when possible
I'm not understand the first way clearly. A flag on the filestore when creating? The flag is associated to FileStore?
Haomai Wang
04:59 AM Feature #7059: filestore: avoid leveldb check for xattr when possible
May I ask about the idea or method?
I also want to reduce redundant calls such as lfn_find() in omap_* functions ...
Haomai Wang
12:12 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
FYI one other thing: the performance is really bad when building a kernel on cephfs. "ceph status" shows in general <... Rolf Fokkens
12:04 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
I managed to get me a whole new list of "failed to reconnect caps for missing inodes" by doing the following:
- re...
Rolf Fokkens

12/24/2013

11:30 PM CephFS Feature #3866: mds: Add lazily-updated backtraces to hard links
what does 'lazily-updated" mean? Zheng Yan
11:24 PM CephFS Bug #3601 (Resolved): client: With multiple clients, file remove doesn't free up space
Zheng Yan
09:41 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
Rolf Fokkens wrote:
> The Ceph cluster was built using Ceph 0.67. On nov 13 it was upgraded to Ceph 0.72
looks li...
Zheng Yan
03:14 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
We'll try to access the repaired FS and see if it still hangs. Rolf Fokkens
03:13 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
I mounted the broken CephFS and did this (because it's easier than locating the specific inodes):
find -type d -ex...
Rolf Fokkens
02:45 AM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
The Ceph cluster was built using Ceph 0.67. On nov 13 it was upgraded to Ceph 0.72 Rolf Fokkens
07:27 PM Bug #7062: Dead lock happend if daemonize
https://github.com/ceph/ceph/pull/994 Haomai Wang
07:12 PM Bug #7062 (Resolved): Dead lock happend if daemonize
New ceph_osd.cc code did ObjectStore init work before global_init_daemonize(), and WBThrottle thread is created when ... Haomai Wang
08:22 AM Bug #7061 (Resolved): valgrind: mon and osd leaks in libnss on master
not on next. unclear what changed... Sage Weil

12/23/2013

11:46 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
Rolf Fokkens wrote:
> Actually waiting for this to be sorted out takes a while. So to be able to make use of cephfs ...
Zheng Yan
11:29 PM CephFS Bug #6914: Accessing specific file on CephFS causes client hang
Jasper Siero wrote:
> Some simularity with http://tracker.ceph.com/issues/6087? We use the Emperor release so that s...
Zheng Yan
10:12 PM Bug #7060 (Resolved): mon: osdmap scrub crc mismatch
... Sage Weil
10:03 PM Bug #7051 (Resolved): mon: features aren't passed along for forwarded requests
Sage Weil
01:06 AM Bug #7051: mon: features aren't passed along for forwarded requests
Thanks for explaining, crystal clear now :-) Loïc Dachary
03:54 PM Bug #7056 (Resolved): osd: copy-get doesn't copy omap header
Sage Weil
10:19 AM Bug #7056 (Resolved): osd: copy-get doesn't copy omap header
this is why my new small-objects tests are failing: they include setattr/rmattr. i'll add them into the other worklo... Sage Weil
11:17 AM Feature #7059 (Resolved): filestore: avoid leveldb check for xattr when possible
The current FileStore code calls into DBObjectMap for xattrs in many cases unconditionally. This means taht even for... Sage Weil
11:16 AM Feature #6354 (Resolved): ObjectStore: allow xattrs as large as the underlying FS allows
Sage Weil
11:16 AM Feature #2505 (Resolved): Automatically set filestore xattr use omap
Sage Weil
11:15 AM Fix #3188: osd: close read hole
Sage Weil
11:15 AM Feature #7058 (Resolved): osd primary affinity
Sage Weil
11:14 AM Feature #7057 (Resolved): crush location
Sage Weil
09:21 AM Feature #7033 (In Progress): cachepool: agent: trivial flushing
Sage Weil
09:21 AM Feature #6643 (Resolved): cachepool: flush
Sage Weil
09:21 AM Feature #6644 (Resolved): cachepool: evict
Sage Weil
09:21 AM Feature #6188 (Resolved): cachepool: osd: promote on write and mark object dirty
Sage Weil
09:21 AM Feature #6189 (Resolved): cachepool: osd: promote on read
Sage Weil
09:17 AM rbd Bug #5426: librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion::complete()
This problem is easily reproducible with Mirantis OpenStack 4.0:
https://bugs.launchpad.net/fuel/+bug/1260911
Aft...
Dmitry Borodaenko
 

Also available in: Atom