Project

General

Profile

Activity

From 07/26/2013 to 08/24/2013

08/24/2013

11:34 PM Feature #5511: rados.py support for object locking
Is there exists any progress?
May I give a hand if any?
Haomai Wang
10:05 PM Fix #6116 (Resolved): osd: incomplete pg from thrashing on next
... u'overall_status': u'HEALTH_WARN', u'summary': [{u'severity': u'HEALTH_WARN', u'summary': u'1 pgs incomplete'}]} ... Sage Weil
03:44 PM Bug #6115 (Resolved): doc: asphyxiate does not support class
"asphyxiate":https://github.com/ceph/asphyxiate is "unable to handle classes":http://comments.gmane.org/gmane.comp.fi... Loïc Dachary
01:36 PM Bug #6003: journal Unable to read past sequence 406 ...
teuthology-2013-08-23_01:00:10-rados-master-testing-basic-plana/1275
Sage Weil

08/23/2013

09:26 PM RADOS Feature #6114 (New): Complete python binding interfaces for librados
Now python binding for librados only supports basic operations like read, write. A lot of interfaces librados.h imple... Haomai Wang
06:40 PM CephFS Bug #2218 (Resolved): CephFS "mismatch between child accounted_rstats and my rstats!"
I have run MDS with "mds verify_scatter = 1" for months, didn't hit this. Zheng Yan
04:29 PM Feature #6001 (Fix Under Review): EC: [link] jerasure plugin
Loïc Dachary
04:27 PM Feature #6000 (Fix Under Review): EC: [link] erasure plugin mechanism and abstract API
Loïc Dachary
04:24 PM Subtask #6113 (Resolved): add ceph osd pool create [name] [key=value]
"work in progress":https://github.com/ceph/ceph/pull/578
* add *ceph osd pool create [name] [key=value]* where *ke...
Loïc Dachary
04:23 PM Bug #6112 (Resolved): rgw test failed in the nightly during upgrade from dumpling to next
logs: ubuntu@teuthology:/a/teuthology-2013-08-23_01:35:04-upgrade-parallel-next-testing-basic-vps/1798... Tamilarasi muthamizhan
04:12 PM Bug #6083: fedora18 rpm packages for ceph should be built with proper naming convention for ceph-dbg
we may need to check this for centos as well. I am not able to check this right now as v0.67.1 is missing on gitbuild... Tamilarasi muthamizhan
04:09 PM Subtask #5878 (Fix Under Review): erasure plugin mechanism and abstract API
Loïc Dachary
04:08 PM Subtask #5879 (Fix Under Review): jerasure plugin
Loïc Dachary
03:54 PM Documentation #5690: ceph "global options" should be documented somewhere
Here's some notes I scribbled for myself:
early_options:
--version: show version to stdout, exit
--conf/-c: se...
Dan Mick
03:35 PM Bug #6099 (Resolved): ceph-rest-api: default log file doesn't work because not daemon
Dan Mick
03:15 PM Bug #6099 (Fix Under Review): ceph-rest-api: default log file doesn't work because not daemon
Dan Mick
03:22 PM CephFS Feature #3426: ceph-fuse: build/run on os x
Giving this to Noah since he's actually done it already in a branch. Greg Farnum
03:21 PM Bug #2901 (Resolved): librados-config should not read ceph.conf
Sage Weil
03:18 PM Bug #3163 (Won't Fix): doc: explain meaning of pg dump output
Sage Weil
03:18 PM CephFS Bug #3544 (Won't Fix): ./configure checks CFLAGS for jni.h if --with-hadoop is specified but also...
Sage Weil
03:17 PM Bug #3030 (Won't Fix): config/option parser: Avoid needing to list command line options in a glob...
Sage Weil
03:17 PM Bug #3029 (Won't Fix): config/option parser: Avoid needing to list obscure one-use options in glo...
Sage Weil
03:16 PM Bug #2520 (Duplicate): iozone random read/write with 4k block size hangs
Sage Weil
03:14 PM Bug #3662 (Won't Fix): mkcephfs --mkfs is not inserting any default settings
Sage Weil
03:13 PM Bug #2690 (Won't Fix): mon: persist quorum features
we should use compatset features in cases where this is unsafe. Sage Weil
03:12 PM Bug #2207 (Resolved): osd: crash when op length is greater than op input data
Sage Weil
03:12 PM Bug #5291 (Can't reproduce): Bug with client naming for Cinder-Volume usage
Ian Colle
03:11 PM Bug #2618 (Can't reproduce): error: unable to open OSD superblock
Samuel Just
03:10 PM Bug #5891 (Won't Fix): rados bench displaying wrong unit
we want to use base-2, but MiB etc is not pretty Sage Weil
03:09 PM Bug #3660 (Resolved): osd: marking objects lost invalidates pg stats
Samuel Just
03:09 PM Bug #2354 (Resolved): osd: make watch timeout configurable
Samuel Just
03:08 PM Bug #2902 (Resolved): common lib tries to open literal ~/.ceph/ceph.conf
Sage Weil
03:08 PM Bug #2507 (Resolved): auth: "ceph auth get-or-create-key" argument validation is lacking
Sage Weil
03:08 PM Bug #2205 (Won't Fix): mkcephfs throws "No such file or directory" errors when the pwd the script...
Sage Weil
03:08 PM Bug #1036 (Won't Fix): obsync: handle LFN for file://
Sage Weil
03:07 PM Bug #2551 (Rejected): leveldb broke "make distcheck"
Sage Weil
03:07 PM rgw Bug #3896 (Can't reproduce): rest-bench common/WorkQueue.cc: 54: FAILED assert(_threads.empty())
Sage Weil
03:07 PM Bug #5078 (Won't Fix): Debian missing sudo results in unclear error
Sage Weil
03:07 PM Bug #5471 (Resolved): mon: do not join a quorum if quorum's version is lower than ours
Sage Weil
03:06 PM CephFS Bug #3551 (Can't reproduce): mds: journaler hang
Sage Weil
02:57 PM Bug #2914 (Resolved): librados set_complete_callback, set_safe_callback clobber each other's argu...
Sage Weil
02:51 PM Bug #3526 (Resolved): Commands mentioned in documentation are incomplete ?
Ian Colle
02:50 PM Bug #3584 (Resolved): Ranlib fails from 64-bit client on a file in 32-bit based Ceph cluster.
i believe this has been fixed. Sage Weil
02:49 PM rgw Bug #6111 (Resolved): rgw: multipart upload fails when last chunk < 512k
Yehuda Sadeh
02:49 PM CephFS Bug #3598 (Resolved): MDS should shut down cleanly on EBLACKLIST
Sage Weil
02:48 PM Bug #3780 (Won't Fix): pg_num inappropriately low on new pools
Sage Weil
02:48 PM Bug #2828 (Resolved): osd: assign_bid was allowed to mutate and return data
Sage Weil
02:48 PM Bug #2653 (Resolved): Web docs point to obsolete "fusermount" page
Ian Colle
02:47 PM Bug #3300 (Resolved): ceph::buffer::end_of_buffer isn't caught
Sage Weil
02:46 PM Bug #3899 (Won't Fix): osd: failed to decode object_info_t
Sage Weil
02:46 PM Bug #3268 (Rejected): osd: localize reads handling is incorrect
Sage Weil
02:45 PM Bug #3894 (Closed): monclient: --keyring failed despite presence of file
Sage Weil
02:45 PM Bug #2890 (Resolved): monitor: "recognize" heap commands
Sage Weil
02:44 PM Bug #3972 (Resolved): new boost dependency: libboost-program-options
Sage Weil
02:42 PM Messengers Bug #1674 (Can't reproduce): daemons crash when sent random data
Sage Weil
02:41 PM Bug #3903 (Resolved): OSDMap::raw_pg_to_pps causes pools to have similar mappings
Sage Weil
02:37 PM Bug #4780 (Resolved): RBD-Enabling Discard Trim
Sage Weil
02:37 PM CephFS Bug #5021: ceph-fuse: crash on traceless reply
What's the status of wip-5021? Greg Farnum
02:36 PM Bug #3434: Unknown variables in test_xattr_support
I'm curious why this is set to Won't Fix - the bug still exists in master. Steve Hoeksema
02:32 PM Bug #3434 (Won't Fix): Unknown variables in test_xattr_support
Sage Weil
02:35 PM Bug #4344 (Can't reproduce): osd/ReplicatedPG.cc: 5378: FAILED assert(pi.recovery_info.soid.snap ...
Sage Weil
02:34 PM CephFS Bug #6087 (Resolved): mds: do not loop on old dirs missing backpointer xattrs
I put this into Dumpling. The issue didn't exist in Cuttlefish since Yan hadn't written the open-by-ino code at that ... Greg Farnum
02:33 PM Bug #4109 (Duplicate): incorrect degraded count
same as negative degraded Sage Weil
02:33 PM Fix #4205: librados: Improve Watch-notify semantics
sam, figure out what this means. Sage Weil
02:29 PM Bug #6110 (Resolved): v0.67.1 branch is missing on gitbuilder for debian precise
There is no v0.67.1 branch on http://gitbuilder.ceph.com/ceph-deb-precise-x86_64-basic/ref/
and even confusing is ...
Tamilarasi muthamizhan
02:28 PM Bug #2891 (Can't reproduce): heap profiler hangs when trying to start it up on the mon
Sage Weil
02:27 PM Bug #5251 (Can't reproduce): wrong node messages in mds log
Sage Weil
02:27 PM Bug #5449 (Can't reproduce): osd crash immediately after booting up
Samuel Just
02:26 PM Bug #5459 (Resolved): ceph-mon failure using wip-mon-pgmap on ARM
Sage Weil
02:26 PM Bug #5500 (Resolved): ceph CLI should validate, reject bad daemon commands
Sage Weil
02:24 PM Bug #5733 (Won't Fix): monitor: validate pg_temp entries from OSDs
remove_down_pg_temp() cleans up Sage Weil
02:22 PM Bug #5946 (Resolved): lockstatus.get_status called although check-locks: false
Has been merged already, closing. Loïc Dachary
02:17 PM Bug #5946: lockstatus.get_status called although check-locks: false
pull request? Samuel Just
02:17 PM Bug #5788 (Resolved): ceph: try new, fallback to old can race with daemon upgrade
Samuel Just
02:15 PM Bug #5972 (Won't Fix): Permissions on /var/run/ceph changed causing permission error messages
Sage Weil
02:14 PM Fix #5989: librados: document that bufferlist usage model is inconsistent
Sage Weil
02:13 PM Bug #6043 (Need More Info): upstart does not reflect running ceph-osd daemons (ubuntu 13.04 only)
is this still a problem? unless we can figure out the sequence to reproduce this i'm not sure what to do here. upst... Sage Weil
02:08 PM Bug #5932 (Won't Fix): osdmaptool --create-from-conf ignore "osd pool default pg[p] num"
pg split now works; let's rely on that instead.
also nothing really uses osdmaptool to create the initial osdmap...
Sage Weil
02:05 PM Bug #5395: arm: osd: big performance differential between read/write
is this still present? Sage Weil
02:04 PM Bug #5445 (Can't reproduce): random osd EPERM on journal
Sage Weil
02:04 PM Bug #5641 (Resolved): occasional crush_ops.sh failure
Sage Weil
02:03 PM Bug #5776 (Can't reproduce): ceph: passing -1 osd id
Sage Weil
02:02 PM Bug #5700 (Can't reproduce): very high memory usage after update
don't see anything strange from the core. i suspect this is just lots of pgs... Sage Weil
02:01 PM Bug #5823: cpu load on cluster node is very high, client can't get data on pg from primary node ...
what kernel are you running? Samuel Just
02:00 PM rbd Bug #5890 (Need More Info): can't remove rbd image from pool
does 'ceph health' say OK? Sage Weil
01:59 PM Bug #5925 (Can't reproduce): hung ceph_test_rados_delete_pools_parallel
Sage Weil
01:57 PM Bug #5981: osd: journal didn't preallocate
the problem is that ceph-disk creates the jouranl but does not allocat eit Sage Weil
01:51 PM Documentation #6107 (Resolved): Broken link on upgrade doc
See http://ceph.com/docs/master/install/upgrading-ceph/ John Wilkins
12:39 PM Documentation #6107 (Resolved): Broken link on upgrade doc
There is a broken link in the doc at http://ceph.com/docs/next/install/upgrading-ceph/
The broken link is: http://...
Ross Turk
01:51 PM Bug #5985 (Resolved): very slow recovery for some objects
Sage Weil
01:51 PM Bug #5923 (Duplicate): osd: 6 up, 5 in; 91 active+clean, 1 remapped
Sage Weil
01:51 PM Bug #5901 (Duplicate): stuck incomplete immediately after clean
Sage Weil
01:50 PM Bug #5922 (In Progress): osd: unfound objects on next
Sage Weil
01:30 PM rbd Bug #5615 (Duplicate): lock ops are not re-sent when cluster gets marked un-full
this is the linger resend on unfull bug #6070 Sage Weil
01:27 PM rbd Bug #5812 (Need More Info): qemu-kvm guest hangs on disk write with rbd storage
Josh Durgin
01:13 PM rbd Bug #5812: qemu-kvm guest hangs on disk write with rbd storage
Since this mostly goes away with caching enabled, I'm guessing this is the same as #5919 - does it still occur with 0... Josh Durgin
01:26 PM rbd Bug #3619 (Resolved): librbd: read_iterate sparse behavior broken
Sage Weil
01:25 PM rbd Bug #5184 (Resolved): libceph: create_singlethread_workqueue() error handling
dan carpenter fixed this Sage Weil
01:24 PM rbd Bug #5955 (In Progress): qemu deadlock when librbd caching enabled (writethru or writeback).
Sage Weil
01:20 PM rgw Bug #5362 (Resolved): rgw: failure when listing objects with prefix that starts with underscore
Sage Weil
01:20 PM rgw Bug #4410 (Can't reproduce): rgw: exits uncleanly on fastcgi socket error
Sage Weil
01:18 PM rgw Bug #5374 (Fix Under Review): Avoid relying on keystone's admin token
Ian Colle
01:18 PM rgw Bug #5374: Avoid relying on keystone's admin token
wip-5374 Ian Colle
01:16 PM rgw Bug #5820 (Resolved): radosgw-admin should fail on non-valid flags
Pull request merged. Ian Colle
01:16 PM rgw Bug #5885 (Resolved): Valgrind issue found while running s3 and swift tests
looks like leaks Sage Weil
01:13 PM rgw Bug #5192 (Won't Fix): RGW: radosgw-admin user rm --access-key not working on bobtail
Sage Weil
01:12 PM rgw Bug #5953 (Resolved): rgw: drain requests when going down
Sage Weil
01:09 PM rgw Bug #6046 (Resolved): rgw: empty pool created for control objects
Sage Weil
01:06 PM RADOS Fix #6109 (New): pg <pgid> mark_unfound_lost fails if a completely-gone OSD still in map
cluster on mira045 et. al. had bad disk on osd.25; marked out, much data extracted, but for some
reason one pgid (2....
Dan Mick
12:48 PM Bug #6108 (Resolved): broken readdir_r usage
the buffer needs to be ~ sizeof(struct dirent) + PATH_MAX; it can't be a struct dirent or else the filename itself wi... Sage Weil
12:44 PM CephFS Bug #5649 (Can't reproduce): smbtorture test gets ebusy on kclient umount
hasn't come up in a few weeks. Sage Weil
12:43 PM CephFS Bug #5927: kcephfs: ENOTEMPTY on rm -r
Sage Weil
10:39 AM Bug #6090 (Pending Backport): mon/OSDMonitor.cc: 186: FAILED assert(err == 0)
Sage Weil
10:17 AM rgw Bug #6056 (Resolved): rgw: sync agent is not propagating bucket delete
Fixed, commit:2632846e24e3c26139e982e0a569951d25e1589b Yehuda Sadeh
09:58 AM rbd Bug #5426: librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion::complete()
ubuntu@teuthology:/a/teuthology-2013-08-23_00:30:06-ceph-deploy-master---basic-saya/1087
slightly different, thoug...
Sage Weil
09:40 AM Feature #5909 (In Progress): mon: keep track of monitor store size estimate vs 'du $mon_data'
Ian Colle
08:49 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Ooh, I think I know what this is. This is probably cuttlefish v0.61.7 or older, right? There is a fix in dumpling (... Sage Weil
03:29 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Sage,
sorry for being late on this, other tasks kept me busy,
but here the infos you were asking for:
> > ceph...
bernhard glomm
08:29 AM Bug #6049: pgmap json output shows bytes_* values quadrupled
oops, sorry about that. pushed to dumpling branch. commit:a0ac88272511d670b5c3756dda2d02c93c2e9776 Sage Weil
02:52 AM Bug #6049: pgmap json output shows bytes_* values quadrupled
Maybe this warrants a backport to dumpling? The fix is tiny and risk-free. Faidon Liambotis
08:16 AM Bug #6085 (Resolved): specify filetype flag (-t) when calling mount
Sage Weil
06:03 AM Bug #6085 (Fix Under Review): specify filetype flag (-t) when calling mount
Opened pull request: https://github.com/ceph/ceph/pull/534 Alfredo Deza
07:24 AM devops Bug #6104 (Resolved): ceph-deploy should workaround pseudo-tty in SSH
When connecting to a host that does not allow `sudo` over SSH, returning an error similar to:... Alfredo Deza
05:27 AM devops Bug #6102 (Resolved): if EPEL has been added skip adding it again
ceph-deploy as of 1.2.2 adds the EPEL repo to CentOS and Scientific, it should not try to add this again if it alread... Alfredo Deza
03:11 AM rgw Bug #5931: radosgw crashes when deleting object
Stumbled upon the same problem using bobtail binaries for ubuntu precise (from http://ceph.com/debian-bobtail repo), ... Moritz Krinke

08/22/2013

08:53 PM Bug #6101 (Can't reproduce): ceph-osd crash on corrupted store
I see a problem with one of my dumpling OSD under debian, on a compressed btrfs. I think my BTRFS is corrupted or som... Mikaël Cluseau
06:40 PM Bug #5951: osd: next: EEXIST on mkcoll
Nothing useful from the last one.
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5951...
Samuel Just
06:01 PM Bug #5239 (Can't reproduce): osd: Segmentation fault in ceph-osd / tcmalloc
Let us know if this is still happening for you. Thanks! Sage Weil
06:00 PM Bug #5695 (Resolved): Debian packaging fails when removed but not purged
Sage Weil
05:58 PM CephFS Bug #5883 (Resolved): mds: broken locking, ref count in handle_accept
Sage Weil
05:53 PM Bug #6090 (Fix Under Review): mon/OSDMonitor.cc: 186: FAILED assert(err == 0)
Sage Weil
03:56 PM Bug #6090: mon/OSDMonitor.cc: 186: FAILED assert(err == 0)
Sage Weil
10:41 AM Bug #6090 (Resolved): mon/OSDMonitor.cc: 186: FAILED assert(err == 0)
... Sage Weil
05:18 PM Bug #6099 (Resolved): ceph-rest-api: default log file doesn't work because not daemon
ceph-rest-api assumes that it can either get the user's choice or the default log file from
rados_conf_get("log_file...
Dan Mick
04:42 PM devops Feature #6098 (Rejected): put teuthology.front.sepia.ceph.com apache configuration files under so...
There are several configuration files that are on teuthology.front.sepia.ceph.com that should probably be saved in gi... Anonymous
04:32 PM Bug #6097 (Resolved): btrfs locking regression on async snap ioctl
... Sage Weil
01:40 PM rgw Bug #6078 (Fix Under Review): rgw: CORS not working
Pushed a bunch of changes to wip-6078. Yehuda Sadeh
12:15 PM devops Bug #5499 (Resolved): ceph-deploy --cluster clustername osd prepare fails
Merged into ceph-deploy master branch: 9605cefd71770118097a11f99a9fc27c1e30b1f5 Alfredo Deza
09:57 AM devops Bug #5499 (In Progress): ceph-deploy --cluster clustername osd prepare fails
Alfredo Deza
08:30 AM devops Bug #5499: ceph-deploy --cluster clustername osd prepare fails
Thanks for the update, this should get fixed today with a release before the end of the week. Alfredo Deza
01:36 AM devops Bug #5499: ceph-deploy --cluster clustername osd prepare fails
Bah, redmine formatting sucks... Patch attached. Samir Ibradzic
01:32 AM devops Bug #5499: ceph-deploy --cluster clustername osd prepare fails
This should fix it:
--- /usr/share/pyshared/ceph_deploy/osd.py
+++ /usr/share/pyshared/ceph_deploy/osd.py
@@ -113,...
Samir Ibradzic
11:06 AM devops Bug #6091 (Won't Fix): centos build should use redhat-rpm-config for debuginfo packages
The specfile currently directly invokes the debuginfo macro for centos builds. It should instead use the redhat-rpm-... Anonymous
10:16 AM rgw Bug #6088: rgw: When uploading via POST specifying text instead of file formdata input field, a s...
Version used: 0.56.6-15-g8c6a912 Brian Andrus
10:11 AM rgw Bug #6088 (Resolved): rgw: When uploading via POST specifying text instead of file formdata input...
Specific text from customer:
??When uploading via POST if user specifies the "file" formdata input field as "text"...
Brian Andrus
09:55 AM devops Bug #6086 (Resolved): ceph-deploy needs to handle ClientInitExceptions from pushy
Merged into ceph-deploy master branch with hash: dfaa9d3274b3c8c0dcfce94062532649bf212fb9
Basically, a simple try/...
Alfredo Deza
09:49 AM devops Bug #6086 (Fix Under Review): ceph-deploy needs to handle ClientInitExceptions from pushy
Opened pull request: https://github.com/ceph/ceph-deploy/pull/48 Alfredo Deza
07:34 AM devops Bug #6086 (Resolved): ceph-deploy needs to handle ClientInitExceptions from pushy
When pushy can't connect to a remote host it errors out and ceph-deploy does not handle the exception
resulting in a...
Alfredo Deza
09:53 AM devops Bug #6077 (Resolved): lsb_release should not be required for purging CentOS/Scientific
Merged into ceph-deploy master branch with hash: 5d7304cabcca4bde7dd439d4578300777b28575c
Moved the helpers for ls...
Alfredo Deza
07:34 AM devops Bug #6077 (Fix Under Review): lsb_release should not be required for purging CentOS/Scientific
Pull request opened: https://github.com/ceph/ceph-deploy/pull/47 Alfredo Deza
09:42 AM Bug #6003: journal Unable to read past sequence 406 ...
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-08-22_01:00:13-rados-next-testing-basic-plana/804 Sage Weil
09:36 AM Bug #6081 (Duplicate): osd crashed during upgrade tests from dumpling to next in the nightlies
dup of #6082 Sage Weil
09:05 AM rbd Bug #5636: krbd: crash in image refresh
again on ubuntu@teuthology:/a/teuthology-2013-08-22_01:01:30-krbd-next-testing-basic-plana/1020... Sage Weil
08:57 AM rbd Bug #5426: librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion::complete()
ubuntu@teuthology:/a/teuthology-2013-08-22_01:01:01-rbd-next-testing-basic-plana/888 Sage Weil
08:50 AM Bug #6047 (In Progress): mon: Assert and monitor-crash when attemting to create pool-snapshots wh...
Joao Eduardo Luis
08:24 AM Feature #5909 (Fix Under Review): mon: keep track of monitor store size estimate vs 'du $mon_data'
wip-5909 / pr: #526 Joao Eduardo Luis
08:15 AM CephFS Bug #6087 (Resolved): mds: do not loop on old dirs missing backpointer xattrs
... Sage Weil
07:16 AM Bug #6085: specify filetype flag (-t) when calling mount
This is the actual patch that fixes this problem:... Alfredo Deza
07:11 AM Bug #6085 (Resolved): specify filetype flag (-t) when calling mount
When not specifying the filetype `mount` will refuse to mount the filesystem created as seen in this thread from ceph... Alfredo Deza

08/21/2013

05:10 PM Bug #6083 (Resolved): fedora18 rpm packages for ceph should be built with proper naming conventio...
currently, in the nightlies, ceph-debug package is looked for in a format "ceph-debug-0.67.1-11.gf6fe74f.fc18" but th... Tamilarasi muthamizhan
04:48 PM Feature #5984: mon: probe monitors to check on their status regardless of quorum
Dan Mick
04:44 PM Support #6070 (Resolved): list_lockers() never returns after disk full (librbdpy)
commit:38a0ca66a79af4b541e6322467ae3a8a4483cc72 in master, next, dumpling and cuttlefish Josh Durgin
03:06 PM Support #6070 (In Progress): list_lockers() never returns after disk full (librbdpy)
There's a fix waiting for review in the wip-6070-cuttlefish branch. It applies cleanly to the next branch as well. Josh Durgin
04:24 PM Bug #6081 (Duplicate): osd crashed during upgrade tests from dumpling to next in the nightlies
logs: ubuntu@teuthology:/a/teuthology-2013-08-21_01:35:03-upgrade-parallel-next-testing-basic-vps/5197... Tamilarasi muthamizhan
03:21 PM rbd Fix #6079 (Resolved): libceph: osd_client does not handle PAUSERD or PAUSEWR or FULL flags in osdmap
When these flags are present, reads and or writes should not be sent. When these flags are removed, requests that wer... Josh Durgin
02:21 PM rgw Bug #6078 (Resolved): rgw: CORS not working
Yehuda Sadeh
01:36 PM devops Feature #5847: Build own versions of most recent leveldb for all supported platforms.
Both the native leveldb-1.12 and our locally compiled version work correctly for mon create. So the issue encountere... Anonymous
11:47 AM devops Feature #5847 (In Progress): Build own versions of most recent leveldb for all supported platforms.
Re-opening since the leveldb-1.12 backported from fedora19 hangs during monitor create on centos/rhel 6.3 & 6.4 as de... Anonymous
11:51 AM Bug #6022 (Resolved): monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
Deleting leveldb-1.12 from the ceph-extras repo, and from the local mirror used by teuthology falls back to the level... Anonymous
11:37 AM Bug #5951: osd: next: EEXIST on mkcoll
No failures again. Switched yaml to xfs, more failures so far with xfs than with ext4.
~/teuthology [mine?] » ./v...
Samuel Just
11:17 AM Bug #6071 (Resolved): rados api test LibRadosMisc.BigAttrPP failed on the arm set up

test passed....
Tamilarasi muthamizhan
10:58 AM Bug #6071: rados api test LibRadosMisc.BigAttrPP failed on the arm set up
yes, the test set up had different versions for ceph-test[defaults to master branch by install task as no branch was ... Tamilarasi muthamizhan
11:13 AM rgw Bug #6046 (Pending Backport): rgw: empty pool created for control objects
Sage Weil
11:05 AM devops Bug #6077 (Resolved): lsb_release should not be required for purging CentOS/Scientific
lsb_release seems to still be required for some ceph-deploy actions, purge/purgedata use it and should really attempt... Alfredo Deza
11:04 AM Bug #5412 (Resolved): doc bug: incorrect reference to monitor quorum requirements
http://ceph.com/docs/master/rados/deployment/ceph-deploy-mon/ John Wilkins
10:57 AM devops Documentation #5968 (Resolved): typo in monmap
http://ceph.com/docs/master/man/8/monmaptool/ John Wilkins
10:53 AM rgw Documentation #5525 (Resolved): Radosgw 'add the ceph keyring entries' section should be updated ...
http://ceph.com/docs/master/radosgw/config/#add-to-ceph-keyring-entries
http://ceph.com/docs/master/start/quick-rgw/...
John Wilkins
10:49 AM Fix #6075 (Rejected): ceph.client.admin.keyring doesn't allow read to non-root users
The ceph-create-keys script creates the file with 0600 and this causes issues when deploying. Our docs have to add th... John Wilkins
10:47 AM rgw Bug #6056 (In Progress): rgw: sync agent is not propagating bucket delete
You've got my Reviewed-by: with the comment change we discussed, assuming you've tested it. Greg Farnum
10:44 AM Documentation #5926 (Resolved): 5 minute quick start should deploy cluster using ceph-deploy and ...
mkcephfs references removed from documentation. John Wilkins
10:44 AM devops Documentation #5688 (Resolved): ceph-deploy: upgrade procedure has to be documented
http://ceph.com/docs/master/install/upgrading-ceph/ John Wilkins
10:43 AM Bug #6074 (Duplicate): [ERR] scrub mismatch
This looks like #5754. It's a bug in leveldb on precise, but harmless. Sage Weil
10:29 AM Bug #6074 (Duplicate): [ERR] scrub mismatch
While running the rados suite with... Loïc Dachary
10:33 AM rgw Feature #5604 (Resolved): rgw: teuthology tests to check various user creation issues on multi re...
These tests were added via commit #a39e7f1b095d3cb07f15ed065b4841d8730ed584
Anonymous
10:29 AM rgw Feature #5603 (Resolved): rgw: teuthology test to check secondary region creation
This test case is a subset of 5604. Closing it as resolved since 5604 has been resolved. Anonymous
10:24 AM rgw Feature #5602 (Resolved): rgw: teuthology task to test default region as master region
This test case was checked into ceph-qa-suite as suites/rgw/singleton/all/rados-convert-to-region.yaml
commit #c37fa...
Anonymous
09:56 AM rgw Bug #6051 (Resolved): rgw: 404 during readwrite test
The issue was that the tests were not specifying the 'domain root pool' and the pool name generated by the rgw.py tas... Anonymous
08:52 AM Bug #6073 (Can't reproduce): osd: mark_me_down sequence is racy
... Sage Weil
03:06 AM Subtask #5879 (In Progress): jerasure plugin
Loïc Dachary
02:06 AM rbd Bug #6072 (Resolved): librbd image rename breaks child backwards reference
Renaming an rbd image, that has clones, with a large name will break Image().parent_info() i.e. reverse lookup and th... Edward Hope-Morley

08/20/2013

10:58 PM rgw Bug #6056: rgw: sync agent is not propagating bucket delete
Comment on github. Greg Farnum
09:48 AM rgw Bug #6056: rgw: sync agent is not propagating bucket delete
Tag, Greg, you're it. Ian Colle
09:40 AM rgw Bug #6056: rgw: sync agent is not propagating bucket delete
Josh - can you please review? Ian Colle
10:40 PM devops Bug #5599 (Pending Backport): ceph-disk: prepare should issue a partprobe on the journal device too
Sage Weil
05:51 PM Bug #6071: rados api test LibRadosMisc.BigAttrPP failed on the arm set up
it is sending a 40MB xattr and failing. it should be sending 64K. note that 2 lines down from osd max attr in confi... Sage Weil
05:40 PM Bug #6071: rados api test LibRadosMisc.BigAttrPP failed on the arm set up
logs with debug on is in: mira025: /home/ubuntu/bug_6071_latest Tamilarasi muthamizhan
05:15 PM Bug #6071: rados api test LibRadosMisc.BigAttrPP failed on the arm set up
xfs Tamilarasi muthamizhan
05:12 PM Bug #6071: rados api test LibRadosMisc.BigAttrPP failed on the arm set up
logs are copied to ubuntu@mira025:/home/ubuntu/bug_6071 Tamilarasi muthamizhan
05:09 PM Bug #6071: rados api test LibRadosMisc.BigAttrPP failed on the arm set up
the test is... Sage Weil
05:04 PM Bug #6071 (Resolved): rados api test LibRadosMisc.BigAttrPP failed on the arm set up
The rados api test failed on arm test setup when using ceph-deploy task. while it still worked with install task.
...
Tamilarasi muthamizhan
05:27 PM Bug #5951: osd: next: EEXIST on mkcoll
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5951-7" -n 50 --owner samuelj@slider test... Samuel Just
03:47 PM Bug #5951: osd: next: EEXIST on mkcoll
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5951-6" -n 50 --owner samuelj@slider test... Samuel Just
01:10 PM Bug #5951: osd: next: EEXIST on mkcoll
ubuntu@teuthology:/a/teuthology-2013-08-20_01:00:13-rados-next-testing-basic-plana/2690 Sage Weil
09:18 AM Bug #5951: osd: next: EEXIST on mkcoll
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-08-19_20:00:16-rados-dumpling-testing-basic-plana/2087 Sage Weil
05:23 PM Bug #5922: osd: unfound objects on next
After examining the running process of one of these, it really looks like either the replica ignored the message for ... Samuel Just
04:58 PM CephFS Bug #6004 (Pending Backport): osdc/ObjectCacher.cc: 738: FAILED assert(bh->length() <= start+(lof...
Sage Weil
04:40 PM Support #6070 (Resolved): list_lockers() never returns after disk full (librbdpy)
In the situation where an OSD disk fills up and then we attempt to unmount a resource, we have a process which, among... JuanJose Galvez
03:51 PM rbd Bug #5220: test_ls_snaps segfaults on the arm test setup
teuthology logs are copied to ubuntu@mira025:/home/ubuntu/rbd_api_old Tamilarasi muthamizhan
03:39 PM Bug #6040 (Resolved): Significant slowdown of osds since v0.67 Dumpling
From ceph-users:
Hey Samuel,
I picked up 0.67.1-10-g47c8949 from the GIT-builder and the osd from
that seems t...
Samuel Just
12:53 PM devops Bug #6035: ceph-deploy: ceph-create-keys stuck on fedora 18 VMs
ceph version 0.67.1 (e23b817ad0cf1ea19c0a7b7c9999b30bed37d533)
looks like the mon create command doesnt hang anym...
Tamilarasi muthamizhan
12:04 PM devops Bug #6035: ceph-deploy: ceph-create-keys stuck on fedora 18 VMs
Tamil, I attempted to start manually the mon on one of those servers and got errors:... Alfredo Deza
12:26 PM Bug #6022: monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
Rebuilding leveldb-1.12 without the Basho patch seems to work ok. This patch is described as:
# Cherry-picked fro...
Anonymous
12:06 PM Bug #6022: monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
Looks like the old leveldb-1.7.0 package works ok with dumpling on centos6.3, but the new leveldb-1.12 package does n... Anonymous
08:02 AM Bug #6022: monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
6.3 fails in the same way:... Alfredo Deza
12:20 PM Bug #6049 (Resolved): pgmap json output shows bytes_* values quadrupled
Dan Mick
11:50 AM devops Feature #4954 (New): ceph-deploy: help and document need to be updated for osd create
Alfredo Deza
11:28 AM Bug #6045 (Pending Backport): mon/OSDMonitor.cc: 1609: FAILED assert(err == 0)
Sage Weil
10:14 AM devops Feature #6067 (Resolved): ceph-deploy: make mon create catch common errors
a few ideas:
- add a --add argument that is needed to expand the mon cluster. if not present, we will only procee...
Sage Weil
09:52 AM devops Bug #6019 (Resolved): ceph-deploy needs to better detect yum/apt for bootstraping
Merged into ceph-deploy master: 252c21dec59ba1ff407362a6b21f043b7b8947ef
We are now making sure we are adding the ...
Alfredo Deza
08:53 AM devops Bug #6019 (Fix Under Review): ceph-deploy needs to better detect yum/apt for bootstraping
Opened pull request: https://github.com/ceph/ceph-deploy/pull/46 Alfredo Deza
08:07 AM rbd Bug #5955: qemu deadlock when librbd caching enabled (writethru or writeback).
This hang occurred frequently with qemu 1.4.0, but after a week of trying, I cannot reproduce this bug under qemu 1.5... Mike Dawson
04:27 AM Subtask #6064 (Rejected): erasure code : convenience functions to code / decode
It would be usefull to have convenience functions that "work in terms of offset+length instead of chunks":http://arti... Loïc Dachary

08/19/2013

10:54 PM Bug #6057 (Resolved): osd: log bound mismatch after bobtail -> dumpling -> next upgrade
yay, tested ok for me too. merged and backported Sage Weil
10:42 PM Bug #6057: osd: log bound mismatch after bobtail -> dumpling -> next upgrade
Ran the above yaml on wip-6057, seems to work. Samuel Just
05:59 PM Bug #6057: osd: log bound mismatch after bobtail -> dumpling -> next upgrade
wip-6057 Samuel Just
04:55 PM Bug #6057: osd: log bound mismatch after bobtail -> dumpling -> next upgrade
fatty:/home/sage/tmp/6057/ceph-osd.0.log for the full log Sage Weil
04:55 PM Bug #6057: osd: log bound mismatch after bobtail -> dumpling -> next upgrade
... Sage Weil
02:11 PM Bug #6057 (Resolved): osd: log bound mismatch after bobtail -> dumpling -> next upgrade
2013-08-19 06:24:27.814763 osd.2 0.0.0.0:6808/24619 1 : [ERR] 3.4 log bound mismatch, info (0''0,30''164] actual [21'... Sage Weil
10:47 PM Bug #5951: osd: next: EEXIST on mkcoll
no failures in previous run
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5951-5" -n 5...
Samuel Just
03:43 PM Bug #5951: osd: next: EEXIST on mkcoll
no failures in the last run
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5951-4" -n 5...
Samuel Just
11:08 AM Bug #5951: osd: next: EEXIST on mkcoll
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5951-3" -n 20 --owner samuelj@slider test... Samuel Just
08:58 AM Bug #5951: osd: next: EEXIST on mkcoll
ubuntu@teuthology:/a/teuthology-2013-08-19_01:00:13-rados-master-testing-basic-plana/969 Sage Weil
10:40 PM Bug #6040: Significant slowdown of osds since v0.67 Dumpling
merged wip-dumpling-pglog-undirty with the config set to false into next and dumpling. Samuel Just
12:08 AM Bug #6040: Significant slowdown of osds since v0.67 Dumpling
wip-dumpling-pglog-undirty may help with this. Samuel Just
06:09 PM devops Bug #5599: ceph-disk: prepare should issue a partprobe on the journal device too
Actually this patch (attached) is probably more in keeping with the code style already used in ceph-disk, uses partpr... Mark Kirkwood
05:13 PM devops Bug #5599: ceph-disk: prepare should issue a partprobe on the journal device too
Fyi a tentative patch has been suggested, using partx rather than partprobe (no idea which might be best mind you):
...
Mark Kirkwood
05:06 PM rgw Bug #6056 (Fix Under Review): rgw: sync agent is not propagating bucket delete
We end up not removing the bucket entry point, although the bucket is unlinked from the user. Yehuda Sadeh
10:04 AM rgw Bug #6056: rgw: sync agent is not propagating bucket delete
I should have noted that I would expect that info for the bucket should not be found on either the source or the dest... Anonymous
10:02 AM rgw Bug #6056 (Resolved): rgw: sync agent is not propagating bucket delete
A new test that deletes an existing bucket on the source, then does a sync, then tries to get info for that bucket on... Anonymous
04:10 PM Bug #5902 (Resolved): s3tests failure during parallel upgrade test
backported Sage Weil
03:42 PM Bug #6003: journal Unable to read past sequence 406 ...
200 runs later and no luck reproducing this with logs.
/var/lib/teuthworker/archive/sage-bug-6003-a
200 passes
Sage Weil
09:38 AM Bug #6003: journal Unable to read past sequence 406 ...
Run this tasks repeatedly with logging Ian Colle
03:24 PM Fix #6059 (Resolved): osd: block reads while repgather is writing across replicas
Currently we use the ondisk_write/read locks to do mutual exclusion over the local filestore which avoids reading dat... Sage Weil
03:21 PM Feature #5905 (Resolved): hello world librados program (with explanatory comments!)
Merged into master, commit:823435ce650a2be0523eba0d91dc9feb28b795f7 Greg Farnum
02:42 PM Bug #6058 (Duplicate): upgrading from bobtail to dumpling to next: log bound mismatch and wrong n...
Tamilarasi muthamizhan
02:39 PM Bug #6058 (Duplicate): upgrading from bobtail to dumpling to next: log bound mismatch and wrong n...
These failure are seen when running the rgw upgrade tests from bobtail to dumpling to next branch.
logs from the n...
Tamilarasi muthamizhan
01:37 PM CephFS Bug #5039 (Resolved): client: unlinking files leaves the cached entry behind
Sage Weil
01:04 PM Bug #6043: upstart does not reflect running ceph-osd daemons (ubuntu 13.04 only)
getting somewhere. but it still can't find it.... Zoltan Arnold Nagy
12:31 PM Bug #6043 (In Progress): upstart does not reflect running ceph-osd daemons (ubuntu 13.04 only)
is the ceph package still installed? some older versions didn't stop the jobs before they uninstalled, which might e... Sage Weil
11:54 AM Bug #6043: upstart does not reflect running ceph-osd daemons (ubuntu 13.04 only)
well...... Zoltan Arnold Nagy
09:33 AM Bug #6043 (Rejected): upstart does not reflect running ceph-osd daemons (ubuntu 13.04 only)
stop ceph-osd id=0
or
stop ceph-osd-all
Sage Weil
12:50 PM Bug #6052 (Resolved): ceph cli doesn't respect CEPH_ARGS
Sage Weil
09:42 AM Bug #6052 (Fix Under Review): ceph cli doesn't respect CEPH_ARGS
Sage Weil
09:01 AM Bug #6052 (Resolved): ceph cli doesn't respect CEPH_ARGS
... Sage Weil
12:29 PM Feature #6036 (Fix Under Review): cachepool: osd: add objecter
Sage Weil
11:20 AM Bug #5988 (Resolved): librados: synchronous IO generally returns on ack instead of commit
Merged into master, thanks Sage. Greg Farnum
11:16 AM Bug #5988 (Fix Under Review): librados: synchronous IO generally returns on ack instead of commit
wip-5988, commit:4e86be9232602ed595d885fcaeda5e47ad9a2a6a, pull request 512. Greg Farnum
11:19 AM Bug #5979 (Resolved): librados: imposes internal tooling expectations on users
Merged into master. Greg Farnum
11:15 AM Bug #5979 (Fix Under Review): librados: imposes internal tooling expectations on users
wip-5988, commit:f5636be742bffb19f16fdb832891fd1a43679ccf. Pull request 512. Greg Farnum
09:53 AM Fix #5844: osd: snaptrimmer should throttle itself
(09:57:41 AM) sjust: xdeller: right, the only way to fix that would be to increase OSDMap propogation speed
Regard...
Igor Lukyanov
09:42 AM rgw Bug #6051: rgw: 404 during readwrite test
Missed adding the traceback entry to an error dict in one codepath. I've pushed a one-line fix to the s3-tests branch... Anonymous
09:00 AM rgw Bug #6051 (Resolved): rgw: 404 during readwrite test
... Sage Weil
09:39 AM rgw Bug #6046 (Fix Under Review): rgw: empty pool created for control objects
Yehuda Sadeh
09:36 AM Bug #6041 (Resolved): Failing to add 3rd monitor
Please upgrade to 0.67(.1) dumpling; 0.64 is an interim development release that doesn't get backported fixes (as 0.6... Sage Weil
08:53 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
bernhard glomm wrote:
> Sage Weil wrote:
> > bernhard: i think the problem in your case is that you have old keyrin...
Sage Weil
12:40 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Sage Weil wrote:
> bernhard: i think the problem in your case is that you have old keyrings in /etc/ceph from prior ...
bernhard glomm
07:34 AM Subtask #5862: FileStore must work with ghobjects rather than hobjects
Here is how I understand "stripe":https://www.usenix.org/legacy/events/fast09/tech/full_papers/plank/plank.pdf / shar... Loïc Dachary
07:11 AM devops Bug #6019 (In Progress): ceph-deploy needs to better detect yum/apt for bootstraping
Alfredo Deza
06:44 AM Subtask #5878: erasure plugin mechanism and abstract API
Wido den Hollander wrote:
> Will this be case sensitve? I would suggest not, since that will confuse users. I pers...
Loïc Dachary
06:32 AM Subtask #5878: erasure plugin mechanism and abstract API
I haven't looked at it in-depth, but one thing I noticed is that Reed-Solomon is always spelled with the first two le... Wido den Hollander
02:44 AM Bug #6047: mon: Assert and monitor-crash when attemting to create pool-snapshots while rbd-snapsh...
This is pretty much the same as #5959, which was reported on cuttlefish and which we believed to have fixed on commit... Joao Eduardo Luis

08/18/2013

10:55 PM Bug #6022 (New): monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
verified that i can reproduce this. my first guess is a problem with the leveldb package on 6.4. 6.3 passes? Sage Weil
10:55 PM Bug #6022: monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
Sage Weil
10:32 PM Bug #6049 (Resolved): pgmap json output shows bytes_* values quadrupled
I was looking at the "ceph --format=json status" pgmap bytes_{used,avail,total} values with the goal of using them to... Faidon Liambotis
09:08 PM Bug #5897 (Resolved): ceph_test_rados_api_watch_notify hang on LibRadosWatchNotify.WatchNotifyTim...
fix preceded dumpling; backported to cuttlefish branch Sage Weil
09:06 PM rgw Bug #5949 (Resolved): radosgw: leaks
Sage Weil
11:42 AM devops Bug #4924 (Resolved): ceph-deploy: gatherkeys fails on raring (cuttlefish)
closing out this bug. i think i captured everything we learned in http://pad.ceph.com/p/quorum_pitfalls along with a... Sage Weil
11:36 AM rbd Bug #5919 (Resolved): qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to ke...
Sage Weil
11:08 AM Bug #6047 (Resolved): mon: Assert and monitor-crash when attemting to create pool-snapshots while...
While playing around on my test-cluster, I ran into a problem that I've seen before, but have never been able to repr... Oliver Daudey

08/17/2013

09:24 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Hi Sage, sorted it just before your reply. Was idly scrolling back through the thread when I spotted the word 'iptabl... Michael Potter
09:09 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Michael Potter wrote:
> Hi Sage, took everything out of the host for except for #ipaddr# #subdomain-identifier#.#res...
Sage Weil
07:45 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Hi Sage, took everything out of the host for except for #ipaddr# #subdomain-identifier#.#resolveable-domain#
Cleaned...
Michael Potter
06:14 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Ah, I think the problem is
{ "rank": 0,
"name": "#subdomain-identifier#",
...
Sage Weil
05:54 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Sage Weil wrote:
> Michael Potter wrote:
> > Getting the same thing on a fresh install of CentOS 6.4
> >
> > [.....
Michael Potter
06:35 PM rgw Bug #6046 (Resolved): rgw: empty pool created for control objects
Yehuda Sadeh
10:48 AM devops Feature #6017: ceph-deploy mon create: create on all mons in ceph.conf + then do gatherkeys if no...
how about
ceph-deploy mon create-initial
which will
1. do mon create on each mon in the mon_initial_quorum ...
Sage Weil
10:07 AM Bug #6045 (Fix Under Review): mon/OSDMonitor.cc: 1609: FAILED assert(err == 0)
wip-6045
Sage Weil
09:57 AM Bug #6045 (In Progress): mon/OSDMonitor.cc: 1609: FAILED assert(err == 0)
we need to refresh any time we apply committed states to disk Sage Weil
09:47 AM Bug #6045 (Resolved): mon/OSDMonitor.cc: 1609: FAILED assert(err == 0)
... Sage Weil
10:01 AM CephFS Documentation #5797: Document unstable nature of CephFS
My PR is still open? Scott Devoid
09:10 AM Bug #5923 (Need More Info): osd: 6 up, 5 in; 91 active+clean, 1 remapped
Sage Weil
09:10 AM Bug #5902 (Pending Backport): s3tests failure during parallel upgrade test
Sage Weil
09:10 AM Bug #5901 (In Progress): stuck incomplete immediately after clean
Sage Weil
09:09 AM Bug #6003 (Need More Info): journal Unable to read past sequence 406 ...
Sage Weil
09:07 AM Bug #5959 (Resolved): Quorum is crashing on 'osd pool mksnap'
backported by commit:64bef4ae4bab28b0b82a1481381b0c68a22fe1a4 Sage Weil
09:00 AM Bug #5986: mon: FAILED assert(snaps.count(s)) when removing pool snap on 0.61.7
backported in commit:411871f6bcc9a4b81140c2e98d13dc123860f6f7 Sage Weil
09:00 AM Bug #5986 (Resolved): mon: FAILED assert(snaps.count(s)) when removing pool snap on 0.61.7
Sage Weil
08:58 AM Bug #5985 (Pending Backport): very slow recovery for some objects
Sage Weil
03:54 AM Bug #6043 (Won't Fix): upstart does not reflect running ceph-osd daemons (ubuntu 13.04 only)
h3. Workaround
Using *restart* instead of *reload* restarts the daemons instead of sending them a signal that grac...
Zoltan Arnold Nagy
12:36 AM CephFS Bug #6004: osdc/ObjectCacher.cc: 738: FAILED assert(bh->length() <= start+(loff_t)length-opos)
the fix looks good Zheng Yan
12:00 AM Bug #6041 (Resolved): Failing to add 3rd monitor
After adding an additional (3rd) monitor, that new monitor will crash during first sync.
Ceph version: 0.64
201...
Bram Pieters

08/16/2013

11:42 PM Feature #5964 (Resolved): ceph-post-file (to replace/supplement cephdrop)
Sage Weil
11:35 PM Feature #6036 (In Progress): cachepool: osd: add objecter
Sage Weil
03:17 PM Feature #6036 (Resolved): cachepool: osd: add objecter
Sage Weil
10:08 PM CephFS Bug #4894 (Resolved): mds: standby shut itself down due to not having any data
Sage Weil
10:56 AM CephFS Bug #4894 (Fix Under Review): mds: standby shut itself down due to not having any data
wip-4894
saw this again in ubuntu@teuthology:/a/teuthology-2013-08-15_20:01:04-fs-cuttlefish-testing-basic-plana/1...
Sage Weil
10:03 PM rbd Bug #5955 (Need More Info): qemu deadlock when librbd caching enabled (writethru or writeback).
Sage Weil
09:09 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Diego Woitasen wrote:
> What do you think? https://github.com/ceph/ceph/pull/510
aha, i bet this is what is tripp...
Sage Weil
06:50 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
What do you think? https://github.com/ceph/ceph/pull/510 Diego Woitasen
05:55 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
I think the documentation is a little confusing. I had the same problems minutes ago I fixed it. In my escenario I ha... Diego Woitasen
05:43 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Michael Potter wrote:
> Getting the same thing on a fresh install of CentOS 6.4
>
> [...]
Can you post the out...
Sage Weil
03:42 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Getting the same thing on a fresh install of CentOS 6.4... Michael Potter
09:58 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Hi,
I'm hitting the same bug on red hat 6.3 (Santiago), purging /var/lib/ceph and /etc/ceph doesn't help.
Bartlomiej Palmowski
09:26 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
bernhard: i think the problem in your case is that you have old keyrings in /etc/ceph from prior cluster instances. ... Sage Weil
06:59 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
... bernhard glomm
06:33 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
> runing ceph-create-keys manually gives:
>
> INFO:ceph-create-keys:ceph-mon is not in quorum: u'probing'
> repea...
bernhard glomm
09:03 PM Bug #6040: Significant slowdown of osds since v0.67 Dumpling
For completeness, the relevant part of "ceph.conf", the rest of which just defines a standard 3-node cluster, with mo... Oliver Daudey
06:20 PM Bug #6040: Significant slowdown of osds since v0.67 Dumpling
Kernel: SMP Debian 3.2.46-1~bpo60+1 x86_64 GNU/Linux on Debian Squeeze.
QEMU/KVM: 1:1.1.2+dfsg-2~bpo60+1, recompiled...
Oliver Daudey
06:10 PM Bug #6040 (Resolved): Significant slowdown of osds since v0.67 Dumpling
I'm running a Ceph-cluster with 3 nodes, each of which runs a mon, osd and mds. I'm using RBD on this cluster as sto... Oliver Daudey
06:02 PM rgw Bug #5953 (Pending Backport): rgw: drain requests when going down
Sage Weil
05:23 PM rbd Bug #5220 (In Progress): test_ls_snaps segfaults on the arm test setup
hitting this again on dumpling but when tried with rbd old format. hence, reopening the bug... Tamilarasi muthamizhan
04:21 PM CephFS Bug #6004: osdc/ObjectCacher.cc: 738: FAILED assert(bh->length() <= start+(loff_t)length-opos)
Zheng Yan wrote:
> Sage Weil wrote:
> > looks like a read vs truncate race...
> > [...]
>
> looks like client r...
Sage Weil
03:31 PM Feature #6000 (In Progress): EC: [link] erasure plugin mechanism and abstract API
Loïc Dachary
03:26 PM Feature #6038 (Resolved): cachepool: filestore/osd: infrastructure for large object COPY atomic r...
Sage Weil
03:24 PM Feature #6037 (Resolved): cachepool: osd: whiteout state
Sage Weil
03:13 PM devops Bug #6035 (Closed): ceph-deploy: ceph-create-keys stuck on fedora 18 VMs
logs: ubuntu@teuthology:/a/teuthology-2013-08-16_01:10:04-ceph-deploy-master-testing-basic-vps/109387... Tamilarasi muthamizhan
02:39 PM Subtask #5862: FileStore must work with ghobjects rather than hobjects

Use a "generation" number (gen_t?) instead of version_t.
Use a "shard" or "slice" number (shard_t or slice_t) i...
David Zafman
02:20 PM Feature #6033 (Resolved): cachepool: osd: basic io decision: read/write from/to cache pool or EAG...
Sage Weil
02:18 PM Feature #6032 (Resolved): cachepool: objecter: send requests to cache pool
Sage Weil
02:17 PM Feature #6029: cachepool: osd: separate object version from pg version
librados visible version seperate from PG version. There must also be an Objecter interface usable (in the future) f... Samuel Just
02:14 PM Feature #6029 (Resolved): cachepool: osd: separate object version from pg version
Sage Weil
02:17 PM Feature #6031 (Resolved): cachepool: osd: COPY from another pool; small objects only
Sage Weil
02:15 PM Feature #5703 (Duplicate): Allow ceph-deploy to work with non-root account
Duplicate of #3347 Neil Levine
02:15 PM Feature #6030 (Resolved): cachepool: osd: pg_pool_t cache_pool property
Sage Weil
02:08 PM Feature #5908 (Rejected): mon: formatted output sections should be consistent across services and...
Sage Weil
02:06 PM Feature #5904 (Resolved): hello world osd class (with explanatory documentation/comments!)
Sage Weil
02:04 PM Feature #6028 (Resolved): EC: [link] ensure that erasure coded pools don't work until the osds ca...
Samuel Just
02:02 PM Subtask #6027 (Resolved): ensure that erasure coded pools don't work until the osds can handle it
"work in progress":https://github.com/ceph/ceph/pull/941
Perhaps the OSDMap includes a lower bound set of feature ...
Samuel Just
01:37 PM devops Feature #5775: create qemu rbd package for rhel 6.5 - qemu-rbd
It's actually a symlink from /usr/lib64/qemu/librbd.so.1 to the librbd.so installed by the librbd package Josh Durgin
01:31 PM devops Feature #5775: create qemu rbd package for rhel 6.5 - qemu-rbd
depends on librbd
installs a symlink in /usr/lib/qemu to librbd.so installed by the librbd package
Sage Weil
01:26 PM devops Feature #5775: create qemu rbd package for rhel 6.5 - qemu-rbd
Sage Weil
01:28 PM rbd Cleanup #5757 (Resolved): remove any fiemap reference from rbd.cc
Sage Weil
01:25 PM rbd Feature #5774 (Resolved): test libvirt + qemu on rhel
Josh Durgin
01:22 PM rbd Feature #5774 (Need More Info): test libvirt + qemu on rhel
? Neil Levine
01:24 PM rgw Feature #2460 (Rejected): rgw: support multiple ceph backends
Sage Weil
01:23 PM rgw Cleanup #3154 (Rejected): rgw: configurable auid when creating pools
Sage Weil
01:20 PM devops Fix #5900: Create a Python package for ceph Python bindings
I've made all the bindings individual packages and updates `ceph-rest-api` script to use the right imports.
Instal...
Alfredo Deza
01:12 PM rgw Documentation #5669: Default site in Apache interferes with Gateway
This has been in the documentation for sometime.
http://ceph.com/docs/master/radosgw/config/#enable-the-configurati...
John Wilkins
01:09 PM Bug #6022: monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
... Tamilarasi muthamizhan
01:09 PM Bug #6022 (Resolved): monitor crashed during ceph-deploy mon create on centos 6.4 and 6.3
logs: ubuntu@teuthology:/a/teuthology-2013-08-16_01:10:04-ceph-deploy-master-testing-basic-vps/109409... Tamilarasi muthamizhan
01:07 PM rgw Feature #5605 (In Progress): rgw: teuthology tests to check bucket issues in multi region env
Ian Colle
11:59 AM devops Feature #6017: ceph-deploy mon create: create on all mons in ceph.conf + then do gatherkeys if no...
Neil Levine
11:50 AM devops Feature #6017 (Resolved): ceph-deploy mon create: create on all mons in ceph.conf + then do gathe...
For mon status, use ... Sage Weil
11:59 AM devops Feature #6020: radosgw-apache opinionated package
Neil Levine
11:56 AM devops Feature #6020 (Rejected): radosgw-apache opinionated package
Sage Weil
11:54 AM devops Bug #6019 (Resolved): ceph-deploy needs to better detect yum/apt for bootstraping
ceph-deploy will fail horribly installing in CentOS because the EPEL repo does not exist.
It needs that because it...
Alfredo Deza
11:52 AM devops Feature #6018 (Resolved): Build ceph via jenkins
Set up a jenkins instance to build Ceph and push to repos.
- it should pull from a private repo, not github, so th...
Ian Colle
11:33 AM devops Feature #5845 (Rejected): Automate ceph-deploy push to ceph-extras.
Ian Colle
11:10 AM CephFS Documentation #5797 (Resolved): Document unstable nature of CephFS
Sage Weil
10:18 AM Fix #4635 (In Progress): mon: many ops expose uncommitted state
Sage Weil
10:10 AM Bug #6005 (Resolved): config stringification bug
Sage Weil
09:17 AM Bug #6005 (Resolved): config stringification bug
use of std::copy leaves separator at the end. fix in next, needs backport to dumpling Sage Weil
09:57 AM Bug #5981: osd: journal didn't preallocate
ext4, mounted with noatime,nodiratime,discard. Zoltan Arnold Nagy
09:39 AM Bug #5981 (Need More Info): osd: journal didn't preallocate
Sage Weil
09:39 AM Bug #5981: osd: journal didn't preallocate
strange, it is doing an fallocate on the jounral when it creates it, which should ensure there is sufficient disk spa... Sage Weil
03:59 AM Bug #5981: osd: journal didn't preallocate
Ok, the issue has been that the journal mountpoint has been filled, since it has been undersized.
Would it be poss...
Zoltan Arnold Nagy
09:40 AM Bug #5988: librados: synchronous IO generally returns on ack instead of commit
Sage Weil
09:37 AM Bug #5979: librados: imposes internal tooling expectations on users
on second thought, i think we should just drop the mention of the -m and just say 'no monitors specified' or somethin... Sage Weil
12:49 AM devops Feature #5847 (Resolved): Build own versions of most recent leveldb for all supported platforms.
Latest leveldb has been added to ceph-extras repo. Anonymous
12:29 AM Bug #5492: scripts installing into /usr/usr/sbin (with --prefix=/usr)
Gary Lowell wrote:
> Thanks Danny. I tested $(exec_prefix)$(sbindir) on rpm and debian builds and it looks like do...
Yixun Lan

08/15/2013

11:13 PM CephFS Bug #6004: osdc/ObjectCacher.cc: 738: FAILED assert(bh->length() <= start+(loff_t)length-opos)
Sage Weil wrote:
> looks like a read vs truncate race...
> [...]
looks like client releases Fr cap too early or ...
Zheng Yan
10:24 PM CephFS Bug #6004 (Fix Under Review): osdc/ObjectCacher.cc: 738: FAILED assert(bh->length() <= start+(lof...
Sage Weil
09:28 PM CephFS Bug #6004: osdc/ObjectCacher.cc: 738: FAILED assert(bh->length() <= start+(loff_t)length-opos)
looks like a read vs truncate race...... Sage Weil
09:20 PM CephFS Bug #6004 (Resolved): osdc/ObjectCacher.cc: 738: FAILED assert(bh->length() <= start+(loff_t)leng...
... Sage Weil
10:53 PM CephFS Bug #5021: ceph-fuse: crash on traceless reply
hit this again,... Sage Weil
09:18 PM CephFS Bug #5927: kcephfs: ENOTEMPTY on rm -r
Sage Weil
09:14 PM Bug #6003 (Resolved): journal Unable to read past sequence 406 ...
* fix : commit:bae1f3eaa09c4747b8bfc6fb5dc673aa6989b695... Sage Weil
07:25 PM Feature #6002 (Resolved): EC: [link] erasure coding library plugin API documentation, including a...
Samuel Just
07:24 PM Feature #6001 (Resolved): EC: [link] jerasure plugin
Samuel Just
07:23 PM Feature #6000 (Resolved): EC: [link] erasure plugin mechanism and abstract API
Samuel Just
07:22 PM Feature #5999 (Resolved): EC: [link] OSD internals must work in terms of cpg_t
Samuel Just
07:21 PM Feature #5998 (Resolved): EC: [link] FileStore must work with ghobjects rather than hobjects
Samuel Just
07:20 PM Feature #5997 (Resolved): EC: [link] Refactor scrub to use PGBackend methods
Samuel Just
07:19 PM Feature #5996 (Resolved): EC: [link] PG::calc_acting and friends should always choose the shortes...
Samuel Just
07:18 PM Feature #5995 (Resolved): EC: [link] Getinfo should use PGBackend methods to determine when peeri...
Samuel Just
07:17 PM Feature #5994 (Resolved): EC: [link] Backfill should be able to handle multiple backfill peers
Samuel Just
07:13 PM Feature #5993 (Resolved): EC: [link] Refactor recovery to use PGBackend methods
Samuel Just
07:12 PM Feature #5992 (Resolved): EC: [link] Refactor Backfill to use PGBackend methods
Samuel Just
07:12 PM Feature #5991 (Resolved): EC: [link] Backfill peers should not be included in the acting set
Samuel Just
07:11 PM Feature #5990 (Resolved): EC: [link] Factor out the ReplciatedPG object replication and client wr...
Samuel Just
07:09 PM Subtask #5433: Factor out the ReplicatedPG object replication and client IO logic as a PGBackend ...
Samuel Just
07:09 PM Subtask #5046 (Resolved): Factor out PG logs, PG missing
Samuel Just
06:35 PM Tasks #5848 (Resolved): add perf counter for each RecoveryMachine state
David Zafman
01:57 PM Tasks #5848 (Fix Under Review): add perf counter for each RecoveryMachine state
David Zafman
10:45 AM Tasks #5848 (In Progress): add perf counter for each RecoveryMachine state
David Zafman
05:45 PM Feature #5964 (Fix Under Review): ceph-post-file (to replace/supplement cephdrop)
Sage Weil
01:45 PM Feature #5964 (In Progress): ceph-post-file (to replace/supplement cephdrop)
Sage Weil
05:44 PM Feature #5904 (Fix Under Review): hello world osd class (with explanatory documentation/comments!)
Sage Weil
05:27 PM Bug #5973 (Resolved): ceph --admin-daemon return code broken
Sage Weil
04:51 PM Feature #5905 (Fix Under Review): hello world librados program (with explanatory comments!)
wip-5905 and pull request https://github.com/ceph/ceph/pull/508 Greg Farnum
01:46 PM Feature #5905 (In Progress): hello world librados program (with explanatory comments!)
Sage Weil
04:36 PM Fix #5989 (Resolved): librados: document that bufferlist usage model is inconsistent
I discussed this on irc and it's not clear if we want to provide any guarantees or not, but it's certainly unpleasant... Greg Farnum
04:33 PM Bug #5988 (Resolved): librados: synchronous IO generally returns on ack instead of commit
This is not defaulting to data safety, and the synchronous functions don't provide any interface for doing something ... Greg Farnum
04:32 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
This has just happened to me, so log with 'debug mon = 20' and 'debug ms = 1' and 'debug monc = 20' is attached.
...
Rob Taylor
12:32 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
So after reinstalling the server, this went away. Next time I run into this, I'll update. Zoltan Arnold Nagy
04:13 PM Feature #5909 (In Progress): mon: keep track of monitor store size estimate vs 'du $mon_data'
Joao Eduardo Luis
04:10 PM Fix #4635: mon: many ops expose uncommitted state
We've fixed a couple of cases on the OSDMonitor and merged them into master. I'll keep this open for a while longer ... Joao Eduardo Luis
04:05 PM Bug #5959 (Pending Backport): Quorum is crashing on 'osd pool mksnap'
I'm pretty sure this is fixed by d1501938f5d07c067d908501fc5cfe3c857d7281 on next. Joao Eduardo Luis
03:21 PM Bug #5959 (In Progress): Quorum is crashing on 'osd pool mksnap'
I haven't been able to reproduce this either on 0.61.7 or -earlier- *latest* versions.
I was however able to trigg...
Joao Eduardo Luis
04:03 PM Bug #5986 (Pending Backport): mon: FAILED assert(snaps.count(s)) when removing pool snap on 0.61.7
Well, duh.
This was fixed by Sage (and reviewed by me) on d90683fdeda15b726dcf0a7cab7006c31e99f146
Joao Eduardo Luis
03:37 PM Bug #5986: mon: FAILED assert(snaps.count(s)) when removing pool snap on 0.61.7
I can now confirm this is also really easy to trigger on cuttlefish HEAD. Joao Eduardo Luis
03:20 PM Bug #5986 (Resolved): mon: FAILED assert(snaps.count(s)) when removing pool snap on 0.61.7
While attempting to reproduce #5959, I managed to trigger this crash. It doesn't trigger on next, but I'm able to tr... Joao Eduardo Luis
03:53 PM Documentation #5987 (Resolved): document requirement for monitor host time sync
We used to say words about keeping monitor hosts in timesync with NTP or the like, but I can no longer find any menti... Dan Mick
03:14 PM Bug #5985 (Resolved): very slow recovery for some objects
The snap cloning recovery logic can cause a push transaction to generate dozens of tiny writes followed by dozens of ... Samuel Just
03:11 PM Bug #5951: osd: next: EEXIST on mkcoll
2 failures in previous set, 1 mon clock, 1 stuck wait_backfill. Didn't have logging. D'oh. Rerunning.
~/teuthol...
Samuel Just
02:32 PM rgw Feature #5611: rgw: swift GET request for object with custom metadata should show custom metadata
Neil Levine
01:45 PM Feature #5906 (Resolved): mon: better ceph -s output
Sage Weil
01:29 PM Feature #5984 (Resolved): mon: probe monitors to check on their status regardless of quorum
This could be used to figure out if a monitor is up, and if it is what's its excuse for not being in the quorum.
W...
Joao Eduardo Luis
01:09 PM Bug #5982 (Rejected): injectargs seems to be broken for bools
Sage Weil
01:08 PM Bug #5982: injectargs seems to be broken for bools
Stefan Priebe wrote:
> at least to me this doesn't change anything:
> ceph osd tell \* injectargs -- "--osd_recover...
Sage Weil
12:51 PM Bug #5982: injectargs seems to be broken for bools
at least to me this doesn't change anything:
ceph osd tell \* injectargs -- "--osd_recover_clone_overlap false"
ok
...
Stefan Priebe
12:49 PM Bug #5982: injectargs seems to be broken for bools
you need a -- to make the cli stop parsing the option. or a space in there.
ceph tell osd.0 injectargs -- --some-...
Sage Weil
12:33 PM Bug #5982: injectargs seems to be broken for bools
also for admin socket Samuel Just
12:30 PM Bug #5982 (Rejected): injectargs seems to be broken for bools
ceph2/src [wip-5910] » ./ceph tell osd.\* injectargs '--osd_recover_clone_overlap=false'
*** DEVELOPER MODE: setting...
Samuel Just
12:22 PM Bug #5981: osd: journal didn't preallocate
Oh, I forgot. This is on Ubuntu 13.04, with the following packages:
zoltan@signina:~$ dpkg -l | grep 0.61.
ii ce...
Zoltan Arnold Nagy
12:16 PM Bug #5981 (Resolved): osd: journal didn't preallocate
I had a node deployed using ceph-deploy. 7 disks in total, the journals are
on files on an SSD.
After rebooting t...
Zoltan Arnold Nagy
11:33 AM rgw Bug #5192: RGW: radosgw-admin user rm --access-key not working on bobtail
Further updates on this issue have come in from the customer, details can be found here: https://inktank.zendesk.com/... JuanJose Galvez
10:42 AM Bug #5979 (Resolved): librados: imposes internal tooling expectations on users
... Greg Farnum
10:18 AM Feature #5978 (Rejected): ceph.conf: create hierarchy
Currently we have semi-flat hierarchy. We have global section, section per entity type and section per entity. The va... Yehuda Sadeh
10:10 AM rbd Bug #5977 (Resolved): librbd: python bindings need docstrings to show up in online docs
Some methods don't have docstrings, which means they don't show up in http://ceph.com/docs/master/rbd/librbdpy/ at al... Josh Durgin
09:36 AM Bug #5972: Permissions on /var/run/ceph changed causing permission error messages
this is teuthology? it does the 777 on /var/run/ceph so that we can use the asok for non-root processes. normal ins... Sage Weil
08:00 AM devops Bug #5895 (Resolved): ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish bran...
Alfredo Deza
07:48 AM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
Opened #5975; merged https://github.com/ceph/ceph-deploy/pull/44 Zack Cerza
07:41 AM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
I'll merge this pull request but I really want a ticket to stay open reminding us that this needs to be *fixed* and n... Zack Cerza
07:38 AM devops Bug #5895 (Fix Under Review): ceph-deploy: mon create command hung on ceph-create-keys in cuttlef...
I have opened a new pull request with some tested changes that fix this problem: https://github.com/ceph/ceph-deploy/... Alfredo Deza
05:34 AM devops Bug #5895 (In Progress): ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish b...
Alfredo Deza
07:46 AM devops Bug #5975 (Resolved): Find a real fix for the pushy issue of hanging/deadlocking during long-runn...
Issue #5895 caused us to have to implement a really disappointing workaround to deal with a pushy problem: https://gi... Zack Cerza
05:34 AM devops Bug #5971 (Duplicate): ceph-deploy: ceph-create-keys hung during mon create in dumpling release o...
Alfredo Deza
12:16 AM devops Bug #5947 (Resolved): ceph-deploy RPM release is pointing to the wrong repo
ceph-deploy-release packages have been rebuilt to point to http://ceph.com/packages/ceph-extras/rpm/${dist}/noarch
T...
Anonymous

08/14/2013

10:34 PM Linux kernel client Bug #5429: libceph: rcu stall, null deref in osd_reset->__reset_osd->__remove_osd
hit this again, ubuntu@teuthology:/a/teuthology-2013-08-14_01:01:26-kcephfs-next-testing-basic-plana/106215
it was...
Sage Weil
10:19 PM Bug #5973 (Resolved): ceph --admin-daemon return code broken
'ceph daemon path bad' returns error
'ceph --admin-daemon path bad' does not
see wip
Sage Weil
07:48 PM Bug #5922: osd: unfound objects on next
in this case acting set was [5,3], marking 0 down caused it to finish recovery. Samuel Just
07:47 PM Bug #5922: osd: unfound objects on next
"last_epoch_started": 145},
"recovery_state": [
{ "name": "Started\/Primary\/Active",
"e...
Samuel Just
06:23 PM Bug #5922: osd: unfound objects on next
ubuntu@plana31:~$ sudo ceph pg dump | grep recovering
dumped all in format plain
0.22 187 37 82 37...
Samuel Just
07:08 PM Bug #5972 (Won't Fix): Permissions on /var/run/ceph changed causing permission error messages
ubuntu@mira044:~$ cat /etc/issue
Ubuntu 12.04.2 LTS \n \l
ubuntu@mira044:~$ ls -ld /var/run/ceph
drwxrwxrwx 2 ro...
David Zafman
06:19 PM Bug #5951: osd: next: EEXIST on mkcoll
failed due to github:
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5951-1" -n 50 --ow...
Samuel Just
05:30 PM Bug #5958: os/DBObjectMap.cc: 393: FAILED assert(!valid || cur_iter->valid())
Uploaded osd7_omap_bug5958.tar.bz2 to cephdrop as reuqested, it's a tar of the current/omap directory of osd.7 that k... Eric Renfro
05:21 PM devops Bug #5895 (Duplicate): ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish bra...
Tamilarasi muthamizhan
03:01 PM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
The only way I see around this (other than pushy fixing this problem) is to avoid capturing the stdout/stderr of the ... Alfredo Deza
01:41 PM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
I opened a bug in Pushy: https://github.com/axw/pushy/issues/45
Still investigating and trying all kinds of things...
Alfredo Deza
12:56 PM devops Bug #5895 (In Progress): ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish b...
hitting this issue now on centos 6.4 - burnupi05, burnupi21.
Alfredo is already on it.
Tamilarasi muthamizhan
05:19 PM devops Bug #5971 (Duplicate): ceph-deploy: ceph-create-keys hung during mon create in dumpling release o...
An user from the community [ishkabob] have been reporting this problem on IRC and i tried reproducing this locally, i... Tamilarasi muthamizhan
04:27 PM Bug #5970 (Resolved): librados: python bindings Rados() backwards compatiblity issue with 0.67.0
commit:34da9cbc33205623cf64aee1989f53dfb2c5bddd in next and commit:dd0df583e2661444287a36acc240a8ac0ec381e8 in dumpling Josh Durgin
03:53 PM Bug #5970 (Resolved): librados: python bindings Rados() backwards compatiblity issue with 0.67.0
As described on ceph-users:... Josh Durgin
03:48 PM devops Feature #5523 (Resolved): libcurl 7.28+ packages
Anonymous
03:47 PM devops Bug #5947 (In Progress): ceph-deploy RPM release is pointing to the wrong repo
Anonymous
03:46 PM devops Bug #5960 (Resolved): Missing deb packages for ceph-deploy 1.2
First push only picked up the sources for some reason. I reran reprepro and confirmed that debs where included this ... Anonymous
05:58 AM devops Bug #5960 (Resolved): Missing deb packages for ceph-deploy 1.2
There are currently no ceph-deploy 1.2 deb packages in the repo http://ceph.com/packages/ceph-extras/debian/pool/main... Alfredo Deza
03:40 PM Feature #5907: permanently log all administrative actions
I choose to interpret this as follows:
1) effectful mon commands should end up in the central log
2) the osd should...
Samuel Just
03:35 PM Feature #5910 (Resolved): osd: report filestore op latency (apply and commit) to monitor
df17d021a6b517588d04cda7430ac11b2a83ef9a Samuel Just
02:37 PM rbd Feature #4550 (Resolved): Create Qemu+RBD rpm package for RHEL+CentOS 6.3 on ceph.com
Ian Colle
02:36 PM rbd Feature #4013: rbd: openstack: extend nova boot api to support going from image to volume
Ian Colle
02:36 PM rbd Feature #4017: rbd: openstack: simplify volume booting with new api
Ian Colle
02:04 PM devops Documentation #5968 (Resolved): typo in monmap
In the following page http://ceph.com/docs/master/man/8/monmaptool/, there is a typo for monmap, it is instead mentio... Tamilarasi muthamizhan
01:40 PM devops Bug #5962 (Resolved): ceph-deploy: install should default to dumpling and not cuttlefish
Merged to ceph-deploy master with hash: c9fa0912dfc75660bc3e207f9918d3f95464d71c Alfredo Deza
11:52 AM devops Bug #5962 (Fix Under Review): ceph-deploy: install should default to dumpling and not cuttlefish
Opened pull request: https://github.com/ceph/ceph-deploy/pull/43 Alfredo Deza
11:02 AM devops Bug #5962 (Resolved): ceph-deploy: install should default to dumpling and not cuttlefish
ceph-deploy install should now default to picking dumpling, instead of cuttlefish.
I tested this on ubuntu and it ...
Tamilarasi muthamizhan
01:22 PM Bug #5967 (Resolved): monitor caps parser should accept '.' as a legal unquoted character
caps mention 'entities' (type.id) a lot, and to have to quote that is annoying and unnecessary; if the parser accepte... Dan Mick
11:08 AM Feature #5964 (Resolved): ceph-post-file (to replace/supplement cephdrop)
Sage Weil
11:04 AM Bug #5963 (Won't Fix): ceph pg dump {,sum,pgs,pgs_brief} is all the same for format=plain
20:56 < paravoid> "ceph pg dump pgs_brief" seems to be the same as "ceph pg dump" to me
20:57 < paravoid> same as "c...
Faidon Liambotis
11:03 AM Feature #5905: hello world librados program (with explanatory comments!)
Sam's asked that this include omap and xattr ops, so it needs to demonstrate some semi-advanced uses of librados, not... Greg Farnum
10:23 AM Bug #5226: Some PG stay in "incomplete" state
For the record : today I replaced all OSD by new models, and it fix the problem : it seems that incomplete data were ... Olivier Bonvalet
09:49 AM rbd Bug #5919 (Pending Backport): qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O lea...
Oliver Francke wrote:
> Hi Josh, Sage,
>
> it looks very good right now, could not get it to break, even with 4 m...
Sage Weil
04:19 AM rbd Bug #5919: qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_t...
Hi Josh, Sage,
it looks very good right now, could not get it to break, even with 4 more VM's producing I/O, deep-...
Oliver Francke
09:39 AM Feature #5904 (In Progress): hello world osd class (with explanatory documentation/comments!)
Sage Weil
09:30 AM Bug #5920: mon daemon crashes
Dmitry Panov wrote:
> Hi Sage!
>
> I've cleaned up everything and have re-created the cluster. This time I replac...
Sage Weil
01:27 AM Bug #5920: mon daemon crashes
Hi Sage!
I've cleaned up everything and have re-created the cluster. This time I replaced node sn2 by node sn4.
...
Dmitry Panov
07:09 AM Subtask #5878 (In Progress): erasure plugin mechanism and abstract API
Loïc Dachary
07:09 AM Subtask #5877 (In Progress): Plugable erasure code library
Loïc Dachary
06:18 AM Bug #5959: Quorum is crashing on 'osd pool mksnap'
s/entire pool/entire quorum/g in prev comment Andrey Korolyov
06:17 AM Bug #5959: Quorum is crashing on 'osd pool mksnap'
There is some update:
- if pool contains volumes with their own snapshots, it`s more likely for entire pool to die...
Andrey Korolyov
04:09 AM Bug #5959 (Resolved): Quorum is crashing on 'osd pool mksnap'
Full backtrace is attached. Crashed mons running 0.61.7,
ceph osd pool mksnap --keyfile admin dev-rack0 snap2
201...
Andrey Korolyov
05:59 AM Subtask #5527 (Resolved): unit tests for common/sharedptr_registry.hpp
Loïc Dachary

08/13/2013

08:12 PM Bug #5958: os/DBObjectMap.cc: 393: FAILED assert(!valid || cur_iter->valid())
From irc, looks like disk/leveldb issue. Hoping to get current/omap subdir of osd in question (osd 7). Seems somewh... Samuel Just
08:07 PM Bug #5958 (Can't reproduce): os/DBObjectMap.cc: 393: FAILED assert(!valid || cur_iter->valid())

0> 2013-08-13 22:48:26.503696 7fab63ec4700 -1 os/DBObjectMap.cc: In function 'virtual bool DBObjectMap::...
Samuel Just
03:31 PM Bug #5792 (Resolved): ceph-deploy mon create hangs on fedora18 vm's
Sage Weil
07:03 AM Bug #5792: ceph-deploy mon create hangs on fedora18 vm's
Sandon, it seems that this issue is gone, are you able to reproduce this?
#5895 was very similar (if not the same)...
Alfredo Deza
02:53 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Looks good to me. Will see if I can accept pull request. Mark Kirkwood
06:20 AM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
I created a branch and applied your patch with one (minor) change which was to use `distro.init` for the RuntimeError... Alfredo Deza
02:32 PM rbd Bug #5955: qemu deadlock when librbd caching enabled (writethru or writeback).
normally completions follow this pattern... Sage Weil
01:32 PM rbd Bug #5955 (Resolved): qemu deadlock when librbd caching enabled (writethru or writeback).
From Mike Dawson on ML:
Logs are uploaded to cephdrop with the file name mikedawson-rbd-qemu-deadlock.
- At abo...
Sage Weil
02:27 PM rgw Feature #418: rgw: object versioning
There's no one working on it right now AFAIK. I'd be more than happy to discuss this feature with you. Yehuda Sadeh
02:05 PM rgw Feature #418: rgw: object versioning
Versioning is quite a common feature nowadays, both in the S3 API as well as the Google Storage API. I really like to... Roald van Loon
02:06 PM Bug #5951: osd: next: EEXIST on mkcoll
./virtualenv/bin/teuthology-schedule --name "samuelj-5951-0" -n 50 --owner samuelj@slider testruns/5951.yaml | tee 59... Samuel Just
09:57 AM Bug #5951 (Resolved): osd: next: EEXIST on mkcoll
... Sage Weil
01:53 PM devops Bug #5895 (Resolved): ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish bran...
Merged into ceph-deploy master branch: 40c8088e011670d881432a94d8f159355d57d1e5
The problem here is that, by not r...
Alfredo Deza
12:48 PM devops Bug #5895 (Fix Under Review): ceph-deploy: mon create command hung on ceph-create-keys in cuttlef...
With Sage's suggestion to execute directly on those nodes, I was able to replicate the problem and find a fix.
A p...
Alfredo Deza
11:34 AM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
Ugh, I keep failing to replicate :(
The error on the second one is the famous ulimit configuration but that is not...
Alfredo Deza
09:39 AM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
Alfredo, I mentioned that the issue is not reproducible on a single node but the problem still exists with 2 nodes. Tamilarasi muthamizhan
01:43 PM devops Bug #5887 (Resolved): create a `-b` flag to use a ceph-deploy branch
Merged into teuthology's master branch
Hash: 13edfe6bdf48abb22a20598c6da09207529f8cc4
Alfredo Deza
08:46 AM devops Bug #5887 (Fix Under Review): create a `-b` flag to use a ceph-deploy branch
Pull request opened: https://github.com/ceph/teuthology/pull/28 Alfredo Deza
08:21 AM devops Bug #5887 (In Progress): create a `-b` flag to use a ceph-deploy branch
Alfredo Deza
01:42 PM devops Bug #5934 (Resolved): include a CHANGELOG and a --version flag
Merged into the master branch.
Hash: caf43ffd3e8c4b5fa35932b5b6784e725f5d998b
Alfredo Deza
01:40 PM devops Feature #5956 (Rejected): Implement a radosgw command in ceph-deploy
As a proof of concept, to see if ansible could work for ceph-deploy remote commands, implement a radosgw deployment w... Alfredo Deza
01:16 PM rgw Bug #5953 (Fix Under Review): rgw: drain requests when going down
Yehuda Sadeh
01:11 PM rgw Bug #5953 (Resolved): rgw: drain requests when going down
If receiving either SIGUSR1 or SIGTERM, wait (for a period of time) for requests to complete. This changes the SIGTER... Yehuda Sadeh
01:15 PM rgw Bug #5949 (Pending Backport): radosgw: leaks
Sage Weil
11:17 AM rgw Bug #5949 (Fix Under Review): radosgw: leaks
Sage Weil
08:40 AM rgw Bug #5949 (Resolved): radosgw: leaks
ubuntu@teuthology:/a/teuthology-2013-08-13_01:02:14-rgw-next-testing-basic-plana/104857... Sage Weil
01:02 PM rbd Bug #5919: qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_t...
Hi Oliver,
Okay, I think we spotted the (other) bug. I pushed another patch to the same branch; can you try again...
Sage Weil
02:01 AM rbd Bug #5919: qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_t...
Hi Josh,
update/upgrade our cluster with ceph version 0.56.6-28-gf07447f (f07447f28c6e468c1dc76149f51d7958b10482c2...
Oliver Francke
11:49 AM rgw Bug #5893 (Resolved): failures in s3 tests but teuthology task still passes
Anonymous
10:27 AM rgw Bug #5893: failures in s3 tests but teuthology task still passes
Commit 9799858e90d9055f387f7c991cfcbf50f48cf42d in ceph/s3-tests allows s3tests to propagate exceptions up to the cal... Anonymous
11:14 AM Bug #5920: mon daemon crashes
>Hi Dmitry,
Hi Sage, please see my replies embedded.
>This looks like fallout from a rare paxos bug (#5750), fixe...
Dmitry Panov
10:38 AM Bug #5920 (Duplicate): mon daemon crashes
Hi Dmitry,
This looks like fallout from a rare paxos bug (#5750), fixed in commit:17aa2d6d16c77028bae1d2a77903cdfd...
Sage Weil
11:12 AM Bug #5922: osd: unfound objects on next
2013-08-13 11:08:20.438338 osd.3 [WRN] slow request 25666.244730 seconds old, received at 2013-08-13 04:00:34.193533:... Samuel Just
09:56 AM Bug #5922: osd: unfound objects on next
/var/lib/teuthworker/archive/teuthology-2013-08-13_01:00:13-rados-next-testing-basic-plana/104754 Sage Weil
11:11 AM Bug #5804: mon: binds to 0.0.0.0:6800something port
Just noticed that the port is set as 6801 -- that's used by the osds iirc -- and we've seen reports of the messenger ... Joao Eduardo Luis
10:41 AM Feature #5906 (In Progress): mon: better ceph -s output
Sage Weil
10:01 AM rgw Feature #5506 (Fix Under Review): rgw: use Keystone to authenticate S3 requests
Yehuda Sadeh
09:59 AM devops Bug #5930 (Resolved): ceph-deploy: osds disappear after reboot
this was commit:10cf3088ceeb70970a1ffb8b34e5a4871115face. backported to cuttlefish too. Sage Weil
09:16 AM CephFS Bug #5950 (Rejected): kcephfs: cephfs set_layout -p 4 gets EINVAL
from ML Georg H?llrigl <georg.hoellrigl@xidras.com>:
> By the way setting the layout seems to have a bug:
>
> #...
Sage Weil
08:31 AM Fix #5844: osd: snaptrimmer should throttle itself
We've scrutinized influence of snapshotting on disk IO and got some curious results. From our point of view, snapshot... Igor Lukyanov
05:44 AM devops Bug #5947 (Resolved): ceph-deploy RPM release is pointing to the wrong repo
The new release of ceph-deploy (1.2.1) has a problem with dependencies and it is not possible to install.
From IRC...
Alfredo Deza
04:55 AM rgw Bug #5931: radosgw crashes when deleting object
I just wanted to confirm that the issue is gone. Thanks for a quick fix! Artem Salpagarov
03:47 AM Bug #5946 (Resolved): lockstatus.get_status called although check-locks: false
"work in progress":https://github.com/dachary/teuthology/tree/wip-5946
lockstatus.get_status "is called":https://git...
Loïc Dachary

08/12/2013

07:35 PM rbd Bug #5919: qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_t...
I found one bug - there's a fix on top of the current bobtail branch in wip-librados-aio-flush. Could you see if that... Josh Durgin
06:15 PM rbd Bug #5919: qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_t...
Thanks Oliver, that's good to hear. If you use cache=writethrough instead of no cache, it will have the same write pr... Josh Durgin
05:38 AM rbd Bug #5919: qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_t...
Hi Josh,
even under high ceph-load I was not able to break things down with rbd_cache=true after 2 hours, whereas ...
Oliver Francke
06:35 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Ok here's my revised patch that hopefully incorporates what you were suggesting. Obviously feel free to editorialize ... Mark Kirkwood
03:36 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Lol - I think opening a new ticket is fine. You can mention that you have discussed the proposed change with me.
...
Mark Kirkwood
08:34 AM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Mark, you are spot on.
I will create a new ticket so I can fix this and apply your branch, unless you want to subm...
Alfredo Deza
05:14 PM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
Tamil, you mentioned on IRC that you could not reproduce this anymore, can you confirm that is the case to resolve th... Alfredo Deza
03:47 PM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
this issue seems to be on ubuntu systems as well.
we need this issue to be resolved so we can test the upgrades fr...
Tamilarasi muthamizhan
04:21 PM rgw Bug #5931 (Resolved): radosgw crashes when deleting object
Yehuda Sadeh
02:34 PM rgw Bug #5931: radosgw crashes when deleting object
Yehuda Sadeh wrote:
> Do you have log for the entire request? Looks like an issue with multi-delete.
I hope that'...
Artem Salpagarov
09:48 AM rgw Bug #5931 (In Progress): radosgw crashes when deleting object
Ian Colle
03:05 PM devops Bug #5944 (Closed): ceph-deploy osd needs to be moved to use the new remote helpers
Debugging remote issues with ceph-deploy is very complicated if we are not logging the remote end.
`ceph-deploy os...
Alfredo Deza
01:40 PM rbd Feature #5938 (Rejected): openstack: nova: allow live-migration without detach/reattach for rbd
Currently nova requires shared local storage for live-migration. Even when it is shared, volumes are detached and re-... Josh Durgin
12:00 PM Bug #5901: stuck incomplete immediately after clean
The bug is that the primary can report that the pg is clean before the replica sees the OP_BACKFILL_FINISH message. ... Samuel Just
09:28 AM rgw Bug #5820: radosgw-admin should fail on non-valid flags
The dashed options are what I meant. I'd prefer Yehuda to review this; I'll try and ping him on it today. Anonymous
04:58 AM rgw Bug #5820: radosgw-admin should fail on non-valid flags
What do you mean by "flag"?
are the "--dashed-option" flags?
If so, here comes a pull request to fix this issue : h...
Christophe Courtaut
09:22 AM devops Bug #5489 (Resolved): ceph-deploy: mon destroy throws inappropriate message
With the addition of more logging and better output from the remote node this should no longer be an issue.
Tamil,...
Alfredo Deza
09:20 AM devops Bug #5813 (Closed): Normalize ceph-deploy Exceptions for better error reporting
With the combination of the new decorator to catch exceptions I noticed most of the exceptions are inherited from exc... Alfredo Deza
09:18 AM devops Bug #5887: create a `-b` flag to use a ceph-deploy branch
Actually, better than a flag is a `YAML` key.
There is some work being done with this, but the naming will be odd ...
Alfredo Deza
09:15 AM devops Bug #5928 (Closed): Catch KeyboardInterrupt and RuntimeError for nicer reporting
Merged into master with hash: a8dcfbbf7268cc0bfb15ac986650e37962e4d9d6 Alfredo Deza
09:13 AM devops Bug #5934 (Fix Under Review): include a CHANGELOG and a --version flag
Opened pull request: https://github.com/ceph/ceph-deploy/pull/39 Alfredo Deza
08:42 AM devops Bug #5934 (In Progress): include a CHANGELOG and a --version flag
Alfredo Deza
06:53 AM devops Bug #5934 (Resolved): include a CHANGELOG and a --version flag
ceph-deploy doesn't have a changelog where a list of changes can be seen for each release. Since the release cycle fo... Alfredo Deza
08:13 AM rgw Feature #5935 (New): rgw: implement IP-based ACL capabilities
radosgw could possibly implement access control based on source IP or header, the latter being beneficial for those b... Brian Andrus

08/11/2013

11:12 PM Bug #5920: mon daemon crashes
Hi Sage,
Sure. You can download it here: [[https://dl.dropboxusercontent.com/u/22489421/mon.tgz]]
Dmitry Panov
09:43 PM Bug #5923: osd: 6 up, 5 in; 91 active+clean, 1 remapped
ubuntu@teuthology:/a/samuelj-5901-1/101594 may be an example. (others in the same suite) Samuel Just
09:42 PM Bug #5922: osd: unfound objects on next
ubuntu@teuthology:/a/samuelj-5901-1/101594 may be an example. (others in the same suite) Samuel Just
08:53 PM rgw Bug #5931: radosgw crashes when deleting object
Yehuda Sadeh
05:27 PM rgw Bug #5931: radosgw crashes when deleting object
Do you have log for the entire request? Looks like an issue with multi-delete. Yehuda Sadeh
03:23 AM rgw Bug #5931 (Resolved): radosgw crashes when deleting object
I use 3hub mac app (S3 client) for managing files in rados. Rados is installed on two physical Ubuntu presice machine... Artem Salpagarov
12:31 PM Bug #5932 (Won't Fix): osdmaptool --create-from-conf ignore "osd pool default pg[p] num"
osdmaptool --create-from-conf ignore "osd pool default pg num" & "osd pool default pgp num" values from config, I fix... Denis kaganovich

08/10/2013

05:48 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
With your commit 1d2d9881f93df6383d99720c84d6e7397a04bba0 the problem is solved. Absolutely. I was merely thinking th... Mark Kirkwood
05:14 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Mark,
Are you sure you are able to replicate this with the latest release of ceph-deploy?
This commit fixed t...
Alfredo Deza

08/09/2013

11:42 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Sorry - was a bit keen with the 'submit' button! That mon create patch does is not really doing what it should, espec... Mark Kirkwood
11:12 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
I notice I should have used "from ceph_deploy import lsb" in the above. Having said that - there is another place whe... Mark Kirkwood
09:15 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
In fact like this seems to work:... Mark Kirkwood
03:56 PM devops Bug #5911: ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Well that was a fast turnaround!
A minor niggle - I note the osd code does:...
Mark Kirkwood
03:32 PM devops Bug #5911 (Resolved): ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Sage Weil
03:27 PM devops Bug #5911 (Fix Under Review): ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart...
Opened pull request: https://github.com/ceph/ceph-deploy/pull/36 Alfredo Deza
06:42 PM Bug #5925: hung ceph_test_rados_delete_pools_parallel
The pool create process died after getting a write error:
@
2013-08-09T02:08:27.288 INFO:teuthology.task.workunit...
David Zafman
02:18 PM Bug #5925: hung ceph_test_rados_delete_pools_parallel
This looks like a test case race.
2 processes hung while mon thrasher keeps running:
ubuntu 7800 1 0 02:0...
David Zafman
10:28 AM Bug #5925 (Can't reproduce): hung ceph_test_rados_delete_pools_parallel
... Sage Weil
06:36 PM rbd Bug #5919: qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to kernel_hung_t...
The log shows one aio_flush (that started at 2013-08-08 14:19:36.425555) never completing. This log is without cachin... Josh Durgin
02:07 AM rbd Bug #5919 (Resolved): qemu-1.4.0 and onwards, linux kernel 3.2.x, ceph-RBD, heavy I/O leads to ke...
Hi,
we had a number of tickets raising, where users reported problems with latest debian-7.[01] and kernel 3.2.x/ ...
Oliver Francke
06:00 PM CephFS Bug #5927: kcephfs: ENOTEMPTY on rm -r
my patch "ceph: rework trim caps code" can explain this issue Zheng Yan
02:24 PM CephFS Bug #5927 (Resolved): kcephfs: ENOTEMPTY on rm -r
... Sage Weil
05:40 PM devops Bug #5930 (Resolved): ceph-deploy: osds disappear after reboot
while using ceph-deploy to deploy cluster, when the test system is reboot, the osds and mount point disappears.
th...
Tamilarasi muthamizhan
04:14 PM devops Fix #5900: Create a Python package for ceph Python bindings
Josh also pointed out about the licensing.

I am not sure how to proceed here but the Python files say GPLv2
...
Alfredo Deza
04:11 PM devops Fix #5900: Create a Python package for ceph Python bindings
There are some references to the python files that will need to be updated for the new paths. The first one is for bu... Josh Durgin
03:54 PM Bug #5901: stuck incomplete immediately after clean
./virtualenv/bin/teuthology-schedule --name "samuelj-5901-1" -n 50 --owner samuelj@slider testruns/5799.yaml | tee 59... Samuel Just
02:49 PM Bug #5901: stuck incomplete immediately after clean
~/teuthology [mine?] » ./virtualenv/bin/teuthology-schedule --name "samuelj-5901-0" -n 10 --owner samuelj@slider test... Samuel Just
03:48 PM Feature #5910 (In Progress): osd: report filestore op latency (apply and commit) to monitor
Samuel Just
03:48 PM devops Bug #5928 (Fix Under Review): Catch KeyboardInterrupt and RuntimeError for nicer reporting
Pull request opened: https://github.com/ceph/ceph-deploy/pull/37 Alfredo Deza
03:29 PM devops Bug #5928 (Closed): Catch KeyboardInterrupt and RuntimeError for nicer reporting
ceph-deploy should show tracebacks when keyboardinterrupt or RuntimeERror are raised, just displaying the actual erro... Alfredo Deza
03:43 PM devops Bug #5929 (Resolved): debian: python-ceph should depend on libcephfs1
There are libcephfs python bindings in it as well. This dependency is already in the spec file. Josh Durgin
03:40 PM Bug #5788: ceph: try new, fallback to old can race with daemon upgrade
fixed the main path where this happens. dan still wants to clean this up and fix other similar cases. Sage Weil
02:31 PM Tasks #5585 (Resolved): test large scale exapnsion and contraction
Done using Mark's burnupi nodes. Resulted in the TPHandle patches. Samuel Just
02:26 PM Bug #5902: s3tests failure during parallel upgrade test
another example: ubuntu@teuthology:/a/teuthology-2013-08-09_01:30:23-upgrade-parallel-next-testing-basic-plana Sage Weil
12:58 PM rgw Bug #5921 (Resolved): rgw: failure in token revocation
Fixed by commits 1563613d83aa6937529fda8e09a0f59ff939712d and 71177fe3c7a164ad4b0c7d507949b2f07a792f9a. Added commit ... Yehuda Sadeh
09:47 AM rgw Bug #5921 (Fix Under Review): rgw: failure in token revocation
Ian Colle
07:44 AM rgw Bug #5921: rgw: failure in token revocation
can't read message (callback was renamed, need to update). Yehuda Sadeh
07:44 AM rgw Bug #5921 (Resolved): rgw: failure in token revocation
Yehuda Sadeh
12:56 PM rgw Bug #5882 (Resolved): rgw: failure to lock shard should result in another error
Done, merged at commit:068baae719b6a06487110e30b3cdcafddda0e579. Yehuda Sadeh
11:56 AM rgw Bug #5882 (Fix Under Review): rgw: failure to lock shard should result in another error
pushed change to wip-5882. Yehuda Sadeh
12:42 PM Documentation #5926 (Resolved): 5 minute quick start should deploy cluster using ceph-deploy and ...
5 minute quick start still uses mkcephfs to deploy ceph cluster and this should not be the case anymore. It should us... Tamilarasi muthamizhan
11:37 AM devops Feature #5847 (In Progress): Build own versions of most recent leveldb for all supported platforms.
Anonymous
10:46 AM Bug #5922: osd: unfound objects on next
I attached gdb to the process for too long and osd.1 got briefly marked down. This was enough to cause it to to clea... Samuel Just
08:40 AM Bug #5922 (Duplicate): osd: unfound objects on next
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-08-09_01:00:22-rados-next-testing-basic-plana/100980
...
Sage Weil
10:40 AM devops Bug #5917 (Resolved): ceph-deploy: f18 install broken
Sage Weil
10:19 AM devops Bug #5917 (Fix Under Review): ceph-deploy: f18 install broken
There is an open pull request ready to be reviewed: https://github.com/ceph/ceph-deploy/pull/35
Alfredo Deza
09:39 AM devops Bug #5917 (In Progress): ceph-deploy: f18 install broken
Alfredo Deza
10:25 AM Bug #5785 (Resolved): librados: deadlock during shutdown
Sage Weil
09:57 AM Fix #5844: osd: snaptrimmer should throttle itself
(09:50:05 AM) sjust: xdeller: you mean snapshot trimming?
(09:50:58 AM) xdeller: if i may call it so, it`s about #58...
Samuel Just
09:42 AM Feature #5648: mon should use 'module'/'perm' string in MonCommands.h for caps checking
merged post-dumpling; we can backport later if we feel like it. Sage Weil
09:42 AM Feature #5648 (Resolved): mon should use 'module'/'perm' string in MonCommands.h for caps checking
Sage Weil
09:11 AM Bug #5923 (Duplicate): osd: 6 up, 5 in; 91 active+clean, 1 remapped
I suspect the same root cause as #5922.... Sage Weil
08:44 AM Bug #5920 (Need More Info): mon daemon crashes
Hi Dmitry,
Can you create a tar.gz of your mon data directory (/var/lib/ceph/mon/*) and post it somewhere? (Or co...
Sage Weil
05:47 AM Bug #5920: mon daemon crashes
OS - Ubuntu 12.04.2 x64.
Ceph packages were download from Ceph repository.
Dmitry Panov
05:43 AM Bug #5920 (Duplicate): mon daemon crashes
Hello!
I'd like to report about the problem. May be it will help you to improve Ceph.
I've created Ceph cluster...
Dmitry Panov

08/08/2013

10:53 PM Feature #5918 (Rejected): osd: friendly error message about missing xattr support
nevermind, it was the use omap option that was missing. Sage Weil
10:39 PM Feature #5918 (Rejected): osd: friendly error message about missing xattr support
a user in irc hit this:... Sage Weil
10:51 PM devops Bug #5208 (Resolved): Debian Wheezy Needs the 'ca-certificates' package before you can wget the p...
Sage Weil
10:51 PM Bug #5897 (Pending Backport): ceph_test_rados_api_watch_notify hang on LibRadosWatchNotify.WatchN...
Sage Weil
10:48 PM devops Bug #5903 (Resolved): ceph-deploy ignores defined xfs mount options
Sage Weil
06:06 PM devops Bug #5903 (Pending Backport): ceph-deploy ignores defined xfs mount options
Sage Weil
04:13 PM devops Bug #5903 (Fix Under Review): ceph-deploy ignores defined xfs mount options
There is a pull request that needs some reviewing that addresses the problem: https://github.com/ceph/ceph/pull/489 Alfredo Deza
01:27 PM devops Bug #5903 (Resolved): ceph-deploy ignores defined xfs mount options
When specifying specific XFS mount options within ceph.conf, ceph-deploy ignores defined mount options and mounts wit... Brian Andrus
10:10 PM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
It looks to me like the last command that mon create ran is finished, but pushy still has its connection open. that ... Sage Weil
09:48 PM devops Bug #5917 (Resolved): ceph-deploy: f18 install broken
2013-08-08T01:21:46.541 INFO:teuthology.orchestra.run.err:[10.214.138.154]: [ESC[1mvpm099ESC[0m][ESC[1;37mINFOESC[0m ... Sage Weil
09:30 PM Messengers Feature #5916 (Resolved): msgr: allow connect to multi-addr peer
add a multi-addr get_connection() method that lets you specify an ordered list of peer addrs. try them in succession... Sage Weil
09:27 PM Messengers Feature #5915 (Resolved): msgr: allow binding to multiple addresses, address types
Currently we either bind to ipv4 or ipv6. With the rsockets code, we may also bind to an rsockets v4 or v6 addr. Al... Sage Weil
09:25 PM Messengers Feature #5914 (Rejected): msgr: use either socket or rsocket calls
Use either the normal socket api or the rsockets one, depending on the peer entity_addr_t type. This will affect Pip... Sage Weil
09:24 PM Messengers Feature #5913 (Rejected): msgr: add wrappers for remaining socket calls in Pipe, Accepter
Almost all socket calls are confined to Accepter.cc (which is small) and Pipe.cc (which is not). Most actual socket ... Sage Weil
09:23 PM Messengers Feature #5912 (Rejected): msgr: create rsockets entity_addr_t type
use the (currently unused) entity_addr_t::type to have an 'rsockets' flavor of IP addresses so that we can disambigua... Sage Weil
07:23 PM Bug #5823: cpu load on cluster node is very high, client can't get data on pg from primary node ...
ceph-osd process is causing the high load, ...
When i used the htop to monitor cpu load, I saw that %cpu spend fo...
Khanh Nguyen Dang Quoc
01:22 PM Bug #5823: cpu load on cluster node is very high, client can't get data on pg from primary node ...
Which process is causing the load? Samuel Just
06:30 PM Bug #5746 (Resolved): ceph cli interactive mode completely broken
Sage Weil
06:20 PM Bug #5872 (Resolved): ceph daemon <name> badcommand does not return error
Sage Weil
08:32 AM Bug #5872: ceph daemon <name> badcommand does not return error
Sage Weil
06:11 PM devops Bug #5911 (Resolved): ceph-deploy to Ubuntu targets touches sysvinit for mons and upstart for osds
Testing ceph-deploy from git master (checked out 6 Aug ad1f3f3689e8b060a13f715e7ac3735f91e9c546) for Ubuntu 12.04 tar... Mark Kirkwood
04:03 PM Feature #5910 (Resolved): osd: report filestore op latency (apply and commit) to monitor
ceph osd dump perf_info (maybe?)
Could also dump other potentially useful perf information.
Samuel Just
04:01 PM Feature #5909 (Resolved): mon: keep track of monitor store size estimate vs 'du $mon_data'
Allows us to figure out whether the monitor is starting to suffer from leveldb's compaction shenanigans Joao Eduardo Luis
03:37 PM Feature #5908 (Rejected): mon: formatted output sections should be consistent across services and...
With the rest api, almost *everything* in the monitor now outputs to a formatter. However, someone parsing this outp... Joao Eduardo Luis
03:33 PM Feature #5907 (Resolved): permanently log all administrative actions
Samuel Just
03:28 PM Feature #5906 (Resolved): mon: better ceph -s output
The current output is hard to scan.
Each status on a different line.
active/non-active
recovering/backfilling/re...
Samuel Just
03:22 PM Feature #5905: hello world librados program (with explanatory comments!)
This should probably in a seperate repo? Samuel Just
03:19 PM Feature #5905 (Resolved): hello world librados program (with explanatory comments!)
Samuel Just
03:18 PM Feature #5904 (Resolved): hello world osd class (with explanatory documentation/comments!)
Samuel Just
01:20 PM Bug #5749 (Duplicate): osd: unfound objects on cuttlefish
This is a dup of #5799, backported the fix. Samuel Just
11:22 AM Bug #5822 (Resolved): ceph_argparse.py: parsing of required arguments broken
Dan Mick
09:17 AM Bug #5902 (Resolved): s3tests failure during parallel upgrade test
... Sage Weil
08:31 AM rgw Bug #5874 (Resolved): rgw: cuttlefish cls_rgw tests fails against next
backported the preload osd class patches to cuttlefish and enabled in teuthology so we can avoid this problem in test... Sage Weil
01:38 AM Documentation #5899 (Closed): transcript of Emperor CDS discussion about erasure coding
Published "in the wiki":http://wiki.ceph.com/01Planning/CDS/Emperor/Transcript_:_Erasure_coded_storage_backend_%28ste... Loïc Dachary
01:34 AM Subtask #5510: ObjectContext : replace ref with shared_ptr
rebased against master Loïc Dachary

08/07/2013

02:15 PM devops Fix #5900 (Fix Under Review): Create a Python package for ceph Python bindings
Branch `wip-5900` was pushed and it needs a few things addressed:
* Versioning -> (it currently is defaulting to 0...
Alfredo Deza
12:20 PM devops Fix #5900 (In Progress): Create a Python package for ceph Python bindings
The `src/pybind` directory needs to be converted to a proper Python package structure so we can create and publish Py... Alfredo Deza
12:40 PM Bug #5901 (Duplicate): stuck incomplete immediately after clean
2013-08-06T19:19:00.289 INFO:teuthology.orchestra.run.err:[10.214.133.31]: dumped all in format json
2013-08-06T19:1...
Samuel Just
09:37 AM Feature #4929: Erasure encoded placement group
Archive of the erasure code session "PAD":http://pad.ceph.com/p/osd-erasure-coding of the Emperor session summit.
...
Loïc Dachary
08:45 AM Bug #5898 (Resolved): ceph cli lost "osd set/unset noscrub/nodeep-scrub"
Sage Weil
02:31 AM Documentation #5899 (Closed): transcript of Emperor CDS discussion about erasure coding
as found at http://youtu.be/-K8bSHx7zJ0?t=30m26s Loïc Dachary

08/06/2013

10:15 PM rgw Bug #5893: failures in s3 tests but teuthology task still passes
So the error I'm seeing is when the s3readwrite test goes to read an object and doesn't find it (due to a YAML issue ... Anonymous
02:20 PM rgw Bug #5893 (Resolved): failures in s3 tests but teuthology task still passes
In a dev branch, I had the read tests for s3 fail but the overall teuthology task passed. This seems like a pretty bi... Anonymous
06:39 PM rgw Bug #5874: rgw: cuttlefish cls_rgw tests fails against next
So basically this is #5752. We can try working around it by running the objclass unitest before the upgrade (which wi... Yehuda Sadeh
04:21 PM rgw Bug #5874: rgw: cuttlefish cls_rgw tests fails against next
The osd hasn't been restarted at this point. Yehuda Sadeh
04:20 PM rgw Bug #5874: rgw: cuttlefish cls_rgw tests fails against next
We get this, which looks like #5752:... Yehuda Sadeh
01:56 PM rgw Bug #5874: rgw: cuttlefish cls_rgw tests fails against next
It looks like some osd crashed here:... Yehuda Sadeh
06:32 PM Bug #5898 (Fix Under Review): ceph cli lost "osd set/unset noscrub/nodeep-scrub"
Dan Mick
06:27 PM Bug #5898 (Resolved): ceph cli lost "osd set/unset noscrub/nodeep-scrub"
These flags were added at a time while the Ceph CLI restructuring was in progress, and
got lost in the transition. ...
Dan Mick
06:15 PM Bug #5897 (Resolved): ceph_test_rados_api_watch_notify hang on LibRadosWatchNotify.WatchNotifyTim...
2013-08-06T04:50:26.730 INFO:teuthology.task.thrashosds.thrasher:Added osd 1
2013-08-06T04:50:27.345 INFO:teuthology...
Samuel Just
06:00 PM Bug #5896 (Resolved): mon: MonmapMonitor: 'ceph mon add' always returns 'mon already exists'
on next (ceph version 0.67-rc3-100-ge8d72ef (e8d72ef0402a42c8aa8ea624497d13db81d5e361))... Joao Eduardo Luis
06:00 PM devops Bug #5895: ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish branch on RHEL 6.3
oops, it is not centos 6.3, it is RHEL 6.3 Tamilarasi muthamizhan
06:00 PM devops Bug #5895 (Resolved): ceph-deploy: mon create command hung on ceph-create-keys in cuttlefish bran...
on centos 6.3, when trying to deploy ceph using --stable=cuttlefish branch, the mon create command hung on ceph-creat... Tamilarasi muthamizhan
05:03 PM Bug #5894 (Can't reproduce): test_daemon teuthology failure
Dan Mick
05:02 PM Bug #5894: test_daemon teuthology failure
Yeah, this was fixed with 9466a0b152beeab894be66c6528495ca01a1974a I believe. If it happens again with a version >= ... Dan Mick
04:54 PM Bug #5894: test_daemon teuthology failure
ubuntu@teuthology:/a/teuthology-2013-08-05_01:00:16-rados-next-testing-basic-plana/96941 Samuel Just
04:31 PM Bug #5894 (Can't reproduce): test_daemon teuthology failure
2013-08-06T02:37:04.714 INFO:teuthology.task.workunit.client.0.out:[10.214.132.22]: crush_ruleset: 2
2013-08-06T02:3...
Samuel Just
04:58 PM Bug #5503 (Resolved): osd: ceph --admin-daemon interface doesn't handle spaces in names
Dan Mick
04:17 PM Bug #5746 (Fix Under Review): ceph cli interactive mode completely broken
Dan Mick
02:14 PM Bug #5749: osd: unfound objects on cuttlefish
ubuntu@teuthology:/a/teuthology-2013-08-04_01:30:24-upgrade-parallel-next-testing-basic-plana/96526
Another instance?
Samuel Just
01:50 PM rgw Bug #5885: Valgrind issue found while running s3 and swift tests
And now it shows up again with my wip-56040-ysw-5602-buck teuthology branch. I wonder if we're introducing some kind ... Anonymous
12:14 PM rgw Bug #5885: Valgrind issue found while running s3 and swift tests
This test is now passing against next. Uploading the archive from the run. Guessing we can close this and attribute i... Anonymous
10:18 AM rgw Feature #4342: rgw: dr: data sync agent: update sync processing state
    - mark bucket index log progress on local (target) zone
Sage Weil
10:18 AM rgw Feature #4341: rgw: dr: data sync agent: implement delta changes sync
    - watch (sharded) data log: which buckets were modified
      - lock/mark each shard
      - processes it
    ...
Sage Weil
10:17 AM rgw Feature #4340: rgw: dr: data sync agent: implement full sync
- use special admin user
- use admin metadata api to enumerate all buckets
- use normal api to enumerat...
Sage Weil
07:39 AM devops Bug #5499: ceph-deploy --cluster clustername osd prepare fails
Amit Vijairania wrote:
> Hello! I'm running into similar issue..
>
> ***
> root@svl-swift-1:/home/ceph# /usr/lo...
Amit Vijairania
07:33 AM devops Bug #5499: ceph-deploy --cluster clustername osd prepare fails
Hello! I'm running into similar issue..... Amit Vijairania
05:22 AM Bug #5891 (Won't Fix): rados bench displaying wrong unit
When using rados bench via a gigabit Ethernet [1] I can reach up to 111.7 MB/s [2]. With a different tool I can reach... Niklas Goerke
04:01 AM rbd Bug #5890: can't remove rbd image from pool
ceph version 0.61.7 (8f010aff684e820ecc837c25ac77c7a05d7191ff) Khanh Nguyen Dang Quoc
04:01 AM rbd Bug #5890 (Closed): can't remove rbd image from pool
do rbd -p newpool ls, i see 2 images (bk_cache, bk_cache1) in there.
Then i performs delete the image bk_cache ,bu...
Khanh Nguyen Dang Quoc

08/05/2013

03:43 PM devops Feature #5730 (Resolved): Show what command is being executed
There was not a specific pull request for this since it was included as part of a bigger refactoring.
The new modu...
Alfredo Deza
02:45 PM devops Bug #5887 (Resolved): create a `-b` flag to use a ceph-deploy branch
Teuthology does not currently allow to pass in a flag to be able to point to a `ceph-deploy` branch and it will
use...
Alfredo Deza
01:13 PM devops Bug #5208 (In Progress): Debian Wheezy Needs the 'ca-certificates' package before you can wget th...
Alfredo Deza
01:09 PM CephFS Bug #5883: mds: broken locking, ref count in handle_accept

warning: Can't read pathname for load map: Input/output error.
[Thread debugging using libthread_db enabled]
Usin...
Sage Weil
12:52 PM CephFS Bug #5883 (Resolved): mds: broken locking, ref count in handle_accept
Sage Weil
12:56 PM rgw Bug #5885 (Resolved): Valgrind issue found while running s3 and swift tests
While validating some teuthology changes, a test run found a Valgrind issue with RGW. I'm attaching the archive outpu... Anonymous
12:52 PM Bug #5884 (Resolved): negative num_objects_degraded in pool stats
On `mira103` I'm seeing a negative value for `num_objects_degraded` for `pool 5`. I run `ceph pg dump pools` and see ... Noah Watkins
12:30 PM rgw Bug #5875 (Resolved): rgw: usage logging should only account object store operations
Fixed and merged, commit:9e88b1b48c1f990ae202085d0a809f8ce6b18743. Yehuda Sadeh
12:06 PM Bug #5872: ceph daemon <name> badcommand does not return error
commit:9afdea21aa072f9262155c178d6ee0da9a7b1815
Dan Mick
12:05 PM Bug #5872 (Resolved): ceph daemon <name> badcommand does not return error
Dan Mick
11:34 AM Bug #5873 (Duplicate): osd: unfound object from thrashing when all osds are up
This was probably caused by the same bug as 5799. We'll see if it comes up again with that patch in next. Samuel Just
11:33 AM devops Bug #5849 (Resolved): ceph-deploy install does not create ceph.conf in /etc/ (CentOS and Ubuntu)
Confirmed the lack of writing to the configuration file when `mon create` was being called.
Fixed and merged to ce...
Alfredo Deza
11:28 AM devops Documentation #5881 (Closed): [ceph-deploy] Naming cluster example command shows wrong example
Just merged into ceph's master branch.
Hash: e8d72ef0402a42c8aa8ea624497d13db81d5e361
Alfredo Deza
08:00 AM devops Documentation #5881 (Closed): [ceph-deploy] Naming cluster example command shows wrong example
In the ceph-deploy documentation, in the Naming Cluster section (http://ceph.com/docs/next/rados/deployment/ceph-depl... Daniele Stroppa
11:02 AM rgw Bug #5882 (Resolved): rgw: failure to lock shard should result in another error
currently returning 500, should find some appropriate 4xx error code. Yehuda Sadeh
09:25 AM Bug #4254: osd: failure to recover before timeout on rados bench and thrashing; negative stats
I think I'm seeing the negative stats again on mira103
`sudo ceph -f json-pretty pg dump pools | grep "\-"`
Noah Watkins
07:16 AM RADOS Documentation #5880 (New): erasure-code: library plugin API documentation, including a simple exa...
Loïc Dachary
07:12 AM Subtask #5879 (Resolved): jerasure plugin
"work in progress":https://github.com/ceph/ceph/pull/538
Create a standalone jerasure plugin, outside of the Ceph ...
Loïc Dachary
07:10 AM Subtask #5878 (Resolved): erasure plugin mechanism and abstract API
"work in progress":https://github.com/ceph/ceph/pull/518
The "abstract API":https://github.com/dachary/ceph/blob/w...
Loïc Dachary
07:08 AM Subtask #5877 (Resolved): Plugable erasure code library
The "plugable erasure code library":https://github.com/dachary/ceph/blob/wip-4929/doc/dev/osd_internals/erasure-code.... Loïc Dachary
03:17 AM rbd Bug #5876 (Resolved): Assertion failure in rbd_img_obj_callback() : rbd_assert(which >= img_reque...
Hi,
with CuttleFish with RBD kernel client (from Linux 3.9.11), I have this kernel BUG :...
Olivier Bonvalet

08/04/2013

10:25 PM Bug #5869 (Fix Under Review): osd/OSD.h: 447: FAILED assert(ret) in get_map
wip-5869 Sage Weil
03:05 PM Bug #5869: osd/OSD.h: 447: FAILED assert(ret) in get_map
it's osd.2 on this run:... Sage Weil
09:14 PM rgw Bug #5875 (Resolved): rgw: usage logging should only account object store operations
Don't aggregate admin stuff, etc. The problem is that it currently does that, and moreover it sets a 'bucket' name wh... Yehuda Sadeh
09:13 PM CephFS Bug #4685: BUG: unable to handle kernel NULL pointer dereference at
proposed fix in testing branch, 'ceph: fix null pointer dereference' Sage Weil
08:59 PM rbd Bug #5760: libceph: osdc_build_request(): BUG_ON(p > msg->front.iov_base + msg->front.iov_len);
another user hit this: http://marc.info/?l=ceph-devel&m=137566541229652&w=2 Sage Weil
03:55 PM rgw Bug #5874 (Resolved): rgw: cuttlefish cls_rgw tests fails against next
... Sage Weil
03:18 PM Bug #5873: osd: unfound object from thrashing when all osds are up
also ubuntu@teuthology:/a/teuthology-2013-08-02_01:00:11-rados-next-testing-basic-plana/93547. sitll running too Sage Weil
03:17 PM Bug #5873 (Duplicate): osd: unfound object from thrashing when all osds are up
... Sage Weil
03:16 PM Bug #5872 (Resolved): ceph daemon <name> badcommand does not return error
... Sage Weil
03:15 PM rgw Feature #5602: rgw: teuthology task to test default region as master region
I have code out for review. Branches are:
s3-tests: wip-buck-next-2
teuthology: wip-buck-5602
This work is predi...
Anonymous
03:02 PM Fix #4635: mon: many ops expose uncommitted state
teuthology-2013-08-02_01:00:11-rados-next-testing-basic-plana/93344 for a recent occurance Sage Weil
02:48 PM CephFS Bug #5871 (Resolved): client.4145 released lease on dn 1000000061b/f108 which dne
failure_reason: '"2013-08-03 12:43:47.073985 mds.0 10.214.133.35:6801/3070 2 : [WRN]
client.4145 released lease on...
Sage Weil
10:42 AM devops Bug #5849: ceph-deploy install does not create ceph.conf in /etc/ (CentOS and Ubuntu)
install should installt he deb/rpm which will mkdir /etc/ceph. mon create and osd create will put ceph.conf there. ... Sage Weil
08:34 AM devops Bug #5849 (Need More Info): ceph-deploy install does not create ceph.conf in /etc/ (CentOS and Ub...
Who's responsibility is to create `/etc/ceph.conf` ? ceph-deploy?
Because I can't find a place where `ceph-deploy...
Alfredo Deza

08/03/2013

11:28 AM Bug #5869 (Resolved): osd/OSD.h: 447: FAILED assert(ret) in get_map
note this was from yesterday... maybe fixed now?... Sage Weil

08/02/2013

09:34 PM Bug #5822 (Fix Under Review): ceph_argparse.py: parsing of required arguments broken
wip-5822 Dan Mick
09:03 PM Bug #5799 (Resolved): SIGABRT in build_push_op -> object_info_t::decode
Samuel Just
02:10 PM Bug #5799 (Fix Under Review): SIGABRT in build_push_op -> object_info_t::decode
Sage Weil
01:36 PM Bug #5799: SIGABRT in build_push_op -> object_info_t::decode
Sage Weil
12:03 PM Bug #5799: SIGABRT in build_push_op -> object_info_t::decode
Just kidding, ext4 just happens to be slow enough to trigger it. wip-5799. ubuntu@teuthology:/a/samuelj-5799-1/92938 Samuel Just
08:01 PM CephFS Bug #5864: cfuse_workunit_suites_ffsb suite on Centos hangs with *** Got Signal Interrupt ***
I believe that this is indicative of a problem with ceph-fuse as opposed to a teuthology issue (I could also be wrong). Anonymous
08:00 PM CephFS Bug #5864 (Closed): cfuse_workunit_suites_ffsb suite on Centos hangs with *** Got Signal Interrup...
The cfuse_workunit_suites_ffsb suite seems to get stuck with the message *** Got Signal Interrupt *** After several ... Anonymous
07:15 PM Subtask #5863 (Rejected): OSD internals must work in terms of cpg_t
A cpg_t type should be created (basically pair<pg_t, chunk_id_t>). All osd internal maps and PG<->PG messages should... Samuel Just
07:13 PM Subtask #5862 (Resolved): FileStore must work with ghobjects rather than hobjects
"pull request":https://github.com/ceph/ceph/pull/546
vhobjects should be basically tuple<hobject_t, version_t, chu...
Samuel Just
06:58 PM Subtask #5861 (Resolved): Refactor scrub to use PGBackend methods
ScrubMap should be adapted to allow for PGBackend specific metadata Samuel Just
06:48 PM Subtask #5860 (Rejected): PG::calc_acting and friends should use PGBackend to select the acting s...
Samuel Just
06:46 PM Subtask #5859 (Rejected): GetInfo should use PGBackend methods to determine when peering can cont...
Samuel Just
06:46 PM Subtask #5858 (Resolved): Backfill should be able to handle multiple backfill peers
Samuel Just
06:45 PM Subtask #5857 (Resolved): Refactor recovery to use PGBackend methods
"work in progress":https://github.com/athanatos/ceph/tree/wip-5857
"work in progress (2)":https://github.com/athanat...
Samuel Just
06:45 PM Subtask #5856 (Resolved): Refactor Backfill to use PGBackend methods
Samuel Just
06:44 PM Subtask #5855 (Resolved): Backfill peers should not be included in the acting set
Backfill peers should be included in the IO path as they currently are, but should not be included in the acting set.... Samuel Just
05:06 PM Documentation #5854 (Closed): ceph-deploy new needs to be clearer that "the names here are mon na...
Many people assume they should name all machines in the cluster in ceph-deploy new. We
need to make the documentati...
Dan Mick
05:05 PM devops Bug #5849: ceph-deploy install does not create ceph.conf in /etc/ (CentOS and Ubuntu)
and this was on ceph branch: next Tamilarasi muthamizhan
04:12 PM devops Bug #5849: ceph-deploy install does not create ceph.conf in /etc/ (CentOS and Ubuntu)
This problem also appears on Ubuntu... Anonymous
02:57 PM devops Bug #5849 (Resolved): ceph-deploy install does not create ceph.conf in /etc/ (CentOS and Ubuntu)
For some reason, `ceph-deploy install {node}` is failing to create the config files in `/etc/ceph/`
Alfredo Deza
05:02 PM Documentation #5853 (Resolved): quickstart is freaking people out about mons and osds on same mac...
http://ceph.com/docs/master/start/quick-ceph-deploy/#add-a-monitor says
Tip
In production environments, we rec...
Dan Mick
03:48 PM Feature #5648: mon should use 'module'/'perm' string in MonCommands.h for caps checking
pushed wip-5648-b with some changes! Sage Weil
02:27 PM Feature #2009 (Resolved): osd: report performance to monitor
we did this! Sage Weil
02:26 PM Tasks #5848 (Resolved): add perf counter for each RecoveryMachine state
Samuel Just
02:23 PM Tasks #5585: test large scale exapnsion and contraction
Use plana nodes to expand Ian Colle
02:13 PM devops Feature #5847: Build own versions of most recent leveldb for all supported platforms.
Related to http://tracker.ceph.com/issues/5818 Ian Colle
02:11 PM devops Feature #5847 (Resolved): Build own versions of most recent leveldb for all supported platforms.
Ian Colle
02:11 PM Bug #5754: mon: occasional osd full map not trimmed
Sage Weil
02:10 PM rbd Bug #5812: qemu-kvm guest hangs on disk write with rbd storage
The cluster is healthy and i could not find slow requests in osd logs:
ceph -s
health HEALTH_OK
monmap e1:...
Kevin Fibich
01:53 PM rbd Bug #5812: qemu-kvm guest hangs on disk write with rbd storage
Was your cluster healthy and showing no slow requests? It looks like the vm didn't have rbd caching enabled, so unles... Josh Durgin
02:07 PM Bug #5818: leveldb 1.12: hang on shutdown (mon)
Sage Weil
11:34 AM Bug #5818: leveldb 1.12: hang on shutdown (mon)
looks like this: https://code.google.com/p/leveldb/issues/detail?id=125 Sage Weil
11:29 AM Bug #5818: leveldb 1.12: hang on shutdown (mon)
(&*#@ leveldb:... Sage Weil
01:07 PM rgw Feature #4339 (Resolved): rgw: multisite: metadata sync agent: update sync processing state
Ian Colle
01:07 PM rgw Feature #4338 (Resolved): rgw: multisite: metadata sync agent: implement delta changes sync
Ian Colle
01:07 PM rgw Feature #4337 (Resolved): rgw: multisite: metadata sync agent: implement full sync
Ian Colle
01:05 PM devops Bug #5763 (Resolved): ceph-deploy new [IP] should error out
Merged into ceph-deploy master
Hash: a334df3
Alfredo Deza
01:03 PM rgw Feature #5603 (In Progress): rgw: teuthology test to check secondary region creation
Ian Colle
01:03 PM rgw Feature #5602 (In Progress): rgw: teuthology task to test default region as master region
Ian Colle
01:01 PM devops Feature #4766 (Rejected): ceph-deploy: commands should continue to execute the next argument in c...
Given the whole system is idempotent, I think it is better to fail as soon as there is an error so the user can look ... Neil Levine
12:56 PM devops Bug #5839 (Resolved): Move `mon` to the new structure
Merged into ceph-deploy master
Hash: d7d6e0638f07c3693327d68ce8283b2c8fa5c444
Alfredo Deza
07:03 AM devops Bug #5839 (Resolved): Move `mon` to the new structure
We keep seeing issues with `ceph-deploy mon` commands but they are extremely difficult to investigate because we are ... Alfredo Deza
12:07 PM rgw Bug #5415 (Resolved): rgw: failing valgrind leak checks
Sage Weil
11:54 AM devops Feature #5845 (Rejected): Automate ceph-deploy push to ceph-extras.
Ian Colle
10:55 AM Fix #5844 (Resolved): osd: snaptrimmer should throttle itself
Cuttlefish still has some problems when operating with large snapshots on cluster with large amount of objects (~100G... Andrey Korolyov
10:14 AM rgw Bug #5843 (Won't Fix): swift api: x-container-meta-{key} should not be allowed on an object
Using 'x-container-meta-{key}' to update object custom metadata should be disallowed, since it's supposed to be used ... Alexandre Marangone
10:04 AM rgw Feature #5842 (Resolved): rgw: integrate multi-region s3tests into teuthology task
Yehuda Sadeh
09:26 AM Bug #5832 (Duplicate): mon scrub mismatch
looks like it.. scrub picked up extra osdmap keys, which is how this is manifesting Sage Weil
12:49 AM rgw Bug #5831 (Resolved): rgw: CORS info read for every request
Wip-5831 landed to next Ian Colle

08/01/2013

06:39 PM Bug #5503 (New): osd: ceph --admin-daemon interface doesn't handle spaces in names
Manually tested creating keys with spaces in key and val. David suggested maybe testing objnames with space too. Pr... Dan Mick
03:48 PM devops Bug #5835 (Resolved): Change text in package builds
Currently contains the following:
Ceph is a distributed network file system designed to provide excellent performa...
Neil Levine
03:45 PM Documentation #5834 (Resolved): Change Availability text in all of the man pages
The current man pages for ceph, rbd etc contain this paragraph:
AVAILABILITY
<COMMAND> is part of the C...
Neil Levine
03:34 PM Bug #5320 (Resolved): osd/ReplicatedPG.cc: 4753: FAILED assert(!pg_log.get_missing().is_missing(s...
...because it didn't need to be. Samuel Just
03:23 PM Bug #5320 (Pending Backport): osd/ReplicatedPG.cc: 4753: FAILED assert(!pg_log.get_missing().is_m...
This never actually got backported. Samuel Just
12:30 PM Bug #5084 (Resolved): osd: slow peering after osd restart (bobtail)
278c7b59228f614addf830cb0afff4988c9bc8cb probably took care of this? Samuel Just
12:28 PM Bug #5297 (Resolved): Slow requests after restarting an OSD (post peering)
This got merged. Samuel Just
12:25 PM Bug #5832 (Duplicate): mon scrub mismatch
/a/teuthology-2013-08-01_01:00:21-rados-next-testing-basic-plana/92213/remote
Related to #5754?
Samuel Just
12:10 PM rgw Bug #5831 (Resolved): rgw: CORS info read for every request
This is really not needed, only need to do it in the CORS related requests. Yehuda Sadeh
11:42 AM devops Bug #5617 (Resolved): ceph-deploy: log ceph version to ceph.log
Merged to ceph-deploy master branch
Hash: fc7a8e542b97d8f14fa46a80404d433075cd6ef5
Alfredo Deza
09:10 AM devops Bug #5617 (Fix Under Review): ceph-deploy: log ceph version to ceph.log
A pull request is ready to be reviewed that addresses this problem: https://github.com/ceph/ceph-deploy/pull/28 Alfredo Deza
11:02 AM rgw Bug #5789 (Resolved): rgw: can't authenticate cors requests
Tested cuttledish, that's indeed a post-cuttlefish regression, no need to backport. Yehuda Sadeh
10:55 AM Bug #5799: SIGABRT in build_push_op -> object_info_t::decode
Scheduled run: samuelj-5799-0
Job scheduled with ID 92924
Job scheduled with ID 92925
Job scheduled with ID 9292...
Samuel Just
10:14 AM devops Bug #5499: ceph-deploy --cluster clustername osd prepare fails
Alfredo Deza wrote:
> Can you confirm the actual output in the node that is failing
I am sorry but we just rebuil...
Robert Sander
09:47 AM devops Bug #5499 (Need More Info): ceph-deploy --cluster clustername osd prepare fails
Can you confirm the actual output in the node that is failing when you run:... Alfredo Deza
09:58 AM Bug #5784 (Resolved): ceph_test_rados_watch_notify segfault
wip-5784 merged into next Ian Colle
09:48 AM Bug #5815 (Resolved): rest/test.py failure: FAILURE: url http://localhost:5000/api/v0.1/mds/setm...
wip-5815 merged into next Ian Colle
09:26 AM CephFS Documentation #5797: Document unstable nature of CephFS
Pull request submitted: https://github.com/ceph/ceph/pull/479 Scott Devoid
05:53 AM devops Bug #5738 (Resolved): Do not `rm- rf` the mon directory when calling destroy
Merged into `ceph-deploy` master
Hash: 19144079c6592840328d3695578d6c3a7d52fe37
Alfredo Deza
01:37 AM Bug #5823 (Can't reproduce): cpu load on cluster node is very high, client can't get data on pg ...
ceph version 0.61.7 (8f010aff684e820ecc837c25ac77c7a05d7191ff)
env: 3 cluster nodes (10 osds/node), use dedicated...
Khanh Nguyen Dang Quoc

07/31/2013

11:39 PM Bug #5822 (Resolved): ceph_argparse.py: parsing of required arguments broken
Relatively-recent changes I made to ceph_argparse.py were not caught by existing (lacking) tests;
commands can be ap...
Dan Mick
06:12 PM Bug #5815 (Fix Under Review): rest/test.py failure: FAILURE: url http://localhost:5000/api/v0.1/...
Fix test failure by adding 10x retry to "get epoch, set epoch+1".
wip-5815
Dan Mick
02:09 PM Bug #5815: rest/test.py failure: FAILURE: url http://localhost:5000/api/v0.1/mds/setmap?epoch=11
Also, as Sage points out, "unrecognized command" isn't really useful here.
Dan Mick
02:09 PM Bug #5815: rest/test.py failure: FAILURE: url http://localhost:5000/api/v0.1/mds/setmap?epoch=11
OK, this is mds setmap racing against the epoch number we got from mds dump. Possible fixes: retry N times or just s... Dan Mick
10:58 AM Bug #5815: rest/test.py failure: FAILURE: url http://localhost:5000/api/v0.1/mds/setmap?epoch=11
ubuntu@teuthology:/a/teuthology-2013-07-31_01:00:23-rados-next-testing-basic-plana/91048 Samuel Just
10:57 AM Bug #5815 (Resolved): rest/test.py failure: FAILURE: url http://localhost:5000/api/v0.1/mds/setm...
... Samuel Just
04:09 PM CephFS Documentation #5797: Document unstable nature of CephFS
the docs are under <ceph-sources>/doc/cephfs/ Zheng Yan
04:03 PM CephFS Bug #5649: smbtorture test gets ebusy on kclient umount
kclient bug or test script bug ? Zheng Yan
04:01 PM CephFS Bug #5649 (In Progress): smbtorture test gets ebusy on kclient umount
ubuntu@teuthology:/a/teuthology-2013-07-31_01:01:24-fs-next-testing-basic-plana/91313 Tamilarasi muthamizhan
02:44 PM rgw Feature #5821 (Resolved): Alias --secret-key to --secret for radosgw-admin
It seems like 'secret key' and '--secret-key' are the predominant way to refer to the secret for an RGW user but the ... Anonymous
02:41 PM rgw Bug #5820 (Resolved): radosgw-admin should fail on non-valid flags
A bug in the RGW teuthology task was encountered today where the --secret flag for user creation was being passed in ... Anonymous
02:17 PM rbd Feature #5167: openstack: cinder: differential backups
https://review.openstack.org/#/c/35216/ Josh Durgin
02:17 PM rbd Feature #5168: openstack: cinder: rbd as a backup target
https://review.openstack.org/#/c/34346/ Josh Durgin
02:15 PM rbd Feature #5465: openstack: cinder: support resize with rbd
https://review.openstack.org/#/c/36415/ Josh Durgin
01:47 PM Bug #5814 (Resolved): test_daemon.sh - [Errno 13] Permission denied
Sage Weil
01:38 PM Bug #5814: test_daemon.sh - [Errno 13] Permission denied
Dan Mick wrote:
> teuthworker can't passwordless sudo, although ubuntu can; that might complicate things. Should we...
Sage Weil
01:22 PM Bug #5814: test_daemon.sh - [Errno 13] Permission denied
teuthworker can't passwordless sudo, although ubuntu can; that might complicate things. Should we make teuthworker b... Dan Mick
12:54 PM Bug #5814: test_daemon.sh - [Errno 13] Permission denied
Script just needs to sudo I think Sage Weil
10:58 AM Bug #5814: test_daemon.sh - [Errno 13] Permission denied
ubuntu@teuthology:/a/teuthology-2013-07-31_01:00:23-rados-next-testing-basic-plana/91029 Samuel Just
10:55 AM Bug #5814 (Resolved): test_daemon.sh - [Errno 13] Permission denied
2013-07-31T01:56:50.625 INFO:teuthology.task.workunit.client.0.err:[10.214.132.26]: will thrash map for 10 epochs
20...
Samuel Just
01:32 PM devops Bug #5819 (Won't Fix): redhat-rpm-config package needed for debuginfo packages
Just a note to fix up the spec file, or otherwise document the requirement. On rhel, the debug packages are built by... Anonymous
01:09 PM Bug #5784 (Fix Under Review): ceph_test_rados_watch_notify segfault
David Zafman
12:06 PM rgw Feature #5607 (Resolved): s3-tests: new tests for multi region
Yehuda Sadeh
12:06 PM rgw Feature #5606 (Resolved): s3-tests: run in multi region setup
Yehuda Sadeh
12:00 PM rgw Bug #5810 (Rejected): rgw: update_type does not work correctly when meta object does not exist (o...
Ok, after discussing that with Josh, he reminded me that we don't really replay the log. What we do is always fetchin... Yehuda Sadeh
11:11 AM rgw Bug #5810: rgw: update_type does not work correctly when meta object does not exist (or removed)
Issue #2 (not checking versions for removal operations) can be fixed easily.
Was thinking about issue #2. Note tha...
Yehuda Sadeh
11:33 AM Bug #5818 (Won't Fix): leveldb 1.12: hang on shutdown (mon)
ubuntu@teuthology:/a/teuthology-2013-07-31_01:00:23-rados-next-testing-basic-plana/91208
The processes are still r...
Samuel Just
10:23 AM Bug #5799: SIGABRT in build_push_op -> object_info_t::decode
I think this may have been an ext4 xattr error. Either way, we'll have to reproduce it. Samuel Just
09:34 AM devops Bug #5813 (Closed): Normalize ceph-deploy Exceptions for better error reporting
ceph-deploy raises many different types of exceptions which makes it difficult for better reporting of the tool's err... Alfredo Deza
09:21 AM devops Bug #5763: ceph-deploy new [IP] should error out
Added a validator to `ceph-depoy new` so that we prevent this from happening.
Pull request was opened: https://gi...
Alfredo Deza
09:04 AM Bug #5503 (Resolved): osd: ceph --admin-daemon interface doesn't handle spaces in names
Ian Colle
08:14 AM devops Bug #4936 (Resolved): ceph-deploy fails to report errors
I am closing this as the new logging features will log verbosely to the terminal by default.
Errors should be more...
Alfredo Deza
08:02 AM rbd Bug #5812 (Closed): qemu-kvm guest hangs on disk write with rbd storage
qemu-kvm VM with rbd backed storage works until guest tries to write data, then kernel_hung task messages appear.
...
Kevin Fibich
05:45 AM devops Bug #5728 (Resolved): Capture std(err|out) and exit code for better error reporting
Merged into master ceph-deploy
Hash: a451465438174f68e3d6b216c13387fdaa954fe2
Alfredo Deza
12:13 AM devops Bug #5811 (Resolved): gperftools-{devel,libs}-2.0.11.el6 appears to be broken (centos6/rhel6)
Configure complains that tcmalloc can not be found. The log shows the configure test program failed with sem_init() ... Anonymous

07/30/2013

11:17 PM rgw Feature #5604 (In Progress): rgw: teuthology tests to check various user creation issues on multi...
*sigh*
wip-region-sync contains a teuth task that sets up the sync agent (apparently successfully). Unfortunately I'...
Greg Farnum
11:11 PM Bug #5800 (Resolved): admin socket arg validation broken
Sage Weil
06:25 PM Bug #5800 (In Progress): admin socket arg validation broken
Just figuring out test/admin_socket.cc Dan Mick
10:17 PM devops Bug #5777: ceph-deploy: gatherkeys fails on ubuntu precise
does /var/run/ceph/ceph-mon.node01.asok exist? what happens if you do 'ceph --admin-daemon /var/run/ceph/ceph-mon.no... Sage Weil
10:08 PM devops Bug #5777: ceph-deploy: gatherkeys fails on ubuntu precise

root@node01:~# ceph-create-keys -i node01
admin_socket: exception getting command descriptions: [Errno 2] No such ...
vlad karm
07:15 AM devops Bug #5777: ceph-deploy: gatherkeys fails on ubuntu precise
what is the output on the node01 from 'ceph-create-keys -i `hostname`' ? Sage Weil
12:53 AM devops Bug #5777: ceph-deploy: gatherkeys fails on ubuntu precise
Same again on
Ubuntu 13.04 (GNU/Linux 3.8.0-19-generic x86_64)
root@manage:~/cluster# ceph-deploy install --dev ...
vlad karm
06:12 PM rbd Feature #5465 (Resolved): openstack: cinder: support resize with rbd
Josh Durgin
06:10 PM rbd Feature #5167 (Resolved): openstack: cinder: differential backups
Josh Durgin
06:10 PM rbd Feature #5168 (Resolved): openstack: cinder: rbd as a backup target
Josh Durgin
05:55 PM rgw Bug #5810 (Rejected): rgw: update_type does not work correctly when meta object does not exist (o...
Two related but separate issues:
- when creating a new object, update-type=update-by-version doesn't work at all (t...
Yehuda Sadeh
04:05 PM rgw Bug #5808 (Resolved): rgw: can't suspend user
Yehuda Sadeh
04:05 PM rgw Bug #5808: rgw: can't suspend user
Fixed, commit:736d6a1bde71919f7469218684c12793a9532aeb Yehuda Sadeh
01:53 PM rgw Bug #5808 (Resolved): rgw: can't suspend user
teuthology-2013-07-30_01:01:13-rgw-next-testing-basic-plana, 89678 Yehuda Sadeh
03:55 PM Bug #5798 (Resolved): librados: mutex assertion in SimpleMessenger::submit_message
Samuel Just
02:53 PM Bug #5798: librados: mutex assertion in SimpleMessenger::submit_message
(gdb) print *this
$5 = {<RefCountedObject> = {_vptr.RefCountedObject = 0xd66500, nref = {val = 140461764831096}}, lo...
Samuel Just
02:45 PM Bug #5798: librados: mutex assertion in SimpleMessenger::submit_message
2013-07-29T10:59:31.528 INFO:teuthology.task.workunit.client.0.out:[10.214.133.35]: [ RUN ] LibRadosCmd.PGCmd
Samuel Just
02:43 PM Bug #5798: librados: mutex assertion in SimpleMessenger::submit_message
binary was ceph_test_rados_api_cmd Samuel Just
12:50 PM Bug #5798: librados: mutex assertion in SimpleMessenger::submit_message
There is a chance that this may be related to #5784. Samuel Just
03:40 PM Bug #5804: mon: binds to 0.0.0.0:6800something port
... Joao Eduardo Luis
02:02 PM Bug #5804: mon: binds to 0.0.0.0:6800something port
Just saw it is; my connection to that machine is however painfully slow. Will update as soon as I figure something else. Joao Eduardo Luis
01:56 PM Bug #5804: mon: binds to 0.0.0.0:6800something port
Is the monitor being specified in 'mon initial members'? Joao Eduardo Luis
03:11 PM Bug #5754: mon: occasional osd full map not trimmed
pretty sure this is https://code.google.com/p/leveldb/issues/detail?id=178. fixed https://code.google.com/p/leveldb/... Sage Weil
02:47 PM CephFS Bug #5367 (Resolved): multiclient tests: kernel mount gets EPERM
We can reopen if we see this again, but the tests are now checking (and chef is setting) the uid for the ubuntu (test... Greg Farnum
02:47 PM CephFS Bug #5761: teuthology: MPI test sometimes fails with a permission denied
Essentially a duplciate of #5637 — mismatched UIDs on different nodes. Greg Farnum
02:30 PM CephFS Bug #5761 (Resolved): teuthology: MPI test sometimes fails with a permission denied
Sage Weil
02:38 PM Bug #5616 (Resolved): repeated mon elections
Sage Weil
10:50 AM Bug #5806 (Resolved): osd: unfound objects on next!
Hopefully fixed with teuthology hack on next: e1cd49bd4558f7491e11b2bffe9355001661f81f Samuel Just
10:38 AM Bug #5806: osd: unfound objects on next!
This is a problem with the test, when we mark an osd up, there is lag before the pgs transitions from active+clean. ... Samuel Just
10:27 AM Bug #5806 (In Progress): osd: unfound objects on next!
Samuel Just
09:28 AM Bug #5806 (Resolved): osd: unfound objects on next!
ubuntu@teuthology:/a/teuthology-2013-07-30_01:00:15-rados-next-testing-basic-plana/89584
cluster is hung in recovery
Sage Weil
07:58 AM CephFS Bug #5805 (Need More Info): mds: crash loading sessionmap
log shows the missing object is snaptable instead of sessionmap. The log does not give us any clue how this happened ... Zheng Yan
07:16 AM CephFS Bug #5805 (Can't reproduce): mds: crash loading sessionmap
ftp://ftp.ts.fujitsu.com/outgoing/mds.0_crash-logs.tar.gz Sage Weil
06:10 AM devops Bug #5728: Capture std(err|out) and exit code for better error reporting
There is a pull request waiting for testing that enables full logging and actual remote actions in the remote host.
...
Alfredo Deza

07/29/2013

09:31 PM Bug #5804 (Resolved): mon: binds to 0.0.0.0:6800something port
it keeps binding to 0.0.0.0:6801 instead of the mon addr.. why? Sage Weil
04:01 PM Bug #5800 (Resolved): admin socket arg validation broken
flab:src 04:00 PM $ ./ceph daemon mon.a config set debug_ms 1
*** DEVELOPER MODE: setting PYTHONPATH and LD_LIBRARY_...
Sage Weil
03:52 PM rgw Bug #5789: rgw: can't authenticate cors requests
I think it only broke post cuttlefish, then probably not. Yehuda Sadeh
03:50 PM rgw Bug #5789 (Pending Backport): rgw: can't authenticate cors requests
In next, commit:f10c2e746ef5b9921b808450e75f2a90070e05bc.
Does this need backports? An earlier iteration of this b...
Greg Farnum
10:35 AM rgw Bug #5789 (Fix Under Review): rgw: can't authenticate cors requests
Yehuda Sadeh
10:26 AM rgw Bug #5789 (Resolved): rgw: can't authenticate cors requests
Yehuda Sadeh
03:49 PM rgw Bug #5793 (Resolved): rgw: cross region copy requests not always going to the correct region
Fixed in next commit:eb6e6da1a90d3d94d623650556d78e56d04ab3a1 Greg Farnum
10:40 AM rgw Bug #5793 (Fix Under Review): rgw: cross region copy requests not always going to the correct region
Yehuda Sadeh
10:36 AM rgw Bug #5793 (Resolved): rgw: cross region copy requests not always going to the correct region
Because instead of a region connection map, we only keep a single connection that points at the master region. Yehuda Sadeh
03:48 PM rgw Bug #5791 (Resolved): rgw: bucket set_attr operations are logged in wrong metadata section
Fixed in next commit:b5e74485d94d033478f791535423a8b8bef88326 Greg Farnum
10:45 AM rgw Bug #5791 (Fix Under Review): rgw: bucket set_attr operations are logged in wrong metadata section
Yehuda Sadeh
10:32 AM rgw Bug #5791: rgw: bucket set_attr operations are logged in wrong metadata section
these operations are done on the bucket instance, however, they're being logged to the bucket section (and not to buc... Yehuda Sadeh
10:29 AM rgw Bug #5791 (Resolved): rgw: bucket set_attr operations are logged in wrong metadata section
Yehuda Sadeh
03:47 PM rgw Bug #5790 (Resolved): rgw: access correct bucket instance object
Fixed in next, commit:68730d80c8aa393a583c92052ef5ffd11efe17a6 Greg Farnum
10:45 AM rgw Bug #5790 (Fix Under Review): rgw: access correct bucket instance object
Yehuda Sadeh
10:29 AM rgw Bug #5790 (Resolved): rgw: access correct bucket instance object
For buckets that were created before the bucket entrypoint - instance split, the bucket entry point holds the bucket ... Yehuda Sadeh
03:46 PM rgw Bug #5770 (Resolved): rgw: cors, bucket attrs info kept on bucket entry point instead of bucket i...
Fixed in next, commit:acd16d1aed0f6146c84a377be9fef1d7f54c6bed Greg Farnum
10:45 AM rgw Bug #5770 (Fix Under Review): rgw: cors, bucket attrs info kept on bucket entry point instead of ...
Yehuda Sadeh
03:06 PM devops Bug #5738 (Fix Under Review): Do not `rm- rf` the mon directory when calling destroy
wip-5738 Sage Weil
02:50 PM devops Bug #5777 (Resolved): ceph-deploy: gatherkeys fails on ubuntu precise
seems to work fine on ceph version 0.67-rc2-184-g1cb3216-1precise, ie the latest next branch. Tamilarasi muthamizhan
02:15 PM Bug #5798: librados: mutex assertion in SimpleMessenger::submit_message
Samuel Just
02:13 PM Bug #5798 (In Progress): librados: mutex assertion in SimpleMessenger::submit_message
Samuel Just
12:54 PM Bug #5798 (Resolved): librados: mutex assertion in SimpleMessenger::submit_message
... Sage Weil
02:15 PM Bug #5799 (In Progress): SIGABRT in build_push_op -> object_info_t::decode
Samuel Just
01:29 PM Bug #5799 (Resolved): SIGABRT in build_push_op -> object_info_t::decode
... Sage Weil
02:11 PM Bug #5749: osd: unfound objects on cuttlefish
oops wrong bug Sage Weil
01:45 PM Bug #5749 (In Progress): osd: unfound objects on cuttlefish
Sage Weil
02:10 PM Bug #5748 (Resolved): osd: unfound object test is failing to produce unfound objects
teuthology.git bug, commit:99a79c6c1f2927b73003f13e9b08be892fa7cd50 Sage Weil
01:59 PM Bug #5748 (In Progress): osd: unfound object test is failing to produce unfound objects
Samuel Just
02:01 PM Bug #5780 (Resolved): the size of the old acting set should be compared with the min_size of the ...
Loïc Dachary
01:53 PM Bug #5784: ceph_test_rados_watch_notify segfault
Samuel Just
01:52 PM Bug #5784 (Resolved): ceph_test_rados_watch_notify segfault
Samuel Just
01:50 PM Bug #5783 (Resolved): osd: recovery timeout
The first appeared not to be hung. The second should be fixed with teuthology commit 8284e19226a2f15435974af99bc3d80... Samuel Just
01:29 PM Bug #5783: osd: recovery timeout
ubuntu@teuthology:/a/teuthology-2013-07-29_09:44:34-rados-next-testing-basic-plana/88630 Sage Weil
01:33 PM CephFS Bug #5649 (Resolved): smbtorture test gets ebusy on kclient umount
Sage Weil
01:33 PM Bug #5779 (Resolved): ceph-rest-api: missing requests dependency onpackage?
Sage Weil
01:14 PM Bug #5786 (Resolved): mon/rest-api: pg dump test gives mon bad command?
Sage Weil
01:11 PM rbd Bug #5426: librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion::complete()
nevermind, the PerfCounters is invalid too, with m_lower_bound = 22777856, m_upper_bound = 0 Josh Durgin
01:06 PM rbd Bug #5426: librbd: mutex assert in perfcounters::tinc in librbd::AioCompletion::complete()
Reproduced after 22 tries with no debugging on. Couldn't reproduce without debugging, but this is a start.
The Per...
Josh Durgin
12:59 PM rbd Bug #5488: librbd: deadlock in image refresh
Another potential deadlock popped up: md_lock vs librados' lock ordering. Still trying to find which callback takes t... Josh Durgin
12:46 PM CephFS Documentation #5797 (Resolved): Document unstable nature of CephFS
Apparently CephFS is not production-ready, especially not in multi-mds configurations?
>nhm: We should probably pu...
Scott Devoid
12:24 PM Feature #5796 (New): formatter: send output to ostream
Instead of accumulating the result in memory and doing a final f->flush(outputbuffer), set the output/sink (ostream!)... Sage Weil
12:12 PM Bug #5782 (Resolved): osd: off-by-one on map gap handling?
Sage Weil
10:36 AM Bug #5792 (Resolved): ceph-deploy mon create hangs on fedora18 vm's
Aall the nightlies are hanging on fedora when running ceph-deploy mon create. example:
2013-07-28T01:18:11.681 DEB...
Sandon Van Ness
10:10 AM devops Bug #5405 (Resolved): ceph-deploy: transient pushy exception on install
Sage Weil
09:58 AM devops Feature #5016 (Resolved): ceph-deploy: gitbuilders for release packages
All ceph-deploy gitbuilders are happy now. Anonymous
09:34 AM devops Bug #5778: gitbuilders use cryptopp instead of nss libraries
if the configure fix isn't quick, let's add the --without-cryptopp Sage Weil
09:11 AM CephFS Bug #5787: client/Client.cc: 2081: FAILED assert(!unclean) in put_inode
Greg Farnum wrote:
> If the inode is >4MB, shouldn't the oset have more than one object? Sounds like maybe we lost t...
Sage Weil
09:08 AM CephFS Bug #5787: client/Client.cc: 2081: FAILED assert(!unclean) in put_inode
If the inode is >4MB, shouldn't the oset have more than one object? Sounds like maybe we lost track of an in-flight w... Greg Farnum

07/28/2013

10:24 PM Bug #5780: the size of the old acting set should be compared with the min_size of the old osdmap
Sam - please review Ian Colle
05:11 AM Bug #5780 (Fix Under Review): the size of the old acting set should be compared with the min_size...
Loïc Dachary
04:47 AM Bug #5780 (Resolved): the size of the old acting set should be compared with the min_size of the ...
"work in progress":https://github.com/dachary/ceph/tree/wip-5780
"pg_interval_t::check_new_interval":https://githu...
Loïc Dachary
10:19 PM rgw Bug #5416 (Resolved): --help output needs --rgw-zone option
Ian Colle
04:55 PM rgw Bug #5416: --help output needs --rgw-zone option
Issue should be closed as https://github.com/ceph/ceph/commit/7a7fca46fb1e8e3a121279b45fa9ce687d29b788 fixed it. Christophe Courtaut
08:52 PM Bug #5788 (Resolved): ceph: try new, fallback to old can race with daemon upgrade
- cli tries new style, gets EINVAL
- mon upgrades from old -> new
- cli tries old style, gets EINVAL
it should p...
Sage Weil
03:24 PM CephFS Bug #5787 (Need More Info): client/Client.cc: 2081: FAILED assert(!unclean) in put_inode
Sage Weil
03:24 PM CephFS Bug #5787 (Duplicate): client/Client.cc: 2081: FAILED assert(!unclean) in put_inode
... Sage Weil
09:28 AM Bug #5786 (Resolved): mon/rest-api: pg dump test gives mon bad command?
... Sage Weil
09:17 AM Bug #5785 (Resolved): librados: deadlock during shutdown
the monc locking for librados shutdown looks broken. although the whole thing appears a bit wonky.... Sage Weil
09:10 AM Bug #5749: osd: unfound objects on cuttlefish
HEALTH_WARN 2 pgs recovering; 2 pgs stuck unclean; recovery 101/1942 degraded (5.201%); 24/971 unfound (2.472%)
root...
Sage Weil
09:05 AM Bug #5782 (Fix Under Review): osd: off-by-one on map gap handling?
wip-5782 Sage Weil
08:21 AM Bug #5782 (Resolved): osd: off-by-one on map gap handling?
off by one on map gap... Sage Weil
08:28 AM Bug #5784 (Resolved): ceph_test_rados_watch_notify segfault
... Sage Weil
08:24 AM Bug #5783 (Resolved): osd: recovery timeout
no obvious crashes.... Sage Weil
07:59 AM CephFS Bug #5367: multiclient tests: kernel mount gets EPERM
ah, a few do! the ones this test failed on don't, but maybe they were reimaged since then. Sage Weil
02:09 AM CephFS Bug #5367: multiclient tests: kernel mount gets EPERM
can't reproduce locally. Is it possible your clients have different 'username -> uid' mapping? Zheng Yan

07/27/2013

06:40 PM Bug #5779: ceph-rest-api: missing requests dependency onpackage?
please review commit:6881ab3b39259adcd7aed7026ccae1cba1db7aa1 Sage Weil
05:49 PM Bug #5779 (Fix Under Review): ceph-rest-api: missing requests dependency onpackage?
Sage Weil
07:35 AM Bug #5779 (Resolved): ceph-rest-api: missing requests dependency onpackage?
in qa i see... Sage Weil
05:51 PM Bug #5754: mon: occasional osd full map not trimmed
a mon transaction dump clearly shows the key being removed, and it is then visible later.
other evidence that this...
Sage Weil
12:47 PM devops Bug #5777: ceph-deploy: gatherkeys fails on ubuntu precise
Same here
Ubuntu 12.04.2 LTS (GNU/Linux 3.2.0-49-generic x86_64)
# ceph version 0.61.7 (8f010aff684e820ecc837c25...
vlad karm
07:54 AM devops Bug #4924 (Need More Info): ceph-deploy: gatherkeys fails on raring (cuttlefish)
Can you add 'debug mon = 20' and 'debug ms = 1' and 'debug monc = 20' to your ceph.conf, restart ceph-mon, and attach... Sage Weil
07:18 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
I'm still seeing this with the latest cuttlefish on ubuntu 13.04.
after doing ceph-deploy install ceph-dmon1 and t...
Zoltan Arnold Nagy

07/26/2013

11:25 PM rgw Bug #5415 (Fix Under Review): rgw: failing valgrind leak checks
Sage Weil
10:18 PM CephFS Bug #5724 (Resolved): ceph-fuse crashed during fs tests using ceph-deploy
this was broken getgroups stuff, disabled now. Sage Weil
09:55 PM Bug #5239 (Need More Info): osd: Segmentation fault in ceph-osd / tcmalloc
Sage Weil
05:47 PM devops Bug #5778 (Resolved): gitbuilders use cryptopp instead of nss libraries
The configure script checks for the libcryptopp package being installed before it checks for nss. This means that ev... Anonymous
05:37 PM devops Bug #5777 (Resolved): ceph-deploy: gatherkeys fails on ubuntu precise
ceph-deploy gatherkeys fails as ceph-create-keys is stuck up when trying to start the monitor.
this is on ceph -ne...
Tamilarasi muthamizhan
05:31 PM Bug #5776 (Can't reproduce): ceph: passing -1 osd id
i fixed librados to not assert, but still:... Sage Weil
05:17 PM devops Feature #5775 (Resolved): create qemu rbd package for rhel 6.5 - qemu-rbd
Josh Durgin
05:15 PM rbd Feature #5774 (Resolved): test libvirt + qemu on rhel
Josh Durgin
04:58 PM Bug #5767 (Resolved): ceph-deploy + ceph-fuse tests failing on next
Sage Weil
03:27 PM Bug #5767 (In Progress): ceph-deploy + ceph-fuse tests failing on next
oops Sage Weil
03:27 PM Bug #5767 (Resolved): ceph-deploy + ceph-fuse tests failing on next
Sage Weil
01:26 PM Bug #5767 (Resolved): ceph-deploy + ceph-fuse tests failing on next
appear to hang. no admin socket in var/run :(
reproducing with logs
Sage Weil
04:48 PM devops Feature #5773 (Resolved): ceph-deploy: should add more tests to ceph-deploy task
ceph-deploy task in teuthology should be able to do more efficient tests w.r.to osd create like,
picking different...
Tamilarasi muthamizhan
04:44 PM rgw Feature #5607 (In Progress): s3-tests: new tests for multi region
Yehuda Sadeh
04:30 PM rgw Feature #5606: s3-tests: run in multi region setup
Yehuda Sadeh
03:59 PM rgw Bug #5770 (Resolved): rgw: cors, bucket attrs info kept on bucket entry point instead of bucket i...
Yehuda Sadeh
03:25 PM Bug #5764 (Resolved): mon: problem with pgmap upgrade_format, then sync
Sage Weil
10:36 AM Bug #5764: mon: problem with pgmap upgrade_format, then sync
Sage Weil
09:12 AM Bug #5764 (Resolved): mon: problem with pgmap upgrade_format, then sync
2/3 mons upgrade and form quorum and are happy
last one starts up, loads latest (old format) pgmap
does a sync
doe...
Sage Weil
03:22 PM Bug #5616: repeated mon elections
logs on slider:~samuelj/big_cluster/mons[2] Samuel Just
02:35 PM rbd Bug #5768 (Resolved): rbd-fuse: leak in enumerate_images()
enumerate_images() allocates ibuf and never frees it. Problem is that it returns a list with pointers into it. This n... Yehuda Sadeh
02:07 PM Bug #5752 (Resolved): osd: delayed loading of classes can be problematic if use races with upgrade
Sage Weil
01:57 PM Bug #5766 (Resolved): osd: replay not closing fds? too many open fds on upgrade+restart
leak in _check_global_replay_guard. backported to cuttlefish Samuel Just
09:38 AM Bug #5766 (Resolved): osd: replay not closing fds? too many open fds on upgrade+restart
... Sage Weil
01:24 PM devops Bug #5727 (Resolved): Discover and set the proper OS bindings to use
Pull request https://github.com/ceph/ceph-deploy/pull/22 merged.
Hash: 4ae2768d63531a7458133b632fec4d92959c6e06
Alfredo Deza
10:51 AM Bug #5677 (Resolved): osd/OSD.cc: 5517: FAILED assert(_get_map_bl(epoch, bl))
Samuel Just
09:17 AM CephFS Bug #5765 (New): kclient: High CPU due to raw_spin_lock in ceph_cap_string
During CephFS testing, very high CPU usage was noticed during writes across lots of clients. perf showed that a sign... Mark Nelson
09:12 AM rgw Feature #5604: rgw: teuthology tests to check various user creation issues on multi region enviro...
Getting started on this after discussing how to break up the tasks with Joe. Greg Farnum
06:43 AM devops Bug #5745: ceph-deploy: mds create command errors out EACCES on next branch
Fixed in `next` branch of ceph
Hash: ebb9aceb7971f13ae4032f0943854b2b2c0af28a
And in `master` branch of ceph-de...
Alfredo Deza
05:46 AM devops Bug #5763 (Resolved): ceph-deploy new [IP] should error out
When calling `new` with ceph-deploy, sometimes it is possible a user may use an IP instead of a hostname. The tool sh... Alfredo Deza
 

Also available in: Atom