Project

General

Profile

Activity

From 04/18/2013 to 05/17/2013

05/17/2013

11:59 PM Revision 863d6d78 (ceph): Merge pull request #253 from Elbandi/wip-getlayout
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
10:59 PM Revision feec1b46 (ceph): doc: Added more glossary-compliant terms and indexing.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:58 PM Revision 5c4b4f0f (ceph): doc: Added another instance term to the glossary.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:56 PM Revision decf342c (ceph): doc: Minor improvements to Ceph FS landing page.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:07 PM Revision 0f4c67f1 (ceph): rgw: store region in bucket info
only handle requests that come to buckets stored in correct
region.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:50 PM Revision 3255f115 (ceph): libcephfs: get the pool name of a file path
Signed-off-by: Andras Elso <elso.andras@gmail.com> Andras Elso
07:45 PM Revision 3562c8c5 (ceph): libcephfs: get replication factor of a file handle/path
Signed-off-by: Andras Elso <elso.andras@gmail.com> Andras Elso
07:43 PM Revision 877fcf0b (ceph): libcephfs: get file handle/path layout info
Signed-off-by: Andras Elso <elso.andras@gmail.com> Andras Elso
07:39 PM Revision 42c74fde (ceph): libcephfs: get stripe_unit/stripe_count/object_size/pool_id by file han...
Signed-off-by: Andras Elso <elso.andras@gmail.com> Andras Elso
07:10 PM Revision ee3d50e6 (ceph): Client: get describe_layout by file handle/path
Signed-off-by: Andras Elso <elso.andras@gmail.com> Andras Elso
07:10 PM Revision 10496a84 (ceph): libcephfs: fix typos
Signed-off-by: Andras Elso <elso.andras@gmail.com> Andras Elso
07:08 PM Revision 5a274c8e (ceph): client config will be done only after the cluster is operational.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
05:47 PM devops Bug #4865 (In Progress): ceph-disk: activate fails on debian wheezy due to missing udev by-partuu...
tested with cuttlefish branch on burnupi26, still seeing the same issue Tamilarasi muthamizhan
05:23 PM devops Bug #5086 (Resolved): ceph-deploy: osd create command fails sometimes on centos 6.3
commit:bae521159deb0ca58c05ec14fa8362c4cc334fc2 Dan Mick
05:06 PM devops Bug #5107: ceph-deploy: on centos 6.3, osd create command should be cleaned up
... Tamilarasi muthamizhan
04:57 PM devops Bug #5107 (Duplicate): ceph-deploy: on centos 6.3, osd create command should be cleaned up
on centos 6.3, [burnupi05, burnupi21]
while osd create command when used with zapdisk option, does create osds suc...
Tamilarasi muthamizhan
04:44 PM devops Bug #4925 (Resolved): Incorrect yum conf for Cuttlefish and el6
The ceph-release rpms in the rpm-bobtail repo have been respun to reference ceph.com/rpm-bobtail instead of ceph.com/... Anonymous
04:26 PM Revision d0a5d3a7 (ceph): Merge pull request #295 from ceph/wip-5077
Reviewed-by: Joao Luis <joao.luis@inktank.com> Sage Weil
04:17 PM Revision 69e2cbef (ceph): mon: add 'compact' command
As in, 'ceph mon tell \* compact'
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Joao Luis <joao.luis@inkt...
Sage Weil
03:35 PM Revision b238f356 (ceph): Merge pull request #296 from dalgaaf/wip-da-CID-1021213
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
03:26 PM devops Cleanup #5106 (Resolved): ceph_deploy: install/compile error on wheezy
Incompatible syntax with python 2.6 compiler:
administrator@ceph-admin:~$ sudo aptitude install ceph-deploy
The f...
Anonymous
03:10 PM CephFS Bug #5105 (Duplicate): mds/CInode.cc: 1996: FAILED assert(auth_pins >= 0)
While trying to reproduce #4999, I collected this in an MDS.
I was running next branch (commit c80c6a032c) merged ...
Jim Schutt
02:34 PM Subtask #5046: Factor out PG logs, PG missing
Move code to PGLog.cc PGLog.h so that missing, log and ondisklog are protected. Fix what breaks. Loïc Dachary
02:22 PM CephFS Bug #5104 (Can't reproduce): MDS crashed in Objecter::handle_osd_op_reply
While trying to reproduce #4999, I collected this in an MDS.
I was running next branch (commit c80c6a032c) merged ...
Jim Schutt
02:14 PM CephFS Bug #5103 (Rejected): mds: hung getattrs after restart
this was an osd issue. Sage Weil
11:06 AM CephFS Bug #5103 (Rejected): mds: hung getattrs after restart
logs on cephdrop ceph-mds.1.log
hung requests are...
Sage Weil
01:21 PM Documentation #2271: FAQ: BTRFS vs XFS
Correction: btrfs is not a journaled fs. The xfs is better than ext4 for subtle desSigns reasons that are probably n... Sage Weil
12:49 PM Documentation #2271 (Resolved): FAQ: BTRFS vs XFS
This is covered here: http://ceph.com/docs/master/rados/configuration/filesystem-recommendations/ and has a link from... John Wilkins
12:30 PM rbd Bug #4661: xfstest 139 hung
Note that both the original crash and this latest one
involve (probably) some corruption found in a path
involving ...
Alex Elder
12:28 PM rbd Bug #4661: xfstest 139 hung
Finally getting back to this.
Here is the end of the log:...
Alex Elder
11:58 AM Revision a130cd50 (ceph): kv_flat_btree_async.cc: release AioCompletion before leave the loop
CID 727982 (#1 of 1): Resource leak (RESOURCE_LEAK)
leaked_storage: Variable "aioc" going out of scope leaks the st...
Danny Al-Gaaf
11:54 AM Revision 4ba70f8f (ceph): librbd/internal.cc: fix resource leak
Call release() on librados::AioCompletion to free storage before
leave the loop or call new again.
CID 1021213 (#1 o...
Danny Al-Gaaf
11:52 AM Bug #5084: osd: slow peering after osd restart (bobtail)
One interesting observation is that when I tried restarting an OSD a few minutes after it has been restarted and clus... Faidon Liambotis
10:29 AM Bug #5084: osd: slow peering after osd restart (bobtail)
Here's the same with osd.6, but with --debug-ms 1 as requested. Besides peering being slow, the recovery_wait phase s... Faidon Liambotis
11:43 AM Linux kernel client Bug #5043 (Resolved): Oops in remove_osd
The following has been committed to the "testing" branch
of the ceph-client git repository:
14d2f38 libceph: must...
Alex Elder
11:42 AM rbd Bug #4559 (Need More Info): krbd: kernel BUG when mapping unexisting rbd device
I have committed the following to the "testing" branch of
the ceph-client git respository:
7262cfc rbd: don't des...
Alex Elder
04:20 AM rbd Bug #4559: krbd: kernel BUG when mapping unexisting rbd device
I am going to test the patch. I will let you know about results probably next week. Maciej Galkiewicz
10:59 AM Bug #5102: mon: assert(is_active()) on propose_pending()
wip-5102 has a proposed fix. Joao Eduardo Luis
10:56 AM Bug #5102 (Resolved): mon: assert(is_active()) on propose_pending()
This bug popped up while Jim Schutt was trying to reproduce #4999
This was my reply to the assert:
The issue ...
Joao Eduardo Luis
10:14 AM Documentation #3388 (Resolved): doc: create documentation for juju installation
Patrick McGarry completed this on the wiki. Since it's third party, it should be in the wiki rather than the mainline... John Wilkins
09:26 AM Bug #5077 (Resolved): nightlies: single node cluster hung waiting for ceph_health to be OK
Sage Weil
08:55 AM rgw Feature #5101 (New): teuthology: make rgw.py test multiple instances
even running 2 instances is a start, but ideally we also make an haproxy task that balances between them? Sage Weil
08:47 AM devops Feature #5091: google-perftools for arm
- Enable google-perftools on armhf: TODO
+ Enable google-perftools on armhf: DONE
i guess this means its in the u...
Sage Weil
06:39 AM Bug #5100 (Can't reproduce): teuthology kclient (?): fails to unmount after tiobench
If I run the "suites/tiobench.sh" suite with kclient I sometimes
(or maybe always) get a failure, *after* the suite ...
Alex Elder
06:10 AM Revision 7494e4eb (ceph): doc: Omitted literal syntax from toc.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:10 AM Revision 381ad24d (ceph): doc: Added fuse syntax to the fstab doc.
fixes: #3672
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
03:39 AM Revision 231a2aa8 (ceph): Merge branch 'next'
Sage Weil
03:39 AM Revision c80c6a03 (ceph): sysvinit: fix enumeration of local daemons when specifying type only
- prepend $local to the $allconf list at the top
- remove $local special case for all case
- fix the type prefix chec...
Sage Weil
03:26 AM Bug #4999: monitor sync failure
Yeah, the assert guarantees an invariable is always met (we must be active to propose a new value through Paxos).
...
Joao Eduardo Luis
02:22 AM rbd Bug #5099 (Resolved): io performance / ceph block device
ceph version 0.61.2,
ceph -s
health HEALTH_OK
monmap e1: 2 mons at {a=ip1:6789/0,b=ip2:6789/0}, election epoch...
Khanh Nguyen Dang Quoc
01:40 AM Revision 7bc7c9d4 (ceph): udev: install disk/by-partuuid rules
Wheezy's udev (175-7.2) has broken rules for the /dev/disk/by-partuuid/
symlinks that ceph-disk relies on. Install p...
Sage Weil
01:40 AM Revision d8d7113c (ceph): udev: install disk/by-partuuid rules
Wheezy's udev (175-7.2) has broken rules for the /dev/disk/by-partuuid/
symlinks that ceph-disk relies on. Install p...
Sage Weil
12:58 AM Revision 65072f2e (ceph): mon: clear pg delta after some period
If we have not pg_map updates, the delta doesn't update, and can get stuck
with the velocity right before activity st...
Sage Weil

05/16/2013

11:46 PM CephFS Documentation #3672 (Resolved): doc: how to mount ceph-fuse from fstab
John Wilkins
11:14 PM Documentation #4933 (Resolved): ceph-deploy. Partition usage should be disk usage.
John Wilkins
11:12 PM Revision acf6b8f9 (ceph): os/FileStore: fix replay guard error msgs (again)
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Documentation #3394 (Resolved): doc: create documentation for ceph-deploy
We have full docs for ceph-deploy now. John Wilkins
10:46 PM Documentation #3321 (Resolved): doc: Explain monitor HA better
Added quite a bit of content on high availability, and index entries for it too. John Wilkins
10:46 PM Revision 9b9d322c (ceph): test_filestore_idempotent_sequence: unmount prior to deleting store
FileStoreDiff umounts the stores in its destructor.
Also, DeterministicOpSequence handles deletes its passed
object ...
Samuel Just
10:45 PM Documentation #3247 (Resolved): doc: Move content out of wiki, kill it with fire
New wiki has been up for awhile, and the old wiki is de-linked from the main site. John Wilkins
10:45 PM Revision 5a27e85c (ceph): Revert "test_filejournal.cc: cleanup memory in destructor"
The finish() method for Contexts calls delete this.
This reverts commit 36028916c4630ea66007760efed8fc6c441e7af5.
F...
Samuel Just
10:37 PM Revision 49c04c62 (ceph): librbd: make image creation defaults configurable
Programs using older versions of the image creation functions can't
set newer parameters like image format and fancie...
Josh Durgin
10:37 PM Revision 4d7058fe (ceph): rbd.py: fix stripe_unit() and stripe_count()
These matched older versions of the functions, but would segfault
using the current versions.
backport: cuttlefish, ...
Josh Durgin
10:36 PM Revision 82a16c32 (ceph): cls_rbd: make sure stripe_unit is not larger than object size
Test a few other cases too.
backport: cuttlefish, bobtail
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
(cher...
Josh Durgin
10:28 PM Revision aacc9adc (ceph): librbd: make image creation defaults configurable
Programs using older versions of the image creation functions can't
set newer parameters like image format and fancie...
Josh Durgin
10:28 PM Revision c49ba750 (ceph): os/FileStore: print error code to log on replay guard failure
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:21 PM Revision 53ee6f96 (ceph): rbd.py: fix stripe_unit() and stripe_count()
These matched older versions of the functions, but would segfault
using the current versions.
backport: cuttlefish, ...
Josh Durgin
10:19 PM Revision 810306a2 (ceph): cls_rbd: make sure stripe_unit is not larger than object size
Test a few other cases too.
backport: cuttlefish, bobtail
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
08:59 PM Revision 8fa3039e (ceph): doc: Added index reference.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:59 PM Revision 74a73f2f (ceph): doc: Added glossary references and index references.
fixes: #3321
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
08:57 PM Revision 5737d032 (ceph): doc: Added cluster map and CRUSH definitions.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:57 PM Revision 58a880bd (ceph): doc: Fixing index references.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:56 PM Bug #5082: OSD wrongly marked as down
Since I use XFS on OSD and It does massive RAM caching, and linux VFS in general, don't you think it could be the rea... Ivan Kudryavtsev
07:58 PM Bug #5082: OSD wrongly marked as down
-loses a moves-
moves
Ivan Kudryavtsev
07:57 PM Bug #5082: OSD wrongly marked as down
I wonder if next scenario could be realized in my case. I have a lot of data on OSD and change weight such as OSD los... Ivan Kudryavtsev
11:15 AM Bug #5082: OSD wrongly marked as down
As I see It could be that During the process a lot of IO placed on backing device and OSD just waits in 'D' state and... Ivan Kudryavtsev
10:54 AM Bug #5082: OSD wrongly marked as down
BTW, I got a lot of
2013-05-17 00:52:47.278462 osd.23 [WRN] slow request 30.313363 seconds old, received at 2013-...
Ivan Kudryavtsev
09:36 AM Bug #5082 (Need More Info): OSD wrongly marked as down
hmm, ok, this is going to need more in the way of logs to diagnose. can you capture 'debug mon = 10', 'debug ms = 1'... Sage Weil
08:56 PM Revision 46f5f585 (ceph): doc: Added latency comment.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:18 PM Revision c2acecbd (ceph): debian: make radosgw require matching version of librados2
...indirectly via ceph-common. We get bad behavior when they diverge, I
think because of libcommon.la being linked b...
Sage Weil
08:17 PM Revision 604c83ff (ceph): debian: make radosgw require matching version of librados2
...indirectly via ceph-common. We get bad behavior when they diverge, I
think because of libcommon.la being linked b...
Sage Weil
08:14 PM Revision bc9f502c (ceph): set permission for config file
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
08:10 PM Revision 2df6e376 (ceph): Merge pull request #291 from dalgaaf/wip-da-CID-1019548
client/Client.cc: fix/silence "logically dead code" CID-Error
Even money that this satisfies the coverity gods...
R...
Sage Weil
06:41 PM devops Bug #4865 (Resolved): ceph-disk: activate fails on debian wheezy due to missing udev by-partuuid ...
commit:d8d7113c35b59902902d487738888567e3a6b933 Sage Weil
01:08 PM devops Bug #4865: ceph-disk: activate fails on debian wheezy due to missing udev by-partuuid rules
http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=681809 Sage Weil
01:08 PM devops Bug #4865: ceph-disk: activate fails on debian wheezy due to missing udev by-partuuid rules
wip-4865
paravoid says it's specific to wheezy.. the udev rules are different than upstream.
moving the by-part...
Sage Weil
06:30 PM Revision 1df344fe (ceph): schedule_suite.sh: put sha1 in install: overrides, not ceph:
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:14 PM Revision eaf3abf3 (ceph): FileJournal: adjust write_pos prior to unlocking write_lock
In committed_thru, we use write_pos to reset the header.start value in cases
where seq is past the end of our journal...
Samuel Just
05:49 PM Revision 541396fa (ceph): client/Client.cc: fix/silence "logically dead code" CID-Error
Fix handling of 'safe' and the conditions after calling file_flush().
CID 1019548 (#1 of 1): Logically dead code (DE...
Danny Al-Gaaf
05:48 PM Bug #5077 (Fix Under Review): nightlies: single node cluster hung waiting for ceph_health to be OK
wip-5077 Sage Weil
05:19 PM devops Bug #4925 (In Progress): Incorrect yum conf for Cuttlefish and el6
The rpm symbolic link was moved from pointing at bobtail to point at cuttlefish. The solution is rebuild the ceph-re... Anonymous
09:50 AM devops Bug #4925 (New): Incorrect yum conf for Cuttlefish and el6
This was fix last week, but it is happening again.
From http://ceph.com/rpm-cuttlefish/el6/x86_64/ceph-release-1-0...
Alexandre Marangone
04:49 PM Revision 7cb59d3b (ceph): added UserKnownHostsfile to ssh config
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
04:42 PM Revision 64871e09 (ceph): mds: avoid assert after suicide()
Fixes: #5079
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:40 PM CephFS Bug #5021: ceph-fuse: crash on traceless reply
Sage Weil
04:39 PM CephFS Bug #5021: ceph-fuse: crash on traceless reply
500 passes of the job on commit:1f65594c23309b527d74afe648c888c69a3c2acd wip-5021 Sage Weil
04:24 PM devops Bug #5086: ceph-deploy: osd create command fails sometimes on centos 6.3
Grr. Yes, this is a pre-Python-2.7 thing. "{}" in format strings was added in 2.7.
Yargh; I hate using field numbe...
Dan Mick
04:05 PM devops Bug #5086: ceph-deploy: osd create command fails sometimes on centos 6.3
this seems to work fine, when tried manually.
Tamilarasi muthamizhan
12:51 PM devops Bug #5086 (Resolved): ceph-deploy: osd create command fails sometimes on centos 6.3
test set up: burnupi05, burnupi20.
while running ceph-deploy tests from teuthology, found that "osd create" comman...
Tamilarasi muthamizhan
04:14 PM Revision 707ad738 (ceph): Merge pull request #290 from dalgaaf/wip-da-SCA-cppcheck-v2
Reviewed-by: Sage Weil <sage@inktank.com>
rgw bits Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Sage Weil
04:07 PM rbd Feature #5067 (Fix Under Review): librbd: configuration options to override default image creatio...
wip-librbd-config-create Josh Durgin
03:46 PM Bug #5075 (Resolved): filejournal tests failing in nightlies
Samuel Just
03:46 PM Bug #5076 (Resolved): nightlies:segfault in ceph_test_filestore_idempotent_sequence
Samuel Just
03:17 PM Bug #4999: monitor sync failure
Hmmm, this bit seems odd:... Jim Schutt
11:18 AM Bug #4999: monitor sync failure
Hmm, got the following during mon startup:... Jim Schutt
09:56 AM Bug #4999: monitor sync failure
OK, I'll take it for a spin. next branch is fine, except we might run into other issues that confound debugging this... Jim Schutt
09:52 AM Bug #4999: monitor sync failure
Jim, pushed wip-4999 with a patch to output what I believe to be the relevant information on 'debug mon = 9' (some me... Joao Eduardo Luis
09:16 AM Bug #4999: monitor sync failure
Yeah. Will push something shortly and let you know. Joao Eduardo Luis
08:15 AM Bug #4999: monitor sync failure
Joao, maybe it would be best if you pushed a branch with the debugging you'd like to see, since you know the code muc... Jim Schutt
08:03 AM Bug #4999: monitor sync failure
I'm afraid the debug ms = 1 is responsible for whatever is keeping me from reproducing, so I was
leaning towards the...
Jim Schutt
07:35 AM Bug #4999: monitor sync failure
huh, although debug ms = 1 would still be useful to see what's triggering what. That could also be surgically debugge... Joao Eduardo Luis
07:31 AM Bug #4999: monitor sync failure
That's pretty much what I was going to suggest if you were unable to trigger this.
Trying to output only the relev...
Joao Eduardo Luis
07:15 AM Bug #4999: monitor sync failure
Well, unfortunately it didn't reproduce overnight, either.
I was reproducing this every day at lower debug levels,...
Jim Schutt
02:55 PM devops Feature #5092: libatomic-ops for arm; or use gcc atomics instead
i wonder if it would actually be less effort to make our include/atomic.h use the gcc atomic types if they are availa... Sage Weil
02:27 PM devops Feature #5092 (Closed): libatomic-ops for arm; or use gcc atomics instead
Customer reported an issue with ceph and libatomic-ops on quantal. We are currently building with version "7.2~alpha... Anonymous
02:54 PM devops Feature #5091: google-perftools for arm
Blueprint changed by James Page:
Work items changed:
Work items for ubuntu-13.06:
+ Enable google-perftools on...
Sage Weil
02:23 PM devops Feature #5091 (Resolved): google-perftools for arm
Need google-perftools package for arm Anonymous
02:30 PM rbd Bug #4559 (Fix Under Review): krbd: kernel BUG when mapping unexisting rbd device
The following has been posted for review:
[PATCH] rbd: don't destroy ceph_opts in rbd_add()
Alex Elder
01:26 PM rbd Bug #4559: krbd: kernel BUG when mapping unexisting rbd device
I don't have enough information nor do I have a Xen setup so
it isn't easy for me to try to reproduce the problem re...
Alex Elder
01:12 PM rbd Bug #4559: krbd: kernel BUG when mapping unexisting rbd device
I believe I have found the problem.
In rbd_add(), if rbd_client_create() failed the error path
would call ceph_de...
Alex Elder
09:23 AM rbd Bug #4559: krbd: kernel BUG when mapping unexisting rbd device
I asked Maciej to report more specifically the platform he
was using (VM, and kernel/user space rbd). Answer:
Op...
Alex Elder
02:20 PM devops Feature #5015 (In Progress): ceph-deploy: push packages to all ceph repos
Currently pushing ceph-deploy to debian and to centos6. The others are in progress. Anonymous
02:10 PM devops Feature #5014 (In Progress): arm: Build ARM packages

Packages available at:
http://gitbuilder.ceph.com/ceph-deb-quantal-armv7l-basic/ref/master/
Anonymous
01:56 PM Bug #5084: osd: slow peering after osd restart (bobtail)
Did all that, attached the logs & peering.txt. Peering took two minutes, with recovery_wait taking another two, so I ... Faidon Liambotis
01:03 PM Bug #5084: osd: slow peering after osd restart (bobtail)
two theories:
- deep scrub is slowing things down. can you try 'ceph osd set nodeepscrub' and/or 'ceph osd set no...
Sage Weil
11:26 AM Bug #5084: osd: slow peering after osd restart (bobtail)
Thanks for opening this. Attached are osd dump, osd tree and the ceph.log right after I did "restart ceph-osd id=0". ... Faidon Liambotis
11:12 AM Bug #5084 (Resolved): osd: slow peering after osd restart (bobtail)
Sage Weil
01:47 PM devops Feature #5090: ceph-build: Need to support arm in the repos.
commit 8dcf3f991bfffef5ea19453e39c65366b2e496fe
Author: Gary Lowell <glowell@inktank.com>
Date: Thu May 16 13:42:...
Anonymous
01:35 PM devops Feature #5090 (Resolved): ceph-build: Need to support arm in the repos.
This may be as simple as adding armhf to the repo config. Anonymous
01:32 PM devops Feature #5089 (Resolved): ceph-deploy install fails on arm
Need to update the install function ot work with arm packages. Anonymous
01:30 PM devops Feature #5088 (Resolved): ceph-deploy packages need to install on arm
Need to add arm into the package indexes. Currently get the following error message:
Unable to find expected entr...
Anonymous
01:20 PM rgw Bug #4997 (Resolved): Seg Fault on rgw 0.61.1 with cluster in 0.61
commit:604c83ff18f9a40c4f44bc8483ef22ff41efc8ad Sage Weil
01:05 PM CephFS Bug #4965 (Resolved): libcephfs-java test failure
Sage Weil
12:20 PM Revision 56f8c364 (ceph): test/system/st_rados_create_pool.cc_ reduce scope of 'ret' in run()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 6147df48 (ceph): test/system/st_rados_list_objects.cc: reduce scope of 'ret' in run()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision a634a13d (ceph): test/system/systest_runnable.cc: reduce scope of 'ret' in join()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 297b573d (ceph): tools/ceph-filestore-dump.cc: reduce scope of 'r' in export_files()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 49033b69 (ceph): objclass/class_debug.cc: reduce scope of 'n' in cls_log()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision b9fe22b3 (ceph): rbd_fuse/rbd-fuse.c: reduce scope of some variables in open_rbd_image()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision a3eeb9f8 (ceph): rgw/rgw_acl_s3.cc: remove local variable 'ret' from create_from_headers()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 9d6e0866 (ceph): rgw/rgw_admin.cc: reduce scope of 'ret'
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 682f1076 (ceph): rgw/rgw_bucket.cc: reduce scope of 'max' in rgw_remove_bucket()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 492553b3 (ceph): rgw/rgw_common.cc: reduce scope of 'end' in two cases
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 95bf066b (ceph): rgw/rgw_tools.cc: reduce scope of 'ret' in rgw_get_obj()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision 8f486f0b (ceph): test/librbd/test_librbd.cc: reduce scope of several variables
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:20 PM Revision d226d9c3 (ceph): test/system/rados_list_parallel.cc: reduce scope of 'ret'
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:16 PM Revision 393de325 (ceph): osdc/Objecter.cc: reduce scope of skipped_map
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:16 PM Revision 84ce4e9b (ceph): os/chain_xattr.cc: reduce scope of local variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:16 PM Revision 829fdd41 (ceph): src/os/LFNIndex.cc: reduce scope of suffix_len
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:16 PM Revision 8254072b (ceph): os/HashIndex.cc: reduce scope of a local variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:16 PM Revision ea7d8a4d (ceph): os/FileStore.cc: reduce scope of a local variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:16 PM Revision 2c60bc1e (ceph): src/os/FlatIndex.cc: reduce scope of suffix_len
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:16 PM Revision 44275282 (ceph): src/os/DBObjectMap.cc: reduce scope of some variables
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:15 PM Revision 14871d05 (ceph): mount/mount.ceph.c: reduce scope of 'skip' in parse_options()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:15 PM Revision eb808cff (ceph): src/mds/flock.cc: reduce scope of old_lock_to_end in two cases
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:15 PM Revision ce84a226 (ceph): mds/Locker.cc: reduce scope of forced_change_max
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:15 PM Revision e7d47827 (ceph): src/crush/mapper.c: reduce scope of some local variables
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:11 PM Revision e0f10845 (ceph): auth/Crypto.cc: reduce scope of local variable in_buf
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
12:11 PM Revision aa11c68f (ceph): rgw/rgw_rados.cc: remove not needed code
Fix for cppcheck warning:
[src/rgw/rgw_rados.cc:2390]: (warning) Assignment of function
parameter has no effect out...
Danny Al-Gaaf
12:10 PM Revision cd48f570 (ceph): rgw/rgw_gc.cc: fix possible NULL pointer dereference
Fix/silence cppcheck warning:
[src/rgw/rgw_gc.cc:185] -> [src/rgw/rgw_gc.cc:181]: (error) Possible
null pointer der...
Danny Al-Gaaf
12:04 PM Revision 403bfa43 (ceph): osd/OSD.cc: remove unused variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
11:34 AM devops Feature #4947 (Resolved): Chef: Support for custom repositories
Alexandre Marangone
11:17 AM Bug #5020: osd: 2.5 deep-scrub stat mismatch, got 710/665 objects, 0/0 clones, 2908160/2723840 bytes
eaf3abf3f9a7b13b81736aa558c9084a8f07fdbe and
72bf5f4813c273210b5ced7f7793bc1bf813690c are both required.
Samuel Just
11:16 AM Bug #5057 (Duplicate): assertion in DeleteOp::_begin
Samuel Just
11:14 AM Subtask #5085 (Rejected): PG::merge_log should not have side effects other than on the log & miss...
The modifications on info around
* https://github.com/ceph/ceph/blob/master/src/osd/PG.cc#L678
* https://github.com...
Loïc Dachary
10:47 AM Subtask #5046: Factor out PG logs, PG missing
pg_info_t ( including pg_stat_t ) is modified during the log merging phase but it should not be the case. When factor... Loïc Dachary
10:45 AM Subtask #5046: Factor out PG logs, PG missing
read PG::merge_log PG::merge_old_entry PG::rewind_divergent_log PG::proc_replica_log
pg_log_t + pg_missing_t
Read ...
Loïc Dachary
10:40 AM Bug #4895: leveldb: mon workload makes store.db grow without bound
If anybody who sees this can generate a leveldb trace file (procedure is described above) i think that will help.
...
Sage Weil
04:56 AM Bug #4895: leveldb: mon workload makes store.db grow without bound
When this state occurs, leveldb compacts on trim as expected, but the store either too large or growing fast enough t... Mike Dawson
04:06 AM Bug #4895: leveldb: mon workload makes store.db grow without bound
On IRC, Florian Wiessner have been mentioning some strange behaviors that may be related to leveldb growth/compaction... Joao Eduardo Luis
12:51 AM Bug #4895: leveldb: mon workload makes store.db grow without bound
I'm on 0.61.2 as well and I can report that I struck this issue too. tnt suggested the MON restart and that fixed it.... Nigel Williams
12:06 AM Bug #4895: leveldb: mon workload makes store.db grow without bound
I'd like to report that I also have this happenning ...
All the mons have grown to > 9G this night (from 200 M usu...
Sylvain Munaut
10:26 AM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
Done. Reinstalling ceph and repaired from backups - I have troubles with monitor reinit as I do before. Wiped new mon... Denis kaganovich
09:43 AM CephFS Bug #5079 (Resolved): assert in MDCache::_recovered()
thanks, this one was easy to fix.
commit:64871e093159ad06d84fb2a84c7808a81800dfc4
Sage Weil
09:38 AM Bug #5081: Data migration and recover slow after changed OSD weight
btw, simpler to do 'ceph osd crush reweight osd.8 .25'
it is normal to have a bit of a long tail. also note that ...
Sage Weil
08:17 AM rbd Subtask #5028 (Resolved): rbd: treat clones with zero parent overlap as non-layered
The following was committed to the ceph-client "testing" branch.
70cf49cf rbd: ignore zero-overlap parent
Alex Elder
08:16 AM Bug #5027 (Resolved): rbd: support reading parent page data for writes
The following was committed to the ceph-client "testing" branch:
b91f09f1 rbd: support reading parent page data fo...
Alex Elder
08:15 AM Bug #5038 (Resolved): krbd: fix parent request size assumption
The following was committed to the ceph-client "testing" branch:
ebda6408 rbd: fix parent request size assumption
Alex Elder
08:14 AM Bug #5026 (Resolved): libceph: allow osd requests to be reused
The following was committed to the ceph-client "testing" branch:
c10ebbf5 libceph: init sent and completed when st...
Alex Elder
06:02 AM Revision 17d8ee9d (ceph): Fix some little/big endian issues
Ceph uses little endian, this patch fixes some endian issues
while Ceph running on big endian machine.
Signed-off-by...
Li Wang
06:02 AM Revision 769a16d6 (ceph): Makefle: force char to be signed
On an armv7l build, we see errors like
warning: rgw/rgw_common.cc:626:16: comparison is always false due to limited...
Sage Weil
04:58 AM Linux kernel client Bug #5043 (Fix Under Review): Oops in remove_osd
The following patch has been posted for review.
[PATCH] libceph: must hold mutex for reset_changed_osds()
Alex Elder
03:25 AM Bug #5069: monitor crashed during mon thrash in nightlies
I have some more logs - unfortunatelly, the mon is unable to start up and rejoin the cluster after this assert. Florian Wiessner
12:18 AM Revision 1d6ed811 (ceph): Merge branch 'wip-4783'
Reviewed-by: Sam Just <sam.just@inktank.com> David Zafman
12:18 AM Revision c0378b60 (ceph): OSD: Repair with 0 fixed doesn't complete properly
Queue DoRecovery() event on any repair
Signed-off-by: David Zafman <david.zafman@inktank.com>
David Zafman
12:18 AM Revision 3759daa9 (ceph): OSD: After repairs finish a new deep-scrub should be avoided
When errors fixed, clear them so pg not inconsistent and no deep-scrub needed
In the rare case of incomplete repair, ...
David Zafman

05/15/2013

11:26 PM Bug #5059: PGs can get stuck degraded if OSD removed before being out
CORRECTION: A lost OSD can be marked out and crush will recalculate replica locations. But administrator accidentall... David Zafman
11:23 PM Bug #5082: OSD wrongly marked as down
reweighted OSD.9 -> OSD.9, OSD.21 down ... Ivan Kudryavtsev
11:11 PM Bug #5082: OSD wrongly marked as down
When it reports them down, they're down in tree also, after some seconds they're up again.
ceph osd crush reweight...
Ivan Kudryavtsev
11:00 PM Bug #5082: OSD wrongly marked as down
can you attach 'ceph osd tree' output before and after the command? it's not clear to me what is going on.. you shou... Sage Weil
10:32 PM Bug #5082: OSD wrongly marked as down
Could it be because I'm using the command above instead of
ceph osd crush reweight?
Ivan Kudryavtsev
10:25 PM Bug #5082: OSD wrongly marked as down
ceph version 0.56.4 (63b0f854d1cef490624de5d6cf9039735c7de5ca)
Ivan Kudryavtsev
10:24 PM Bug #5082 (Can't reproduce): OSD wrongly marked as down
During ceph crush manipulation
ceph osd crush set 17 osd.17 0.8 pool=default host=ceph-osd-2-1
I see messages ...
Ivan Kudryavtsev
10:32 PM Bug #4783 (Resolved): After repairs finish a new deep-scrub should be avoided
commit:3759daa9c41f274f2834ed57f8c58f9ab6a725d7 David Zafman
09:06 PM Bug #5081 (Can't reproduce): Data migration and recover slow after changed OSD weight
I'm using ceph version 0.56.4 (63b0f854d1cef490624de5d6cf9039735c7de5ca)
Now, I'm trying to make some osds with sm...
Ivan Kudryavtsev
06:08 PM devops Bug #4865 (In Progress): ceph-disk: activate fails on debian wheezy due to missing udev by-partuu...
Anonymous
05:57 PM devops Bug #4865: ceph-disk: activate fails on debian wheezy due to missing udev by-partuuid rules

(01:59:54 PM) sagelap [~sage@2607:f298:a:607:598c:d480:4af:b6ce] entered the room.
(02:00:44 PM) paravoid: http://...
Sage Weil
01:53 PM devops Bug #4865: ceph-disk: activate fails on debian wheezy due to missing udev by-partuuid rules
http://git.kernel.org/cgit/linux/hotplug/udev.git/commit/?id=693b6344e193f5aeca21df5f1c98fd32148006ac
paravoid sug...
Sage Weil
03:46 PM Revision e34a56f8 (ceph): doc: fix mkcephfs production use, deprecated note
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:55 PM rbd Bug #4559 (In Progress): krbd: kernel BUG when mapping unexisting rbd device
Finally starting to look at this again. I'm going to
start by trying to reproduce it in the newly-reported
context.
Alex Elder
02:15 PM Linux kernel client Bug #5043: Oops in remove_osd
I think I may have just found something.
It appears as though access to the osd client's osd red-black
tree is su...
Alex Elder
01:16 PM Linux kernel client Bug #5043: Oops in remove_osd
The ceph_osd_client in question is osdc @ 0xffff8802244a4950
The ceph_osd is osd @ 0xffff88020a285000
(struct rb_...
Alex Elder
02:14 PM Bug #4999: monitor sync failure
Hmmm, I've been running all day, but I haven't reproduced yet.
I'm seeing a mon repeatedly drop out of and then re...
Jim Schutt
07:10 AM Bug #4999: monitor sync failure
I'll be attempting to reproduce this morning with the debug mon = 10, debug ms = 1. Jim Schutt
06:43 AM Bug #4999: monitor sync failure
Yeah, I'd say so.
Anyway, this ticket has two (independent, I hope) bugs: the sync bug and the leveldb bug. I've ...
Joao Eduardo Luis
01:52 PM Bug #5077: nightlies: single node cluster hung waiting for ceph_health to be OK
looks like one of the monitors on the single node cluster went down
Tamilarasi muthamizhan
10:37 AM Bug #5077: nightlies: single node cluster hung waiting for ceph_health to be OK
also,
ubuntu@teuthology:/a/teuthology-2013-05-15_01:00:04-rados-master-testing-basic/13515$ cat orig.config.yaml ...
Tamilarasi muthamizhan
10:36 AM Bug #5077: nightlies: single node cluster hung waiting for ceph_health to be OK
... Tamilarasi muthamizhan
10:35 AM Bug #5077 (Resolved): nightlies: single node cluster hung waiting for ceph_health to be OK
logs: ubuntu@teuthology:/a/teuthology-2013-05-15_01:00:04-rados-master-testing-basic/13506... Tamilarasi muthamizhan
11:37 AM CephFS Bug #5079 (Resolved): assert in MDCache::_recovered()
While trying to reproduce 4999 with the requested logging, I got this MDS assert.
I'm running cuttlefish branch @ ...
Jim Schutt
11:30 AM devops Bug #5065 (Duplicate): ceph-deploy: osd prepared but not activated on debian-wheezy
#4865 Sage Weil
10:59 AM Bug #5078 (Won't Fix): Debian missing sudo results in unclear error
Due to the missing 'sudo' command in debian the install.py fails while resolving the hostnames.
Perhaps a 'you need ...
Rens Reinders
10:54 AM CephFS Bug #5021: ceph-fuse: crash on traceless reply
... Sage Weil
10:32 AM CephFS Bug #5021: ceph-fuse: crash on traceless reply
... Sage Weil
10:28 AM rgw Bug #4497: rgw: FAIL: testSlashInName (test.functional.tests.TestContainer)
The test still generates a too short bucket name. The swift tree on github has that fixed but the test pulls from cep... Yehuda Sadeh
09:32 AM rgw Bug #4497: rgw: FAIL: testSlashInName (test.functional.tests.TestContainer)
ubuntu@teuthology:/a/teuthology-2013-05-15_01:30:03-upgrade-master-testing-basic/13769 Tamilarasi muthamizhan
10:25 AM Bug #5076: nightlies:segfault in ceph_test_filestore_idempotent_sequence
ubuntu@teuthology:/a/teuthology-2013-05-15_01:00:04-rados-master-testing-basic/13530 Tamilarasi muthamizhan
10:24 AM Bug #5076 (Resolved): nightlies:segfault in ceph_test_filestore_idempotent_sequence
logs:... Tamilarasi muthamizhan
10:00 AM Bug #5075 (Resolved): filejournal tests failing in nightlies
logs: ubuntu@teuthology:/a/teuthology-2013-05-15_01:00:04-rados-master-testing-basic/13528
2013-05-15T01:20:47.123...
Tamilarasi muthamizhan
09:55 AM Bug #5074 (Can't reproduce): nightlies: timed out waiting for admin socket of restarted osd
logs:ubuntu@teuthology:/a/teuthology-2013-05-15_01:00:04-rados-master-testing-basic/13494
2013-05-15T05:05:50.579 ...
Tamilarasi muthamizhan
09:08 AM Bug #5062: mon: 0.61.2 asserts on AuthMonitor during monitor start
By finishing a sync before the current proposal is fully committed (and now that I think of it, we might still have t... Joao Eduardo Luis
08:56 AM Bug #5062: mon: 0.61.2 asserts on AuthMonitor during monitor start
Any idea how sync could have missed a version? Greg Farnum
07:06 AM Bug #5062 (Need More Info): mon: 0.61.2 asserts on AuthMonitor during monitor start
Waiting on Florian to provide me a copy of a healthy monitor's data dir to assess whether the "corrupted" state is pa... Joao Eduardo Luis
08:57 AM rgw Feature #5073: rgw: create tenant namespace
How would this interact with stuff like bucket URLs? Greg Farnum
07:06 AM rgw Feature #5073 (New): rgw: create tenant namespace
Currently rgw has a single global namespace. It is possible to provide different namespaces for different tenants as ... Yehuda Sadeh
07:40 AM Bug #5069: monitor crashed during mon thrash in nightlies
Logs are gone. I'll try to reproduce. In the future, it would be nice to grab the logs and data dirs before nuking/a... Joao Eduardo Luis
07:01 AM rgw Feature #4098 (In Progress): rgw: multi-site: Global Bucket Namespace
Yehuda Sadeh
06:39 AM Bug #5072 (Can't reproduce): mon: segfault on leveldb::Table::Open() during monitor start
Jim Schutt is hitting this after triggering #4999... Joao Eduardo Luis
05:05 AM Revision 3ac7fb8a (ceph): rgw: parse location constraint on bucket creation
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:04 AM Revision a7e80e2b (ceph): rgw: a function to read all request input
Factor out this useful function. Also make sure that
we never read more than a specified (large enough) max.
Signed-...
Yehuda Sadeh
04:35 AM Revision 84c17b68 (ceph): rgw: update json encode/decode for new bucket info
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
01:52 AM Revision b6464076 (ceph): Modified PutMetadata::get_data() to handle chunked transfers
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
01:52 AM Revision c8ac2879 (ceph): rgw: add region to bucket info
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
01:52 AM Revision 648c3bc2 (ceph): With admin metadata rest API implementation and unit test cases for it
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
01:52 AM Revision bf612f04 (ceph): rgw: modify metadata RESTful implementation
REST handler should derive from RGWHandler_Auth_S3,
other changes.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
01:52 AM Revision efaa33f3 (ceph): Fixed certain bugs on rest admin APIs
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
01:52 AM Revision 02197744 (ceph): Removed the check for parameter validation in op_get()
Signed-off-by: Babu Shanmugam <anbu@enovance.com> Babu Shanmugam
01:52 AM Revision 3c8ef2b9 (ceph): ceph_json: don't try to parse NULL buffer
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
12:06 AM Revision 2a441aa2 (ceph): Merge pull request #279 from ceph/wip-libcephfs-env
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
12:06 AM Revision 8f3fb972 (ceph): Added OSD to glossary, removed parenthetical.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:05 AM Revision f36ec02f (ceph): doc: Updated architecture document.
fixes: #2968
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins

05/14/2013

11:38 PM Revision 624c8cc3 (ceph): Merge branch 'wip-5049'
Reviewed-by: Sam Just <sam.just@inktank.com> David Zafman
11:28 PM Revision 48e89b51 (ceph): OSD: scrub interval checking
Do arithmetic so large intervals don't wrap
Fix log messages to reflect the change and improve output
Add message whe...
David Zafman
11:28 PM Revision 1f4e7a5a (ceph): OSD: Don't scrub newly created PGs until min interval
Set initial values for last_scrub_stamp, last_deep_scrub_stamp
fixes: #5050, #5051
Signed-off-by: David Zafman <dav...
David Zafman
11:24 PM Revision e582e15c (ceph): Fix scrub_test.py permission error
Add description of yaml file including log-whitelist
Add sudo to dd that corrupts data
Signed-off-by: David Zafman <...
David Zafman
11:02 PM Revision 7b93d287 (ceph): doc/release-notes: v0.62
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:38 PM Revision 2ba167be (ceph): Merge branch 'next'
Gary Lowell
10:28 PM Revision 5ff703d6 (ceph): Merge pull request #283 from dachary/wip-5058
internal documentation proofreading
Reviewed-by: Sam Just <sam.just@inktank.com>
athanatos
10:05 PM CephFS Bug #5021 (In Progress): ceph-fuse: crash on traceless reply
Sage Weil
09:51 PM Bug #5060 (Can't reproduce): osd: decode failure in load_pgs on 0.56.4
If you see this again, please capture the stacktrace of the original before recovering, and if you can, generate a co... Sage Weil
05:14 PM Bug #5060: osd: decode failure in load_pgs on 0.56.4
Yes, it was in 0.56.4 and before.
I can not reproduce because already formatted.
Ivan Kudryavtsev
09:41 AM Bug #5060 (Need More Info): osd: decode failure in load_pgs on 0.56.4
Was the ceph-osd process that originally crashed under load also 0.56.4? Or an earlier version? (Do you have the lo... Sage Weil
05:20 AM Bug #5060 (Can't reproduce): osd: decode failure in load_pgs on 0.56.4
On of my osd hosts crashed on high load and after rebooted it is unable to start some osds.
Error log for osd.3 is ...
Ivan Kudryavtsev
09:02 PM Revision 52b0438c (ceph): doc/rados/configuration: fix [mon] osd min down report* config docs
Fix other osd -> mon section name, and note the old config value name prior
to v0.62.
Fixes: #5044.
Signed-off-by: S...
Sage Weil
08:46 PM Revision 1c53991e (ceph): fix typos and add hyperlink to peering
s/;/:/
s/up_acting_affected/acting_up_affected/
Add relative link to ../../peering
http://tracker.ceph.com/issues/50...
Loïc Dachary
08:46 PM Revision 2a4425af (ceph): reflect recent changes in the pg deletion logic
No need to wait on DeletingStateRef for flush https://github.com/ceph/ceph/commit/d3dd99b725afaa026fe6f700ddc14a7f657... Loïc Dachary
08:46 PM Revision b7d4012c (ceph): typo s/come/some/
http://tracker.ceph.com/issues/5058 refs #5058
Signed-off-by: Loic Dachary <loic@dachary.org>
Loïc Dachary
08:44 PM Revision dbddffef (ceph): update op added to a waiting queue or discarded
The decision to discard an op happens either in OSD or in PG.
The operation queue goes to a single OpWQ object if wai...
Loïc Dachary
07:58 PM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
1) Just one mo symptom: "assert(soid.snap == *curclone);" (IMHO there are too similar to others, include "clone witho... Denis kaganovich
07:44 PM Revision e9935f2c (ceph): ceph_json: fix bool decoding
"false" means false.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:26 PM Bug #4783 (Fix Under Review): After repairs finish a new deep-scrub should be avoided
David Zafman
07:25 PM Revision 67ecd75c (ceph): rgw: json_encode json a bit differently
Encode map as a list, it's a more friendly representation.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:05 PM devops Feature #5071 (Duplicate): ceph-deploy osd list
There is a disk list that shows all disks (and, if they map to osds, some info about them).
ceph-deploy osd list s...
Sage Weil
06:04 PM devops Bug #4919: ceph-deploy: disk list doesn't properly display all the disks on a VM
I got as far as verifying that the VM didn't list all devices in /dev/disk/by-path.. no idea why. it seemed to list ... Sage Weil
05:31 PM Revision afeb8f2d (ceph): md/Sever.cc: fix straydn assert
From fb222a0a1c98a4141b6d0e79eac7a41c208f7147, we only know straydn is
non-null if oldin is non-null.
Signed-off-by:...
Sage Weil
05:30 PM Revision 29d8ec4e (ceph): Merge pull request #285 from dalgaaf/wip-da-CID-fixes-2-v3
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
05:20 PM rbd Bug #5070: rbd map failed and stalled in "D"
probably connected with #4522 Ivan Kudryavtsev
05:15 PM rbd Bug #5070 (Can't reproduce): rbd map failed and stalled in "D"
It happened with one device while others are mapped well.
[6765922.875713] ------------[ cut here ]------------
[...
Ivan Kudryavtsev
05:20 PM Revision e69257ea (ceph): rgw/rgw_user.cc: fix possible NULL pointer dereference
CID 1019559 (#1 of 1): Dereference after null check (FORWARD_NULL)
var_deref_model: Passing null pointer "usr" to f...
Danny Al-Gaaf
05:15 PM Revision d6929021 (ceph): mds/Server.cc: fix possible NULL pointer dereference
Assert if straydn is NULL.
CID 1019554 (#2 of 2): Dereference after null check (FORWARD_NULL)
var_deref_model: Pas...
Danny Al-Gaaf
05:15 PM CephFS Bug #4832 (Need More Info): mds: failed auth_unpin assert
cranked up mds logs in qa.. should get useful info next time we hit this. Sage Weil
05:07 PM CephFS Bug #4832: mds: failed auth_unpin assert
recent logs: ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:46-kernel-next-testing-basic/13128 Tamilarasi muthamizhan
05:10 PM rgw Bug #4497: rgw: FAIL: testSlashInName (test.functional.tests.TestContainer)
recent logs: ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:34-rgw-next-testing-basic/13055 Tamilarasi muthamizhan
05:10 PM Documentation #2968 (Resolved): doc: complete architecture section
This architecture document has enough information that we can close this bug. We'll, of course, continue to update it. John Wilkins
05:07 PM Revision fb222a0a (ceph): mds/Server.cc: fix possible NULL pointer dereference
Assert of straydn is NULL here.
CID 1019558 (#1 of 1): Dereference after null check (FORWARD_NULL)
var_deref_model...
Danny Al-Gaaf
05:05 PM rbd Bug #5032: xfstest 269 failure
logs: ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:46-kernel-next-testing-basic/13125 Tamilarasi muthamizhan
05:02 PM Revision c87788b6 (ceph): mds/Server.cc: fix possible NULL pointer dereference
Assert if destdn == NULL.
CID 1019557 (#1 of 1): Dereference after null check (FORWARD_NULL)
var_deref_model: Pass...
Danny Al-Gaaf
04:59 PM Bug #5069 (Resolved): monitor crashed during mon thrash in nightlies
logs: ubuntu@teuthology:/a/teuthology-2013-05-14_01:00:05-rados-next-testing-basic/12938... Tamilarasi muthamizhan
04:57 PM Bug #4967: Misbehaving OSD sets over half of the cluster as down despite "osd min down reporters ...
Nodes were not rebooted and those OSDs that were marked as down weren't restarted (ps shows them as started on Apr 3r... Faidon Liambotis
02:06 PM Bug #4967 (Can't reproduce): Misbehaving OSD sets over half of the cluster as down despite "osd m...
We are not sure about why logging didn't show OSDs getting marked down. It is possible that OSDs were restarted, or ... David Zafman
04:52 PM Fix #4567: mon: refactor mon caps; allow restriction of key/value storage by prefix
Sage Weil
04:50 PM Revision 088455f8 (ceph): librados/AioCompletionImpl.h: add missing Lock
Add missing Lock around code changing AioCompletionImpl::rval/ack and safe
in C_AioCompleteAndSafe::finish().
CID 10...
Danny Al-Gaaf
04:44 PM Revision 8a52350d (ceph): src/dupstore.cc: check return value of list_collections()
CID 1019545 (#1 of 1): Unchecked return value (CHECKED_RETURN)
check_return: Calling function "ObjectStore::list_co...
Danny Al-Gaaf
04:43 PM Revision 70a4a971 (ceph): mds/Server.cc: fix possible NULL pointer dereference
CID 1019555 (#1 of 1): Dereference after null check (FORWARD_NULL)
var_deref_model: Passing null pointer "in" to fu...
Danny Al-Gaaf
04:42 PM Bug #5049 (Resolved): scrub interval checking
commit:48e89b5171b912eba3521d918c437978107fc298 David Zafman
04:41 PM Bug #5050 (Resolved): initial scrub timestamp is 0.000000
commit:1f4e7a5aafdace9fb82d311ec4ff0a1a6c7c9a31 David Zafman
10:23 AM Bug #5050 (In Progress): initial scrub timestamp is 0.000000
David Zafman
04:41 PM Bug #5051 (Resolved): initial deep scrub timestamp is 0.0000000
commit:1f4e7a5aafdace9fb82d311ec4ff0a1a6c7c9a31 David Zafman
10:19 AM Bug #5051 (In Progress): initial deep scrub timestamp is 0.0000000
David Zafman
04:39 PM Revision 21489acf (ceph): src/rbd.cc: use 64-bits to shift 'order'
CID 1019568 (#1 of 1): Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN)
overflow_before_widen: Potentially ov...
Danny Al-Gaaf
04:39 PM Revision 043ea2ce (ceph): tools/ceph.cc: close file descriptor in error case
CID 717121 (#1 of 1): Resource leak (RESOURCE_LEAK)
leaked_handle: Handle variable "fd" going out of scope leaks th...
Danny Al-Gaaf
04:39 PM Revision c3c140b3 (ceph): tools/ceph.cc: close file descriptor in error case
CID 717122 (#1 of 1): Resource leak (RESOURCE_LEAK)
leaked_handle: Handle variable "fd" going out of scope leaks
...
Danny Al-Gaaf
04:39 PM Revision eac545e1 (ceph): tools/ceph.cc: cleanup memory allocated for 'buf'
CID 717123 (#1-2 of 2): Resource leak (RESOURCE_LEAK)
leaked_storage: Variable "buf" going out of scope leaks the s...
Danny Al-Gaaf
04:39 PM Revision 8df55e0a (ceph): test/test_cors.cc: initialize key_type in constructor
CID 1019635 (#1 of 1): Uninitialized pointer field (UNINIT_CTOR)
uninit_member: Non-static class member "kt" is not...
Danny Al-Gaaf
04:39 PM Revision 98836309 (ceph): mon/QuorumService.h: remove unused QuorumService::flags
CID 1019626 (#1 of 1): Uninitialized scalar field (UNINIT_CTOR)
uninit_member: Non-static class member "flags" is n...
Danny Al-Gaaf
04:39 PM Revision 528ec353 (ceph): mon/Monitor.h: init 'crc' in constructor with '0'
CID 1019624 (#1 of 1): Uninitialized scalar field (UNINIT_CTOR)
uninit_member: Non-static class member "crc" is not...
Danny Al-Gaaf
04:39 PM Revision 3e446825 (ceph): mon/Monitor.cc: init 'timecheck_acks' with '0' in constructor
CID 1019623 (#1 of 1): Uninitialized scalar field (UNINIT_CTOR)
uninit_member: Non-static class member "timecheck_a...
Danny Al-Gaaf
04:39 PM Revision df4c099a (ceph): ceph-filestore-dump.cc: cleanup resource in error case
CID 1019590 (#1 of 1): Resource leak (RESOURCE_LEAK):
leaked_storage: Variable "rmt" going out of scope leaks the
s...
Danny Al-Gaaf
04:39 PM Revision 349cfb41 (ceph): ceph-filestore-dump.cc: cleanup on error case
CID 1019589 (#1 of 1): Resource leak (RESOURCE_LEAK)
leaked_storage: Variable "t" going out of scope leaks the
st...
Danny Al-Gaaf
04:39 PM Revision d8cb7dfc (ceph): filestore/test_idempotent_sequence.cc: fix FileStore leaks
CID 717107 (#1 of 1): Resource leak (RESOURCE_LEAK)
leaked_storage: Variable "store" going out of scope leaks the
...
Danny Al-Gaaf
04:39 PM Revision cab8e9bf (ceph): test/kv_store_bench.cc: fix resource leak
CID 727984 (#5 of 5): Resource leak (RESOURCE_LEAK)
leaked_storage: Variable "cb_args" going out of scope leaks the...
Danny Al-Gaaf
04:39 PM Revision 3c285c44 (ceph): scratchtool.c: cleanup rados_t on error
Make sure rados_shutdown() get called also in error case.
CID 717106 (#1 of 1): Resource leak (RESOURCE_LEAK)
leak...
Danny Al-Gaaf
04:39 PM Revision 7ea44ee0 (ceph): librbd/test_librbd.cc: free memory in test_list_children()
CID 719581 (#7 of 7): Resource leak (RESOURCE_LEAK)
CID 719581 (#6 of 7): Resource leak (RESOURCE_LEAK)
leaked_stor...
Danny Al-Gaaf
04:39 PM Revision 36028916 (ceph): test_filejournal.cc: cleanup memory in destructor
CID 716885 (#1 of 1): Resource leak in object (CTOR_DTOR_LEAK)
alloc_new: Allocating memory by calling "new C_SafeC...
Danny Al-Gaaf
04:19 PM Revision e0de0089 (ceph): mon: fix validatation of mds ids in mon commands
Fixes: #4996
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 5c305d63043762027323052b4bb3ae306...
Sage Weil
04:14 PM Revision 4c0d3eb7 (ceph): mon: fix validatation of mds ids in mon commands
Fixes: #4996
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 5c305d63043762027323052b4bb3ae306...
Sage Weil
04:13 PM Revision 9382379c (ceph): v0.62
Gary Lowell
02:50 PM Bug #5068 (Won't Fix): ceph_test_rados gets SIGFPE when run with no args
Probably dividing by zero; didn't look closely:... Dan Mick
02:08 PM rbd Feature #5067 (Resolved): librbd: configuration options to override default image creation parame...
From http://permalink.gmane.org/gmane.comp.file-systems.ceph.devel/15064:
For example, there could be just:
rbd...
Josh Durgin
02:03 PM Cleanup #5044 (Resolved): osd_min_down_reporters/osd_min_down_reports are incorrectly documented
commit:52b0438c66b23c5eec4eed62a489143f995f6c94 Sage Weil
01:58 PM Documentation #5058: docs/master/dev/osd_internals updates
Sam Just says : Some of the details of the PG op queueing have changed again in next. See "fd90105":https://github.co... Loïc Dachary
11:56 AM devops Bug #5066 (Resolved): Problems with ceph-deploy debs
* Used Distribution:
Ubuntu 12.04.2 (amd64)
* Installed packages from external sources:
- kernel 3.8.11 ...
Peter Wienemann
11:27 AM devops Bug #5065 (Duplicate): ceph-deploy: osd prepared but not activated on debian-wheezy
tried this on two different debian machines [burnupi24 and burnupi26].
on burnupi24:
hit ioerror and osd was no...
Tamilarasi muthamizhan
11:14 AM Bug #5064 (Won't Fix): mon/monclient: subscribe protocol does not allow cancellation
This only really comes up with the librados rados_monitor_log(); no other users (might potentially) need this. Sage Weil
11:13 AM Bug #4999: monitor sync failure
This failure is the first type, in sync_start_reply_timeout().
It looks just like the previous one, and I don't have...
Jim Schutt
10:39 AM Bug #4999: monitor sync failure
I've got a monitor that has just failed, so I'll see what gdb has to say
about that one.
Am I correct in assuming...
Jim Schutt
10:27 AM Bug #4999: monitor sync failure
After talking with Sage, this bug is being postponed until we get a log with higher debug levels to catch intermediat... Joao Eduardo Luis
10:23 AM Bug #4999 (Need More Info): monitor sync failure
Sage Weil
10:19 AM Bug #4999 (In Progress): monitor sync failure
Jim, still unable to restart the monitor? If so, could you by any chance run the monitor with gdb and check out what... Joao Eduardo Luis
11:05 AM Linux kernel client Bug #5043 (In Progress): Oops in remove_osd
I'm taking this for the time being.
This is in rb_erase().
Which means that maybe the osd client's red-black tr...
Alex Elder
10:51 AM Bug #5055 (Rejected): osd: crash in FileJournal::wrap_read_bl
wrap_read_bl returned EIO. Samuel Just
10:27 AM Bug #5062 (In Progress): mon: 0.61.2 asserts on AuthMonitor during monitor start
Joao Eduardo Luis
09:01 AM Bug #5062 (Can't reproduce): mon: 0.61.2 asserts on AuthMonitor during monitor start
Florian from Smart Weblications bumped into this crash on one of his monitors *roughly two hours after* upgrading fro... Joao Eduardo Luis
10:15 AM Bug #5054: deep scrub reports 1 inconsistent object
i think this is a dup of the bug sam is still worknig on Sage Weil
10:03 AM Feature #4839 (In Progress): api: make new CLI send old version of commands to old monitors durin...
Dan Mick
09:47 AM Bug #5056 (Won't Fix): rados_mon_workunits failed in the nightlies
this is a known problem with idempotency of the mon commands.. see fix #4635. we aren't going to fix it for bobtail/... Sage Weil
09:44 AM Bug #5059: PGs can get stuck degraded if OSD removed before being out
what does 'ceph osd tree' say? usually stuck degraded happens bc there aren't enough up/in osds Sage Weil
09:29 AM Bug #4698: osd suicide timed out after 150
ubuntu@teuthology:/a/teuthology-2013-05-14_01:30:05-upgrade-master-testing-basic/13144 Tamilarasi muthamizhan
09:20 AM Bug #4996 (Resolved): mon: bogus mds tell can crash monitors
Sage Weil
09:11 AM devops Bug #5063 (Rejected): Unexpected build warning
This issue is really just a reminder to myself to check this out:
glowell@pudgy:~/build/ceph-0.62/ceph$ dch -v 0.6...
Anonymous
08:34 AM Bug #5061 (Duplicate): Monitor crash with 0.61.2
Only now did I notice this is a duplicate of #4999
Marking it as such.
Thanks again Matthew!
Joao Eduardo Luis
06:43 AM Bug #5061 (Duplicate): Monitor crash with 0.61.2
https://pastee.org/sznze is the log at time of failure. I haven't been able to reproduce it yet. Matthew Via
03:48 AM Revision c5deb5db (ceph): doc/release-notes: v0.61.2
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:01 AM rbd Bug #4897: qemu rbd driver should allow manipulation of format 2, striped images
I'm running into this as well.
Wouldn't it be a start to at least start creating image with rbd_create2|3 instead ...
Wido den Hollander
01:45 AM CephFS Bug #5037: Ceph-MDS asserts after upgrade 0.56.2 -> 0.56.6
Our ceph is productive, yeah. We are only using rbd, not CephFS or RadosGW, though. SJust and Sage are familiar with ... Christopher Kunz
01:11 AM CephFS Bug #5036: `ls` hangs on random folder
By turning on the debug mode of MDS:... Quan Tong Anh
12:17 AM Revision 45e19510 (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil

05/13/2013

11:58 PM Revision 97a73091 (ceph): rgw: tie bucket/user removal to mdlog differently
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
11:52 PM Bug #5059 (Won't Fix): PGs can get stuck degraded if OSD removed before being out

If an OSD goes down and the user marks it lost and/or removes before it is marked out, then PGs are left degraded a...
David Zafman
11:45 PM Documentation #5058 (Resolved): docs/master/dev/osd_internals updates
"work in progress":https://github.com/dachary/ceph/tree/wip-5058
* See OSD::handle_pg_(notify|info|log|query) rela...
Loïc Dachary
11:07 PM Revision 393140e7 (ceph): Merge pull request #281 from ceph/wip-rbd-rm-enoent
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
11:06 PM Revision 4bb40633 (ceph): ceph_test_libcephfs: parse environment
Lets you use CEPH_ARGS to get output from the tester.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:56 PM Revision f24b8fb9 (ceph): PG: fix some brace styling
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
Samuel Just
10:55 PM Revision 72bf5f48 (ceph): PG: subset_last_update must be at least log.tail
Fixes: 5020
Backport: bobtail, cuttlefish
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman...
Samuel Just
09:49 PM Revision 395a775a (ceph): SimpleThrottle: fix -ENOENT checking
The condition was reversed. Rewrite it so it's clear that we're
ignoring -ENOENT only when m_ignore_enoent is set.
S...
Josh Durgin
09:26 PM Revision d06d0c3b (ceph): rgw: slightly simplify metadata abstraction
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
09:19 PM Revision 58836d9d (ceph): qemu: load the kvm module before trying to use it
It should be loaded before this, but in some cases it is not for some reason.
Signed-off-by: Josh Durgin <josh.durgi...
Josh Durgin
08:29 PM Revision bb6d1f07 (ceph): rgw: read bucket metadata before writing it
In order to keep track of version.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
08:24 PM Revision 88af2b0f (ceph): Replace mis-named mon config variables using mon_osd_min_down_reports/m...
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
David Zafman
07:35 PM Revision 225fefe5 (ceph): ceph-disk: add '[un]suppress-activate <dev>' command
It is often useful to prepare but not activate a device, for example when
preparing a bunch of spare disks. This mar...
Sage Weil
07:31 PM Revision de4678fa (ceph): Merge pull request #280 from ceph/wip-4996
Reviewed-by: Joao Luis <joao.luis@inktank.com> Sage Weil
06:58 PM Cleanup #5044: osd_min_down_reporters/osd_min_down_reports are incorrectly documented
88af2b0f7b951367e670869db76e57f0d970aa38
Update to master branch for the next release renaming these these values to...
David Zafman
01:01 PM Cleanup #5044: osd_min_down_reporters/osd_min_down_reports are incorrectly documented

I pushed the change to rename these configuration variables to wip-5044.
David Zafman
11:52 AM Cleanup #5044: osd_min_down_reporters/osd_min_down_reports are incorrectly documented
we should rename these config fields in master branch and add an item to /PendingReleaseNotes documenting the change.... Sage Weil
11:49 AM Cleanup #5044 (Resolved): osd_min_down_reporters/osd_min_down_reports are incorrectly documented

These 2 configuration variables do NOT follow standard convention that a mon variable begin mon_. Everywhere I've ...
David Zafman
06:58 PM Revision fea78254 (ceph): v0.61.2
Gary Lowell
06:49 PM Revision 5c305d63 (ceph): mon: fix validatation of mds ids in mon commands
Fixes: #4996
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:40 PM Revision 8464c064 (ceph): mon: Monitor: tolerate GV duplicates during conversion
Fixes: #4974
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
(cherry picked from commit ba05b16ee2b6e25141f...
Joao Eduardo Luis
06:39 PM Revision 11041163 (ceph): Merge pull request #278 from ceph/wip-4974
Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
06:37 PM Revision ba05b16e (ceph): mon: Monitor: tolerate GV duplicates during conversion
Fixes: #4974
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Joao Eduardo Luis
05:16 PM Revision 6db072d4 (ceph): libcephfs: add ceph_conf_parse_env()
This exists in the librados API.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:16 PM Revision 9ec77ebb (ceph): ceph_test_libcephfs: fix xattr test
This broke in 0c70e44630734760fd36e0c770a33fb0e74b42a4.
Fixes: #5030
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:15 PM Bug #5057 (Duplicate): assertion in DeleteOp::_begin
log: ubuntu@teuthology:/a/teuthology-2013-05-13_01:00:04-rados-master-testing-basic/12278... Tamilarasi muthamizhan
05:11 PM Bug #5056 (Won't Fix): rados_mon_workunits failed in the nightlies
log: ubuntu@teuthology:/a/teuthology-2013-05-13_01:00:04-rados-master-testing-basic/12200... Tamilarasi muthamizhan
04:58 PM Bug #5055 (Rejected): osd: crash in FileJournal::wrap_read_bl
logs: ubuntu@teuthology:/a/teuthology-2013-05-12_01:00:05-rados-master-testing-basic/11664... Tamilarasi muthamizhan
04:54 PM Bug #5054 (Resolved): deep scrub reports 1 inconsistent object
logs: ubuntu@teuthology:/a/teuthology-2013-05-12_01:00:05-rados-master-testing-basic/11672... Tamilarasi muthamizhan
04:52 PM Revision 9bb58b2a (ceph): OSD: We need to wait on CLEARING_DIR, not DELETED_DIR
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
04:42 PM Revision c24ffc16 (ceph): Merge pull request #274 from dalgaaf/wip-da-fix-osd_h
osd/OSD.h: fix try_stop_deletion
Reviewed-by: Sam Just <sam.just@inktank.com>
athanatos
04:38 PM rbd Bug #5053 (Duplicate): qemu xfs test failed in the nightlies
Josh Durgin
04:35 PM rbd Bug #5053 (Duplicate): qemu xfs test failed in the nightlies
... Tamilarasi muthamizhan
04:30 PM Bug #5052 (Duplicate): kclient_workunit_misc test failed in the nightlies
the test failed, as rsync was unable to copy a few files due to permission denied errors,
logs: ubuntu@teuthology:...
Tamilarasi muthamizhan
04:18 PM Bug #5020 (Pending Backport): osd: 2.5 deep-scrub stat mismatch, got 710/665 objects, 0/0 clones,...
Samuel Just
02:24 PM Bug #5020: osd: 2.5 deep-scrub stat mismatch, got 710/665 objects, 0/0 clones, 2908160/2723840 bytes
I think I found the issue. wip_5020. Testing. Samuel Just
01:18 PM Bug #5020: osd: 2.5 deep-scrub stat mismatch, got 710/665 objects, 0/0 clones, 2908160/2723840 bytes
Got it reproduced with logs. Samuel Just
03:33 PM Bug #4873 (Can't reproduce): osd: scrub found missing object on primary
David Zafman
02:54 PM Bug #5051 (Resolved): initial deep scrub timestamp is 0.0000000
The initial deep scrub timestamp is 0.000000 and it should be the current time. This causes the OSD to do a deep scr... Mark Nelson
02:52 PM Bug #5050 (Resolved): initial scrub timestamp is 0.000000
When PGs are created, the initial scrub timestamp is 0.000000 where it should be the current time. With the current ... Mark Nelson
02:39 PM Bug #5049 (In Progress): scrub interval checking
David Zafman
02:38 PM Bug #5049 (Resolved): scrub interval checking
In OSD.cc right now we set:
utime_t max = ceph_clock_now(g_ceph_context);
utime_t min = max;
min -= g_conf->osd_...
Mark Nelson
02:39 PM Bug #4698: osd suicide timed out after 150
... Tamilarasi muthamizhan
02:37 PM Bug #4698: osd suicide timed out after 150
logs: ubuntu@teuthology:/a/teuthology-2013-05-13_01:30:03-upgrade-master-testing-basic/12483... Tamilarasi muthamizhan
02:33 PM Bug #4967 (In Progress): Misbehaving OSD sets over half of the cluster as down despite "osd min d...
David Zafman
01:54 PM rbd Bug #4959: xfstest 17 failure
The same test failed again in the same way. This time rbd caching was enabled, while the first time it was disabled.
...
Josh Durgin
01:09 PM rbd Fix #5048 (Resolved): krbd: limit of ~230 mapped images at once
this is just because we are using our major/minor device ids in a stupid way, iirc. Sage Weil
12:55 PM Bug #4996 (Pending Backport): mon: bogus mds tell can crash monitors
Sage Weil
12:50 PM Bug #4996: mon: bogus mds tell can crash monitors
Is this already backported to where it needs to be? Greg Farnum
12:36 PM Bug #4996 (Resolved): mon: bogus mds tell can crash monitors
Sage Weil
12:40 PM Bug #5024 (Resolved): mon_debug_dump_transactions should default to False
Sage Weil
08:59 AM Bug #5024 (Fix Under Review): mon_debug_dump_transactions should default to False
Sage Weil
12:40 PM Bug #4974 (Resolved): nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Sage Weil
11:28 AM Bug #4974 (In Progress): nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Comments on github. Greg Farnum
07:38 AM Bug #4974 (Fix Under Review): nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Joao Eduardo Luis
07:38 AM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Pushed wip-4974 to gh. It fixes Via's store, but haven't dumped the transactions yet to make sure the correct orderi... Joao Eduardo Luis
04:21 AM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Cause:
We had a bug on bobtail that would create duplicate GV versions, so there's a fair chance that at some poin...
Joao Eduardo Luis
04:07 AM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
plana's status is now gone, no logs on teuthology, but I'll update the ticket next with a description on what's happe... Joao Eduardo Luis
12:37 PM rbd Feature #3763 (Resolved): krbd: handle flattening of mapped image
Sage Weil
12:37 PM CephFS Feature #4326 (Resolved): qa: add samba + (kclient|ceph-fuse) to suite
Sage Weil
12:16 PM devops Bug #5047 (Closed): ceph build needs libboost 1.50 for debian sid
The build on sid needs a boost library newer than the default due to a conflict in header files. This is clamied to ... Anonymous
12:09 PM Bug #5038 (Fix Under Review): krbd: fix parent request size assumption
The following has been posted for review:
[PATCH] rbd: fix parent request size assumption
Alex Elder
07:53 AM Bug #5038 (Resolved): krbd: fix parent request size assumption
When Josh was reviewing a recent kernel rbd patch he pointed
out that a variable named "obj_size" was misleading bec...
Alex Elder
12:02 PM Subtask #5046 (Resolved): Factor out PG logs, PG missing
PG logs, PG missing: The logic for merging an authoritative PG log with another PG log while filling in the missing s... Loïc Dachary
12:00 PM Subtask #4928 (Rejected): PG/ReplicatedPG API
... Loïc Dachary
11:28 AM Linux kernel client Bug #5043 (Resolved): Oops in remove_osd
Stack output:
Stack traceback for pid 29892
0xffff88022140bf20 29892 2 1 6 R 0xffff88022140c3a8 ...
Sandon Van Ness
11:07 AM CephFS Bug #5021: ceph-fuse: crash on traceless reply
Never mind that comment, I was just looking at the job it happened on, not the actual failure... Greg Farnum
10:20 AM CephFS Bug #5021: ceph-fuse: crash on traceless reply
Will come back for another pass and verify, but I assume this is the disconnected inode error. Greg Farnum
10:54 AM CephFS Bug #5033: oops in ceph_put_wrbuffer_cap_refs
plana47 died with:
[0]kdb> bt
Stack traceback for pid 25102
0xffff88001c499f90 25102 23405 1 0 R 0x...
Sandon Van Ness
10:39 AM Feature #5042 (New): Backport option to disable deep scrub to bobtail
Mark Nelson
10:32 AM Feature #5041 (New): Deep scrub CPU limit behavior
Determine if deep scrub is properly using the CPU utilization limits (rather than just scrub). If not, should it? M... Mark Nelson
10:17 AM CephFS Bug #5030 (Resolved): libcephfs xattr test failure
Sage Weil
10:12 AM devops Feature #4954 (In Progress): ceph-deploy: help and document need to be updated for osd create
John Wilkins
10:11 AM CephFS Bug #5037: Ceph-MDS asserts after upgrade 0.56.2 -> 0.56.6
It couldn't find the actual table object in RADOS. We've seen this pop up a few times, but I believe it's always been... Greg Farnum
01:43 AM CephFS Bug #5037 (Can't reproduce): Ceph-MDS asserts after upgrade 0.56.2 -> 0.56.6
After upgrading our Ceph setup to 0.56.6 from 0.56.2, the MDS processes assert() on start and will not work.
This i...
Christopher Kunz
09:24 AM rbd Bug #5040 (Resolved): krbd: record that an parent info refresh has failed
In order to manage resized clone images (including the flattening
of a clone image) the kernel rbd client needs to g...
Alex Elder
09:01 AM CephFS Bug #5039 (Resolved): client: unlinking files leaves the cached entry behind
http://comments.gmane.org/gmane.comp.file-systems.ceph.user/1277
When unlinking a file, the client should make an ...
Mike Bryant
08:10 AM rgw Bug #4997: Seg Fault on rgw 0.61.1 with cluster in 0.61
gary, do you see a problem with matching up the versions like this? i think in radosgw's case it may be more importa... Sage Weil
07:34 AM rgw Bug #4997: Seg Fault on rgw 0.61.1 with cluster in 0.61
This is the issue I reported on the ML and is tracked in http://tracker.ceph.com/issues/4944 Sylvain Munaut
02:59 AM rgw Bug #4997: Seg Fault on rgw 0.61.1 with cluster in 0.61
, it was a problem with ceph-common and librados2 packages who wasn't up-to-date.
It's not the first time that thi...
Yann ROBIN
06:28 AM rbd Bug #4559: krbd: kernel BUG when mapping unexisting rbd device
Do you suggest any workaround? I dont think that it is possible to downgrade to bobtail. Maciej Galkiewicz
05:59 AM rbd Bug #4559: krbd: kernel BUG when mapping unexisting rbd device
I am affected by the same bug but only with ceph 0.61.1. Every time I try to map any volume (with proper keyring, id ... Maciej Galkiewicz
06:01 AM Revision db29f49f (ceph): Merge pull request #275 from ceph/wip-rbd-read-from-replica
Reviewed-by: Sage Weil <sage.weil@inktank.com> Josh Durgin
04:11 AM CephFS Bug #5036: `ls` hangs on random folder
As you can see, the @ls@ process is stuck in D state:
*@/proc/10297/status@*...
Quan Tong Anh
12:16 AM CephFS Bug #5036 (Resolved): `ls` hangs on random folder
strace hangs at "getdents(3,": https://clbin.com/LktUw
The informations when dumping via SysRq:...
Quan Tong Anh
02:31 AM Revision d5193460 (ceph): Objecter, librados: use only ObjectOperation form of sparse_read intern...
This will be used when exposing an ObjectOperation version of sparse_read()
to the librados user, and there's no reas...
Josh Durgin
02:31 AM Revision 442f0588 (ceph): librados: add sparse_read() to the C++ bindings for an ObjectOperation
This will allow it to be used with general aio_operate() so we don't have
to add new versions of each operation when ...
Josh Durgin
02:31 AM Revision 4ddaea70 (ceph): librados: add per-ObjectOperation flags for balanced and localized reads
These need to apply to the entire ObjectOperation, not just a subop,
so use a new enum and a new aio_operate() call t...
Josh Durgin
02:31 AM Revision 0c7414b1 (ceph): ReplicatedPG: send -EAGAIN for both balanced and localized reads
This logic for localized reads applies to balanced reads too.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
02:31 AM Revision 13ae13a9 (ceph): librbd: add options to enable balanced or localized reads for snapshots
Since snapshots never change, it's safe to read from replicas for them.
A common use for this would be reading from a...
Josh Durgin
02:26 AM Revision ed76824c (ceph): Objecter: fix error handling for decoding stat
r is just a local variable, changing it has no effect.
Set the per-operation return value if provided when a decoding...
Josh Durgin
01:31 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
FYI: I have code that finds the missing inode by using backtrace. The code is under test, will send out soon. Zheng Yan
01:11 AM CephFS Bug #5031: mds/MDCache.cc: 5221: FAILED assert(reconnected_snaprealms.empty())
The items left in reconnected_snaprealms should be other MDS's mdsdir. I comment out that line when running test Zheng Yan

05/12/2013

11:09 PM rbd Feature #3064 (Resolved): librbd: A way to read from nearby replicas
Flags to do this for reads of snapshots are added by commit:13ae13a9068afcd4eb4b3574c46875cad8c91ab6.
Making the i...
Josh Durgin
11:08 PM Feature #5035 (Resolved): rados: smarter localized reads
Currently localized reads just match based on client and osd ip matching, which was originally implemented for hadoop... Josh Durgin
12:36 AM Revision 82211f21 (ceph): qa: rsync test: exclude /usr/local
Some plana have non-world-readable crap in /usr/local/samba. Avoid
/usr/local entirely for that and any similar land...
Sage Weil
12:07 AM Revision 62eb49f6 (ceph): schedule_suite.sh: bump suite timeout from 6->8 hours
This captures the current slow rbd tasks.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

05/11/2013

06:15 PM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
we hit it again! leaving the machines locked and task hung.
ubuntu@teuthology:/a/teuthology-2013-05-11_01:30:03-u...
Sage Weil
05:58 PM Revision 459c7311 (ceph): osd/OSD.h: fix try_stop_deletion
Fix try_stop_deletion(): The comment above the while loop says "If we are
in DELETING_DIR or DELETED_DIR", but the wh...
Danny Al-Gaaf
05:41 PM CephFS Bug #5033 (Can't reproduce): oops in ceph_put_wrbuffer_cap_refs
... Sage Weil
05:38 PM rbd Bug #5032 (Closed): xfstest 269 failure
... Sage Weil
05:27 PM CephFS Bug #5031: mds/MDCache.cc: 5221: FAILED assert(reconnected_snaprealms.empty())
logs copied to logs/ subdir Sage Weil
05:27 PM CephFS Bug #5031 (Resolved): mds/MDCache.cc: 5221: FAILED assert(reconnected_snaprealms.empty())
... Sage Weil
05:25 PM Bug #4579: kclient + ffsb workload makes osds mark themselves down
ubuntu@teuthology:/a/teuthology-2013-05-11_01:00:38-fs-next-testing-basic/11289 Sage Weil
05:22 PM CephFS Bug #5030 (Resolved): libcephfs xattr test failure
2013-05-11T02:29:39.882 INFO:teuthology.task.workunit.client.0.out:[ RUN ] LibCephFS.Xattrs
2013-05-11T02:29:39...
Sage Weil
05:04 PM Bug #4813: pgs stuck creating
... Sage Weil
05:02 PM Bug #4976: osd powercycle triggers object corruption on xfs
... Sage Weil
10:58 AM rbd Feature #3763: krbd: handle flattening of mapped image
The following patches have been posted for review. They
are available in the "review/wip-flatten" branch of the
ce...
Alex Elder
10:57 AM rbd Feature #3763 (Fix Under Review): krbd: handle flattening of mapped image
The following patches have been posted for review. They
are available in the "review/wip-flatten" branch of the
ce...
Alex Elder
10:21 AM rbd Feature #3763: krbd: handle flattening of mapped image
I've got over 2800 iterations on UML and over 5300 iterations
on "normal" Linux running flattens while writing 16 co...
Alex Elder
10:56 AM rbd Subtask #5028 (Fix Under Review): rbd: treat clones with zero parent overlap as non-layered
The following patch has been posted for review. It is
available in the "review/wip-flatten" branch of the
ceph-cli...
Alex Elder
10:31 AM rbd Subtask #5028 (Resolved): rbd: treat clones with zero parent overlap as non-layered
When the overlap of a clone with its parent is 0, there
is no need to consult the parent for any image data any
mor...
Alex Elder
10:56 AM Bug #5027 (Fix Under Review): rbd: support reading parent page data for writes
The following patch has been posted for review. It is
available in the "review/wip-flatten" branch of the
ceph-cli...
Alex Elder
10:28 AM Bug #5027 (Resolved): rbd: support reading parent page data for writes
Currently, rbd_img_obj_parent_read_full() assumes the incoming
object request contains bio data. But if a layered i...
Alex Elder
10:55 AM Bug #5026 (Fix Under Review): libceph: allow osd requests to be reused
The following patch has been posted for review. It is
available in the "review/wip-flatten" branch of the
ceph-cli...
Alex Elder
10:27 AM Bug #5026 (Resolved): libceph: allow osd requests to be reused
Because certain fields in an osd request structure are never
cleared, any attempt to reuse a request leads to a fail...
Alex Elder
06:14 AM Revision f330d038 (ceph): fs/samba: disable smbtorture lock test
Until we fix #5025.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:02 AM Revision 89257384 (ceph): fs/samba: fix noceph mount point
We need to clean this dir up at the end of the test or else teuthology
will be unhappy with the dirty testdir. Use t...
Sage Weil
06:01 AM Revision 63203c6e (ceph): localdir: create/cleanup mnt.foo dir on local fs
This creates and cleans up a local mnt dir that can be consumed
by other tasks (like workunit, samba, etc), just like...
Sage Weil
05:02 AM Revision a0d238c3 (ceph): rgw: cache obj version
Also keep bucket objv_tracker on the request state.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
04:55 AM Revision 391f3abb (ceph): fs/samba: smbtorture: disable base.bench-hold* tests
These seem to hang, even when run on samba + local fs (no ceph). Sage Weil
04:44 AM Revision 857279b8 (ceph): fs/samba: add noceph.yaml baseline
Run samba against the a local directory to isolate issues not specific to
the ceph backend.
Sage Weil
04:44 AM Revision 464e5e3c (ceph): fs/samba: disable kernel build
bus error, bad file handle errors... maybe an issue with cifs.ko?
2013-05-10T19:58:02.736 INFO:teuthology.task.worku...
Sage Weil
03:57 AM Bug #4967: Misbehaving OSD sets over half of the cluster as down despite "osd min down reporters ...
Docs say "You can change the minimum number of osd down reports by adding an osd min down reports setting under the [... Faidon Liambotis
03:10 AM Revision 703bc2fd (ceph): config_opts: default mon_debug_dump_transactions to 'false'
otherwise, it chews mon log space at an alarming rate.
Fixes: #5024
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
02:00 AM Revision a095075f (ceph): ceph-qa: update Hadoop tests overrides
Changes to the install teuthology task have caused the
Hadoop tasks to fail. This patch fixes the test specification
...
Joe Buck
01:03 AM Revision 6205c3da (ceph): rados/osd-powrcycle: turn up mds logging
To catch #4832
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
01:03 AM Revision d0e9a19e (ceph): fs/samba: restructure and expand test collection
All workloads on samba, samba+fuse, samba+kernel. Workloads include
torture and cifs + {various workunits}
Sage Weil
12:33 AM Revision a12464f7 (ceph): Do not scan for vm locks when listing all machines.
Fixes: #4830
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Warren Usui
12:13 AM Revision b5e9b56f (ceph): Merge pull request #272 from ceph/wip-rbd-parallel
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil

05/10/2013

11:45 PM Revision ea0e0c7e (ceph): Merge branch 'wip-4273'
Reviewed-by: Sam Just <sam.just@inktank.com> David Zafman
11:32 PM Revision 996f1edc (ceph): task modified to include a '-' before the test script
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
11:17 PM Revision 93f27942 (ceph): Throttle: move start_op() to C_SimpleThrottle constructor
This is done by all callers right before constructing this.
Since C_SimpleThrottle is already responsible for calling...
Josh Durgin
11:17 PM Revision bfa10669 (ceph): librbd: only send non-zero copyup data
If the parent image is logically zero for the range of a child object,
it's equivalent to the object not existing. Sa...
Josh Durgin
11:17 PM Revision a6d0a254 (ceph): librbd: parallelize and simplify flatten
Flattening reads the logical child object from the parent image, and
then does a copyup operation if the data is non-...
Josh Durgin
11:17 PM Revision fb299d38 (ceph): librbd: move completion release into rbd_ctx_cb()
All the users of rbd_ctx_cb() do this separately right now, but
there's no reason to keep the completion around after...
Josh Durgin
11:17 PM Revision 613d7471 (ceph): librbd: run copy in parallel
Instead of using read_iterate(), loop over each period of objects in
the source, read from them asynchronously, and t...
Josh Durgin
11:17 PM Revision cfece23d (ceph): librbd: parallelize rollback
Use a SimpleThrottle like trim_image() to limit the number of
requests in flight.
Signed-off-by: Josh Durgin <josh.d...
Josh Durgin
11:13 PM CephFS Bug #5025 (Resolved): samba smbtorture lock test fails on kclient
... Sage Weil
11:12 PM Revision 922df6c5 (ceph): rgw: op->PutACLs uses the correct set_attr for buckets
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
10:55 PM Revision 1f0b947d (ceph): rgw: rados->set_attr() just calls rados->set_attrs()
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
10:19 PM CephFS Bug #5022 (Resolved): samba: smbtorture failures
Sage Weil
05:27 PM CephFS Bug #5022 (Resolved): samba: smbtorture failures
logs: ubuntu@teuthology:/a/teuthology-2013-05-10_01:00:36-fs-master-testing-basic/10437... Tamilarasi muthamizhan
10:14 PM Bug #4996 (Fix Under Review): mon: bogus mds tell can crash monitors
wip-4996
this will be soon obsoleted by the new cli work, but this fix should go to bobtail and cuttlefish.
Sage Weil
02:42 AM Bug #4996 (Resolved): mon: bogus mds tell can crash monitors
I was trying to run this command at the time:... Mike Bryant
10:08 PM Revision 7b408537 (ceph): Merge pull request #273 from dalgaaf/wip-da-CID-fixes-v2
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
10:02 PM Revision 4b3a7dcb (ceph): client/SyntheticClient.cc: fix another memory leak
Fix memory leak in read_random: call delete[] on buf before
call new[] again in the for-loop.
CID 717071 Resource le...
Danny Al-Gaaf
10:02 PM Revision 5af2cbfe (ceph): rados.cc: fix leaking of Formatter*
Make sure Formatter* is deleted in error case.
717096 Resource leak (CWE-404) (25 of 25 cases)
Signed-off-by: Danny...
Danny Al-Gaaf
10:02 PM Revision cb91f0fd (ceph): client/Client.cc: fix possible NULL pointer dereference
CID 751332 Dereference null return value (CWE-476)
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
10:02 PM Revision f6635cb3 (ceph): client/SyntheticClient.cc: check return value of describe_layout()
Fix SyntheticClient::chunk_file(): check return value of
describe_layout() and handle the error.
CID 966615 Unchecke...
Danny Al-Gaaf
10:02 PM Revision d1e0fc64 (ceph): kv_flat_btree_async.cc: fix resource leak
Call AioCompletion::release() if the completion is no longer
needed to free the resources.
CID 727976 Resource leak ...
Danny Al-Gaaf
10:02 PM Revision c006151c (ceph): ceph-monstore-tool.cc: check if open() was successful
Should fix: "fd" is passed to a parameter that cannot be negative.
CID 1019566 Improper use of negative value (NEGAT...
Danny Al-Gaaf
10:02 PM Revision 437d69ef (ceph): mds/CDir.cc: fix possible dereference after NULL check
CID 1019553 Dereference after null check (FORWARD_NULL, CWE-476)
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
10:02 PM Revision 4908079c (ceph): rados_sync.cc: remove dead and not needed code
The first if handles all chars < 32, the last 2 if's check for
'\n' (10) and '\r' (13). This code will never be reach...
Danny Al-Gaaf
10:02 PM Revision 5babc816 (ceph): rbd.cc: fix error handling
Fix undead code. Get error code from write_fd() before check
the result against < 0.
CID 1019550 Logically dead code...
Danny Al-Gaaf
10:02 PM Revision b097f656 (ceph): rgw/rgw_rest.cc: remove dead and unneeded code
Since origin and meth are already checked to be true there is
no need to check again in s->cio->print() after the ini...
Danny Al-Gaaf
10:02 PM Revision f56cb984 (ceph): osd/OSD.h: add missing unlock of osd_lock
CID 1019560 Missing unlock (CWE-667)
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
10:02 PM Revision 5392e970 (ceph): mds/Server.cc: remove dead default in switch
The default switch can't get executed since the switch value
can only have the two values already checked.
CID 71689...
Danny Al-Gaaf
10:02 PM Revision d258df4d (ceph): mds/MDCache.cc: add NULL pointer check
Check for result of get_inode() for NULL before use the pointer.
716990 Dereference null return value (CWE-476)
Sig...
Danny Al-Gaaf
10:02 PM Revision 447f3186 (ceph): mds/MDCache.cc: fix dereference NULL pointer
Fix possible NULL pointer dereference. Change return value of
CInode::get_dirfrag() to return NULL instead of 0 since...
Danny Al-Gaaf
10:02 PM Revision b9fbc821 (ceph): client/SyntheticClient.cc: fix memory leak
Fix memory leak in read_random: call delete[] on buf before
call new[] again in the for-loop.
CID 717070 Resource le...
Danny Al-Gaaf
10:02 PM Revision 6e241b97 (ceph): ObjectStore.cc: add missing break in switch
Fix switch handling for case OP_SPLIT_COLLECTION2, add break after
the case to prevent fall through into default case...
Danny Al-Gaaf
10:02 PM Revision d9c5b5b7 (ceph): client/SyntheticClient.cc: add missing break in switch
Fix switch handling for case SYNCLIENT_MODE_OVERLOAD_OSD_0, add break
after the case to prevent fall through into nex...
Danny Al-Gaaf
10:02 PM Revision 8d614665 (ceph): rgw/rgw_user.cc: add missing break in switch
Fix switch handling for case KEY_TYPE_SWIFT, add break after the
case to prevent fall through into KEY_TYPE_S3 case.
...
Danny Al-Gaaf
10:02 PM Revision 0c70e446 (ceph): libcephfs/test.cc: add assert for result of ceph_getxattr()
Check result of ceph_getxattr() before pass it as parameter to
strncmp(). Make sure it's not negative.
CID 739411 Ar...
Danny Al-Gaaf
10:02 PM Revision 077cdb04 (ceph): test/omap_bench.cc: remove dead code
CID 716900 Logically dead code (CWE-561)
CID 716901 Logically dead code (CWE-561)
CID 727968 Logically dead code (CWE...
Danny Al-Gaaf
09:42 PM Revision 92db7a01 (ceph): rgw: metadata handler for bucket set_attr operations
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
09:39 PM Revision 27fb38bb (ceph): doc: Fixed typos. Somehow got a merge error.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:30 PM Revision 65768a68 (ceph): mds: be more explicit about path_traverse completion
Coverity turned up
CID 717085 (#1 of 1): Resource leak (RESOURCE_LEAK)
10. leaked_storage: Variable "c" going out of...
Sage Weil
08:53 PM Revision 8eaa15f2 (ceph): doc: fix broken link to ceph-deploy in release notes
Signed-off-by: Ross Turk <ross@inktank.com> Ross Turk
08:43 PM Bug #4967: Misbehaving OSD sets over half of the cluster as down despite "osd min down reporters ...

You need "osd min down reporters = 14" in the [mon] section not the [osd] section of ceph.conf.
David Zafman
08:23 PM Bug #5024: mon_debug_dump_transactions should default to False
I pushed a fix to the cuttlefish branch to get the gitbuilders chewing on it. Dan Mick
08:07 PM Bug #5024 (Resolved): mon_debug_dump_transactions should default to False
797089ef082b99910eebfd9454c03d1f027c93bb added mon_debug_dump_transactions to dump to a
separate log file (mon_debu...
Dan Mick
08:21 PM CephFS Bug #4965: libcephfs-java test failure
Commit a095075fe4dcdac817895dac316100e733ab4698 has a patch that I believe fixes this issue. If it resolves things in... Anonymous
07:00 PM Revision 537386d9 (ceph): Throttle: add a simpler throttle that just blocks above a threshold
This is convenient to use to turn synchronous calls into asynchronous
calls with a limited number of operations in fl...
Josh Durgin
07:00 PM Revision 40956410 (ceph): librbd: delete more than one object at once
Speed up deletions when resizing down or removing an image by keeping
up 10 operations in flight by default.
Refs: #...
Josh Durgin
07:00 PM Revision 3b2c5fb8 (ceph): librados: add selfmanaged_snap_rollback as an ObjectOperation
This allows it to be done asynchronously, or in conjunction with
other operations.
Signed-off-by: Josh Durgin <josh....
Josh Durgin
06:30 PM Revision 261aaba1 (ceph): doc: Added entry for the RGW Admin Ops API.
fixes: #5002
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
06:06 PM rgw Bug #4975 (Resolved): nightlies: swift test failed after upgrading from bobtail to next
Sage Weil
09:46 AM rgw Bug #4975: nightlies: swift test failed after upgrading from bobtail to next
I want to start moving us towards zero-tolerance for nightly failures.
If this is a bug in the test suite, we shou...
Anonymous
06:06 PM rgw Bug #4957 (Resolved): 400 Bad Request after bucket rm ... on radosgw-admin test
commit:1b0f241dcb7f4a03c3c5095464e740e5dd75831a in teuthology.git reverts the bad test. Sage Weil
09:51 AM rgw Bug #4957: 400 Bad Request after bucket rm ... on radosgw-admin test
Made this urgent because I want to start driving us towards zero-tolerance for nightly failures. If the problem is i... Anonymous
05:31 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
... Tamilarasi muthamizhan
05:31 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
log: ubuntu@teuthology:/a/teuthology-2013-05-10_01:00:36-fs-master-testing-basic/10442... Tamilarasi muthamizhan
05:26 PM Revision 49d22aa6 (ceph): Merge pull request #271 from dalgaaf/wip-da-sca-cppcheck-v2.1
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
05:23 PM CephFS Bug #5021 (Resolved): ceph-fuse: crash on traceless reply
logs: ubuntu@teuthology:/a/teuthology-2013-05-10_01:00:36-fs-master-testing-basic/10448... Tamilarasi muthamizhan
05:21 PM rbd Bug #4959: xfstest 17 failure
Whoops, meant to update this this morning. There's only one set of tests running, so no concurrency. Josh Durgin
11:29 AM rbd Bug #4959: xfstest 17 failure
That's a really weird output.
The "*** test 3" (and 3 and 4) are shown at the top of a
loop that:
- runs fsstres...
Alex Elder
05:19 PM Revision ad2990c4 (ceph): include/addr_parsing.c: reduce scope of port_str in safe_cat()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision cd0f461f (ceph): include/ceph_hash.cc: reduce scope of a var in ceph_str_hash_rjenkins()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision 54e7a6ff (ceph): libcephfs_jni.cc: reduce scope of ret variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision 45ffb36b (ceph): ceph-filestore-dump.cc: use empty() instead of size()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision cdfc4a7e (ceph): rgw/rgw_op.cc: use empty() instead of size()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision e7d11140 (ceph): cls/rbd/cls_rbd.cc: reduce scope of variable rc
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision b66b8ddf (ceph): common/admin_socket.cc: remove scope of ret variable in do_accept()
Reduce scope of ret variable and remove usage in one case.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:19 PM Revision 6256d3ed (ceph): common/ceph_argparse.cc: remove scope of some variables
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision 8c97e77e (ceph): common/obj_bencher.cc: reduce scope of avg_bandwidth
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision e064e67c (ceph): common/safe_io.c: reduce scope of some ssize_t variables
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision 73a10ac9 (ceph): crush/builder.c: reduce scope of oldsize in crush_add_bucket()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:19 PM Revision cc3376cf (ceph): global/global_init.cc: reduce scope of ret in global_init_daemonize()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
05:18 PM Revision 1b0f241d (ceph): Revert "radosgw-admin: Test bucket list for bucket starting with unders...
This reverts commit fa70eb8f67371568f47ae237606be63024164214.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:14 PM Feature #4273 (Resolved): osd: prioritize recovery for degraded pgs
commit:ea0e0c7e34f6b99ecb123d0ee761f653e7e6bc04 Sage Weil
05:13 PM rbd Feature #2256: rbd: parallelize deletions
commit:b5e9b56fc93dd4896c802aff1096430b523ad84c Sage Weil
05:13 PM rbd Feature #2256 (Resolved): rbd: parallelize deletions
Sage Weil
04:58 PM CephFS Bug #4832: mds: failed auth_unpin assert
ubuntu@teuthology:/a/teuthology-2013-05-10_01:00:06-rados-master-testing-basic/10278... Tamilarasi muthamizhan
04:53 PM CephFS Feature #3243 (Resolved): qa: test samba reexport via libcephfs vfs plugin in teuthology
Sage Weil
04:52 PM Bug #4976: osd powercycle triggers object corruption on xfs
... Tamilarasi muthamizhan
04:51 PM Bug #4976: osd powercycle triggers object corruption on xfs
ubuntu@teuthology:/a/teuthology-2013-05-10_01:00:06-rados-master-testing-basic/10288 Tamilarasi muthamizhan
04:52 PM Fix #4840: mon: transition from old-style allow command to new command descriptions
the underlying bits are all now in place in wip-mon-cap.
i think all that's needed is a translate of old strings t...
Sage Weil
04:50 PM Fix #4567 (Fix Under Review): mon: refactor mon caps; allow restriction of key/value storage by p...
Sage Weil
04:48 PM Bug #5020 (Resolved): osd: 2.5 deep-scrub stat mismatch, got 710/665 objects, 0/0 clones, 2908160...
ubuntu@teuthology:/a/teuthology-2013-05-10_01:00:06-rados-master-testing-basic/10305... Tamilarasi muthamizhan
04:37 PM Revision 723062bb (ceph): doc: Updated usage syntax. Added links to hardware and manual OSD remove.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:04 PM devops Feature #5019 (Resolved): arm: gitbuilder for ARM
Ian Colle
03:03 PM devops Feature #5018 (Resolved): arm: ceph-deploy: push packages to ARM
Ian Colle
03:00 PM devops Feature #5016: ceph-deploy: gitbuilders for release packages
Sage Weil
02:57 PM devops Feature #5016 (Resolved): ceph-deploy: gitbuilders for release packages
Sage Weil
03:00 PM devops Feature #5015: ceph-deploy: push packages to all ceph repos
Sage Weil
02:57 PM devops Feature #5015 (Resolved): ceph-deploy: push packages to all ceph repos
Sage Weil
02:56 PM devops Feature #5014 (Resolved): arm: Build ARM packages
Anonymous
02:55 PM devops Feature #5013 (Rejected): build internal openstack + ceph cluster out of some burnupi
Sage Weil
02:45 PM Bug #5010 (Resolved): doc: Trasitioning to ceph-deploy doc missing
Sorry for the delay. Someone caused a merge conflict. The link syntax was wrong, and there was a reference in two pla... John Wilkins
01:49 PM Bug #5010 (Resolved): doc: Trasitioning to ceph-deploy doc missing
http://ceph.com/docs/master/release-notes/#v0-61-cuttlefish has a link called "Transitioning to ceph-deploy" that res... Ian Colle
01:36 PM rgw Feature #3366: rgw: dr: define management api
Extend mgt API to implement new zone commands Ian Colle
01:32 PM rgw Feature #3366: rgw: dr: define management api
Extend the management API to allow it execute commands related to establishing multi-site or DR Neil Levine
01:31 PM rgw Feature #3366: rgw: dr: define management api
Neil Levine
01:31 PM rgw Feature #3366 (In Progress): rgw: dr: define management api
Neil Levine
01:31 PM rgw Feature #3366: rgw: dr: define management api
Neil Levine
01:30 PM rgw Feature #4309: rgw: multisite: metadata objects versioning
Ian Colle
01:30 PM rgw Feature #4312: rgw: multisite: log metadata changes
Ian Colle
01:28 PM rgw Feature #4331: rgw: multisite: metadata-changes log: create internal API
Sage Weil
01:28 PM rgw Feature #4311: rgw: dr: radosgw changes: internal bucket changes tracker
Sage Weil
01:28 PM rgw Feature #4347: rgw: dr: bucket index objclass: fetch changes log
Sage Weil
01:28 PM rgw Feature #4346: rgw: dr: bucket index objclass: changes
Sage Weil
01:28 PM rgw Feature #4332: rgw: multisite: metadata-changes log: tie into metadata update operations
Sage Weil
01:25 PM rbd Documentation #5009 (Resolved): doc: explain how to get qemu packages for each distro
Once #4550 and #4834 are done, we'll have qemu packages with async flush for several distros. This is pretty importan... Josh Durgin
01:20 PM rbd Feature #5003: cinder/nova: don't require ceph.conf on a compute host / support multiple clusters
The nova blueprint is: https://blueprints.launchpad.net/nova/+spec/better-libvirt-network-volume-support Josh Durgin
12:04 PM rbd Feature #5003 (Rejected): cinder/nova: don't require ceph.conf on a compute host / support multip...
https://bugs.launchpad.net/cinder/+bug/1077817 Josh Durgin
01:13 PM rgw Documentation #3217 (Closed): rgw: document RESTful usage api
Landed via da271f7f78fee9e349860695b1913210a44018cc Ian Colle
01:11 PM rgw Feature #5008 (In Progress): rgw: bucket metadata changes should be reflected in mdlog
Ian Colle
01:11 PM rgw Feature #5008 (Resolved): rgw: bucket metadata changes should be reflected in mdlog
Yehuda Sadeh
01:11 PM RADOS Feature #5007 (New): librados: expose snap_set_diff interfaces?
Just looking at source organization, I see snap_set_diff.cc in librados, but it's only
linked into librbd; perhaps i...
Dan Mick
01:09 PM rgw Feature #4334 (In Progress): rgw: dr: bucket index log API: implement RESTful API
Ian Colle
01:09 PM rgw Feature #4333 (In Progress): rgw: multisite: metadata-changes log: implement RESTful API
Ian Colle
01:09 PM rgw Feature #4329 (In Progress): rgw: dr: updated buckets log: RESTful API
Ian Colle
01:08 PM rgw Feature #4330 (Fix Under Review): rgw: dr: updated buckets log: radosgw-admin changes
Please review wip-rgw-geo Ian Colle
01:07 PM rgw Feature #4328 (Fix Under Review): rgw: dr: updated buckets log: tie into internal bucket changes ...
Please review wip-rgw-geo Ian Colle
01:07 PM rgw Feature #4327 (Fix Under Review): rgw: dr: updated buckets log: create internal API
Please review wip-rgw-geo Ian Colle
01:05 PM rgw Feature #4745: rgw: radosgw-admin command to stat object
wip-rgw-geo Ian Colle
01:05 PM rgw Feature #4745: rgw: radosgw-admin command to stat object
Greg, can you please review this? Ian Colle
12:29 PM rbd Documentation #5006 (Resolved): doc: openstack configuration changes for havana
In Havana, the cinder and nova configuration should be a bit simpler (see #5003, #5004, and #5005). Update (or add a ... Josh Durgin
12:28 PM Bug #4813: pgs stuck creating
this happened again on latest master: ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-05-10_01:00:06-r... Sage Weil
12:14 PM rbd Feature #5004: cinder: make rbd configuration easier to use
Moved the librbd part to #5005. Josh Durgin
12:08 PM rbd Feature #5004 (Resolved): cinder: make rbd configuration easier to use
Get the user out of a config option instead of an environment variable, and switch to librbd from cli rbd so
the opt...
Josh Durgin
12:14 PM rbd Feature #5005 (Resolved): cinder: switch rbd driver to use librbd instead of the cli tool
This will give us better error handling and easier backwards compatibility. It will also make it easer to match the c... Josh Durgin
11:35 AM Bug #5002 (Resolved): doc: API Page missing link to RGW User Admin API
Added link: http://ceph.com/docs/master/api/ John Wilkins
11:10 AM Bug #5002 (Resolved): doc: API Page missing link to RGW User Admin API
http://ceph.com/docs/master/api/ should have an entry for the new RGW User Admin API located at http://ceph.com/docs/... Ian Colle
11:32 AM rbd Feature #3064 (In Progress): librbd: A way to read from nearby replicas
Ian Colle
10:53 AM rbd Feature #3763: krbd: handle flattening of mapped image
I found that in the case that an existence check callback
(for a write) if the image had been flattened, I was
resu...
Alex Elder
06:27 AM rbd Feature #3763: krbd: handle flattening of mapped image
I have evidence that handling a flatten of an image
works correctly when a read parent is underway, as
well as when...
Alex Elder
10:35 AM rbd Feature #4804: tgt: switch to aio
Neil Levine
10:35 AM rbd Feature #4917: iSCSI: Package tgt
Neil Levine
10:19 AM rgw Bug #4958: rgw test failing when trying to delete a bucket
I reverted the relevant teuthology test. Yehuda Sadeh
10:07 AM Bug #4999: monitor sync failure
OK, here's what happens when I try to restart:... Jim Schutt
09:37 AM Bug #4999: monitor sync failure
Or, I guess, maybe my browser was getting in the way of a big cut-n-paste?
Anyway, it all seems to be there now.
Jim Schutt
09:33 AM Bug #4999: monitor sync failure
Evidently it's Redmine's issue. Here's the rest, really!... Jim Schutt
09:31 AM Bug #4999: monitor sync failure
Oops, sorry; too-small cut-n-paste buffer clipped off the trace.
I'll try the restart here in a bit -- I had to sh...
Jim Schutt
09:21 AM Bug #4999: monitor sync failure
Hi Jim!
Can you generate a 'debug ms = 1' 'debug mon = 20' 'debug paxos = 20' log for the restart after the crash?...
Sage Weil
09:16 AM Bug #4999 (Can't reproduce): monitor sync failure

I've been testing v0.61 and v0.61.1 with 262,144 PGs in
a single pool, 576 OSDs, 3 mon, 3 MDS (1 active + 2 standb...
Jim Schutt
09:25 AM devops Bug #4916 (In Progress): ceph-deploy: mon create fails on bobtail branch in centos 6.3
Anonymous
08:53 AM devops Feature #4998 (Resolved): ceph-deploy should allow user specified installation sources
ceph-deploy currently uses the package repos at ceph.com or gitbuilder.ceph.com for installation sources, and additio... Anonymous
08:43 AM Bug #4974 (In Progress): nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Forgot to update status Joao Eduardo Luis
08:43 AM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Matthew provided a store for me to play with.
Doing some analysis, it looks like the issue is confined to the mdsm...
Joao Eduardo Luis
05:17 AM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
We lost the logs for these runs after being nuked. I left a teuthology job running overnight to reproduce it, but fo... Joao Eduardo Luis
08:19 AM rgw Bug #4997 (Resolved): Seg Fault on rgw 0.61.1 with cluster in 0.61
Hi,
I've tried to update the rgw to 0.61.1 and I had a segfault while connecting to the 0.61 cluster.
I have anot...
Yann ROBIN
07:16 AM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
PS This can differ in details from natural failure: I copy another copy of problem files, so CAN (or not), for exampl... Denis kaganovich
06:59 AM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
OK.
I have xfs on OSDs and reiserfs on system & monitors. Now I have more this objects. I understand next: IMHO ther...
Denis kaganovich
05:37 AM Revision fd901056 (ceph): Merge branch 'wip_4955' into next
Reviewed-by: Sage Weil <sage@inktank.com> Samuel Just
05:24 AM Revision b353da6f (ceph): Merge branch 'wip_pg_res'
Reviewed-by: Sage Weil <sage@inktank.com> Samuel Just
05:23 AM Revision 01a07c1e (ceph): OSD: rename clear_temp to recursive_remove_collection()
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:23 AM Revision f5a60ca2 (ceph): osd: remove_dir use collection_list_partial
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:08 AM Revision 8b3cd6e7 (ceph): rgw: don't handle ECANCELLED anymore
Simplify, remove obsolete logic.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
01:21 AM Revision d7fe5c0a (ceph): nuke: don't require noipmi in ctx
This is called from run.py too, which won't have ctx.noipmi.
The default of using impmi is fine for now for run.py.
...
Josh Durgin
12:30 AM Revision 7a8d6fd4 (ceph): PG,OSD: delay ops for map prior to queueing in the OpWQ
Previously, we simply queued ops in the OpWQ without checking. The PG
would then check in do_request whether the mes...
Samuel Just
12:28 AM Revision b274c8a0 (ceph): common/sharedptr_registry.hpp: add remove
remove() can be used to clear an entry before all of its
references are removed.
Signed-off-by: Samuel Just <sam.jus...
Samuel Just
12:28 AM Revision 90f50c48 (ceph): OSD: add pg deletion cancelation
DeletingState now allows _create_lock_pg() to attempt to cancel
pg deletion.
PG::init() must mark the PG as backfill...
Samuel Just
12:28 AM Revision 0ef9b1e0 (ceph): osd_internals/pg_removal.rst: update for pg resurrection
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:28 AM Revision d3dd99b7 (ceph): PG: no need to wait on DeletingStateRef for flush
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just

05/09/2013

10:46 PM Fix #3884 (Resolved): osd: resurrect partially deleted PGs
b353da6f682d223ba14812da0fe814eca72ad6f5 Samuel Just
01:06 PM Fix #3884 (Fix Under Review): osd: resurrect partially deleted PGs
wip-pg-res Ian Colle
10:45 PM Bug #4955 (Resolved): osd/ReplicatedPG.cc: 1078: FAILED assert(0 == "out of order op")
fd901056831586e8135e28c8f4ba9c2ec44dfcf6 Samuel Just
04:52 PM Bug #4955: osd/ReplicatedPG.cc: 1078: FAILED assert(0 == "out of order op")
wip_4955 Samuel Just
09:13 PM Revision 0557e6c1 (ceph): rgw: bucket metadata operations go through metadata handler
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
08:43 PM Bug #4967: Misbehaving OSD sets over half of the cluster as down despite "osd min down reporters ...
I didn't do this and in retrospect it makes a lot of sense. However, I had my mons restarted because of the 0.56.6 up... Faidon Liambotis
04:02 PM Bug #4967: Misbehaving OSD sets over half of the cluster as down despite "osd min down reporters ...
Item #2 the following command should have worked. Telling MONs instead of OSDs to require 14 reporters.
ceph mon ...
David Zafman
02:43 AM Bug #4967 (Resolved): Misbehaving OSD sets over half of the cluster as down despite "osd min down...
My Ceph 0.56.4 (mons @ 0.56.6) cluster had a few misbehaving OSDs yesterday, which escalated into a full blown outage... Faidon Liambotis
07:49 PM Revision c55c6abb (ceph): Merge branch 'master' of https://github.com/ceph/ceph
John Wilkins
07:48 PM Revision 270ca623 (ceph): doc: Updated doc for connectivity. Updated text with glossary terms.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:48 PM Revision e4173123 (ceph): doc: Updated disk syntax. Updated text with glossary terms.
fixes: #4933
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
07:47 PM Revision c1914616 (ceph): Merge pull request #267 from ceph/wip-coverity
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Sage Weil
07:46 PM Revision af919287 (ceph): doc: Added connectivity section. Updated doc with glossary terms.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:22 PM Revision f1b13a17 (ceph): doc: Added the non-implemented bit for the gateway to the dev/radosgw TOC.
fixes: #4978
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
07:20 PM Revision 69b64826 (ceph): OSD: don't rename pg collections, handle PGs in RemoveWQ
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
07:09 PM Revision 827fd108 (ceph): Merge branch 'master' of https://github.com/ceph/ceph
John Wilkins
07:08 PM Revision fe164e44 (ceph): doc: Republishing the admin operations API for the gateway.
fixes: #4978
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
07:07 PM Revision d4732e85 (ceph): doc: Republishing the admin operations API for the gateway.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:00 PM Revision 09163a3b (ceph): Add priority option to AsyncReserver class
Add priority to request_reservation()
Change to map of lists by prioriry
Add priority to queue_pointers mappped type
...
David Zafman
07:00 PM Revision 00e90316 (ceph): osd: prioritize recovery for degraded pgs
Three Reservation priorities from RECOVERY, BACKFILL_HIGH, BACKFILL_LOW
fixes: #4273
Signed-off-by: David Zafman <d...
David Zafman
07:00 PM Revision df049c1c (ceph): AsyncReserver: Remove assert in set_max() for max > 0
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:15 PM Revision da271f7f (ceph): doc: Document admin api web interface.
Signed-off-by: caleb miles <caleb.miles@inktank.com> caleb miles
06:07 PM Revision 619a68a3 (ceph): armor: don't break lines by default
Added a new function that breaks the lines, but by default
don't do it.
Signed-off-by: Yehuda Sadeh <yehuda@inktank....
Yehuda Sadeh
06:07 PM Revision 770d94d3 (ceph): rgw: implement metadata hander for buckets data
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:49 PM Revision 4a90af8d (ceph): ceph-filestore-dump: fix uninit fields in ctor
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:49 PM Revision 2bc08830 (ceph): librbd: fix possible use-after-free
(of the pointer)
CID 966634 (#1 of 1): Use after free (USE_AFTER_FREE)
2. use_after_free: Using freed pointer "ictx"...
Sage Weil
05:49 PM Revision 72b5629a (ceph): rbd: fix buffer leak in do_import
CID 1019580 (#2 of 2): Resource leak (RESOURCE_LEAK)
10. leaked_storage: Variable "p" going out of scope leaks the st...
Sage Weil
05:49 PM Revision e30a0321 (ceph): osd: init test_ops_hook
CID 1019628 (#1 of 1): Uninitialized pointer field (UNINIT_CTOR)
2. uninit_member: Non-static class member "test_ops_...
Sage Weil
05:49 PM Revision 499edd8b (ceph): osd: initialize OSDService::next_notif_id
CID 1019627 (#1 of 1): Uninitialized scalar field (UNINIT_CTOR)
2. uninit_member: Non-static class member "next_notif...
Sage Weil
05:48 PM Revision 86327076 (ceph): mon: fix Formatter leak
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:48 PM Revision 4087e425 (ceph): os/Filestore: fix fd leak in error path
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:48 PM Revision 3dc7c329 (ceph): rados: fix buffer leak
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:48 PM Revision 110a823f (ceph): rados: fix fd leak
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:48 PM Revision ad073c2b (ceph): radosgw-admin: fix fd leak in read_input()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:48 PM Revision 99958e20 (ceph): rgw: fix various uninit class fields
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:48 PM Revision 76b90240 (ceph): mds: fix fd leak
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:35 PM Feature #4986: create and automate scenario based testing
This includes,
creating a test setup, manual testing, test plan and automation.
Tamilarasi muthamizhan
05:34 PM Feature #4986 (New): create and automate scenario based testing
create and automate scenario based testing for openstack Tamilarasi muthamizhan
05:04 PM Revision 5433462d (ceph): doc/release-notes: v0.61.1 release notes
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:05 PM devops Bug #4984 (Resolved): ceph_deploy: osd create succeeds with an error message (partprobe returns e...
this is not consistent though, recently hit this on rhel 6.3,
tamil@ubuntu:~/ceph-deploy-latest/rhel1/ceph-deploy$...
Tamilarasi muthamizhan
04:04 PM CephFS Bug #4965 (In Progress): libcephfs-java test failure
Anonymous
02:44 PM CephFS Bug #4965: libcephfs-java test failure
Using the YAML file posted above, this test is passing for me. I ran it 4 times on 2 different sets of plana nodes an... Anonymous
01:53 PM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
Hi, I also encountered this error after updating from bobtail to cuttlefish .61.1 .
I started by restarting all th...
Matthew Via
09:51 AM Bug #4974: nightlies: failed assert at Monitor::StoreConverter::_convert_machines
also,
logs: ubuntu@teuthology:/a/teuthology-2013-05-09_01:30:04-upgrade-master-testing-basic/9574
ubuntu@teuth...
Tamilarasi muthamizhan
09:48 AM Bug #4974 (Resolved): nightlies: failed assert at Monitor::StoreConverter::_convert_machines
this seems to happen after upgrading from bobtail to next branch, when the daemons are restarted. all the monitors se... Tamilarasi muthamizhan
01:25 PM Feature #4983 (Resolved): OSD: namespaces pt 2 (caps)
- Cap syntax should include namespace
- OSD must verify namespace portion of cap
Ian Colle
01:24 PM Feature #4982 (Resolved): OSD: namespaces pt 1 (librados/osd, not caps)
- Namespace defaults to ""
- aio_operate and operate gain a namespace argument in librados
- MOSDOp needs to includ...
Ian Colle
01:16 PM Feature #2309 (Duplicate): rados namespaces
Ian Colle
01:16 PM Tasks #4980 (Duplicate): OSD: namespaces pt 2 (caps)
Ian Colle
12:43 PM Tasks #4980 (Duplicate): OSD: namespaces pt 2 (caps)
- Cap syntax should include namespace
- OSD must verify namespace portion of cap
Samuel Just
01:16 PM Tasks #4979 (Duplicate): OSD: namespaces pt 1 (librados/osd, not caps)
Ian Colle
12:42 PM Tasks #4979 (Duplicate): OSD: namespaces pt 1 (librados/osd, not caps)
- Namespace defaults to ""
- aio_operate and operate gain a namespace argument in librados
- MOSDOp needs to includ...
Samuel Just
12:47 PM Documentation #4933 (In Progress): ceph-deploy. Partition usage should be disk usage.
John Wilkins
12:25 PM rgw Bug #4978 (Resolved): doc: new rgw user admin documentation is hidden
http://ceph.com/docs/master/radosgw/ has an Admin Ops API TOC entry. Page appears here: http://ceph.com/docs/master... John Wilkins
11:43 AM rgw Bug #4978 (In Progress): doc: new rgw user admin documentation is hidden
John Wilkins
11:35 AM rgw Bug #4978 (Resolved): doc: new rgw user admin documentation is hidden
The documentation for the new RGW user admin API can be manually found at http://ceph.com/docs/master/dev/radosgw/adm... Ian Colle
11:57 AM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
Can you reproduce with
debug osd = 20
debug filestore = 20
debug ms = 1
and post the logs?
It appears that...
Samuel Just
11:53 AM RADOS Feature #4930: OSD: replica to primary back pressure
This can probably be done in a few steps:
1 Add OSDService::estimate_delay() which returns an estimate of latency fo...
Samuel Just
11:44 AM Bug #4960 (Rejected): Config value osd_max_backfills shouldn't be allowed to be set to 0
David Zafman
11:04 AM rgw Feature #4716 (Resolved): rgw: ability to restrict user to specific operations
done, commit:3846451548e1161e721cfcca9bc6732c5109df69 Yehuda Sadeh
11:03 AM Subtask #4973 (Duplicate): nightlies: Failed assert at MonitorStore::write_bl_ss in bobtail
Tamilarasi muthamizhan
09:38 AM Subtask #4973 (Duplicate): nightlies: Failed assert at MonitorStore::write_bl_ss in bobtail
logs: ubuntu@teuthology:/a/teuthology-2013-05-09_01:30:04-upgrade-master-testing-basic/9552... Tamilarasi muthamizhan
10:36 AM Bug #4976: osd powercycle triggers object corruption on xfs
Looks like there were 4 inconsistent objects in pg 3.0
2013-05-09 01:37:39.771684 7fdc077f2700 0 log [INF] : 2.6 ...
Samuel Just
10:14 AM Bug #4976 (Resolved): osd powercycle triggers object corruption on xfs
logs: ubuntu@teuthology:/a/teuthology-2013-05-09_01:00:05-rados-next-testing-basic/9325... Tamilarasi muthamizhan
10:28 AM Bug #4895 (Need More Info): leveldb: mon workload makes store.db grow without bound
Sage Weil
09:54 AM rgw Bug #4975 (Resolved): nightlies: swift test failed after upgrading from bobtail to next
logs: ubuntu@teuthology:/a/teuthology-2013-05-09_01:30:04-upgrade-master-testing-basic/9571
ubuntu@teuthology:/a/t...
Tamilarasi muthamizhan
09:33 AM Bug #4949 (Resolved): osd: assertion on CEPH_OSD_OP_OMAPGETVALS
commit:36ec6f9bce63641f4fc2e4ab04d03d3ec1638ea0 Sage Weil
07:27 AM rbd Bug #4661: xfstest 139 hung
I spent a *little* time looking at the crash I've got, and
I'm getting fairly convinced this is an XFS problem. It
...
Alex Elder
06:27 AM rbd Feature #3763: krbd: handle flattening of mapped image
This work is mostly done, but I need to put it through some
more thorough tests before I'll post it for review. If...
Alex Elder
02:47 AM Revision e5b2ca88 (ceph): PG: rename must_delay_request to op_must_wait_for_map, make static
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:45 AM Revision 5177fcb6 (ceph): Merge remote-tracking branch 'gh/next'
Conflicts:
src/mon/MonitorDBStore.h
Sage Weil
12:23 AM Revision 56c4847b (ceph): v0.61.1
Gary Lowell

05/08/2013

11:55 PM Revision 6c1e4791 (ceph): mon: dump MonitorDBStore transactions to file
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit 797089ef082b99910eebfd9454c03d1f027c93bb)
Samuel Just
11:55 PM Revision 5a631b85 (ceph): osd: optionally enable leveldb logging
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 0b4c5c1a3349670d11cc3c4fb3c4b3c1a80b2502)
Sage Weil
11:55 PM Revision bb4f65ae (ceph): mon: allow leveldb logging
'mon leveldb log = filename'
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit c1d5f815546b731e...
Sage Weil
11:53 PM Revision 3b94f03e (ceph): mon: dump MonitorDBStore transactions to file
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit 797089ef082b99910eebfd9454c03d1f027c93bb)
Samuel Just
11:52 PM Revision 9143d6d0 (ceph): osd: optionally enable leveldb logging
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 0b4c5c1a3349670d11cc3c4fb3c4b3c1a80b2502)
Sage Weil
11:52 PM Revision 8f456e89 (ceph): mon: allow leveldb logging
'mon leveldb log = filename'
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit c1d5f815546b731e...
Sage Weil
11:42 PM Revision a284c9ec (ceph): common/Preforker: fix warnings
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:36 PM Revision 306ebc6a (ceph): debian/control: squeeze requres cryptsetup package
Squeeze requires the cryptsetup package which has been renamed
cryptsetup-bin in later versions. Allow either packag...
Gary Lowell
11:36 PM Revision 3ebddf17 (ceph): debian/control: squeeze requres cryptsetup package
Squeeze requires the cryptsetup package which has been renamed
cryptsetup-bin in later versions. Allow either packag...
Gary Lowell
11:33 PM Revision 83bbae41 (ceph): debian/control: squeeze requres cryptsetup package
Squeeze requires the cryptsetup package which has been renamed
cryptsetup-bin in later versions. Allow either packag...
Gary Lowell
11:26 PM Revision 46c3e48e (ceph): ceph_json: dump timestamp in utc
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
11:26 PM Revision 551571ca (ceph): rgw: datalog trim
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
11:05 PM Revision 5caa2bde (ceph): default project to ceph and extra_pkgs to none
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
10:50 PM Bug #4963 (Rejected): osd/SnapMapper.cc: 299: FAILED assert(check(oid))
<facepalm> good catch, thanks. i reused a teuth job name and had old runs mixed in with the new. Sage Weil
09:22 PM Bug #4963: osd/SnapMapper.cc: 299: FAILED assert(check(oid))
"2013-04-25" but opened today? Ian Colle
05:40 PM Bug #4963 (Rejected): osd/SnapMapper.cc: 299: FAILED assert(check(oid))
... Sage Weil
10:47 PM Revision 52666dc5 (ceph): PG: reassert_lock_with_map_lock_held() is dead
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
10:46 PM Revision 17705d72 (ceph): OSD,PG: lock_with_map_lock_held() is the same as lock()
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
09:56 PM Revision e0c0a5c1 (ceph): osd: don't assert if get_omap_iterator() returns NULL
Fixes: #4949
This can happen if the object does not exist and it's
a write operation. Just return -ENOENT.
Signed-of...
Yehuda Sadeh
09:56 PM Revision 82b92995 (ceph): ceph-create-keys: gracefully handle no data from admin socket
Old ceph-mon (prior to 393c9372f82ef37fc6497dd46fc453507a463d42) would
return an empty string and success if the comm...
Sage Weil
09:54 PM Revision e2528ae4 (ceph): ceph-create-keys: gracefully handle no data from admin socket
Old ceph-mon (prior to 393c9372f82ef37fc6497dd46fc453507a463d42) would
return an empty string and success if the comm...
Sage Weil
09:36 PM Revision ee3da880 (ceph): init-ceph: fix osd_data location when checking df utilization
Do not assume default osd data location.
Fixes: #4951
Backport: cuttlefish, bobtail
Signed-off-by: Sage Weil <sage@i...
Sage Weil
09:35 PM Revision f2a54cc9 (ceph): init-ceph: fix osd_data location when checking df utilization
Do not assume default osd data location.
Fixes: #4951
Backport: cuttlefish, bobtail
Signed-off-by: Sage Weil <sage@i...
Sage Weil
09:22 PM Revision ea809f7d (ceph): rgw: bucket index log trim
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
08:57 PM CephFS Bug #4965 (Resolved): libcephfs-java test failure
... Sage Weil
07:20 PM Revision 546ed917 (ceph): osd: don't assert if get_omap_iterator() returns NULL
Fixes: #4949
This can happen if the object does not exist and it's
a write operation. Just return -ENOENT.
Signed-of...
Yehuda Sadeh
07:18 PM Revision 36ec6f9b (ceph): osd: don't assert if get_omap_iterator() returns NULL
Fixes: #4949
This can happen if the object does not exist and it's
a write operation. Just return -ENOENT.
Signed-of...
Yehuda Sadeh
07:18 PM Revision 76b736b3 (ceph): rgw: metadata log trim
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
07:15 PM Revision 0d8f7a09 (ceph): Merge pull request #266 from ceph/wip-4949
osd: don't assert if get_omap_iterator() returns NULL
Reviewed-by: Sam Just <sam.just@inktank.com>
athanatos
06:44 PM Revision a89d889b (ceph): Merge pull request #165 from dachary/wip-4321
unit tests for FileStore::_detect_fs when running on ext4 Sage Weil
06:35 PM Revision 38464515 (ceph): rgw: user operation mask
Fixes: #4716
add user operation mask for controlling user permissions.
Also add admin controls for it.
Signed-off-by...
Yehuda Sadeh
06:26 PM Revision cdb3d32e (ceph): Merge pull request #180 from ceph/wip-rados-clone
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
06:22 PM Revision 3385c2c5 (ceph): rgw: more data changes log implementation
Remove a bunch of hard coded stuff. Make renew thread updates
expiration. Only renew if there was more than one reque...
Yehuda Sadeh
06:22 PM Revision ef82ad7c (ceph): rgw: resend data log entry if took too long
If took too long, we want to resend. Also, fix issue with
renew.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:22 PM Revision 988dab3e (ceph): rgw: decouple bucket data pool from bucket index pool
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:22 PM Revision 171b0bf2 (ceph): rgw: data changes log, naive implementation
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:22 PM Revision 6659b2b6 (ceph): rgw: data changes log, don't always send new requests
We may piggy back on older entries that hasn't expired yet.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:22 PM Revision 5e642fae (ceph): lru_map: infrastructure for a bounded map
Useful for cache-like maps, where we want to control
the max number of entries in the map.
Signed-off-by: Yehuda Sad...
Yehuda Sadeh
06:22 PM Revision d5da1525 (ceph): rgw: use shared_ptr instead of RefCountedObject
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:22 PM Revision 39b258c2 (ceph): rgw: limit num of buckets in data changes log
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:22 PM Revision f28df17b (ceph): rgw: changed data log renew thread
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:22 PM Revision a37092f9 (ceph): RefCounteCond: keep return val, wait() returns it
It is necessary in some cases to notify waiters of the
actual return value for the action they were waiting on.
Sign...
Yehuda Sadeh
06:21 PM Bug #4938 (Resolved): rbd: fix an incorrect assertion condition
The following has been committed to the ceph-client
"testing-next" branch.
91c6feb rbd: fix an incorrect assertio...
Alex Elder
03:34 PM Bug #4938 (Fix Under Review): rbd: fix an incorrect assertion condition
The following has been posted for review. It is one of
the patches available in the "review/wip-rbd-cleanup-2"
bra...
Alex Elder
06:25 AM Bug #4938 (Resolved): rbd: fix an incorrect assertion condition
In rbd_img_obj_parent_read_full_callback() there is an assertion
intended to verify the size of the image request fo...
Alex Elder
06:20 PM Bug #4939: rbd: support reading parent page data
The following has been committed to the ceph-client
"testing-next" branch.
91c6feb rbd: fix an incorrect assertio...
Alex Elder
06:19 PM Bug #4939 (Resolved): rbd: support reading parent page data
The following has been committed to the ceph-client
"testing-next" branch.
5b2ab72 rbd: support reading parent pa...
Alex Elder
03:33 PM Bug #4939 (Fix Under Review): rbd: support reading parent page data
The following has been posted for review. It is one of
the patches available in the "review/wip-rbd-cleanup-2"
bra...
Alex Elder
06:26 AM Bug #4939 (Resolved): rbd: support reading parent page data
Currently, rbd_img_parent_read() assumes the incoming object request
contains bio data. But if a layered image is p...
Alex Elder
06:19 PM Bug #4940 (Resolved): rbd: set mapping read-only flag in rbd_add()
The following has been committed to the ceph-client
"testing-next" branch.
7ce4eef rbd: set mapping read-only fla...
Alex Elder
03:33 PM Bug #4940 (Fix Under Review): rbd: set mapping read-only flag in rbd_add()
The following has been posted for review. It is one of
the patches available in the "review/wip-rbd-cleanup-2"
bra...
Alex Elder
06:28 AM Bug #4940 (Resolved): rbd: set mapping read-only flag in rbd_add()
The rbd_dev->mapping field for a parent image is not meaningful.
Since rbd_image_probe() is used both for images bei...
Alex Elder
06:18 PM Bug #4941 (Resolved): rbd: only set up watch for mapped images
The following has been committed to the ceph-client
"testing-next" branch.
1f3ef78 rbd: only set up watch for map...
Alex Elder
03:32 PM Bug #4941 (Fix Under Review): rbd: only set up watch for mapped images
The following has been posted for review. It is one of
the patches available in the "review/wip-rbd-cleanup-2"
bra...
Alex Elder
06:31 AM Bug #4941: rbd: only set up watch for mapped images
I added this to the patch description:
In fact, a watch request is a write operation, and we may only
have read a...
Alex Elder
06:29 AM Bug #4941 (Resolved): rbd: only set up watch for mapped images
Any changes to parent images are immaterial to any mapped clone.
So there is no need to have a watch event registere...
Alex Elder
06:15 PM Revision 4848fac2 (ceph): OSD: handle stray snap collections from upgrade bug
Previously, we failed to clear snap_collections, which causes split to
spawn a bunch of snap collections. In load_pg...
Samuel Just
06:15 PM Revision dc6b9e6b (ceph): PG: clear snap_collections on upgrade
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry picked from commi...
Samuel Just
06:15 PM Revision b514941b (ceph): OSD: snap collections can be ignored on split
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry picked from commi...
Samuel Just
06:14 PM Revision 61354b21 (ceph): Merge branch 'wip_split_upgrade' into next
Fixes: #4927 Samuel Just
06:12 PM Revision 8e89db89 (ceph): OSD: handle stray snap collections from upgrade bug
Previously, we failed to clear snap_collections, which causes split to
spawn a bunch of snap collections. In load_pg...
Samuel Just
06:12 PM Revision 252d71a8 (ceph): PG: clear snap_collections on upgrade
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
06:11 PM Revision 438d9aa1 (ceph): OSD: snap collections can be ignored on split
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
06:09 PM Revision 30ffca77 (ceph): ceph: return error code when failing to get result from admin socket
Make sure we return a non-zero result code when we fail to read something
from the admin socket.
Backport: cuttlefis...
Sage Weil
06:08 PM Revision 6789b020 (ceph): Merge pull request #265 from ceph/wip-mon-trace
Sage Weil
06:08 PM Revision 1870516e (ceph): mon: set MonitorDBStore options on open
So both ctors set the options.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:06 PM Revision 2a8fa9ee (ceph): Merge pull request #254 from ceph/wip-crush-rules
Reviewed-by: Joao Luis <joao.luis@inktank.com> Sage Weil
06:05 PM Revision 393c9372 (ceph): ceph: return error code when failing to get result from admin socket
Make sure we return a non-zero result code when we fail to read something
from the admin socket.
Backport: cuttlefis...
Sage Weil
06:03 PM Revision 797089ef (ceph): mon: dump MonitorDBStore transactions to file
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:03 PM Revision aa94f5b3 (ceph): ceph-monstore-tool: add MonitorDBStore trace dumper
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:03 PM Revision dbcd738a (ceph): ceph-monstore-tool: added replay
Samuel Just
06:01 PM Revision 9e11065f (ceph): Merge pull request #261 from ceph/wip-leveldb
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
06:01 PM Revision 4b142a12 (ceph): rgw: copy_obj uses req_id as tag
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:01 PM Revision 49b3d2e0 (ceph): rgw-admin: bucket list also specifies object namespace
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:01 PM Revision 7132e6e0 (ceph): rgw-admin: add object stat command
for retrieving object metadata.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:01 PM Revision 2983d987 (ceph): Makefile.am: add missing header file
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:01 PM Revision d05d05a5 (ceph): rgw: update cli test for radosgw-admin
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:01 PM Revision abef2b24 (ceph): rgw-admin: object stat also decodes policy
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
06:01 PM Revision 0b526d95 (ceph): rgw: don't set shadow obj attr in object metadata
This is incorrect, was only right for pre-manifest objects.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:01 PM Revision 4fbf9a75 (ceph): rgw: fix get_obj() with zero sized obj
Now that even zero sized objs have manifest a
test had to be modified.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:57 PM Revision abd0ab3e (ceph): cls_rgw, rgw: bucket index logs modifications
Add a log to the bucket index.
This commit also ties the "epoch" version that is kept
per index entry to the relevan...
Yehuda Sadeh
05:57 PM Revision d857896b (ceph): rgw: fix broken radosgw-admin user * commands
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision 5e196283 (ceph): cls_rgw: bucket index versioning
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision b1578ba7 (ceph): radosgw-admin, cls_rgw: list bucket index log
a new radosgw-admin command to list bucket index log.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:57 PM Revision 871b4013 (ceph): radosgw-admin: bilog list gets marker and max-entries params
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision 2a16bafd (ceph): rgw: bucket index log fixes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision 7e08c57d (ceph): rgw: share object tag and index tag
object tag is now being written to the index, so that both
object and index hold the same tag. This is needed so that...
Yehuda Sadeh
05:57 PM Revision fa23b3e7 (ceph): rgw-admin: fix user_id initialization
broken due to rebase
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:57 PM Revision 85e4ea91 (ceph): rgw-admin: fix some more merge issues
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision 3d7b8390 (ceph): rgw: call rgw_store_user_info() with objv_tracker
another rebase casualty
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:57 PM Revision b295c649 (ceph): rgw: radosgw-admin bucket list --bucket lists bucket objects
Also lists tag for each object.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:57 PM Revision cb6d4de4 (ceph): rgw: log user operations
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision 8444db6d (ceph): cls_log: adjust listing api
Listing api now also gets end time and a marker.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:57 PM Revision 75ada775 (ceph): rgw: use new cls_log listing interface
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision 288645db (ceph): rgw: show metadata log through radosgw-admin
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:57 PM Revision fe63d44b (ceph): rgw: cls_log_trim boundary change
end_time is not inclusive
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:57 PM Revision 478fe5ee (ceph): rgw: metadata rm
Still needs to fix the way we remove user entries; need to take
different path, similar to put_entry())
Signed-off-b...
Yehuda Sadeh
05:57 PM Revision 7ca91922 (ceph): rgw: track object versions between reads and writes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision b77422f8 (ceph): cls_log: trim works in chunks, returns -ENODATA when done
Also created a higher level interface that iterates until
done.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:55 PM Revision 4feac81d (ceph): osd: keep track of current osd subop num
This can be used later to generate unique subop versions
in compound write operations.
Signed-off-by: Yehuda Sadeh <...
Yehuda Sadeh
05:55 PM Revision fee51dd9 (ceph): objclass: provide new api for unique subop versioning
We need to be able to generate a unique identifier for
each subop. This can be useful e.g., if we want to keep multip...
Yehuda Sadeh
05:55 PM Revision 5313b991 (ceph): cls_log: fixes, other adjustments
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision 815e0ac4 (ceph): cls_log: unitest
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision c28f8647 (ceph): cls_log: more fixes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision 0cdce74d (ceph): cls_log: extend unitest
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision e98ca568 (ceph): test_cls_log: remove warning
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision 72220bfa (ceph): rgw: initialize meta_mgr earlier
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision fe96600e (ceph): rgw: rgw_get_system_obj() can return obj_version
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision 7cfa89dc (ceph): rgw: metadata set/get infrastructure
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision 1269986d (ceph): rgw: metadata manager, api to list keys
Also, implement key listing for user metadata.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:55 PM Revision dcee3a11 (ceph): rgw: put metadata, other cleanups
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision ff2d3c95 (ceph): rgw: add top level metadata handler
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision 272635f3 (ceph): rgw: user metadata updates also key version
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:55 PM Revision 0cccd7c6 (ceph): rgw: metadata list user, only show uids
don't show unrelated object names
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:55 PM Revision 7c19f966 (ceph): cls_log: a class to handle info sorted by timestamp
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 8ef4f397 (ceph): rgw: get/set region map
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 8bb5e0a2 (ceph): rgw: regionmap update
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 8bab4fe0 (ceph): rgw: fix json decoding of rgw_bucket
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 6a5966c3 (ceph): cls_version: create a new objclass
New objclass to track and modify objects versions.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:54 PM Revision 8d7d436d (ceph): cls_version: various fixes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 913ecd49 (ceph): cls_version: unitest
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 5f1b897c (ceph): rgw: start tying metadata objs to version objclass
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 1563ad17 (ceph): librados: add two more ObjectOperation::exec()
- one that also gets out bufferlist and ret value pointer
- one that gets a callback context
Signed-off-by: Yehuda ...
Yehuda Sadeh
05:54 PM Revision 2223d998 (ceph): cls_version: add cls_version_read(ObjectReadOpeation&)
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 64aa4e45 (ceph): rgw: define region/zone data structures
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 960fa0d3 (ceph): rgw: region management encoding/decoding changes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 488a20de (ceph): rgw: region creation
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 6fa54800 (ceph): rgw: derr -> lderr
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision a12357cd (ceph): rgw: admin command to show region info
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 708339a0 (ceph): rgw: some region/zone related cleanups/fixes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 5cc34ff0 (ceph): rgw: can list regions, show default region info
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision 67db8a6b (ceph): rgw: set region info, default region
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:54 PM Revision e27f889d (ceph): common/ceph_parser.cc: cleanup
remove extra logging to stdout
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
05:54 PM Revision ea4d033b (ceph): rgw: zone list, setup changes
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
05:53 PM rbd Bug #4959: xfstest 17 failure
A single xfstest failed:... Josh Durgin
04:32 PM rbd Bug #4959 (Closed): xfstest 17 failure
logs: ubuntu@teuthology:/a/teuthology-2013-05-08_01:00:26-rbd-master-testing-basic/8696
Traceback (most recent cal...
Tamilarasi muthamizhan
05:32 PM Bug #4943 (Resolved): cryptsetup-bin not available on squeeze
Sage Weil
08:38 AM Bug #4943 (Resolved): cryptsetup-bin not available on squeeze

From ceph-devel mailing list:
Dear ceph developers,
I just tried to upgrade my ceph installation from bobtail...
Anonymous
05:00 PM Revision 3ff0fffd (ceph): fs/samba: Add tests for samba/cifs tasks
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
04:50 PM rgw Bug #4957: 400 Bad Request after bucket rm ... on radosgw-admin test
The test fails with bucket that starts with underscore. Not sure why we should allow that. In any case, there's a con... Yehuda Sadeh
04:15 PM rgw Bug #4957: 400 Bad Request after bucket rm ... on radosgw-admin test
also ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-05-08_01:00:07-rados-master-testing-basic/8632 Sage Weil
04:14 PM rgw Bug #4957 (Resolved): 400 Bad Request after bucket rm ... on radosgw-admin test
... Sage Weil
04:47 PM rgw Bug #4958 (Duplicate): rgw test failing when trying to delete a bucket
Tamilarasi muthamizhan
04:22 PM rgw Bug #4958 (Duplicate): rgw test failing when trying to delete a bucket
logs - ubuntu@teuthology:/a/teuthology-2013-05-08_01:00:07-rados-master-testing-basic/8631
This is the error, we a...
Tamilarasi muthamizhan
04:32 PM Bug #4960 (Rejected): Config value osd_max_backfills shouldn't be allowed to be set to 0

ceph osd tell '*' injectargs '--osd_max_backfills 0' causes all OSDs to crash. Also, if ceph.conf had a value of 0...
David Zafman
04:26 PM devops Feature #4667 (Rejected): ceph-deploy update
Dan Mick
04:24 PM CephFS Bug #4832: mds: failed auth_unpin assert
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-05-08_01:00:07-rados-master-testing-basic/8600 Sage Weil
03:20 PM Bug #4955 (Resolved): osd/ReplicatedPG.cc: 1078: FAILED assert(0 == "out of order op")
-1222> 2013-05-07 16:42:41.796117 7fcb871c1700 15 filestore(/var/lib/ceph/osd/ceph-3) _omap_setkeys meta/16ef7597/in... Samuel Just
03:19 PM devops Feature #4947 (Fix Under Review): Chef: Support for custom repositories
https://github.com/ceph/ceph-cookbooks/pull/31 Alexandre Marangone
10:49 AM devops Feature #4947 (Resolved): Chef: Support for custom repositories
It would be nice to have environment variables to overwrite the default repo configuration.
Env variables might look...
Alexandre Marangone
02:54 PM Bug #4952 (Resolved): ceph-create-keys talked to mon, got back non-JSON
commit:e2528ae42c455c522154c9f68b5032a3362fca8e Sage Weil
02:44 PM Bug #4952 (In Progress): ceph-create-keys talked to mon, got back non-JSON
Sage Weil
01:21 PM Bug #4952: ceph-create-keys talked to mon, got back non-JSON
Sage allows as how this is probably avoided by 30ffca77df006a244044604074779af538721f14 in
the cuttlefish branch, so...
Dan Mick
01:14 PM Bug #4952 (Resolved): ceph-create-keys talked to mon, got back non-JSON
netmass in IRC today reported a failure to create the client.admin key with ceph-deploy; he found
a crash from ceph-...
Dan Mick
02:37 PM Bug #4951 (Resolved): Ceph init script makes assumptions about osd data location
thanks for the report! committed to next, backported to cuttlefish
commit:f2a54cc9c98a9f31aef049c74ea932b2d9000d3c
Sage Weil
01:13 PM Bug #4951 (Resolved): Ceph init script makes assumptions about osd data location
_Marked major, as this will result in bad weighting in CRUSH maps, which could have odd carry-on effects._
In 0.61...
Dan Reif
02:10 PM devops Feature #4954 (Resolved): ceph-deploy: help and document need to be updated for osd create
we need to update the ceph help and documentation for ceph-deploy osd create, to also contain the usage and command s... Tamilarasi muthamizhan
02:08 PM devops Fix #4953 (Resolved): ceph-deploy: dns mismatches can cause gatherkeys to fail
Our cluster has two networks, a fast network and a control network. The servers names are nodeX for the fast network ... Anonymous
12:32 PM Bug #4922 (Won't Fix): Adding OSD to CRUSH leads to scrubbing while disabled in config

In this release to disable scrubbing you'd want much larger values for the scrub intervals. Also, you need to spec...
David Zafman
12:21 PM Bug #4895 (In Progress): leveldb: mon workload makes store.db grow without bound
Sage Weil
12:21 PM devops Bug #4924 (Resolved): ceph-deploy: gatherkeys fails on raring (cuttlefish)
commit:393c9372f82ef37fc6497dd46fc453507a463d42 Sage Weil
11:23 AM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Tamilarasi muthamizhan
11:03 AM devops Bug #4924 (Resolved): ceph-deploy: gatherkeys fails on raring (cuttlefish)
tested the fix on wip-ceph-tool. works fine. Tamilarasi muthamizhan
12:10 PM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
Yes. All time, starting in this bugzilla, I have 3 nodes with ~same (+-day) git snapshot. Now it just same.
Source...
Denis kaganovich
11:18 AM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
Can you confirm that all of your osds are running Cuttlefish (it should work anyway if some are still on Bobtail, but... Samuel Just
09:25 AM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
I trying to remove this lost part of snapshots by:
rados rm rb.0.1ee4.238e1f29.000000001300 -p rbd
& remove this fi...
Denis kaganovich
06:06 AM Bug #4937: osd/ReplicatedPG.cc: 1379: FAILED assert(0)
It can be related to trying to resolve next:
2013-05-07 07:16:03.399196 7f65257fa700 0 log [ERR] : scrub 2.81 e20...
Denis kaganovich
05:48 AM Bug #4937 (Can't reproduce): osd/ReplicatedPG.cc: 1379: FAILED assert(0)
Sure down 2 osd with next messages (this is one osd info):
0> 2013-05-08 15:20:25.634740 7fe69d736700 -1 osd/Re...
Denis kaganovich
11:59 AM Bug #4949 (Resolved): osd: assertion on CEPH_OSD_OP_OMAPGETVALS
Happens when this op is called on non existing object. This fixes it:... Yehuda Sadeh
11:23 AM rbd Bug #4446 (Need More Info): librbd: crash from opensolaris vm
Josh Durgin
11:16 AM Bug #4927 (Resolved): OSD: pg upgrade does not clear snap_collections and split still iterates ov...
Samuel Just
10:26 AM Bug #4927: OSD: pg upgrade does not clear snap_collections and split still iterates over snap_col...
OSD's are able to start with wip_split_upgrade Mike Lowe
10:19 AM Bug #4927: OSD: pg upgrade does not clear snap_collections and split still iterates over snap_col...
Sage Weil
11:09 AM Bug #4934 (Can't reproduce): ceph-deploy: librbd1 missing as a dependency
Sage was right, turns out there was an old version of librados2 still installed that was gumming up the works. Closin... Anonymous
10:58 AM Documentation #4948 (Resolved): Document how 'ceph -s' arrives at space numbers
ceph -s sums storage used and available, would be great to document how those numbers are calculated as a reference.
...
Patrick McGarry
10:52 AM rgw Feature #4715: rgw: Add support for OPTIONS HTTP method
Yehuda, can we close this? Neil Levine
10:45 AM rgw Feature #4108 (Duplicate): rgw: optionally put bucket index data in separate pool
Duplicate of 4613 Neil Levine
09:43 AM devops Feature #4945 (Rejected): build script for ceph-release rpm
Currently the ceph-release rpm is generated during the build by the ceph-build scripts. It should be broken out into... Anonymous
08:54 AM devops Cleanup #4944 (Rejected): Ensure that ceph upgrades are consistent.
We need to ensure that there are no surprises with package versions when ceph is upgraded via apt-get.
Email from ...
Anonymous
08:03 AM Fix #4942 (Resolved): librados: do not hang on auth failure on start
If a "rbd --id foo -s 1 -p bar create test" is run, it hangs instead of failing with permission denied if client.foo ... Jeff Bachtel
06:23 AM rbd Bug #4912 (Resolved): rbd: fix leak of format 2 snapshot context
Alex Elder
06:23 AM rbd Bug #4912: rbd: fix leak of format 2 snapshot context
The following has been committed to the "testing-next"
branch of the ceph-client git repository. I am holding
off ...
Alex Elder
06:18 AM rbd Bug #4911 (Resolved): rbd: revalidate only for mapping size changes
The following has been committed to the "testing-next"
branch of the ceph-client git repository. I am holding
off ...
Alex Elder
03:13 AM CephFS Bug #4241: SELinux fails because it can't set xattrs
Are you sure about that? ceph_file_iops hasn't been changed since 2009, and the methods are there. The problem still ... Carl-Johan Schenström
02:27 AM Revision f1dfcba5 (ceph): Merge branch 'wip-teuth4768a-wusui'
Conflicts:
teuthology/task/install.py
Warren Usui
02:27 AM Revision 2c34d197 (ceph): Merge branch 'wip-teuth4768a-wusui'
Conflicts:
teuthology/task/install.py
Warren Usui
01:26 AM Revision f0c0997c (ceph): doc/install/os-recs: reverse order of releases
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:17 AM Revision 9477c17a (ceph): Merge pull request #263 from wido/config-get
Add "config get <var>" to the admin
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
01:05 AM Revision f8ae2bd9 (ceph): doc: Fixed typos.
fixes: #4932
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
12:36 AM devops Bug #4936 (Resolved): ceph-deploy fails to report errors
Hi,
I like the ceph-deploy script, but it can be very confusing for new users when things go wrong. I spend my first...
hakan ardo

05/07/2013

11:11 PM Revision 452fb529 (ceph): doc: Fixed typo.
fixes: #4422
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
11:08 PM Revision a0cb5e5e (ceph): doc: Removed "and" as suggested.
fixes: #3686
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
11:00 PM Revision 783b92fe (ceph): install: default to ceph project throughout
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:51 PM Revision 5741228f (ceph): ceph_manager: add timeout option to revive, increase for power_cycle
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:48 PM Revision ad75582c (ceph): doc: Fixed hyperlink.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:43 PM Revision 87160c4d (ceph): doc: Fixed path typo.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:40 PM Bug #4934: ceph-deploy: librbd1 missing as a dependency
if you run the command inside the vm independent of ceph-deploy, ti's usually possible to figure out what apt's probl... Sage Weil
08:10 PM Bug #4934: ceph-deploy: librbd1 missing as a dependency
In a clean VM, this works properly. I'm going to re-install the host that exhibited the failure and see if it's maybe... Anonymous
06:15 PM Bug #4934 (Can't reproduce): ceph-deploy: librbd1 missing as a dependency
After running:
./ceph-deploy new MON
I ran
./ceph-deploy install MON
and it error out with this message:
cep...
Anonymous
06:49 PM Revision 2d6e4d2b (ceph): doc: Updated OS support for Cuttlefish.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:32 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
this looks like a timing thing.. the ceph-create-keys is racing with ceph-mon startup and ceph is wrongly returning s... Sage Weil
05:26 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
Although, now that I look at it, my logs do have this line:
2013-05-07 17:21:18.687516 7f7ced5ee700 0 -- 192.168.14...
Anonymous
05:24 PM devops Bug #4924: ceph-deploy: gatherkeys fails on raring (cuttlefish)
I saw a similar behavior on a 12.04 (precise) installation. Here are my logs.
2013-05-07 17:21:17.546091 7f7ced601...
Anonymous
01:28 PM devops Bug #4924 (Resolved): ceph-deploy: gatherkeys fails on raring (cuttlefish)
ceph version 0.61 (237f3f1e8d8c3b85666529860285dcdffdeda4c5)
trying to test ceph-deploy on raring and it fails at ...
Tamilarasi muthamizhan
06:10 PM Documentation #4933 (Resolved): ceph-deploy. Partition usage should be disk usage.
/dev/sdb1 should be /dev/sdb, etc. John Wilkins
06:06 PM Documentation #4932 (Resolved): Typos in Preflight
John Wilkins
06:04 PM Documentation #4932 (Resolved): Typos in Preflight
Sentence refers to chef instead of chef. John Wilkins
05:20 PM Feature #4931 (Rejected): OSD: clone from journal for btrfs
Samuel Just
05:19 PM RADOS Feature #4930 (New): OSD: replica to primary back pressure
Currently, replicas cannot throttle sub-ops. This means that a slower OSD can have OSDOps starved by sub-ops. Samuel Just
05:09 PM Revision 67b60b96 (ceph): doc: Minor tweak to the definition list style.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:57 PM Bug #4927: OSD: pg upgrade does not clear snap_collections and split still iterates over snap_col...
9303/8148) [] r=0 lpr=0 pi=8144-9302/4 lcod 0'0 mlcod 0'0 inactive] read_log done
0> 2013-05-07 18:58:40.324141...
Samuel Just
04:56 PM Bug #4927: OSD: pg upgrade does not clear snap_collections and split still iterates over snap_col...
(7812'4613,7812'5260] local-les=8406 n=421 ec=1 les/c 8406/8406 8217/8405/8405) [3,12] r=0 lpr=9302 lcod 0'0 mlcod 0'... Samuel Just
04:42 PM Bug #4927 (Resolved): OSD: pg upgrade does not clear snap_collections and split still iterates ov...
Samuel Just
04:50 PM Feature #4929 (Resolved): Erasure encoded placement group
This is the _home page_ of erasure coding implementation in Ceph. The description should contain links to all the bac... Loïc Dachary
04:47 PM Subtask #4928 (Rejected): PG/ReplicatedPG API
"work in progress":https://github.com/dachary/ceph/commits/wip-4928
* The goal is not to factor out a base class f...
Loïc Dachary
04:12 PM CephFS Documentation #4422 (Resolved): Typo on Release Process webpage
John Wilkins
04:08 PM devops Documentation #3686 (Resolved): install prerequisites (Debian)
John Wilkins
03:52 PM Bug #4918 (Resolved): teuthology: larger timeout for power cycle tests
teuthology pushed to next, 5741228f605d30640881a0dc3aa885e94061bbe1 Samuel Just
03:24 PM Bug #4895 (Need More Info): leveldb: mon workload makes store.db grow without bound
please reproduce with wip-mon-trace:
- restart ceph-mon with --mon-compact-on-start
- stop ceph-mon
- verify it ...
Sage Weil
03:17 PM Bug #4895 (In Progress): leveldb: mon workload makes store.db grow without bound
mikedawson reports that with higher load this continues to happen even with 'mon compact on trim = true'. Sage Weil
03:16 PM Revision 7b22cfb2 (ceph): PG,OSD: mark info as backfilling in _remove_pg()
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
03:16 PM Revision d7cd9574 (ceph): OSD::clear_temp should clear snap mapper entries as well
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
03:16 PM Revision 4b548b5f (ceph): OSD: removal collections will be removed inline and not queued
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
03:16 PM Revision fd63f8aa (ceph): WorkQueue: Allow WorkQueueVal to be specified with 1 type
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
02:58 PM Revision c2075169 (ceph): doc: Added glossary to TOC.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:57 PM Revision 473aae96 (ceph): doc: Added glossary.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:57 PM Revision 4e99dca9 (ceph): doc: Fixed usage typo.
fixes: #4923
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
02:53 PM devops Bug #4925 (Resolved): Incorrect yum conf for Cuttlefish and el6
Fixed.
The repo package location was moved to match the development builds, and what ceph-deploy uses. That pat...
Anonymous
01:30 PM devops Bug #4925 (Resolved): Incorrect yum conf for Cuttlefish and el6
The current ceph-release package for Cuttlefish points to a Bobtail repository.
Package: http://ceph.com/rpm-cuttl...
Alexandre Marangone
02:30 PM Revision 78518f62 (ceph): Merge pull request #262 from gollub/typos
fixed common typo in error messages Sage Weil
01:43 PM devops Bug #4877 (Resolved): ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
Anonymous
01:16 PM devops Bug #4877: ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
Resolved with the following commit:
commit 51dfee667fbce36bb888dc8af75243941f33e2b9
Author: Gary Lowell <glowell@...
Anonymous
01:18 PM devops Bug #4862: ceph-deploy: install occassionally throws exceptions though installation is successful
4877 is a duplicate of this bug. Resolved with the following commit:
commit 51dfee667fbce36bb888dc8af75243941f33e...
Anonymous
12:59 PM Feature #4846 (Resolved): builds scripts need to include raring
Anonymous
12:41 PM Feature #4846: builds scripts need to include raring
Packages for raring were successfully built for ceph v0.61. Anonymous
12:26 PM Revision 85867380 (ceph): Fix whitespace indentation
Signed-off-by: Wido den Hollander <wido@42on.com> Wido den Hollander
11:56 AM Revision ad504e94 (ceph): Implement 'config get <var>' for the admin socket
Signed-off-by: Wido den Hollander <wido@42on.com> Wido den Hollander
09:55 AM Revision 27d86bd1 (ceph): fixed common typo in error messages
Signed-off-by: Daniel Gollub <d.gollub@telekom.de> Daniel Gollub
08:02 AM Bug #4923 (Resolved): Typo in monitor bootstrap
See http://ceph.com/docs/master/dev/mon-bootstrap/#secret-keys John Wilkins
07:53 AM Bug #4923 (In Progress): Typo in monitor bootstrap
John Wilkins
04:58 AM Bug #4923: Typo in monitor bootstrap
Typo in Monitor bootstrap page (http://ceph.com/docs/master/dev/mon-bootstrap/), command to create new keyring:
<pre...
Kenneth Østrup
04:56 AM Bug #4923 (Resolved): Typo in monitor bootstrap
Kenneth Østrup
04:35 AM Revision f3963b2e (ceph): suites/marginal: Add backtrace restart test
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
04:34 AM Revision 39f99ca2 (ceph): Merge remote-tracking branch 'gh/last'
Conflicts:
suites/fs/singleton/all/cfuse_workunit_suites_blogbench_ceph_deploy.yaml
Sage Weil
04:31 AM Revision f7c8c27c (ceph): Merge branch 'next'
Sage Weil
04:31 AM Revision 61dba20d (ceph): Merge branch 'next'
Sage Weil
03:16 AM Revision 2bd27312 (ceph): doc/install/{debian,rpm}: update for cuttlefish
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:11 AM Revision 7c3a0e8b (ceph): doc/start/get-involved: fix links
ERROR: /srv/autobuild-ceph/gitbuilder.git/build/doc/start/get-involved.rst:33: Unknown target name: "tracker".
ERROR:...
Sage Weil
03:07 AM Revision b107081f (ceph): doc/release-notes: I missed rgw rest api in the release notes
Mostly from here dd19d693e6528c70167958ebc57e075200a08803
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
02:51 AM Revision e5b25bd3 (ceph): Merge branch 'next'
Gary Lowell
01:04 AM Revision f1be93f9 (ceph): install: only remove ceph data of project is ceph
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:03 AM Revision 0b7cd6a8 (ceph): task/cifs-mount.py: Task for mounting cifs
The cifs-mount task mounts a smb endpoint from the
first available smbd server (the samba.0 role). This
task is simi...
Sam Lang
12:37 AM Revision 980973dc (ceph): task/install.py: Allow installation of non-ceph
Generalizes the install task to specify a "project" which defaults to
'ceph', but can be configured to install differ...
Sam Lang
12:37 AM Revision c0f1ef73 (ceph): task/daemon-helper: Add nostdin option
Some daemons (smbd) will try to read from stdin and check if its a
socket, using that for sending/receiving messages....
Sam Lang
12:37 AM Revision 4899fa1a (ceph): task/samba.py: Samba task to setup/start smbd
The samba task sets up samba on all 'samba' roles
with ceph as the backend storage module. The task
creates a smb.co...
Sam Lang
12:15 AM Revision 9d85d67e (ceph): os/ObjectStore: add missing break in dump()
CID 751331 (#1 of 1): Missing break in switch (MISSING_BREAK)
unterminated_case: This case (value 35) is not terminat...
Sage Weil

05/06/2013

11:50 PM Bug #4922 (Won't Fix): Adding OSD to CRUSH leads to scrubbing while disabled in config
Hello,
I'm using ceph version:
ceph version 0.56.4 (63b0f854d1cef490624de5d6cf9039735c7de5ca)
I have next lines ...
Ivan Kudryavtsev
11:45 PM Revision c693ba57 (ceph): rados: add whole-object 'clonedata' command
Clone the data stream from one object to another.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:41 PM Revision 298ebdb0 (ceph): doc: Deleted redundant "so that" phrase.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
11:38 PM Revision 277f2de9 (ceph): doc: Corrected typo.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
11:38 PM Revision 359bd6df (ceph): doc: Corrected typo.
John Wilkins
11:37 PM Revision 2d4b5bd8 (ceph): Removed comment out of header, and added "coming soon."
John Wilkins
11:37 PM Revision 1cfc6e3b (ceph): doc: Updated usage for push | pull.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
11:34 PM Revision 048e0499 (ceph): Clean up defer_recovery() functions
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
11:27 PM Revision 2bbbd816 (ceph): add fs collection ceph-deploy blogbench test in new singleton suite
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
11:27 PM Revision 3219a87a (ceph): add fs collection ceph-deploy blogbench test in new singleton suite
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:34 PM Revision bd36e78f (ceph): osd: make class load errors louder
Fixes: #4639
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:21 PM Revision 0b4c5c1a (ceph): osd: optionally enable leveldb logging
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:13 PM Revision c1d5f815 (ceph): mon: allow leveldb logging
'mon leveldb log = filename'
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:18 PM Revision 237f3f1e (ceph): v0.61
Gary Lowell
08:05 PM Revision eb69c7df (ceph): os/: default to dio for non-block journals
Workaround: #4910
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
07:12 PM Revision 60603d01 (ceph): ceph-disk: use separate lock files for prepare, activate
Use a separate lock file for prepare and activate to avoid deadlock. This
didn't seem to trigger on all machines, bu...
Sage Weil
07:02 PM rbd Bug #4912 (Fix Under Review): rbd: fix leak of format 2 snapshot context
The following has been posted for review:
[PATCH] rbd: fix leak of format 2 snapshot context
Alex Elder
06:32 AM rbd Bug #4912 (Resolved): rbd: fix leak of format 2 snapshot context
When rbd_dev_v2_refresh() is called, the rbd device already has a
snapshot context associated with it. But that nev...
Alex Elder
07:01 PM rbd Bug #4911 (Fix Under Review): rbd: revalidate only for mapping size changes
The following has been posted for review:
[PATCH] rbd: revalidate only for mapping size changes
Alex Elder
04:37 AM rbd Bug #4911 (Resolved): rbd: revalidate only for mapping size changes
This commit:
d98df63e rbd: revalidate_disk upon rbd resize
instituted a call to revalidate_disk() to notify int...
Alex Elder
06:44 PM rbd Feature #3763 (In Progress): krbd: handle flattening of mapped image
I thought I'd already marked it such, but I've been working on this
for a few days. At this point I have some funct...
Alex Elder
06:30 PM Revision e662b614 (ceph): ceph-test.install: add ceph-monstore-tool and ceph-osdomap-tool
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
06:30 PM Revision eae02fd3 (ceph): ceph.spec.in: remove twice listed ceph-coverage
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
06:30 PM Revision 71cef086 (ceph): ceph.spec: add some files to ceph
Add installed, but not packaged files to ceph-test (ceph-monstore-tool,
ceph-osdomap-tool) rpm file section.
Signed-...
Danny Al-Gaaf
06:22 PM Revision 33c154ce (ceph): Fix teuthology installations on physical Centos machines.
Yum installs of packages specify a pacakge number. Initial
install of yum source changed to not fail if already done...
Warren Usui
06:19 PM Revision c5d43024 (ceph): doc: Update the usage to reflect optional directory name.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:19 PM Revision 35acb152 (ceph): doc: Rearranged to show zapping multiple disks and creating multiple OSDs.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:50 PM Revision 8add78ca (ceph): doc: Moved install to the second step, from the first step.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:28 PM Revision c3ba53d2 (ceph): Merge pull request #256 from dalgaaf/wip-da-spec-update
Fix ceph.spec.in Gary Lowell
05:28 PM Revision 0200c903 (ceph): Merge pull request #257 from dalgaaf/wip-da-fix-debian
ceph-test.install: add ceph-monstore-tool and ceph-osdomap-tool Gary Lowell
05:26 PM Revision 495438f7 (ceph): apparently this should never work on our current configs
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:26 PM Revision 7628adb6 (ceph): apparently this should never work on our current configs
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
05:08 PM Revision 6abbe680 (ceph): doc: Autonumbering syntax correction.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:04 PM Revision efa460c3 (ceph): doc: Added troubleshooting PGs to the index.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:44 PM Bug #3552: After ceph-deploy installation a reboot breaks OSDs
So, AFAICT, this is supposed to happen by virtue of ceph-disk-activate being called from udev, which will ferret out ... Dan Mick
04:44 PM Revision cddf3b53 (ceph): doc: Commented out osd list for now.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:44 PM Revision 0c0fc03e (ceph): doc: Commented out remove a mds for now.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:43 PM Revision 41eecf4f (ceph): doc: Forwarding link. FAQ migrated to new Ceph wiki.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:39 PM Fix #4840: mon: transition from old-style allow command to new command descriptions
Interested in the design/plan here; anything to share? Dan Mick
11:31 AM Fix #4840 (In Progress): mon: transition from old-style allow command to new command descriptions
Joao Eduardo Luis
04:16 PM Bug #4810 (Won't Fix): mon: forwarded messages have weird priorities
current behavior is ok. for now clients can get equal priority if they want it by setting to HIGH; currently all are... Sage Weil
04:02 PM rbd Feature #2256 (In Progress): rbd: parallelize deletions
Josh Durgin
02:57 PM CephFS Bug #4920 (Resolved): client: does not respect O_NOFOLLOW
It looks like doing an open() always implicitly follows symlinks, because we call path_walk() with followsym set to t... Greg Farnum
02:49 PM devops Bug #4919 (Resolved): ceph-deploy: disk list doesn't properly display all the disks on a VM
on a VM, the disk list command doesn't list out all the disks, it seems to list only the latest disk, that was used.
...
Tamilarasi muthamizhan
02:49 PM devops Bug #4877: ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
I think you've fixed this now, right, Gary? Dan Mick
02:46 PM rbd Feature #4917: iSCSI: Package tgt
At minimum we want:
(1) a package people can download and install (vs they must build it)
(2) something that is...
Anonymous
01:40 PM rbd Feature #4917 (Resolved): iSCSI: Package tgt
Ian Colle
02:42 PM Bug #4845 (Resolved): mon (ms): deadloop and possible assert(sync_state == SYNC_STATE_CHUNKS)
Sage Weil
02:38 PM rbd Bug #2654 (Won't Fix): Stale rbd volume cannot be unmaped
the problem here is just that you are on a 3.2 kernel. many many bugs were fixed but they were only backported as fa... Sage Weil
02:36 PM Bug #4639 (Resolved): OSD class load failure log should be on by default and as noticeable as pos...
commit:bd36e78f726bee6042359ed6dbd6ef65e1d843a2 Sage Weil
02:22 PM Bug #4639: OSD class load failure log should be on by default and as noticeable as possible
Just had another IRC user get caught in this trap today. Dan Mick
02:19 PM rgw Feature #4716 (In Progress): rgw: ability to restrict user to specific operations
Some small comments on Github. Greg Farnum
02:03 PM Bug #4918 (Resolved): teuthology: larger timeout for power cycle tests
2013-05-06T12:38:21.228 INFO:teuthology.task.thrashosds.ceph_manager:waiting on admin_socket for 2, ['version']
2013...
Samuel Just
01:42 PM Revision 3540d905 (ceph): ceph-test.install: add ceph-monstore-tool and ceph-osdomap-tool
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:21 PM Revision 6f338851 (ceph): ceph.spec.in: remove twice listed ceph-coverage
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:15 PM Bug #4910: journal Unable to read past sequence 337 but header indicates the journal has committe...
wip-4910 Ian Colle
01:09 PM Revision acb60e58 (ceph): ceph.spec: add some files to ceph
Add installed, but not packaged files to ceph-test (ceph-monstore-tool,
ceph-osdomap-tool) rpm file section.
Signed-...
Danny Al-Gaaf
12:37 PM devops Bug #4916 (Resolved): ceph-deploy: mon create fails on bobtail branch in centos 6.3
in followup with a user from the list, tried this on my local machines, and this did not work
tamil@ubuntu:~/ceph...
Tamilarasi muthamizhan
09:22 AM rgw Bug #4902: Issuess handling very large files
Added ticket 4914 to add ability to read xattr from file / stdin Ian Colle
09:19 AM Feature #4914 (Resolved): rados tool: read xattr from file / stdin
Currently xattr can only be provided as a command line parameter. We'd like to be able to provide it from a file or f... Yehuda Sadeh
09:16 AM Cleanup #4507: mon: drop atomic_t
wip-4507
The patch only drops the atomic_t and uses instead a boolean.
Joao Eduardo Luis
08:53 AM Bug #4228 (Fix Under Review): mon uses pick_addresses if invoked with mkfs or without mon addr; f...
pushed patch to wip-4228 Joao Eduardo Luis
03:56 AM Revision 496fd60e (ceph): upgrade: fix up rgw tests a bit
Sage Weil

05/05/2013

06:36 AM CephFS Bug #4909: mds: stalled/stuck directory (standby)
Directory accessed only after reboot one of node (with stalled mount's) - not after only ceph daemons restarting. Denis kaganovich
04:08 AM Revision 2d59b8a1 (ceph): nfs: debug mds
I've seen a run hang on rmdir on shutdown, and want to see why the MDS didn't
reply.
Sage Weil

05/04/2013

07:34 PM Bug #4910: journal Unable to read past sequence 337 but header indicates the journal has committe...
This also stops happening if I disable aio. Samuel Just
05:41 PM Bug #4910: journal Unable to read past sequence 337 but header indicates the journal has committe...
This reproduces very quickly without journal logging, but doesn't reproduce at all with. Samuel Just
12:46 PM Bug #4910: journal Unable to read past sequence 337 but header indicates the journal has committe...
not sure about the priority of this bug. Tamilarasi muthamizhan
12:46 PM Bug #4910: journal Unable to read past sequence 337 but header indicates the journal has committe...
ubuntu@teuthology:/a/teuthology-2013-05-04_01:00:03-rados-next-testing-basic/7008
Tamilarasi muthamizhan
12:30 PM Bug #4910: journal Unable to read past sequence 337 but header indicates the journal has committe...
... Tamilarasi muthamizhan
12:27 PM Bug #4910 (Duplicate): journal Unable to read past sequence 337 but header indicates the journal ...
logs: ubuntu@teuthology:/a/teuthology-2013-05-04_01:00:03-rados-next-testing-basic/6997... Tamilarasi muthamizhan
03:56 AM CephFS Bug #4909: mds: stalled/stuck directory (standby)
Sorry, comment 1 is about ctdbd (IMHO), forget. Only main issue. Denis kaganovich
03:51 AM CephFS Bug #4909: mds: stalled/stuck directory (standby)
& (without debug 10) now log flooding on other node (mds.4):
2013-05-04 13:47:27.648019 7fe8c59ca700 0 mds.0.serv...
Denis kaganovich
12:58 AM rgw Bug #4902: Issuess handling very large files
Yes, the underlying filesystem is XFS.
I enabled ...
Jiri Brunclik

05/03/2013

11:20 PM Revision 1a67f7b3 (ceph): mon: fix init sequence when not daemonizing
We made the common_init_finish and chdir conditional on daemonize in commit
2e0dd5ae6c8751e33d456b2b06c1204b63db959a,...
Sage Weil
11:08 PM Revision a763569e (ceph): ceph: add 'osd crush rule ...' to usage
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:04 PM Revision 3f0b8ec2 (ceph): mon: avoid null deref in Monitor::_mon_status()
mikedawson reports:
*** Caught signal (Segmentation fault) **
in thread 7f40ce270700
ceph version 0.60-801-g7ec01...
Sage Weil
10:54 PM Revision a9686922 (ceph): mon: generate useful error msgs for 'osd crush rule create-simple ...'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:53 PM Revision 8894c504 (ceph): crush: return -1 on error from get_type_id
So we can distinguish between a bad type name and type 0.
Fix both callers, too.
Signed-off-by: Sage Weil <sage@ink...
Sage Weil
08:28 PM Revision b2501e91 (ceph): ceph.spec: require xfsprogs
This is needed when creating new OSDs (via ceph-disk). At least for most
people. Eventually we'll want to include b...
Sage Weil
07:53 PM Revision 95a0bda7 (ceph): v0.56.6
Gary Lowell
07:47 PM Revision 05af17e6 (ceph): rgw: don't send tail to gc if copying object to itself
Fixes: #4776
Backport: bobtail
Need to make sure that when copying an object into itself we don't
send the tail to th...
Yehuda Sadeh
07:45 PM Revision 6dbdcf5a (ceph): ceph.spec.in: Fix platform dependecies
Picked up an incorrect dependency merging the rbd udev rules update.
Signed-off-by: Gary Lowell <gary.lowell@inktan...
Gary Lowell
07:24 PM Revision f0eb20a7 (ceph): ceph_common.sh: re-sync get_name_list with master
We backported various items but didn't catch all the changes! :(
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:18 PM CephFS Bug #4909 (Can't reproduce): mds: stalled/stuck directory (standby)
I many times break actions (debug mysql replication script, just multiple dump redirections) directly to directory, m... Denis kaganovich
07:05 PM Bug #4845: mon (ms): deadloop and possible assert(sync_state == SYNC_STATE_CHUNKS)
Or commit after few hours of this bug (something about Paxos leaks) fix it, or it is disabling tcmalloc, but now whil... Denis kaganovich
06:36 PM Revision 378eb328 (ceph): Merge branch 'next'
Sage Weil
06:36 PM Revision a0988be6 (ceph): doc/release-notes: warn about sysvinit crush map update
See c189d855e67baadf977d8ca14509dcacd579af7a.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:33 PM Revision c189d855 (ceph): init-ceph: update osd crush map position on start
This is what the upstart ceph-osd.conf does; we need to do the same so that
new OSDs (e.g., that ceph-deploy creates)...
Sage Weil
06:29 PM Revision 2e0dd5ae (ceph): mon: fork early to avoid leveldb static env state
leveldb has static state that prevents it from recreating its worker thread
after our fork(), even when we close and ...
Sage Weil
06:04 PM Revision 6f8c1e9c (ceph): doc/release-notes: add/link complete changelogs
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:04 PM Revision 4fa2c497 (ceph): doc/release-notes: v0.56.5
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:14 PM rgw Bug #4902: Issuess handling very large files
On a second thought, the 'rados cp' stuff isn't going to work as it won't be able to list the xattrs. Yehuda Sadeh
02:04 PM rgw Bug #4902: Issuess handling very large files
Are you using xfs? What happens is that we split the extended attribute data over multiple xattrs, as it became too b... Yehuda Sadeh
12:40 PM rgw Bug #4902: Issuess handling very large files
... Jiri Brunclik
11:47 AM rgw Bug #4902: Issuess handling very large files
Can you try running:... Yehuda Sadeh
11:03 AM rgw Bug #4902 (In Progress): Issuess handling very large files
Ian Colle
03:37 AM rgw Bug #4902 (Resolved): Issuess handling very large files
Hi,
I am new to Ceph. I am running version 0.56 on Debian Squeeze. I would like to use it to store very large file...
Jiri Brunclik
02:53 PM CephFS Bug #4894: mds: standby shut itself down due to not having any data
MDS::boot_create() first starts a new log segment (its ESubtreemap is empty), then use MDCache::create_empty_hierarch... Zheng Yan
10:40 AM CephFS Bug #4894: mds: standby shut itself down due to not having any data
You must be racing ahead of me here, Yan — what's your theory? Just that the first active MDS failed to write any log... Greg Farnum
01:57 PM Bug #4907 (Resolved): rados python bindings: get_xattr() uses a fixed 4k buffer
get_xattr() uses a fixed 4k buffer and does not handle ERANGE. Yehuda Sadeh
12:24 PM CephFS Feature #4906 (Resolved): ceph-fuse: use the Preforker class
Sage wrote a Preforker class for the Monitor. We should switch to using that instead of our own band-aided daemonizat... Greg Farnum
12:13 PM rgw Bug #4905 (Resolved): rgw: log formatter for ops socket not protected
Missing a lock. Yehuda Sadeh
11:40 AM Bug #4895: leveldb: mon workload makes store.db grow without bound
we're hoping the #4851 forking thing will fix this too.. mikedawson is testing! Sage Weil
11:37 AM Bug #4896 (Resolved): leveldb: stuck on leveldb::DBImpl::CompactRange()
Sage Weil
11:37 AM devops Bug #4901 (Resolved): ceph-deploy: cluster is not operational on centos 6.3
Sage Weil
11:29 AM devops Bug #4901 (Fix Under Review): ceph-deploy: cluster is not operational on centos 6.3
Sage Weil
11:37 AM Bug #4851 (Resolved): leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil
02:25 AM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Ok, my conclusion was a bit pre-mature. mon1 just went down (while the other took over) and a bt showed it was hangin... Wido den Hollander
01:48 AM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil wrote:
> wip-leveldb-reopen
>
> needs testing and review
I build this morning with that branch and i...
Wido den Hollander
11:26 AM Feature #4903 (New): OpTracker: support hierarchies of request classes
Right now, a TrackedOp is associated with an OpTracker, and all our statistics come out of that. It would be nice if ... Greg Farnum
11:07 AM rgw Feature #4716 (Fix Under Review): rgw: ability to restrict user to specific operations
Greg, Can you please review this? Ian Colle
10:20 AM Feature #4846 (In Progress): builds scripts need to include raring
pbuilder environment has been updated for raring. Test build mostly worked. Will continue to work on this. Anonymous
10:18 AM Tasks #4889 (Resolved): Cut v0.56.5 release
Bobtail (v0.56.5) pushed out. Anonymous
09:51 AM devops Bug #4877: ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
I pushed a wip-4862 ceph-deploy branch. This puts a pushy close at the end of each connection. There is also an upd... Anonymous
01:08 AM Revision df884bb7 (ceph): v0.56.5
Gary Lowell
01:05 AM Revision b38cbabb (ceph): ceph.spec.in: fix udev rules.d files handling
Move 50-rbd.rules into the ceph base package since the related
ceph-rbdnamer binary is part of this package. Use corr...
Danny Al-Gaaf

05/02/2013

10:32 PM devops Bug #4900: ceph-deploy: on RHEL 6.3, the cluster is not UP yet
this was tested on wip-sysvinit branch. Tamilarasi muthamizhan
10:25 PM devops Bug #4900 (Resolved): ceph-deploy: on RHEL 6.3, the cluster is not UP yet
tested and this works fine on burnupi27, burnupi28 Tamilarasi muthamizhan
04:55 PM devops Bug #4900 (Resolved): ceph-deploy: on RHEL 6.3, the cluster is not UP yet
build ceph cluster on burnupi27, burnupi28 using ceph-deploy.
while all the commands worked fine, the cluster is s...
Tamilarasi muthamizhan
10:32 PM Revision 72fc6eb2 (ceph): doc: Fixed typos.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:31 PM devops Bug #4901 (Resolved): ceph-deploy: cluster is not operational on centos 6.3
while the wip-sysvinit branch works fine for rhel 6.3, am still not able to get the cluster operational on my centos ... Tamilarasi muthamizhan
10:22 PM Bug #4851 (Fix Under Review): leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
wip-leveldb-reopen
needs testing and review
Sage Weil
06:17 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
http://pastebin.com/raw.php?i=mmVeZ4ik
notice that hte thread doing all the work (the background thread) is the lo...
Sage Weil
06:04 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)

(05:45:49 PM) mikedawson: sagewk: working -> http://pastebin.com/raw.php?i=SQzePEn2 hung -> http://pastebin.com/r...
Sage Weil
05:01 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
leveldb log with successful quorum: http://pastebin.com/raw.php?i=SQzePEn2 Sage Weil
02:41 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
procedure for generating a leveldb transaction log that we can (hopefully) use to reproduce this:
- install wip_mo...
Sage Weil
12:57 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
so far, --mon-leveldb-paranoid seems to work around this. Sage Weil
10:56 AM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
So, back to the three monitors which aren't working yet.
What I noticed is this:...
Wido den Hollander
04:13 AM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil wrote:
> can you try wip-paranoid with --mon-leveldb-paranoid when starting ceph-mon, and see what happens...
Wido den Hollander
09:26 PM rgw Feature #4716 (In Progress): rgw: ability to restrict user to specific operations
Yehuda Sadeh
08:47 PM Revision e3b0e1e8 (ceph): s3tests: add force-branch with higher precdence than 'branch'
This way we can force a branch despite something in overrides.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:47 PM Revision a9f3eb63 (ceph): s3tests: add force-branch with higher precdence than 'branch'
This way we can force a branch despite something in overrides.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:33 PM Revision bae62bf3 (ceph): Merge branch 'next'
Sage Weil
08:33 PM Revision 5cdd7317 (ceph): Revert "mon: fix Monitor::pick_random_mon()"
This reverts commit 741f46852380c8e75669f6d7bf1202adad0358fb.
This is fixed in next; revert this to avoid a conflict...
Sage Weil
08:32 PM Revision 4f49565b (ceph): Merge remote-tracking branch 'gh/wip-mon-rank' into next
Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
08:19 PM Revision b4e73cc6 (ceph): doc/install/upgrading...: note that argonaut->bobtail->cuttlefish must ...
Which will be released shortly.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
07:54 PM Revision 039a3a97 (ceph): tools/: add paranoid option to ceph-osdomap-tool
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Samuel Just
07:47 PM Revision 26105280 (ceph): osd: default 'osd leveldb paranoid = false'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:32 PM Revision 444660ed (ceph): librados,client: bump mount timeout to 5 min
30 seconds is pretty short.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktan...
Sage Weil
07:30 PM Revision debbc79e (ceph): Merge pull request #251 from bkerensa/patch-1
Improve verbiage
Signed-off-by: Benjamin Kerensa <bkerensa@ubuntu.com>
Sage Weil
07:29 PM CephFS Bug #4894: mds: standby shut itself down due to not having any data
I think MDS::boot_create() should start a new log segment after creating the fs hierarchy. Zheng Yan
10:56 AM CephFS Bug #4894 (Resolved): mds: standby shut itself down due to not having any data
... Greg Farnum
07:21 PM Revision 6a612687 (ceph): OSD: also walk maps individually for start_split in consume_map()
We need to go map-by-map to get the parents right in consume_map()
just as we must in load_pgs().
Fixes: 4884
Signed...
Samuel Just
06:06 PM Revision c659dd76 (ceph): rgw: increase startup timeout to 5 min
30s is too short.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:18 PM Revision 65d61f7a (ceph): Merge branch 'wip-paranoid' into next
Sage Weil
04:38 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
simple workaround
run this command by cron, every 5 minutes:...
Vladislav Gorbunov
04:26 PM Revision d0678a06 (ceph): debian: only start/stop upstart jobs if upstart is present
This avoids errors on non-upstart distros (like wheezy).
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked ...
Sage Weil
04:25 PM Revision 209ce34a (ceph): debian: stop ceph-mds before uninstalling ceph-mds
Fixes: #4384
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 9eb0d91b867ab980135d7c6ff6347d69d...
Sage Weil
03:19 PM Revision 366781e8 (ceph): upgrade/rgw: run first s3tests pass using bobtail tests
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:18 PM devops Bug #4862: ceph-deploy: install occassionally throws exceptions though installation is successful
Clarifications: all hosts were running centos. Tamil guesses it happens 25% of the time.
See also #4877
Dan Mick
03:05 PM CephFS Feature #4326 (Fix Under Review): qa: add samba + (kclient|ceph-fuse) to suite
These changes were part of the samba.py task changes in wip-samba-tasks. An example of use is in ceph-qa-suite:suite... Sam Lang
02:31 PM Bug #4898 (Resolved): ceph-deploy osd create hostname fails with traceback (should catch error)
Dan Mick
02:28 PM Bug #4898 (Fix Under Review): ceph-deploy osd create hostname fails with traceback (should catch ...
Dan Mick
01:32 PM Bug #4898: ceph-deploy osd create hostname fails with traceback (should catch error)
Dan Mick
01:31 PM Bug #4898 (Resolved): ceph-deploy osd create hostname fails with traceback (should catch error)
Dan Mick
02:30 PM Revision 45c9e24f (ceph): doc/install/upgrading...: note about transitioning to ceph-deploy
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:30 PM Revision a8d46473 (ceph): doc/release-notes: note about ceph-deploy
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:05 PM rbd Feature #3763: krbd: handle flattening of mapped image
I'm trying to decide what to do with this issue.
In my mind, it has always had something to do
with dealing with ...
Alex Elder
01:36 PM Bug #4731 (Fix Under Review): PG: don't write out pg epoch on every map activation
Sage Weil
01:31 PM rbd Bug #4897 (Duplicate): qemu rbd driver should allow manipulation of format 2, striped images
From irc, someone points out that qemu-img convert can't create anything but format 1 images. We should update the b... Dan Mick
01:20 PM Documentation #4874 (Resolved): upgrading from bobtail to cuttlefish section
Sage Weil
09:28 AM Documentation #4874 (Fix Under Review): upgrading from bobtail to cuttlefish section
Sage Weil
01:10 PM devops Bug #4877 (In Progress): ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
Dan Mick
01:10 PM devops Bug #4877: ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
Dan Mick
01:09 PM devops Bug #4877: ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
The opinion is that this is after the work is done, so it just an annoyance, and perhaps we can cleanly close the pus... Dan Mick
12:57 PM Bug #4896: leveldb: stuck on leveldb::DBImpl::CompactRange()
--mon-leveldb-paranoid works around this, just like #4851 Sage Weil
12:56 PM Bug #4896 (Resolved): leveldb: stuck on leveldb::DBImpl::CompactRange()
... Sage Weil
12:53 PM Bug #4895 (Resolved): leveldb: mon workload makes store.db grow without bound
this is confirmed to happen to mikedawson with 32KB block size when the explicit compaction is disabled (mon compact ... Sage Weil
12:38 PM rbd Bug #4833 (Resolved): krbd: fix a bug in resizing a mapping
I forgot to close this.
This has been committed to the "testing" branch:
e28626a rbd: fix a bug in resizing a m...
Alex Elder
12:36 PM rbd Bug #4880 (Resolved): krbd: clear EXISTS flag if mapped snapshot disappears
The following have been committed to the "testing" branch
of the ceph-client git repository:
15228ed rbd: clear E...
Alex Elder
12:35 PM rbd Feature #3926 (Resolved): krbd: use slab allocation for common data structures
The following have been committed to the "testing" branch
of the ceph-client git repository:
1c2a9df rbd: allocat...
Alex Elder
12:33 PM rbd Bug #4803 (Resolved): krbd: memory leaks while testing layered images
The following has been committed to the ceph-client
"testing" branch:
b5b09be rbd: fix image request leak on pare...
Alex Elder
05:43 AM rbd Bug #4803 (Fix Under Review): krbd: memory leaks while testing layered images
The following has been posted for review:
[PATCH] rbd: fix image request leak on parent read
It is available on...
Alex Elder
12:23 PM Bug #4884 (Resolved): osd/OSD.cc: 217: FAILED assert(piter != rev_pending_splits.end())
Samuel Just
10:55 AM Bug #4884: osd/OSD.cc: 217: FAILED assert(piter != rev_pending_splits.end())
Had logs this time. wip_4884. Samuel Just
11:42 AM Bug #4892 (Resolved): ceph cluster hung on centos setup when run using teuthology ceph task
works fine after we stop iptables service on centos machines. Tamilarasi muthamizhan
09:29 AM Bug #4892: ceph cluster hung on centos setup when run using teuthology ceph task
until we know more Sage Weil
11:27 AM devops Bug #4890: ceph-deploy: install fails on RHEL 6.3
It looks like it possible to clean a specific repo cache by:
yum clean all --enablerepo=<repo> --disablerepo="*"
...
Anonymous
10:04 AM devops Bug #4890 (Resolved): ceph-deploy: install fails on RHEL 6.3
after some discussion we think we should not do anything drastic. customers are unlikely to see this, and the root is... Sage Weil
09:02 AM devops Bug #4890: ceph-deploy: install fails on RHEL 6.3
whoops, i thought this was the teuthology side.
Gary, what do you think ceph-deploy should do? Install the releas...
Sage Weil
10:53 AM rbd Bug #4661: xfstest 139 hung
A little more info... I think this machine may need some repair
so I'm going to reboot it--I can't get anything mor...
Alex Elder
09:18 AM rbd Bug #4661: xfstest 139 hung
The kernel did not drop into kdb, but it's still alive.
(Machine is mira064, by the way.)
Well, not fully alive. ...
Alex Elder
09:04 AM rbd Bug #4661: xfstest 139 hung
OK, I hit something.
Things were definitely running concurrently this time--I saw
interleaved output from all thr...
Alex Elder
08:17 AM rbd Bug #4661: xfstest 139 hung
what might be more effective is taking the full sequence of tests run by the node when we do the full run and do that... Sage Weil
05:44 AM rbd Bug #4661: xfstest 139 hung
Still no hangs.
HOWEVER I screwed up. I put the iteration count
at 1000 for running xfstests. However I only di...
Alex Elder
10:21 AM Bug #4871 (Can't reproduce): clone_range api test failure with thrashing
yeah, marking can't reproduce for now Sage Weil
09:26 AM Revision f95a053b (ceph): Update debian.rst
"complete list of distributions" should be complete list of releases since we already know what distributions are sup... Benjamin Kerensa
08:36 AM devops Bug #4767 (Resolved): ceph-deploy: install should default to picking cuttlefish when cuttlefish i...
commit:5fcdcbfb2d5a291316b1e1989e68f5a65613448c
until the cuttlefish repo appears users can explicitly specify --s...
Sage Weil
08:35 AM Bug #4891 (Resolved): Performance regression with KRBD reads and random reads with lots of concur...
Sage Weil
08:28 AM CephFS Bug #4832: mds: failed auth_unpin assert
hit this again:... Sage Weil
06:11 AM rbd Bug #3871 (Resolved): krbd: initial header read may be out of date
I guess I neglected to mark this resolved.
The following was committed to the "testing" branch
of the ceph-client...
Alex Elder
06:09 AM rbd Bug #4559: krbd: kernel BUG when mapping unexisting rbd device
I haven't checked, but I presume this one is still
happening. It's pretty important, considering it
produces a cra...
Alex Elder
06:08 AM rbd Bug #4802 (Resolved): krbd: walk through error paths and fix them
This code walk-through process is and will be ongoing.
I have committed all the patches mentioned above, and
I'm ba...
Alex Elder
05:30 AM Bug #4893 (Duplicate): Moniter hang caused by LevelDB
this is a dup of #4851 Zheng Yan
05:06 AM Bug #4893 (Duplicate): Moniter hang caused by LevelDB
When running osd, mds and monitor on the same machine, the monitor hangs quite often.
https://code.google.com/p/le...
Zheng Yan
04:46 AM Revision 6a91ecb1 (ceph): Merge branch 'next'
Sage Weil
12:24 AM Revision 17c14b25 (ceph): Merge remote-tracking branch 'gh/wip-doc-cuttlefish' into next
Sage Weil
12:18 AM Revision 9e6f7b12 (ceph): nuke.py: Allow ipmi power cycling to be skipped
Some nodes don't have ipmi setup. Allow nuke to
skip the ipmi checking if -i (--noipmi) is specified.
Signed-off-by...
Sam Lang
12:10 AM Revision d230fb88 (ceph): upgrade: fix client ids
Sage Weil

05/01/2013

11:36 PM Revision a488d610 (ceph): upgrade rgw: increase client mount timeout
This should let radosgw restart and connect once the cluster is finished upgrading
Signed-off-by: Josh Durgin <josh....
Josh Durgin
11:11 PM Revision c194151a (ceph): Merge remote-tracking branch 'upstream/wip_4884' into next
Fixes: #4884
Reviewed-by: Greg Farnum <greg@inktank.com>
Samuel Just
10:43 PM Revision d0d93a74 (ceph): tools: ceph-osdomap-tool.cc
Add tool for dumping info from osd omap.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:43 PM Revision 628e2320 (ceph): Makefile: put ceph_monstore_tool in bin_DEBUGPROGRAMS
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
10:43 PM Revision 615b84b1 (ceph): Makefile,gitignore: ceph-monstore-tool, not ceph_monstore_tool
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
10:18 PM Revision 01d2e159 (ceph): For vms, fix some bad default configuration settings.
Fixes: #4881
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Warren Usui
09:59 PM Revision f4982268 (ceph): OSD: load_pgs() should fill in start_split honestly
In load_pgs(), we previously called assigned children starting
at the loaded pg created between its stored epoch and ...
Samuel Just
09:56 PM Revision 3e0ca62b (ceph): OSD: cancel_pending_splits needs to cancel all descendants
expand_pg_num() and load_pgs() may result in a pg with children
in pending_splits which also have children in pending...
Samuel Just
09:40 PM Revision d9441808 (ceph): osd: add --osd-leveldb-paranoid flag
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:40 PM Revision 7cc0a352 (ceph): mon: add --mon-leveldb-paranoid flag
This is sort of equivalent to an fsck.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:10 PM Revision dfacd1bd (ceph): dumper: fix Objecter locking
Locking expectations changed at some point, and the Dumper wasn't
updated to comply:
1) We need to take the lock for ...
Greg Farnum
09:08 PM Revision 0c91becf (ceph): Makefile.am: Add -lpthread to fix build on newer ld in Raring Ringtail
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 9...
Dan Mick
09:03 PM Revision fdbab85f (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
09:03 PM Revision 7bb145b2 (ceph): doc/rados/deploy: note that osd delete does not work yet
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:02 PM Revision 771f4529 (ceph): doc/rados/deploy: misc edits
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
08:52 PM Revision c838e1f4 (ceph): Revert "Revert "Specify xfs for osd powercycle testing""
Pushed a fix to teuthology, should work now.
This reverts commit 853e8fdb731e863e4703d86d8852ecdb4160275f.
Samuel Just
08:40 PM devops Bug #4890 (In Progress): ceph-deploy: install fails on RHEL 6.3
Warren, sounds like we should do 'yum clean all' in three places:
- right after we install the yum source rpm (in ...
Sage Weil
06:38 PM devops Bug #4890 (Resolved): ceph-deploy: install fails on RHEL 6.3
BTW, I've left burnupi27/28 with ceph installed. There is a ~ubuntu/remove.sh script on both systems that does the c... Anonymous
06:34 PM devops Bug #4890: ceph-deploy: install fails on RHEL 6.3
This error was due to cached metadata on the target systems. ceph-deploy is attempting to install version 0.60-851, ... Anonymous
05:22 PM devops Bug #4890 (Resolved): ceph-deploy: install fails on RHEL 6.3
this is the error seen, when trying to install ceph on RHEL machines
tamil@tamil-VirtualBox:~/rhel/ceph-deploy$ ./...
Tamilarasi muthamizhan
08:14 PM Revision b124e8ea (ceph): ceph_manager: mount_osd_data expects osd as a str
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:14 PM Revision b948406a (ceph): ceph.py: set up ctx.disk_config outside of the loop
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
08:13 PM Revision 0382aa60 (ceph): ceph.py: the journal component does not current work with restart
Removing for the time being.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
07:59 PM rbd Bug #4803: krbd: memory leaks while testing layered images
I believe I have found the leak. I have done UML testing
and am about to initiate teuthology testing. I'll post
t...
Alex Elder
07:57 PM Revision 15e6544f (ceph): Merge remote-tracking branch 'gh/bobtail-deploy' into bobtail-next
Sage Weil
07:52 PM rbd Bug #4661: xfstest 139 hung
135 iterations completed without hanging, running two clients
each running tests 137-141.
Quite honestly though I...
Alex Elder
10:47 AM rbd Bug #4661: xfstest 139 hung
I have been trying to reproduce this problem by having just
one client run tests 137-141, repeatedly. At this point...
Alex Elder
07:41 PM Feature #4273 (In Progress): osd: prioritize recovery for degraded pgs
David Zafman
06:05 PM Revision a21ea018 (ceph): Revert "PaxosService: use get and put for version_t"
This reverts commit e725c3e210b244e090d70c77d937c94f4f63a2be.
These inadvertantely got rid of the prefix portion of ...
Sage Weil
05:57 PM Revision 88c030fc (ceph): mon/Paxos: update first_committed when we trim
The Paxos::trim() -> ::trim_to() path trims old states but does not
update first_committed. This misinforms later pa...
Sage Weil
05:57 PM Revision 3a6138b2 (ceph): mon/Paxos: don't ignore peer first_committed
We go to the effort of keeping a map of the peer's first/last committed
so that we can send the right commits during ...
Sage Weil
05:45 PM Revision bb270f86 (ceph): mon: Monitor: fix bug on _pick_random_mon() that would choose an invali...
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
05:45 PM Revision 7f48fd06 (ceph): mon: Monitor: use rank instead of name when randomly picking monitors
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
05:43 PM Revision 8a8ae159 (ceph): OSD: clean up in progress split state on pg removal
There are two cases: 1) The parent pg has not yet initiated the split 2) The
parent pg has initiated the split.
Prev...
Samuel Just
05:39 PM Bug #4892 (Resolved): ceph cluster hung on centos setup when run using teuthology ceph task
test setup: burnupi50, burnupi51, burnupi52 [running centos 6.3]
when running ceph task using teuthology, it hung ...
Tamilarasi muthamizhan
05:23 PM Bug #4891 (Resolved): Performance regression with KRBD reads and random reads with lots of concur...
Seeing performance regressions with KRBD reads and random reads. Primarily affecting EXT4, also hitting XFS with lot... Mark Nelson
05:13 PM Tasks #4889: Cut v0.56.5 release
updated bobtail branch with bobtail-deploy.. i think its everything we need. Sage Weil
05:08 PM Tasks #4889 (Resolved): Cut v0.56.5 release
As discussed verbally; consult with Sage. Greg Farnum
04:52 PM Linux kernel client Feature #4888 (New): krbd: support boot from root file system on an rbd image
Somebody asked about it on IRC, and I've been thinking we
should figure out what it would take. Sage just got asked...
Alex Elder
04:52 PM Revision 8f76d2ee (ceph): Merge remote branch 'origin/next'
Josh Durgin
04:52 PM Revision b366ad33 (ceph): Merge remote branch 'origin/next'
Josh Durgin
04:45 PM Bug #3904 (Resolved): FAILED assert(want_acting.empty())
Samuel Just
04:45 PM Bug #4805 (Resolved): ReplicatedPG: pull bug
Samuel Just
04:39 PM Bug #4871: clone_range api test failure with thrashing
Per Sam, possible ext4 issue? Ian Colle
10:42 AM Bug #4871 (In Progress): clone_range api test failure with thrashing
trying to reproduce with osd logs Sage Weil
04:37 PM Bug #4884 (Resolved): osd/OSD.cc: 217: FAILED assert(piter != rev_pending_splits.end())
Samuel Just
03:28 PM Bug #4884: osd/OSD.cc: 217: FAILED assert(piter != rev_pending_splits.end())
Reviewed-by Greg Farnum
03:01 PM Bug #4884: osd/OSD.cc: 217: FAILED assert(piter != rev_pending_splits.end())
I think it's a bug in load_pgs(). Testing wip_4884. Samuel Just
01:55 PM Bug #4884 (Resolved): osd/OSD.cc: 217: FAILED assert(piter != rev_pending_splits.end())
0> 2013-05-01 13:23:44.951665 7f7315d41700 -1 osd/OSD.cc: In function 'void OSDService::mark_split_in_progress(p... David Zafman
03:36 PM Bug #4873: osd: scrub found missing object on primary
First thing is we see that osd.0 goes down while pgs are creating (actually, splitting).
2013-04-30 21:55:31.27002...
Samuel Just
11:06 AM Bug #4873 (In Progress): osd: scrub found missing object on primary
Sage Weil
03:06 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
This is happening with the argonaut ceph-fuse daemon, not a cuttlefish one. Going to turn this down to High again and... Greg Farnum
02:19 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
steps to reproduce:
bring up a cluster of 2 nodes running argonaut, run blogbench workload on it from client.
upg...
Tamilarasi muthamizhan
12:46 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
Are you still running old clients when you hit this? Greg Farnum
12:04 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
hitting this pretty consistently when upgrading mds from argonaut to cuttlefish.
have this reproduced on burnupi39...
Tamilarasi muthamizhan
03:02 PM devops Bug #4887 (Resolved): ceph-disk list: Doesn't list properly devices using dm-mapper
... Alexandre Marangone
02:57 PM Revision 7f99b46a (ceph): Merge pull request #250 from wido/docs
docs: Various updates for the documentation Sage Weil
02:47 PM rbd Feature #3926 (Fix Under Review): krbd: use slab allocation for common data structures
In addition to those in rbd, the following patches
for libceph have been posted for review. They are
also availabl...
Alex Elder
02:46 PM rbd Feature #3926: krbd: use slab allocation for common data structures
The following patches have been posted for review.
They are available in the "review/wip-slabs" branch
of the ceph-...
Alex Elder
02:45 PM rbd Bug #4880 (Fix Under Review): krbd: clear EXISTS flag if mapped snapshot disappears
The following patches have been posted for review.
They are available in the "review/wip-slabs" branch
of the ceph-...
Alex Elder
05:05 AM rbd Bug #4880 (Resolved): krbd: clear EXISTS flag if mapped snapshot disappears
This commit removed the rbd device snapshot list:
17e4749 rbd: kill off the snapshot list
It inadvertently re...
Alex Elder
02:42 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
can you try wip-paranoid with --mon-leveldb-paranoid when starting ceph-mon, and see what happens? that enables leve... Sage Weil
02:29 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil wrote:
> it looks like leveldb is doing a compaction. is the processing using disk? how big is store.db?...
Wido den Hollander
02:20 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
it looks like leveldb is doing a compaction. is the processing using disk? how big is store.db? Sage Weil
02:18 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil wrote:
> can you attach with gdb and do 'thread apply all bt'?
Done! I've added the output.
Wido den Hollander
01:40 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
can you attach with gdb and do 'thread apply all bt'? Sage Weil
01:30 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil wrote:
> full mon log starting from a ceph-mon restart would be ideal. thanks!
I added logs with this s...
Wido den Hollander
01:08 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
full mon log starting from a ceph-mon restart would be ideal. thanks! Sage Weil
01:06 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
I'm now also seeing this on a single-monitor cluster running the next branch (fe68afe9d10bc5d49a05a8bafa644d57783447c... Wido den Hollander
02:27 PM CephFS Bug #4105 (Resolved): mds: fix up the Dumper
Merged into next in commit:dfacd1bd805ebb730b5206c9830b28f47cc7f9cf. Hurray! Greg Farnum
02:20 PM CephFS Bug #4105 (Fix Under Review): mds: fix up the Dumper
wip-4105-mds-dumper
Wasn't actually that complicated; it's just the locking expectations around the Objecter chang...
Greg Farnum
02:23 PM CephFS Feature #4886 (Resolved): teuthology: add tests that use the MDS dumper
We want to prevent the Dumper from bitrotting like it has been. Figure out a simple and effective way to test the dum... Greg Farnum
02:20 PM CephFS Feature #4885 (Resolved): dumper: do an incremental log dump
Right now we read it all into memory and then dump it out into a file. So far that's been okay, but we probably want ... Greg Farnum
01:43 PM Fix #3884 (In Progress): osd: resurrect partially deleted PGs
Sage Weil
12:52 PM Tasks #4842 (Rejected): blueprint: erasure coded pg infrastructure
Sage Weil
12:48 PM Tasks #4843 (Resolved): blueprint: crush library, language extensions
Sage Weil
12:36 PM Tasks #4844 (Resolved): blueprint: stats infrastructure (collectd, statsd, graphite, ...)
Sage Weil
12:28 PM Tasks #4841 (Resolved): blueprint: rados namespaces
Sage Weil
12:13 PM Messengers Bug #2569 (Resolved): msgr: connect_rank crash
this is a known problem in argonaut that we aren't going to backport the fix for. Sage Weil
12:11 PM Messengers Bug #2569: msgr: connect_rank crash
re-pasting it... Tamilarasi muthamizhan
12:11 PM Messengers Bug #2569: msgr: connect_rank crash
2013-05-01 11:39:20.698027 7fefeaa9f700 -1 msg/SimpleMessenger.cc: In function 'void SimpleMessenger::Pipe::register_... Tamilarasi muthamizhan
12:10 PM Messengers Bug #2569 (In Progress): msgr: connect_rank crash
hit this on burnupi39 on argonaut branch, when trying to run upgrade test from argonaut to cuttlefish.
Tamilarasi muthamizhan
11:28 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
Hmm, I thought we handled renames properly since they involve changing the caps state. But maybe we don't propagate t... Greg Farnum
11:13 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-05-01_01:00:37-fs-next-testing-basic/4534 Sage Weil
04:41 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
I think this is a general issue. When handling MClientReconnect, if an inode is not in the cache, the MDS tries fetch... Zheng Yan
11:05 AM Bug #4872 (Resolved): mds crash decoding OSDMap
commit:a21ea0186d9a7ef136ccadf96c02ba683bc5e533 Sage Weil
10:58 AM Bug #4879 (Resolved): mon/Paxos.cc: 557: FAILED assert(begin->last_committed == last_committed)
commit:88c030fc05dcc5227ec1b3e32e9169312d640ac1 Sage Weil
10:50 AM Bug #4813 (Resolved): pgs stuck creating
Samuel Just
10:50 AM Bug #4849 (Resolved): pg stuck peering
Samuel Just
09:56 AM Revision 2f6eb39d (ceph): docs: Update links to Github and the Tracker
Wido den Hollander
09:52 AM Revision 81b06be3 (ceph): docs: Update the ceph-users join and leave addresses
These were pointing to vger, where only -devel lives. Wido den Hollander
09:09 AM Revision b3f37ea4 (ceph): docs: Update CloudStack RBD documentation
Wido den Hollander
05:00 AM rbd Bug #4868 (Resolved): rbd: kill off the snapshot list
The following have been committed to the "testing" branch
of the ceph-client git repository:
1fbd6ca rbd: look up...
Alex Elder
04:59 AM rbd Bug #3952 (Resolved): krbd: no need for object header version
The following have been committed to the "testing"
branch of the ceph-client git repository:
7178711 rbd: stop tr...
Alex Elder
04:58 AM rbd Bug #4867 (Resolved): rbd: a few small issues
The following have been committed to the "testing" branch
of the ceph-client git repository:
451411f rbd: fix up ...
Alex Elder
04:57 AM rbd Bug #4857 (Resolved): libceph: define snap context creation function
The following have been committed to the "testing" branch
of the ceph-client git repository:
6b6e51a libceph: cre...
Alex Elder
04:56 AM rbd Bug #4774 (Resolved): krbd: don't create /dev entries for backing devices
The following have been committed to the "testing" branch
of the ceph-client git repository:
452a982 rbd: drop mo...
Alex Elder
04:26 AM Revision fdc05346 (ceph): init-ceph: use remote config when starting daemons on remote nodes (-a)
If you use -a to start a remote daemon, assume the remote config is present
instead of pushing the local config. Thi...
Sage Weil
02:56 AM rbd Bug #2654: Stale rbd volume cannot be unmaped
Hi, any news on this? Leon Keijser
02:48 AM Revision 55c87e82 (ceph): PG: call check_recovery_sources in remove_down_peer_info
If we transition out of peering due to affected
prior set, we won't trigger start_peering_interval
and check_recovery...
Samuel Just
02:48 AM Revision a28c2f55 (ceph): PG: clear want_acting when we leave Primary
This is somewhat annoying actually. Intuitively we want to
clear_primary_state when we leave primary, but when we re...
Samuel Just
01:38 AM Revision 853e8fdb (ceph): Revert "Specify xfs for osd powercycle testing"
This is currently broken.
This reverts commit 79abc44205f53d4b3a12e9a0cc9430e91225a564.
Samuel Just
01:12 AM Revision 849ed598 (ceph): mon: communicate the quorum_features properly when declaring victory.
Fixes #4747.
Signed-off-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry picked ...
Greg Farnum
01:12 AM Revision fe68afe9 (ceph): mon: communicate the quorum_features properly when declaring victory.
Fixes #4747.
Signed-off-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Greg Farnum
01:04 AM Revision b17e8424 (ceph): doc: Incorporating Tamil's feedback.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:48 AM Revision bd6ea8d0 (ceph): doc: Reordered header levels for visual clarity.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:39 AM Revision bb93ebaa (ceph): doc: Fixed a few typos.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:32 AM Revision 14ce0ad1 (ceph): doc: Updated the upgrade guide for Aronaut and Bobtail to Cuttlefish.
fixes: #4874
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
12:09 AM Revision 52742fb0 (ceph): fix some errors found by pyflakes
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
12:09 AM Revision 5a7267f8 (ceph): fix some errors found by pyflakes
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
12:02 AM Revision 7df72f26 (ceph): s3tests: revert useless portion of 1c50db6a4630d07e72144dafd985c397f8a4...
Perhaps it was attempting to debug something, but it shouldn't have been committed.
Signed-off-by: Josh Durgin <josh...
Josh Durgin
12:02 AM Revision f866037f (ceph): s3tests: revert useless portion of 1c50db6a4630d07e72144dafd985c397f8a4...
Perhaps it was attempting to debug something, but it shouldn't have been committed.
Signed-off-by: Josh Durgin <josh...
Josh Durgin

04/30/2013

11:59 PM Revision 809814b6 (ceph): rgw: restart radosgw too
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
11:49 PM Revision 5a6e5607 (ceph): rgw tests: remove users after each test
These should all be cleanup up at some point. They're
almost all the same code.
Signed-off-by: Josh Durgin <josh.dur...
Josh Durgin
11:49 PM Revision 2dcce57c (ceph): rgw tests: remove users after each test
These should all be cleanup up at some point. They're
almost all the same code.
Signed-off-by: Josh Durgin <josh.dur...
Josh Durgin
11:47 PM Revision 6aba6d2c (ceph): rgw tests: clean up immediately after the test
There's no need for an explicit cleanup function, so move it back
to where it came from (except in s3roundtrip, which...
Josh Durgin
11:47 PM Revision 3c604251 (ceph): rgw tests: clean up immediately after the test
There's no need for an explicit cleanup function, so move it back
to where it came from (except in s3roundtrip, which...
Josh Durgin
11:42 PM Revision 7de29dd8 (ceph): doc/release-notes: update cuttlefish release notes to include bobtail
Collapse changes from bobtail -> cuttlefish.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:39 PM Revision 935e8685 (ceph): ceph: allow restarting radosgw
Only split once, since radosgw will have client.X after it.
Monitors and MDSs may have names with more .s as well.
S...
Josh Durgin
11:37 PM Revision 55b16c79 (ceph): rgw: add to ctx.daemons so it can be stopped/started dynamically
Name the daemon after the client it runs on, since only
one per host is supported anyway.
Signed-off-by: Josh Durgin...
Josh Durgin
11:35 PM Revision 4979df32 (ceph): misc: move daemon stopping function to a generic place
This will be useful for other daemons, like radosgw, in the future.
Signed-off-by: Josh Durgin <josh.durgin@inktank....
Josh Durgin
10:52 PM Bug #4873: osd: scrub found missing object on primary
ubuntu@teuthology:/a/sage-2013-04-30_21:21:02-rados-wip-mds-testing-basic/4319
not that the regression from #4872 ...
Sage Weil
08:21 AM Bug #4873 (Can't reproduce): osd: scrub found missing object on primary
... Sage Weil
10:50 PM Bug #4872 (Fix Under Review): mds crash decoding OSDMap
wip-4872
broke this when cleaning up the get/put helpers.
Sage Weil
10:46 PM Bug #4872: mds crash decoding OSDMap
the mds gets this:... Sage Weil
09:23 PM Bug #4872: mds crash decoding OSDMap
it's in the full osdmap decode path. the mds gets a few maps right after mount, gets nothing (basically idle) for a ... Sage Weil
03:58 PM Bug #4872: mds crash decoding OSDMap
Maybe the message is getting corrupted somehow? Greg Farnum
03:55 PM Bug #4872: mds crash decoding OSDMap
interestingly, crash is always preceeded by... Sage Weil
03:54 PM Bug #4872: mds crash decoding OSDMap
seeing this when i run the whole rados suite. cranked up mds debugging. Sage Weil
11:43 AM Bug #4872: mds crash decoding OSDMap
this looks liek a ghost.. i can't reproduce with the same commits. and the error doesn't make any sense. Sage Weil
10:07 AM Bug #4872 (In Progress): mds crash decoding OSDMap
Sage Weil
08:19 AM Bug #4872 (Resolved): mds crash decoding OSDMap
... Sage Weil
10:39 PM Revision 3cf5824f (ceph): Merge branch 'wip-4837-election-syncing' into next
Reviewed-by: Sage Weil <sage@inktank.com> Greg Farnum
10:37 PM Bug #4806 (Resolved): os/FileStore.cc: In function 'void FileStore::_set_replay_guard() failure
non-trivial to backport; split ppl should upgrade to cuttlefish. Sage Weil
09:32 PM Revision 79abc442 (ceph): Specify xfs for osd powercycle testing
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
09:19 PM Bug #4879 (Fix Under Review): mon/Paxos.cc: 557: FAILED assert(begin->last_committed == last_comm...
wip-4879 Sage Weil
08:49 PM Bug #4879 (Resolved): mon/Paxos.cc: 557: FAILED assert(begin->last_committed == last_committed)
... Sage Weil
09:16 PM Revision cd1d6fb3 (ceph): ceph-disk: tolerate /sbin/service or /usr/sbin/service
CentOS/RH has it in /sbin, others in /usr/sbin.
Backport: bobtail
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
08:50 PM Revision cedcb193 (ceph): Revert "mon: when electing, be sure acked leaders have new enough store...
This was somehow broken -- out-of-date leaders were being elected -- and
we've decided smaller band-aids are more app...
Greg Farnum
08:50 PM Revision d00b4cd7 (ceph): Revert "mon: update assert for looser requirements"
We reverted the gating by paxos sequences, so now we don't
need to look at them at all.
This reverts commit 1e6f02b3...
Greg Farnum
08:50 PM Revision a39bbdf3 (ceph): mon: if we get our own sync_start back, drop it on the floor.
We have timeouts that will clean everything up, and this can happen
in some cases that we've decided are legitimate. ...
Greg Farnum
08:50 PM Revision a97eccad (ceph): mon: Monitor: disregard paxos_max_join_drift when deciding whether to sync
We should only rely on whether our paxos version is overlap with whatever
they have -- we'll catch up later with them...
Joao Eduardo Luis
08:47 PM Revision c2bcc2a6 (ceph): ObjectCacher: wait for all reads when stopping flusher
Stopping the flusher is essentially the shutdown step for the
ObjectCacher - the next thing is actually destroying it...
Josh Durgin
08:04 PM Revision 08bf1610 (ceph): Verbose output on ceph-qa-chef.
Signed-off-by: Sandon Van Ness <sandon@inktank.com> Sandon Van Ness
06:49 PM Revision 17612a40 (ceph): Merge branch 'wip-mon-compact' into next
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
06:38 PM Bug #4747 (Resolved): Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has n...
I tested it with vstart upgrades and all looks good. Pushed the fix to "next" and backported to "bobtail". Greg Farnum
05:55 PM Bug #4747 (Fix Under Review): Upgrade monitors from argonaut->bobtail->next fails w/"Existing sto...
I've managed to reproduce this locally just using vstart. It appears that we haven't actually been setting the MMonEl... Greg Farnum
04:36 PM Bug #4747 (In Progress): Upgrade monitors from argonaut->bobtail->next fails w/"Existing store ha...
reopening the bug.
hit this again on burnupi39, burnupi45. Greg is already looking into it.
Tamilarasi muthamizhan
06:16 PM Revision ea9c76b8 (ceph): elector: trigger a mon reset whenever we bump the epoch
We need to call reset during every election cycle; luckily we
can call it more than once. bump_epoch is (by definitio...
Greg Farnum
06:01 PM Revision 6ae9bbb5 (ceph): elector: trigger a mon reset whenever we bump the epoch
We need to call reset during every election cycle; luckily we
can call it more than once. bump_epoch is (by definitio...
Greg Farnum
05:55 PM Revision 53a2c64f (ceph): Merge branch 'wip-2209' into next
Reviewed-by: Samuel Just <sam.just@inktank.com> David Zafman
05:31 PM Documentation #4874 (In Progress): upgrading from bobtail to cuttlefish section
John Wilkins
09:03 AM Documentation #4874 (Resolved): upgrading from bobtail to cuttlefish section
http://ceph.com/docs/next/install/upgrading-ceph/
on that page
Sage Weil
05:26 PM Revision 0acede3b (ceph): mon: change leveldb block size to 64K
#leveldb on freenode says > 2MB is nonsense (it might explain the weird
behavior we saw). Riak tuning guide suggests...
Sage Weil
04:15 PM Revision 4f70c898 (ceph): misc: default base_test_dir to /home/ubuntu/cephtest
This matches what the teuthworker is currently doing. Sage Weil
04:11 PM Fix #4567 (In Progress): mon: refactor mon caps; allow restriction of key/value storage by prefix
Sage Weil
04:07 PM devops Bug #4859 (Resolved): ceph-deploy: install fails on RHEL 6.3
Anonymous
04:07 PM devops Bug #4859: ceph-deploy: install fails on RHEL 6.3
Install on redhat works when I run ceph-deploy on a system with python 2.6.6. It looks like the issue is in the conn... Anonymous
03:02 PM devops Bug #4859: ceph-deploy: install fails on RHEL 6.3
Ian Colle wrote:
> Are RHEL customers really going to want to disable their subscription manager service?
I can't...
Anonymous
02:56 PM devops Bug #4859: ceph-deploy: install fails on RHEL 6.3
One of the problems testing rhel is that since we don't have a license we can't access the rhel repository for automa... Anonymous
02:39 PM devops Bug #4859: ceph-deploy: install fails on RHEL 6.3
Are RHEL customers really going to want to disable their subscription manager service? Ian Colle
02:11 PM devops Bug #4859: ceph-deploy: install fails on RHEL 6.3
The message "Unable to read consumer identity" is due to a known bug in rhel, RHEL KB # 165803,
The solution given...
Anonymous
11:40 AM devops Bug #4859: ceph-deploy: install fails on RHEL 6.3
tried with next branch, and hit this
tamil@tamil-VirtualBox:~/rhel/ceph-deploy$ ./ceph-deploy install --dev=next b...
Tamilarasi muthamizhan
09:39 AM devops Bug #4859: ceph-deploy: install fails on RHEL 6.3
Fix checked in. Waiting for confirmation that it works. Anonymous
04:03 PM devops Bug #4877 (Resolved): ceph-deploy pushy library compatibility issue between python 2.7 and 2.6.6
The is in issue with connection close when the going from ceph-deploy running on a system with python 2.7 to a target... Anonymous
03:54 PM Bug #4837 (Resolved): mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Merged into next in commit:3cf5824f60b15cdf4db4e895b4da0d6c964b9ed4. Forgot the "Fixes" tag, whoops. Greg Farnum
02:47 PM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Sage signed off on this; waiting for results from the rados multimon and monthrash suites before merging into next an... Greg Farnum
01:52 PM Bug #4837 (Fix Under Review): mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Okay, wip-4837-election-syncing should address the assert properly. The other issue with monitors not forming a quoru... Greg Farnum
02:44 PM rbd Bug #4661 (In Progress): xfstest 139 hung
Once again, we have the same evidence in these last two
crashes as we did before, which is to say, not enough to
re...
Alex Elder
02:17 PM devops Bug #4876 (Resolved): ceph-deploy: osd create command fails to start the osds on centos 6.3
commit:cd1d6fb3f9b906f13cf281294d9272e1e92a0243 Sage Weil
01:51 PM devops Bug #4876 (Resolved): ceph-deploy: osd create command fails to start the osds on centos 6.3
while osd create command mounts the disk, it fails to start the osd daemon
tamil@tamil-VirtualBox:~/centos/ceph-de...
Tamilarasi muthamizhan
02:06 PM Revision 57404b6a (ceph): swift, s3readwrite: add missing yield
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
02:06 PM Revision 022bd4aa (ceph): swift, s3readwrite: add missing yield
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
01:48 PM rbd Bug #4827 (Resolved): librbd: use after free of ceph context or something in it
commit:c2bcc2a60c2c1f66c757c01ed6bcc6778821f81d Sage Weil
11:22 AM rbd Bug #4827: librbd: use after free of ceph context or something in it
The fix might work after all. The test was still running against the next branch since I had specified it in the ceph... Josh Durgin
01:26 PM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
The attached files include the complete client log, along with the mds logs that include 10000000004 (one of the indo... Sam Lang
09:39 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
We can't revoke on unlink because the file might still be held open with something accessing it. :) Greg Farnum
09:38 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
This looks like the client creates a file, then unlinks it, but it never removes it from its cache, because it still ... Sam Lang
05:41 AM CephFS Bug #4850 (In Progress): ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
Sam Lang
11:50 AM Bug #4815 (Resolved): mon: leveldb grows quickly and without bound
Sage Weil
11:46 AM Bug #4815: mon: leveldb grows quickly and without bound
wip-mon-compact looks good Samuel Just
11:19 AM Bug #4858 (Resolved): mon: doesn't necessarily call reset() during an election cycle
Ran tests against the rados and monitor regression suites in bobtail and in next.
Merged into next in commit:6ae9b...
Greg Farnum
11:13 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
Try setting 'rgw list buckets max chunk' to a smaller value (e.g., 100 or even 10). My guess is that certain buckets ... Yehuda Sadeh
11:03 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
I have no indications that the cluster is unhealthy (including but not limited to "ceph health"). I tried stating the... Faidon Liambotis
11:07 AM devops Feature #3255: ceph-disk: allow prepare without activate (for spares)
I was reading the new ceph-deploy documentation today. On the 'prepare' action it says:
"The prepare command only pr...
Faidon Liambotis
09:21 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
iirc the fundamental problem is that we create sockets, which allocate an inode via create_inode(), and we can't pass... Sage Weil
09:18 AM Linux kernel client Bug #147: lockdep: possible irq lock inversion dependency w/ osdc->request_mutex and con->mutex
It's possible this problem is due to rbd possibly starting
requests while in irq context. It may be further complic...
Alex Elder
08:54 AM devops Bug #4756 (Resolved): mkcephfs doesn't set up same keys as ceph-deploy
Added http://ceph.com/docs/master/rados/deployment/ceph-deploy-transition/ John Wilkins
08:38 AM rbd Feature #3893 (Rejected): krbd: document the new request code
Almost a month and nobody seems to care that I think
this should just go away.
It's going away.
Alex Elder
08:32 AM rbd Bug #3925: krbd: sysfs write lockdep warnings
I was working on trying to argue that this problem goes
away when the sysfs files for snapshots go away:
http:/...
Alex Elder
08:17 AM Bug #4871 (Can't reproduce): clone_range api test failure with thrashing
... Sage Weil
07:22 AM rbd Bug #4870 (Resolved): rbd: watch request error handling bugs
These are error conditions that may not be likely, but...
In rbd_dev_header_watch_sync():
1) if we are initiatin...
Alex Elder
07:22 AM rgw Feature #3671 (Resolved): Request for x-amz-grant-full-control support
Yehuda Sadeh
07:17 AM rgw Feature #4328 (In Progress): rgw: dr: updated buckets log: tie into internal bucket changes tracker
Yehuda Sadeh
07:15 AM rgw Feature #4330 (In Progress): rgw: dr: updated buckets log: radosgw-admin changes
Yehuda Sadeh
07:10 AM Linux kernel client Bug #4869 (New): libceph: osd_client: get_reply() generalize for more ops
Currently, the osd client's get_reply() method assumes
that there is only a single op (the first one) that
will des...
Alex Elder
06:57 AM rbd Bug #4796 (Resolved): krbd: don't create sysfs entries for snapshots of mapped images
The following has been committed to the ceph-client
"testing" branch:
f03a167 rbd: don't create sysfs entries for...
Alex Elder
06:54 AM rbd Bug #4803 (In Progress): krbd: memory leaks while testing layered images
The following have been committed to the ceph-client
"testing" branch. I am not closing this issue though,
there i...
Alex Elder
06:50 AM rbd Bug #4800 (Resolved): krbd: avoid dropping extra reference in rbd_free_disk()
This has been committed to the ceph-client "testing" branch:
b1557a5 rbd: avoid dropping extra reference in rbd_fr...
Alex Elder
05:52 AM rbd Bug #4868 (Fix Under Review): rbd: kill off the snapshot list
The following have been posted for review. They are
available in the "review/wip-rbd-cleanup-6" branch of
the ceph-...
Alex Elder
05:09 AM rbd Bug #4868 (Resolved): rbd: kill off the snapshot list
We no longer use the snapshot list for anything. When we need to
look up a snapshot name, id, size, or feature mask...
Alex Elder
05:51 AM rbd Bug #3952 (Fix Under Review): krbd: no need for object header version
The following have been posted for review. They are
available in the "review/wip-rbd-cleanup-6" branch of
the ceph-...
Alex Elder
05:48 AM rbd Bug #4867 (Fix Under Review): rbd: a few small issues
The following have been posted for review. They are
available in the "review/wip-rbd-cleanup-6" branch of
the ceph-...
Alex Elder
05:12 AM rbd Bug #4867: rbd: a few small issues
rbd: don't revalidate so much

Whenever a header object event causes a mapped rbd image to refresh
...
Alex Elder
05:11 AM rbd Bug #4867: rbd: a few small issues
rbd: snap names are pointer to constant data
Make explicit that snapshot names don't change by making functions
r...
Alex Elder
05:07 AM rbd Bug #4867: rbd: a few small issues
rbd: fix up the layering warning message
A warning gets spewed for any image being probed, including parent
image...
Alex Elder
05:05 AM rbd Bug #4867 (Resolved): rbd: a few small issues
I found and fixed a few small issues in the rbd code and I'm
just documenting them here.
Alex Elder
05:47 AM rbd Bug #4857 (Fix Under Review): libceph: define snap context creation function
The following have been posted for review. They are
available in the "review/wip-rbd-cleanup-6" branch of
the ceph...
Alex Elder
03:58 AM Revision 7d4c0dcf (ceph): Merge branch 'next'
Sage Weil
01:57 AM Revision 6f2a7df4 (ceph): doc: Fix typo.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:54 AM Revision 35a98234 (ceph): doc: Added reference to transition from mkcephfs to ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:53 AM Revision de31b618 (ceph): doc: Updated index for new pages. Added inner table.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:53 AM Revision fa9f17c5 (ceph): doc: Added transition from mkcephfs to ceph-deploy page.
fixes: #4756
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
01:52 AM Revision 02853c5e (ceph): doc: Added purge page to ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:51 AM Revision 45d12f12 (ceph): doc: Added OSD page to ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:51 AM Revision 0b912f46 (ceph): doc: Added mds page for ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:51 AM Revision 3c46c519 (ceph): doc: Added admin tasks page for ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:41 AM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
No, I checked the load already with Joao yesterday, but they weren't loaded at all.
The load was about 0.5 at the ...
Wido den Hollander
12:46 AM Revision b5e24610 (ceph): osd: Rename members and methods related to stat publish
pg_stats_lock to pg_stats_publish_lock
pg_stats_valid to pg_stats_publish_valid
pg_stats_stable to pg_stats_publish
u...
David Zafman
12:46 AM Revision adb7c8a0 (ceph): osd: read kb stats not tracked?
In read cases track stats in PG::unstable_stats
Include unstable_stats in write_info() and publish_stats_to_osd()
For...
David Zafman
12:46 AM Revision 1c15636b (ceph): Set num_rd, num_wr_kb and num_wr in various places that needed it
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
12:20 AM Revision bd68b82b (ceph): mon: enable 'mon compact on trim' by default; trim in larger increments
This resolves the leveldb growth-without-bound problem observed by
mikedawson, and all the badness that stems from it...
Sage Weil
12:11 AM Revision 70ce4db4 (ceph): Disable quiet mode wget output on wget for ceph-qa-chef
So maybe I can get a better idea of what is causing it to fail.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Sandon Van Ness
12:09 AM Revision 95ece012 (ceph): Merge pull request #249 from ceph/wip-cuttle-man
man page updates
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
12:08 AM Revision 929a9944 (ceph): mon: share extra probe peers with debug log, mon_status
This is useful when debugging initial quorum formation.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:01 AM Revision 030bf8aa (ceph): debian: only start/stop upstart jobs if upstart is present
This avoids errors on non-upstart distros (like wheezy).
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil

04/29/2013

11:57 PM Revision 5d20c39c (ceph): Merge remote-tracking branch 'gh/wip-up' into next
Reviewed-by: Sam Lang <sam.lang@inktank.com> Sage Weil
11:46 PM Revision 4b9325b2 (ceph): Merge pull request #248 from ctrlaltdel/next
Fix a README typo Sage Weil
11:20 PM Revision 23c591ed (ceph): Merge pull request #244 from dalgaaf/wip-da-pylint-2
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
11:01 PM Revision 825a4317 (ceph): man: update remaining copyright notices
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
11:01 PM Revision 4abf0814 (ceph): man: refresh content from rst
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
10:57 PM Revision 2b5dda0e (ceph): Merge branch 'wip_4860' into next
Reviewed-by: Sage Weil <sage@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
Samuel Just
10:56 PM Revision 1bd011a1 (ceph): PG,OSD: _remove_pg must remove pg keys
Instead of doing this in OSD::_remove_pg, pass a transaction
to on_removal and do it in PG.
Signed-off-by: Samuel Ju...
Samuel Just
10:56 PM Revision 71460126 (ceph): OSD: no need to remove snapdirs on _remove_pg()
The snapmapper patches removed snapdirs altogether.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:45 PM Revision 8f6a1b8f (ceph): mon/Paxos: compact on trim
Compact the paxos keys when we trim old paxos states.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:45 PM Revision 3cb4f678 (ceph): mon: compact PaxosService prefix on trim
Each time we trim a PaxosService, have leveldb compact so that the
space from removed states is reclaimed.
This is p...
Sage Weil
10:45 PM Revision a2f7d1d1 (ceph): leveldb: add compact_prefix method
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:45 PM Revision e8c98241 (ceph): mon: add compact_prefix transaction operation
Add a prefix compaction opteration to the transaction that will be
performed after the transaction applies.
Signed-o...
Sage Weil
10:45 PM Revision 90b6b6df (ceph): mon: compact leveldb on bootstrap
This is an opportunistic time to optimize our local data since we are
out of quorum. It serves as a safety net for c...
Sage Weil
10:45 PM Revision ee3cdaa8 (ceph): mon: compact leveldb on bootstrap
This is an opportunistic time to optimize our local data since we are
out of quorum. It serves as a safety net for c...
Sage Weil
10:44 PM Revision 5fa0f048 (ceph): mon: --compact argument, config option to compact the store on start
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:43 PM Revision 6a00f332 (ceph): leveldb: add compact() method
This will compact the entire store; it will be slow!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:37 PM Revision ffc8557a (ceph): doc: update rbd man page for new options
--no-progress and --allow-shrink were added recently.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
10:05 PM Revision 8b2a1475 (ceph): gitignore: add ceph_monstore_tool
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Samuel Just
09:50 PM Revision 29831f96 (ceph): Makefile: fix java build warning
This is a workaround that makes the warning go away. Not certain there
isn't something we should be changing...
Sig...
Sage Weil
08:53 PM Revision 418cff58 (ceph): Fix journal partition creation
With OSD sharing data and journal, the previous code created the
journal partiton from the end of the device. A uint3...
Alexandre Marangone
08:07 PM Bug #4860: OSD::_remove_pg removes info oid, but not the info keys
2b5dda0e6a31adf952ca486a53b899ef8d1ebfa1 Samuel Just
05:14 PM Bug #4860 (Resolved): OSD::_remove_pg removes info oid, but not the info keys
Samuel Just
01:49 PM Bug #4860 (Resolved): OSD::_remove_pg removes info oid, but not the info keys
Samuel Just
08:02 PM RADOS Feature #4866 (New): read kb stats should be occasionally persisted
After the fix for 2209 we still need to periodically create a transaction to persist the read stats. This can be ski... David Zafman
07:58 PM Bug #2209 (Resolved): osd: read kb stats not tracked?
adb7c8a0608659e339836b3f769d96a19841b6fb David Zafman
12:15 PM Bug #2209 (In Progress): osd: read kb stats not tracked?
David Zafman
07:19 PM Bug #4521: mon: starting a new osd crashes all mon's
A final copy would be great, you can then go ahead and wipe. Thanks! Samuel Just
04:07 PM Bug #4521: mon: starting a new osd crashes all mon's
This monitor (a) is in a state now that i cannot even start it up. I was planning on removing it and wiping the dire... Evan Felix
06:27 PM Revision 6a5be251 (ceph): Merge branch 'wip-mon-pg' into next
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
06:24 PM Revision c8ec76ee (ceph): s3tests, s3readwrite, swift: cleanup explicitly
Cleaning up test dir explicitly after run, so that
consecutive runs don't fail.
Signed-off-by: Yehuda Sadeh <yehuda@...
Yehuda Sadeh
06:24 PM Revision 820c72b8 (ceph): s3tests, s3readwrite, swift: cleanup explicitly
Cleaning up test dir explicitly after run, so that
consecutive runs don't fail.
Signed-off-by: Yehuda Sadeh <yehuda@...
Yehuda Sadeh
06:11 PM Revision a2fe0137 (ceph): mon: remap creating pgs on startup
After Monitor::init_paxos() has loaded all of the PaxosService state,
we should then map creating pgs to osds. This ...
Sage Weil
06:11 PM Revision 278186d7 (ceph): mon: only map/send pg creations if osdmap is defined
This avoids calculating new pg creation mappings if the osdmap isn't
loaded yet, which currently happens when during ...
Sage Weil
06:07 PM Revision 28d495a3 (ceph): mon: factor map_pg_creates() out of send_pg_creates()
Factor out the portion of the function that remaps creating pgs to osds
from the part that sends those pending create...
Sage Weil
05:46 PM Revision 896b2777 (ceph): client: make dup reply a louder error
If we get a dup reply something is probably wrong! We should make sure
it appears more loudly in the log. In partic...
Sage Weil
05:46 PM Revision ee553ac2 (ceph): client: fix session open vs mdsmap race with request kicking
A sequence like:
- ceph-fuse starts, make_request on getattr
- waits for mds to be active
- tries to open a sessi...
Sage Weil
05:45 PM Revision f8f762a2 (ceph): Merge branch 'wip_4836' into next
Fixes: #4836
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
05:45 PM rbd Bug #4661: xfstest 139 hung
and again!
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-04-29_01:01:03-kernel-next-testing-basic...
Sage Weil
05:44 PM rbd Bug #4661: xfstest 139 hung
happened again,
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-04-28_21:32:47-kernel-next-testing-...
Sage Weil
05:28 PM devops Bug #4865 (Resolved): ceph-disk: activate fails on debian wheezy due to missing udev by-partuuid ...
it fails because wheezy has no /dev/disk/by-partuuid. we either need to install our own rules for that, or work arou... Sage Weil
05:27 PM devops Bug #4825 (Resolved): ceph-deploy: install failed on debian-wheezy
Sage Weil
05:03 PM devops Bug #4825: ceph-deploy: install failed on debian-wheezy
pushed fix to next for the start/stop errors.
did lsb-release get installed manually? i don't see that in the o...
Sage Weil
05:21 PM Bug #4815 (Fix Under Review): mon: leveldb grows quickly and without bound
wip-mon-compact Sage Weil
05:18 PM devops Bug #4864 (Resolved): ceph-deploy: mon create command seems to output info about the first node only
tamil@ubuntu:~/ceph-deploy-latest/centos/ceph-deploy$ ./ceph-deploy mon create burnupi05 burnupi21
ceph-mon: mon.non...
Tamilarasi muthamizhan
05:01 PM rbd Bug #4827 (In Progress): librbd: use after free of ceph context or something in it
Failed on the 8th try, in a similar way, although without logs.
The ObjectCacher looks like it's been destroyed al...
Josh Durgin
04:44 PM rbd Bug #4827: librbd: use after free of ceph context or something in it
Sage Weil
03:29 PM rbd Bug #4827: librbd: use after free of ceph context or something in it
The wip-rbd-close-image branch contains a potential fix. Running the test in a loop to see if it'll happen again. Josh Durgin
10:18 AM rbd Bug #4827: librbd: use after free of ceph context or something in it
It didn't reproduce with log_max_recent = 1, but without that setting it happened after just 3 tries.
Unfortunatel...
Josh Durgin
04:58 PM CephFS Bug #4853 (Resolved): ceph-fuse hang on mount getattr
commit:ee553ac279664b7f1b527a0b1b56768134cf5157 Sage Weil
12:43 PM CephFS Bug #4853: ceph-fuse hang on mount getattr
this is not a new race, and is only triggered when a mds session open and request race with an mds restart. not a cu... Sage Weil
10:47 AM CephFS Bug #4853 (Fix Under Review): ceph-fuse hang on mount getattr
fix in wip-up
here is the client-side log that shows we send the getattr twice. we only process the first reply, ...
Sage Weil
09:21 AM CephFS Bug #4853: ceph-fuse hang on mount getattr
Ignore that, wrong bug — sorry. Greg Farnum
09:20 AM CephFS Bug #4853: ceph-fuse hang on mount getattr
/a/teuthology-2013-04-28_21:32:40-fs-next-testing-basic/2662
That's an fsstress run that got hung, I copied the cl...
Greg Farnum
09:02 AM CephFS Bug #4853 (In Progress): ceph-fuse hang on mount getattr
Sage Weil
08:38 AM CephFS Bug #4853 (Resolved): ceph-fuse hang on mount getattr
100% reproducible with this job file... Sage Weil
04:51 PM Bug #4851 (Need More Info): leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil
01:09 PM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
The time skew is just because mon1 was way behind.. the message it received is in sequence with the other sent by mon... Sage Weil
01:01 PM Bug #4851 (In Progress): leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
Sage Weil
09:00 AM Bug #4851: leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
This might be Urgent, but somebody needs to evaluate it. Greg Farnum
06:01 AM Bug #4851 (Resolved): leveldb: hang on leveldb::DBImpl::MakeRoomForWrite(bool)
While testing with the next branch (50e58b9f49382d690f5a22af80f6981f1c12d4c3) I stumbled upon the problem that creati... Wido den Hollander
04:50 PM Revision f3b7db1a (ceph): upgrade: restructure rbd tests
- expand matrix
- include branch: bobtail in first set of tests so that we run the right
version of the test
Sage Weil
04:50 PM Revision 4f2df744 (ceph): rbd: dont' test python on bobtail
The workunit will pull the latest and fail Sage Weil
04:50 PM Revision a9188bfd (ceph): upgrade: fs: ignore 'wrongly marked down'
Sage Weil
04:27 PM Bug #4858: mon: doesn't necessarily call reset() during an election cycle
Sage says it's good! Greg Farnum
02:10 PM Bug #4858 (Fix Under Review): mon: doesn't necessarily call reset() during an election cycle
wip-4858-reset[-bobtail]. Will run through a suite once it's up on gitbuilder. Greg Farnum
01:36 PM Bug #4858: mon: doesn't necessarily call reset() during an election cycle
It's a bit more subtle than I'd initially described it. Greg Farnum
01:18 PM Bug #4858 (Resolved): mon: doesn't necessarily call reset() during an election cycle
We need to call Monitor::reset() at some point during an election in order to guarantee consistency. However, we don'... Greg Farnum
03:57 PM devops Bug #4862 (Resolved): ceph-deploy: install occassionally throws exceptions though installation is...
not often though, hit this with ceph-deploy installs,
this time on centos 6.3,
tamil@ubuntu:~/ceph-deploy-lates...
Tamilarasi muthamizhan
03:40 PM Bug #4837 (In Progress): mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Possibility not true. Worth a quick look even so.
Wido's crash logs didn't really have any new data, but they conf...
Greg Farnum
10:35 AM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Possibility to check later today: peons commit to disk when they receive a propose in a way that they return those va... Greg Farnum
02:16 AM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Mike Dawson wrote:
> Wido,
>
> This sounds quite consistent with the things I am seeing. The assert you saw to st...
Wido den Hollander
02:38 PM devops Bug #4756 (In Progress): mkcephfs doesn't set up same keys as ceph-deploy
John Wilkins
02:26 PM CephFS Bug #4861 (Rejected): Alter Java components to build against Java 1.6 (or 1.7)
The Java packages use -source 1.5 to specify that they should use that version of the API. This is being done for com... Anonymous
01:56 PM devops Bug #4859 (In Progress): ceph-deploy: install fails on RHEL 6.3
We need to configure the epel repository for rhel if it hasn'tbeen already. Anonymous
01:29 PM devops Bug #4859 (Resolved): ceph-deploy: install fails on RHEL 6.3
install fails on RHEL 6.3 with the followign error message,
tamil@ubuntu:~/ceph-deploy-latest/rhel/ceph-deploy$ ./...
Tamilarasi muthamizhan
01:22 PM Bug #4747 (Resolved): Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has n...
Resolving this because the actual bug is broader. Greg Farnum
01:20 PM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
Okay, this is actually #4858 — not calling reset() meant we weren't clearing out the paxos_recovered member, so the G... Greg Farnum
11:00 AM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
Hrm, the store for mon.c has the global versions, but for some reason the feature_set on disk hasn't been updated. Go... Greg Farnum
12:50 PM Bug #3945: osd: dynamically link to leveldb
Stefan: I opened http://gitbuilder.ceph.com and it was right there:
http://gitbuilder.ceph.com/leveldb-deb-x86_64...
Dan Mick
12:38 PM rbd Bug #4857 (Resolved): libceph: define snap context creation function
I created a function to encapsulate the creation of a snapshot
context, for use in rbd. In review, Josh said he tho...
Alex Elder
12:17 PM rbd Feature #4550: Create Qemu+RBD rpm package for RHEL+CentOS 6.3 on ceph.com
For CentOS, I am less concerned about package version but for RHEL6.3, we should use the same version of qemu that sh... Neil Levine
11:30 AM Bug #4675 (Resolved): mon: pg creations don't get queued on mon startup
merged the fix for the mon restart case. commit:6a5be251df0e14ec66fb868ff6a6ef6e08d539c6
there is likely still a ...
Sage Weil
11:16 AM Bug #4675 (Fix Under Review): mon: pg creations don't get queued on mon startup
pushed updated wip-mon-pg Sage Weil
11:18 AM Bug #4849: pg stuck peering
until we see this again Sage Weil
11:02 AM rbd Bug #4774: krbd: don't create /dev entries for backing devices
OK, finally getting to the point of this bug...
I just posted the following patches for review. The
last one act...
Alex Elder
11:00 AM Bug #4856 (Won't Fix): monitor: upgrades produce "client did not provide supported auth type" in log
This is most of the output in the monitor logs when Tamil is running upgrade tests. It apparently isn't inhibiting fu... Greg Farnum
10:47 AM Bug #4836 (Resolved): crush_ops failure
Samuel Just
10:02 AM Bug #4855 (Can't reproduce): peek map assert
From list:
Hey folks,
I'm helping put together a new test/experimental cluster, and hit this today when bringin...
Samuel Just
09:49 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
Is your cluster completely healthy? Gathering a single container's stats is not related to the container's size, so i... Yehuda Sadeh
09:25 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
OK, I actually ran a version that has all that fixed this time :)
Both the text/plain view and stats=false return ...
Faidon Liambotis
09:25 AM Linux kernel client Bug #4854 (Rejected): read more than they should
3.8 kernel module, mount params (read ahead = 0):... Andras Elso
09:21 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
/a/teuthology-2013-04-28_21:32:40-fs-next-testing-basic/2662
That's an fsstress run that got hung, I copied the cl...
Greg Farnum
08:22 AM Revision bf0b4306 (ceph): Fix a README typo
Signed-off-by: François Deppierraz <francois@ctrlaltdel.ch> Francois Deppierraz
04:15 AM Revision cea2ff86 (ceph): mon: Fix leak of context
Use Context::complete() to finish context, it frees the context
after executing Context::finish().
Signed-off-by: Ya...
Yan, Zheng
02:34 AM rgw Feature #2169: rgw: api to control bucket placement
Neil Levine

04/28/2013

10:11 PM Bug #4348: OSD slow request leads to RBD clients stalled/delayed
After upgrade to
ceph version 0.56.4 (63b0f854d1cef490624de5d6cf9039735c7de5ca)
it doesn't behave as before. Works ...
Ivan Kudryavtsev
10:01 PM Revision 20d99c4a (ceph): doc: Removed extra whitespace.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:01 PM Revision 041b0cf9 (ceph): doc: Added rbd-fuse to TOC.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:00 PM Revision 8f48a3d1 (ceph): Added commentary and removed fourth column for now.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:00 PM Revision 4e805a57 (ceph): doc: Removed. Redunant information now.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:59 PM Revision 66127852 (ceph): doc: Added openssh-server mention, corrections, hyperlink fix.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:59 PM Revision 21db055e (ceph): doc: Added openssh-server mention.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:58 PM Revision 9fa6ba79 (ceph): doc: Added manpage link and hidden TOC.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:58 PM Revision dd6e79aa (ceph): doc: Removed installed Chef. This is now in the ceph wiki.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:57 PM Revision 945dac65 (ceph): doc: Removed text for include directive. Wasn't behaving the way I'd ho...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:57 PM Revision 3d9bc469 (ceph): doc: Added ceph-mds to CephFS toc.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:46 PM Bug #4813: pgs stuck creating
ubuntu@teuthology:/a/teuthology-2013-04-27_20:54:49-rados-next-testing-basic/2087 Samuel Just
07:27 PM Revision 45df0b26 (ceph): workunit: use passed refspec rather than checking sha1 again
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Samuel Just
05:28 PM Revision de745dba (ceph): install.upgrade: apt-get install instead of upgrade
Upgrade does not actually upgrade in some cases; use install!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:26 PM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Wido,
This sounds quite consistent with the things I am seeing. The assert you saw to start this bug report is qui...
Mike Dawson
09:46 AM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
So, I'm not sure if this is related, but since I'm experiencing this with the 'next' branch I'm reporting it here for... Wido den Hollander
04:35 PM Revision 1e52fb9b (ceph): install: prefer 'branch' over 'sha1'
The upgrade tasks specify 'branch' in the job file, but the
schedule_suite.sh script sets a sha1 in the overrides. M...
Sage Weil
04:19 PM Revision 1e449d44 (ceph): nfs: debug mds
I've seen a run hang on rmdir on shutdown, and want to see why the MDS didn't
reply.
Sage Weil
04:18 PM Revision a71dd9a3 (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
08:51 AM CephFS Bug #4850: ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
have full log.. put a copy in the run dir Sage Weil
08:50 AM CephFS Bug #4850 (Resolved): ceph-fuse: disconnected inode on shutdown with fsstress + mds thrashing
... Sage Weil
08:41 AM Bug #4849 (Resolved): pg stuck peering
... Sage Weil
08:30 AM Bug #4836: crush_ops failure
all of these commands need similar treatment, and i think we can structure it in a reasonably clean and generic way. ... Sage Weil
08:12 AM Feature #4846 (Resolved): builds scripts need to include raring
need to make sure release builds include raring! Sage Weil
07:00 AM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
About deadloop: opened #4845 Denis kaganovich
06:58 AM Bug #4845 (Resolved): mon (ms): deadloop and possible assert(sync_state == SYNC_STATE_CHUNKS)
This is more digged log about problem, described after closing #4811 (and not related to directly).
First I just n...
Denis kaganovich
05:28 AM Revision 44d13a76 (ceph): doc: Fix. ceph, not chef.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
03:55 AM Revision 8315a22c (ceph): upgrade: debug fs jobs
These are hanging; crank up logs to see why. Sage Weil
03:55 AM Revision f1eeec39 (ceph): upgrade: rgw: restructure collection
- use separate facets
- make final swift use client.1 so it doesn't
collide with a previous run
Sage Weil
03:55 AM Revision 17f34a70 (ceph): rgw asdf
Sage Weil
03:55 AM Revision 78823630 (ceph): upgrade: reorganize the basic/rados suite
Use facets instead of duplicating the test content each time. Sage Weil
03:55 AM Revision bc0b50f3 (ceph): upgrade: dbench instead of blogbench
blogbench hangs bobtail ceph-fuse in some
cases, it seems.
Sage Weil
12:59 AM Revision 5327d062 (ceph): ceph-filestore-dump: fix warnings on i386 build
tools/ceph-filestore-dump.cc: In member function ‘int header::get_header()’:
warning: tools/ceph-filestore-dump.cc:45...
Sage Weil

04/27/2013

12:42 PM rbd Bug #3871 (Fix Under Review): krbd: initial header read may be out of date
The following have been posted for review. They are available
in the "review/wip-rbd-cleanup-4" in the ceph-client ...
Alex Elder
08:09 AM rbd Bug #4774 (Fix Under Review): krbd: don't create /dev entries for backing devices
I'm making headway on this now. It mostly is taking the form of
cleaning up code as I walk through how things get s...
Alex Elder
08:04 AM rbd Bug #4833 (Fix Under Review): krbd: fix a bug in resizing a mapping
The following has been posted for review:
[PATCH] rbd: fix a bug in resizing a mapping
It was posted together w...
Alex Elder
03:55 AM Bug #3945: osd: dynamically link to leveldb
Can somebody tell me where the snappy .deb is? i can't find it for squeeze under the gitbuilder Stefan Priebe
01:12 AM Revision 3cc10645 (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
12:19 AM Revision 1e6c390a (ceph): tools: add ceph_monstore_tool with getosdmap
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:19 AM Revision e725c3e2 (ceph): PaxosService: use get and put for version_t
Otherwise, we just duplicate the logic for generating the version
key names.
Signed-off-by: Samuel Just <sam.just@in...
Samuel Just
12:19 AM Revision 79280d9f (ceph): OSDMonitor: when adding bucket, delay response if pending map has name
Fixes: #4836
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:12 AM Revision 5744afeb (ceph): upgrade: do not start second radosgw
Use the first one. This verifies bobtail radosgw works against cuttlefish
osds.
Sage Weil
12:04 AM Revision f08c3a50 (ceph): upgrade: mount fs with ceph-fuse for fs tests
Sage Weil

04/26/2013

11:57 PM Revision ab353c71 (ceph): upgrade: run blogbench against ceph-fuse
Otherwise this runs on the local disk, not touching the ceph cluster. Sage Weil
11:52 PM Revision 928e241a (ceph): upgrade: run rados python test on bobtail to avoid polluting cluster wi...
Extra pools from test.sh will make this fail:
2013-04-26T11:06:45.631 INFO:teuthology.task.workunit.client.0.err:tes...
Sage Weil
11:05 PM Revision 50e58b9f (ceph): ceph.spec.in: remove conditional checks on tcmalloc
tcmalloc is available on all supported platforms now.
Signed-off-by: Gary Lowell <gary.lowell@inktank.com>
Gary Lowell
11:05 PM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
I've added the logs from mon2 and mon3.
What I did notice, that now mon1 crashed without anything in the logs. mon...
Wido den Hollander
04:18 PM Bug #4837 (Need More Info): mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
I can't do any more without more logs, unfortunately. :(
In order to increase our odds of getting useful logs, I'v...
Greg Farnum
02:23 PM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Okay, so mon1, id 0, is leader. Then, suddenly, he's probing and goes into syncing. There's no logging here which is ... Greg Farnum
01:41 PM Bug #4837: mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Okay, I see part of what's happening here. The sync infrastructure includes a separate forwarding mechanism, and that... Greg Farnum
01:06 PM Bug #4837 (In Progress): mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Well this is different — the monitor is addressing sync requests to itself! Greg Farnum
12:50 PM Bug #4837 (Resolved): mon: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
I just upgraded 3 monitors from 0.56.4 to 0.60 (next branch) and saw a monitor crash when I ran:
$ ceph osd unset ...
Wido den Hollander
11:04 PM Revision 5c1782a5 (ceph): debian/rules: Fix tcmalloc breakage
Since all currently supported platforms have tcmalloc
available and it is now the default, remove broken check code
t...
Gary Lowell
11:04 PM Revision 6d348a1e (ceph): mon: cache osd epochs
The monitor may get a series of messages from the OSD that prompt it to
send incremental maps (pg_temp updates, failu...
Sage Weil
10:37 PM Revision 1a6b87ea (ceph): ceph.spec.in: put ceph-disk-* et al in correct sbindir
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:29 PM Revision 86337936 (ceph): debian: fix ceph.install
This got out of sync somewhere in cherry-picking all of these patches.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:27 PM Revision 0650fa95 (ceph): monitor: assert out early if we get our own sync_start back
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
10:24 PM Revision 1e6f02b3 (ceph): mon: update assert for looser requirements
Signed-off-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Greg Farnum
09:07 PM Revision ba13173b (ceph): doc: Deleted old index. Generates warnings otherwise.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:07 PM Revision 9a7a0753 (ceph): doc: General purpose pre-flight checklist.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:07 PM Revision 9e775f15 (ceph): doc: Modified Ceph deployment landing page.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:06 PM Revision fb8119ce (ceph): doc: Added general pre-flight checklist for ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:06 PM Revision 3433aa8f (ceph): doc: Removed old ceph-deploy placeholder.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:05 PM Revision 9c0c4c17 (ceph): doc: Removed Chef section. Now appears in new Ceph wiki.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:05 PM Revision c25144e8 (ceph): doc: Added Key Management for ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:04 PM Revision d0d1554a (ceph): doc: Added "Add/Remove Monitors" section for ceph-deploy.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:04 PM Revision f24dbdef (ceph): doc: Added Create a Cluster section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:03 PM Revision b631cc67 (ceph): doc: Added ceph-deploy package management (install | uninstall ) section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:02 PM Revision d85c6904 (ceph): doc: Added new quick start preamble and index.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:01 PM Revision 3ff7eef9 (ceph): doc: Added ceph-deploy preflight.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:01 PM Revision 93656740 (ceph): doc: Added ceph-deploy quick start.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:40 PM Revision 7406981a (ceph): ceph-disk list: say 'unknown cluster $UUID' when cluster is unknown
This makes it clearer that an old osd is in fact old.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked fro...
Sage Weil
08:40 PM Revision 9419dca6 (ceph): ceph-disk: add missing space after comma
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 0080d1df7c7950e051840a543fc4bdabe6c...
Danny Al-Gaaf
08:40 PM Revision 14a348dc (ceph): ceph-disk: fix Redefining name 'uuid' from outer scope
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 058eb923c5b7dab611901fdd1724ce2a7c1...
Danny Al-Gaaf
08:40 PM Revision 7326ea63 (ceph): ceph-disk: define exception type
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 4c6d6442a89adc5b56e99cb4d2ed572f2ad...
Danny Al-Gaaf
08:40 PM Revision 0e47d312 (ceph): ceph-disk: merge twice defined function is_mounted(dev)
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit eaf31bf9f90ba9709a57a6870dbafa21142...
Danny Al-Gaaf
08:40 PM Revision ee452ebe (ceph): ceph-disk: fix naming of local variable in is_mounted()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 3dd8b461219e64bb0f7a210dba5a9ab7c64...
Danny Al-Gaaf
08:40 PM Revision 1b86b1c7 (ceph): ceph-disk: fix some (local) variable names
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit c4eb7e6ddd593cd45ab8343da01355be738...
Danny Al-Gaaf
08:40 PM Revision c71fb8d5 (ceph): ceph-disk: CalledProcessError has no output keyword on 2.6
Signed-off-by: Gary Lowell <gary.lowell@inktank.com>
(cherry picked from commit a793853850ee135de14b9237f7023cadcdb8...
Gary Lowell
08:40 PM Revision 0b42b1ed (ceph): Makefile.am: install ceph-* python scripts to /usr/bin directly
Install ceph-* scripts directly to $(prefix)$(sbindir) (which
normaly would be /usr/sbin) instead of moving it around...
Danny Al-Gaaf
08:40 PM Revision bd8bb984 (ceph): ceph-disk: print subprocess.CalledProcessError on error
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 543327b1f2a9efe8083bb196433c4bcf838...
Danny Al-Gaaf
08:40 PM Revision d26a0342 (ceph): ceph-disk: add some more docstrings
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 86e55f5448c4b5b46b74d2d89b01d1e64b1...
Danny Al-Gaaf
08:40 PM Revision 63eb8507 (ceph): ceph-disk: rename some constants to upper case variable names
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 8a999ded088e688fd3f4a7c27127b7c06f0...
Danny Al-Gaaf
08:40 PM Revision ecb34b81 (ceph): ceph-disk: fix naming of a local variable in find_cluster_by_uuid
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 0b5fcfffe6d2f69bd4318cc93ef73195d94...
Danny Al-Gaaf
08:40 PM Revision d714049d (ceph): ceph-disk: rename some local variabels in list_*partitions
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit b7d7e6894c550a7afa8dfb5bfa2bc54b5d3...
Danny Al-Gaaf
08:40 PM Revision 153994cd (ceph): ceph-disk: ignore udevadm settle return code
If we time out, just continue and let the next step fail.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked...
Sage Weil
08:40 PM Revision 0c8efc06 (ceph): ceph-disk: conditionally remove mount path
umount removes it on success; only remove it here if it is still there.
Signed-off-by: Sage Weil <sage@inktank.com>
...
Sage Weil
08:40 PM Revision 9da81e4e (ceph): ceph-disk: reimplement is_partition
Previously we were assuming any device that ended in a digit was a
partition, but this is not at all correct (e.g., /...
Sage Weil
08:40 PM Revision bf3f8702 (ceph): ceph-disk: reimplement list_all_partitions
Use /dev/disk/by-id to list disks and their partitions. This is more
accurate and correct than the previous (as-yet ...
Sage Weil
08:40 PM Revision 24d729c5 (ceph): ceph-disk: implement 'list'
This is based on Sandon's initial patch, but much-modified.
Mounts ceph data volumes temporarily to see what is insi...
Sage Weil
08:40 PM Revision 0182973b (ceph): ceph-disk: handle missing journal_uuid field gracefully
Only lower if we know it's not None.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 562e1716...
Sage Weil
08:40 PM Revision b9f86d96 (ceph): fix: Redefining name 'uuid' from outer scope (line 14)
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit d3c60dc8cad1db1d5df1c740bc805aaf9ba...
Danny Al-Gaaf
08:40 PM Revision 01152115 (ceph): ceph-disk: add missing space after >> operator
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 0ada43f79d2b1f9f84367e558c6d1a3e90e...
Danny Al-Gaaf
08:40 PM Revision 9464284f (ceph): ceph-disk: fix except to catch OSError
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 9daf6cfce2d57509d896eae28bb97146a68...
Danny Al-Gaaf
08:40 PM Revision ffe024b8 (ceph): ceph-disk: remove unused variable key from prepare_journal_dev()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 97b4f8d66bef2328fa53f9e508eb38f8b8d...
Danny Al-Gaaf
08:40 PM Revision 329f279c (ceph): ceph-disk: there is no os.path.lstat use os.lstat
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 6d3247b5c02c39a66666a5833106dbc2304...
Danny Al-Gaaf
08:40 PM Revision 690ab6b3 (ceph): ceph-disk: fix adjust_symlink() replace 'canonical' with 'path'
Replace 'canonical' variable with 'path' since canonical doesn't
exist in this function.
Signed-off-by: Danny Al-Gaa...
Danny Al-Gaaf
08:40 PM Revision 1ffc89af (ceph): ceph-disk: fix adjust_symlink() replace 'journal' with 'target'
Replace 'journal' variable with 'target' since journal doesn't
exist in this function.
Signed-off-by: Danny Al-Gaaf ...
Danny Al-Gaaf
08:40 PM Revision e92baf50 (ceph): ceph-disk: cast output of subprocess.Popen() to str()
Cast output of subprocess.Popen() to str() to be able to use
str.split() and str.splitlines() without warnings from p...
Danny Al-Gaaf
08:40 PM Revision 02d48351 (ceph): ceph-disk: re-add python 2.7 dependency comment
FIXME!
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 6d63752c8fde91cdab306d1ca689690b269fe977)
Sage Weil
08:40 PM Revision 0113e533 (ceph): ceph-disk: udevadm settle before partprobe
After changing the partition table, allow the udev event to be
processed before calling partprobe. This helps preven...
Gary Lowell
08:40 PM Revision 970348fc (ceph): ceph-disk: fix indention
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 2d26bcc07162a5176cdbc1748b829e3f396...
Danny Al-Gaaf
08:40 PM Revision b4176baf (ceph): ceph-disk: consolidate ceph-disk-* into a single binary
ceph-disk prepare ...
ceph-disk activate ...
ceph-disk ...
This let's us share code (we were already duplicating a...
Sage Weil
08:40 PM Revision 3cbc0d0c (ceph): ceph-disk: consolidate exceptions
Use a single exception type, and catch it at the top level.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry pick...
Sage Weil
08:40 PM Revision 8901e02d (ceph): ceph-disk: simplify command dispatch
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit f287c6f90af0dfdd41358846b069aa3c54b600b3)
Sage Weil
08:40 PM Revision b807d8ba (ceph): ceph-disk: install and package
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit c2602d749023b24ac22d8cfce6e04889078f14d8)
Con...
Sage Weil
08:40 PM Revision 9c46dfb2 (ceph): ceph-disk: rename local variable shadowing builtin
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 57dde5c8b18ff4ccd53a30bb94119c0ffce...
Danny Al-Gaaf
08:40 PM Revision 0da87db1 (ceph): ceph-disk: remove double defined function get_conf
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit c57daa3c6e03e8974e133d3a2d9bc3d6f06...
Danny Al-Gaaf
08:40 PM Revision 8dd8cbac (ceph): ceph-disk: remove twice defined function mount
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit ea26ea0d81a23aa76076ad5441c3b1aadfb...
Danny Al-Gaaf
08:40 PM Revision bd1036dd (ceph): ceph-disk: remove twice defined identical function unmount
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 4adf088107586be7b356d1e963570cdab23...
Danny Al-Gaaf
08:40 PM Revision 3ec61f85 (ceph): ceph-disk: rename local variable shadowing builtin
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 9bcf5b64f45ab6c4bdedf820ed111319b2d...
Danny Al-Gaaf
08:40 PM Revision 0b4e85fe (ceph): ceph-disk: fix /dev/dm-[0-9] handling list_all_partitions()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 3af7a1ac5bf24bf832d7180002281d6b585...
Danny Al-Gaaf
08:40 PM Revision 6fa6cd85 (ceph): ceph-disk: remove unused variables from list_partitions()
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
(cherry picked from commit 6a8120d4b0c4cfa851d473532eb2366534f...
Danny Al-Gaaf
08:40 PM Revision ea07b0e1 (ceph): ceph-disk-prepare: use os.path.realpath()
My janky symlink resolution is broken in various ways.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked fr...
Sage Weil
08:40 PM Revision d05b4391 (ceph): ceph-disk-prepare: clean up stupid check for a digit
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit f03f62697f170d42b4b62c53d2860ff2f24a2d73)
Sage Weil
08:40 PM Revision e4a52002 (ceph): ceph-disk-prepare: verify device is not mounted before using
Make sure the data and/or journal device(s) are not in use (mounted)
before using them. Make room for additional "in...
Sage Weil
08:40 PM Revision 5ad4120a (ceph): ceph-disk-prepare: verify device is not in use by device-mapper
Be nice and tell the user which devices/mappings are consuming the device,
too.
Signed-off-by: Sage Weil <sage@inkta...
Sage Weil
08:40 PM Revision 35eac085 (ceph): ceph-disk-prepare: move in-use checks to the top, before zap
Move the in-use checks to the very top, before we (say) zap!
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry pic...
Sage Weil
08:40 PM Revision 897413f9 (ceph): ceph-disk-activate: don't override default or configured osd journal path
There is no reason not to rely on the default or obey any configured
value here.
Fixes: #4031
Signed-off-by: Sage We...
Sage Weil
08:40 PM Revision 739b013c (ceph): ceph-disk-activate: rely on default/configured keyring path
No reason to override the default or configured value here.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry pick...
Sage Weil
08:40 PM Revision 7c1edc0c (ceph): Revert "ceph-disk-activate: don't override default or configured osd jo...
This reverts commit 813e9fe2b4291a1c1922ef78f031daa9b78fe53b.
We run --mkfs with the osd disk mounted in a temporary...
Sage Weil
08:40 PM Revision a6ecf928 (ceph): Revert "ceph-disk-activate: rely on default/configured keyring path"
This reverts commit 936b8f20af1d390976097c427b6e92da4b39b218.
This is necessary because we mount the osd in a tempor...
Sage Weil
08:40 PM Revision 568485be (ceph): ceph-disk-activate: abort if target position is already mounted
If the target position is already a mount point, fail to move our mount
over to it. This usually indicates that a di...
Sage Weil
08:40 PM Revision 19a2cf58 (ceph): ceph-disk-activate: identify cluster .conf by fsid
Determine what cluster the disk belongs to by checking the fsid defined
in /etc/ceph/*.conf. Previously we hard-code...
Sage Weil
08:40 PM Revision 455cb325 (ceph): ceph-disk-prepare: 'mkfs -t' instead of 'mkfs --type='
Older mkfs (el6) doesn't like --type=.
Fixes: #4495
Reported-by: Alexandre Maragone <alexandre.maragone@inktank.com>...
Sage Weil
08:40 PM Revision caad1874 (ceph): ceph-disk-prepare: do partprobe after setting final partition type
This is necessary to kick udev into processing the updated partition and
running its rules.
Signed-off-by: Sage Weil...
Sage Weil
08:40 PM Revision 34fba357 (ceph): ceph-disk-activate: use full paths for everything
We are run from udev, which doesn't get a decent PATH.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked fr...
Sage Weil
08:40 PM Revision d7084037 (ceph): ceph-disk-activate: pull mount options from ceph.conf
Signed-off-by: Alexandre Marangone <alexandre.marangone@inktank.com>
(cherry picked from commit e7040f55f01db3de7d5ce...
Alexandre Marangone
08:40 PM Revision 5c5021b4 (ceph): ceph-disk-prepare: add initial support for dm-crypt
Keep keys in /etc/ceph/dmcrypt-keys.
Identify partition instances by the partition UUID. Identify encrypted
partiti...
Sage Weil
08:40 PM Revision 28d11938 (ceph): udev: trigger on dmcrypted osd partitions
Automatically map encrypted journal partitions.
For encrypted OSD partitions, map them, wait for the mapped device t...
Sage Weil
08:40 PM Revision 632be442 (ceph): ceph-disk-prepare: always force mkfs.xfs
Signed-off-by: Alexandre Marangone <alexandre.marangone@inktank.com>
(cherry picked from commit d950d83250db3a179c4b6...
Alexandre Marangone
08:40 PM Revision 405e0ea1 (ceph): debian: fix start of ceph-all
Tolerate failure, and do ceph-all, not ceph-osd-all.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from...
Sage Weil
08:40 PM Revision d1775daf (ceph): ceph-disk-prepare: -f for mkfs.xfs only
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit fecc3c3abf1176f4c7938e161559ea2db59f1cff)
Sage Weil
08:40 PM Revision abdac6fd (ceph): Fix: use absolute path with udev
Avoids the following: udevd[61613]: failed to execute '/lib/udev/bash'
'bash -c 'while [ ! -e /dev/mapper/....
Signe...
Alexandre Marangone
08:40 PM Revision 3441acf3 (ceph): debian: require cryptsetup-bin
This is needed for ceph-disk-prepare's dmcrypt support.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked f...
Sage Weil
08:40 PM Revision 8f7e3e7d (ceph): ceph.spec.in: add new Requires from ceph-disk-prepare
Added new Requires from ceph-disk-prepare: cryptsetup, gptfdisk,
parted and util-linux.
Signed-off-by: Danny Al-Gaaf...
Danny Al-Gaaf
08:40 PM Revision 181ebdee (ceph): debian: put ceph-mds upstart conf in ceph-mds package
Fixes: #3157
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 23ad3a46a0099e263f43e0f0c1df1d21c...
Sage Weil
08:40 PM Revision fa23919e (ceph): ceph-disk-activate: factor mounting out of activate
The activate stuff is generic for any OSD, regardless of whether we want
to mount it or not. Pull that part out.
Si...
Sage Weil
08:40 PM Revision e6d5aa05 (ceph): ceph-disk-activate: add --mark-init INITSYSTEM option
Do not assume we will manage via upstart; let that be passed down via the
command line.
Signed-off-by: Sage Weil <sa...
Sage Weil
08:40 PM Revision aa428017 (ceph): ceph-disk-activate: detect whether PATH is mount or dir
remove in-the-way symlinks in /var/lib/ceph/osd
This is simpler. Just detect what the path is and Do The Right Thin...
Sage Weil
08:40 PM Revision 5e0892fd (ceph): ceph-disk-prepare: refactor to support DIR, DISK, or PARTITION for data...
Lots of code reorganization collapsed into a single commit here.
- detect whether the user gave us a directory, disk...
Sage Weil
08:40 PM Revision 494533a5 (ceph): upstart/ceph-hotplug: tell activate to start via upstart
This will mark the OSD data dir as upstart-managed.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from ...
Sage Weil
08:40 PM Revision 9ea32e5f (ceph): upstart: ceph-hotplug -> ceph-osd-activate
This is a more meaningful name.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit e011ad128e7f3...
Sage Weil
08:40 PM Revision 74b56270 (ceph): ceph-disk-activate: specify full path for blkid, initctl, service
/sbin apparently isn't in the path when udev runs us.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked fro...
Sage Weil
08:40 PM Revision ffb0613e (ceph): ceph-disk-activate: auto detect init system
Look for an option 'init' in ceph.conf. Otherwise, check if we're ubuntu.
If so, use upstart. Otherwise, use sysvin...
Sage Weil
08:40 PM Revision 8b771bf9 (ceph): udev: trigger ceph-disk-activate directly from udev
There is no need to depend on upstart for this.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from comm...
Sage Weil
08:40 PM Revision 656305f6 (ceph): ceph-disk-activate: catch daemon start errors
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 690ae05309db118fb3fe390a48df33355fd068a0)
Sage Weil
08:40 PM Revision e1624e46 (ceph): debian: start/stop ceph-all event on install/uninstall
This helps us avoid the confusing situation with upstart where an individual
daemon job is running (like ceph-osd id=...
Sage Weil
08:40 PM Revision 8c4c53ab (ceph): ceph-disk-prepare: align mkfs, mount config options with mkcephfs
'osd mkfs ...', not 'osd fs mkfs ...'. Sigh. Support both.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry pic...
Sage Weil
08:40 PM Revision 05efb7ab (ceph): init-ceph: consider sysvinit-tagged dirs as local
If there is a 'sysvinit' file in the daemon directory in the default
location (/var/lib/ceph/$type/ceph-$id), conside...
Sage Weil
08:40 PM Revision 39df4c81 (ceph): init-ceph: iterate/locate local sysvinit-tagged directories
Search /var/lib/ceph/$type/ceph-$id and start/stop those daemons if
present and tagged with the sysvinit file.
Signe...
Sage Weil
08:40 PM Revision f43c339d (ceph): upstart/ceph-hotplug: drop -- in ceph-disk-activate args
We would like to transition to
ceph-disk-activate --mount DEV
and away from a generic multi-definition PATH argume...
Sage Weil
08:40 PM Revision f97f49b1 (ceph): ceph-create-keys: create mds bootstrap key
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 809143f16c70483ba5bb429dea812d31b67f2b49)
Sage Weil
08:40 PM Revision 919b0aed (ceph): debian: include /var/lib/ceph/bootstrap-mds in package
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit e80675a0f333c04452d4822fd0eb3c6e92eda3df)
Sage Weil
08:36 PM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
Oh, I actually see a couple of fixes that match this description in subsequent commits that haven't reached gitbuilde... Faidon Liambotis
08:33 PM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
So, I tried master, 3cc106453f79a0a0c332b164e282a35234a85659 with
curl -D - -H "X-Auth-Token: ..." 'http://localhost...
Faidon Liambotis
08:31 PM Revision e0c39c1e (ceph): Merge branch 'wip-4822' into next
Reviewed-by: Sam Just <sam.just@inktank.com> David Zafman
07:42 PM Revision 2211b1d7 (ceph): Fix improperly spaced line.
Warren Usui
07:37 PM Revision ebbdef29 (ceph): monitor: squash signed/unsigned comparison warning
This is a safe range to do comparisons against, and we compare
against the signed rank inside the loop.
Signed-off-b...
Greg Farnum
07:33 PM Revision 56ac098b (ceph): Merge branch 'wip-4760' into next
Yehuda Sadeh
07:32 PM Revision 5fa3cbf5 (ceph): mon: use brute force to find a sync provider if our first one fails
We try and select a random monitor first, but if that fails we should
make sure that nobody's available before assert...
Greg Farnum
07:24 PM Revision a92b4c75 (ceph): Merge branch 'wip-mon-fwd' into next
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
07:10 PM Revision 1670a2bf (ceph): rgw: trivial cleanups post code review
Following code review of #4760.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:10 PM Revision 7144ae86 (ceph): rgw: fix bucket count when stating account
We need to add up the num of buckets and not just set it
as we don't read the entire list of buckets in one operation...
Yehuda Sadeh
07:10 PM Revision 960eac26 (ceph): rgw: fix plain formatter flush
The plain formatter flush needs to append eol if needed, and
not to clear the sections stack.
Signed-off-by: Yehuda ...
Yehuda Sadeh
07:10 PM Revision 2264078a (ceph): rgw: swift list containers can return 204
In order to keep compatibility with swift, if a plain formatter
is being used, we should return 204 when there are no...
Yehuda Sadeh
07:10 PM Revision f2df8762 (ceph): rgw: fix bucket listing when reaching limit
Bucket listing was broken when limit was set.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:10 PM Revision a8b1bfa1 (ceph): rgw: fix list buckets limit
There was an issue when limit was being set, we didn't
break from the iterating loop if limit was reached. Also,
S3 d...
Yehuda Sadeh
07:10 PM Revision c880e957 (ceph): rgw: fix compilation for certain architectures
Casting.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
07:06 PM Revision e4c13afa (ceph): Merge branch 'next'
Get fix for raring builds Dan Mick
07:05 PM Revision 98f532e8 (ceph): Makefile.am: Add -lpthread to fix build on newer ld in Raring Ringtail
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Dan Mick
06:25 PM Revision f21dcdc9 (ceph): ceph config data goes in conf, not config
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
06:25 PM Revision df4105b6 (ceph): ceph config data goes in conf, not config
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
06:12 PM Revision 741f4685 (ceph): mon: fix Monitor::pick_random_mon()
The other arg isn't used, so remove the (broken) handling for that case.
If we re-add it later, model after the MonCl...
Sage Weil
06:10 PM Bug #4815: mon: leveldb grows quickly and without bound
other mons are 36GB, so it's not done yet. but stuck. Sage Weil
06:05 PM Bug #4815: mon: leveldb grows quickly and without bound
the mon.a is getting stuck in leveldb:... Sage Weil
05:29 PM Bug #4815: mon: leveldb grows quickly and without bound
New logs have been uploaded to cephdrop as "mikedawson/ceph-mon.*.log". They show starting up the three monitors. mon... Mike Dawson
04:23 PM Bug #4815: mon: leveldb grows quickly and without bound
can you reproduce with the latest next, capture the mon.a log, and also attach to the process after it stops making p... Sage Weil
05:48 PM Revision cbc3b91c (ceph): mon: mark PaxosServiceMessage forward fields deprecated
These are no longer used; we manage forward state explicitly via the
Monitor sessions instead. Mark them deprecated ...
Sage Weil
05:48 PM Revision 77c068d1 (ceph): mon: fix double-forwarding check
The PaxosServiceMessage fields are no longer filled in. Use Session::proxy_con
instead.
Signed-off-by: Sage Weil <s...
Sage Weil
05:47 PM devops Feature #4766: ceph-deploy: commands should continue to execute the next argument in case of fail...
ceph-deploy commands [new, mon create, osd create,...] exit when any given argument fails. it is either in the beginn... Tamilarasi muthamizhan
05:27 PM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
Greg, checked that and now, hitting this on only one monitor [mon.c on burnupi45].
leaving the test machines burnu...
Tamilarasi muthamizhan
04:40 PM Bug #4747 (In Progress): Upgrade monitors from argonaut->bobtail->next fails w/"Existing store ha...
Looked at this briefly and am having Tamil check it again. From the logs it appears the monitors never formed a quoru... Greg Farnum
02:02 PM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
upgraded the osds and mds as well. but the monitors are stuck up. one of the monitors seems to be up.
ubuntu@burnu...
Tamilarasi muthamizhan
02:01 PM Bug #4747 (New): Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not be...
I am not sure, why this was marked "cant reproduce" but am hitting this on my local cluster [burnupi39, burnupi45]
...
Tamilarasi muthamizhan
05:24 PM Revision e3b602ad (ceph): osd: Fix logic in OSDMap::containing_subtree_is_down()
Check for up OSDs as we walk up the crushmap hierarchy
fixes: #4822
Signed-off-by: David Zafman <david.zafman@inkta...
David Zafman
05:19 PM Revision a2a23ccd (ceph): debian/rules: use multiline search to look for Build-Depends
When Build-Depends was split into multiple lines (in commit
8f5c665744e58d6d51a1e86de55c1399f51cc1c3), the grep for
l...
Dan Mick
05:12 PM Revision f768fbba (ceph): client: re-fix cap releases
Encode cap releases if NOT replay. <facepalm> Thanks, Greg!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:51 PM Revision 5121e56c (ceph): client: don't embed cap releases in clientreplay
If the client is sending replay requests, avoid sending embedded caps,
since the mds already has the client's caps fr...
Sam Lang
04:33 PM rbd Bug #4446: librbd: crash from opensolaris vm
I tried booting in several configurations, and couldn't get it to fail. I used ceph 0.56.4, and qemu 1.0 for ubuntu 1... Josh Durgin
01:55 PM rbd Bug #4446: librbd: crash from opensolaris vm
As an ex-Sun employee, I can point out that this is an *ancient* version of S10; there've been many many updates sinc... Dan Mick
04:20 PM devops Bug #4823 (Resolved): ceph-deploy: install not implemented for RHEL 6.3
Resolved with the following commit:
commit c32a80a20ad2e29bf05bb67a244bbc995a31a606
Author: Gary Lowell <glowell@...
Anonymous
04:16 PM Bug #4810: mon: forwarded messages have weird priorities
We've discussed this and are not sure if we want to change the way prioritization works or not. The observable sympto... Greg Farnum
04:12 PM Bug #4810: mon: forwarded messages have weird priorities
wip-mon-fwd Sage Weil
03:44 PM rbd Bug #4827: librbd: use after free of ceph context or something in it
Segfaults with different backtraces occurred with and without caching enabled. Unfortunately the first core file is c... Josh Durgin
09:21 AM rbd Bug #4827 (Resolved): librbd: use after free of ceph context or something in it
From teuthology:/a/teuthology-2013-04-26_02:29:00-rbd-next-testing-basic/1393/teuthology.log:... Josh Durgin
03:22 PM devops Feature #3255: ceph-disk: allow prepare without activate (for spares)
Sage Weil
03:13 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Not single commit and not THIS bug commit. I got sure stuck (IMHO 100% last 3 of 3, not second) mon, need "kill" twic... Denis kaganovich
11:10 AM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
That commit doesn't touch the monitor code, and I don't believe those osd types are used in the monitor either. What ... Greg Farnum
08:42 AM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Post-problems IMHO solved... Upgrading gcc to 4.8.0 & -fno-aggressive-loop-optimizations (but this is black magic and... Denis kaganovich
02:34 PM Tasks #4844 (Resolved): blueprint: stats infrastructure (collectd, statsd, graphite, ...)
Sage Weil
02:33 PM Tasks #4843: blueprint: crush library, language extensions
2 racks, 2 osds on first rack, 1 on second
hosts have 2 ssds 4 spinning, want 3 replicas split by host primary on ss...
Samuel Just
02:32 PM Tasks #4843 (Resolved): blueprint: crush library, language extensions
Sage Weil
02:30 PM Tasks #4842 (Rejected): blueprint: erasure coded pg infrastructure
Sage Weil
02:29 PM Tasks #4841 (Resolved): blueprint: rados namespaces
Sage Weil
02:24 PM Feature #4214 (Duplicate): osd: optionally tolerate and repair EIO on deep scrub reads
Samuel Just
02:23 PM Fix #4840 (Resolved): mon: transition from old-style allow command to new command descriptions
Sage Weil
02:20 PM Feature #4107 (Duplicate): Usage quota for rados pools
Sage Weil
02:12 PM Cleanup #4828: dan: don't respond to e-mail via your phone in the bathroom
Sage Weil
09:45 AM Cleanup #4828 (Rejected): dan: don't respond to e-mail via your phone in the bathroom
I had an e-mail exchange with Dan this morning about some
problems with gitbuilder that Mark Nelson reported to me.
...
Alex Elder
02:09 PM Feature #4839 (Resolved): api: make new CLI send old version of commands to old monitors during u...
Ian Colle
02:06 PM Feature #4455 (In Progress): api: move '--format' into just another command argument
Dan Mick
02:06 PM rbd Feature #4838 (New): rbd-fuse: use the low level fuse interface
The low level interface will let us parse custom options (i.e. standard ceph ones). Josh Durgin
02:05 PM Bug #4822 (Resolved): After 5 minutes a down OSD is NOT marked out
e3b602adf7527101e4fd198263c8f7c1d4b5d194 David Zafman
01:08 PM rgw Feature #4745 (Fix Under Review): rgw: radosgw-admin command to stat object
Ian Colle
01:08 PM rgw Feature #4573 (Resolved): Create User Quota Blueprint
Ian Colle
01:07 PM rgw Feature #4312 (Fix Under Review): rgw: multisite: log metadata changes
Sage Weil
01:06 PM rgw Feature #3274 (Resolved): rgw: RESTful admin api for user admin
Sage Weil
01:06 PM rgw Feature #4464 (Resolved): rgw: bucket commands and RESTful API
Ian Colle
12:50 PM Bug #4836 (Resolved): crush_ops failure
2013-04-26T02:37:53.631 INFO:teuthology.task.mon_thrash.ceph_manager:quorum is size 2
2013-04-26T02:37:53.632 DEBUG:...
Samuel Just
12:47 PM Bug #4812 (Resolved): mon/Monitor.cc: 1107: FAILED assert(0 == "Unable to find a new monitor to c...
Merged into next in commit:5fa3cbf520f5aeb9e0101c1263f681542d3069a5
Created #4835 to track the other issues I raised.
Greg Farnum
12:47 PM RADOS Feature #4835 (Resolved): Monitor: better handle aborted synchronizations
See #4812. That should not be an assert (graceful shutdowns!), and in that specific case we don't actually want to ex... Greg Farnum
12:34 PM Bug #4824 (Resolved): msgr: crash in submit_message
commit:a92b4c7558d936591ca9d7320042b54a68b2962b Sage Weil
10:34 AM Bug #4824 (In Progress): msgr: crash in submit_message
Sage Weil
12:28 PM rgw Bug #4826 (Resolved): rgw: plain formatter does not flush correctly
Fixed, commit:960eac26004849d6e2fa61cfab6482e9db667c52. Yehuda Sadeh
09:32 AM rgw Bug #4826 (In Progress): rgw: plain formatter does not flush correctly
Ian Colle
12:03 PM rbd Feature #4231: librbd: Java bindings
So I already started work on 'rados-java': https://github.com/wido/rados-java
I'm thinking about combining this to...
Wido den Hollander
11:49 AM rbd Feature #4231: librbd: Java bindings
Possible good task for Joe and/or Noah? Ian Colle
11:52 AM rbd Feature #4834: Recompile/package qemu with new version of librbd to enable asynchronous flushing ...
Gary and Josh to work together on this. Ian Colle
11:51 AM rbd Feature #4834 (Resolved): Recompile/package qemu with new version of librbd to enable asynchronou...
Ian Colle
11:39 AM rbd Bug #4833 (Resolved): krbd: fix a bug in resizing a mapping
When a snapshot context update occurs, rbd_update_mapping_size() is
called to set the capacity of the disk to record...
Alex Elder
11:38 AM rbd Feature #2557: QEMU support for image locking
Need a blueprint to assist in architectural planning before we can estimate or plan this effort. Ian Colle
11:36 AM rbd Feature #4454: openstack: support volume migration in Cinder
Initially copy from one back end to the other. Instead of using volume migrations blueprint, just use backup? Getting... Ian Colle
11:19 AM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
/a/teuthology-2013-04-26_02:29:14-fs-next-testing-basic/1450 Greg Farnum
11:17 AM CephFS Bug #4832 (Resolved): mds: failed auth_unpin assert
... Greg Farnum
11:15 AM Bug #4821 (Resolved): monitor: actually setting an exclusive in _pick_random_mon would break things
oh i see, i was looking at the MonClient version (that this was probably modeled after).
commit:741f46852380c8e756...
Sage Weil
11:08 AM Bug #4821: monitor: actually setting an exclusive in _pick_random_mon would break things
max is monmap->size(); there's no modification if other is specified. I guess it would work if we decremented max, an... Greg Farnum
09:34 AM Bug #4821: monitor: actually setting an exclusive in _pick_random_mon would break things
no i think it's right. if o is set, max is num_mon-1, and we shift o or greater one to the right to still get a unif... Sage Weil
10:44 AM CephFS Bug #4829 (Closed): client: handling part of MClientForward incorrectly?
(In reference to a backwards check for is_replay when doing encode_cap_releases())... Greg Farnum
09:52 AM CephFS Bug #4742 (Resolved): mds: stuck clientreplay request
commit:5121e56c255c079569f02e0ee852e469f38f470e Sage Weil
08:07 AM rbd Feature #4013: rbd: openstack: extend nova boot api to support going from image to volume
Ian Colle
08:06 AM rbd Feature #4017: rbd: openstack: simplify volume booting with new api
Ian Colle
07:59 AM rbd Bug #4803: krbd: memory leaks while testing layered images
The following additional set of patches has been posted for
review. They're available in the "review/wip-rbd-cleanu...
Alex Elder
05:15 AM rbd Bug #4803 (Fix Under Review): krbd: memory leaks while testing layered images
OK, I have some patches ready for review but I think this will
be an ongoing process so I'll probably be bouncing th...
Alex Elder
07:58 AM rbd Bug #4800 (Fix Under Review): krbd: avoid dropping extra reference in rbd_free_disk()
The following has been posted for review:
rbd: avoid dropping extra reference in rbd_free_disk()
Alex Elder
05:10 AM rbd Bug #4800 (In Progress): krbd: avoid dropping extra reference in rbd_free_disk()
Alex Elder
05:09 AM rbd Bug #4800 (Fix Under Review): krbd: avoid dropping extra reference in rbd_free_disk()
(Nevermind. Will be ready for review shortly.) Alex Elder
07:05 AM Revision 89692e09 (ceph): debian/rules: use multiline search to look for Build-Depends
When Build-Depends was split into multiple lines (in commit
8f5c665744e58d6d51a1e86de55c1399f51cc1c3), the grep for
l...
Dan Mick
05:22 AM rbd Bug #4802: krbd: walk through error paths and fix them
I think it may be hard to describe exactly what the problems
of this type are. I do a fairly good job of it in the ...
Alex Elder
05:11 AM rbd Bug #4796 (Fix Under Review): krbd: don't create sysfs entries for snapshots of mapped images

The following has been posted for review:
[PATCH] rbd: don't create sysfs entries for non-mapped snapshots
It...
Alex Elder

04/25/2013

11:47 PM Revision 2146930e (ceph): mon: do not forward other mon's requests to other mons
The request forwarding infrastructure is there for client requests.
However, we (ab)use it for mon's sending MLog mes...
Sage Weil
11:24 PM Revision a5cade1f (ceph): PG: clear want_acting when we leave Primary
This is somewhat annoying actually. Intuitively we want to
clear_primary_state when we leave primary, but when we re...
Samuel Just
10:18 PM Revision 3ce35a67 (ceph): mon: get own entity_inst_t via messenger, not monmap
There are intervals during bootstrap(*) during which we are part of the
monmap, but our name (mon->name) does not mat...
Sage Weil
09:15 PM rgw Bug #4826 (Resolved): rgw: plain formatter does not flush correctly
This came up with the new changes that stream bucket listing. Previously we never ever flushed data while iterating, ... Yehuda Sadeh
08:16 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
IMHO problems prior to this commit. Now I happy to get working f4804849b7644f2c1dfd92404682f510a88e9a23 and going to ... Denis kaganovich
07:35 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Now all wrong, but there are at least this log. Denis kaganovich
05:32 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
hmm, according to the log it is in quorum (and leader) and healthy.. Sage Weil
04:52 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
"killall ceph-mon -w" need twice.
Denis kaganovich
04:50 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
PS First time was HEALTH_OK. Not once restart. Denis kaganovich
04:44 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
This mon now not in quorum (but running). Denis kaganovich
04:26 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Can you attach the new startup log? Sage Weil
04:19 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
And even "killall ceph-mon -w" waiting long (or infinite)... Denis kaganovich
04:17 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Sage Weil wrote:
> commit:3ce35a6743e050bf0de5abd5ad32f522c5664f3d
Hmm. Now starting good, but silent collapsing ...
Denis kaganovich
03:19 PM Bug #4811 (Resolved): incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
commit:3ce35a6743e050bf0de5abd5ad32f522c5664f3d Sage Weil
03:07 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Sage Weil wrote:
> Just to clarify: this failed startup is happening only *after* you did the manual repair (remove ...
Denis kaganovich
01:41 PM Bug #4811 (Need More Info): incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Sage Weil
01:40 PM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Just to clarify: this failed startup is happening only *after* you did the manual repair (remove store.db, replace mo... Sage Weil
02:20 AM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Sorry for flooding (this is my morning):
Last failure (on different node then first 2), between power-on and this ...
Denis kaganovich
01:29 AM Bug #4811: incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
PS About 2 previous failures: I just think about RAM limit. It just was so: busy node, long running, swapoff -a (but ... Denis kaganovich
12:25 AM Bug #4811 (Resolved): incorrect shutdown: mon/MonMap.h: 160: FAILED assert(mon_addr.count(n))
Monitor unable to start after incorrect shutdown. First happened on busy node with swapoff -a (twice), on older versi... Denis kaganovich
08:11 PM Revision b0ba4123 (ceph): Merge pull request #239 from ceph/wip-4760
#4760
Second patch Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
07:49 PM Revision a0acdcf3 (ceph): Use get('field', default) to assign downburst values for vps.
Fixes: #4592
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Dan Mick <dan.mick@inktank.com>
Warren Usui
06:52 PM Revision 42ab1f45 (ceph): Merge pull request #246 from ceph/wip-4793
#4793
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
06:36 PM Revision 303e739e (ceph): radosgw: receiving unexpected error code while accessing an non-existin...
This patch fixes a bug in radosgw swift compatibility code,
that is, if a not-owner but authorized user access a non-...
Li Wang
06:34 PM CephFS Bug #4742: mds: stuck clientreplay request
Yeah, we've discussed this some on github around wip-4742 and on irc. :) Greg Farnum
06:31 PM CephFS Bug #4742: mds: stuck clientreplay request
Looks like a client bug, it may add cap releases to the replay requests. (encode_cap_releases() should be called when... Zheng Yan
10:38 AM CephFS Bug #4742: mds: stuck clientreplay request
Logs for two runs, one is stuck in replay from a setattr, the other is stuck in replay from a rename.
Sam Lang
06:17 PM Revision 407ce132 (ceph): PendingReleaseNotes: these are now in the release-notes.rst
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:17 PM Revision c979d65b (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
06:17 PM Revision 4af93dcc (ceph): doc/release-notes: add note about sysvinit script change
See cd7e52cc76878eed0f084f7b9a6cf7c792b716c6.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:13 PM Revision cd7e52cc (ceph): init-ceph: use remote config when starting daemons on remote nodes (-a)
If you use -a to start a remote daemon, assume the remote config is present
instead of pushing the local config. Thi...
Sage Weil
05:21 PM Revision ea54e660 (ceph): Merge branch 'wip-4748-b' into next
Reviewed-by: Greg Farnum <greg@inktank.com> Sage Weil
05:13 PM devops Bug #4825 (Resolved): ceph-deploy: install failed on debian-wheezy
... Tamilarasi muthamizhan
04:52 PM Bug #4824 (Resolved): msgr: crash in submit_message
... Sage Weil
04:30 PM Bug #4813 (Resolved): pgs stuck creating
This was probably fixed by the fix for 4748 Samuel Just
09:01 AM Bug #4813: pgs stuck creating
ubuntu@teuthology:/a/teuthology-2013-04-25_01:00:08-rados-next-testing-basic/584 Samuel Just
09:01 AM Bug #4813 (Resolved): pgs stuck creating
2013-04-25T02:36:57.292 DEBUG:teuthology.misc:with jobid basedir: 584
2013-04-25T02:36:57.292 DEBUG:teuthology.orche...
Samuel Just
04:27 PM devops Bug #4823 (Resolved): ceph-deploy: install not implemented for RHEL 6.3
... Tamilarasi muthamizhan
04:25 PM Bug #3904 (Pending Backport): FAILED assert(want_acting.empty())
Samuel Just
02:14 PM Bug #3904 (Fix Under Review): FAILED assert(want_acting.empty())
Sage's scenario is most likely correct, pushed wip_3904. Samuel Just
04:14 PM Revision fb17d37f (ceph): Revert "turn on debugging for MDS and Client in FS runs"
We want to apply debugging and whitelists, not one or the
other -- whoops!
This reverts commit 60e7fb4152a7f42594d86...
Greg Farnum
04:14 PM Revision ae00c60b (ceph): temporarily add cephfs debugging to overrides
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
04:14 PM Revision 5d5e0a6e (ceph): Revert "turn on debugging for MDS and Client in FS runs"
We want to apply debugging and whitelists, not one or the
other -- whoops!
This reverts commit cb1e8ed954c41840f28f5d...
Greg Farnum
04:14 PM Revision 35cf1220 (ceph): temporarily add cephfs debugging to overrides
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
04:08 PM Bug #4815 (Need More Info): mon: leveldb grows quickly and without bound
It's not entirely clear what's going on here with just the messenger logging. If you can get monitor logging (and fro... Greg Farnum
12:33 PM Bug #4815: mon: leveldb grows quickly and without bound
Actually, I meant dmesg instead of syslog above. Looking at the syslog, ceph-mon was killed by oom-killer:
Killed ...
Mike Dawson
12:17 PM Bug #4815 (Resolved): mon: leveldb grows quickly and without bound
My mon.a process went away without a core dump or indication in the ceph-mon log or syslog of what happened. mon.b an... Mike Dawson
04:01 PM Bug #4822 (Resolved): After 5 minutes a down OSD is NOT marked out

Seeing this message:
tick entire containing rack subtree for osd.0 is down; resetting timer
OSDMap::containing_...
David Zafman
03:14 PM Bug #4812: mon/Monitor.cc: 1107: FAILED assert(0 == "Unable to find a new monitor to connect to. ...
The only way I can see this assert happening is if b randomly selected the previously-chosen monitor (c) or itself 6 ... Greg Farnum
02:50 PM Bug #4812: mon/Monitor.cc: 1107: FAILED assert(0 == "Unable to find a new monitor to connect to. ...
Yep, bootstrap() calls reset_sync(). So c dropped b's sync on the floor, and then b timed out of course. Was it suppo... Greg Farnum
02:08 PM Bug #4812: mon/Monitor.cc: 1107: FAILED assert(0 == "Unable to find a new monitor to connect to. ...
Okay, this is sort of what was supposed to happen, I think. mon c stopped responding to mon b's sync queries, and it ... Greg Farnum
01:31 PM Bug #4812 (In Progress): mon/Monitor.cc: 1107: FAILED assert(0 == "Unable to find a new monitor t...
Greg Farnum
08:56 AM Bug #4812: mon/Monitor.cc: 1107: FAILED assert(0 == "Unable to find a new monitor to connect to. ...
ubuntu@teuthology:/a/teuthology-2013-04-25_01:00:08-rados-next-testing-basic/587/ Samuel Just
08:55 AM Bug #4812 (Resolved): mon/Monitor.cc: 1107: FAILED assert(0 == "Unable to find a new monitor to c...
0> 2013-04-25 05:52:00.052720 7f126a7fc700 -1 mon/Monitor.cc: In function 'void Monitor::sync_timeout(entity_ins... Samuel Just
03:10 PM Bug #3214: osdmaptool's usage is incomplete
The rest of the bug still needs review/update; --test-map-object is indeed in the usage though (I must have been usin... Dan Mick
02:44 PM Bug #4821 (Resolved): monitor: actually setting an exclusive in _pick_random_mon would break things
... Greg Farnum
02:08 PM devops Bug #4820 (Resolved): ceph-deploy : intermittent errors during install
not often, but see this error at the end of install. It would be nice to make this error look better or let the user ... Tamilarasi muthamizhan
02:02 PM Revision d90b0caf (ceph): gen_state_diagram.py: fix function name
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
02:02 PM Revision 1ee8f390 (ceph): gen_state_diagram.py: fix naming of global variables/constants
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
02:02 PM Revision d9f8de1e (ceph): gen_state_diagram.py: add some missing spaces around operators
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
02:02 PM Revision 7cd9d23f (ceph): gen_state_diagram.py: remove unnecessary semicolon
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
02:02 PM Revision eb3350e4 (ceph): test_mon_config_key.py: fix some more naming of local vars
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
02:02 PM Revision 74365429 (ceph): test_mon_config_key.py: fix naming of local variable opLOG
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
02:01 PM Revision 9d3b4fd7 (ceph): test_mon_config_key.py: fix naming of local variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
02:00 PM Revision 04075722 (ceph): fix "Instance of 'list' has no 'split' member"
Cast with str() to fix issue.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
01:57 PM Revision c792ea67 (ceph): test_mon_config_key.py: fix naming of local variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:56 PM Revision 912bb82c (ceph): test_mon_config_key.py: fix naming of global variables/constants
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:56 PM Revision 1464169a (ceph): test_mon_config_key.py: add missing space after comma
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:56 PM Revision 16c56506 (ceph): test_mon_config_key.py: remove unnecessary semicolon
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:55 PM Revision f601eb90 (ceph): test_mon_config_key.py: fix bad indentation
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:54 PM Revision 9dd5de26 (ceph): perf-watch.py: fix naming of a local variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:53 PM Revision 226ff52a (ceph): perf-watch.py: fix naming of local variable
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:53 PM Revision 148710fb (ceph): perf-watch.py: add missing space after comma
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:52 PM Revision dffa9eeb (ceph): perf-watch.py: remove unnecessary semicolons
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de> Danny Al-Gaaf
01:23 PM rgw Feature #4819 (New): rgw: add a test for #4797
need to add a test in the swift test suite that checks for issue #4797. Yehuda Sadeh
01:19 PM rbd Bug #4446: librbd: crash from opensolaris vm
Even without NIS or NFS, I'm guessing it'll get far enough to hit the error. I'll email you a place to upload the image. Josh Durgin
04:44 AM rbd Bug #4446: librbd: crash from opensolaris vm
Thanks for continuing to pursue this.
I can send you the image (about 20GB), but it may have issues booting (depen...
Jeff Moskow
01:19 PM rgw Bug #4760 (Resolved): rgw: list buckets/containers should be streamlined
commit:b0ba41235af901bd7e64588e2a247c6a56ec5cfa Sage Weil
01:15 PM Bug #4793 (Resolved): mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
It passed and Sage merged it into next with commit:42ab1f4561cde4c724849c41a7929c93d89e89d9 Greg Farnum
10:13 AM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Sage reviewed this; unfortunately schedule_suite and teuthology failed somehow so these tests didn't actually run. :( Greg Farnum
01:13 PM rbd Bug #4803: krbd: memory leaks while testing layered images
I've been reviewing the rbd code with an eye toward finding
leaks. I have two small ones that I'll fix, but I have ...
Alex Elder
12:56 PM Bug #4748 (Resolved): mon: failed assert in OSDMonitor::build_incremental
Sage Weil
12:55 PM Bug #4810: mon: forwarded messages have weird priorities
which was problematic because of... Sage Weil
12:53 PM Bug #4810 (In Progress): mon: forwarded messages have weird priorities
actually this was the forwards taking the priority from the client msg. fixed that in wip-4748-b and running tests i... Sage Weil
12:26 PM Bug #4816 (Can't reproduce): Monitor crashed with signal Aborted in MMonSubscribe::~MMonSubscribe()
This crash occurred on a non-leader (b) while the leader (a) was experiencing some kind of a memory leak and all mons... Matthew Roy
11:31 AM rgw Bug #4797 (Resolved): rgw: receiving unexpected error code while accessing an non-existing object...
Done, merged patch by Li Wang to next, commit:303e739e5b34ad1aaedb0025ffc6da1a9e04c320. Yehuda Sadeh
10:33 AM rgw Bug #4797 (In Progress): rgw: receiving unexpected error code while accessing an non-existing obj...
Yehuda Sadeh
09:23 AM rgw Bug #4497: rgw: FAIL: testSlashInName (test.functional.tests.TestContainer)
Oh, I think figured it out. It turns out that we do return sometimes 400 and not 404.
The test itself tries to do ...
Yehuda Sadeh
06:24 AM rbd Bug #2654: Stale rbd volume cannot be unmaped
Hi, thanks for replying. Here's the info:... Leon Keijser
01:02 AM rbd Bug #2700 (Resolved): blkdeviotune method at libvirt doesn`t work on RBD volumes
The patch got accepted into libvirt: http://www.libvirt.org/git/?p=libvirt.git;a=commit;h=e3e866aee0f8b0b125da74e1afc... Wido den Hollander
12:39 AM Revision 6b8f1c6b (ceph): repair_test.py: Additional test cases
Test repair with more than 1 damaged object and with different types of damage
Regression test for bug #4778
Signed-...
David Zafman
12:33 AM Revision f4804849 (ceph): Merge branch 'wip-4778' into next
Reviewed-by: Samuel Just <sam.just@inktank.com> David Zafman
12:32 AM Revision ac3dda21 (ceph): scrub clears inconsistent flag set by deep scrub
Add new num_deep_scrub_errors and num_shallow_scrub_errors to object_stat_sum_t
Show deep-scrub error count when outp...
David Zafman

04/24/2013

11:46 PM Revision ba527c1e (ceph): doc/release-notes: enospc note
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:42 PM Revision 2075ec60 (ceph): doc/release-notes: 0.61 cuttlefish notes
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:20 PM Revision 43225220 (ceph): Merge pull request #242 from ceph/wip-objectcacher-enoent
Reviewed-by: Sage Weil <sage.weil@inktank.com> Josh Durgin
10:54 PM Revision 82d5cd60 (ceph): ObjectCacher: remove all buffers from a non-existent object
Once we're sure an object doesn't exist, we retry all the waiters in
order, and they return -ENOENT immediately. If t...
Josh Durgin
10:40 PM Revision fb8bad31 (ceph): mon: be more careful about making sure we're up-to-date on sync check
We were looking at our own paxos_max_join_drift and using that to
calculate whether we were new enough to join withou...
Greg Farnum
10:40 PM Revision fcaabf1a (ceph): mon: when electing, be sure acked leaders have new enough stores to lead
In general anybody participating in an election should be new enough to
lead thanks to the bootstrap process, but we'...
Greg Farnum
10:07 PM Revision 290b5eb0 (ceph): rgw: fix i386 compile error
error: rgw/rgw_op.cc:665:63: no matching function for call to ‘min(uint64_t, size_t&)’
Signed-off-by: Sage Weil <sag...
Sage Weil
10:05 PM Revision 14f23922 (ceph): FileStore::_split_collection: src or dest may be removed on replay
If the collection is subsequently removed, the _split_collection
might get replayed and find either src or dest remov...
Samuel Just
09:34 PM Revision 3604c982 (ceph): librados: fix calc_snap_set_diff interval calculation
When calculating the [a,b] interval over which a given clone is valid, do
not assume that b == the clone id; that is ...
Sage Weil
09:04 PM Revision 5668e5b5 (ceph): Merge remote-tracking branch 'upstream/wip_2476' into next
Fixes: #2476
Reviewed-by: Greg Farnum <greg@inktank.com>
Samuel Just
08:46 PM devops Feature #4667: ceph-deploy update
Gary's put a test repo up, and I've fleshed out the code to handle adding that if necessary, and it
seems to be work...
Dan Mick
08:20 PM Revision 81a6165c (ceph): PG: call check_recovery_sources in remove_down_peer_info
If we transition out of peering due to affected
prior set, we won't trigger start_peering_interval
and check_recovery...
Samuel Just
07:26 PM Revision a9791dae (ceph): mon: send clients away while sychronizing
When we are out of quorum, we waitlist client messages or (eventually)
send them elsewhere. If we are synchronizing,...
Sage Weil
06:23 PM Revision 12bc9a7a (ceph): mkcephfs: give mon. key 'allow *' mon caps
This will ease the transition from mkcephfs to ceph-deploy by allowing
ceph-create-keys to use the mon. keyring file ...
Sage Weil
05:58 PM Bug #4778 (Resolved): scrub clears inconsistent flag set by deep scrub
ac3dda214d52c10206328a92e4373521200c8863 David Zafman
05:16 PM Revision cce1c91a (ceph): PendingReleaseNotes: note about rbd resize --allow-shrink
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
05:10 PM rbd Bug #4446: librbd: crash from opensolaris vm
Sorry for the delay. I've learned that Solaris sector counts could start at 1 instead of 0, so rbd did at least see t... Josh Durgin
05:00 PM Bug #4810 (Won't Fix): mon: forwarded messages have weird priorities
While testing #4748, i'm seeing MForward messages between monitors getting lost. they are enqueued by read_message, ... Sage Weil
04:24 PM Bug #4784: Two Monitors Concurrently Reporting as Leaders
Oh, yeah. Looks like there were a bunch of backed up messages, and the second leader was having as much trouble with ... Greg Farnum
07:28 AM Bug #4784: Two Monitors Concurrently Reporting as Leaders
Greg, during this state ceph -s hangs for longer than I have waited (several minutes). All RBD volumes are stalled/un... Mike Dawson
04:22 PM rbd Bug #3664 (Resolved): osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
commit:82d5cd601e0fb7cb24dda4ea1f0e9f12e5d18708 Josh Durgin
04:02 PM rbd Bug #3664 (Fix Under Review): osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
Josh Durgin
08:13 AM rbd Bug #3664: osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
ubuntu@teuthology:/a/teuthology-2013-04-23_19:55:59-rbd-next-testing-basic$ less 155/teuthology.log
Sage Weil
08:13 AM rbd Bug #3664: osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
ubuntu@teuthology:/a/teuthology-2013-04-23_19:55:59-rbd-next-testing-basic$ less 148/teuthology.log
Sage Weil
04:13 PM devops Bug #4498 (Resolved): ceph-deploy osd create doesn't set up symlink for single node
commit:3a74cfcda2f37550e8f68d0d5b664151225a9244
Dan Mick
03:56 PM Bug #4793 (Fix Under Review): mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUEST...
wip-4793. Waiting for it to build so I can kick off some teuthology tests. Greg Farnum
02:35 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Okay, given that setup the only way to be participating in an election but too far behind is if we've been alive but ... Greg Farnum
02:17 PM Bug #4793 (In Progress): mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Okay, so mon.a lost quorum leadership for about 13 seconds without noticing; looks like it got stuck waiting for paxo... Greg Farnum
01:12 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Ah, that would make self-abdication a more palatable solution indeed. Greg Farnum
01:03 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
One thought: before the election they do the probe step.. maybe a simple flag in the election that says "i think i'm ... Sage Weil
12:54 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Actually, we can't just send along the versions because then the voters need global state in order to respond to each... Greg Farnum
11:06 AM Bug #4793 (In Progress): mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Okay, discussed this a little more and there was some confusion about whether we were discussing the cluster leader o... Greg Farnum
03:51 PM Revision 14777ec1 (ceph): Merge remote-tracking branch 'gh/next'
Conflicts:
ceph.spec.in
Sage Weil
03:49 PM Revision 31399d17 (ceph): Fix typo of the keystone service-create command
Signed-off-by: leseb <sebastien.han@enovance.com> Sébastien Han
03:49 PM Revision 9abec309 (ceph): rgw: list container only shows stats if needed
Fixes: #4759
Add a new request param 'stats' for the swift list containers
request. If set to 'false' it disables sta...
Yehuda Sadeh
03:46 PM Bug #4703 (Can't reproduce): ceph health hangs when upgrading from bobtail to next branch
this appears to be resolved; unable to reproduce (whereas it used to be pretty frequently triggered). Sage Weil
10:18 AM Bug #4703: ceph health hangs when upgrading from bobtail to next branch
Greg, can you please take a look at this? Ian Colle
03:36 PM Revision c7a0477b (ceph): rbd: fix cli-integration tests for striping change
We don't set the striping feature when we are using backward-compatible
(default) striping now; fix the test accordin...
Sage Weil
03:22 PM Revision 446641aa (ceph): 95-ceph-osd-alt.rules: Fix missing parent parameter
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell
03:14 PM rbd Bug #4526 (Can't reproduce): rbd-fsx: ENOTEMPTY
Sage Weil
03:14 PM Bug #4348 (Resolved): OSD slow request leads to RBD clients stalled/delayed
oh, just noticed this is 0.56.2. upgrade to .4 and the stalls will go away. Sage Weil
10:04 AM Bug #4348: OSD slow request leads to RBD clients stalled/delayed
Ivan, are you still seeing this problem? Sage Weil
03:06 PM Bug #4806 (Pending Backport): os/FileStore.cc: In function 'void FileStore::_set_replay_guard() f...
Samuel Just
01:29 PM Bug #4806 (Resolved): os/FileStore.cc: In function 'void FileStore::_set_replay_guard() failure
... Sage Weil
02:40 PM Cleanup #4809 (Resolved): MMonProbe extra fields
Looks to me like we have some unused fields in MMonProbe now:... Greg Farnum
02:35 PM Bug #4785 (Resolved): rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps...
Sage Weil
02:35 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
merged to next, commit:3604c98232615827812099af27ebc3ed2414c8eb Sage Weil
02:30 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Thanks! Diffs completed. Denis kaganovich
01:37 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Perfect, I see the problem now! Can you try wip-4785-b? Sage Weil
01:28 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Sage Weil wrote:
> i mean the output from the command 'rados -p rbd listsnaps rb.0.c558.238e1f29.000000000000'
OK...
Denis kaganovich
01:27 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Or you want output from --snap ... --from-snap ... ?
I in doubts!
Denis kaganovich
01:16 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
i mean the output from the command 'rados -p rbd listsnaps rb.0.c558.238e1f29.000000000000' Sage Weil
01:15 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
HMM. If I right understand, you want console output? IMHO it near same (a bit duplicating) to already attached "foo" ... Denis kaganovich
12:46 PM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Denis: can you attach teh output from the listsnaps command above? Sage Weil
11:32 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
... there was from "--from-snap backup" to active image. To secondary snapshot log differ in not significant details.... Denis kaganovich
11:30 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
can you do 'rados -p rbd listsnaps rb.0.c558.238e1f29.000000000000'?
also are you on irc? that would be quicker t...
Sage Weil
11:11 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Sage Weil wrote:
> Denis: can you run the rbd failing command with --log-file foo --log-max 1 --debug-ms 1 --debug-r...
Denis kaganovich
10:45 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Denis: can you run the rbd failing command with --log-file foo --log-max 1 --debug-ms 1 --debug-rbd 20?
Also, push...
Sage Weil
03:28 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
PS /etc/init.d/ceph restart - on all... Denis kaganovich
03:24 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
Now installing "next" branch on all nodes (as wip-3495 here now):
librados/snap_set_diff.cc: 40: FAILED assert(b =...
Denis kaganovich
02:35 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
git, wip-3495 branch:
# ceph-osd --version
ceph version 0.60-476-gd3752d2 (d3752d2a09f221f8cee6919ce59d102fd7f2f9...
Denis kaganovich
02:06 PM Bug #2476 (Resolved): osd: watch timeout depends on operations to an object
Samuel Just
01:22 PM Bug #2476: osd: watch timeout depends on operations to an object
Samuel Just
01:21 PM Bug #2476 (Pending Backport): osd: watch timeout depends on operations to an object
Samuel Just
01:34 PM Documentation #4807 (Closed): Document PG states, possible causes, and possible remedies
I think it would help people help themselves a lot if we enumerated the various pg states and what could
cause them ...
Dan Mick
01:22 PM Bug #4805 (Pending Backport): ReplicatedPG: pull bug
Samuel Just
12:22 PM Bug #4805 (Fix Under Review): ReplicatedPG: pull bug
Reset needs to check_recovery_sources, have patch. wip_4805 Samuel Just
12:19 PM Bug #4805 (Resolved): ReplicatedPG: pull bug
-7> 2013-04-23 21:03:12.595110 7fee6572c700 10 osd.5 1119 do_waiters -- finish
-6> 2013-04-23 21:03:12.75589...
Samuel Just
01:08 PM Bug #4521: mon: starting a new osd crashes all mon's
Samuel Just wrote:
> Evan: what version of leveldb are you using?
leveldb-1.7.0-2.el6.x86_64
Evan Felix
01:01 PM Bug #4521: mon: starting a new osd crashes all mon's
the original conversion bug is fixed, and the fixer works for those who need it, modulo this leveldb thing. we shoul... Sage Weil
12:50 PM Bug #4521: mon: starting a new osd crashes all mon's
Evan: what version of leveldb are you using? Samuel Just
06:05 AM Bug #4521: mon: starting a new osd crashes all mon's
The issue appears to be with leveldb's state, which is returning 'Invalid argument: not an sstable (bad magic number)... Joao Eduardo Luis
12:47 PM Bug #4748: mon: failed assert in OSDMonitor::build_incremental
Sage Weil
11:31 AM Bug #4748: mon: failed assert in OSDMonitor::build_incremental
last little bit of log:... Sage Weil
11:27 AM Bug #4748: mon: failed assert in OSDMonitor::build_incremental
got logs, ubuntu@teuthology:/a/sage-e1/313 Sage Weil
09:04 AM Bug #4748: mon: failed assert in OSDMonitor::build_incremental
kicked of job sage-e1 to try to reproduce this with logs Sage Weil
11:25 AM devops Bug #4756: mkcephfs doesn't set up same keys as ceph-deploy
The transition doc should be something like 'transitioning an existing cluster from mkcephfs to ceph-deploy', and the... Sage Weil
11:23 AM devops Bug #4756: mkcephfs doesn't set up same keys as ceph-deploy
wip-4756 tested out ok, commit:12bc9a7aa9cb2f47c952dee9abb210dc4eacf470 Sage Weil
09:28 AM devops Bug #4756: mkcephfs doesn't set up same keys as ceph-deploy
Well ceph-create-keys isn't able to add it because it needs those keys, right? ;)
I think a transition document is...
Greg Farnum
09:25 AM devops Bug #4756: mkcephfs doesn't set up same keys as ceph-deploy
I'll update mkcephfs to do this to ease future users' transition to ceph-deploy.
For existing clusters, the transi...
Sage Weil
10:52 AM rbd Bug #2654 (Need More Info): Stale rbd volume cannot be unmaped
Sage Weil
10:52 AM rbd Bug #2654 (In Progress): Stale rbd volume cannot be unmaped
Can you post results from find /sys/bus/rbd/devices -ls and ls -al /dev/rbd* ? Sage Weil
12:06 AM rbd Bug #2654: Stale rbd volume cannot be unmaped
Please consider re-opening this ticket. I am experiencing the same issue, even with the latest kernel version:
<pr...
Leon Keijser
10:47 AM rbd Feature #4804 (Rejected): tgt: switch to aio
Use aio interface for tgt to avoid a workqueue + sync items. Sage Weil
10:09 AM rbd Bug #4522 (Need More Info): RBD utility "showmapped" bug
Sage Weil
10:09 AM rbd Bug #4522: RBD utility "showmapped" bug
Do you still see this?
What 'showmapped' is looking at is /sys/bus/rbd/devices/*... an ls -al of that directory wo...
Sage Weil
09:23 AM Bug #4194 (Can't reproduce): osd, librados: listing objects got premature ENOENT
Sage Weil
09:23 AM Linux kernel client Bug #4524 (Can't reproduce): libceph: bad ptr deref in rbtree for kick_requests
Sage Weil
09:23 AM devops Bug #4520 (Resolved): ceph-disk-prepare intermittently fails on Centos
commit:9eda8e5d5abf0743a2ad484806cfb2018243515f Sage Weil
09:22 AM rbd Bug #4803 (Resolved): krbd: memory leaks while testing layered images
I have a series of small tests I run to test rbd functionality.
I occasionally run them in a loop in my UML environm...
Alex Elder
09:13 AM Bug #4067 (Won't Fix): Argonaut fails to build on fedora18
Sage Weil
09:03 AM rbd Bug #4802: krbd: walk through error paths and fix them
Fixed project. Alex Elder
09:02 AM rbd Bug #4802 (Resolved): krbd: walk through error paths and fix them
I have encountered a few places where the kernel rbd
code does not handle error conditions exactly right.
There app...
Alex Elder
08:57 AM rbd Bug #4800: krbd: avoid dropping extra reference in rbd_free_disk()
This is basically done.
As I look through the code though I see there are other places
where error handling does ...
Alex Elder
07:43 AM Bug #4801 (Duplicate): osd class path broken on fedora 18?
https://bugzilla.redhat.com/show_bug.cgi?id=891993 Sage Weil
03:54 AM Revision a40772be (ceph): osd_types: add last_became_active to pg_stats
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
03:54 AM Revision 1f7ff412 (ceph): ReplicatedPG: timeout watches based on last_became_active
This way a notify on an object with a single defunct watcher
won't necessarily have to wait the full timeout if the p...
Samuel Just
03:51 AM Revision d44cfc52 (ceph): Merge branch 'wip_4552' into next
Fixes: #4552
Reviewed-by: Greg Farnum <greg@inktank.com>
Samuel Just
01:27 AM Revision 297c6714 (ceph): DispatchQueue: track queued message arrival times and expose oldest
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
01:27 AM Revision 49eeaeba (ceph): Messenger: add interface to get oldest queued message arrival time
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
01:27 AM Revision d196b5ba (ceph): OSD: don't report peers down if hbclient_messenger is backed up
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
01:10 AM Revision fd750da9 (ceph): Add changes to make teuthology suites work on vms.
Fixes: #4719
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by: Dan Mick <dan.mick@inktank.com>
Warren Usui
12:24 AM Revision a8e7e9df (ceph): init-ceph: fix (and simplify) pushing ceph.conf to remote unique name
The old code would only do the push once per remote node (due to the
list in $pushed_to) but would reset $unique on e...
Sage Weil
12:23 AM Revision 0cd86dfb (ceph): Merge pull request #237 from ceph/wip-4794
init-ceph: fix (and simplify) pushing ceph.conf to remote unique name Sage Weil
12:17 AM Revision e09efda7 (ceph): Merge pull request #241 from ceph/wip-4798
#4798
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
12:16 AM Revision 48631c11 (ceph): mon: revert part of PaxosService::is_readable() change
In 98e23980f4ab7ba289303f72da06721c84767293 is_readable() was changed to
call is_active(), but that has a check for i...
Sage Weil

04/23/2013

11:30 PM Revision 97c77985 (ceph): Merge branch 'wip-teuthologyfix4693-wusui'
Warren Usui
11:28 PM Revision b7aaa198 (ceph): Check downburst paths. Display an appropriate error message if an
executable downburst cannot be found.
Fixes: #4693
Signed-off-by: Warren Usui <warren.usui@inktank.com>
Reviewed by:...
Warren Usui
11:18 PM Revision 0093d704 (ceph): librbd: fix i386 build
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:11 PM Revision 5349ee30 (ceph): Merge pull request #240 from ceph/wip-4665
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
10:57 PM Revision 857c88e0 (ceph): librbd: add read_iterate2 call with fixed argument type
The existing read_iterate takes a size_t for the length, which is only 4GB
on 32-bit machines. Instead, take a uint6...
Sage Weil
10:45 PM Revision 6c798ed9 (ceph): librbd: implement read not in terms of read_iterate
The read() method returns the bytes read, trimmed to the end of the image;
use the other read() variant to do this (w...
Sage Weil
09:06 PM Revision 95ed73a7 (ceph): mon: drop forwarded requests after an election
On each election, we resend routed requests to the new leader (or
requeue for ourselves). Therefore, if we receive a...
Sage Weil
08:54 PM Bug #4552 (Resolved): osd: temporarily hung box marks down peers
I think the problem was likely caused by a severely backed up heartbeat client dispatch queue. d44cfc524fc0844c6027c... Samuel Just
08:45 PM Revision ab257070 (ceph): mon: requeue routed_requests for self if elected leader
If we have requests that we have forwarded, and are elected leader,
requeue those requests for ourself and queue them...
Sage Weil
08:40 PM Revision 4b07d692 (ceph): mon: track original Connection* for forwarded requests
Keep a reference to the source Connection* for forwarded requests. This
makes the reply path slightly cleaner, and w...
Sage Weil
07:50 PM Revision 526863ee (ceph): remove ext4 from rados thrashing for now
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
07:44 PM Revision 426e3be6 (ceph): Merge pull request #222 from ceph/wip-3495
Reviewed-by: Greg Farnum <greg@inktank.com> Gregory Farnum
07:28 PM Revision 8402107c (ceph): test_filejournal: adjust corrupt entry tests to force header write
The journal no longer assumes corruption if it finds a valid entry
after an inavlid entry. Instead, these tests will...
Samuel Just
07:15 PM rbd Bug #4800 (Resolved): krbd: avoid dropping extra reference in rbd_free_disk()
I found during some failure injection testing that the call to
rbd_free_disk() in the error path of rbd_dev_probe_fi...
Alex Elder
07:04 PM Revision 9374bacc (ceph): Merge pull request #238 from ceph/wip-bobtail-rbd-backports-req-order
Reviewed-by: Sage Weil <sage.weil@inktank.com> Josh Durgin
06:33 PM Revision d86f9b1d (ceph): ObjectCacher: always complete flush_set() callback
This removes the last remnants of
b5e9995f59d363ba00d9cac413d9b754ee44e370. If there's nothing to flush,
immediately ...
Josh Durgin
06:33 PM Revision ee7bf281 (ceph): ObjectCacher: remove NULL checks in flush_set()
Callers will always pass a callback, so assert this and remove the
checks for it being NULL.
Signed-off-by: Josh Dur...
Josh Durgin
06:33 PM Revision 3a61d17b (ceph): ObjectCacher: remove unneeded var from flush_set()
The gather will only have subs if there is something to flush. Remove
the safe variable, which indicates the same thi...
Josh Durgin
06:33 PM Revision fb95b800 (ceph): librados: add async flush interface
Sometimes you don't want flush to block, and can't modify
already scheduled aio_writes. This will be useful for a
lib...
Josh Durgin
06:33 PM Revision f9bcffa2 (ceph): librados: add versions of a couple functions taking explicit snap args
Usually the snapid to read from or the snapcontext to send with a write
are determined implicitly by the IoCtx the op...
Josh Durgin
06:33 PM Revision cbb37fb5 (ceph): librbd: add an is_complete() method to AioCompletions
Mainly this is useful for testing, like flushing and checking that
all pending writes are complete after the flush fi...
Josh Durgin
06:33 PM Revision f2e490cb (ceph): librbd: use the same IoCtx for each request
Before we were duplicating the IoCtx for each new request since they
could have a different snapshot context or read ...
Josh Durgin
06:33 PM Revision 31a45e8e (ceph): librbd: add an async flush
At this point it's a simple wrapper around the ObjectCacher or
librados.
This is needed for QEMU so that its main th...
Josh Durgin
06:33 PM Revision d36c5b5b (ceph): librados: move snapc creation to caller for aio_operate
The common case already has a snapshot context, so avoid duplicating
it (copying a potentially large vector) in IoCtx...
Josh Durgin
06:33 PM Revision 4a1c27c0 (ceph): librados: don't use lockdep for AioCompletionImpl
This is a quick workaround for the next branch. A more complete fix
will be done for the master branch. This does not...
Josh Durgin
06:33 PM Revision 7bc8df1f (ceph): test_stress_watch: remove bogus asserts
There's no reason to check the duration of a watch. The notify will
timeout after 30s on the OSD, but there's no guar...
Josh Durgin
06:33 PM Revision 13ba07a0 (ceph): ObjectCacher: deduplicate final part of flush_set()
Both versions of flush_set() did the same thing. Move it into a
helper called from both.
Signed-off-by: Josh Durgin ...
Josh Durgin
06:33 PM Revision 124f81cc (ceph): WritebackHandler: make read return nothing
The tid returned by reads is ignored, and would make tracking writes
internally more difficult by using the same id-s...
Josh Durgin
06:33 PM Revision 884438fe (ceph): LibrbdWriteback: use a tid_t for tids
An int could be much smaller, leading to overflow and bad behavior.
Signed-off-by: Josh Durgin <josh.durgin@inktank....
Josh Durgin
06:33 PM Revision 7a11c250 (ceph): LibrbdWriteback: removed unused and undefined method
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
(cherry picked from commit 909dfb7d183f54f7583a70c05550bec07856d...
Josh Durgin
06:33 PM Revision 0e2266db (ceph): LibrbdWriteback: complete writes strictly in order
RADOS returns writes to the same object in the same order. The
ObjectCacher relies on this assumption to make sure pr...
Josh Durgin
06:33 PM Revision aa37726b (ceph): rbd: only set STRIPINGV2 feature when needed
Only set the STRIPINGV2 feature if the striping parameters are non-default.
Specifically, fix the case where the pass...
Josh Durgin
06:33 PM Revision 959bfe90 (ceph): osdc/Objecter: unwatch is a mutation, not a read
This was causing librados to unblock after the ACK on unwatch, which meant
that librbd users raced and tried to delet...
Sage Weil
06:33 PM Revision d9636faa (ceph): osd: make watch OSDOp print sanely
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit de4fa95f03b99a55b5713911c364d7e2a4588679)
Sage Weil
06:33 PM Revision 9ea4dac1 (ceph): objecter: separate out linger_read() and linger_mutate()
A watch is a mutation, while a notify is a read. The mutations need to
pass in a proper snap context to be fully cor...
Sage Weil
06:33 PM Revision d8ac6cbf (ceph): objecter: initialize linger op snapid
Since they are write ops now, it must be CEPH_NOSNAP or the OSD
returns EINVAL.
Signed-off-by: Josh Durgin <josh.dur...
Josh Durgin
06:33 PM Revision 9b292199 (ceph): common: add lockers for RWLocks
This makes them easier to use, especially instead of existing mutexes.
Signed-off-by: Josh Durgin <josh.durgin@inkta...
Josh Durgin
06:33 PM Revision 6e6636d5 (ceph): librbd: use rwlocks instead of mutexes for several fields
Image metadata like snapshots, size, and parent is frequently read,
but rarely updated. During flatten, we were depen...
Josh Durgin
06:33 PM Revision 34e9030e (ceph): librbd: make sure racing flattens don't crash
The only way for a parent to disappear is a racing flatten completing,
or possibly in the future the image being forc...
Josh Durgin
06:33 PM Revision 796066b7 (ceph): Merge branch 'wip-4249' into wip-4249-master
Make snap_rollback() only take a read lock on snap_lock, since
it does not modify snapshot-related fields.
Conflicts:...
Josh Durgin
06:33 PM Revision cd989681 (ceph): librbd: fix rollback size
The duplicate calls to get_image_size() and get_snap_size() replaced
by 5806226cf0743bb44eaf7bc815897c6846d43233 unco...
Josh Durgin
06:33 PM Revision f2bcf241 (ceph): test_rbd: move flatten tests back into TestClone
They need the same setup, and it's easy enough to run specific
subtests. Making them a separate subclass accidentally...
Josh Durgin
06:33 PM Revision 1e51be05 (ceph): ObjectCacher: keep track of outstanding reads on an object
Reads always use C_ReadFinish as a callback (and they are the only
user of this callback). Keep an xlist of these for...
Josh Durgin
06:33 PM Revision d9ca1b00 (ceph): ObjectCacher: add a method to clear -ENOENT caching
Clear the exists and complete flags for any objects that have exists
set to false, and force any in-flight reads to r...
Josh Durgin
06:33 PM Revision 1c44b66f (ceph): librbd: invalidate cache when flattening
The cache stores which objects don't exist. Flatten bypasses the cache
when doing its copyups, so when it is done the...
Josh Durgin
06:33 PM Revision 9facdcac (ceph): librbd: optionally wait for a flush before enabling writeback
Older guests may not send flushes properly (i.e. never), so if this is
enabled, rbd_cache=true is safe for them trans...
Josh Durgin
06:33 PM Revision 7bc1596b (ceph): librbd: flush cache when set_snap() is called
If there are writes pending, they should be sent while the image
is still writeable. If the image becomes read-only, ...
Josh Durgin
06:33 PM Revision e237dfc7 (ceph): ObjectCacher: optionally make writex always non-blocking
Add a callback argument to writex, and a finisher to run the
callbacks. Move the check for dirty+tx > max_dirty into ...
Josh Durgin
06:33 PM Revision 3b0c565d (ceph): librbd: make aio_writes to the cache always non-blocking by default
When the ObjectCacher's writex blocks, it affects the thread requesting
the aio, which can cause starvation for other...
Josh Durgin
06:33 PM Revision 0f2e5d36 (ceph): objectcacher: Remove commit_set, use flush_set
commit_set() and flush_set() are identical in functionality,
so use flush_set everywhere and remove commit_set from
t...
Sam Lang
06:33 PM Revision 00dfb3f0 (ceph): ObjectCacher: fix flush_set when no flushing is needed
C_GatherBuilder takes ownership of the Context we pass it. Deleting it
in flush_set after constructing the C_GatherBu...
Josh Durgin
06:31 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
huh, forgot to mention that there is one case in which an out-of-quorum monitor must be elected the (sync) leader in ... Joao Eduardo Luis
05:42 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
Okay, this appears to be happening because the elected leader is too far behind, so it starts syncing and the system ... Greg Farnum
01:53 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-04-23_11:03:00-rados-next-testing-basic/126
with logs
Sage Weil
01:49 PM Bug #4793: mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
No idea what's going on here, but I'm going to start looking into it. Greg Farnum
09:30 AM Bug #4793 (Resolved): mon/Monitor.cc: 1126: FAILED assert(!(sync_role & SYNC_ROLE_REQUESTER))
During the process of attempting to sync a new or behind monitor with 0.60, I have seen mon/Monitor.cc: 1126: FAILED ... Mike Dawson
06:23 PM Revision 1435cb54 (ceph): Merge branch 'next' of github.com:ceph/teuthology into next
Sandon Van Ness
06:22 PM Revision 0b50cb5e (ceph): Increase IPMI attempts to try to get around Flakey IPMI.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
Sandon Van Ness
06:02 PM Revision 7fbe467f (ceph): ceph.conf: enable full debugging on the mon
Sage Weil
05:48 PM Revision 556bb649 (ceph): rgw: stream list buckets (containers) request
Fixes: #4760
Instead of retrieving the entire list of buckets in one
chunk, streamline it. This makes it so that if t...
Yehuda Sadeh
05:35 PM Revision 98cc648c (ceph): Increase IPMI attempts to try to get around Flakey IPMI.
Signed-off-by: Sandon Van Ness <sandon@inktank.com>
Reviewed-by: Sam Lang <sam.lang@inktank.com>
Sandon Van Ness
05:27 PM devops Bug #4498 (Fix Under Review): ceph-deploy osd create doesn't set up symlink for single node
Dan Mick
02:06 PM devops Bug #4498: ceph-deploy osd create doesn't set up symlink for single node
I ran across this too, and have a fix; the problem is just in the log statement. Dan Mick
05:23 PM Bug #4794 (Resolved): init-ceph: fix unique name pushing thing for ceph.conf on remote nodes
Sage Weil
10:03 AM Bug #4794 (Fix Under Review): init-ceph: fix unique name pushing thing for ceph.conf on remote nodes
Sage Weil
09:55 AM Bug #4794 (Resolved): init-ceph: fix unique name pushing thing for ceph.conf on remote nodes
Sage Weil
05:17 PM Bug #4798 (Resolved): mon: message stuck in processing loop
commit:e09efda Sage Weil
04:11 PM Bug #4798 (Fix Under Review): mon: message stuck in processing loop
Sage Weil
02:30 PM Bug #4798: mon: message stuck in processing loop
from the logs it looks like starvation from these messages is preventing any new message processing (and thus quorum)
Sage Weil
02:20 PM Bug #4798: mon: message stuck in processing loop
Okay, but of course the only reason we're seeing this is that the monitors aren't forming a quorum, right? So that's ... Greg Farnum
01:49 PM Bug #4798 (In Progress): mon: message stuck in processing loop
about to test a fix. the problem is that routed_request are resent to the new leader, even if that is us.. so it is p... Sage Weil
01:33 PM Bug #4798: mon: message stuck in processing loop
Is it actually a loop or is the command getting re-sent? Greg Farnum
01:26 PM Bug #4798 (Resolved): mon: message stuck in processing loop
... Sage Weil
05:00 PM Revision ccbc4dbc (ceph): init-ceph: fix (and simplify) pushing ceph.conf to remote unique name
The old code would only do the push once per remote node (due to the
list in $pushed_to) but would reset $unique on e...
Sage Weil
05:00 PM Bug #4784 (Closed): Two Monitors Concurrently Reporting as Leaders
Yeah, leveldb on mon.a went totally out to lunch — it tried to pass through a transaction and never finished, as best... Greg Farnum
02:57 PM Bug #4784 (In Progress): Two Monitors Concurrently Reporting as Leaders
I've got more digging to do to verify my diagnose on the listed times, but so far what I'm seeing looks like the lead... Greg Farnum
01:26 PM Bug #4784 (Need More Info): Two Monitors Concurrently Reporting as Leaders
Got Kevin on irc and am waiting for logs of when this first happens, if possible. I'll go review some of the election... Greg Farnum
04:12 PM rbd Bug #4665 (Resolved): librbd: read_iterate() can overflow its return value
commit:857c88e017f082b6ef2a81a1890baa7d20672a31 Josh Durgin
12:18 PM rbd Bug #4665 (In Progress): librbd: read_iterate() can overflow its return value
Sage Weil
03:53 PM CephFS Feature #4799 (Resolved): Client Security for CephFS
As discussed on the #ceph IRC channel with gregaf and others, I would find some added level of client security in Cep... Mike Kelly
02:11 PM Bug #4749 (Duplicate): osd: failed to recover before timeout
oh..this is a dup of #4798. the mon is stuck, so the pg stats appear to make no progress. Sage Weil
01:52 PM Bug #4749: osd: failed to recover before timeout
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-04-23_11:03:00-rados-next-testing-basic/120 Sage Weil
01:34 PM CephFS Bug #4721 (Resolved): libcephfs tests fail when using ceph-deploy
strange that it works fine on the latest next branch [0.60-624-g426e3be-1precise] ... Tamilarasi muthamizhan
01:10 PM devops Bug #4632 (Resolved): ceph-deploy: osd create command prepares disk but does not activate in centos
commit:7ad63d23d74e5bc45c44a0192ab1f49ceb68ffa7 Sage Weil
12:58 PM Bug #4792: filejournal corrupt tests broken
with commit:8402107c65874262681f27ff6018b0d405af1a94, for those of you following along via email instead of the auto-... Greg Farnum
12:54 PM Bug #4792 (Resolved): filejournal corrupt tests broken
Samuel Just
09:29 AM Bug #4792 (Resolved): filejournal corrupt tests broken
... Sage Weil
12:54 PM rgw Bug #4797 (Resolved): rgw: receiving unexpected error code while accessing an non-existing object...
The problem happens when a user has been granted the swift read-objs permission on the bucket. Yehuda Sadeh
12:54 PM Bug #4791 (Need More Info): osd/ReplicatedPG.cc: 7053: FAILED assert(r >= 0) in scan_range
This may be an ext4 bug, I suggest we ignore it until we see it again on xfs. I've removed ext4 from the rados and r... Samuel Just
09:28 AM Bug #4791 (Can't reproduce): osd/ReplicatedPG.cc: 7053: FAILED assert(r >= 0) in scan_range
... Sage Weil
12:44 PM Bug #3495 (Resolved): ceph-mon crash
Merged into next with commit:426e3be64e851947b288e43bc0ee932ae7f214bb Greg Farnum
12:07 PM rbd Bug #3737 (Resolved): Higher ping-latency observed in qemu with rbd_cache=true during disk-write
Thanks for testing it out everyone. It's now in the bobtail branch too. Josh Durgin
07:09 AM rbd Bug #3737: Higher ping-latency observed in qemu with rbd_cache=true during disk-write
I just tested the Qemu patch with a cherry-pick to Qemu 1.2 and with the wip-bobtail-rbd-backports-req-order branch a... Wido den Hollander
12:06 PM rbd Bug #4551 (Resolved): librbd: rollback broken for clones
Josh Durgin
12:06 PM rbd Bug #4525 (Resolved): hang during librbd python tests
Josh Durgin
12:05 PM rbd Bug #4364 (Resolved): ObjectCacher: inconsistency after flatten
Josh Durgin
12:05 PM rbd Bug #4531 (Resolved): ObjectCacher: read waiters for parent data during copyup get reordered, cau...
Josh Durgin
11:38 AM rbd Bug #4796 (Resolved): krbd: don't create sysfs entries for snapshots of mapped images
When an rbd image gets mapped a device entry gets created
for it under /sys/bus/rbd/devices/<id>/. Inside that
dir...
Alex Elder
10:56 AM Bug #2476: osd: watch timeout depends on operations to an object
This looks okay to me, but Sam doesn't remember it and has gotten nervous so now looking at it is in his queue for la... Greg Farnum
10:55 AM Bug #4521: mon: starting a new osd crashes all mon's
current mon directory Evan Felix
07:16 AM Bug #4521: mon: starting a new osd crashes all mon's
Evan, after a closer inspection I figured that your bug is indeed different from Sage's.
Can you confirm you ran t...
Joao Eduardo Luis
10:29 AM CephFS Bug #4742: mds: stuck clientreplay request
Attaching mds log from mds stuck on clientreplay. Looks like setattr is gets put on the inode waiting list by the lo... Sam Lang
09:23 AM rgw Bug #4755: rgw: assumption of signed char
Adam, I pushed a different fix to wip-4755 branch. Can you test it and make sure that it fixes the issue for you? Yehuda Sadeh
08:53 AM Bug #4785 (Need More Info): rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r-...
Can you confirm what version the OSDs are running? My first guess is they have v0.60 or older code that doesn't have... Sage Weil
03:03 AM Bug #4785: rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps[r->snaps.s...
PS Same without "--snap backup.tmp" - to active image only. Denis kaganovich
03:00 AM Bug #4785 (Resolved): rbd export-diff: librados/snap_set_diff.cc: 40: FAILED assert(b == r->snaps...
Creating backups. Yesterday created snapshots "backup" for every rbd image. Everyday creating snapshot "backup.tmp" a... Denis kaganovich
08:06 AM rbd Bug #4774: krbd: don't create /dev entries for backing devices
I'm unfortunately finding what I fought with last year when
working with the initialization and teardown of rbd devi...
Alex Elder
05:30 AM Revision 7ad63d23 (ceph): ceph-disk: OSD hotplug fixes for Centos
Two fixes for Centos 6.3 and other systems with udev versions
prior to 172. The disk peristant name using the GPT UU...
Gary Lowell
04:03 AM Revision 3dd9574b (ceph): doc: Usage requires --num_osds.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
04:02 AM Revision b71ec9c2 (ceph): doc: Added some detail. Calculating PGs, maps; reorganized a bit.
fixes: #2968
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
03:59 AM Revision aa16700d (ceph): Merge branch 'next'
Sage Weil
03:59 AM Revision bbcba292 (ceph): set 'filestore flush min = 0' for all ffsb jobs
Until we fix #4579 Sage Weil

04/22/2013

11:18 PM Revision f42fc0e4 (ceph): mon: MDSMonitor: tighter leash on cross-proposals to the osdmon
Fixes: #3495
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Joao Eduardo Luis
11:18 PM Revision b73ef010 (ceph): mon: [MDS]Monitor: remove 'stop_cluster' and 'do_stop()'
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
10:22 PM Revision 25019803 (ceph): Merge pull request #234 from ceph/wip-4758
Fixes #4758.
Reviewed-by: Greg Farnum <greg@inktank.com>
Gregory Farnum
10:20 PM Revision fa77e1e7 (ceph): mon: PaxosService: add request_proposal() to perform cross-proposals
Instead of allowing services to directly use 'propose_pending()' on
other services, we instead add two new functions:...
Joao Eduardo Luis
10:20 PM Revision a634bb17 (ceph): mon: PaxosService: is_writeable() depends on being ready to be written to
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
10:20 PM Revision 98e23980 (ceph): mon: PaxosService: is_readable/writeable() depending on is_active()
Instead of depending on individual conditions.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Joao Eduardo Luis
10:20 PM Revision b29a5b15 (ceph): mon: PaxosService: consider is_recovering() on is_writeable()
A service is never writeable while it's recovering.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Joao Eduardo Luis
10:12 PM Revision 59d6953c (ceph): mon: set threshold to periodically stash_full
Set an interval to periodically write a full copy of the map that is lower
than the trim point (which is generally a ...
Sage Weil
10:12 PM Revision b33fae4e (ceph): mon: commit LogSummary on every message
This moves our version pointer up so that we don't re-log (by re-consuming)
log messages to /var/log/ceph/ceph.log on...
Sage Weil
10:11 PM Revision 5792be81 (ceph): Merge pull request #230 from ceph/wip-mon-paxos-fixes
Wip mon paxos fixes
Reviewed-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
10:05 PM Revision c200cdb0 (ceph): Merge pull request #225 from ceph/wip-4543
Fixes #4543
Reviewed-by: Greg Farnum <greg@inktank.com>
Gregory Farnum
10:03 PM Revision 660752a2 (ceph): doc: Added users to Getting Started.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:55 PM Revision 1164345a (ceph): ceph-mon: Attempt to obtain monmap from several possible sources
In order of interest/priority:
- our latest monmap version
- a backup monmap version created during sync start, ...
Joao Eduardo Luis
09:53 PM Revision 9ba32404 (ceph): mon: Monitor: backup monmap prior to starting a store sync
If by fate we end up attempting a store sync after failing at
least one before, we might not have a monmap to read fr...
Joao Eduardo Luis
09:01 PM Documentation #3674 (In Progress): Deployment documentation is confusing
John Wilkins
08:44 PM Revision de5d1da8 (ceph): rgw: don't send tail to gc if copying object to itself
Fixes: #4776
Backport: bobtail
Need to make sure that when copying an object into itself we don't
send the tail to th...
Yehuda Sadeh
08:36 PM Revision cec5282b (ceph): Merge pull request #232 from ceph/wip-4710
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
08:23 PM Bug #2476: osd: watch timeout depends on operations to an object
Greg, can you please review this wip branch? Ian Colle
08:01 PM Revision 86ad464f (ceph): Merge branch 'next'
Sage Weil
08:01 PM Revision 48d89c61 (ceph): ceph-deploy: fix stop command
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:58 PM Revision 70e1e47d (ceph): Merge pull request #233 from ceph/wip-mon-idempotent
Wip mon idempotent
Reviewed-by: Dan Mick <dan.mick@inktank.com>
Sage Weil
07:50 PM Revision 85fd2ca2 (ceph): mon: make 'osd pool rmsnap ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:49 PM Revision 43d62c00 (ceph): mon: make 'osd pool mksnap ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:48 PM Revision 08e3ec11 (ceph): mon: make 'osd blacklist rm ...' idempotent
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:45 PM Bug #4784 (Closed): Two Monitors Concurrently Reporting as Leaders
There appears to be a bug in the new Monitor Paxos code in version 0.59 and 0.60. Over the past several days, I have ... Mike Dawson
07:41 PM Revision 5926ffa5 (ceph): rbd: only set STRIPINGV2 feature when needed
Only set the STRIPINGV2 feature if the striping parameters are non-default.
Specifically, fix the case where the pass...
Sage Weil
07:38 PM Revision 5446218f (ceph): rbd: fix feature display for --info
Only include the feature if it is set!
Backport: bobtail
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:41 PM Revision 568101fa (ceph): rbd: avoid clobbering return value with udevadm settle
Fixes: #4707
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:32 PM Revision 8db9d0a2 (ceph): FileJournal: a valid entry after invalid entry =/=> corrupt
Out of order journal entry writes using aio may cause entry
n+2 to be written prior to n. This does not indicate
cor...
Samuel Just
06:19 PM devops Bug #4769 (Resolved): centos reimaging script should also include ntpd restart
Imager has been updated to run ceph-qa-chef after imaging for CentOS like ubuntu so this should be good in the future. Sandon Van Ness
11:54 AM devops Bug #4769: centos reimaging script should also include ntpd restart
Alrighty. I will work on getting the imager to automatically run this for centOS so its not something you have to thi... Sandon Van Ness
11:45 AM devops Bug #4769: centos reimaging script should also include ntpd restart
oops, i did not run ceph-qa-chef on the newly installed centos systems. Tamilarasi muthamizhan
11:10 AM devops Bug #4769: centos reimaging script should also include ntpd restart
Were you seeing this not happen after running ceph-qa-chef? On ubuntu this is handled on ceph-qa-chef (not imaging) a... Sandon Van Ness
05:53 PM rbd Bug #3664: osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
Nevermind, the logs were saved after all. Hooray! Josh Durgin
05:52 PM rbd Bug #3664: osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
Unfortunately the logs aren't there anymore (they weren't saved when a power failure restarted the machine running te... Josh Durgin
05:16 PM rbd Bug #3664 (In Progress): osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
Looking at this again, hopefully will get it fixed tomorrow. Josh Durgin
05:22 PM rgw Feature #3671: Request for x-amz-grant-full-control support
Merged into master awhile ago, ID eb0f49d4b68062701b842b9cfdde708868769bef caleb miles
05:21 PM rgw Feature #3670: Request for bucket-owner-read and bucket-owner-full-control grants
caleb miles wrote:
> Committed to master awhile ago, ID e345dfe04a64fcd0d37c9e0717b6714038c302ae
caleb miles
05:13 PM rgw Feature #3670 (Resolved): Request for bucket-owner-read and bucket-owner-full-control grants
Committed to master awhile ago, ID eb0f49d4b68062701b842b9cfdde708868769bef caleb miles
05:11 PM rbd Bug #4774 (In Progress): krbd: don't create /dev entries for backing devices
This is what I am now working on; just marking it so. Alex Elder
05:09 PM rbd Bug #3847 (Resolved): rbd: figure out correct byte order for watch version
The following has been committed to the "testing" branch
of the ceph-client git repository:
42c6070 libceph: fix ...
Alex Elder
05:06 PM rbd Feature #4709 (Resolved): krbd: support stripingv2 images that don't require I/O path changes
The following has been committed to the "testing" branch
of the ceph-client git repository:
09186dd rbd: get and ...
Alex Elder
05:04 PM rbd Bug #4773 (Resolved): rbd: have rbd_obj_method_sync() return transfer count
The following have been committed to the "testing" branch of
the ceph-client git repository.
3ad6cbd9 libceph: ad...
Alex Elder
04:54 PM devops Feature #4667: ceph-deploy update
I have a start which installs based on the currently-configured repos (of which, at the moment,
none actually contai...
Dan Mick
03:27 PM devops Feature #4667 (In Progress): ceph-deploy update
Dan Mick
04:53 PM rbd Feature #4724 (Resolved): krbd: handle layered I/O correctly when the child has been resized
The following has been committed to the "testing" branch
of the ceph-client git repository:
64548e0 rbd: enforce ...
Alex Elder
04:39 PM Bug #4783 (Resolved): After repairs finish a new deep-scrub should be avoided

The fix for #4778 needs to initiate a deep-scrub after repairs are complete to clear the PG_STATE_INCONSISTENT. We...
David Zafman
04:14 PM Revision 9b953aa4 (ceph): radosgw: Fix duplicate 'Content-Type' when using 'response-content-type'
Signed-off-by: Sylvain Munaut <s.munaut@whatever-company.com>
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com>
Sylvain Munaut
04:08 PM Revision 4b9a2a39 (ceph): mon: MonmapMonitor: add function to obtain latest monmap
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
04:08 PM Revision 41b874cb (ceph): mon: PaxosService: add 'exists_key/version' helper functions
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
03:47 PM devops Bug #4767 (Pending Backport): ceph-deploy: install should default to picking cuttlefish when cutt...
wip-4767, ready to merge right when cuttlefish is released. Sage Weil
03:39 PM Bug #4579: kclient + ffsb workload makes osds mark themselves down
this is not new behavior, so it is not a cuttlefish blocker. feature #4782 is the current proposed fix. Sage Weil
03:37 PM Feature #4782 (Resolved): osd: build writeback model to replace async flusher
build a model that includes
- dirty bytes value
- dirty files values
- a cost function of bytes and inodes
...
Sage Weil
03:34 PM Bug #4552 (In Progress): osd: temporarily hung box marks down peers
Sage Weil
03:22 PM Bug #4758 (Resolved): monitor: going through all incrementals on startup
Commit:25019803507114e8ab2082d2c44af6588e5aafc2 Greg Farnum
03:01 PM Bug #4758 (Fix Under Review): monitor: going through all incrementals on startup
Sage Weil
03:16 PM rbd Bug #4242: krbd: xfstest 259 failure (FS size near 4TB)
I just hit this again with the current testing branch.
testing e7fce31 rbd: issue a copyup for layered writes
Alex Elder
03:08 PM Bug #4543 (Resolved): mon: corrupted store if monitor dies mid-sync
commit: c200cdb08108ae901c4c6f3625d55da707a38e5a Greg Farnum
11:28 AM Bug #4543 (In Progress): mon: corrupted store if monitor dies mid-sync
Whoops, wrong one before. Greg Farnum
11:28 AM Bug #4543 (Need More Info): mon: corrupted store if monitor dies mid-sync
Greg Farnum
11:28 AM Bug #4543: mon: corrupted store if monitor dies mid-sync
New comments; should be quick to address; have you tested it? Greg Farnum
09:15 AM Bug #4543 (Fix Under Review): mon: corrupted store if monitor dies mid-sync
Revised version and comments on github. Joao Eduardo Luis
02:59 PM rbd Feature #4550 (In Progress): Create Qemu+RBD rpm package for RHEL+CentOS 6.3 on ceph.com
oops, didn't mean to change the status Josh Durgin
02:55 PM rbd Feature #4550: Create Qemu+RBD rpm package for RHEL+CentOS 6.3 on ceph.com
I'd suggest starting with the latest version of qemu-kvm for centos 6 (the c6 branch of https://nazar.karan.org/summa... Josh Durgin
12:24 PM rbd Feature #4550 (In Progress): Create Qemu+RBD rpm package for RHEL+CentOS 6.3 on ceph.com

It looks like I want to grab the source for the qemu-kvm-0.12.1.2-2.295 package that ships with centos 6.3, rebuild...
Anonymous
02:55 PM Bug #4765 (Rejected): monitor: sets global version feature but upgrades might not actually have a...
A-hah! We only need the global versions on those updates which will be involved in syncing during/following the cuttl... Greg Farnum
02:36 PM rgw Bug #4776 (Resolved): S3 copy part corrupt files >512kb
Fixed, commit:de5d1da810732ee48f41e8be18257053d862301b. Merged into next, bobtail. Yehuda Sadeh
09:31 AM rgw Bug #4776 (Need More Info): S3 copy part corrupt files >512kb
Sage Weil
01:38 AM rgw Bug #4776 (Resolved): S3 copy part corrupt files >512kb
We are using radosgw and s3 API and we recently needed to update metadata on some files.
So we used the copy part of...
Guilhem Lettron
01:40 PM rbd Bug #4710 (Resolved): rbd: STRIPINGV2 feature specified by default for format 2 images
Sage Weil
12:46 PM rbd Bug #4710 (Fix Under Review): rbd: STRIPINGV2 feature specified by default for format 2 images
Sage Weil
11:42 AM rbd Bug #4710 (In Progress): rbd: STRIPINGV2 feature specified by default for format 2 images
Sage Weil
11:29 AM rbd Bug #4710: rbd: STRIPINGV2 feature specified by default for format 2 images
Per Josh, this is easy fix, let's get it into Cuttlefish. Ian Colle
01:31 PM Bug #4778 (In Progress): scrub clears inconsistent flag set by deep scrub
David Zafman
10:22 AM Bug #4778: scrub clears inconsistent flag set by deep scrub
Can we fix this without adding a separate deep-scrub inconsistent flag? (and is it feasible to do that before Cuttlef... Greg Farnum
06:45 AM Bug #4778 (Resolved): scrub clears inconsistent flag set by deep scrub
On my 0.56.4 cluster, I have some pgs marked as inconsistent because of an omap inconsistency that .4 is able to dete... Faidon Liambotis
12:46 PM rbd Feature #3419 (Resolved): krbd: copy-up on write to clone
The following have been committed to the ceph-client
"testing" branch.
b15a1df rbd: implement full object parent ...
Alex Elder
10:08 AM rbd Feature #3419: krbd: copy-up on write to clone
The following have been committed to the ceph-client
"testing" branch. Still waiting on reviews for the
last two.
...
Alex Elder
11:41 AM rbd Bug #4707 (Resolved): rbd CLI: bad error code masked by udevadm_settle
commit:568101fa72e29ee960fcf3d704f04edfd50bd072 Sage Weil
11:39 AM rbd Bug #4707 (In Progress): rbd CLI: bad error code masked by udevadm_settle
Sage Weil
11:27 AM rbd Bug #4707: rbd CLI: bad error code masked by udevadm_settle
Let's try to get this into Cuttlefish. Ian Colle
11:35 AM Bug #4736 (Resolved): journal Entry at pos 83251200 valid, there are missing sequence numbers pri...
Created new task for actual solution. Samuel Just
10:19 AM Bug #4736 (In Progress): journal Entry at pos 83251200 valid, there are missing sequence numbers ...
Sage Weil
11:32 AM Feature #4781 (New): Journal entries should record last known committed entry
This can be used to detect more corrupt journal cases. Samuel Just
11:31 AM rbd Bug #4665: librbd: read_iterate() can overflow its return value
Per Josh, this is another easy fix, let's get it into Cuttlefish. Ian Colle
10:57 AM Bug #4757: ceph-disk-prepare will not use all available space with >2TB hard drives
My suspicion is that Tv ran across this bug, and some version of gdisk wanted to reorder
partitions based on starti...
Dan Mick
10:20 AM Bug #4772: (deep?) scrubbing scheduling misses PGs
Yes it was and there are no indications of flapping OSDs that I can see.
I think I found the same pgs being scrubb...
Faidon Liambotis
09:33 AM Bug #4772 (Need More Info): (deep?) scrubbing scheduling misses PGs
Scrubbing skips pgs that are degraded... was the cluster active+clean when you did the scheduling? Sage Weil
10:04 AM rbd Bug #4762 (Resolved): libceph: fix two messenger bugs
The following has been committed to the ceph-client "testing"
branch:
68423cc libceph: fix two messenger bugs
Alex Elder
09:52 AM Bug #4780: RBD-Enabling Discard Trim
This strictly speaking isn't true "Note that this uses the IDE driver. The virtio driver does not support discard." p... John Wilkins
09:48 AM Bug #4780 (Resolved): RBD-Enabling Discard Trim
We need to provide examples for configuring libvirt, since we now support SCSI. Virtio and SCSI should be the main ex... John Wilkins
09:35 AM Bug #4779 (Resolved): The ceph command and crushtool have differing views on valid characters for...
Using osd crush move, I can create a bucket with a '/' in the name.
If I then get a crush map, decompile it, and att...
Mike Bryant
09:25 AM devops Bug #4752 (Resolved): ceph-create-keys doesn't work on upgraded clusters
Further update from Dan indicated that EACCES was returned on authentication error after all. I tested the changes b... Anonymous
09:09 AM rgw Bug #4124 (Resolved): Using "response-content-type" arguments causes duplicated Content-Type in r...
Merged in, commit:9b953aa4100eca5de2319b3c17c54bc2f6b03064 Yehuda Sadeh
05:34 AM rbd Bug #4777 (Resolved): krbd: verify a few things in the zeroing routines
The kernel rbd driver has a function zero_bio_chain() that's
used to zero out the data in a bio list starting at a g...
Alex Elder
05:08 AM Revision 1a8b30ef (ceph): ceph-create-keys: Don't wait if permission denied
If get or create keys returns permssion denied, exit
gracefully instead of retrying.
Signed-off-by: Gary Lowell <ga...
Gary Lowell
03:02 AM rbd Bug #3737: Higher ping-latency observed in qemu with rbd_cache=true during disk-write
Ooops, sorry...,
was a bit misleaded, cause "cache=writeback" was still in the config file.
Oliver.
Oliver Francke

04/21/2013

06:11 PM rgw Support #4775 (Resolved): Why I can created an exsiting bucket
There is a bucket named abcdef in the ceph. It contains 5 objects.
Then I create a new bucket with the same name a...
manx suo
02:56 PM rbd Bug #3847 (Fix Under Review): rbd: figure out correct byte order for watch version
The following has been posted for review:
[PATCH] libceph: fix byte order mismatch
It is available in the "revi...
Alex Elder
02:39 PM rbd Bug #3847 (In Progress): rbd: figure out correct byte order for watch version
libceph: fix byte order mismatch
A WATCH op includes an object version. The version that's supplied
is incorrect...
Alex Elder
02:24 PM rbd Bug #4774 (Resolved): krbd: don't create /dev entries for backing devices
Currently when a layered rbd device gets mapped, the
snapshot device that is its parent gets probed in the
same way...
Alex Elder
02:21 PM rbd Feature #4709 (Fix Under Review): krbd: support stripingv2 images that don't require I/O path cha...
The following has been posted for review and is available
in the "review/wip-stripe-v2" branch of the ceph-client
g...
Alex Elder
06:50 AM rbd Feature #4709 (In Progress): krbd: support stripingv2 images that don't require I/O path changes
Starting work on this. Alex Elder
02:20 PM rbd Bug #4773 (Fix Under Review): rbd: have rbd_obj_method_sync() return transfer count
The following have been posted for review, and are
available in the "review/wip-stripe-v2" branch of
the ceph-clien...
Alex Elder
10:28 AM rbd Bug #4773: rbd: have rbd_obj_method_sync() return transfer count
I've implemented these fixes and will post them for
review after I've done some better testing.
I also made a few...
Alex Elder
07:59 AM rbd Bug #4773 (Resolved): rbd: have rbd_obj_method_sync() return transfer count
Callers of rbd_obj_method_sync() don't know how many bytes of data
got returned by the class method call. As a resu...
Alex Elder
08:54 AM Bug #4757: ceph-disk-prepare will not use all available space with >2TB hard drives
We should also get more details about what the original problem was before just assuming it's fixed. I bet Mark has T... Greg Farnum
06:12 AM CephFS Bug #4753: mds/Locker.cc: 4167: FAILED assert(0)
Additional: I resolve it runtime, changing assert(0) to some lock (IMHO first in this case) on one node and found for... Denis kaganovich

04/20/2013

10:36 PM rbd Feature #4724 (Fix Under Review): krbd: handle layered I/O correctly when the child has been resized
The following has been posted for review:
[PATCH] rbd: enforce parent overlap
Alex Elder
06:10 PM Revision 1fa719d5 (ceph): doc: Aesthetic improvements. Removed unnecessary graphic and overrode m...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:08 PM Revision 3749ffe6 (ceph): doc: Added a scenario to PG troubleshooting.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:06 PM Revision cf915941 (ceph): doc: Changed usage to "bucket-name". Description was okay.
fixes: #4102
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
11:14 AM Documentation #4102 (Resolved): doc: in crush-map-rules, wrong spec for step take
http://ceph.com/docs/next/rados/operations/crush-map/ Should appear in master within a week. John Wilkins
10:32 AM rbd Feature #3418 (Resolved): krbd: write path (layering)
The following have been committed to the "testing" branch
of the ceph-client git repository:
a065a13 libceph: kil...
Alex Elder
07:47 AM Bug #4772 (Can't reproduce): (deep?) scrubbing scheduling misses PGs
I have a 144 OSD (135 in) cluster, partioned in ~10 pools and 16760 pgs in total. The cluster runs Ceph 0.56.4 using ... Faidon Liambotis
01:56 AM Feature #4771 (Rejected): Snippet / included configuration
When managing large systems via Puppet or some other configuration tool it could be very useful to have "snippet" con... Wido den Hollander
01:23 AM Revision 861ac497 (ceph): added ceph.client.admin.keyring on the client to run rbd and rados tests
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
01:14 AM Revision c4f8adca (ceph): Merge branch 'wip-4201' into next
Reviewed-by: Samuel Just <sam.just@inktank.com> David Zafman
01:13 AM Revision 2bbac6e4 (ceph): added extra packages required by ceph-deploy for rbd and rados tests
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
01:11 AM Revision 870f47c7 (ceph): tools/ceph-filestore-dump: Implement remove, export and import
Change local names to be clearer
Break real_log() into common function get_log()
Move infos_oid, biginfo_oid and log_...
David Zafman
12:11 AM Revision 481c532f (ceph): Merge branch 'wip_4662_clean' into next
Reviewed-by: Greg Farnum <greg@inktank.com> Samuel Just
12:10 AM Revision 6ef0f162 (ceph): PG: check for pg change in ~FlushState
Fixes: #4662
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:10 AM Revision 0e155550 (ceph): ReplicatedPG::_applied_recovered_object*: don't queue scrub if deleting
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:10 AM Revision 88d9ee1d (ceph): ReplicatedPG::_finish_mark_all_unfound_lost: only requeue if !deleting
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:10 AM Revision b8cb9d7e (ceph): PG: bail if deleting in _finish_recovery
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
12:10 AM Revision 75cb55b4 (ceph): AsyncReserver: delete context in cancel_reservation
Fixes: #4662
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
12:08 AM Revision 460db089 (ceph): osd: Add flag to force version write in _write_info()
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
12:08 AM Revision 37d2fe2c (ceph): osd: Make clear_temp() public for use by remove
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
12:08 AM Revision d73b9fbe (ceph): tools/ceph-filestore-dump: Error messages lost because stderr is closed
Use cout instead of cerr for command errors
Use cerr for debug mode because stderr is avail
Output map_epoch in debug...
David Zafman
12:08 AM Revision da39f911 (ceph): osd: Create static PG::_write_log() function
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman

04/19/2013

10:23 PM Revision ad845e61 (ceph): OSDMonitor: pg split is no longer experimental
Fixes: #4711
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Samuel Just
10:16 PM Revision 095dc4f6 (ceph): Merge pull request #228 from alram/next
Fix journal partition creation
Reviewed-by: Sage Weil <sage@inktank.com>
Sage Weil
10:11 PM Revision 56619ab9 (ceph): Fix journal partition creation
With OSD sharing data and journal, the previous code created the
journal partiton from the end of the device. A uint3...
Alexandre Marangone
09:37 PM Revision fe9d3260 (ceph): rbd: fix qa tests to use --allow-shrink
Fixes: #4763
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:34 PM Revision b6b4ebed (ceph): osd: an interval can't go readwrite if its acting is empty
Let's not forget that min_size can be zero.
Fixes: #4159
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked ...
Sage Weil
09:33 PM Revision 055d746c (ceph): mon: restrict pool size to 1..10
See: #4159
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 30b8d653751acb4bc4be5ca611f154e19af...
Sage Weil
09:28 PM Revision f114fdc4 (ceph): Merge pull request #227 from ceph/wip-4574
Reviewed-by: Greg Farnum <greg@inktank.com> Gregory Farnum
08:25 PM Linux kernel client Feature #4770 (Resolved): krbd: consider including write data with layered existence check
Josh suggested we could pass along the data to be written
along with the STAT op sent to the osd for a target object...
Alex Elder
08:08 PM Revision c073bd25 (ceph): init-ceph: do not stop start on first failure
When starting we often loop over many daemon instances. Currently we stop
on the first error and do not try to start...
Sage Weil
08:05 PM Revision d395aa52 (ceph): init-ceph: do not stop start on first failure
When starting we often loop over many daemon instances. Currently we stop
on the first error and do not try to start...
Sage Weil
07:34 PM Bug #4757: ceph-disk-prepare will not use all available space with >2TB hard drives
Really confused by that state; journal should have been partition 2 at the end of the drive, so more is wrong than ju... Dan Mick
03:25 PM Bug #4757 (Resolved): ceph-disk-prepare will not use all available space with >2TB hard drives
Alexandre Marangone
10:51 AM Bug #4757: ceph-disk-prepare will not use all available space with >2TB hard drives
I ran ceph-disk-prepare with the patch for a disk of 3TB and a disk of 10GB. Multiple times, with and without --zap-d... Alexandre Marangone
09:54 AM Bug #4757: ceph-disk-prepare will not use all available space with >2TB hard drives
hrm, that comment came from tv, so who knows what he was seeing. can you do some testing with the change and see if ... Sage Weil
09:50 AM Bug #4757 (Resolved): ceph-disk-prepare will not use all available space with >2TB hard drives
When sharing the journal with the OSD data, ceph-disk-prepare will not use all the available disk space with disks >2... Alexandre Marangone
07:26 PM Revision 9a7d1f51 (ceph): mon: Monitor: fix timechecks get_health clobbering overall status
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
07:20 PM rbd Feature #3419 (Fix Under Review): krbd: copy-up on write to clone
The following has been posted for review.
This set of patches culminates in providing layered
write functionality...
Alex Elder
07:19 PM rbd Bug #4762 (Fix Under Review): libceph: fix two messenger bugs
The following has been posted for review:
[PATCH] libceph: fix two messenger bugs
Alex Elder
01:10 PM rbd Bug #4762 (Resolved): libceph: fix two messenger bugs
While getting copyup functionality working I found two
bugs in the messenger that previously were not triggered.
...
Alex Elder
07:16 PM Revision aa0d5f39 (ceph): mon: fix health monitor calls
- unconditionally call get_health, regardless of formatter *
- return a meaningful health status code
Signed-off-by:...
Sage Weil
07:03 PM Revision be4807f5 (ceph): global: call observers (and start logging) in global_init
Call observers so that the logging infrastructure gets initailized and we
start logging. Otherwise, unless a default...
Sage Weil
06:29 PM Revision 52d8240a (ceph): osd: Add OSD::make_infos_oid() as common function to create oid
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:29 PM Revision 76505c28 (ceph): osd: Create new static function PG::_write_info() for use by PG import
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:29 PM Revision 5ffb3ef4 (ceph): filestore, osd: Fixes to comform to programming guidelines
Signed-off-by: David Zafman <david.zafman@inktank.com> David Zafman
06:26 PM Revision fa89cfd2 (ceph): mon: QuorumService: return health status on get_health()
This allows us to return the appropriate overall health status on
Monitor::get_health().
Fixes: 4574
Signed-off-by:...
Joao Eduardo Luis
06:21 PM Feature #4201 (Resolved): osd: data loss: pg export/import/remove
commit:870f47c7cb24b5da7a7e3a5ba45f140e268c0754 David Zafman
06:06 PM Revision 78c9db88 (ceph): OpRequest: don't maintain history if the OSD is shutting down
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:05 PM Revision 1493e7db (ceph): osd/: optionally track every pg ref
This involves three pieces:
For intrusive_ptr type references, we use TrackedIntPtr instead. This
uses get_with_id ...
Samuel Just
06:05 PM devops Bug #4769 (Resolved): centos reimaging script should also include ntpd restart
The reimaging script, we currently have for centos should include 'restart ntpd' at the end of the script as the ntpd... Tamilarasi muthamizhan
06:00 PM Revision 8fe1b9d5 (ceph): ReplicatedPG: use ReplicatedPGRef for C_OSD_OpApplied
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision f03ba5a2 (ceph): ReplicatedPG: use ReplicatedPGRef for C_OSD_OpCommit
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision 4090eff8 (ceph): ReplicatedPG: use ReplicatedPGRef for C_PG_MarkUnfoundLost
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision ec6f71bd (ceph): ReplicatedPG: use the ReplicatedPGRef typedef
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision 66c007fb (ceph): common/: add tracked_int_ptr.hpp
TrackedIntPtr acts like intrusive_ptr, but is able to
track a ref id.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
06:00 PM Revision 220c6512 (ceph): ReplicatedPG: add ReplicatedPGRef
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision b021036b (ceph): PG,ReplicatedPG: move intrusive_ptr declarations to top
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision ce647753 (ceph): PG: do not put() in scrub() if pg is deleting
scrub() no longer handles the put, this call
must have been missed.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
06:00 PM Revision 8bd89e12 (ceph): PG: use PGRef in C_PG_ActivateCommitted
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision 2f9a35ac (ceph): PG: use PGRef for C_PG_FinishRecovery
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision f45a5413 (ceph): PG: use PGRef for FlushState
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision 0b7795ac (ceph): OSD: use PGRef in consume_map
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision c2127a11 (ceph): PG: use PGRef in QueuePeeringEvt
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision 1c2b66cf (ceph): OSD: use PGRef in handle_pg_stats_ack
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision c04c3e59 (ceph): OSD: use PGRef in handle_pg_remove
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision 07a80ee3 (ceph): FileStore::_do_clone_range: _do_copy_range encodes error in return, not...
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:00 PM Revision 016e975a (ceph): FileStore::_do_copy_range: read(2) might return EINTR
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
05:41 PM devops Bug #4767 (Resolved): ceph-deploy: install should default to picking cuttlefish when cuttlefish i...
currently, ceph-deploy install defaults to bobtail but when cuttlefish is ready, it should default to cuttlefish.
...
Tamilarasi muthamizhan
05:20 PM Revision af5a9b37 (ceph): Merge pull request #224 from ceph/wip-mon-crush
Wip mon crush
Reviewed-by: Dan Mick <dan.mick@inktank.com>
Sage Weil
05:20 PM devops Feature #4766 (Rejected): ceph-deploy: commands should continue to execute the next argument in c...
currently, when trying to create multiple osds using the "osd create" command, the command returns failure when the f... Tamilarasi muthamizhan
05:14 PM Bug #4662 (Resolved): osd/OSD.h: 809: FAILED assert(peering_queue.empty()) on shutdown
481c532ff361b21e044621ac13c8f00ebfb1b3dc Samuel Just
05:06 PM Bug #4747 (Can't reproduce): Upgrade monitors from argonaut->bobtail->next fails w/"Existing stor...
Awesome. I made #4758 for the fast-convert story I mentioned. Greg Farnum
04:46 PM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
It's a manual process so I could have missed something along the way. If I used upgrade instead of dist-upgrade for ... Ken Franklin
01:41 PM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
Shoot; it looks like this is actually just checking the on-disk features CompatSet; it's not iterating through the ac... Greg Farnum
10:48 AM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
I'm not currently working on this, so I'm unassigning it from me (but still watching) in case someone else wants to p... Joao Eduardo Luis
10:00 AM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
It pretty much has to, unless it were given separate logic to figure out which commits "matter", which would be not g... Greg Farnum
09:55 AM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
hmm, could the problem may be that it wants gv values for *everything* in the mon store, not just the recent commits? Sage Weil
09:52 AM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
I was able to recreate this twice. The first time included running functional tests in between each installation ie.... Ken Franklin
09:44 AM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
Greg Farnum wrote:
> I believe this is about the pre-Bobtail change which started adding global ordering values to t...
Joao Eduardo Luis
04:53 PM Bug #4765 (Rejected): monitor: sets global version feature but upgrades might not actually have a...
We don't check on doing a store conversion that we actually have GV values, only that they've been enabled on the mon... Greg Farnum
04:30 PM Revision 5e4b8bc4 (ceph): config: clarify 'mon osd down out subtree limit'
Clarify the description; this is the subtree type that we won't mark out
if it is all down, but anything less than it...
Sage Weil
03:25 PM Bug #4711 (Resolved): mon: remove --enable-experimental-feature on set pg_num
Samuel Just
03:19 PM Bug #4764 (Can't reproduce): ceph -w sometimes does not reflect clean pgs
ceph -s reports all pgs clean, but ceph -w does not include an entry for it.
ceph3/src [wip_4711?] » ./ceph -w
...
Samuel Just
03:08 PM Bug #4749 (In Progress): osd: failed to recover before timeout
Sage Weil
02:39 PM Bug #4699 (Resolved): osd: crash when looking at a map changing pool size from 0 to 2

Cherry-picked changes for bug #4159
commit:80682c88ef71ca4977df83f8d9b82310a76cf93d
commit:aa91dbf11deb02a25f7ff9...
David Zafman
02:38 PM rbd Bug #4763 (Resolved): rbd test scripts should use --allow-shrink flag when resizing rbd img
Sage Weil
01:58 PM rbd Bug #4763 (In Progress): rbd test scripts should use --allow-shrink flag when resizing rbd img
Sage Weil
01:28 PM rbd Bug #4763 (Resolved): rbd test scripts should use --allow-shrink flag when resizing rbd img
The existing rbd test script rbd/copy.sh fails with the recent inclusion of --allow-shrink flag for the resize comman... Tamilarasi muthamizhan
02:30 PM Bug #4574 (Resolved): mon: HEALTH_OK even if data health is HEALTH_WARN
Looks good to me; I tested and merged. commit:f114fdc40a0aac9f38745c50dce18d186e657acd Greg Farnum
12:23 PM Bug #4574 (Fix Under Review): mon: HEALTH_OK even if data health is HEALTH_WARN
Joao Eduardo Luis
12:22 PM Bug #4574: mon: HEALTH_OK even if data health is HEALTH_WARN
proposed fix on wip-4574 Joao Eduardo Luis
01:28 PM Bug #4543 (In Progress): mon: corrupted store if monitor dies mid-sync
Comments on Github; and this is one that we'll definitely need to test before merging. Greg Farnum
09:30 AM Bug #4543 (Fix Under Review): mon: corrupted store if monitor dies mid-sync
wip-4543 has a proposed fix -- haven't tested it yet. Joao Eduardo Luis
01:06 PM Bug #2545 (Resolved): init-ceph: stops if one instance fails to start
commit:d395aa521e8a4b295ed2b08dd7cfb7d9f995fcf7 Sage Weil
12:47 PM Bug #2545: init-ceph: stops if one instance fails to start
Looks good. That was a lot simpler than I expected. Anonymous
09:20 AM Bug #2545: init-ceph: stops if one instance fails to start
Gary, Can you please review wip-sysvinit? Ian Colle
09:19 AM Bug #2545 (Fix Under Review): init-ceph: stops if one instance fails to start
wip-sysvinit Sage Weil
12:25 PM Bug #4748 (In Progress): mon: failed assert in OSDMonitor::build_incremental
Joao Eduardo Luis
09:48 AM Bug #4748: mon: failed assert in OSDMonitor::build_incremental
Possibly related to #4521 Joao Eduardo Luis
12:24 PM Bug #4228: mon uses pick_addresses if invoked with mkfs or without mon addr; fails if no cluster ...
Currently not working on this one, so if anyone wants to pick it up go for it. Otherwise, I'll get back to it as soon... Joao Eduardo Luis
12:04 PM Bug #4676 (Resolved): daemon logs aren't opened until daemonize
commit:be4807f5b88115bc5a553ecee6f42c0c7d7cfbe2 Sage Weil
12:03 PM Bug #4676 (Fix Under Review): daemon logs aren't opened until daemonize
wip-log Sage Weil
11:33 AM Bug #4731: PG: don't write out pg epoch on every map activation
0d6881c8 does seem to do the trick. Not sure yet whether we want this in bobtail. Shouldn't be a problem in cuttlef... Samuel Just
11:29 AM Bug #4009 (Duplicate): osd reports map e6 wrongly marked me down
I think this is 4579 Samuel Just
11:26 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
I opened 3 issues for this problem:: #4759, #4760, #4761. These will make it so that it'll be possible to list contai... Yehuda Sadeh
11:02 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
Corrected by a colleague of mine: Swift does paginate, at 10.000 items. It would help but not that much in my case as... Faidon Liambotis
07:43 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
Swift doesn't seem to paginate this. I haven't looked at Swift's internals for this but I doubt it lists all of my co... Faidon Liambotis
07:32 AM rgw Bug #4754: GET/HEAD on account is extremely slow, times out
Right. There are two issues that play together here. One is that we don't paginate the request, the second one is tha... Yehuda Sadeh
06:22 AM rgw Bug #4754 (Resolved): GET/HEAD on account is extremely slow, times out
Doing a GET or a HEAD on /swift/v1 times out, even after increasing the timeout to 5 minutes. It's hard to know the e... Faidon Liambotis
11:24 AM Bug #4698: osd suicide timed out after 150
This appears to be a filesystem problem with ext4. Samuel Just
08:52 AM Bug #4698: osd suicide timed out after 150
Any update on this? Is it still happening? Ian Colle
11:23 AM rgw Feature #4761 (New): rgw: swift list containers should get stats asynchronously
Yehuda Sadeh
11:23 AM Bug #4686: corrupt or missing osdmap on load_pgs
I have also not been able to reproduce this one. Samuel Just
08:56 AM Bug #4686: corrupt or missing osdmap on load_pgs
Is this still occurring? Still planning fix for Cuttlefish? Ian Colle
11:23 AM Bug #4602: osd/ReplicatedPG.cc: 6487: FAILED assert(latest->is_update())
I haven't seen it since. Samuel Just
08:55 AM Bug #4602: osd/ReplicatedPG.cc: 6487: FAILED assert(latest->is_update())
Sam - any update on this? Are we still seeing this? Still trying to get this into Cuttlefish? Ian Colle
11:23 AM rgw Bug #4760 (Resolved): rgw: list buckets/containers should be streamlined
Yehuda Sadeh
11:22 AM rgw Feature #4759 (Resolved): rgw: option swift list container without container stats
We'd like to be able to dump container list without required to dump stats for each container. Yehuda Sadeh
11:04 AM devops Bug #4752: ceph-create-keys doesn't work on upgraded clusters
ceph CLI currently fails in ceph_tool_common_init and doesn't pass back a failure code that can be interpreted, so re... Dan Mick
09:58 AM devops Bug #4752 (In Progress): ceph-create-keys doesn't work on upgraded clusters
Anonymous
11:02 AM Bug #4620 (Resolved): mon: Paxos proposals take too long to finish when transaction is huge
Greg and Jim Schutt took care of this issue (commit:d8a354d511c96f5a1a25ec907f96e77f047b7c01)
Also, increasing the...
Joao Eduardo Luis
10:48 AM rgw Feature #4715: rgw: Add support for OPTIONS HTTP method
That's actually CORS, which already went into cuttlefish. Yehuda Sadeh
10:45 AM Bug #4758 (Resolved): monitor: going through all incrementals on startup
Apparently the monitor is incrementing through vast numbers of PGMap and OSDMap incrementals in some cases, and that ... Greg Farnum
10:42 AM Bug #3495: ceph-mon crash
Joao Eduardo Luis
09:34 AM Bug #3495: ceph-mon crash
Denis, you appear to be using master [1]; the fix is only available on wip-3495.
[1]:...
Joao Eduardo Luis
09:20 AM Bug #3495: ceph-mon crash
Fixme if I use wrong branch, but:
0> 2013-04-19 19:06:29.120708 7fb556116700 -1 mon/PaxosService.cc: In funct...
Denis kaganovich
07:45 AM Bug #3495: ceph-mon crash
This has been stable for me for >24-hours. I think you've got it. Thanks for all your help! Matthew Roy
10:17 AM CephFS Bug #4105: mds: fix up the Dumper
This has annoyed me a couple more times and I think it's now at the top of the queue, so here we go again. Greg Farnum
10:08 AM CephFS Bug #4746: client: invalidate callback can deadlock
pushed wip-fuse to ceph-client.git Sage Weil
09:46 AM Bug #4521: mon: starting a new osd crashes all mon's
Sage opened a bug for that one here: http://tracker.ceph.com/issues/4748 Joao Eduardo Luis
09:40 AM Bug #4521: mon: starting a new osd crashes all mon's
debug for ms and mon at 20, log attached.
Evan Felix
08:31 AM Bug #4521: mon: starting a new osd crashes all mon's
that store was after i ran the fix(log wip4521.fix_debugA), started the mon, then it crashed.
will run again wit...
Evan Felix
09:42 AM CephFS Bug #4753: mds/Locker.cc: 4167: FAILED assert(0)
You mean file_eval should just short-circuit if it's scanning? That seems like the most sensible place for it, but I'... Greg Farnum
09:31 AM CephFS Bug #4753: mds/Locker.cc: 4167: FAILED assert(0)
yeah, that transition doesn't make sense. i think it should do nothing in the scan state.. Sage Weil
09:05 AM CephFS Bug #4753: mds/Locker.cc: 4167: FAILED assert(0)
file_eval is trying to move ifile from "scan" to "mixed" in order to serve up the client caps, and scatter_mix doesn'... Greg Farnum
09:30 AM devops Bug #4756 (Resolved): mkcephfs doesn't set up same keys as ceph-deploy
Notably, "mon." doesn't get any permissions associated with it, which can also lead to the problems in #4752. Until w... Greg Farnum
09:17 AM rgw Bug #4755: rgw: assumption of signed char
Doh, in the title, s/patch/char/, of course. Adam Borowski
09:16 AM rgw Bug #4755 (Resolved): rgw: assumption of signed char
I'm testing ceph on an armhf based server. During compilation (from Debianized sources you provide), it turns out th... Adam Borowski
08:53 AM rbd Bug #3664: osdc/ObjectCacher.cc: 517: FAILED assert(!i->size())
Josh - any update? Are we still going to get this in Cuttlefish? Ian Colle
06:43 AM Bug #4723: FAILED assert(!db->create_and_open(std::cerr)) after IO Error.
This should probably be closed with can't reproduce. Now that the cluster is healthy I'm not able to produce the same... Matthew Roy
06:16 AM rgw Feature #4613: Allow bucket data to reside in a separate pool to object data
This would be especially useful for us, perhaps even a must. We have 200-250 million files split in a number of conta... Faidon Liambotis
05:35 AM Bug #3609: mon: track down the Monitor's memory consuption sources
It appears that, when starting a monitor, we will o through all the pg and osdmap incremental versions and apply them... Joao Eduardo Luis
05:20 AM Bug #3609: mon: track down the Monitor's memory consuption sources
btw, and as such:... Joao Eduardo Luis
05:18 AM Bug #3609: mon: track down the Monitor's memory consuption sources
starting monitors with tcmalloc noticed the following (both are peons):... Joao Eduardo Luis
02:13 AM CephFS Bug #4601: symlink with size zero
I was looking at the <inode>.<frag>_head* file in the osd that held the directory where the link was stored. As it t... Alexandre Oliva

04/18/2013

11:22 PM Revision 60e7fb41 (ceph): turn on debugging for MDS and Client in FS runs
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
11:21 PM Revision e21fdf81 (ceph): ior-cfuse: remove the binary/ dir that make install creates
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
11:21 PM Revision cb1e8ed9 (ceph): turn on debugging for MDS and Client in FS runs
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:23 PM Revision cd2cabec (ceph): doc: Trimmed toc depth for nicer visual appearance.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:08 PM Revision 44aa696b (ceph): doc: Added new PG troubleshooting use case.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:08 PM Revision 2e3579ed (ceph): doc: Updated title.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
09:07 PM Revision 304a2343 (ceph): doc: Added PG troubleshooting to toctree.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:51 PM Revision a975f9df (ceph): packaging: Add ceph-test debian package
The ceph-test package includes optional test and benchmarking programs.
Conflicts:
debian/control
debian/rules
Gary Lowell
08:51 PM Revision 2382d9b7 (ceph): deb: Add ceph-coverage to ceph-test deb package
Teuthology uses the ceph-coverage script extensively
and expects it to be installed by the ceph task. Add
the script...
Sam Lang
08:30 PM Revision d5139ba1 (ceph): doc: Bifurcating OSD and PG Troubleshooting. Updated hyperlink.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:30 PM Revision 3b8057ac (ceph): doc: Bifurcating OSD and PG Troubleshooting. Added PG troubleshooting doc.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:29 PM Revision 3c4bf83c (ceph): doc: Bifurcating OSD and PG Troubleshooting. Removed PG section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:13 PM rgw Feature #4327 (In Progress): rgw: dr: updated buckets log: create internal API
Yehuda Sadeh
08:07 PM rgw Feature #4573 (In Progress): Create User Quota Blueprint
Yehuda Sadeh
08:07 PM rgw Feature #4745 (In Progress): rgw: radosgw-admin command to stat object
Yehuda Sadeh
06:42 PM Revision 46d8b9f2 (ceph): rgw_bucket: Fix dump_index_check.
Signed-off-by caleb miles <caleb.miles@inktank.com> caleb miles
06:20 PM Revision 0d46dc46 (ceph): mon: make 'osd crush link ...' idempotent
We fixed move in f5ba0fbbe73e11418634bc95e1fc36d17edccf37 but missed this
one.
Signed-off-by: Sage Weil <sage@inktan...
Sage Weil
06:20 PM Revision b0c1001a (ceph): mon: ensure 'osd crush rule ...' commands are idempotent
Ensure that we return 0 for these cases.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:11 PM Revision decdeadf (ceph): Merge branch 'next'
Sage Weil
06:09 PM Revision 5f1898d9 (ceph): rgw_bucket: Fix dump_index_check.
Signed-off-by caleb miles <caleb.miles@inktank.com> caleb miles
05:41 PM Revision 7e4f80b1 (ceph): debian/control: Fix typo in libboost version number
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell
05:41 PM Revision f4bc7607 (ceph): build: Add new package dependencies
Add libboost-system-dev (bug #4725).
Add hdparm to rpm installation requirements. The hdparm
command is used to det...
Gary Lowell
05:39 PM Revision efbe2e8b (ceph): Merge branch 'wip-max_size-3637' into next
Reviewed-by: Sage Weil <sage@inktank.com> Greg Farnum
05:38 PM Revision 87634d88 (ceph): mds: journal the projected root xattrs in add_root()
In EMetaBlob::add_root(), we should log the projected root xattrs
instead of original ones to reflect xattr changes.
...
Kuan Kai Chiu
05:38 PM Revision f379ce37 (ceph): mds: fix setting/removing xattrs on root
MDS crashes while journaling dirty root inode in handle_client_setxattr
and handle_client_removexattr. We should use ...
Kuan Kai Chiu
05:23 PM Bug #4521: mon: starting a new osd crashes all mon's
Evan, is that store prior or post applying the fix? It doesn't seem fixed at all.
Also, when you have the chance, ...
Joao Eduardo Luis
09:25 AM Bug #4521: mon: starting a new osd crashes all mon's
I pulled the updates, I compiled and installed. I ran the fix,( see attached log). I started the new mon, and tried... Evan Felix
05:22 PM CephFS Bug #4753 (Resolved): mds/Locker.cc: 4167: FAILED assert(0)
Every mds crashed after some startup checks: "mds/Locker.cc: 4167: FAILED assert(0)":
mds/Locker.cc: 4167: FAILED ...
Denis kaganovich
05:15 PM Revision a3c48351 (ceph): ceph.conf: lower mon disk avail warning threshold
Only wanr when we hit 90% instead of default 70%
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from com...
Sage Weil
05:15 PM Revision 4efed084 (ceph): ceph-deploy: stop daemons, archive, then purge[data]
Purge removes logs, and we want to archive those, so explicitly shut down
all daemons before doing the archiving step...
Sage Weil
05:12 PM CephFS Bug #4746: client: invalidate callback can deadlock
The suggestion from Maxim is to modify fuse to serialize reads and invalidate via a mutex. That ought to do the tric... Sage Weil
09:37 AM CephFS Bug #4746: client: invalidate callback can deadlock
It's not any of our internal locking that are getting stuck; it's the VFS inode mutexes in combination with us. If I ... Greg Farnum
07:31 AM CephFS Bug #4746: client: invalidate callback can deadlock
The invalidate is queued in a separate thread, and when we call the invalidate, we don't have the client lock held. ... Sam Lang
05:06 PM CephFS Bug #4601: symlink with size zero
>I looked a bit in the ceph-osd file holding the directory that contains the symlink, and I can see ^Q in the yes_hea... Greg Farnum
04:57 PM CephFS Bug #1945 (Can't reproduce): blogbench hang on caps
We haven't seen this in a long time (at least, that's marked here), and there's been a ton of work here over the last... Greg Farnum
04:39 PM CephFS Bug #4732: uclient: client/Inode.cc: 126: FAILED assert(cap_refs[c] > 0)
This was in the async invalidate thread, so I'm turning this down. It should probably be investigated alongside/after... Greg Farnum
04:34 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
Okay, pushed the update for more debugging, and am downgrading this to "High" since it only appears under so many fai... Greg Farnum
04:17 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
Also, both of these are the same job as the first incident was: fsstress workunit on ceph-fuse, messenger failure inj... Greg Farnum
04:15 PM CephFS Bug #4565: MDS/client: issue decoding MClientReconnect on MDS
Those machines are cleared out again, of course (d'oh!). Next time we see this we need to gather up everything we can... Greg Farnum
04:03 PM CephFS Bug #4741: MDS: stuck in clientreplay
Interesting; on #4742 it was clearly waiting on a request because it kept saying "still have 1 active replay requests... Greg Farnum
03:57 PM CephFS Bug #4741 (Duplicate): MDS: stuck in clientreplay
This is a duplicate of #4742. It looks like setattr is the culprit. I was able to generate a core file of the mds w... Sam Lang
11:13 AM CephFS Bug #4741: MDS: stuck in clientreplay
Also /a/teuthology-2013-04-18_01:01:07-fs-next-testing-basic/15101 Greg Farnum
03:58 PM CephFS Bug #4721 (Need More Info): libcephfs tests fail when using ceph-deploy
(Trying to track the responsibility flow more clearly.) Greg Farnum
03:19 PM CephFS Bug #4721: libcephfs tests fail when using ceph-deploy
Have you reproduced this, Tamil? Since all the tests are failing I'm pretty sure this is some kind of authentication ... Greg Farnum
03:57 PM CephFS Bug #4742 (In Progress): mds: stuck clientreplay request
Sam Lang
03:57 PM CephFS Bug #4742: mds: stuck clientreplay request
Marked #4741 as a duplicate of this bug. It looks like setattr is the culprit. I was able to generate a core file o... Sam Lang
03:47 PM Revision fd678eab (ceph): debian/control: Fix typo in libboost version number
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell
03:45 PM Revision 4b34b0e5 (ceph): mon: PaxosService: fix trim criteria so to avoid constantly trimming
Say a service establishes it will only keep 500 versions once a given
condition X is true. Now say that said conditi...
Joao Eduardo Luis
03:30 PM Revision 69974a4d (ceph): Merge branch 'wip-4725' Add build dependencies (Bug 4725)
Gary Lowell
03:24 PM Revision 86c1ea11 (ceph): build: Add new package dependencies
Add libboost-system-dev (bug #4725).
Add hdparm to rpm installation requirements. The hdparm
command is used to det...
Gary Lowell
02:59 PM devops Bug #4752: ceph-create-keys doesn't work on upgraded clusters
Ah. Well that seems easy enough. Dan Mick
02:31 PM devops Bug #4752: ceph-create-keys doesn't work on upgraded clusters
oops, i dropped this ball.
ceph command was update dto return the error code, so it just need sto check if $! is E...
Sage Weil
02:28 PM devops Bug #4752 (Resolved): ceph-create-keys doesn't work on upgraded clusters
ceph-create-keys requires the "mon." key to have permission to do things to the monitors. Apparently older deployment... Greg Farnum
02:30 PM Bug #4747: Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has not been con...
I believe this is about the pre-Bobtail change which started adding global ordering values to the monitor data store;... Greg Farnum
11:17 AM Bug #4747 (Resolved): Upgrade monitors from argonaut->bobtail->next fails w/"Existing store has n...
Testing upgrade from Argonaut to Bobtail to Next (cuttlefish). I am using Argonaut and bobtail distros on gitbuilder... Ken Franklin
02:17 PM Bug #4743 (Can't reproduce): omap deep scrub finds multiple PGs as inconsistent
I think this was actually caused by one of the journal replay defects from <56.4. I'm marking it can't reproduce unt... Samuel Just
01:32 PM Bug #4743: omap deep scrub finds multiple PGs as inconsistent
the xattrs, however, seem to match Samuel Just
01:32 PM Bug #4743: omap deep scrub finds multiple PGs as inconsistent
osd.133 is missing 3 keys (out of 750k) on object 3.2f2_head/d340c2f2/.dir.10267.612/head//3 Samuel Just
12:54 PM Bug #4743: omap deep scrub finds multiple PGs as inconsistent
I got debug filestore = 20 debug osd = 30 debug ms = 1 (turns out it needs 30, not 20) logs from all three replicas o... Faidon Liambotis
01:57 PM CephFS Bug #4722: kernel BUG at fs/ceph/caps.c:1006 invalid opcode: 0000
I did a checkout of v3.5, and caps.c:1006 is... Greg Farnum
01:37 PM CephFS Bug #4738: libceph: unlink vs. readdir (and other dir orders)
I don't believe locking is implemented yet via the Samba VFS bindings, since we don't have a userspace implementation... Greg Farnum
01:27 PM CephFS Bug #4738: libceph: unlink vs. readdir (and other dir orders)
On top only:
vfs objects = scannedonly ceph
And if i switching to:
vfs objects = scannedonly
or:
vfs objects = c...
Denis kaganovich
12:42 PM rbd Documentation #4751 (Closed): Document Live Migration with RBD
For people migrating to Ceph, some information on migration would be helpful.
Wido "You can do Live Migration with...
John Wilkins
12:36 PM Documentation #4750 (Closed): Improve Unfound Object Documentation
Monitoring OSDs and PGs doesn't cover unfound objects. Add some description there and link to troubleshooting. John Wilkins
12:28 PM Bug #3440: Running OSDs on ZFS on Linux
Tried with the patch and it works for me. Some comments are on Github: https://github.com/zfsonlinux/zfs/pull/1409
...
Wido den Hollander
11:26 AM Bug #4749 (Duplicate): osd: failed to recover before timeout
job was... Sage Weil
11:25 AM Bug #4748 (Resolved): mon: failed assert in OSDMonitor::build_incremental
... Sage Weil
11:03 AM CephFS Bug #3637 (Resolved): client: not issuing caps for with clients doing shared writes
Merged into next in commit:efbe2e8b55ba735673a3fdb925a6304915f333d8 Greg Farnum
09:41 AM Bug #4543: mon: corrupted store if monitor dies mid-sync
Updated the original description with further details. Joao Eduardo Luis
08:58 AM Revision 5a5fdfc6 (ceph): mon: Paxos: increase debug levels for proposal listing
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
08:34 AM devops Bug #4725 (Resolved): ceph package build-depends are incomplete for Ubuntu 12.04 at least
Resolved with the following commit:
commit 86c1ea1156b25e1a7038132a2319cbf6a47c92da
Author: Gary Lowell <glowell@...
Anonymous
01:34 AM Revision a0e457ae (ceph): doc: Removed legacy man page index. Generates warning otherwise.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:34 AM Revision d67793c2 (ceph): doc: Clarified that admin-socket is accessed from same host.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:33 AM Revision da7bf677 (ceph): doc: Updated hyperlinks to new tshooting section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:32 AM Revision fb4cba4b (ceph): doc: Removed this doc. Nothing referenced it, and parent directory echo...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:32 AM Revision f7843174 (ceph): doc: Revised top-level ops page.
Consolidated authentication into high-level operations. Added a
troubleshooting section. Collapsed toc trees to make ...
John Wilkins
01:30 AM Revision 6cf36827 (ceph): doc: Removed link to nowhere. Otherwise generates a warning.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:29 AM Revision 064ec2fb (ceph): doc: Removed top-level tshoot page, and created new index.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:29 AM Bug #3495: ceph-mon crash
Thanks for the update! Joao Eduardo Luis
01:28 AM Revision 0d1e0472 (ceph): doc: Excised community from OSD tshoot, made it stand alone.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:28 AM Revision 23e3fbee (ceph): doc: Moved monitor troubleshooting to troubleshooting section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:27 AM Revision 594580c9 (ceph): doc: Moved troubleshooting OSD to troubleshooting section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:26 AM Revision 78758007 (ceph): doc: Added extraneous rgw settings to rgw conf.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:25 AM Revision 4e6709bf (ceph): doc: Moved memory profiling from operations to troubleshooting.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:25 AM Revision 9e9bd2d8 (ceph): doc: Moved CPU profiling from operations to troubleshooting.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:24 AM Revision f0e3548a (ceph): doc: Set toc depth to 1 level, and added troubleshooting so it appears ...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:23 AM Revision dd7fd2dd (ceph): doc: Moved journal discussion to OSD ref from Ceph config.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:22 AM Revision 9ddc8b90 (ceph): doc: Reordered deployment tools in toc.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:21 AM Revision fd8b4d0a (ceph): doc: Removed logging from config index. Set depth to 1 for clean appear...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:20 AM Revision cd4b242d (ceph): doc: Removed logging. Added references. Reorganized and edited.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:19 AM Revision 22a5cb66 (ceph): doc: Removed. Not in toc, and otherwise generates a warning.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:18 AM Revision 84b0ec28 (ceph): doc: Updated hyperlink.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
01:18 AM Revision 808ad25a (ceph): doc: Removed fragmented logging info. Consolidated into one doc.
Logging was variously described in the ceph configuration document,
a configuration reference, and a section in opera...
John Wilkins
 

Also available in: Atom