Project

General

Profile

Activity

From 01/10/2013 to 02/08/2013

02/08/2013

11:38 PM Bug #4065 (Can't reproduce): Crash of 0.56.2 OSD on Ubuntu 12.04 LTS
Hi.
I am new to this and new to ceph. So please bear with me...
I tried to setup a ceph cluster here at home to t...
Matthias Babisch
11:33 PM Bug #4052: OSD High memory usage (8-12GB) right after start
Simon Frerichs wrote:
> Hi Sage,
>
> i checked two osds one start with this branch each and they crashed.
> I'll...
Sage Weil
11:30 PM Bug #4052: OSD High memory usage (8-12GB) right after start
Hi Sage,
i checked two osds one start with this branch each and they crashed.
I'll do another check later. Our cl...
Simon Frerichs
11:20 PM Bug #4052: OSD High memory usage (8-12GB) right after start
Hi Simon-
Is the osd crashing on startup *every* time? From the trace it looks like there is an invalid xattr set...
Sage Weil
05:13 PM Bug #4052: OSD High memory usage (8-12GB) right after start
i'll add an heap dump soon.
i just restarted another osd with wip_bobtail_f, also crashing:...
Simon Frerichs
02:22 PM Bug #4052: OSD High memory usage (8-12GB) right after start
hi Josh, burnupi57 running on wip-f branch might help.
we have this running from last week for the memory leak testing.
Tamilarasi muthamizhan
02:01 PM Bug #4052: OSD High memory usage (8-12GB) right after start
Yeah, if you can still reproduce it, a heap profile of an osd that's using excessive memory would be great.... Josh Durgin
01:42 PM Bug #4052: OSD High memory usage (8-12GB) right after start
Josh Durgin wrote:
> I haven't been able to reproduce this locally.
do you need more information / log output?
Simon Frerichs
01:39 PM Bug #4052: OSD High memory usage (8-12GB) right after start
I haven't been able to reproduce this locally. Josh Durgin
02:00 AM Bug #4052: OSD High memory usage (8-12GB) right after start
... Simon Frerichs
01:50 AM Bug #4052: OSD High memory usage (8-12GB) right after start
as requested current cluster status:
2013-02-08 10:48:40.125733 mon.0 [INF] pgmap v25369962: 2112 pgs: 1460 acti...
Simon Frerichs
01:43 AM Bug #4052 (Can't reproduce): OSD High memory usage (8-12GB) right after start
Hi,
some of our osds need 8-12GB RAM right after startup.
Sage mentioned wip_bobtail_f might fix it but this bra...
Simon Frerichs
09:58 PM Revision e7bc4b8d (ceph): mds: cap_reconnect_t uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:58 PM Revision f1e08e6c (ceph): mds: add ENCODING to the incompat set
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:58 PM Revision d414875a (ceph): mds: remove very dead commented-out code
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:58 PM Revision 77612336 (ceph): mds: enable SnapInfo, snaplink_t, sr_t dencoder usage
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:51 PM Bug #4064: osd: filestore assert on FORREMOVAL_* collection removal
... Sage Weil
09:50 PM Bug #4064 (Resolved): osd: filestore assert on FORREMOVAL_* collection removal
ubuntu@teuthology:/var/lib/teuthworker/archive/sage-2013-02-08_12:11:07-rados-master-testing-basic/2870$ ... Sage Weil
09:27 PM Revision 38dd59ba (ceph): doc: Removed unnecessary/contradictory options.
fixes: #4058
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
09:18 PM Revision 19c94666 (ceph): doc: Fixed order of option.
fixes: #4046
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
09:17 PM Revision d557bcfb (ceph): mds: ESubtreeMap more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision d727b129 (ceph): mds: ETableClient event now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision f82dce8d (ceph): mds: ETableClient more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision ac8f25c0 (ceph): mds: ETableServer event now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 8f75db34 (ceph): mds: ETableServer more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 05461e8d (ceph): mds: EUpdate event now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 3fb1b219 (ceph): mds: EUpdate more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 4123d011 (ceph): mds: use modern encoding for LogEvent
It's a pretty simple encoding, but if we ever want to encode more than
the event type and the event itself we'll be g...
Greg Farnum
09:17 PM Revision 6a6f75e8 (ceph): mds: inode_load_vec_t now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 5c812213 (ceph): mds: dirfrag_load_vec_t now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 7cbae702 (ceph): mds: mds_load_t now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 07b24cf2 (ceph): mds: SessionMap now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision de9c1a15 (ceph): MDS: EMetaBlob more modernization for encoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision a1ed7418 (ceph): mds: EOpen event now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 5db5433e (ceph): mds: EResetJournal event now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision b578ef27 (ceph): mds: EResetJournal modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 3f469baa (ceph): mds: ESession event now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 709ff3d7 (ceph): mds: ESession more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 8e8ecb47 (ceph): mds: ESessions now uses modern encoding
To facilitate this (since it had no versioning previously), it
gets a new encoding number and LogEvent::decode() sets...
Greg Farnum
09:17 PM Revision 3cea0ca7 (ceph): mds: ESessions more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision a64153f9 (ceph): mds: ESlaveUpdate event now uses modern encoding everywhere
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 637e99fd (ceph): mds: link_rollback more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 85c67fe8 (ceph): mds: rmdir_rollback more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 70bc6afa (ceph): mds: rename_rollback more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 1842f461 (ceph): mds: ESlaveUpdate more modernization for dencoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 7998524d (ceph): mds: ESubtreeMap event now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 8906ae9a (ceph): mds: remove unused EString event
While we're at it, #include LogEvent.h from each of
the log events, some of which didn't include it previously!
Sign...
Greg Farnum
09:17 PM Revision b079733b (ceph): mds: EFragment event uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision a25683f3 (ceph): mds: EImportFinish event uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 83c1a3c4 (ceph): mds: EImportStart event uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 60730271 (ceph): mds: EMetaBlob and its sub-parts use modern encoding now
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision deb0d45c (ceph): mds: EMetaBlob::full_bit more modernization for dencoder
While we're doing so, make the frag stream operator const!
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:17 PM Revision 09a2d66e (ceph): mds: EMetaBlob::remotebit more modernization for dencoder
And set defaults for the default constructor.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:17 PM Revision 196313b4 (ceph): mds: EMetaBlob::nullbit modernization for dencoder
And set defaults in the default constructor.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:17 PM Revision 821b74e9 (ceph): MDS: EMetaBlob::dirlump more modernization for encoder
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 0fe7a086 (ceph): mds: old_rstat_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:17 PM Revision d5a6a251 (ceph): DecayCounter: use modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Sage Weil
09:17 PM Revision 95cee97e (ceph): AnchorServer: use modern encoding for server state
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision d8a7b876 (ceph): CInode: use modern encoding for encode_store
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision eb060bb4 (ceph): CInode: use modern encoding for encode_export/decode_import
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 6e797e00 (ceph): InoTable: use modern encoding for encode_state and decode_state
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision 7bad5078 (ceph): SnapServer: use modern encoding for server_state
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision d21de810 (ceph): mds: ECommitted now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:17 PM Revision f886f31e (ceph): mds: EExport event uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:17 PM Revision ff63530d (ceph): mds: Capability (and sub-structs) now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
06:04 PM Revision f11beb95 (ceph): radosgw-admin: fix cli test
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 1b05b0edbac09d1d7cf0da2e536829df05e48573)
Sage Weil
05:59 PM Revision 3cf3710b (ceph): mon: fix typo in C_Stats
Broken by previous commit.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:21 PM CephFS Bug #4060: mds: vxattr ceph.file.layout.pool doesn't check latest osdmap
Also, I've tested this fix with a basic script:
set -e
mnt=$1
touch ${mnt}/foo.$$
rados mkpool foo.$$
poolid=$...
Sam Lang
05:20 PM CephFS Bug #4060 (Fix Under Review): mds: vxattr ceph.file.layout.pool doesn't check latest osdmap
Pushed a proposed fix to wip-4060. Needs review. Sam Lang
03:18 PM CephFS Bug #4060 (In Progress): mds: vxattr ceph.file.layout.pool doesn't check latest osdmap
Even with add_data_pool I get EINVAL, so I'm reopening this. I've verified that (as before), the osdmap on the mds ... Sam Lang
02:57 PM CephFS Bug #4060 (Rejected): mds: vxattr ceph.file.layout.pool doesn't check latest osdmap
You need to add it to the MDSMap first ("ceph mds add datapool [x]" or something), and the server does at least wait ... Greg Farnum
02:53 PM CephFS Bug #4060 (Resolved): mds: vxattr ceph.file.layout.pool doesn't check latest osdmap
touch foo
create pool d2
setfattr -n ceph.file.layout.pool -v d2 foo
> returns EINVAL
The problem is that the p...
Sam Lang
05:19 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
the bit that looks fishy here is m_flush_mutex.__nusers. can you see what that thread is doing in gdb?
maybe it's...
Sage Weil
04:58 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
... Josh Durgin
04:55 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
A log dump shows nothing, so I'm guessing the log is corrupted such that it keeps logging to more and more memory wit... Josh Durgin
04:49 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
Something like that (or some kind of bug in the logging system that only gets hit with syslog or when not logging) wo... Greg Farnum
04:48 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
Oh, how interesting...I wonder if this is syslog not having enough network bandwidth? Or (in the more general sense) ... Greg Farnum
04:45 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
On wip-f, one osd grew to consume 70% of ram. The heap profiler tells us:... Josh Durgin
06:16 AM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
I've also started to see this and will try to get some heap profiling done to report back.
* ceph version 0.56.1 (...
Wido den Hollander
05:18 PM Revision 2bdf753d (ceph): mon: assert valid context return values
We recognized EAGAIN, ECANCELED, and success only.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Joao Lui...
Sage Weil
05:18 PM Revision 3322e96b (ceph): Merge branch 'next'
Sage Weil
05:09 PM Revision 4837063d (ceph): mon: retry PGStats message on EAGAIN
If we get EAGAIN from a paxos restart/election/whatever, we should
restart the message instead of just blindly acking...
Sage Weil
05:09 PM Revision 17827769 (ceph): mon: handle -EAGAIN in completion contexts
We can get ECANCELED, EAGAIN, or success out of the completion contexts,
but in the EAGAIN case (meaning there was an...
Sage Weil
05:08 PM Bug #4006: osd: repeating 'wrong node' message in log
I am also seeing this message in the radosgw.log file for .56-623. The error appears when restarting rgw and again d... Ken Franklin
05:08 PM Bug #4063 (Duplicate): filer: probe crash on wip-bobtail-osd-msgr branch
Tamilarasi muthamizhan
05:02 PM Bug #4063: filer: probe crash on wip-bobtail-osd-msgr branch
restarting the mds/all daemons in the cluster does not help, still hitting the same issue again.
leaving the clust...
Tamilarasi muthamizhan
04:56 PM Bug #4063: filer: probe crash on wip-bobtail-osd-msgr branch
repasting the core dump... Tamilarasi muthamizhan
04:56 PM Bug #4063 (Duplicate): filer: probe crash on wip-bobtail-osd-msgr branch
ceph version 0.56.2-15-g2ebf4d0 [wip-bobtail-osd-msgr]
test set up: burnupi06, burnupi07
hit this when running ...
Tamilarasi muthamizhan
03:47 PM devops Feature #4062 (Rejected): Add data collection to the gitbuilders
Need to track how long the builds are taking. Anonymous
03:42 PM CephFS Bug #4061 (Can't reproduce): mds crashed at LogEvent::decode
hit this on burnupi60, when upgrading from ceph v0.56-598-gb970d05 to 0.56.2-12-gcc16791 on 4 feb and it seems to be ... Tamilarasi muthamizhan
01:58 PM CephFS Bug #4044 (Rejected): replay failure between MDS and client
Never mind, this turned out to be another encoding issue. Phew! Greg Farnum
01:28 PM Documentation #3432 (In Progress): move explanation for rbd on libvirt to new docs
John Wilkins
01:27 PM Documentation #4058 (Resolved): fstab documentation has invalid or misleading options
Removed extraneous/contradictory options, committed and pushed. Fix should be up shortly. John Wilkins
11:26 AM Documentation #4058 (Resolved): fstab documentation has invalid or misleading options
http://ceph.com/docs/master/cephfs/fstab/ states "the Ceph file system will mount automatically on startup". However... Dan Reif
01:22 PM CephFS Feature #3543: mds: new encoding
Okay, that was an easy bug to fix. Hurray!
(The LogEvent encoding was off a little bit.)
Running it through anoth...
Greg Farnum
01:17 PM Documentation #4046 (Resolved): Typo in ceph.com docs webpage
Fixed and checked in. Should appear shortly. John Wilkins
01:12 PM Bug #4059 (Duplicate): osd: ENOTEMPTY unhandled for remove op
This occurred on wip_bobtail_f in a local vstart 11-osd cluster which I was trying to use to reproduce #4052 by causi... Josh Durgin
11:19 AM rgw Cleanup #4057 (New): Update Admin API spec with comments
Incorporate received comments into admin API specification caleb miles
09:54 AM Bug #3895: librados test hang during mon thrashing
commit:17827769f1fe6d7c4838253fcec3b3a4ad288f41 Sage Weil
09:20 AM Bug #3895 (Resolved): librados test hang during mon thrashing
Sage Weil
09:05 AM Bug #3895: librados test hang during mon thrashing
wip-mon-eagain looks good Joao Eduardo Luis
09:36 AM Bug #4043: osd: validate/scrub collections
Ian Colle
09:22 AM rgw Feature #3973 (New): rgw: Handle requests sent in non-UTC time
Moved to a feature for possible future consideration. Ian Colle
01:18 AM rgw Feature #3973: rgw: Handle requests sent in non-UTC time
Yehuda, i admit that i looks like the client is sending the wrong date, although it would be nice if radosgw could co... Moritz Krinke
09:07 AM Linux kernel client Bug #3997 (Resolved): xfs: insert memory barriers before wake_up_bit()
Our work is done! Thanks! Sage Weil
09:03 AM Linux kernel client Bug #3997: xfs: insert memory barriers before wake_up_bit()
Ben has committed my fix to the upstream XFS tree.
I'm not sure when it will hit Linus' tree, but I
think we can ca...
Alex Elder
08:36 AM rbd Cleanup #4053: ceph: cleanup ceph page vector functions
Apparently for cleanup there is no "need review" so I'm
marking this "Feedback". I've posted a series of patches
t...
Alex Elder
08:30 AM rbd Cleanup #4053 (Resolved): ceph: cleanup ceph page vector functions
This is just documenting some cleanup activity I've done
that I'm about to post for review.
- delete bogus (re)decl...
Alex Elder
08:21 AM rbd Subtask #4007 (Fix Under Review): libceph: support STAT osd operation
A patch implementing this has been posted to the
ceph-devel mailing list for review.
[PATCH] libceph: allow STAT ...
Alex Elder
07:25 AM Revision ec1085e5 (ceph): Merge remote-tracking branch 'gh/wip-bobtail-vxattrs' into bobtail
Sage Weil
07:25 AM Revision 66d77585 (ceph): mon: enforce reweight be between 0..1
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Joao Luis <joao.luis@inktank.com>
(cherry picked from commit...
Sage Weil
07:24 AM Revision 8bab3a1c (ceph): PG: dirty_info on handle_activate_map
We need to make sure the pg epoch is persisted during
activate_map.
Backport: bobtail
Reviewed-by: Sage Weil <sage@i...
Samuel Just
07:24 AM Revision dffa386b (ceph): osd: flush peering queue (consume maps) prior to boot
If the osd itself is behind on many maps during boot, it will get more and
(as part of that) flush the peering wq to ...
Sage Weil
07:20 AM Revision 5a02d6de (ceph): Merge branch 'next'
Sage Weil
06:51 AM Revision 1b05b0ed (ceph): radosgw-admin: fix cli test
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:31 AM Revision 2eaa7281 (ceph): keys: renew autobuild.asc key
This expired today. Change it to never expire, like the Ubuntu release
keys.
Signed-off-by: Sage Weil <sage@inktank...
Sage Weil
06:19 AM Revision f3ba46d3 (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
01:16 AM Revision 8a2de334 (ceph): Merge remote-tracking branch 'origin/master' into wip-2941-3
Yehuda Sadeh
12:59 AM Revision 278dfe50 (ceph): rgw: stream get_obj operation
Fixes: #2941
Instead of iterating through the parts one by one when reading
an object, we can now send multiple reque...
Yehuda Sadeh
12:59 AM Revision 3383618d (ceph): throttle: optional non perf counter mode
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com> Yehuda Sadeh
12:34 AM Revision a56eb88c (ceph): Merge to include --machine-type and changes to --summary
Added the ability to support multiple types of machines with
--machine-type added to teuthology-lock when used with -...
Sandon Van Ness
12:16 AM Revision ed2bb387 (ceph): OSD: check pg snap collections on start up
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
12:16 AM Revision 55f85796 (ceph): OSD::load_pgs: first scan colls before initing PGs
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
12:06 AM Revision 75d86e47 (ceph): Made teuthology-lock --summary machine type aware.
Signed-off-by: Sandon Van Ness <sandon@van-ness.com> Sandon Van Ness

02/07/2013

11:38 PM Bug #4051 (Duplicate): osd: inconsistent snapcolls on argonaut
latest run:... Sage Weil
11:04 PM Bug #3895 (Fix Under Review): librados test hang during mon thrashing
tracked this down; see wip-mon-eagain
qa run against rados api tests seems to confirm that this fixes it (previous...
Sage Weil
01:07 PM Bug #3895: librados test hang during mon thrashing
Attached mon logs from a recent run after the rados test seemed to hang for a big (100 mon elections or so). The log... Sam Lang
12:49 PM Bug #3895: librados test hang during mon thrashing
Attached log files for this from hung runs (librados and kernel untar). Sam Lang
10:54 PM Revision f6af1e76 (ceph): rgw: fix bucket_owner assignment
s->bucket_acl may be null, so reverting to old behavior.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
10:54 PM Revision 70532d19 (ceph): rgw: get bucket_owner from policy
We already read the bucket policy, we can get the bucket
owner from there.
Signed-off-by: Yehuda Sadeh <yehuda@inkta...
Yehuda Sadeh
10:54 PM Revision e345dfe0 (ceph): Feature 3667: Support extra canned acls.
Support the bucket-owner-read and bucket-owner-full
canned acls.
Signed-off-by caleb miles <caleb.miles@inktank.com>...
caleb miles
10:27 PM Revision fa47e77a (ceph): ReplicatedPG: check store for temp collection in have_temp_coll
We may not have "created" the temp collection since OSD restart
before removing the PG. have_temp_coll must also loo...
Samuel Just
09:55 PM Revision a18045f0 (ceph): rgw: a tool to fix clobbered bucket info in user's bucket list
This fixes bad entries in user's bucket list that may have occured
due to issue #4039. Syntax:
$ radosgw-admin user...
Yehuda Sadeh
09:55 PM Revision a00c77ab (ceph): rgw: bucket recreation should not clobber bucket info
Fixes: #4039
User's list of buckets is getting modified even if bucket already
exists. This fix removes the newly cre...
Yehuda Sadeh
09:53 PM Revision 47c9f46a (ceph): rgw: a tool to fix clobbered bucket info in user's bucket list
This fixes bad entries in user's bucket list that may have occured
due to issue #4039. Syntax:
$ radosgw-admin user...
Yehuda Sadeh
09:53 PM Revision 6c8d6381 (ceph): rgw: bucket recreation should not clobber bucket info
Fixes: #4039
User's list of buckets is getting modified even if bucket already
exists. This fix removes the newly cre...
Yehuda Sadeh
09:48 PM Bug #4050 (Resolved): recovery assert failure, osd/PG.cc: 6255: FAILED assert(query.query.type ==...
2013-02-07 20:58:49.461754 7f518f18c700 -1 osd/PG.cc: In function 'boost::statechart::result PG::RecoveryState::Repli... Samuel Just
09:26 PM Revision 030bc7c2 (ceph): Added support for multiple types of machines.
Added the ability to support multiple types of machines with
--machine-type added to teuthology-lock when used with -...
Sandon Van Ness
09:26 PM Revision 9cb6c33f (ceph): rgw: a tool to fix clobbered bucket info in user's bucket list
This fixes bad entries in user's bucket list that may have occured
due to issue #4039. Syntax:
$ radosgw-admin user...
Yehuda Sadeh
09:25 PM Revision 9d006ec4 (ceph): rgw: bucket recreation should not clobber bucket info
Fixes: #4039
User's list of buckets is getting modified even if bucket already
exists. This fix removes the newly cre...
Yehuda Sadeh
09:09 PM Revision 78454794 (ceph): Merge branch 'wip-cephtool' into next
Usage/errmsg fixups for the ceph CLI tool
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Dan Mick
09:06 PM Revision 36cf4d0c (ceph): ceph: fix 'pg' error message to direct user toward better input
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
09:06 PM Revision 73872e71 (ceph): ceph: use "config set" consistently in help/error msgs
apparently it was once known as set_config. Fix up everything to
refer to the new name. Also, fix up the help messa...
Dan Mick
09:06 PM Revision eb9d6cac (ceph): osd: fix name of setomapval admin-daemon command
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
09:06 PM Revision c44846e0 (ceph): ceph: ceph mon delete doesn't exist; ceph mon remove is the command
Fix up cli test as well (doc is already correct)
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
09:06 PM Revision 1042060f (ceph): mds: error messages for export_dir said 'migrate_dir'
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:31 PM Feature #3891 (Fix Under Review): osd: move purged_snaps out of info
David Zafman
07:47 PM Revision 5896b971 (ceph): modified the script to run on both argonaut and bobtail.
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
06:44 PM Revision dbce1d0d (ceph): PG: dirty_info on handle_activate_map
We need to make sure the pg epoch is persisted during
activate_map.
Backport: bobtail
Reviewed-by: Sage Weil <sage@i...
Samuel Just
06:25 PM Revision 94323535 (ceph): mds: rename mds_traceless_replies to mds_inject_traceless_reply_probabi...
Sage pointed out we should try for consistent naming on these debug
options, and this option is like our other inject...
Greg Farnum
06:21 PM Revision af95d934 (ceph): osd: flush peering queue (consume maps) prior to boot
If the osd itself is behind on many maps during boot, it will get more and
(as part of that) flush the peering wq to ...
Sage Weil
06:10 PM Documentation #3432: move explanation for rbd on libvirt to new docs
The secondary issue is only without cephx, it's true, but the bigger issue of "we *really*
need this documentation i...
Dan Mick
06:08 PM Documentation #4049 (Resolved): public/cluster network doc should mention that multiple subnets a...
public network and cluster network allow comma-separated (at least) lists of subnets. It is of course assumed
that ...
Dan Mick
05:16 PM rgw Feature #2941 (Resolved): rgw: improve streaming read performance
Merged, commit:8a2de334fed5c56919063bba8c60a3c73bd6534c Yehuda Sadeh
05:11 PM rgw Bug #4048 (Resolved): API mismatch between RGW and Swift
As discussed with Yehuda, when using RadosGW with a delimiter:
curl -H 'x-auth-token: 909e3793e499425fb90364738107da...
Alexandre Marangone
05:08 PM rbd Bug #4047 (Resolved): removing a non-existing rbd image logs error in osd logs
when removing a non-existing rbd image floods osd logs even when the debug is turned off. This can be avoided.
ubu...
Tamilarasi muthamizhan
04:31 PM Documentation #4046 (Resolved): Typo in ceph.com docs webpage
In this section:
http://ceph.com/docs/master/rados/operations/operating/#stopping-a-cluster
the example command:...
Anonymous
04:22 PM rbd Bug #4045 (Resolved): snap unprotect on a snapshot that is already unprotected throws inappropria...
ceph version 0.56.2-7-gc3468f7 (c3468f76a5e68a6426f03e508d8ecf26950fca2a)
Trying to unprotect a snapshot, that is ...
Tamilarasi muthamizhan
04:09 PM Feature #3982 (Resolved): Performance tests on branches that change the way pg info is stored
David Zafman
02:54 PM rgw Feature #3667 (Resolved): rgw: support extra canned acl params
Merged commit:e345dfe04a64fcd0d37c9e0717b6714038c302ae Yehuda Sadeh
02:14 PM CephFS Bug #4044 (Rejected): replay failure between MDS and client
While testing #3543 (but that shouldn't be related to this issue), I restarted the MDS and ran into a case where the ... Greg Farnum
02:11 PM CephFS Feature #3543: mds: new encoding
3Still haven't gotten in on teuthology (soon!), but I did some local upgrade testing. I was able to upgrade from mast... Greg Farnum
01:58 PM Bug #4042: osd crash in recovery state: FAILED assert(0 == "we got a bad state machine event")
Nope. I've looked at it when reporting this issue, but I couldn't find a core file. I'd expected one to be in /, but ... Wido den Hollander
01:51 PM Bug #4042 (Need More Info): osd crash in recovery state: FAILED assert(0 == "we got a bad state m...
Hey Wido- Do you have have the core by chance? Sage Weil
08:40 AM Bug #4042 (Resolved): osd crash in recovery state: FAILED assert(0 == "we got a bad state machine...
I just rebooted a couple of my 0.56.2 nodes and out of 12 OSDs one went down with:... Wido den Hollander
01:55 PM rgw Bug #4039 (Resolved): rgw: bucket info discrepencies
Fixed, commit:9d006ec40ced9d97b590ee07ca9171f0c9bec6e9.
Recovery tool, commit:9cb6c33f0e2281b66cc690a28e08459f2e62ca...
Yehuda Sadeh
11:06 AM rgw Bug #4039 (In Progress): rgw: bucket info discrepencies
Ian Colle
01:49 PM rbd Bug #4003 (Resolved): rbd: EBUSY errors from rbd unmap
closing this. phew! Sage Weil
01:45 PM Bug #4043 (Resolved): osd: validate/scrub collections
check that existent collections is correct.
one option is to just do this during startup (along with some optional...
Sage Weil
11:23 AM Bug #4036: init-ceph: assumes write access to /var/run/ceph
I was mistaken about vstart clusters; it's restarting them just fine. Changed the bug description to more correctly d... Greg Farnum
10:09 AM CephFS Cleanup #1499: mds: clean up directory layouts
I've rebased on top of the wip-mds-encode-rebased branch as wip-1499-mds-layouts, although I notice it's failing some... Greg Farnum
10:09 AM CephFS Bug #1435: mds: loss of layout policies upon mds restart
I've been totally unable to come up with a scenario for how this could happen via code inspection, so I think I'm jus... Greg Farnum
09:54 AM Bug #3995: OSD heartbeat-crashes during startup
All right, I'll try to confirm if I see the problem again.
Thank you.
Artem Grinblat
09:47 AM Bug #3995 (Resolved): OSD heartbeat-crashes during startup
Artem Grinblat wrote:
> Sage, no, as I've said in comment #1, after a couple of restarts the OSD returned to normal....
Sage Weil
09:43 AM Bug #3995: OSD heartbeat-crashes during startup
Sage, no, as I've said in comment #1, after a couple of restarts the OSD returned to normal. Artem Grinblat
09:39 AM Bug #3995 (Need More Info): OSD heartbeat-crashes during startup
Artem, does it do this on every startup? Can you test wip_bobtail_f and see if it resolves the problem?
Thanks!
Sage Weil
09:52 AM rgw Feature #3973: rgw: Handle requests sent in non-UTC time
from RFC 2616:... Yehuda Sadeh
09:39 AM rgw Feature #3973: rgw: Handle requests sent in non-UTC time
Ian Colle
12:26 AM rgw Feature #3973: rgw: Handle requests sent in non-UTC time
Ian, i dont think this is an client issue. Checking the AWS documentation (http://docs.aws.amazon.com/AmazonS3/latest... Moritz Krinke
08:14 AM Bug #4041 (Can't reproduce): mon: Single-Paxos: on Paxos, leader didn't trim old versions
Possibly after being killed at some point, the leader ignored earlier versions when it trimmed its state, such that t... Joao Eduardo Luis
07:44 AM Bug #4040: mon: Single-Paxos: on PGMonitor, FAILED assert(0 == "update_from_paxos: error parsing ...
Triggered again, same symptoms, and it appears as if the issue is a skipped version on the store:
from the origina...
Joao Eduardo Luis
04:25 AM Bug #4040: mon: Single-Paxos: on PGMonitor, FAILED assert(0 == "update_from_paxos: error parsing ...
Also, I suspect this might be causing the same problem described on #4026 Joao Eduardo Luis
04:24 AM Bug #4040: mon: Single-Paxos: on PGMonitor, FAILED assert(0 == "update_from_paxos: error parsing ...
Something got messed up when updating the 'last_committed' version on mon.f, which by the way has fallen some 10 vers... Joao Eduardo Luis
04:01 AM Bug #4040 (Resolved): mon: Single-Paxos: on PGMonitor, FAILED assert(0 == "update_from_paxos: err...
... Joao Eduardo Luis
06:02 AM Revision ed9103aa (ceph): rgw: parse testdir into apache.conf
Also fix up the template to use {{field}} for stuff we don't want to parse.
There is probably a better way...
Signed...
Sage Weil
06:01 AM Revision 75c40fac (ceph): qa: fix iogen script
Wait 10 minutes and then stop.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:44 AM Revision 67bbb9c7 (ceph): osd_recovery: add missing testdir arg
Sage Weil
01:14 AM Revision 561ea14c (ceph): ceph_manager: take int or string to osd_admin_socket
This fixes a failure on dump_stuck. Sage Weil
12:47 AM Revision 46d7dbd3 (ceph): client: trigger the completion in _flush when short-cutting
We missed a shortcut return from _flush() when doing
e9a6694d0151b79c3a3b44cee5df8e3d4dcbfc2c, so _fsync() calls
were...
Greg Farnum

02/06/2013

11:46 PM Bug #3945: osd: dynamically link to leveldb
The current version of leveldb that is being used by ceph is 1.2. The wip-leveldb has version 1.9 which is the lates... Anonymous
09:34 PM Revision 08b82b3e (ceph): mds: add "mds traceless replies" debug option
This option specifies (in the range 0-1) the percentage of modifying
operations that should be responded to without i...
Greg Farnum
07:45 PM Revision 9871cf27 (ceph): logrotate.conf: Silence rgw logrotate some more
Apply the same change as commit d02340d90c9d30d44c962bea7171db3fe3bfba8e to
the radosgw logrotate.conf.
Signed-off-b...
Gary Lowell
07:44 PM Revision d02340d9 (ceph): silence logrotate some more
I was getting email with logrotate error output from “which invoke-rc.d”
on systems without an invoke-rc.d. This pat...
Alexandre Oliva
06:43 PM Revision f81e0952 (ceph): Merge remote-tracking branch 'gh/wip-danny-cleanups'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
06:38 PM Revision 0aea4dba (ceph): Merge remote-tracking branch 'gh/wip-3768'
Sage Weil
06:22 PM Revision c0e1070f (ceph): test: fix Throttle unit test.
A bunch of these are slightly racy so they're enclosed in loops. This
particular one, though, changes the Throttle st...
Greg Farnum
05:06 PM Revision 3fbb5522 (ceph): radosbench: fix missing format value
tdir is substituted in at the end. There is probably a better way to do
this.
Sage Weil
05:04 PM Revision 936f314a (ceph): rgw: fix testdir format on f
Format the path, not filehandle Sage Weil
05:02 PM Revision 1948a02b (ceph): osd: do not spam system log on successful read_log
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
04:48 PM CephFS Bug #4038 (Resolved): ceph-fuse: various hangs
He says it fixed the problem, and it's in master now. (commit: 46d7dbd3472f26926c6d048bfc3c150074bfd283) Greg Farnum
04:32 PM CephFS Bug #4038: ceph-fuse: various hangs
There's a shortcut return in CInode::_flush() that wasn't setting the new completion to done (when called from _fsync... Greg Farnum
04:01 PM CephFS Bug #4038 (Resolved): ceph-fuse: various hangs
... Sage Weil
04:42 PM Revision 3acc4d2c (ceph): rbd-fuse: fix for loop in open_rbd_image()
Remove uninitialized usage of 'int i' as i++ from 'for' loop.
The variale 'i' is never used in this loop and initiali...
Danny Al-Gaaf
04:42 PM Revision b1fc10ef (ceph): messages/MOSDRepScrub.h: initialize member variable in constructor
Initialize chunky and deep bool member variables in the constructor
with false.
Signed-off-by: Danny Al-Gaaf <danny....
Danny Al-Gaaf
04:42 PM Revision db0dbe5d (ceph): msg/Message.h: fix C-style pointer casting
Replace C-style pointer casting with correct static_cast<>().
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
04:42 PM Revision 42682963 (ceph): WorkQueue.h: fix cast
Replace C-style pointer casting with correct static_cast<>().
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
04:42 PM Revision a4042cc3 (ceph): ceph_crypto.cc: remove unused shutdown() outside crypto ifdef's
Fix "out-of-line declaration of a member must be a definition
[-Wout-of-line-declaration]". Remove ceph::crypto::shut...
Danny Al-Gaaf
04:42 PM Revision ad526c0e (ceph): obj_bencher.cc: use vector instead of VLA's
Fix "variable length array of non-POD element type" error. (-Wvla)
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisec...
Danny Al-Gaaf
04:42 PM Revision 0327cbaa (ceph): include/buffer.h: fix operator=
Fix operator=: return "iterator&" instead of 'iterator'. Check if 'this'
equals 'other' before set anything.
Signed-...
Danny Al-Gaaf
04:42 PM Revision d54bd170 (ceph): include/types.h: change operator<< function parameter
Fix "Function parameter 'v' should be passed by reference." from cppchecker.
Use 'const pair<A,B>& v' similar to the ...
Danny Al-Gaaf
04:42 PM Revision 22e48b57 (ceph): include/xlist.h: fix C-style pointer casting
Replace C-style pointer casting with correct static_cast<>().
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
04:41 PM rgw Bug #4039 (Resolved): rgw: bucket info discrepencies
bucket (re)creation ends up clobbering the bucket info stored under user's info. Yehuda Sadeh
03:22 PM Bug #4037 (Resolved): mon: Single-Paxos: on Paxos, FAILED assert(begin->last_committed == last_co...
... Joao Eduardo Luis
01:38 PM CephFS Feature #3626 (Resolved): mds: debug mode to generate traceless replies to clients
Greg Farnum
01:38 PM CephFS Feature #3626: mds: debug mode to generate traceless replies to clients
Merged into master in commit:08b82b3ef6b43283e35fd4e56eb5c78651345bea. Greg Farnum
01:26 PM CephFS Feature #3626 (Fix Under Review): mds: debug mode to generate traceless replies to clients
wip-4036 (commit:4ebba50a15584c89e0c5e4c6e48618055ceb96d8). Testing it now with pjd on a vstart cluster with no trace... Greg Farnum
12:52 PM Bug #4036 (Resolved): init-ceph: assumes write access to /var/run/ceph
I noticed this when using init-ceph on a vstart cluster:... Greg Farnum
12:26 PM rgw Bug #4011 (Resolved): rgw: multipart upload complete does not clean up parts from index
Fixed, commit:b663c097d1e6f41aed9abeadaae80f66fc71f5ec
Recovery tool, commit:2d8faf8e5f15e833e6b556b0f3c4ac92e4a4151...
Yehuda Sadeh
11:49 AM rbd Subtask #4007: libceph: support STAT osd operation
This has turned out to be simple change. It was needed in
rbd as well, and I'll just add support to both under this...
Alex Elder
09:19 AM rbd Subtask #4007: libceph: support STAT osd operation
It wasn't really possible to know this up front but
it looks like this is trivial. I've basically
completed it but...
Alex Elder
11:32 AM rgw Feature #3973 (Need More Info): rgw: Handle requests sent in non-UTC time
Moritz - this seems like an issue with aws-sdk-ruby not reporting time in UTC, rather than our inability to handle a... Ian Colle
11:01 AM CephFS Bug #4035 (Rejected): Ceph doesn't recover from fault on Opensuse (cfuse tests & rbd-cli tests)
I'm not sure if this is exclusive to fs but on an opensuse, single node cluster, when running cfuse and rbd tests a f... Ken Franklin
10:56 AM rbd Bug #3697 (In Progress): rbd copy.sh test failing in nightly
Tamilarasi muthamizhan
10:38 AM Bug #3768 (Resolved): perl is required for logrotate, we need to include Perl as a dependency
commit:0aea4dba040b8caaeb5c4079728078541e5bb2c1 Sage Weil
10:08 AM CephFS Fix #4034 (Resolved): mds: fix replayed ino creation extra_bl
I haven't tested this, but I noticed during code inspection for other things that I believe all our recent fixes for ... Greg Farnum
09:59 AM Bug #4026 (In Progress): mon: Single-Paxos: abort on LogMonitor::update_from_paxos
Joao Eduardo Luis
09:59 AM Bug #4026: mon: Single-Paxos: abort on LogMonitor::update_from_paxos
Haven't been able to reproduce this nor to find an obvious cause for this to have happened.
After inspecting the s...
Joao Eduardo Luis
09:37 AM devops Feature #4032: ceph-disk-prepare should allow the definition of an OSD id
Ah, right. I was thinking we could get into badness over that disagreement, but of course everything checks the real ... Greg Farnum
09:27 AM devops Feature #4032: ceph-disk-prepare should allow the definition of an OSD id
Greg Farnum wrote:
> I don't think we want to do this. The problem is that if we plug in a new OSD that has the same...
Sage Weil
09:21 AM devops Feature #4032: ceph-disk-prepare should allow the definition of an OSD id
I don't think we want to do this. The problem is that if we plug in a new OSD that has the same ID as the previous on... Greg Farnum
05:55 AM devops Feature #4032 (Rejected): ceph-disk-prepare should allow the definition of an OSD id
When replacing disks in existing boxes, sometimes it's useful to keep the existing OSD numbering, rather than start a... Faidon Liambotis
08:56 AM rbd Bug #3958: rbd fsx fails with EBUSY
this is causing several failures on master runs.. something has changed.
latest:
ubuntu@teuthology:/a/sage-2013-...
Sage Weil
08:55 AM Bug #3810 (Resolved): btrfs corrupts file size on 3.7
Sage Weil
08:31 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
The testing I've been doing now has shown no problems
now that teuthology has been updated.
The two other issues ...
Alex Elder
06:16 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
Seems to have done the trick! The kernel_untar_build.sh
task just finished for me without error, and it failed
rel...
Alex Elder
05:06 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
That sounds promising, I hope it works!
This was actually the last thing I was looking at last
night while waitin...
Alex Elder
08:18 AM Bug #3854: mon: clock skew tests failing on master
This was fixed by commit:d74b31b24db647a8b7c80d1552fa6f0b02c54ba4 and commit:c54781618569680898e77e151dd7364f22ac4aa1 Joao Eduardo Luis
07:31 AM Revision ed3c3615 (ceph): nuke: don't try unmount if we're rebooting everything anyway
This can cause issues when unmount hangs. Our automatic runs reboot
everything unconditionally, so this caused a bunc...
Josh Durgin
07:28 AM Revision c6504bab (ceph): nuke: make tmpfs check only umount tmpfs
This would catch things like /tmp/cephtest/mnt.client.0, which are
used by cfuse, rbd, and kclient.
Josh Durgin
07:20 AM rbd Bug #4033 (Fix Under Review): krbd: add barriers near done flag operations
A fix for this has been posted for review.
[PATCH] rbd: add barriers near done flag operations
Alex Elder
06:15 AM rbd Bug #4033 (Resolved): krbd: add barriers near done flag operations
I fixed this problem while investigating the rbd hangs
in http://tracker.ceph.com/issues/4003.
Somehow, I missed ...
Alex Elder
07:19 AM Revision 82273e95 (ceph): rbd: fix rbd image unmount
The testdir param was missing. Avoid this class of errors by unmounting
exactly what we mounted.
Sage Weil
07:01 AM Revision 60990459 (ceph): rbd: set env before running sudo
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:53 AM devops Bug #4031 (Won't Fix): ceph-disk-activate hardcodes journal path, ignores configuration
I'm having my ceph.conf configured to store journals in a different place, like:
[osd]
osd journal = /var/lib/ceph...
Faidon Liambotis
01:56 AM Revision 27fb0e63 (ceph): rgw: a tool to fix buckets with leaked multipart references
Checks specified bucket for the #4011 symptoms, optionally fix
the issue.
sytax:
radosgw-admin bucket check --buck...
Yehuda Sadeh
01:55 AM Revision 50c1775d (ceph): rgw: radosgw-admin object unlink
Add a radosgw-admin option to remove object from bucket index
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
(cher...
Yehuda Sadeh
01:48 AM Revision cc167914 (ceph): rgw: a tool to fix buckets with leaked multipart references
Checks specified bucket for the #4011 symptoms, optionally fix
the issue.
sytax:
radosgw-admin bucket check --buck...
Yehuda Sadeh
01:31 AM Revision 9eff2ee1 (ceph): Merge remote-tracking branch 'gh/wip-osd-commands'
Reviewed-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Joao Luis <joao.luis@inktank.com>
Sage Weil
01:08 AM Revision 4d6964fc (ceph): rgw: radosgw-admin object unlink
Add a radosgw-admin option to remove object from bucket index
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
(cher...
Yehuda Sadeh
12:35 AM Revision 3b635423 (ceph): mon: move list_rules into CrushWrapper method
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:35 AM Revision 9f4d4ac9 (ceph): crush: add list_rules() method
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:14 AM Revision 7f237be2 (ceph): Makefile: Add rgw/logrotate.conf source tarball
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell

02/05/2013

11:56 PM Bug #4030 (Resolved): Missing Fedora 18 release packages
In the docs on OS recommendations:
http://ceph.com/docs/master/install/os-recommendations/
It is mentioned that...
Jens Kristian Søgaard
11:52 PM Revision 16235a7a (ceph): rgw: radosgw-admin object unlink
Add a radosgw-admin option to remove object from bucket index
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
11:52 PM Revision 2d8faf8e (ceph): rgw: a tool to fix buckets with leaked multipart references
Checks specified bucket for the #4011 symptoms, optionally fix
the issue.
sytax:
radosgw-admin bucket check --buck...
Yehuda Sadeh
11:52 PM Revision b663c097 (ceph): rgw: unlink multipart upload parts when completing upload
Fixes: #4011
When completing the multipart upload, we also need to unlink the
parts from the bucket index. Originally...
Yehuda Sadeh
11:43 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
This was backing up qa stuff because the rbd.py qa task wasn't unmounting during cleanup. That bit is now fixed. I ... Sage Weil
10:54 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
new theory:
the reason umount hangs is because nuke is killing the client and osds at the same time. the umount i...
Sage Weil
10:41 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I found that unmount was hanging too. I think somehow the
completion of the I/O is not getting propagated up when
...
Alex Elder
10:33 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
aha:... Sage Weil
10:15 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
btw i am able to reproduce the EBUSY with just... Sage Weil
08:28 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I've added some instrumentation and find that the rbd
client is not dropping its watch at the end of the
kernel_unt...
Alex Elder
12:51 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
The interrupt issue has been fixed, but the other issue
(rbd device can't be unmapped because EBUSY) remains.
I h...
Alex Elder
11:35 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I ran the kernel_untar_build.sh workunit using the
ceph "master" branch and the ceph-client "testing"
branch and go...
Alex Elder
11:13 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I think I found *a* problem, possibly not *the* problem.
This commit:
bc7a62ee5 rbd: prevent open for image ...
Alex Elder
11:04 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I am able to reproduce this problem by running
the kernel_untar_build.sh workunit.
I ran the test using the ceph ...
Alex Elder
08:53 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
> Alex, unless there is another high priority regression, can you
> look at this first?
Yes I will.
Alex Elder
08:52 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
Sam Lang wrote:
> I was able to verify that this happens with an older version of teuthology, one without the change...
Sage Weil
08:41 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I was able to verify that this happens with an older version of teuthology, one without the changes I've made recentl... Sam Lang
05:18 AM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I had the impression this might be a problem that
is holding up completion of the nightly test suite.
But I'm not...
Alex Elder
11:20 PM Revision 99ea3030 (ceph): logrotate.conf: Remove unneeded loop and update new rgw version.
Remove an unneeded for loop from the ceph logrotate.conf, and
update the new rgw logrotate.conf to reload the radosgw...
Gary Lowell
10:54 PM Revision c8eace6f (ceph): rgw: create a separate logrotate file for radosgw
Fixes: #3813
Since radosgw package is separate from the ceph package,
it also needs to have a separate logrotate. The...
Yehuda Sadeh
10:31 PM Revision 43a01c99 (ceph): crush: factor out (trivial) crush_destroy_rule()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:31 PM Revision b9bd482d (ceph): crush: remove_rule() method
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:31 PM Revision a19cdd49 (ceph): osdmap: method to check if a crush ruleset is in use
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:31 PM Revision 2c559a7a (ceph): mon: 'osd crush rule rm <name>'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:31 PM Revision b79067a8 (ceph): qa: add workunits/mon/crush_ops.sh
Test creating, listing, removing crush rules via the mon.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:31 PM Revision c370d85b (ceph): mon: 'osd crush rules list|ls'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:30 PM Revision 9da6290c (ceph): crush: factor out dump_rules from dump
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:30 PM Revision b6036a58 (ceph): mon: 'osd crush dump'
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:30 PM Revision a04d3f0a (ceph): mon: 'osd crush rule create-simple <name> <root> <failure_domain_type>'
Simple command to create simple rules.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:29 PM Revision 1a386d6c (ceph): crush: add_simple_rule() command
Method to create a very generic rule the distributes objects across the
specified failure domain type underneath the ...
Sage Weil
10:26 PM Revision d7ada58a (ceph): crush: fix get_rule_id() return value
There are 0 callers, yay!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:26 PM Revision 3105700d (ceph): mon: 'osd find <osd-id>' command
Simple command to find the ip, host, rack, etc. for an OSD. This is better
than 'ceph osd dump | grep ^osd.NNN\ '.
...
Sage Weil
10:26 PM Revision 4f992ea3 (ceph): crush: add rule_exists()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:20 PM Revision 100e9056 (ceph): misc: Close connections on reboot
When nodes are rebooted, the connections remain open
even after calling reconnect and setting up new ssh
sessions to ...
Sam Lang
10:07 PM Bug #4028 (Duplicate): rbd: qa runs failing to remove image after unmap
sorry, this is actuall a dup of #4003. Sage Weil
09:55 PM Bug #4028: rbd: qa runs failing to remove image after unmap
Is this the fault of the rbd task? What should
be removing the image?
Alex Elder
08:57 PM Bug #4028 (Duplicate): rbd: qa runs failing to remove image after unmap
Pretty consistently reproducible with kernel rbd tasks against master branch, and either master or testing kernel. Sage Weil
09:48 PM Revision b3ffc718 (ceph): Merge branch 'wip-2753-fsync-errors'
Reviewed-by: Sage Weil <sage@inktank.com> Greg Farnum
09:29 PM Revision 72c7bcd5 (ceph): mds: MDSCacheObjectInfo now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:29 PM Revision 49b0be58 (ceph): mds: rename MDSTableServer::_pending to mds_table_pending_t
And move it from MDSTableServer into mdstypes.cc, so we can use it
in ceph-dencoder more gracefully (coming up next!)...
Greg Farnum
09:29 PM Revision 924fb18f (ceph): mds: mds_table_pending_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 78632adc (ceph): mds: modernize SimpleLock on-wire encoding
This is a wire protocol change.
Signed-off-by: Sage Weil <sage@inktank.com>
Greg Farnum
09:29 PM Revision 21901d21 (ceph): mdsmap: uninline encode/decode
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 771204b2 (ceph): mds: move conditional MDSMap encoding into single encode method
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 50ab924a (ceph): mds: MDSMap now uses modern encoding
Signed-off-by: Greg Farnum <greg@inktank.com> Greg Farnum
09:29 PM Revision ad40bdd8 (ceph): MDSMap: mds_info_t now uses modern encoding
We have to update the older-style MDSMap encodings to generate
the previous versions for clients as well.
Signed-off...
Greg Farnum
09:29 PM Revision 90d93d94 (ceph): mds: build dencoder with more stuff
Add libosdc and perfglue/disabled_heap_profiler to the
dencoder, because those are required for the MDS stuff
we're a...
Greg Farnum
09:29 PM Revision c058285d (ceph): mds: uninline Capability encoders
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:29 PM Revision 08124fc7 (ceph): mds: SnapInfo, snaplink_t, sr_t now use modern encoding
This commit doesn't enable the dencoder integration due
to some build and compile issues, but we'll turn it
on later....
Greg Farnum
09:29 PM Revision 3e728931 (ceph): client: rename client/SnapRealm files to avoid automake build conflict
We are about to move the MDS' SnapRealm into its own files, which conflicts.
The MDS is more important, so it wins th...
Greg Farnum
09:29 PM Revision f1baa79c (ceph): mds: move SnapRealm into its own h/cc files
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:29 PM Revision 34f7c715 (ceph): mds: inode_back{trace,pointer}_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 2888830f (ceph): mds: default_file_layout now uses modern encoding
And move implementations into mdstypes.cc from CInode and common/types.
mdstypes.cc sadly lives in libcommon; as seve...
Greg Farnum
09:29 PM Revision a892671a (ceph): mds: frag_info_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com> Greg Farnum
09:29 PM Revision cda5590e (ceph): mds: rename struct default_file_layout to file_layout_policy_t
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 9cf9c54c (ceph): mds: nest_info_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 49eb4d68 (ceph): mds: client_writeable_range_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 87da20bf (ceph): mds: fold byte_range_t into client_writeable_range_t
As part of this, fold byte_range_t into it as a sub-struct
and eliminate its free-standing functions (it's too small
...
Greg Farnum
09:29 PM Revision 843a3521 (ceph): mds: inode_t now uses modern encoding
And we move implementations and the dumper into mdstypes.cc (from
mdstypes.h and common/types.cc).
Signed-off-by: Sa...
Greg Farnum
09:29 PM Revision 7e85a178 (ceph): remove common/types.cc
It no longer has a purpose; the functions it used to host are now
implemented in mds/mdstypes.cc and more properly be...
Greg Farnum
09:29 PM Revision bd4897b8 (ceph): mds: old_inode_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 44865800 (ceph): mds: fnode_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 91555e22 (ceph): mds: move durable Session bits into session_info_t
This keeps the on-disk structure explicitly separate from the in-memory
functional stuff.
Signed-off-by: Sage Weil <...
Greg Farnum
09:29 PM Revision f57f4244 (ceph): mds: session_info_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision d580a581 (ceph): mds: string_snap_t now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision 43468a3b (ceph): osd: remove DecayCounter header
Neither the OSD nor the PG makes any use of this.
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:29 PM Revision ccba2ced (ceph): mds: add CEPH_FEATURE_MDSENC feature bit
This will cover the MDS cluster on upgrade, and determine which encoding
of the MDSMap they use for clients.
Signed-...
Greg Farnum
09:29 PM Revision e35b89ec (ceph): mds: Anchor now uses modern encoding
Signed-off-by: Sage Weil <sage@inktank.com>
Signed-off-by: Greg Farnum <greg@inktank.com>
Greg Farnum
09:28 PM Revision ece1c0f8 (ceph): mon: check correct length of command
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
08:47 PM Bug #3971: can't attach rbd image volume to instance
1) The log shows an attempt to open volume-ade3b6fb-2386-4d10-9472-16cd4f955faa; this isn't the same volume you show ... Khanh Nguyen Dang Quoc
08:41 PM Bug #3971: can't attach rbd image volume to instance
The log shows it trying to access an rbd_header.volume-ade3b6fb-2386-4d10-9472-16cd4f955faa object without looking at... Josh Durgin
08:15 PM Bug #3971: can't attach rbd image volume to instance
1) The log shows an attempt to open volume-ade3b6fb-2386-4d10-9472-16cd4f955faa; this isn't the same volume you show ... Dan Mick
07:48 PM Bug #3971: can't attach rbd image volume to instance
yes sure, restarted all.
Please refer to the attached file for more detail.
Thanks.
Khanh Nguyen Dang Quoc
03:49 PM Bug #3971: can't attach rbd image volume to instance
Did you restart the monitors and osds after you set auth supported = none in the global section of every /etc/ceph/ce... Josh Durgin
08:16 PM Bug #4027 (Resolved): ceph-fuse on opensuse12 has the wrong requirement name for libfuse dependency
Instead of fuse-libs it should require libfuse2. This is likely specific to opensuse, but should double check others... Anonymous
07:13 PM Revision 13e22262 (ceph): Merge pull request #39 from dachary/master
Relax Throttle::_reset_max conditions and associated unit tests Greg Farnum
07:06 PM Revision 64ded02c (ceph): Relax Throttle::_reset_max conditions and associated unit tests
Removes a condition in Throttle::_reset_max by which the waiting queue is only
Signal()ed if the new maximum is lower...
Loïc Dachary
06:29 PM Revision ca2d6459 (ceph): os: default to 'journal aio = true'
Hooray, testing indicates this is a win!
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:20 PM Revision e43a13c0 (ceph): Merge pull request #36 from cmello/master
libexpat dependency Greg Farnum
05:57 PM Revision 26f7db11 (ceph): Merge pull request #38 from alram/master
Fixes in ./docs/radosgw/config.rst John Wilkins
05:33 PM RADOS Feature #3807 (Resolved): crush: simple commands to create common rules
commit:9eff2ee13dc03f245a11c91f4ed7d5bc15c55aef Sage Weil
05:31 PM rgw Bug #4011: rgw: multipart upload complete does not clean up parts from index
actually does not affect bobtail. Will still need to port the fix tool to bobtail. Yehuda Sadeh
12:55 PM rgw Bug #4011 (Resolved): rgw: multipart upload complete does not clean up parts from index
Yehuda Sadeh
05:17 PM Bug #4026 (Resolved): mon: Single-Paxos: abort on LogMonitor::update_from_paxos
While running teuthology with 20+ monitors, the monitor workloadgen with 10 osds, and mon thrasher, we triggered the ... Joao Eduardo Luis
04:38 PM Revision da10b58d (ceph): task/ceph_manager: Fix NoneType config issue
kill_mon is getting a config set to None, which blows
up now due to the check for powercycle. Initialize
the config ...
Sam Lang
04:22 PM CephFS Feature #3626 (In Progress): mds: debug mode to generate traceless replies to clients
Server::set_trace_dist() sets several things on the reply:
*snapbl
*head.is_dentry
*head.is_target
*trace_bl
H...
Greg Farnum
02:35 PM CephFS Bug #1435: mds: loss of layout policies upon mds restart
Wait, never mind. Too excited and didn't look closely enough at the projected node struct! :) Greg Farnum
01:56 PM CephFS Bug #4023 (New): kclient: d_revalidate is abusing d_parent
See Viro's email to linux-fsdevel, http://marc.info/?l=linux-fsdevel&m=135968126020360&w=2 .
We probably need t...
Sage Weil
01:52 PM CephFS Bug #2753 (Resolved): Writes to mounted Ceph FS fail silently if client has no write capability o...
wip-2753-fsync errors merged and pushed in commit:b3ffc718c93b7daa75841778b5d50ea3bc5fcc53 and fsync works properly o... Greg Farnum
01:47 PM CephFS Feature #4022 (New): client: qa: test non-cached operation (force sync mode)
Right now it's possible to run the client without going through the cacher. This isn't tested at all right now. It's ... Greg Farnum
01:47 PM rbd Feature #4021 (Resolved): rbd: openstack: add ability to copy volume to image for rbd
Ian Colle
01:46 PM rbd Subtask #4020 (Resolved): rbd: openstack: simplify volume booting with new api: make image boot b...
Ian Colle
01:44 PM rbd Subtask #4019 (Resolved): rbd: openstack: simplify volume booting with new api: add boot option t...
Ian Colle
01:44 PM rbd Subtask #4018 (Resolved): rbd: openstack: simplify volume booting with new api: modify boot panel...
Ian Colle
01:42 PM rbd Feature #4017 (Resolved): rbd: openstack: simplify volume booting with new api
Ian Colle
01:42 PM rbd Feature #4013 (In Progress): rbd: openstack: extend nova boot api to support going from image to ...
Ian Colle
01:24 PM rbd Feature #4013 (Resolved): rbd: openstack: extend nova boot api to support going from image to volume
Ian Colle
01:41 PM rbd Subtask #4016 (Resolved): rbd: openstack: extend nova boot api: modify libvirt driver to support ...
Ian Colle
01:40 PM rbd Subtask #4015 (Resolved): rbd: openstack: extend nova boot api: add block_dev_mapping_v2 to nova-...
Ian Colle
01:40 PM rbd Subtask #4014 (Resolved): rbd: openstack: extend nova boot api: add block_dev_mapping_v2 to nova-api
Ian Colle
01:13 PM rbd Bug #4012 (Won't Fix): rbd: image creation behaviour has to be uniform across bobtail and argonau...
rbd allows images to be created with size 0 in bobtail, but it fails in argonaut.
similarly,while in bobtail it do...
Tamilarasi muthamizhan
12:52 PM rbd Bug #4010 (Fix Under Review): krbd: turn off interrupts for open/remove locking
Posted for review.
[PATCH] rbd: turn off interrupts for open/remove locking
Alex Elder
12:49 PM rbd Bug #4010 (Resolved): krbd: turn off interrupts for open/remove locking
This fix is done. The problem was discovered while
investigating http://tracker.ceph.com/issues/4003.
This commi...
Alex Elder
11:40 AM Bug #4009 (Duplicate): osd reports map e6 wrongly marked me down
... Tamilarasi muthamizhan
10:37 AM Bug #3683 (In Progress): mon: leak of MMonPaxos
ubuntu@teuthology:/a/teuthology-2013-02-04_20:00:03-regression-bobtail-master-basic/15658 Tamilarasi muthamizhan
10:34 AM devops Feature #4008 (Resolved): ceph-deploy: make sure new version works with old ceph-disk_*
Sage Weil
10:12 AM rbd Bug #3697: rbd copy.sh test failing in nightly
recent log : ubuntu@teuthology:/a/teuthology-2013-02-04_20:00:03-regression-bobtail-master-basic/15773 Tamilarasi muthamizhan
09:49 AM Linux kernel client Bug #3997 (Fix Under Review): xfs: insert memory barriers before wake_up_bit()
The first patch was ACK'd by Dave Chinner.
The second one he explained wasn't needed,
because an atomic increment a...
Alex Elder
07:42 AM rbd Subtask #4007 (Resolved): libceph: support STAT osd operation
In order to do layered writes we need to check whether
an object to be written exists before issuing the write.
Thi...
Alex Elder
06:06 AM Revision 2ebf4d06 (ceph): osd: kill unused addr-based send_map()
Not used, old API, bad.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit e359a862199c8a94cb238...
Sage Weil
06:06 AM Revision bac5b144 (ceph): osd: share incoming maps via Connection*, not addrs
Kill a set of parallel methods that are using the old addr/inst-based
msgr APIs, and instead use Connection handles. ...
Sage Weil
06:06 AM Revision 9ca3a165 (ceph): osd: pass new maps to dead osds via existing Connection
Previously we were sending these maps to dead osds via their old addrs
using a new outgoing connection and setting th...
Sage Weil
06:06 AM Revision 4cb28b6e (ceph): osd: requeue osdmaps on heartbeat connections for cluster connection
If we receive an OSDMap on the cluster connection, requeue it for the
cluster messenger, and process it there where w...
Sage Weil
06:05 AM Revision e4f7ff8c (ceph): msgr: add get_loopback_connection() method
Return the Connection* for ourselves, so we can queue messages for
ourselves.
Signed-off-by: Sage Weil <sage@inktank...
Sage Weil
06:05 AM Revision 6af5da7a (ceph): mds: handle ceph.*.layout.* setxattr
Allow individual fields of file or dir layouts to be set via setxattr.
Signed-off-by: Sage Weil <sage@inktank.com>
(...
Sage Weil
06:05 AM Revision d386622c (ceph): mds: allow dir layout/policy to be removed via removexattr on ceph.dir....
This lets a user remove a policy that was previously set on a dir.
Signed-off-by: Sage Weil <sage@inktank.com>
(cher...
Sage Weil
06:05 AM Revision 62ed62f5 (ceph): qa: add layout_vxattrs.sh test script
Test virtual xattrs for file and directory layouts.
TODO: create a data pool, add it to the fs, and make sure we can...
Sage Weil
06:05 AM Revision c0af056e (ceph): mdsmap: backported is_data_pool()
This roughly corresponds to mainline commit 99d9e1d.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:20 AM Revision d41b5411 (ceph): Edit endpoint-create in ./doc/radosgw/config.rst
internalurl and adminurl are mandatory. Typo in publicurl. Alexandre Marangone
05:14 AM Revision 6e603301 (ceph): Edit rgw keystone url in ./doc/radosgw/config.rst
Won't work with the public port, it needs to be the admin port. Alexandre Marangone
05:09 AM Revision af8cac11 (ceph): Note on host in ./doc/radosgw/config.rst
Some people have configured host with a FQDN or an IP
which prevents /etc/init.d/radosgw start to launch the daemon.
Alexandre Marangone
12:42 AM Revision 4b4dba30 (ceph): doc: Updated to note bobtail supports RGW + Keystone.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins

02/04/2013

10:49 PM Revision e2e1de27 (ceph): cli test: add pg deep-scrub option to test
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell
10:23 PM Bug #4006 (Resolved): osd: repeating 'wrong node' message in log
Two users now (paravoid an xiaoxi in #ceph) have reported seeing repeated "... - wrong node!" messages in the osd log... Sage Weil
10:14 PM Revision eba8697e (ceph): cli test: add pg deep-scrub option to test
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell
08:56 PM Revision 4a6924a5 (ceph): install: remove perl dependency
Change the filter in logrotate to use sed instead of perl, and remove the
package dependency on perl.
Signed-off-by:...
Gary Lowell
08:09 PM Revision 0407af46 (ceph): mds: fix client view of dir layout when layout is removed
We weren't handling the case where the projected node has NULL for the
layout properly. Fixes the client's view when...
Sage Weil
08:09 PM Revision 8ce834d3 (ceph): client: note presence of dir layout in inode operator<<
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 84751489ca208964e617516e04556722008ddf67)
Sage Weil
08:09 PM Revision 99824b93 (ceph): client: list only aggregate xattr, but allow setting subfield xattrs
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit ba32ea9454d36072ec5ea3e6483dc3daf9199903)
Sage Weil
08:09 PM Revision 809cff48 (ceph): client: implement ceph.file.* and ceph.dir.* vxattrs
Display ceph.file.* vxattrs on any regular file, and ceph.dir.* vxattrs
on any directory that has a policy set.
Sign...
Sage Weil
08:09 PM Revision 13babca3 (ceph): client: move xattr namespace enforcement into internal method
This captures libcephfs users now too.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit febb96...
Sage Weil
08:09 PM Revision 65ab5174 (ceph): client: allow ceph.* xattrs
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit ad7ebad70bf810fde45067f78f316f130a243b9c)
Sage Weil
07:52 PM rgw Bug #3294: Ceph S3 API test
I researched this error many times by results are so bad,
Thank to "lollipop king", you are very good :D
--
Ta...
tuan ta ba
07:45 PM Revision 804ffc63 (ceph): Add "pg deep-scrub..." missing from ceph usage output
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
David Zafman
07:33 PM Revision 6f3c1cd2 (ceph): rgw_rest: Make fallback uri configurable.
Some HTTP servers, notabily lighttp, do not set SCRIPT_URI, make the fallback
string configurable.
Signed-off-by: ca...
caleb miles
07:28 PM Bug #3979: Ceph 0.56.2 RPM does not install
Gary,
I did that on a freshly kickstarted system already. I'm unsure how much fresher I can get the system without ...
Steven Presser
06:57 PM Bug #3979: Ceph 0.56.2 RPM does not install
Hi Steven -
I looked at the kickstart file and I did not see anything that looked suspicious. At the moment we're...
Anonymous
07:24 PM Revision f57d1b4c (ceph): rgw: fix setting of NULL to string
Fixes: #3777
s->env->get() returns char * and not string and can return NULL.
Also, remove some old unused code.
Sig...
Yehuda Sadeh
07:23 PM Revision 9019fbbe (ceph): rgw: fix setting of NULL to string
Fixes: #3777
s->env->get() returns char * and not string and can return NULL.
Also, remove some old unused code.
Sig...
Yehuda Sadeh
06:50 PM Bug #3768 (Fix Under Review): perl is required for logrotate, we need to include Perl as a depend...
Anonymous
06:50 PM Bug #3736 (Resolved): kernel build: failures starting in 3.8-rc1

The problem that resulting in this bug being opened originally has been solved with the update patch. I've created...
Anonymous
06:45 PM Feature #4005 (New): Add perftools to the kernel debian package script
Currently on the kernel gitbuilder we install a patch to the debian package script in order to build the performance ... Anonymous
06:41 PM Bug #4004 (In Progress): Intermittent kernel build failures
Anonymous
06:39 PM Bug #4004 (Can't reproduce): Intermittent kernel build failures
From time to time the kernel builds will fail in the packaging step with a gzip internal error, usually EINVAL on a w... Anonymous
06:39 PM Revision 2f41f81d (ceph): misc: don't use colon in default run name
LD_LIBRARY_PATH does not work with colons (and backslash does not escape them.) Josh Durgin
06:39 PM Revision 55687240 (ceph): OSD: check for empty command in do_command
Fixes: #3878
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
(...
Samuel Just
06:35 PM Bug #3788 (Resolved): debian source packages are missing
Resolved with the following commits:
commit e3e0b40f1b44e2458e47f31bedaa91408dc294c9
Author: Gary Lowell <gary.lo...
Anonymous
05:53 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
I really can't tell who's got a watch on the header
object. It should be getting removed when the object
gets unma...
Alex Elder
05:02 PM rbd Bug #4003: rbd: EBUSY errors from rbd unmap
There is clearly something that is keeping the rbd image
from getting removed. I reproduced this with just running
...
Alex Elder
04:12 PM rbd Bug #4003 (In Progress): rbd: EBUSY errors from rbd unmap
This sounds familiar, but I'm going to look a little
more closely to see if I can learn why it's happening.
Alex Elder
04:03 PM rbd Bug #4003 (Resolved): rbd: EBUSY errors from rbd unmap
From the teuthology kernel untar task on rbd, we get EBUSY trying to unmap. I'm investigating that this isn't someho... Sam Lang
05:27 PM Revision 60432d9b (ceph): perf_counters.cc: remove twice included header files
Cleanup includes, remove twice included "global/global_init.h" and
"common/ceph_context.h".
Signed-off-by: Danny Al-...
Danny Al-Gaaf
05:27 PM Revision 558b238c (ceph): testmsgr.cc: remove twice included <sys/stat.h>
Cleanup includes, remove twice included <sys/stat.h>.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision c81a9d4a (ceph): ceph-filestore-dump.cc: remove twice included <iostream>
Cleanup includes, remove twice included <iostream>.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision e0acc330 (ceph): xattr_bench.cc: remove twice included <time.h>
Cleanup includes, remove twice included <time.h>.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision 81979909 (ceph): MDS.cc: remove twice included common/errno.h
Cleanup includes, remove twice included common/errno.h.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision c8aeb93d (ceph): small_io_bench*.cc: remove twice included <iostream>
Cleanup includes, remove twice included <iostream>.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision d141f79b (ceph): tp_bench.cc: remove twice included <iostream>
Cleanup includes, remove twice included <iostream>.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision 94210414 (ceph): test_idempotent.cc: remove twice included "os/FileStore.h"
Cleanup includes, remove twice included "os/FileStore.h".
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision b22d641d (ceph): workload_generator.cc: remove twice included "common/debug.h"
Cleanup includes, remove twice included "common/debug.h"
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:27 PM Revision b70d563f (ceph): testxattr.cc: remove twice included <iostream>
Cleanup includes, remove twice included <iostream>.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
05:14 PM Revision 4e29c95d (ceph): mon: enforce reweight be between 0..1
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Joao Luis <joao.luis@inktank.com>
Sage Weil
04:48 PM CephFS Bug #2753 (Fix Under Review): Writes to mounted Ceph FS fail silently if client has no write capa...
Okay, I've checked that the kernel client deals correctly with an fsync (it'll return EPERM). The client branch wip-2... Greg Farnum
03:49 PM CephFS Feature #3540: mds: maintain per-file backpointers on first file object
Comments on Github for this. Greg Farnum
02:21 PM CephFS Feature #3540: mds: maintain per-file backpointers on first file object
I've pushed the additional changes for rename and sentinel to the wip-bt2 branch. Those bits are still untested, but... Sam Lang
10:45 AM CephFS Feature #3540 (In Progress): mds: maintain per-file backpointers on first file object
The initial review happened last week; Sam has some updates for the rename and sentinel object infrastructure now but... Greg Farnum
02:58 PM CephFS Cleanup #1499: mds: clean up directory layouts
there is an old branch that does this. note that this code changed with the wip-vxattrs work, so the rebase needs to ... Sage Weil
02:34 PM CephFS Feature #3953: kclient: get/set layout via virtual xattrs
in testing branch, passing tests. yay! Sage Weil
02:33 PM CephFS Feature #3953 (Resolved): kclient: get/set layout via virtual xattrs
Sage Weil
02:30 PM Bug #3810: btrfs corrupts file size on 3.7
I believe the btrfs patch fixed this issue. I consider this bug closed. Mike Lowe
02:15 PM rgw Cleanup #3777: rgw: audit code for reading NULL env variables
commit:9019fbbe8f84f530b6a8700dfe99dfeb03e0ed3d Yehuda Sadeh
01:41 PM rgw Cleanup #3777 (Resolved): rgw: audit code for reading NULL env variables
Fixes merged into bobtail, next, master. Yehuda Sadeh
01:24 PM rgw Feature #2941: rgw: improve streaming read performance
Bunch of comments on Github for this.
Given some of them it also needs more testing before going into master. :)
Greg Farnum
12:36 PM CephFS Feature #4002 (Resolved): mds: design fsck
Ian Colle
10:40 AM Bug #3787: Ceph OSD crashes on ceph tell osd.x
ushed to bobtail, commit:55687240b2de20185524de07e67f42c3b1ae6592 Sage Weil
10:37 AM Bug #3787: Ceph OSD crashes on ceph tell osd.x
Should this be backported to Bobtail? Ian Colle
10:35 AM CephFS Feature #3242 (New): samba: push plugin upstream
I believe we decided to hold off on putting more effort into this. Greg Farnum
10:34 AM CephFS Feature #3542 (Duplicate): mds: migration path for existing anchors, anchortables, etc.
Closing in favor of #4000 and #4001. Greg Farnum
10:34 AM Bug #3938 (Can't reproduce): ceph-mon crashed on mixed bobtail-argonaut cluster (2 argonaut mons,...
After a couple of days trying to reproduce this issue (and massively failing at it), and given the lack of debug info... Joao Eduardo Luis
10:33 AM CephFS Feature #4001 (Resolved): Implement the migration path from using the AnchorTable to using lookup...
Actually do whatever #4000 specifies. Greg Farnum
10:31 AM CephFS Feature #4000 (Resolved): Design a migration path from using the AnchorTable to using lookup-by-ino
We're currently engaged in work to do lookup-by-ino when we get an ino we don't recognize. However, any old installs ... Greg Farnum
10:27 AM CephFS Feature #3999 (Resolved): update CDir encoding
Either following or as part of #2177, we should update CDir on-disk encoding (and possibly wire encoding) to be versi... Greg Farnum
10:25 AM CephFS Cleanup #3998 (Resolved): mds: split up mdstypes
Right now we have mdstypes and it contains both MDS-exclusive and client-shared structs. Split it up into "metadata_t... Greg Farnum
10:03 AM CephFS Feature #3543: mds: new encoding
Got an early review from Sage; now waiting for a merge review and for test results from the FS suites, which are dela... Greg Farnum
09:59 AM CephFS Bug #3951 (Resolved): ceph-fuse: permissions error on create
Fixed in master commit:cf7c3f7d3fc7b8dc3a08a4fbe4ca1c10f2cb6054 and tested that it solves the problem. Greg Farnum
09:45 AM CephFS Bug #3935 (Need More Info): kclient: Big directory access bugs (multiple), mixed 32- and 64-bit c...
Sage Weil
08:18 AM Linux kernel client Bug #3997: xfs: insert memory barriers before wake_up_bit()
And here is something Sage provided that led me to believe
this could be the source of the problem. I'm not sure ho...
Alex Elder
08:17 AM Linux kernel client Bug #3997: xfs: insert memory barriers before wake_up_bit()
Sorry, I meant to include these in the last one:
[PATCH 1/2] xfs: memory barrier before wake_up_bit()
[PATCH 2/2]...
Alex Elder
08:16 AM Linux kernel client Bug #3997: xfs: insert memory barriers before wake_up_bit()
I have posted two patches to the XFS mailing list for review.
I am also waiting for a build to complete before doing...
Alex Elder
08:03 AM Linux kernel client Bug #3997 (Resolved): xfs: insert memory barriers before wake_up_bit()
I looked at this briefly last week and found what could explain
a hang on an osd node due to a bug in XFS. I ran it...
Alex Elder
04:09 AM Revision 55c1bcf6 (ceph): Add testdir param to get_valgrind_args() calls
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang

02/03/2013

05:38 PM Revision a5ba4f6a (ceph): Merge branch 'wip-misc-fixes'
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
05:28 PM Revision b970d054 (ceph): qa: smalliobenchrbd workunit
Run a bunch of parallel smalliobenchrbd processes.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:09 PM Revision 887e93e7 (ceph): nuke.py: Allow name of job/run to be specified
Nuke will cleanup the base test directory by default, but can
cleanup the test directory for a given run if specified...
Sam Lang
05:09 PM Revision 46d3ff94 (ceph): run.py: Add target name to logging info
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
04:59 PM Revision 4be48a6e (ceph): Merge remote-tracking branch 'gh/wip-rbd-bench'
Conflicts:
ceph.spec.in
debian/ceph-test.install
src/.gitignore
Sage Weil
04:18 PM Revision ada803db (ceph): rbd: fix .format() call with {1} syntax
IndexError: tuple index out of range Sage Weil
03:28 PM Bug #3995: OSD heartbeat-crashes during startup
Good news: the OSD has recovered after a couple of restarts. Artem Grinblat
01:58 PM Bug #3995 (Resolved): OSD heartbeat-crashes during startup
OSD can't start, it does something then crashes with a heartbeat assertion.
Debian, ceph version 0.56.2 (586538e22af...
Artem Grinblat
02:55 PM Bug #3996 (Resolved): mon: 'ceph mon add' results in dubious return message
Disclaimer: this might be only present on the current single-paxos branch (maybe due to some mistaken conflict resolu... Joao Eduardo Luis
05:01 AM Revision fe9fb49e (ceph): ceph_manager: use get() for self.config powercycle checks
I think this is what is going on...
Traceback (most recent call last):
File "/var/lib/teuthworker/teuthology-maste...
Sage Weil
02:11 AM Bug #3948: problems from leveldb static linkage and leveldb downgrade
It's not really urgent, but being able to upgrade to latest argonaut (and if that works for 2-3 days) to latest bobta... Corin Langosch

02/02/2013

08:56 PM Bug #3948: problems from leveldb static linkage and leveldb downgrade
Corin Langosch wrote:
> After the downgrade my cluster is still stable and no osd crashed so far.
>
> What can I ...
Sage Weil
02:55 AM Bug #3948: problems from leveldb static linkage and leveldb downgrade
After the downgrade my cluster is still stable and no osd crashed so far.
What can I do to upgrade to latest argon...
Corin Langosch
05:00 PM Revision 7280980f (ceph): Fixup latest commits that use /tmp/cephtest.
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
04:41 PM Bug #3966 (Resolved): osdthrasher: does tell on osd just after restarting it
fixed in tuethology commit:fadc22c0b9e1755b1d1826fcfe8be71e28574bc9 Sage Weil
04:40 PM Bug #3854 (Resolved): mon: clock skew tests failing on master
Sage Weil
04:40 PM Bug #3994 (Closed): ceph-osd crash under little to no load
... Sage Weil
02:10 PM Bug #3994: ceph-osd crash under little to no load
Also potentially of interest is the kernel log having some btrfs checksum failures:
btrfs csum failed ino 583798 ext...
Matthew Via
02:09 PM Bug #3994: ceph-osd crash under little to no load
It died again, here is the log output:
https://pastee.org/fbgch
Matthew Via
02:07 PM Bug #3994 (Closed): ceph-osd crash under little to no load
One of my osd's crashed a number of times in a row, and was repeatably enough that I had time to set the debugging le... Matthew Via
07:26 AM Revision 606b5c15 (ceph): Merge branch 'wip-rpm-update3'
Patches to ceph.spec.in and addition of rbd-fuse package. Gary Lowell

02/01/2013

11:39 PM Revision bd4f1d5c (ceph): adding task for iogen
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
10:07 PM Revision d9fff40f (ceph): task/chdir-coredump: Use readlink -e
realpath isn't available everywhere, use readlink -e instead.
Signed-off-by: Sam Lang <sam.lang@inktank.com>
Sam Lang
08:07 PM Revision 9a9fe73e (ceph): task/ceph: Fix typo in previous commit
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
07:31 PM Revision 04210c26 (ceph): Merge branch 'master' of https://github.com/ceph/ceph
John Wilkins
07:30 PM Revision d050fe1e (ceph): doc: Minor edits.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:01 PM Revision 9de9ebcf (ceph): nuke: get_testdir_base needs to be imported
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
06:35 PM Revision 0797be3f (ceph): rgw: key indexes are only link to user info
Instead of keeping multiple copies of the user info,
we just treat the key index as a pointer to the actual
user info...
Yehuda Sadeh
05:45 PM Revision edfe5eed (ceph): nuke: Fix cleanup of test dir
Nuke used to remove /tmp/cephtest, now it tries to
remove the test dir, which it may not have the name
for. Instead ...
Sam Lang
05:37 PM Revision 4ebd90eb (ceph): task/ceph: Initialize disk_config maps
The mount_options and fstype maps need to be
initialized properly for later.
Signed-off-by: Sam Lang <sam.lang@inkta...
Sam Lang
04:53 PM Revision 150a3d7d (ceph): misc: Don't include existing partitions in devs
We don't want to include /dev/sda1, etc. in the
list of devices to use.
Signed-off-by: Sam Lang <sam.lang@inktank.com>
Sam Lang
04:33 PM devops Feature #3907 (Fix Under Review): ceph-deploy: be verbose about what is run and what is done (wit...
Sage Weil
04:33 PM devops Feature #3913 (Fix Under Review): ceph-deploy: break mon into create/destroy
Sage Weil
04:33 PM devops Feature #3920 (Fix Under Review): ceph-deploy: support other deb-based distros
Sage Weil
04:32 PM devops Feature #3918 (Fix Under Review): ceph-deploy: osd create HOST:DIR[:JOURNAL]
Sage Weil
04:32 PM devops Feature #3993 (Resolved): upstart/sysvinit: control whether crush position is readjusted on start
Sage Weil
04:16 PM Revision 3806dc5e (ceph): task/ceph: Fix device list
dict.items() returns a tuple, whereas we want
the values().
Signed-off-by: Sam Lang <sam.lang@inktank.com>
Sam Lang
03:13 PM Revision 64e39667 (ceph): misc: get_wwn_id_map() needs to return dict
If we can't find device ids, we need to return
a dict, not a list.
Signed-off-by: Sam Lang <sam.lang@inktank.com>
Sam Lang
02:52 PM rgw Feature #3667 (Fix Under Review): rgw: support extra canned acl params
Ian Colle
02:50 PM rgw Feature #3992 (Resolved): rgw: refactor internal user API for RGW Admin
Ian Colle
02:43 PM rgw Feature #3991 (Resolved): rgw: dr: region mgt changes: define datastructures
Ian Colle
02:42 PM rgw Feature #3990 (Resolved): rgw: dr: implement new version objclass
Ian Colle
02:40 PM rgw Feature #3989 (Resolved): rgw: dr: region mgt changes: radosgw admin changes
Ian Colle
02:38 PM rgw Feature #3988 (Resolved): rgw: dr: region mgt changes: define/implement internal API
Ian Colle
02:36 PM rgw Feature #3987 (In Progress): rgw: dr: region mgt changes: extend json parser with json decoder
Ian Colle
02:36 PM rgw Feature #3987 (Resolved): rgw: dr: region mgt changes: extend json parser with json decoder
Ian Colle
02:31 PM Linux kernel client Feature #3974 (Resolved): libceph: use data length rather than nr_pages
commit 012d5bda1c0f229494c67098d00edfa24c531ea5
Author: Alex Elder <elder@inktank.com>
Date: Thu Jan 31 16:02:00 ...
Alex Elder
02:24 PM Revision dcf99e43 (ceph): nuke: Optionally check console status
Only check the ipmi console status if the ipmi
parameters have been defined in .teuthology.yaml.
Signed-off-by: Sam ...
Sam Lang
02:20 PM Revision ac4ba69d (ceph): misc: Fix get_wwn_id_map() to be optional
Not all plana nodes have symlinks setup when
we check /dev/disk/by-id/wwn-*. Instead of failing
here, just use the /...
Sam Lang
02:18 PM rbd Subtask #3741 (Resolved): krbd: rework request tracking code
commit 9ac90ea3d8dd6ab82f3665a132ca29e6ada56ad8
Author: Alex Elder <elder@inktank.com>
Date: Thu Nov 22 00:00:08 ...
Alex Elder
02:17 PM rbd Feature #3754 (Closed): krbd: use new request tracking code for notify ack
commit 1c8c3c5c571607a188203142020d80aa58e5e280
Author: Alex Elder <elder@inktank.com>
Date: Fri Nov 30 17:53:04 ...
Alex Elder
02:16 PM rbd Tasks #3755: krbd: use new request tracking code for sync object operations
commit 5d08568324f53368f927cc10927b1b105533c044
Author: Alex Elder <elder@inktank.com>
Date: Thu Jan 17 12:25:27 ...
Alex Elder
01:44 PM rbd Tasks #3755 (Resolved): krbd: use new request tracking code for sync object operations
commit 304819b1a49937753ee01aa7ccf8d66547a0be36
Author: Alex Elder <elder@inktank.com>
Date: Sat Jan 19 00:30:28 ...
Alex Elder
02:11 PM rbd Feature #3877 (Closed): krbd: don't wait for notify ack to complete
commit a8a34efcac7a33e7631fe8bf25530bd4be0417f8
Author: Alex Elder <elder@inktank.com>
Date: Thu Jan 17 12:18:46 ...
Alex Elder
01:57 PM devops Feature #3909 (Resolved): ceph-deploy: update install for bobtail/argonaut urls
Dan Mick
01:56 PM devops Feature #3923 (Resolved): ceph-deploy: discover HOST
Dan Mick
01:48 PM Subtask #3986 (Rejected): Send to ceph-dev for review
Ian Colle
01:48 PM Subtask #3985 (Rejected): api: Send to DH for Review
Ian Colle
01:47 PM Feature #3984 (Resolved): api: Send Out DRAFT REST API for Review
Ian Colle
01:46 PM Revision 933cc3c3 (ceph): run.py: Fix argument parsing for --name
With the addition of the --name argument to the
teuthology program (run.py), jobs were failing
because --name was bei...
Sam Lang
01:45 PM Feature #3983 (Resolved): api: create initial DRAFT REST API Design
Ian Colle
01:38 PM rbd Bug #3940 (Resolved): krbd: decrement obj request count when deleting
commit 150fde1984ec8454c163e4f89a50416cd68edbc4
Author: Alex Elder <elder@inktank.com>
Date: Fri Jan 25 17:08:55 ...
Alex Elder
01:38 PM rbd Bug #3937 (Resolved): krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
commit 8d93192992301f8c3a288c8cf4dc8598ac4b8427
Author: Alex Elder <elder@inktank.com>
Date: Fri Jan 25 17:08:55 ...
Alex Elder
01:37 PM rbd Bug #3427 (Resolved): krbd: unmap does not remove block device properly
commit bc7a62ee52cffc735cb8383b6d26648883f1a01e
Author: Alex Elder <elder@inktank.com>
Date: Mon Jan 14 12:43:31 ...
Alex Elder
01:37 PM Linux kernel client Bug #3800 (Resolved): libceph: check compatibility between ceph modules
commit 4f6e0e37c103675df11c8e3d836d64cd24b31734
Author: Alex Elder <elder@inktank.com>
Date: Wed Jan 30 11:13:33 ...
Alex Elder
01:36 PM Linux kernel client Bug #3799 (Resolved): libceph/rbd: bio refs are messed up
commit dfcc01f9f093ea960289e40ca2e73334c708c0f2
Author: Alex Elder <elder@inktank.com>
Date: Wed Jan 30 11:13:33 ...
Alex Elder
01:36 PM Linux kernel client Bug #3798 (Resolved): libceph/rbd: take reference to all bio's in list
commit dfcc01f9f093ea960289e40ca2e73334c708c0f2
Author: Alex Elder <elder@inktank.com>
Date: Wed Jan 30 11:13:33 ...
Alex Elder
01:35 PM Linux kernel client Bug #3976: libceph: add some #ifdef CONFIG_BLOCK in messenger
Sorry, I made a mistake and had to rebase.
commit 1eded6f9903ff388e7af08b2037fc3f3981cdfb2
Author: Alex Elder <el...
Alex Elder
01:32 PM Linux kernel client Bug #3976 (Resolved): libceph: add some #ifdef CONFIG_BLOCK in messenger
commit a88b6b32770dc97b303cda7eade2feade3b945df
Author: Alex Elder <elder@inktank.com>
Date: Thu Jan 31 16:02:01 ...
Alex Elder
01:35 PM Linux kernel client Bug #3875: osd_client: don't use r_num_pages for bio requests
Sorry, I made a mistake and had to rebase.
commit 012d5bda1c0f229494c67098d00edfa24c531ea5
Author: Alex Elder <el...
Alex Elder
01:32 PM Linux kernel client Bug #3875 (Resolved): osd_client: don't use r_num_pages for bio requests
commit 06224afd90f261256b1e0a0db2334f39c21872a9
Author: Alex Elder <elder@inktank.com>
Date: Thu Jan 31 16:02:00 ...
Alex Elder
12:54 PM Feature #3982: Performance tests on branches that change the way pg info is stored
Includes creating teuthology tasks. Ian Colle
12:48 PM Feature #3982: Performance tests on branches that change the way pg info is stored
Need to look at xfs and btrfs (possibly ext4) with small IOs to determine whether to put the fixed sized chunk of the... Ian Colle
12:37 PM Feature #3982 (Resolved): Performance tests on branches that change the way pg info is stored
David Zafman
12:48 PM rbd Bug #1740 (Resolved): krbd: don't return head data when reading from a non-existent snapshot
This was fixed a while ago. Josh Durgin
12:32 PM Feature #3891 (In Progress): osd: move purged_snaps out of info
Ian Colle
12:07 PM rgw Feature #3981 (New): rgw: handle really large buckets
Yehuda Sadeh
11:58 AM rbd Bug #3980 (Won't Fix): rbd image created with size zero on a mixed cluster crashes rbd
creating a rbd image with size 0 is allowed in bobtail but not on argonaut.
on a mixed cluster running argonaut[bu...
Tamilarasi muthamizhan
11:13 AM Bug #3979: Ceph 0.56.2 RPM does not install
Nope, pretty vanilla install, other than the kernel.
I've attached the kickstart file. The cluster is managed by ...
Steven Presser
10:51 AM Bug #3979: Ceph 0.56.2 RPM does not install
Still failing on the chown of /etc/ceph ? Are you by any chance using selinix features, or anything that might cause... Anonymous
10:19 AM Bug #3979: Ceph 0.56.2 RPM does not install
Nope, issue persists on a fresh install of the node. I'm not sure what information would be helpful, but if you let ... Steven Presser
09:44 AM Bug #3979: Ceph 0.56.2 RPM does not install
Nope, Running CentOS 6.3 with a custom kernel (3.6.9-vanilla at the moment). Give me about half an hour and I'll kic... Steven Presser
09:42 AM Bug #3979: Ceph 0.56.2 RPM does not install
Hi Steven -
Are you running Arch Linux ? If so, can you tell me the version, and also the versions of the rpm and...
Anonymous
09:39 AM Bug #3979: Ceph 0.56.2 RPM does not install
Hi,
I installed and upgraded Ceph RPMS on my fresh CentOS VM without issue. I can provide more information if needed.
Anonymous
09:29 AM Bug #3979 (In Progress): Ceph 0.56.2 RPM does not install
Anonymous
09:00 AM Bug #3979 (Rejected): Ceph 0.56.2 RPM does not install
Hey all,
I run a local Ceph mirror for a cluster. I mirrored the 0.56.2 RPMS this morning and went to update my nod...
Steven Presser
10:53 AM rgw Bug #3620 (Resolved): rgw:improve multiple user access keys scalability
Fix merged into master, commit:0797be3f86df8b413256d69e3770ec39ed6e6912. Yehuda Sadeh
09:50 AM Feature #3890 (Resolved): osd: create tool to extract pg info and pg log from filestore
David Zafman
05:51 AM Revision fd1512fc (ceph): Build: Add -n to files and description for rbd-fuse in ceph.sepc.in
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell
05:04 AM Revision de01bddb (ceph): Makefile: Install new rdb-fuse.8 man page
Signed-off-by: Gary Lowell <gary.lowell@inktank.com> Gary Lowell
04:35 AM Revision 16cf9dc6 (ceph): build: Add new rbd-fuse package
rdb-fuse is a new facility to map ceph rdb images to files.
Signed-off-by: Gary Lowell <gary.lowell@inktank.com>
Gary Lowell
04:32 AM Revision 7d1e8254 (ceph): Revert "Don't install rbd-fuse binary"
This reverts commit 35e5d74e5c5786bc91df5dc10b5c08c77305df4e.
-> fix build instead
Danny Al-Gaaf
03:29 AM Revision 334568e0 (ceph): rbd-fuse: quick and dirty manpage
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
02:44 AM Revision 91f8c3c8 (ceph): rbd-fuse: quick and dirty manpage
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
12:52 AM Revision 340b1cfe (ceph): ceph-filestore-dump.cc: don't use po::value<string>()->required()
Don't use po::value<string>()->required() since this breaks build on
RHEL/CentOs6. Check if the options are set as in...
Danny Al-Gaaf
12:34 AM Revision 1ee46c5e (ceph): doc: Added more detail to SSD section. Links to performance blogs.
fixes: #3960
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
12:19 AM Revision c6d26efc (ceph): Merge pull request #37 from alram/master
Add important note in doc/radosgw/config.rst Yehuda Sadeh

01/31/2013

11:58 PM Revision 2292fa6a (ceph): Add important note in doc/radosgw/config.rst
For CentOS and similar, FastCgiWrapper is turned on by default.
This causes Apache to spawn radosgw processes.
Alexandre Marangone
10:17 PM Revision 129a6600 (ceph): ceph-filestore-dump.cc: don't use po::value<string>()->required()
Don't use po::value<string>()->required() since this breaks build on
RHEL/CentOs6. Check if the options are set as in...
Danny Al-Gaaf
10:16 PM Revision 4c1d8d08 (ceph): ceph.spec.in: don't move libcephfs_jni files around
Don't move libcephfs_jni files around from %{_libdir} to /usr/lib/jni/
in the buildroot. They should be placed in %{_...
Danny Al-Gaaf
10:16 PM Revision 6e09cb9b (ceph): ceph.spec.in: extend fix for libedit-devel on special SUSE versions
Extend fix for libedit-devel on special SUSE versions, use ncurses
also on src/ocf/Makefile and src/java/Makefile
Si...
Danny Al-Gaaf
10:16 PM Revision 9235271a (ceph): ceph.spec.in: fix file section for ceph-resource-agents
Create needed dirs (/usr/lib/ocf/resource.d/ceph) for the ceph-resource-agents
subpackage.
Signed-off-by: Danny Al-G...
Danny Al-Gaaf
10:16 PM Revision 9b16036e (ceph): ceph.spec.in: move libcephfs_jni.so to ceph-devel
Move libcephfs_jni.so to the ceph-devel package since so-files they
shouldn't be part of the library package.
Signed...
Danny Al-Gaaf
09:06 PM Revision 3f53c3f0 (ceph): Validate format strings for CLS_ERR/CLS_LOG
cls_log needed __attribute__((format(printf..)) to allow the compiler
to crosscheck format strings and arguments. Af...
Dan Mick
08:59 PM Revision fadc22c0 (ceph): ceph_manager: wait for admin socket on restart, use for set_config
Fixes: #3966
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:51 PM rbd Bug #3978 (Resolved): krbd qa: concurrent.sh test leaves something read-only
I don't know what exactly is happening here, but it appears
that after running the "rbd/concurrent.sh" workunit, if
...
Alex Elder
08:01 PM Bug #3948: problems from leveldb static linkage and leveldb downgrade
Well, after downgrading them they seem to work stable again. If it's related to leveldb, then upgrading leveldb as th... Corin Langosch
07:50 PM Bug #3948: problems from leveldb static linkage and leveldb downgrade
Both osd.7 and osd.15 have corrupted leveldb state. It's likely related to downgrading and then upgrading leveldb. Samuel Just
07:40 PM Bug #3948: problems from leveldb static linkage and leveldb downgrade
Hi Sage!
Today I was brave and upgraded two more nodes (one has 1 osd, the other 3 osds). I worked for some time b...
Corin Langosch
07:18 PM Bug #3971: can't attach rbd image volume to instance
Does 'rbd ls volumes' show volume-5529a8cd-28db-4a72-a0f0-f7b2a221cf8d?
-> yes, i can see it
Khanh Nguyen Dang Quoc
07:01 PM Bug #3971: can't attach rbd image volume to instance
+These're all information need to verify:
root@master:~# dpkg -l | grep librbd
ii librbd1 ...
Khanh Nguyen Dang Quoc
01:44 PM Bug #3971: can't attach rbd image volume to instance
Does 'rbd ls volumes' show volume-5529a8cd-28db-4a72-a0f0-f7b2a221cf8d?
If so, could you provide a few more detail...
Josh Durgin
02:20 AM Bug #3971 (Rejected): can't attach rbd image volume to instance
my env:
libvirt-bin: 0.9.13-0ubuntu12.1~cloud0
ceph : 0.56.1
+ i tried disable module apparmor from system.
+...
Khanh Nguyen Dang Quoc
05:57 PM Cleanup #3977 (Resolved): Do a great stream operator const cleanup!
I just spent a little while trying to figure out why the compiler couldn't resolve operator<< (the stream operator) o... Greg Farnum
05:55 PM Revision 97c6619d (ceph): qa: update the rbd/concurrent.sh workunit
A few changes, now that a few rbd problems have been fixed.
First, the more substantive changes:
- Generate a sou...
Alex Elder
05:37 PM Documentation #3960: [Document bug]MON and MDS do not need a ssd for data storage.
John Wilkins:
What do you mean by:
>One way Ceph accelerates filesystem performance is to segregate the storage of ...
Xiaoxi Chen
04:35 PM Documentation #3960 (Resolved): [Document bug]MON and MDS do not need a ssd for data storage.
I removed the reference to monitors, added detail on sequential write throughput, and a link to an example for a CRUS... John Wilkins
05:14 PM Revision 8f9267cf (ceph): thrashosds: note assumption for powercycling
Josh Durgin
02:55 PM Revision 8e566f6f (ceph): marginal/osd_powercycle: OSD powercycle thrashing
Tasks to run while thrashing osds using ipmi to powercycle.
This currently runs in the marginal suite only.
Signed-o...
Sam Lang
02:45 PM Linux kernel client Bug #3976 (Resolved): libceph: add some #ifdef CONFIG_BLOCK in messenger
There are two spots in the messenger code that would
cause a build failure if CONFIG_BLOCK weren't define.
I've a...
Alex Elder
02:23 PM Revision 77e8d801 (ceph): Remove console.py
Handling of ipmi via the console is now done through the
Console class in teuthology/orchestra/remote.py.
Signed-off...
Sam Lang
02:23 PM Revision 8f720454 (ceph): Assign devices to osds using the device wwn
Linux doesn't guarantee device names (/dev/sdb, etc.)
are always mapped to the same disk. Instead of assigning
nomin...
Sam Lang
02:23 PM Revision 58111595 (ceph): Support power cycling osds/nodes through ipmi
This patch defines a RemoteConsole class associated
with each Remote class instance, allowing
power cycling a target ...
Sam Lang
02:23 PM Revision 87b98496 (ceph): add --name option to teuthology
Signed-off-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Josh Durgin <josh.durgin@inktank.com>
Sam Lang
02:23 PM Revision ace4cb07 (ceph): Replace /tmp/cephtest/ with configurable path
Teuthology uses /tmp/cephtest/ as the scratch test directory for
a run. This patch replaces /tmp/cephtest/ everywher...
Sam Lang
02:23 PM Revision 3eb19c81 (ceph): Replace /tmp/cephtest/ with configurable path
Teuthology uses /tmp/cephtest/ as the scratch test directory for
a run. This patch replaces /tmp/cephtest/ everywher...
Sam Lang
02:21 PM rbd Bug #3975 (Rejected): librbd: xfstests 008 failed inside qemu
This one's not a problem. This test pokes random holes in a
file (or maybe fills random spots). And when done it s...
Alex Elder
02:05 PM rbd Bug #3975 (Rejected): librbd: xfstests 008 failed inside qemu
From xfstests output in ubuntu@teuthology:/a/teuthology-2013-01-29_20:00:04-regression-bobtail-master-basic/7794/remo... Josh Durgin
02:12 PM devops Feature #3912 (Fix Under Review): ceph-deploy: break osd into create/destroy
Sage Weil
02:12 PM devops Feature #3923 (Fix Under Review): ceph-deploy: discover HOST
commit:56b996b76f37fb6a7c3ffc812e87a8cbd6f8c3b8 Sage Weil
02:12 PM devops Feature #3909 (Fix Under Review): ceph-deploy: update install for bobtail/argonaut urls
commit:3c1e4d1d73556560e06686843ed1010174b5ffda Sage Weil
02:01 PM Bug #3970 (Resolved): cls_log should be declared with __attribute__(format..) so -Wformat validat...
commit:3f53c3f016ab0db1a33848ac406239dc07204ea2
Dan Mick
01:59 PM Linux kernel client Feature #3974 (Resolved): libceph: use data length rather than nr_pages
While looking at http://tracker.ceph.com/issues/3875 I learned
that the nr_pages field in a ceph message is never re...
Alex Elder
01:58 PM Revision 14730276 (ceph): Fixes for syntax errors found by pyflakes.
This patch includes minor fixes to the teuthology
python code for syntax errors found by running
check-syntax.sh (whi...
Sam Lang
01:56 PM Revision 3390cc30 (ceph): Scripts to use pyflakes to check python syntax.
pyflakes runs a basic syntax checker against python code.
The added check-syntax.sh script and Makefile run pyflakes
...
Sam Lang
01:00 PM Bug #3966: osdthrasher: does tell on osd just after restarting it
pushed fix to master, fadc22c0b9e1755b1d1826fcfe8be71e28574bc9 (teuthology) Samuel Just
12:15 PM Revision c3468f76 (ceph): PGMap: fix -Wsign-compare warning
Fix -Wsign-compare compiler warning:
mon/PGMap.cc: In member function 'void PGMap::apply_incremental
(CephContext*,...
Danny Al-Gaaf
11:22 AM Bug #3906: ceph-mon leaks memory during peering
So, today I upgraded my whole cluster to 0.56.2, then added a bunch more OSDs (from 84 -> 144). At peering time monit... Faidon Liambotis
09:09 AM rgw Feature #3973 (New): rgw: Handle requests sent in non-UTC time
Executing a S3 Request using the following Date Header... Moritz Krinke
07:35 AM Bug #3972 (Resolved): new boost dependency: libboost-program-options
libboost-program-options is now required to build master, this prerequisite is not mentioned in the documentation. caleb miles
06:50 AM Revision 0758faba (ceph): Add ceph-filestore-dump to the packaging
Feature: #3890
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Dan Mick <dan.mick@inktank.com>
David Zafman
05:18 AM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
I disabled scrubbing using
> ceph osd tell \* injectargs '--osd-scrub-min-interval 1000000'
> ceph osd tell \* in...
Sylvain Munaut
12:16 AM Revision 8fd8534b (ceph): osd_types: add recovery counts to object_sum_stats_t
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 4aea19ee60fbe1106bdd71de2d172aa2941e8aab)
Sage Weil
12:16 AM Revision 8ab77bd4 (ceph): osd: track recovery ops in stats
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit a2495f658c6d17f56ea0a2ab1043299a59a7115b)
Sage Weil
12:16 AM Revision 8d2d396c (ceph): mon/PGMap: include timestamp
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 76e9fe5f06411eb0e96753dcd708dd6e43ab2c02)
Sage Weil
12:16 AM Revision 7f149cf6 (ceph): mon/PGMap: report recovery rates
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 208b02a748d97378f312beaa5110d8630c853ced)
Sage Weil
12:16 AM Revision 7fd7a5ee (ceph): mon/PGMap: report IO rates
This does not appear to be very accurate; probably the stat values we're
displaying are not being calculated correctl...
Sage Weil
12:16 AM Revision 5a6b9af9 (ceph): mon: smooth pg stat rates over last N pgmaps
This smooths the recovery and throughput stats over the last N pgmaps,
defaulting to 2.
Signed-off-by: Sage Weil <sa...
Sage Weil

01/30/2013

11:41 PM Revision ab778cb1 (ceph): doc: v0.56.2 release notes
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:21 PM Revision 4a950aa9 (ceph): Move read_log() function to prep for next commit
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
David Zafman
10:21 PM Revision 3c8d7d78 (ceph): osd: create tool to extract pg info and pg log from filestore
New application ceph-filestore-dump created that mounts filstore
and can dump info or log in JSON when an OSD is not ...
David Zafman
08:52 PM Revision a63fac32 (ceph): task: mon_clock_skew_check: use absolute value when comparing mon_skew
The monitors may report either positive or negative clock skews, and by
not using an absolute value we were constantl...
Joao Eduardo Luis
08:52 PM Revision 89e09fa9 (ceph): task: mon_clock_skew_check: mark as ran once if an expected skew was found
... even if we didn't get a clean/finished result from the monitors
This ought to significantly cut the waiting time...
Joao Eduardo Luis
07:50 PM Revision b571f8ee (ceph): PGMap: fix -Wsign-compare warning
Fix -Wsign-compare compiler warning:
mon/PGMap.cc: In member function 'void PGMap::apply_incremental
(CephContext*,...
Danny Al-Gaaf
07:32 PM Revision b0d4dd21 (ceph): test_libcephfs: fix xattr test
Ignore the ceph.*.layout xattrs.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:02 PM Bug #3970: cls_log should be declared with __attribute__(format..) so -Wformat validates the form...
Dan Mick
04:57 PM Bug #3970 (Resolved): cls_log should be declared with __attribute__(format..) so -Wformat validat...
It'll involve some changes to callers to fix all the harmless errors, but may find some significant
ones and avoid a...
Dan Mick
04:49 PM Bug #3938 (In Progress): ceph-mon crashed on mixed bobtail-argonaut cluster (2 argonaut mons, 1 b...
Have a cluster set-up and ready to start trying to reproduce this in the morning. Joao Eduardo Luis
02:13 PM Bug #3938: ceph-mon crashed on mixed bobtail-argonaut cluster (2 argonaut mons, 1 bobtail)
No, didn't have it set up. I could probably reproduce if necessary. Samuel Just
04:32 PM devops Feature #3965: upstart: ulimit -n hardcoded; doesn't use 'max open files' config setting
I guess there are settings in the upstart config files, but they aren't derived from ceph.conf.
I imagine there are w...
Dan Mick
11:09 AM devops Feature #3965 (Rejected): upstart: ulimit -n hardcoded; doesn't use 'max open files' config setting
3900 tweaked the setting of ulimit -n "max open files" on all daemons in the cluster, but,
at present, we only have...
Dan Mick
03:37 PM CephFS Feature #3540 (Fix Under Review): mds: maintain per-file backpointers on first file object
Initial implementation in wip-bt. Needs review. Sam Lang
02:13 PM Bug #3883: osd: leaks memory (possibly triggered by scrubbing) on argonaut
The burnupi57 cluster (wip-f) does not appear to be leaking after all, the osds seem to have leveled off at around 35... Samuel Just
02:10 PM rbd Bug #3937: krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
The patch is reviewed and ready to push to the testing
branch, and I will do that in a day or so.
I'm going to le...
Alex Elder
02:09 PM rgw Feature #3968 (Resolved): https should work for rest-bench
Trying to set the protocol to https by using the --protocol=https flag does not work. ... Kevin Horan
02:08 PM rbd Bug #3940: krbd: decrement obj request count when deleting
Reviewed and ready to push to master. Will do that in a day or so. Alex Elder
02:07 PM rbd Bug #3427: krbd: unmap does not remove block device properly
Reviewed and ready to push to the ceph-client "testing" branch.
I'm going to wait a day or two before pushing this...
Alex Elder
01:34 PM Linux kernel client Bug #3967 (Resolved): libceph: complete linger requests only once
Currently if a linger request gets resubmitted by the osd
client, its callback function (if provided) will get calle...
Alex Elder
01:05 PM Documentation #3960 (In Progress): [Document bug]MON and MDS do not need a ssd for data storage.
You are correct. The machines and processes would only boot a bit faster. The way to accelerate metadata servers is t... John Wilkins
12:12 PM Bug #3966 (Resolved): osdthrasher: does tell on osd just after restarting it
figured out where the thrasher errors are coming from:... Sage Weil
11:31 AM rbd Bug #3964: krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd image with sn...
...and to answer your other question Alex, there's now a workunit test Sage just added
in c782d2ac531cbb7650968e62f0...
Dan Mick
11:00 AM rbd Bug #3964: krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd image with sn...
Josh thinks 32-bitness probably doesn't matter, and remembers problems with snapshots that were fixed long ago; I gue... Dan Mick
10:55 AM rbd Bug #3964: krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd image with sn...
I don't know if Sage tested 32-bit, or if it matters, and no, that script was just a reproduction scenario; as far as... Dan Mick
06:25 AM rbd Bug #3964: krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd image with sn...
So is this then a request to port whatever it was that
fixed the problem back to 3.2?
If so, how do we prioritize...
Alex Elder
01:10 AM rbd Bug #3964: krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd image with sn...
added test to suite, commit:c782d2ac531cbb7650968e62f0b24e6136a64359 Sage Weil
12:15 AM rbd Bug #3964: krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd image with sn...
This works fine on current testing 3.6.0-00210-g8cc17ca Sage Weil
11:16 AM rbd Bug #3961 (Resolved): 32-bit cls_rbd tries cls_log with %d for 64-bit int, segfaults
commit:e253830abac76af03c63239302691f7fac1af381 on next
Dan Mick
09:37 AM rbd Subtask #3741: krbd: rework request tracking code
My testing on this code is nearly complete. However, I'm going
to hold off on pushing this (along with the changes ...
Alex Elder
06:34 AM rbd Subtask #3741: krbd: rework request tracking code
Alex Elder
09:30 AM Linux kernel client Bug #3740 (Resolved): ceph-client: change to be based on 3.8-rc2
I have finished my testing and have now updated the
ceph-client "testing" branch to be based on 3.8-rc5,
with the p...
Alex Elder
06:14 AM Linux kernel client Bug #3740: ceph-client: change to be based on 3.8-rc2
I discussed this with Sage yesterday. We're now up to
Linux 3.8-rc5. Merging our testing branch into v3.5-rc5
pro...
Alex Elder
09:08 AM Revision 0c872491 (ceph): rbd: add rbd_cli_misc with map-snapshot-io.sh
Sage Weil
09:06 AM Revision c782d2ac (ceph): qa: add test for rbd map and snapshots
This tests for the behavior reported in #3964. It passes on the current
code, but fails on 3.2 in squeeze (and 32-bi...
Sage Weil
09:05 AM Revision 6b493502 (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
08:56 AM Linux kernel client Bug #3798 (In Progress): libceph/rbd: take reference to all bio's in list
It looks like the extra reference that the osd client requires
of the first bio on the list isn't necessary. Nor wo...
Alex Elder
08:41 AM Linux kernel client Bug #3800: libceph: check compatibility between ceph modules
Sage, I already implemented the fix, and it's pretty trivial,
and it's generally useful. By "won't fix" do you mean...
Alex Elder
07:54 AM Revision 586538e2 (ceph): v0.56.2
Gary Lowell
07:40 AM Linux kernel client Bug #3799 (In Progress): libceph/rbd: bio refs are messed up
Looking at the code here, the osd client isn't really doing
anything with the bio pointer. It is simply a middleman...
Alex Elder
07:34 AM Revision bcb8dfad (ceph): cls_rbd, cls_rgw: use PRI*64 when printing/logging 64-bit values
caused segfaults in 32-bit build
Fixes: #3961
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil ...
Dan Mick
07:32 AM Revision e253830a (ceph): cls_rbd, cls_rgw: use PRI*64 when printing/logging 64-bit values
caused segfaults in 32-bit build
Fixes: #3961
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Reviewed-by: Sage Weil ...
Dan Mick
06:47 AM rbd Bug #3927 (Closed): krbd: I/O errors (ENXIO) during rbd/kernel.sh workunit
It turns out this new behavior is a good thing, we're just
reporting errors now where we apparently did not previous...
Alex Elder
06:47 AM rbd Bug #3745 (Rejected): krbd: individual response errors are ignored
I no longer believe this is a problem. Although there is no
aggregate result value for a collection of osd requests...
Alex Elder
06:36 AM Linux kernel client Bug #3959 (Duplicate): krbd: decrement img_request->obj_request_count when deleting
Found it! http://tracker.ceph.com/issues/3940
already documents this.
Alex Elder
06:35 AM rbd Feature #3877: krbd: don't wait for notify ack to complete
Alex Elder
06:35 AM rbd Tasks #3755: krbd: use new request tracking code for sync object operations
Alex Elder
06:35 AM rbd Feature #3754: krbd: use new request tracking code for notify ack
Alex Elder
03:48 AM Revision 77f57411 (ceph): mds: move lexical_cast and assert re-#include to the top
We should keep the re-#includes immediately following the offender, and
documented.
Signed-off-by: Sage Weil <sage@i...
Sage Weil
03:11 AM Bug #3948: problems from leveldb static linkage and leveldb downgrade
Hi Sage,
does it matter that the OSD is now down for around 1-2 days or will it just pickup any changes made to th...
Corin Langosch
03:00 AM Revision 35e5d74e (ceph): Don't install rbd-fuse binary
fixes packaging warnings
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
02:43 AM Revision 23923ee9 (ceph): mds/Server.cc: fix warring assert.h's
New include boost/lexical_cast.hpp apparently drags in the system
assert.h on quantal and squeeze at least, breaking ...
Dan Mick
02:42 AM Revision 25e9a0be (ceph): mon: require name for 'auth add ...' command
Otherwise we interpret the empty string as 'unknown.'.
Fixes: #3956
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
02:19 AM Bug #3595: ceph-osd and ceph-mds crash on Debian Squeeze
root@cluster:~# ceph-osd
Segmentation fault
root@cluster:~# ceph-osd -h
Segmentation fault
root@cluster:~# ceph-...
Jörg Blank
01:07 AM Revision a731da99 (ceph): Merge remote-tracking branch 'origin/wip-fuse-create-fix'
Reviewed-by: Greg Farnum <greg@inktank.com> Greg Farnum
01:05 AM Revision e9a6694d (ceph): client: return errors to the user if fsync fails
To do so, we allow callers of _flush(Inode) to pass in a Context
as well. This Context is then given to the ObjectCac...
Greg Farnum
01:03 AM RADOS Feature #3807 (Fix Under Review): crush: simple commands to create common rules
see wip-osd-commands Sage Weil
12:49 AM Revision 5a7c5088 (ceph): init-ceph: make ulimit -n be part of daemon command
ulimit -n from 'max open files' was being set only on the machine
running /etc/init.d/ceph. It needs to be added to ...
Dan Mick
12:48 AM Revision 84a024b6 (ceph): init-ceph: make ulimit -n be part of daemon command
ulimit -n from 'max open files' was being set only on the machine
running /etc/init.d/ceph. It needs to be added to ...
Dan Mick
12:34 AM Revision c2e50e58 (ceph): Merge remote-tracking branch 'gh/wip-recovery-stats-b'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
12:26 AM Revision 1564c3a0 (ceph): Merge branch 'wip-vxattr'
Reviewed-by: Sam Lang <sam.lang@inktank.com>
Reviewed-by: Greg Farnum <greg@inktank.com>
Sage Weil
12:25 AM Revision ba32ea94 (ceph): client: list only aggregate xattr, but allow setting subfield xattrs
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:25 AM Revision 84751489 (ceph): client: note presence of dir layout in inode operator<<
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:25 AM Revision 09f28541 (ceph): mds: fix client view of dir layout when layout is removed
We weren't handling the case where the projected node has NULL for the
layout properly. Fixes the client's view when...
Sage Weil
12:25 AM Revision ebebf72f (ceph): mds: handle ceph.*.layout.* setxattr
Allow individual fields of file or dir layouts to be set via setxattr.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:25 AM Revision db31a1f9 (ceph): mds: allow dir layout/policy to be removed via removexattr on ceph.dir....
This lets a user remove a policy that was previously set on a dir.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:25 AM Revision 61fbe27a (ceph): qa: add layout_vxattrs.sh test script
Test virtual xattrs for file and directory layouts.
TODO: create a data pool, add it to the fs, and make sure we can...
Sage Weil
12:24 AM Revision e51299fb (ceph): mds: open mydir after replay
In certain cases, we may replay the journal and not end up with the
dirfrag for mydir open. This is fine--we just ne...
Sage Weil
12:24 AM Revision ad7ebad7 (ceph): client: allow ceph.* xattrs
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
12:24 AM Revision febb9650 (ceph): client: move xattr namespace enforcement into internal method
This captures libcephfs users now too.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:24 AM Revision 3f82912a (ceph): client: implement ceph.file.* and ceph.dir.* vxattrs
Display ceph.file.* vxattrs on any regular file, and ceph.dir.* vxattrs
on any directory that has a policy set.
Sign...
Sage Weil

01/29/2013

11:40 PM rbd Bug #3964: krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd image with sn...
... Dan Mick
11:28 PM rbd Bug #3964 (Won't Fix): krbd: 32-bit, kernel 3.2.0 system can't do O_DIRECT writes to mapped rbd i...
fghaas reported, I reproduced on a precise 32-bit system:
create an image, map, writes work fine, even with dd ofl...
Dan Mick
11:06 PM Bug #3963 (Won't Fix): cls_log should check should_gather before vsnprintf()
1) faster
2) would have allowed workaround for 3961
Dan Mick
11:04 PM Bug #2481 (Won't Fix): ceph tell has almost no error reporting
this should get cleaned up with whatever refactor we do with the api work, but not worth spending time on individuall... Sage Weil
11:01 PM Bug #3577 (Can't reproduce): osd missing reported by osd_recovery.test_incomplete_pgs workload
we fixed several things that could explain this. Sage Weil
11:01 PM Bug #3595 (Need More Info): ceph-osd and ceph-mds crash on Debian Squeeze
Is this still a problem with the bobtail packages? Sage Weil
10:58 PM Bug #2721 (Resolved): Ceph status does not work in 0.48 even if it is still documented
wrong monitor version was running Sage Weil
10:57 PM Bug #2647 (Can't reproduce): osd: old request, waiting for subops
Sage Weil
10:56 PM Bug #2500 (Resolved): osd: unprotected ::decodes in ReplicatedPG::do_osd_ops
cleaned up ages ago Sage Weil
10:55 PM Bug #1197 (Resolved): osd: make inconsistent state durable
this got fixed in commit:2475066c3247774a2ad048a2e32968e47da1b0f5 Sage Weil
10:54 PM Bug #3646 (Resolved): pg_temp with two down/out osds
commit:6122a9f62f9eeae1410d1703fecb8939a35fb03f Sage Weil
10:46 PM rbd Bug #3961 (Resolved): 32-bit cls_rbd tries cls_log with %d for 64-bit int, segfaults
32-bit system: rbd create i -s 1; rbd rm i causes death of osd in cls_log();
presumably this is because of cls_log(%...
Dan Mick
10:42 PM Revision 59ac4d35 (ceph): qa: add rbd/concurrent workunit
This defines a new workunit shell script that performs a bunch of
rbd operations concurrently in order to exercise co...
Alex Elder
10:35 PM Revision 3bc21143 (ceph): ObjectCacher: fix flush_set when no flushing is needed
C_GatherBuilder takes ownership of the Context we pass it. Deleting it
in flush_set after constructing the C_GatherBu...
Josh Durgin
10:10 PM RADOS Feature #3807: crush: simple commands to create common rules
ceph osd crush rule list
ceph osd crush rule create-simple <name> <root> <failure domain>
ceph osd crush rule create-...
Sage Weil
10:04 PM Revision 19f42731 (ceph): peer: fix filtering out of scrub from pg state
Sage Weil
09:59 PM Revision 95677fc5 (ceph): mon: OSDMonitor: only share osdmap with up OSDs
Try to share the map with a randomly picked OSD; if the picked monitor is
not 'up', then try to find the nearest 'up'...
Joao Eduardo Luis
09:59 PM Revision e4d76cb8 (ceph): utime: fix narrowing conversion compiler warning in sleep()
Fix compiler warning:
./include/utime.h: In member function 'void utime_t::sleep()':
./include/utime.h:139:50: warnin...
Danny Al-Gaaf
09:33 PM Documentation #3960 (Resolved): [Document bug]MON and MDS do not need a ssd for data storage.
From :http://ceph.com/docs/master/install/hardware-recommendations/#data-storage
it says:
Since the storage requi...
Xiaoxi Chen
09:17 PM Revision a8964107 (ceph): rgw: fix crash when missing content-type in POST object
Fixes: #3941
This fixes a crash when handling S3 POST request and content type
is not provided.
Signed-off-by: Yehud...
Yehuda Sadeh
08:38 PM Linux kernel client Bug #3959 (Duplicate): krbd: decrement img_request->obj_request_count when deleting
Each image request keeps a count of its object requests.
Adding a object request to or deleting one from an image
r...
Alex Elder
08:34 PM Feature #2472: osd: add opaque 'class <name> <foo>' cap that class can interpret/enforce
Sage Weil
08:34 PM CephFS Bug #1946 (Resolved): snapshot inherits timestamp/size/etc from modified trunk dir upon mds restart
commit:7842bb50c7814cc16c22589bf41df7db1f7492eb Sage Weil
08:33 PM Feature #3890 (Fix Under Review): osd: create tool to extract pg info and pg log from filestore
In final review to merge from wip-3890 branch. David Zafman
08:33 PM Bug #3126 (Can't reproduce): mds crashed bool CDir::check_rstats()
we'll see i this comes up with all of yan's fixes in now. Sage Weil
08:33 PM rbd Bug #3566 (Resolved): log max new = 1 can cause hang on process exit
fixed a few weeks ago, commit:813787af3dbb99e42f481af670c4bb0e254e4432 and a few prior commits Sage Weil
08:32 PM Bug #3125 (Resolved): Assertion Error in peer.py - failure from the nightly run
this is fixed up now, most recent commit was 3772d437dd4c562a6490f84124eb4757e22eca92 Sage Weil
08:26 PM rbd Bug #3958 (Resolved): rbd fsx fails with EBUSY
... Sage Weil
07:41 PM CephFS Bug #3553 (Won't Fix): MDS core dumped running 0.48.2argonaut
if/when see this on bobtail or later, we'll investigate. Sage Weil
07:32 PM Bug #3878 (Rejected): osd: nobackfill flag doesn't work
it works. it just doesn't leave the pg in backfill_wait, as i was expecting. Sage Weil
07:30 PM Bug #3836 (Resolved): osd: common/Mutex.cc: 94: FAILED assert(r == 0) in PG::start_flush()
in bobtail, commit:e6bceeedb0b77d23416560bd951326587470aacb Sage Weil
07:24 PM rgw Bug #3365: Broken metadata (duplicated as CSV)
Sage Weil wrote:
> Aaron Schulz wrote:
> > Ian Colle wrote:
> > > Aaron are you still seeing this?
> >
> > Sorr...
Aaron Schulz
12:31 PM rgw Bug #3365 (Can't reproduce): Broken metadata (duplicated as CSV)
Thanks for trying to reproduce this on Bobtail, Aaron. I'm moving it to Can't Reproduce. Ian Colle
12:26 PM rgw Bug #3365: Broken metadata (duplicated as CSV)
I'm having a hard time reproducing this on bobtail. If I remove the metadata normalization code in the MediaWiki/Clou... Aaron Schulz
07:07 PM Bug #3938: ceph-mon crashed on mixed bobtail-argonaut cluster (2 argonaut mons, 1 bobtail)
is there a core for this? Sage Weil
06:51 PM Revision 7cd4e50d (ceph): client: Wait for caps to flush when flushing metadata.
Embarrassingly, this conditional has been backwards since
I committed it in 818e7939. But we want to do the wait when...
Greg Farnum
06:44 PM Revision 11e1f3ac (ceph): ReplicatedPG: make_snap_collection when moving snap link in snap_trimmer
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry...
Samuel Just
06:43 PM Bug #3957 (Resolved): new #include breaks assert.h (again)
Dan Mick
06:40 PM Bug #3957 (Resolved): new #include breaks assert.h (again)
#include <boost/lexical_cast.hpp> in mds/Server.cc apparently re-includes the system assert.h,
blowing up dout(). F...
Dan Mick
06:42 PM Bug #3956 (Resolved): ceph auth add/del entity name parameter check
commit:25e9a0be63fdad9fd8f7909585c9270a3729dc44 Sage Weil
06:00 PM Bug #3956 (Resolved): ceph auth add/del entity name parameter check
It's currently (as of v0.56.1) possible to run "ceph auth add" without any further parameters. This results in the ad... Alex Moore
05:28 PM Revision 907c709c (ceph): mds: Send created ino in journaled_reply
The MDS avoids sending an early reply if a request
triggered inode allocation (no preallocated inodes yet).
For creat...
Sam Lang
05:27 PM Bug #3955 (Resolved): Configure should explicity check for c++ compiler.
If no c++ compiler is installed, configure fails with a misleading message when checking for boost libraries. Anonymous
05:11 PM Bug #3747 (Closed): PGs stuck in active+remapped
I think this was probably related to the lagging pg peering workqueue.. is there anything to suggest that isn't the c... Sage Weil
05:09 PM Bug #3948 (Need More Info): problems from leveldb static linkage and leveldb downgrade
Corin-
Just restart the osd. And check dmesg for any kernel malfeasance... that is usually what triggers this. A...
Sage Weil
04:51 PM Bug #3900 (Resolved): init-ceph should do ulimit -n's with do_root_cmd
commit:84a024b647c0ac2ee5a91bacdd4b8c966e44175c in next, cherry-pick -x'ed to bobtail
Dan Mick
03:21 PM Bug #3900 (Fix Under Review): init-ceph should do ulimit -n's with do_root_cmd
Dan Mick
04:37 PM Subtask #3840 (Resolved): osd: ack push after apply+commit
as part of #3833 Sage Weil
04:36 PM Feature #3732 (Resolved): osd/mon: report recovery rate (bytes and objects per sec)
commit:c2e50e580d18107162d2d101c5c243c665e56124 Sage Weil
04:33 PM CephFS Feature #3953 (Resolved): kclient: get/set layout via virtual xattrs
Sage Weil
04:32 PM CephFS Feature #1236 (Resolved): libceph: set layout via virtual xattrs (libceph/cfuse)
commit:1564c3a0a3efbde5a326001586238fde8f6648ad for userspace bits.
the kernel bits still need review.. opening se...
Sage Weil
04:18 PM Revision cf7c3f7d (ceph): client: Don't use geteuid/gid for fuse ll_create
Fixes a bug in ll_create where files that already exist at the MDS
don't get the created flag set on reply. This cau...
Sam Lang
03:11 PM rbd Bug #3952 (Resolved): krbd: no need for object header version
The header object watch operation had a sort of half implemented
use of the version of the object. It apparently is...
Alex Elder
03:08 PM rbd Bug #3946 (Resolved): rbd fsx failing in nightly
Just an extra delete in a code path in flush_set that wasn't exercised before. Fixed by commit:3bc21143552b35698c9916... Josh Durgin
02:44 PM rbd Bug #3946: rbd fsx failing in nightly
Reproducing locally seems to confirm this, since there was a recent change to replace commit_set() with flush_set():
...
Josh Durgin
12:06 PM rbd Bug #3946: rbd fsx failing in nightly
I'm guessing these are related to recent objectcacher changes, since they didn't affect runs without caching. The cor... Josh Durgin
02:48 PM rbd Feature #3949 (Resolved): krbd: create test script that exercises concurrent operations
I just committed the test script to the ceph master branch.
The script is located here: qa/workunits/rbd/concurrent...
Alex Elder
09:16 AM rbd Feature #3949: krbd: create test script that exercises concurrent operations
Well the script is really nice. And I just got a new
crash while running it on a real machine (rather than
my UML ...
Alex Elder
08:22 AM rbd Feature #3949 (Resolved): krbd: create test script that exercises concurrent operations
I suggested doing this in http://tracker.ceph.com/issues/3427.
That issue is about a bug where an image unmapping ca...
Alex Elder
01:50 PM rgw Bug #3941 (Resolved): s3tests crash on bobtail
Crash fixed, commit:f41010c44b3a4489525d25cd35084a168dc5f537.
Also, pushed a change to s3-tests.git, setting a requi...
Yehuda Sadeh
01:27 PM Bug #3268: osd: localize reads handling is incorrect
Yes, the OSDs will serve replica reads as things stand. Greg Farnum
01:11 PM Bug #3268: osd: localize reads handling is incorrect
I'm starting on this bug now. Before fixing the flag handling described in the ticket, I want to make sure that the O... Noah Watkins
12:43 PM Bug #3810: btrfs corrupts file size on 3.7
I'm making an attempt. Mike Lowe
12:36 PM Bug #3810: btrfs corrupts file size on 3.7
Mike, Bill: are you able to test Josef's patch? Sage Weil
11:45 AM Revision e805b7d6 (ceph): admin_socket: don't bother remote executing if there is no test
Sage Weil
11:30 AM CephFS Bug #3951: ceph-fuse: permissions error on create
I've got a question in for Sam, but other than that this looks good to me! Greg Farnum
09:37 AM CephFS Bug #3951 (Resolved): ceph-fuse: permissions error on create
Reported by Greg Farnum:
gregf@kai:~/ceph/src [master]$ cd mnt/
gregf@kai:~/ceph/src/mnt$ sudo chown gregf.gregf ...
Sam Lang
11:12 AM Revision c9201d0e (ceph): ReplicatedPG: correctly handle new snap collections on replica
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(cherry...
Samuel Just
11:10 AM rbd Bug #3950: krbd: new assertion failure running concurrent rbd test
OK, I do have the osd request pointer now. It was available
in register R14. And with a little work I can determin...
Alex Elder
10:35 AM rbd Bug #3950: krbd: new assertion failure running concurrent rbd test
The object being operated on is the rbd header image, in
this case named "image.5X5ZNB.rbd". The object request typ...
Alex Elder
10:06 AM rbd Bug #3950: krbd: new assertion failure running concurrent rbd test
Weird. It looks to me like the object request that's
just completing is already done, meaning we got
a callback fr...
Alex Elder
09:19 AM rbd Bug #3950 (Can't reproduce): krbd: new assertion failure running concurrent rbd test
(I think this is a new issue, I haven't investigated it yet.)
I hit an assertion failure while running my new test...
Alex Elder
10:34 AM rbd Bug #3937: krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
I've opened a new issue that has symptoms similar to this
but not identical:
http://tracker.ceph.com/issues/395...
Alex Elder
09:41 AM Bug #3768 (In Progress): perl is required for logrotate, we need to include Perl as a dependency
Putting back to in-progress. The preferred solution is to replace the perl filter line with sed or python and remove... Anonymous
09:38 AM Bug #3930 (Resolved): ceph.spec: udev rule for rbd not in rpms
Branch: refs/heads/master
Home: https://github.com/ceph/ceph
Commit: 0b66994c180b1ce5856a38518423d82fbebc8a2e
...
Anonymous
09:15 AM rbd Bug #3427: krbd: unmap does not remove block device properly
I have opened this to cover developing that test script
http://tracker.ceph.com/issues/3949
Alex Elder
07:53 AM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
...yes, yes it is. I've been working in FUSE so far. *sigh* Well, it needed the fix too. Greg Farnum
07:26 AM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
I don't see wip-2753-fsync-errors in the repo. Also, note that this problem was reported on the cephfs kernel client... Sam Lang
06:49 AM Revision 0b66994c (ceph): ceph.spec.in: package rbd udev rule
Package udev/50-rbd.rules per bug 3930.
Signed-off-by: Gary Lowell <gary.lowell@inktank.com>
Gary Lowell
04:22 AM Revision 1c311949 (ceph): osd_recovery: inject a recovery delay
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:22 AM Revision e33b425d (ceph): osd_recovery: use --no-cleanup for rados bench
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
03:53 AM Revision 3b27c9ec (ceph): osd_backfill: --no-cleanup for rados bench
Sage Weil
03:46 AM Revision a7d15afb (ceph): mon: smooth pg stat rates over last N pgmaps
This smooths the recovery and throughput stats over the last N pgmaps,
defaulting to 2.
Signed-off-by: Sage Weil <sa...
Sage Weil
03:17 AM Revision 0f7a9e56 (ceph): Merge remote-tracking branch 'yan/wip-mds'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
03:03 AM Revision ecda1208 (ceph): doc: fix overly-big fixed-width text in Firefox
Changed font size for ... Ross Turk
03:01 AM Revision d5008602 (ceph): btrfs.yaml: increase osd op thread timeout
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
02:50 AM Revision 4aea19ee (ceph): osd_types: add recovery counts to object_sum_stats_t
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:50 AM Revision a2495f65 (ceph): osd: track recovery ops in stats
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:50 AM Revision 76e9fe5f (ceph): mon/PGMap: include timestamp
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:50 AM Revision 208b02a7 (ceph): mon/PGMap: report recovery rates
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:50 AM Revision 3f6837e0 (ceph): mon/PGMap: report IO rates
This does not appear to be very accurate; probably the stat values we're
displaying are not being calculated correctl...
Sage Weil
02:49 AM Revision 193dbedb (ceph): rbd-fuse: fix warning
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:44 AM Revision 1e24ce22 (ceph): doc: Removed indep, and clarified explanation.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
02:17 AM Revision 0e9c8124 (ceph): mds: add projected rename's subtree bounds to ESubtreeMap
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com> Yan, Zheng
02:17 AM Revision e69e7e5d (ceph): mds: fix 'discover' handling in the rejoin stage
If the MDS is the resolve stage, current MDCache::handle_discover() only handles
'discover' from MDS that it has alre...
Yan, Zheng
02:17 AM Revision abc4c785 (ceph): mds: allow handling slave request in the clientreplay stage
replaying a client request may need to create slave request and the slave
MDS can be also in the clientreplay stage.
...
Yan, Zheng
02:17 AM Revision 58841776 (ceph): mds: mark export bounds for cross authority directory rename
this guarantees that the importing MDS gets directory fragment's
up-to-date fragstat/rstat.
Signed-off-by: Yan, Zhen...
Yan, Zheng
02:17 AM Revision 829aeba6 (ceph): mds: clear inode dirty when slave rename finishes.
The inode is linked to a non-auth directory, so remove it from LogSegment's
dirty inode list.
Signed-off-by: Yan, Zh...
Yan, Zheng
02:17 AM Revision c93cf2d2 (ceph): mds: fix for MDCache::disambiguate_imports
In the resolve stage, if no MDS claims other MDS's disambiguous subtree
import, the subtree's dir_auth is undefined.
...
Yan, Zheng
02:17 AM Revision 0cf5e4e5 (ceph): mds: journal inode's projected parent when doing link rollback
Otherwise the journal entry will revert the effect of any on-going
rename operation for the inode.
Signed-off-by: Ya...
Yan, Zheng
02:17 AM Revision 9a0cfcc5 (ceph): mds: don't journal opened non-auth inode
If we journal opened non-auth inode, during journal replay, the corresponding
entry will add non-auth objects to the ...
Yan, Zheng
02:17 AM Revision 4fc68a48 (ceph): mds: properly clear CDir::STATE_COMPLETE when replaying EImportStart
when replaying EImportStart, we should set/clear directory's COMPLETE
flag according with the flag in the journal ent...
Yan, Zheng
02:17 AM Revision 710bba3a (ceph): mds: move variables special to rename into MDRequest::more
My previous patches add two pointers (ambiguous_auth_inode and
auth_pin_freeze) to class Mutation. They are both used...
Yan, Zheng
02:17 AM Revision f4abf00a (ceph): mds: rejoin remote wrlocks and frozen auth pin
Includes remote wrlocks and frozen authpin in cache rejoin strong message
Signed-off-by: Yan, Zheng <zheng.z.yan@int...
Yan, Zheng
02:17 AM Revision 77946dcd (ceph): mds: fetch missing inodes from disk
The problem of fetching missing inodes from replicas is that replicated inodes
does not have up-to-date rstat and fra...
Yan, Zheng
02:17 AM Revision 9944d9fb (ceph): mds: don't journal non-auth rename source directory
After replaying a slave rename, non-auth directory that we rename out of will
be trimmed. So there is no need to jour...
Yan, Zheng
02:17 AM Revision 1a6626f0 (ceph): mds: preserve non-auth/unlinked objects until slave commit
The MDS should not trim objects in non-auth subtree immediately after
replaying a slave rename. Because the slave ren...
Yan, Zheng
02:17 AM Revision 844cd46c (ceph): mds: fix slave rename rollback
The main issue of old slave rename rollback code is that it assumes
all affected objects are in the cache. The assump...
Yan, Zheng
02:17 AM Revision a42a9187 (ceph): mds: split reslove into two sub-stages
The resolve stage serves to disambiguate the fate of uncommitted slave
updates and resolve subtrees authority. The MD...
Yan, Zheng
02:17 AM Revision 3a66656b (ceph): mds: send resolve messages after all MDS reach resolve stage
Current code sends resolve messages when resolving MDS set changes.
There is no need to send resolve messages when so...
Yan, Zheng
02:17 AM Revision 85294a59 (ceph): mds: always use {push,pop}_projected_linkage to change linkage
Current code skips using {push,pop}_projected_linkage to modify replica
dentry's linkage. This confuses EMetaBlob::ad...
Yan, Zheng
02:17 AM Revision e0aa64d0 (ceph): mds: don't replace existing slave request
The MDS may receive a client request, but find there is an existing
slave request. It means other MDS is handling the...
Yan, Zheng
02:17 AM Revision baa6bd6b (ceph): mds: fix for MDCache::adjust_bounded_subtree_auth
After swallowing extra subtrees, subtree bounds may change, so it
should re-check.
Signed-off-by: Yan, Zheng <zheng....
Yan, Zheng
02:17 AM Revision c9ff21a9 (ceph): mds: fix "had dentry linked to wrong inode" warning
The reason of "had dentry linked to wrong inode" warning is that
Server::_rename_prepare() adds the destdir to the EM...
Yan, Zheng
02:17 AM Revision ce431eb5 (ceph): mds: splits rename force journal check into separate function
the function will be used by later patch that fixes rename rollback
Signed-off-by: Yan, Zheng <zheng.z.yan@intel.com>
Yan, Zheng
02:17 AM Revision fb497135 (ceph): mds: force journal straydn for rename if necessary
rename may overwrite an empty directory inode and move it into stray
directory. MDS who has auth subtree beneath the ...
Yan, Zheng
02:17 AM Revision cd8d9107 (ceph): mds: don't set xlocks on dentries done when early reply rename
_rename_finish() does not send dentry link/unlink message to replicas.
We should prevent dentries that are modified b...
Yan, Zheng
02:15 AM Revision 87d85fa2 (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
01:51 AM Revision e58fe519 (ceph): Merge branch 'master' of https://github.com/ceph/ceph
John Wilkins
01:50 AM Revision b429a3a3 (ceph): doc: Updated to add indep and first n to chooseleaf. Num only used with...
fixes: #3711
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
01:31 AM Revision f41010c4 (ceph): rgw: fix crash when missing content-type in POST object
Fixes: #3941
This fixes a crash when handling S3 POST request and content type
is not provided.
Signed-off-by: Yehud...
Yehuda Sadeh
01:22 AM Revision 26988038 (ceph): Merge branch 'wip-osd-down-out'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
01:14 AM Revision 09522e5a (ceph): rgw: fix crash when missing content-type in POST object
Fixes: #3941
This fixes a crash when handling S3 POST request and content type
is not provided.
Signed-off-by: Yehud...
Yehuda Sadeh
01:13 AM Revision 75f6ba56 (ceph): crush: implement get_children(), get_immediate_parent_id()
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:13 AM Revision 2b8ba7ca (ceph): osdmap: implement subtree_is_down() and containing_subtree_is_down()
Implement two methos to see if an entire subtree is down, and if the
containing parent node of type T of a given node...
Sage Weil
01:13 AM Revision b955a599 (ceph): mon: set limit so that we do not an entire down subtree out
Add new configurable 'mon osd down out subtree limit' so that you can
prevent marking out an entire subtree. If for ...
Sage Weil
01:12 AM Revision 2efdfb41 (ceph): mon: Elector: reset the acked leader when the election finishes and we ...
Failure to do so will mean that we will always ack the same leader during
an election started by another monitor. Th...
Joao Eduardo Luis
01:12 AM Revision 428ddb7d (ceph): Merge remote-tracking branch 'gh/wip-timecheck
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
12:58 AM Revision 81ed1bc7 (ceph): rados: add pool_ops workunit to cephtool test
Josh Durgin
12:53 AM Revision c79f7c6c (ceph): Merge branch 'wip-pool-delete'
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
12:52 AM Revision 97b78924 (ceph): doc: update ceph man page link
It's not the wiki anymore, and the man page needed to be regenerated.
Signed-off-by: Josh Durgin <josh.durgin@inktan...
Josh Durgin
12:52 AM Revision 91a0bc89 (ceph): ceph, rados: update pool delete docs and usage
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin

01/28/2013

11:25 PM Revision 1a6197a7 (ceph): qa: fix mon pool_ops workunit
Use ! for clarity when commands are supposed to fail.
Check a few other cases that should fail, and correct deleting
...
Josh Durgin
10:54 PM Revision 826e5860 (ceph): cram: fix for runs with coverage enabled
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
10:50 PM Bug #3948 (Resolved): problems from leveldb static linkage and leveldb downgrade
Two days ago I upgraded one of my osds to 0.48.3 (see http://tracker.ceph.com/issues/3797) and everything worked fine... Corin Langosch
09:56 PM Revision 014fc6d6 (ceph): utime: fix narrowing conversion compiler warning in sleep()
Fix compiler warning:
./include/utime.h: In member function 'void utime_t::sleep()':
./include/utime.h:139:50: warnin...
Danny Al-Gaaf
09:56 PM Revision fb85c7f6 (ceph): rbd: don't ignore return value of system()
Check for the return value of system() and handle the error if needed
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bi...
Danny Al-Gaaf
09:56 PM Revision f74265b0 (ceph): configure: fix check for fuse_getgroups()
Check for fuse_getgroups() only in case we have found libfuse already.
Moved the check to the check for --with-fuse.
...
Danny Al-Gaaf
09:56 PM Revision 21673e8b (ceph): rbd-fuse: fix usage of conn->want
Fix usage of conn->want and FUSE_CAP_BIG_WRITES. Both need libfuse
version >= 2.8. Encapsulate the related code line ...
Danny Al-Gaaf
09:56 PM Revision 818e9a2c (ceph): rbd-fuse: fix printf format for off_t and size_t
Fix printf format for off_t and size_t to print the same on 32 and 64bit
systems. Use PRI* macros from inttypes.h.
S...
Danny Al-Gaaf
09:51 PM Bug #3930 (In Progress): ceph.spec: udev rule for rbd not in rpms
Anonymous
09:50 PM Bug #3945 (In Progress): osd: dynamically link to leveldb
Anonymous
04:56 PM Bug #3945 (Resolved): osd: dynamically link to leveldb
We hit a problem with quantal that underscored the danger of linking statically to libleveldb. After some discussion... Sage Weil
09:21 PM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
wip-2753-fsync-errors has a patch which makes fsync return an error if the client gets back an error from the Objecte... Greg Farnum
05:32 AM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
Looked at this briefly; I see that the way we do fsyncs is attached to a "FIXME: this could starve" comment, and I be... Greg Farnum
09:18 PM rbd Bug #3947 (Resolved): krbd: read zeroing freed bio?
This happened to me once before but I wasn't sure what
I did. Now I think I do know. This is with the new
request...
Alex Elder
08:52 PM Revision 4edef483 (ceph): Merge branch 'wip-java-api'
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Reviewed-by: Joe Buck <jbbuck@gmail.com>
Reviewed-by: Sage Weil <...
Noah Watkins
08:45 PM Feature #3833: osd: improve recovery throttling
commit:d6db239ce5134a9c410554fb292c54981375c628 Sage Weil
08:20 PM Feature #3833: osd: improve recovery throttling
Commit? Ian Colle
07:32 PM Feature #3833 (Resolved): osd: improve recovery throttling
Sage Weil
07:27 PM Revision 0ded0fdf (ceph): mon: Monitor: rework timecheck code to clarify logic boundaries
The initial timecheck implementation relied on a cleanup function to
clean the state each time we changed epochs (or ...
Joao Eduardo Luis
06:13 PM Revision 3a089420 (ceph): doc: fix rbd create syntax
--dest-pool does not apply to create. Also remove extraneous
whitespace.
Signed-off-by: Josh Durgin <josh.durgin@ink...
Josh Durgin
06:08 PM RADOS Documentation #3830: crush-map.rst: chooseleaf doesn't include 'firstn|indep', and 'aggregates' i...
Can we get something moving on this bug, or give it to John to research? (and btw, firstn|indep has
been addressed u...
Dan Mick
05:20 PM Bug #3906 (Won't Fix): ceph-mon leaks memory during peering
This isn't something that's worth dealing with on the monitor side right now. Sage Weil
05:19 PM Bug #3797 (Duplicate): osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48....
see #3376 Sage Weil
04:43 PM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
The conclusion:
- quantal had a newer libleveldb than we built statically into our debs
- downgrading made compac...
Sage Weil
05:02 PM rbd Bug #3946 (Resolved): rbd fsx failing in nightly
... Sage Weil
04:49 PM Bug #3905 (Can't reproduce): incomplete & stale (lost?) PGs
This appears to be something that was triggered and exacerbated by now-fixed issues. Until we can trigger it, I'm in... Sage Weil
08:04 AM Bug #3905: incomplete & stale (lost?) PGs
Due to some other issues and after a chat with Sage, I restarted all of my osds and this disappeared since. So I'm af... Faidon Liambotis
04:28 PM Bug #3944 (Resolved): ceph tool should prevent --admin-socket
Misremembering, I tried several 'ceph --admin-socket' commands rather than 'ceph --admin-daemon'; the result was that... Dan Mick
04:12 PM Bug #3810: btrfs corrupts file size on 3.7
sent a report to linux-btrfs Sage Weil
03:41 PM Bug #3810: btrfs corrupts file size on 3.7
Ok, this looks like a btrfs bug to me. On osd.3, the write extends the file size to 4194304, but the later stat sees... Sage Weil
10:59 AM Bug #3810: btrfs corrupts file size on 3.7
I ran part of the workload and found an inconsistent pg. I've uploaded ceph.log and logs from the primary and second... Mike Lowe
03:50 PM Documentation #3711: crush-map.rst: choose firstn talks about "N", but does not clearly define wh...
Things I mentioned in comment 4 are still present; I'd like to either change them or update here why we're not. Dan Mick
02:11 PM rbd Bug #3427 (Fix Under Review): krbd: unmap does not remove block device properly
I have posted two patches for review, the second of which
should fix this problem. I have not actually reproduced
...
Alex Elder
12:50 PM devops Feature #3479: ceph-deploy: uninstall
commit:93082e82df56b01c524d0195e20068f6a6c8ca26 Sage Weil
12:49 PM devops Feature #3910: ceph-deploy: uninstall purge
ceph-deploy commit:93082e82df56b01c524d0195e20068f6a6c8ca26
Sage Weil
12:48 PM devops Feature #3341: ceph-disk-activate: Make --mount the default
I made it autodetect whether to mount or not based on whether you pass a directory or block device in. Simpler all a... Sage Weil
12:48 PM rgw Bug #3365: Broken metadata (duplicated as CSV)
Aaron Schulz wrote:
> Ian Colle wrote:
> > Aaron are you still seeing this?
>
> Sorry I need to get the time to ...
Sage Weil
10:24 AM Feature #3890 (In Progress): osd: create tool to extract pg info and pg log from filestore
Ian Colle
10:10 AM rgw Cleanup #3777 (In Progress): rgw: audit code for reading NULL env variables
reopening, see #3941. Yehuda Sadeh
10:09 AM rgw Bug #3941: s3tests crash on bobtail
Yeah, similar to that other issue (#3777)... Yehuda Sadeh
09:21 AM CephFS Feature #3540 (In Progress): mds: maintain per-file backpointers on first file object
Sam Lang
02:18 AM Revision 6bd676ea (ceph): mds: fix end check in Server::handle_client_readdir()
commit 1174dd3188 (don't retry readdir request after issuing caps)
introduced an bug that wrongly marks 'end' in the ...
Yan, Zheng
02:18 AM Revision 5176cb71 (ceph): mds: check deleted directory in Server::rdlock_path_xlock_dentry
Commit b03eab22e4 (mds: forbid creating file in deleted directory)
is not complete, mknod, mkdir and symlink are miss...
Yan, Zheng
02:18 AM Revision 919df3bf (ceph): mds: lock remote inode's primary dentry during rename
commit 1203cd2110 (mds: allow open_remote_ino() to open xlocked dentry)
makes Server::handle_client_rename() xlocks r...
Yan, Zheng
02:18 AM Revision 67144973 (ceph): mds: allow journaling multiple root inodes in EMetaBlob
In some cases (rename, rmdir, subtree map), we may need journal multiple
root inodes (/, mdsdir) in one EMetaBlob. Th...
Yan, Zheng
02:18 AM Revision 6daec530 (ceph): mds: introduce XSYN to SYNC lock state transition
If lock is in XSYN state, Locker::simple_sync() firstly try changing
lock state to EXCL. If it fail to change lock st...
Yan, Zheng
02:18 AM Revision 659d1a39 (ceph): mds: properly set error_dentry for discover reply
If MDCache::handle_discover() receives an 'discover path' request but
can not find the base inode. It should properly...
Yan, Zheng

01/27/2013

06:12 PM Revision c5478161 (ceph): mon: Elector: reset the acked leader when the election finishes and we ...
Failure to do so will mean that we will always ack the same leader during
an election started by another monitor. Th...
Joao Eduardo Luis
03:59 PM Bug #3810: btrfs corrupts file size on 3.7
I can do that, it will take somewhere between 12 and 24 hours to run. Mike Lowe
03:34 PM Bug #3810: btrfs corrupts file size on 3.7
Mike, would it be possible to reproduce this with debug file store = 20? That will tell us if what Ceph thinks it di... Sage Weil
02:10 PM Bug #3810: btrfs corrupts file size on 3.7
I deleted the rbd's with inconsistent pg's, recreated the rbd's, ran rsync with the same data set, made sure no btrfs... Mike Lowe
02:15 PM Revision d74b31b2 (ceph): mon: Monitor: force timecheck cleanup on finish_election()
Fixes: #3854
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Joao Eduardo Luis
12:58 PM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Ok, everything still looks good :). Last question: should I upgrade my whole cluster to this version or will a new ar... Corin Langosch
12:01 PM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Ok, after around 10 minutes of runtime everything seems normal. Thanks for the fast and great help! :-)
ceph versi...
Corin Langosch
12:00 PM Bug #3797 (Fix Under Review): osd takes 100% cpu after upgrading from 0.48.2argonaut to the lates...
Sage Weil
11:56 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
that fixed it it seems. we could
- update argonaut and bobtail to newer leveldb :/
- link dynamically for quant...
Sage Weil
11:08 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
looks like levedb spinning on background compaction.
his .2 package is quantals, which is leveldb 1.5.. newer than...
Sage Weil
10:56 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Output of gdb /usr/bin/ceph-osd $pid, then 'thread apply all bt' Corin Langosch
10:25 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Hi Sage, here we go. Is it enough data or do you need more? I didn't disable the logging yet... Corin Langosch
10:00 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Hi Corin-
Can you enable 'debug osd = 20' for a bit and attach that log? I think this is related to commit:830b8f...
Sage Weil
08:31 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Just another small update - nothing changed so far. The cluster is still healthy, but the osd is still using 100% of ... Corin Langosch
05:50 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Just a small update - nothing changed so far. The cluster is still healthy, but the osd is still using 100% of one co... Corin Langosch
05:06 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Here's a nice graph to see the difference before/ after upgrade of disk activity....
The cluster is clean, no reco...
Corin Langosch
05:03 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Hi Sage,
sorry for the delay. I just shutdown the osd, upgraded it and started it again. It's again using almost 1...
Corin Langosch
10:29 AM rgw Bug #3941 (Resolved): s3tests crash on bobtail
... Sage Weil
09:28 AM Revision f666c617 (ceph): Revert "librbd: ensure header is up to date after initial read"
Using assert version for linger ops doesn't work with retries,
since the version will change after the first send.
Th...
Josh Durgin
09:28 AM Revision 10053b14 (ceph): librbd: establish watch before reading header
This eliminates a window in which a race could occur when we have an
image open but no watch established. The previou...
Josh Durgin
09:28 AM Revision 76f93751 (ceph): rbd: Don't call ProgressContext's finish() if there's an error.
do_copy was different from the others; call pc.fail() on error and
do not call pc.finish().
Fixes: #3729
Signed-off-...
Dan Mick
09:28 AM Revision a16c6f3d (ceph): rbd: fix bench-write infinite loop
I/O was continously submitted as long as there were few enough ops in
flight. If the number of 'threads' was high, or...
Josh Durgin
08:58 AM Revision 575a5866 (ceph): os/FileStore: only adjust up op queue for btrfs
We only need to adjust up the op queue limits during commit for btrfs,
because the snapshot initiation (async create)...
Sage Weil
08:47 AM Revision c9eb1b0a (ceph): common/HeartbeatMap: fix uninitialized variable
Introduced by me in 132045ce085e8584a3e177af552ee7a5205b13d8. Thank you,
valgrind!
Signed-off-by: Sage Weil <sage@i...
Sage Weil
06:35 AM Revision fa421cf5 (ceph): configure: remove -m4_include(m4/acx_pthread.m4)
Since we use already AC_CONFIG_MACRO_DIR, no need to include m4/acx_pthread.m4
extra.
Signed-off-by: Danny Al-Gaaf <...
Danny Al-Gaaf
06:34 AM Revision 32276e9a (ceph): configure: fix RPM_RELEASE
Use git to get RPM_RELEASE only if this is a git repo
clone and if the git command is available on the system.
Signe...
Danny Al-Gaaf
04:49 AM Revision 341e6760 (ceph): osdmaptool: fix clitests
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:33 AM Revision 54c392e0 (ceph): osd: dump/display pool min_size
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
01:24 AM devops Feature #3479 (Resolved): ceph-deploy: uninstall
Sage Weil
01:24 AM devops Feature #3910 (Resolved): ceph-deploy: uninstall purge
Sage Weil

01/26/2013

09:46 PM Revision 1ba4c80b (ceph): qa/workunits/rbd/copy.sh: use non-deprecated --image-format option
--format is deprecated.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:45 PM Revision bbb86ec7 (ceph): mon: safety interlock for pool deletion
Require that the pool name be passed twice along with an force option
before we irreversibly delete an entire pool of...
Sage Weil
09:26 PM Revision 700bcede (ceph): Revert "mon: implement safety interlock for deleting pools"
This reverts commit c993ac9b1fa4037f4cc2674455728ee38a7c978b.
This is too hard to test. Requiring the pool name twi...
Sage Weil
09:18 PM Revision 6c407943 (ceph): Added libexpat dependency
Cesar Mello
09:13 PM Revision b5f81636 (ceph): osdthrasher: inject pause on a live (on in) osd
Sage Weil
08:58 PM devops Feature #3917 (Fix Under Review): ceph-dir-prepare command
Sage Weil
08:58 PM devops Feature #3915 (Rejected): ceph-disk-prepare: support sysvinit or upstart
init system is a property of the host, not the disk.. doesn't belong in ceph-disk-prepare. Sage Weil
08:57 PM devops Feature #3911 (Fix Under Review): sysvinit: allow daemon enumeration via dirs
Sage Weil
08:57 PM devops Feature #3914 (Fix Under Review): ceph-disk-activate: support sysvinit
Sage Weil
08:54 PM devops Feature #3341 (Rejected): ceph-disk-activate: Make --mount the default
Sage Weil
08:53 PM devops Bug #3898 (Resolved): ceph-deploy: problems with >1 mon
ceph-deploy commit:8067dd0afa19ff7b7ca75f984dedc4213d3a4be8 Sage Weil
05:21 PM rgw Bug #3365: Broken metadata (duplicated as CSV)
Ian Colle wrote:
> Aaron are you still seeing this?
Sorry I need to get the time to try and reproduce this (and o...
Aaron Schulz
12:44 PM rbd Bug #3937 (Fix Under Review): krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
A patch resolving this has been posted for review.
[PATCH 4/4] rbd: don't drop watch requests on completion
Alex Elder
12:43 PM rbd Bug #3940 (Fix Under Review): krbd: decrement obj request count when deleting
A patch resolving this has been posted for review. Alex Elder
08:05 AM rbd Bug #3940 (Resolved): krbd: decrement obj request count when deleting
The obj_request_count value keeps track of how many object requests
are associated with an image request. It is inc...
Alex Elder
07:57 AM rbd Bug #3939 (Duplicate): krbd: circular locking report in sysfs code
I intended to write this up before but don't think I did.
I'm getting a "possible circular locking dependency detect...
Alex Elder
05:27 AM Revision 7daf3724 (ceph): rbd-fuse: Original code from Andreas Bluemle
Signed-off-by: Andreas Bluemle <andreas.bluemle@itxperts.de> Andreas Bluemle
05:27 AM Revision 2a6dcabf (ceph): rbd-fuse: add simple RBD FUSE client
Currently written in C on FUSE hi-level interfaces, so error reporting
could be better. No serious work done for per...
Dan Mick
05:25 AM Revision aec2a474 (ceph): s3/php: update to 1.5? version of API
Something like v1.5 of the Amazon PHP library requires the AmazonS3
constructor to be given an array of parameters ra...
Dan Mick
02:07 AM Revision b2a473be (ceph): workunit for iogen
Signed-off-by: tamil <tamil.muthamizhan@inktank.com> Tamilarasi muthamizhan
01:59 AM Revision b98da75a (ceph): Merge branch 'wip-osd-msgr'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
01:58 AM Revision 17cd549a (ceph): mon: Monitor: timecheck: only output report to dout once
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Joao Eduardo Luis
01:56 AM Revision 13fb1726 (ceph): mon: Monitor: track timecheck round state and report on health
Fixes: #3854
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Joao Eduardo Luis
01:56 AM Revision aa85d914 (ceph): task: mon_clock_skew_check: increase timeout and kick it off only on stop
We were kicking-off the timeout as soon as we started; it's better however
to kick if off only when we are told to st...
Joao Eduardo Luis
01:56 AM Revision 673101c7 (ceph): task: mon_clock_skew_check: distinguish between on-going and finished c...
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
01:24 AM Revision e6bceeed (ceph): sharedptr_registry: remove extaneous Mutex::Locker declaration
For some reason, the lookup() retry loop (for when happened to
race with a removal and grab an invalid WeakPtr) locke...
Samuel Just
01:24 AM Revision 60888caf (ceph): FileStore: ping TPHandle after each operation in _do_transactions
Each completed operation in the transaction proves thread
liveness, a stuck thread should still trigger the timeouts....
Samuel Just
01:24 AM Revision 6b8a673f (ceph): OSD: use TPHandle in peering_wq
Implement _process overload with TPHandle argument and use
that to ping the hb map between pgs and between map epochs...
Samuel Just
01:24 AM Revision aa6d20aa (ceph): WorkQueue: add TPHandle to allow _process to ping the hb map
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit 4f653d23999b24fc8c65a5...
Samuel Just
01:23 AM Revision e66a7505 (ceph): ReplicatedPG: handle omap > max_recovery_chunk
span_of fails if len == 0.
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from c...
Samuel Just
01:23 AM Revision 44f0407a (ceph): ReplicatedPG: correctly handle omap key larger than max chunk
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit c3dec3e30a85ecad0090c7...
Samuel Just
01:23 AM Revision 50fd6ac9 (ceph): ReplicatedPG: start scanning omap at omap_recovered_to
Previously, we started scanning omap after omap_recovered_to.
This is a problem since the break in the loop implies t...
Samuel Just
01:23 AM Revision 4b32eecb (ceph): ReplicatedPG: don't finish_recovery_op until the transaction completes
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit 62a4b96831c1726043699db86a664dc6a0af8637)
Samuel Just
01:23 AM Revision da34c77b (ceph): ReplicatedPG: ack push only after transaction has completed
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit 20278c4f77b890d5b2b95d2ccbeb4fbe106667ac)
Samuel Just
01:23 AM Revision f9381c74 (ceph): ObjectStore: add queue_transactions with oncomplete
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit 4d6ba06309b80fb21de7bb5d12d5482e71de5f16)
Samuel Just
01:22 AM Revision e2560554 (ceph): common/HeartbeatMap: inject unhealthy heartbeat for N seconds
This lets us test code that is triggered by an unhealthy heartbeat in a
generic way.
Signed-off-by: Sage Weil <sage@...
Sage Weil
01:22 AM Revision cbe8b5bc (ceph): os/FileStore: add stall injection into filestore op queue
Allow admin to artificially induce a stall in the op queue. Forces the
thread(s) to sleep for N seconds. We pause f...
Sage Weil
01:22 AM Revision beb6ca44 (ceph): osd: do not join cluster if not healthy
If our internal heartbeats are failing, do not send a boot message and try
to join the cluster.
Signed-off-by: Sage ...
Sage Weil
01:22 AM Revision 1ecdfca3 (ceph): osd: hold lock while calling start_boot on startup
This probably doesn't strictly matter because start_boot doesn't need the
lock (currently) and few other threads shou...
Sage Weil
01:22 AM Bug #3938 (Can't reproduce): ceph-mon crashed on mixed bobtail-argonaut cluster (2 argonaut mons,...
7:09:03.310220 7f652087e700 1 mon.a@1(peon).osd e72 e72: 20 osds: 20 up, 20 in ... Samuel Just
01:21 AM Revision e120bf20 (ceph): osd: do not reply to ping if internal heartbeat is not healthy
If we find that our internal threads are stalled, do not reply to ping
requests. If we do this long enough, peers wi...
Sage Weil
01:21 AM Revision 5f396e2b (ceph): osd: reduce op thread heartbeat default 30 -> 15 seconds
If the thread stalls for 15 seconds, let our internal heartbeat fail.
This will let us internally respond more quickl...
Sage Weil
01:17 AM Revision fca288b7 (ceph): osd: improve sub_op flag points
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 73a969366c8bbd105579611320c43e2334907fef)
Sage Weil
01:17 AM Revision f13ddc8a (ceph): osd: refactor ReplicatedPG::do_sub_op
PULL is the only case where we don't wait for active.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked fro...
Sage Weil
01:17 AM Revision d5e00f96 (ceph): osd: make last state for slow requests more informative
Report on the last event string, and pass in important context for the
op event list, including:
- which peers were...
Sage Weil
01:17 AM Revision ab3a110c (ceph): osd: dump op priority queue state via admin socket
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 24d0d7eb0165c8b8f923f2d8896b156bfb5e0e60)
Sage Weil
01:17 AM Revision 43a65d04 (ceph): osd: simplify asok to single callback
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 33efe32151e04beaafd9435d7f86dc2eb046214d)
Sage Weil
01:16 AM Revision d0407986 (ceph): common/PrioritizedQueue: dump state to Formatter
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 514af15e95604bd241d2a98a97b938889c6876db)
Sage Weil
01:16 AM Revision 691fd505 (ceph): common/PrioritizedQueue: add min cost, max tokens per bucket
Two problems.
First, we need to cap the tokens per bucket. Otherwise, a stream of
items at one priority over time w...
Sage Weil
01:16 AM Revision a2b03fe0 (ceph): common/PrioritizedQueue: buckets -> tokens
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit c549a0cf6fae78c8418a3b4b0702fd8a1e4ce482)
Sage Weil
01:16 AM Revision 612d75cd (ceph): note puller's max chunk in pull requests
this lets us calculate a cost value
(cherry picked from commit 128fcfcac7d3fb66ca2c799df521591a98b82e05)
Sage Weil
01:16 AM Revision 2224e413 (ceph): osd: add OpRequest flag point when commit is sent
With writeahead journaling in particular, we can get requests that
stay in the queue for a long time even after the c...
Sage Weil
01:16 AM Revision 5b5ca592 (ceph): osd: set PULL subop cost to size of requested data
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit a1bf8220e545f29b83d965f07b1abfbea06238b3)
Sage Weil
01:16 AM Revision 10651e4f (ceph): osd: use Message::get_cost() function for queueing
The data payload is a decent proxy for cost in most cases, but not all.
Signed-off-by: Sage Weil <sage@inktank.com>
...
Sage Weil
01:16 AM Revision 9735c6b1 (ceph): osd: debug msg prio, cost, latency
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit bec96a234c160bebd9fd295df5b431dc70a2cfb3)
Sage Weil
01:15 AM Revision c48279da (ceph): filestore: filestore_queue_max_ops 500 -> 50
Having a deep queue limits the effectiveness of the priority queues
above by adding additional latency.
Signed-off-b...
Sage Weil
01:15 AM Revision f47b2e8b (ceph): osd: target transaction size 300 -> 30
Small transactions make pg removal nicer to the op queue. It also slows
down PG deletion a bit, which may exacerbate...
Sage Weil
01:15 AM Revision 4947f0ef (ceph): os/FileStore: allow filestore_queue_max_{ops,bytes} to be adjusted at r...
The 'committing' ones too.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit cfe4b8519363f92f84...
Sage Weil
01:14 AM Revision ad6e6c91 (ceph): osd: make osd_max_backfills dynamically adjustable
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 101955a6b8bfdf91f4229f4ecb5d5b3da096e160)
Sage Weil
01:14 AM Revision 939b1855 (ceph): osd: make OSD a config observer
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 9230c863b3dc2bdda12c23202682a84c48f070a1)
Con...
Sage Weil
12:16 AM Revision b49440bc (ceph): doc: Added new, more comprehensive OSD/PG monitoring doc.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:15 AM Revision 5f210505 (ceph): doc: Trimmed some detail and added a x-ref to detailed osd/pg monitorin...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:14 AM Revision 95cfdd46 (ceph): doc: Added osd/pg monitoring section to the index.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
12:14 AM Revision d36a208c (ceph): doc: Added x-ref links.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins

01/25/2013

10:25 PM Revision 89386856 (ceph): Merge branch 'master' of https://github.com/ceph/ceph
John Wilkins
10:24 PM Revision 1af3578e (ceph): doc: fixed description for pg in control section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:48 PM Revision 248835d4 (ceph): doc: wider sidebar, larger font, cleaned tip CSS
The sidebar is now about a hundred pixels wider and the fonts
are larger throughout. This works a lot better when yo...
Ross Turk
08:16 PM Linux kernel client Bug #3860: rbd: problems if watch setup returns ERANGE
Just to close this out...
The fix (not repeating no ERANGE) has been committed:
commit c04306471ad93f1daf60771a...
Alex Elder
06:27 AM Linux kernel client Bug #3860: rbd: problems if watch setup returns ERANGE
Josh rejected this. But since he said that the
change I proposed--to not do the loop--was OK
I suggest this bug sh...
Alex Elder
07:41 PM Revision 037900dc (ceph): sharedptr_registry: remove extaneous Mutex::Locker declaration
For some reason, the lookup() retry loop (for when happened to
race with a removal and grab an invalid WeakPtr) locke...
Samuel Just
06:54 PM Revision 8bd306b9 (ceph): doc: Added Subdomain section.
fixes: #3778
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
05:40 PM Revision 8fef6fa3 (ceph): osd/PG: include map epoch in query results
Currently you can only infer it from the info.history.* fields.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:38 PM Revision e359a862 (ceph): osd: kill unused addr-based send_map()
Not used, old API, bad.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:38 PM Revision 5e2fab54 (ceph): osd: share incoming maps via Connection*, not addrs
Kill a set of parallel methods that are using the old addr/inst-based
msgr APIs, and instead use Connection handles. ...
Sage Weil
05:38 PM Revision 1bc419a7 (ceph): osd: pass new maps to dead osds via existing Connection
Previously we were sending these maps to dead osds via their old addrs
using a new outgoing connection and setting th...
Sage Weil
05:38 PM Revision 76705ace (ceph): osd: requeue osdmaps on heartbeat connections for cluster connection
If we receive an OSDMap on the cluster connection, requeue it for the
cluster messenger, and process it there where w...
Sage Weil
05:38 PM Revision a7059eb3 (ceph): msgr: add get_loopback_connection() method
Return the Connection* for ourselves, so we can queue messages for
ourselves.
Signed-off-by: Sage Weil <sage@inktank...
Sage Weil
05:38 PM CephFS Bug #3935: kclient: Big directory access bugs (multiple), mixed 32- and 64-bit clients
I will be able to reproduce after the Feb,8. Willl do if nobody will reproduce before. Ivan Kudryavtsev
04:33 PM CephFS Bug #3935: kclient: Big directory access bugs (multiple), mixed 32- and 64-bit clients
please set 'debug mds = 10' and upload mds log. To minimize mds log size, please truncate the mds log before executin... Zheng Yan
09:54 AM CephFS Bug #3935: kclient: Big directory access bugs (multiple), mixed 32- and 64-bit clients
I made a mistake during initial post: amount of files in directory is 3.5K, not 35K. It's my netflow for last years, ... Ivan Kudryavtsev
12:11 AM CephFS Bug #3935: kclient: Big directory access bugs (multiple), mixed 32- and 64-bit clients
At #3936 I'm providing some benchmarks to show that IOPS/speed is OK for my installation and my hands are not perform... Ivan Kudryavtsev
04:25 PM Documentation #3222 (Resolved): DOC: Get an Object from a Primary OSD
Added a full exercise toward the end here: http://ceph.com/docs/master/rados/operations/monitoring-osd-pg/ John Wilkins
08:50 AM Documentation #3222 (In Progress): DOC: Get an Object from a Primary OSD
John Wilkins
04:24 PM Documentation #3333 (Resolved): doc: Explain "degraded" more
More extensive discussion here: http://ceph.com/docs/master/rados/operations/monitoring-osd-pg/ John Wilkins
04:24 PM Documentation #3331 (Resolved): doc: Where is my data placed?
Provided an entire exercise toward the end of this document: http://ceph.com/docs/master/rados/operations/monitoring-... John Wilkins
04:22 PM Documentation #3320 (Resolved): doc: What persistency does Ceph guarantee
Added more extensive discussions.
Here: http://ceph.com/docs/master/rados/operations/monitoring-osd-pg/ and
Her...
John Wilkins
03:25 PM rbd Bug #3937: krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
OK, with Josh's help I finally managed to reproduce the
problem intentionally to check my fix.
I'm building it no...
Alex Elder
11:11 AM rbd Bug #3937: krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
I have confirmed that every time a request registered to linger
is re-submitted the osd client will call the callbac...
Alex Elder
08:07 AM rbd Bug #3937: krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
I've decoded the osd request that's been provided to
rbd_osd_req_callback(). Its contents look completely
legitima...
Alex Elder
06:54 AM rbd Bug #3937: krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
Adding two things:
- this occurred during test 190 of the third consecutive pass
of xfstests with this in the teuth...
Alex Elder
05:04 AM rbd Bug #3937 (Resolved): krbd: crash in rbd_assert(osd_req == obj_request->osd_req)
Looking at a crash this morning in the new request code due
to this failed assertion in rbd_osd_req_callback():
...
Alex Elder
03:14 PM rgw Bug #3620: rgw:improve multiple user access keys scalability
Ian Colle
01:51 PM Subtask #3840: osd: ack push after apply+commit
Ian Colle
01:50 PM Feature #3833: osd: improve recovery throttling
Ian Colle
11:48 AM Bug #3836: osd: common/Mutex.cc: 94: FAILED assert(r == 0) in PG::start_flush()
pushed to master, still need to backport Samuel Just
11:40 AM Bug #3836 (Fix Under Review): osd: common/Mutex.cc: 94: FAILED assert(r == 0) in PG::start_flush()
D'oh. sharedptr_registry.hpp has an extaneous Mutex::Locker l(lock) declaration in the retry loop. It only actually... Samuel Just
11:41 AM Documentation #3711 (Resolved): crush-map.rst: choose firstn talks about "N", but does not clearl...
John Wilkins
11:40 AM Documentation #3390 (Resolved): doc: add detail on different bucket algorithms
John Wilkins
11:12 AM rgw Feature #3669 (In Progress): rgw: support acl grants through http headers
Ian Colle
11:09 AM rgw Cleanup #3777 (Resolved): rgw: audit code for reading NULL env variables
Merged into master, commit: b3a2e7e955547a863d29566aab62bcc480e27a65 caleb miles
11:07 AM rgw Feature #3667 (In Progress): rgw: support extra canned acl params
Ian Colle
10:55 AM Bug #3928 (Resolved): osd: peering workqueue tryings to advance through *all* past osdmaps in one...
The timeout should be fixed by e0511f4f4773766d04e845af2d079f82f3177cb6. Samuel Just
10:55 AM rgw Bug #3778 (Resolved): document procedure for enabling subdomain S3 api calls
Added info for subdomain call. John Wilkins
10:33 AM rgw Bug #3778 (In Progress): document procedure for enabling subdomain S3 api calls
John Wilkins
09:54 AM rbd Bug #3936: rbd: Strange dd speed behaviour (server side issue?)
It's pretty likely that this is a server-side behavior rather than a client-side one. Keep that in mind when reproduc... Greg Farnum
12:00 AM rbd Bug #3936: rbd: Strange dd speed behaviour (server side issue?)
rados -p rbd bench 120 write -t 16
shows about 90-110 MB/sec.
Ivan Kudryavtsev
09:52 AM rbd Bug #3654 (Resolved): libvirt: colons in ipv6 monitor addresses are not escaped when sent to qemu
Upstream commit c1509ab47edf61e9f20d11922526b9fca518d238 Josh Durgin
09:34 AM rbd Bug #3927: krbd: I/O errors (ENXIO) during rbd/kernel.sh workunit
Yes, the ENXIO is expected. Assuming it's being propagated out to dd, and the test passes (outputs OK at the end of k... Josh Durgin
05:55 AM rbd Bug #3427: krbd: unmap does not remove block device properly
We had some discussion about the whether an atomic bit
operation for this was sufficient, or whether a memory
barri...
Alex Elder
05:48 AM Revision a6ed62e3 (ceph): common: fix cli tests on usage
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:06 AM Revision 38871e27 (ceph): os/FileStore: only adjust up op queue for btrfs
We only need to adjust up the op queue limits during commit for btrfs,
because the snapshot initiation (async create)...
Sage Weil
05:06 AM Revision 5f9ab930 (ceph): Revert "filestore: disable extra committing queue allowance"
This reverts commit 44dca5c8c5058acf9bc391303dc77893793ce0be.
The allowance is not only added for btrfs as of commit...
Sage Weil
05:00 AM Revision d95b4313 (ceph): adminops.rst: revert changes for as-yet-unimplemented features
See wip-admin-api for the new specification
Fixes: #3724
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
04:40 AM CephFS Bug #1878: ceph.ko doesn't setattr (lchown, utimes) on symlinks
Heh. Funny markup. The numbered list came out of #s used for comments.
Anyway, I've just verified that the issue...
Alexandre Oliva
04:34 AM CephFS Bug #1878: ceph.ko doesn't setattr (lchown, utimes) on symlinks
I've just verified that the problem is still present in 3.7.3, and I have a much simpler reproducer too.
mount -t ...
Alexandre Oliva
03:43 AM Revision bb860e49 (ceph): rados: remove unused "check_stdio" parameter
Signed-off-by: Dan Mick <dan.mick@inktank.com> DanTest MickTest
02:05 AM Bug #3810: btrfs corrupts file size on 3.7
Kernel was 3.7.1
Ran btrfsck on the partitions when the error first occurred with nothing found.
Tried your fix o...
Bill Kenworthy
01:54 AM Revision 234becd3 (ceph): rados: obey op_size for 'get'
Otherwise we try to read the whole object in one go, which doesn't bode
well for large objects (either non-optimal or...
Sage Weil
01:31 AM Revision 3a5c70b8 (ceph): ceph_manager: turn long stall injection off by default
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
01:25 AM Revision 4f653d23 (ceph): WorkQueue: add TPHandle to allow _process to ping the hb map
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
01:25 AM Revision e0511f4f (ceph): OSD: use TPHandle in peering_wq
Implement _process overload with TPHandle argument and use
that to ping the hb map between pgs and between map epochs...
Samuel Just
01:25 AM Revision 0c1cc687 (ceph): FileStore: ping TPHandle after each operation in _do_transactions
Each completed operation in the transaction proves thread
liveness, a stuck thread should still trigger the timeouts....
Samuel Just
12:24 AM Revision 006e7065 (ceph): osd_recovery: fix up incomplete test
- stop rados bench from cleaning up
- flush pg stats
- fix sleep call
One or more of these helped fix this test, don...
Sage Weil
12:23 AM Revision 20af01f2 (ceph): ceph_manager: fix get_num_active_recovered()
The states now have 'backfill' *or* 'recover' in them. Sage Weil
12:20 AM Revision 79d599cf (ceph): java: remove extra whitespace
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins

01/24/2013

11:59 PM rbd Bug #3936: rbd: Strange dd speed behaviour (server side issue?)
I also tried to do:
dd if=/dev/rbd/rbd/test of=/dev/null bs=4M - the same situation.
Ivan Kudryavtsev
11:57 PM rbd Bug #3936 (Rejected): rbd: Strange dd speed behaviour (server side issue?)
I have 3 node/15 osds (5 on each), every on separate drive installation (with SSD cache), journal in RAMFS. XFS as ba... Ivan Kudryavtsev
11:46 PM CephFS Bug #3935 (Can't reproduce): kclient: Big directory access bugs (multiple), mixed 32- and 64-bit ...
I have next directory structure in ceph fs:... Ivan Kudryavtsev
11:21 PM Revision b150e8e3 (ceph): workunit: pass java path as env variable
The libcephfs-java test needs this. Sage Weil
11:13 PM Revision 6f0e1137 (ceph): libcephfs-java test: use provided environment
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:41 PM Bug #3810: btrfs corrupts file size on 3.7
Bill Kenworthy wrote:
> Version was 55.1 when created and the error occurred, now updated to 56.1 (on gentoo) after ...
Sage Weil
09:36 PM Bug #3810: btrfs corrupts file size on 3.7
Version was 55.1 when created and the error occurred, now updated to 56.1 (on gentoo) after error
Its organised as 5...
Bill Kenworthy
08:55 PM Bug #3810: btrfs corrupts file size on 3.7
Bill Kenworthy wrote:
> I have been hit by the same thing ... is there any information you need before I try and fix...
Sage Weil
06:18 PM Bug #3810: btrfs corrupts file size on 3.7
I have been hit by the same thing ... is there any information you need before I try and fix it further.
Ive tried...
Bill Kenworthy
01:35 PM Bug #3810: btrfs corrupts file size on 3.7
How about this object instead:
2013-01-23 18:41:31.336722 osd.7 149.165.228.11:6800/28046 159 : [ERR] 2.202 osd.0: s...
Mike Lowe
01:16 PM Bug #3810: btrfs corrupts file size on 3.7
the going theory is that this is triggered by btrfs scrub. can we confirm this somehow? Sage Weil
11:03 AM Bug #3810: btrfs corrupts file size on 3.7
Samuel Just wrote:
> I need a dump of the xattrs on the d0c18e1d/605.00000000/head//1 object in pg 1.1d on osd 7 and...
Mike Lowe
10:17 AM Bug #3810: btrfs corrupts file size on 3.7
Additional info, btrfs scrubs were done while the osd's were active which may or may not have had a negative effect. ... Mike Lowe
09:31 PM Revision 40ae8cea (ceph): common: only show -d, -f options for daemons
Fixes: #3073
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
09:13 PM Revision 7e7130da (ceph): doc: Syntax fixes.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:58 PM Revision b51bfdf0 (ceph): doc: Updated usage for Bobtail.
fixes: #3831
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
08:57 PM Revision 1d71d052 (ceph): doc: Updated usage for Bobtail.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:55 PM rgw Bug #3724 (Resolved): docs refer to non-implemented features of the radosgw-admin rest api
commit d95b4313de1614fd85265879e6d7ddadd5268af2
Dan Mick
08:45 PM rgw Bug #3724: docs refer to non-implemented features of the radosgw-admin rest api
Since the docs are in wip-admin-api, this amounts to rolling doc/radosgw/admin/adminops.rst back to its state as of 0... Dan Mick
01:41 PM rgw Bug #3724: docs refer to non-implemented features of the radosgw-admin rest api
Sage Weil
01:38 PM rgw Bug #3724: docs refer to non-implemented features of the radosgw-admin rest api
John - any update? Ian Colle
08:54 PM Revision b0a5fe94 (ceph): java: support ceph_get_file_pool_name
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
08:50 PM Revision 6a859bcd (ceph): ceph_manager: use 80/70 as pause_long, pause_check_after defaults
OSD::op_tp suicides after 150.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:47 PM Revision 6b272e0f (ceph): Merge branch 'master' of https://github.com/ceph/ceph
John Wilkins
08:46 PM Revision 42d92b73 (ceph): doc: Added example of ext4 user_xattr mount option.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:43 PM Bug #3885: osd: osd-recovery-incomplete qa test failing
(the above commit is in the teuthology code) Dan Mick
04:28 PM Bug #3885 (Resolved): osd: osd-recovery-incomplete qa test failing
fixed, mostly by commit:20af01f23ba932cb97cb40bba89bff546e10c461, which may fix up some of hte other spurious failure... Sage Weil
11:13 AM Bug #3885 (In Progress): osd: osd-recovery-incomplete qa test failing
Sage Weil
08:30 PM Revision b3a2e7e9 (ceph): rgw_rest: Make fallback uri configurable.
Some HTTP servers, notabily lighttp, do not set SCRIPT_URI, make the fallback
string configurable.
Signed-off-by: ca...
caleb miles
08:29 PM Revision b0f27a8f (ceph): librbd: Allow get_lock_info to fail
If the lock class isn't present, EOPNOTSUPP is returned for lock calls
on newer OSDs, but sadly EIO on older; we need...
Dan Mick
07:33 PM Revision 0c6d5a9d (ceph): java: support fchmod
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
07:33 PM Revision 9cefa969 (ceph): java: add missing chmod unmounted test
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
07:33 PM Revision 487bacdb (ceph): java: fix exception name typo
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
07:33 PM Revision 352652b6 (ceph): libcephfs: document ERANGE rv for get_file_pool_name
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
07:27 PM Revision 4b3bcb92 (ceph): java: support stat()
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
06:52 PM Revision 00cfe1d3 (ceph): common/HeartbeatMap: fix uninitialized variable
Introduced by me in 132045ce085e8584a3e177af552ee7a5205b13d8. Thank you,
valgrind!
Signed-off-by: Sage Weil <sage@i...
Sage Weil
06:41 PM Revision b9f58baa (ceph): libcephfs-java test: jar files are in /usr/local/share/java, it seems
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:35 PM Revision f9f31aae (ceph): wireshark: fix indention
Fix indention.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect.de>
Danny Al-Gaaf
06:35 PM Revision 3e9cc0d4 (ceph): wireshark: fix guint64 print format handling
Use G_GUINT64_FORMAT to handle print format of guint64 correctly.
Signed-off-by: Danny Al-Gaaf <danny.al-gaaf@bisect...
Danny Al-Gaaf
06:08 PM Revision 0f24dca2 (ceph): ceph_manager: use do_rados for rmpool
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
04:54 PM devops Bug #3934: ceph-deploy new should require at least one host name
If no hosts are specified on the command line, a ceph.conf file is created without any monitors listed. No errors or... Anonymous
04:51 PM devops Bug #3934 (Resolved): ceph-deploy new should require at least one host name
Anonymous
04:14 PM devops Bug #3933: ceph-deploy gatherkeys silently fails if no host is specified
If no host is specified and ceph.conf exists gatherkeys will fail, but not report any error. Anonymous
04:12 PM devops Bug #3933 (Resolved): ceph-deploy gatherkeys silently fails if no host is specified
Anonymous
02:52 PM Bug #3930 (Resolved): ceph.spec: udev rule for rbd not in rpms
The udev rule for kernel rbd (udev/50-rbd.rules in ceph.git) should be packaged. It's already in the debs: debian/lib... Josh Durgin
01:41 PM rgw Bug #3778: document procedure for enabling subdomain S3 api calls
Sage Weil
01:39 PM rgw Bug #3778: document procedure for enabling subdomain S3 api calls
Any update? Ian Colle
01:41 PM rgw Bug #3450: WRITE permission only doesn't allow proper multi-part upload
Sage Weil
01:33 PM rgw Bug #3450: WRITE permission only doesn't allow proper multi-part upload
Needs to be part of larger overall discussion about the intent of subusers. Ian Colle
01:41 PM rgw Bug #3706: rgw functional test testSlashInName failed in nightly
Sage Weil
01:38 PM rgw Bug #3706: rgw functional test testSlashInName failed in nightly
Need to see if happens again and then find reproducer. Ian Colle
01:41 PM rgw Feature #2804: rgw: disallow running multiple gateways on the same fastcgi socket
Sage Weil
01:41 PM rgw Feature #3074: radosgw needs --help support
Sage Weil
01:41 PM rgw Bug #2366: rgw: bucket index update rely on pg state
Sage Weil
01:41 PM rgw Bug #2650: rgw: swift key creation overrides subuser access mask
Sage Weil
01:41 PM rgw Bug #1777: rgw: user info modification is not atomic
Sage Weil
01:41 PM rgw Bug #1779: rgw: swift auth returns wrong error code when unexisting user is given
Sage Weil
01:14 PM rgw Bug #1779: rgw: swift auth returns wrong error code when unexisting user is given
Work in course with other swift changes, but not a driver. Ian Colle
01:40 PM rgw Feature #3366: rgw: dr: define management api
Caleb to get out updated document for review. Ian Colle
01:37 PM rgw Bug #3620: rgw:improve multiple user access keys scalability
Caleb to review. Ian Colle
01:36 PM rgw Bug #3682 (Resolved): valgrind errors seen when running rgw tests in nightlies
Increased time in tests and has not occurred. Ian Colle
01:35 PM rgw Bug #3628 (Resolved): rgw: leak of object parts on partial upload
Fixed in bobtail Ian Colle
01:34 PM rgw Bug #3485 (In Progress): rgw: unique user emails not enforced
Ian Colle
01:34 PM Bug #3906: ceph-mon leaks memory during peering
the logs indicate this may be related to failed auth connection attempts spamming the monitor. Sage Weil
11:43 AM Bug #3906: ceph-mon leaks memory during peering
we need to reproduce this on a large internal cluster, with many osds and even more pgs. Sage Weil
09:38 AM Bug #3906: ceph-mon leaks memory during peering
I believe this to be related to #3609 Joao Eduardo Luis
01:32 PM rgw Bug #3073: radosgw-admin: is not a daemon, should not have -d/-f options
commit:40ae8ceab58b4c05e01dc9f7809728a592cc4f0d actaully Sage Weil
01:30 PM rgw Bug #3073 (Resolved): radosgw-admin: is not a daemon, should not have -d/-f options
commit:b878b2c6e9ee41de25faf4dfdd7285dcb01b36e8 Sage Weil
01:26 PM rgw Bug #3073: radosgw-admin: is not a daemon, should not have -d/-f options
Change common init Ian Colle
01:30 PM rgw Bug #3365: Broken metadata (duplicated as CSV)
Aaron are you still seeing this? Ian Colle
01:29 PM rgw Bug #3365 (Need More Info): Broken metadata (duplicated as CSV)
Ian Colle
01:21 PM rgw Feature #2490: rgw-admin: only register watch when needed
Performance improvement. Ian Colle
01:21 PM CephFS Bug #1878: ceph.ko doesn't setattr (lchown, utimes) on symlinks
This is still present in 3.6.11 (I'll know about 3.7.* soon). I suspect this may have to do with failing to mark met... Alexandre Oliva
01:18 PM rgw Bug #2482 (Rejected): rgw: duplicate content-length results in 400
Apache issue. Ian Colle
01:14 PM rgw Bug #1906 (Can't reproduce): rgw: total_time isn't logged consistently
Ian Colle
01:13 PM Documentation #3831 (Resolved): ceph osd crush set command needs correction in the doc
John Wilkins
01:10 PM rgw Bug #1673: rgw: mod_fastcgi needs to be backward compatible
Ian Colle
01:10 PM rgw Bug #1673: rgw: mod_fastcgi needs to be backward compatible
Canonical can not take our changes up stream until we solve this issue. Ian Colle
11:16 AM rgw Cleanup #3929 (New): s3-tests: refactor all test_post_* tests
These tests mostly do the same thing, can be cleaned up, no need to duplicate the same code across all. Yehuda Sadeh
10:58 AM CephFS Feature #3821 (In Progress): qa: run backuppc as part of qa suite
Ekapol Rojpiboonphun wrote:
> Just to make sure that I will be on this along the line of what you might already have...
Sage Weil
10:52 AM CephFS Feature #3821: qa: run backuppc as part of qa suite
Just to make sure that I will be on this along the line of what you might already have in mind. (More details please ... Anonymous
09:56 AM CephFS Feature #3821: qa: run backuppc as part of qa suite
Download/install backuppc and get it into suite. Ian Colle
10:32 AM Bug #3928 (In Progress): osd: peering workqueue tryings to advance through *all* past osdmaps in ...
Samuel Just
10:02 AM Bug #3928 (Resolved): osd: peering workqueue tryings to advance through *all* past osdmaps in one...
Sage Weil
10:10 AM Bug #3905: incomplete & stale (lost?) PGs
Sounds like a combination of crush map and rules that aren't behaving well together — "incomplete" means the PG doesn... Greg Farnum
09:42 AM Bug #3801: Cascading OSD failures beginning with common/HeartbeatMap.cc: 78: FAILED assert(0 == "...
The olog stuff is fixed in bobtail, and won't be backported to argonaut.
I'm not sure what the root cause of hte h...
Sage Weil
08:42 AM Bug #3854: mon: clock skew tests failing on master
Happened again on QA, reopening while testing a new patch. Joao Eduardo Luis
08:15 AM rbd Bug #3927: krbd: I/O errors (ENXIO) during rbd/kernel.sh workunit
Hey! I just looked at the test, and here's how it ends:
# remove snapshot and detect error from mapped snapshot
...
Alex Elder
08:15 AM rbd Bug #3927: krbd: I/O errors (ENXIO) during rbd/kernel.sh workunit
This is the relevant portion of the yaml file:
- workunit:
clients:
all:
- rbd/map-unmap.sh
...
Alex Elder
08:09 AM rbd Bug #3927 (Closed): krbd: I/O errors (ENXIO) during rbd/kernel.sh workunit
I'm seeing ENXIO errors at what I believe to the "rbd/kernel.sh
teuthology workunit while testing the new request co...
Alex Elder
05:49 AM rbd Feature #3926 (Resolved): krbd: use slab allocation for common data structures
There are some common data structures--like image and object
requests--that are very frequently allocated and would ...
Alex Elder
05:29 AM rbd Bug #3925 (Resolved): krbd: sysfs write lockdep warnings
... Alex Elder
03:42 AM Revision 2f192eaf (ceph): TestRados expects rollback, not snap_rollback
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
02:50 AM Revision 67c77577 (ceph): PendingReleaseNotes: pool removal cli changes
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:49 AM Revision 87fe35f6 (ceph): Merge remote-tracking branch 'gh/wip-rm-pool'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
02:47 AM Revision 64b9dd08 (ceph): Merge remote-tracking branch 'gh/wip-3832-oc-flushrange'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
02:43 AM Revision 9b56f367 (ceph): Merge remote-tracking branch 'gh/wip_heartbeat'
Sage Weil
02:40 AM Revision 62579eef (ceph): Merge branch 'wip-osd-hb'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
01:44 AM Revision ec5a1455 (ceph): ceph_manager: default chance_down to 0.4
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
01:40 AM Revision 566ae533 (ceph): ceph_manager: add filestore and heartbeat stalls
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
01:22 AM Revision 5d66c9ab (ceph): Use ceph git repo instead of github.
This code change is so that instead of pulling the tarball of github
which can be unreliable at times it instead uses...
Sandon Van Ness
12:55 AM Revision d6db239c (ceph): Merge remote-tracking branch 'upstream/wip_push_after_complete'
Reviewed-by: Sage Weil <sage@inktank.com> Samuel Just

01/23/2013

10:00 PM devops Feature #3229 (Resolved): Support clean ceph-fuse fstab automounting
implemented this already; /sbin/mount.fuse.ceph is in bobtail. Sage Weil
09:59 PM devops Feature #3924 (Resolved): ceph-deploy: package it
Sage Weil
09:57 PM devops Feature #3923 (Resolved): ceph-deploy: discover HOST
somewhat similar to new, except we pull the ceph.conf from a remote host. Sage Weil
09:57 PM devops Feature #3922 (Resolved): ceph-deploy: version command
Sage Weil
09:57 PM devops Feature #3921 (Resolved): ceph-deploy: support RPM-based distros
Sage Weil
09:57 PM devops Feature #3920 (Resolved): ceph-deploy: support other deb-based distros
Sage Weil
09:56 PM devops Feature #3919 (Resolved): ceph-deploy: remove upstart dependency
eliminate whatever remaining upstart dependencies are in ceph-deploy, so that upstart and sysvinit are both viable. Sage Weil
09:55 PM devops Feature #3918 (Resolved): ceph-deploy: osd create HOST:DIR[:JOURNAL]
trigger ceph-dir-prepare instead of ceph-disk-prepare. Sage Weil
09:54 PM devops Feature #3917 (Resolved): ceph-dir-prepare command
ceph-dir-prepare <dir> [journal] or similar
somewhat similar to ceph-disk-prepare, but simpler.
- allocate osd ...
Sage Weil
09:54 PM devops Feature #3916 (Resolved): ceph-disk-activate: non-upstart trigger (udev?)
Sage Weil
09:53 PM devops Feature #3915 (Rejected): ceph-disk-prepare: support sysvinit or upstart
Sage Weil
09:53 PM devops Feature #3914 (Resolved): ceph-disk-activate: support sysvinit
Sage Weil
09:52 PM devops Feature #3913 (Resolved): ceph-deploy: break mon into create/destroy
Sage Weil
09:52 PM devops Feature #3912 (Resolved): ceph-deploy: break osd into create/destroy
Actually, we want
ceph-deploy osd prepare HOST:DEV[:JOURNAL]
ceph-deploy osd activate HOST:DEVORDIR
and perh...
Sage Weil
09:52 PM devops Feature #3911 (Resolved): sysvinit: allow daemon enumeration via dirs
Sage Weil
09:52 PM devops Feature #3910 (Resolved): ceph-deploy: uninstall purge
Sage Weil
09:52 PM devops Feature #3909 (Resolved): ceph-deploy: update install for bobtail/argonaut urls
Sage Weil
09:51 PM devops Feature #3907 (Resolved): ceph-deploy: be verbose about what is run and what is done (with -q)
Sage Weil
08:49 PM Revision 8a97eef1 (ceph): ReplicatedPG: handle omap > max_recovery_chunk
span_of fails if len == 0.
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:35 PM Revision c3dec3e3 (ceph): ReplicatedPG: correctly handle omap key larger than max chunk
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
08:15 PM Revision 09c71f2f (ceph): ReplicatedPG: start scanning omap at omap_recovered_to
Previously, we started scanning omap after omap_recovered_to.
This is a problem since the break in the loop implies t...
Samuel Just
08:10 PM Bug #3904: FAILED assert(want_acting.empty())
I have a theory:
reset
started
primary
getinfo
got infos
getlog
calc_acting succeeds, choose_acting fails,...
Sage Weil
02:48 PM Bug #3904 (Resolved): FAILED assert(want_acting.empty())
Ceph 0.56.1 on Ubuntu 12.04, standard ceph.com packages. Multiple OSDs started getting marked down/crashing out, this... Faidon Liambotis
07:50 PM Revision 20278c4f (ceph): ReplicatedPG: ack push only after transaction has completed
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
07:50 PM Revision 62a4b968 (ceph): ReplicatedPG: don't finish_recovery_op until the transaction completes
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
07:50 PM Revision 4d6ba063 (ceph): ObjectStore: add queue_transactions with oncomplete
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
06:48 PM CephFS Bug #3832 (Resolved): client: does not observe O_SYNC
commit:64b9dd088d8f20019d6c1042895676b2ec57077e Sage Weil
06:42 PM Feature #3888 (Resolved): osd: stop heartbeating peers when internal heartbeat fails
Sage Weil
06:42 PM Feature #3888: osd: stop heartbeating peers when internal heartbeat fails
commit:62579eefba057eea200d8a9a3f6b3d8bca29b8b4 Sage Weil
06:31 PM Bug #3906 (Won't Fix): ceph-mon leaks memory during peering
I've done multiple OSD swaps with both 0.55 & 0.56/0.56.1 on a cluster with > 16k PGs. In those, I've noticed multipl... Faidon Liambotis
06:27 PM Bug #3905 (Can't reproduce): incomplete & stale (lost?) PGs
I added a bunch of new OSDs into my Ceph cluster (0.56.1 on Ubuntu 12.04 LTS) about 72h ago. Simultaneously, I marked... Faidon Liambotis
05:14 PM Revision a972fd40 (ceph): mds: fix end check in Server::handle_client_readdir()
commit 1174dd3188 (don't retry readdir request after issuing caps)
introduced an bug that wrongly marks 'end' in the ...
Yan, Zheng
04:49 PM Revision c061e841 (ceph): rados: safety interlock on 'rmpool' command
This is a very easy way for a user to do a lot of damage with no way back.
Make sure they mean it.
Signed-off-by: Sa...
Sage Weil
04:40 PM Revision c993ac9b (ceph): mon: implement safety interlock for deleting pools
This is a very easy way for users to accidentally to a *lot* of damage.
Make it an annoying manual process to actuall...
Sage Weil
02:43 PM Bug #3903 (Resolved): OSDMap::raw_pg_to_pps causes pools to have similar mappings
The pool should be added in a way to ensure that different pools have independent mappings. Samuel Just
02:27 PM Revision 022a5254 (ceph): osd: drop newlines from event descriptions
These produce extra newlines in the log.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Samuel Just <sam.j...
Sage Weil
02:22 PM Revision ebc93a87 (ceph): OSD: do deep_scrub for repair
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
(cherry picked...
Samuel Just
02:22 PM Revision 32527fa3 (ceph): ReplicatedPG: ignore snap link info in scrub if nlinks==0
links==0 implies that the replica did not sent snap link information.
Signed-off-by: Samuel Just <sam.just@inktank.c...
Samuel Just
02:22 PM Revision 13e42265 (ceph): osd/PG: fix osd id in error message on snap collection errors
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 381e25870f26fad144ecc2fb99710498e3a7a1d4)
Sage Weil
02:22 PM Revision e3b6191f (ceph): osd/ReplicatedPG: validate ino when scrubbing snap collections
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 665577a88b98390b9db0f9991836d10ebdd8f4cf)
Sage Weil
02:21 PM Revision 353b7341 (ceph): ReplicatedPG: compare nlinks to snapcolls
nlinks gives us the number of hardlinks to the object.
nlinks should be 1 + snapcolls.size(). This will allow
us to ...
Samuel Just
02:21 PM Revision 33d5cfc8 (ceph): ReplicatedPG/PG: check snap collections during _scan_list
During _scan_list check the snapcollections corresponding to the
object_info attr on the object. Report inconsistenc...
Samuel Just
02:21 PM Revision bea783bd (ceph): osd_types: add nlink and snapcolls fields to ScrubMap::object
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit b85687475fa2ec74e5429d92ee64eda2051a256c)
Samuel Just
02:21 PM Revision 0c48407b (ceph): PG: move auth replica selection to helper in scrub
Signed-off-by: Samuel Just <sam.just@inktank.com>
(cherry picked from commit 39bc65492af1bf1da481a8ea0a70fe7d0b4b17a3)
Samuel Just
02:21 PM Revision c3433ce6 (ceph): mon: note scrub errors in health summary
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 8e33a8b9e1fef757bbd901d55893e9b84ce6f3fc)
Sage Weil
02:21 PM Revision 90c6edd0 (ceph): osd: fix rescrub after repair
We were rescrubbing if INCONSISTENT is set, but that is now persistent.
Add a new scrub_after_recovery flag that is r...
Sage Weil
02:21 PM Revision 0696cf57 (ceph): osd: note must_scrub* flags in PG operator<<
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit d56af797f996ac92bf4e0886d416fd358a2aa08e)
Sage Weil
02:21 PM Revision 1541ffe4 (ceph): osd: based INCONSISTENT pg state on persistent scrub errors
This makes the state persistent across PG peering and OSD restarts.
This has the side-effect that, on recovery, we r...
Sage Weil
02:21 PM Revision 60910125 (ceph): osd: fix scrub scheduling for 0.0
The initial value for pair<utime_t,pg_t> can match pg 0.0, preventing it
from being manually scrubbed. Fix!
Signed-...
Sage Weil
02:21 PM Revision 0961a3a8 (ceph): osd: note last_clean_scrub_stamp, last_scrub_errors
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 389bed5d338cf32ab14c9fc2abbc7bcc386b8a28)
Sage Weil
02:21 PM Revision 8d823045 (ceph): osd: add num_scrub_errors to object_stat_t
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 2475066c3247774a2ad048a2e32968e47da1b0f5)
Sage Weil
02:20 PM Revision 3a1cd6e0 (ceph): osd: add last_clean_scrub_stamp to pg_stat_t, pg_history_t
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit d738328488de831bf090f23e3fa6d25f6fa819df)
Sage Weil
02:20 PM Revision 7e5a899b (ceph): osd: fix object_stat_sum_t dump signedness
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 6f6a41937f1bd05260a8d70b4c4a58ecadb34a2f)
Sage Weil
02:20 PM Revision e252a313 (ceph): osd: change scrub min/max thresholds
The previous 'osd scrub min interval' was mostly meaningless and useless.
Meanwhile, the 'osd scrub max interval' wou...
Sage Weil
02:20 PM Revision 33aa64ee (ceph): osd/PG: remove useless osd_scrub_min_interval check
This was already a no-op: we don't call PG::scrub_sched() unless it has
been osd_scrub_max_interval seconds since we ...
Sage Weil
02:20 PM Revision fdd0c1ec (ceph): osd: move scrub schedule random backoff to seperate helper
Separate this from the load check, which will soon vary dependon on the
PG.
Signed-off-by: Sage Weil <sage@inktank.c...
Sage Weil
02:20 PM Revision 9ffbe268 (ceph): osd/PG: trigger scrub via scrub schedule, must_ flags
When a scrub is requested, flag it and move it to the front of the
scrub schedule instead of immediately queuing it. ...
Sage Weil
02:19 PM Revision cffb1b22 (ceph): osd/PG: introduce flags to indicate explicitly requested scrubs
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 1441095d6babfacd781929e8a54ed2f8a4444467)
Sage Weil
02:19 PM Revision 438e3dfc (ceph): osd/PG: move scrub schedule registration into a helper
Simplifies callers, and will let us easily modify the decision of when
to schedule the PG for scrub.
Signed-off-by: ...
Sage Weil
01:40 PM Revision acb47e4d (ceph): os/FileStore: only flush inline if write is sufficiently large
Honor filestore_flush_min in the inline flush case.
Backport: bobtail
Signed-off-by: Sage Weil <sage@inktank.com>
Re...
Sage Weil
01:40 PM Revision 15a1ced8 (ceph): os/FileStore: fix compile when sync_file_range is missing;
If sync_file_range is not present, we always close inline, and flush
via fdatasync(2).
Fixes compile on ancient plat...
Sage Weil
01:39 PM Revision 9dddb9d8 (ceph): osd: set pg removal transactions based on configurable
Use the osd_target_transaction_size knob, and gracefully tolerate bogus
values (e.g., <= 0).
Signed-off-by: Sage Wei...
Sage Weil
01:38 PM Revision c30d231e (ceph): osd: make pg removal thread more friendly
For a large PG these are saturating the filestore and journal queues. Do
them synchronously to make them more friend...
Sage Weil
01:38 PM Revision b2bc4b95 (ceph): os: move apply_transactions() sync wrapper into ObjectStore
This has nothing to do with the backend implementation.
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked f...
Sage Weil
01:38 PM Revision 6d161b57 (ceph): os: add apply_transaction() variant that takes a sequencer
Also, move the convenience wrappers into the interface and funnel through
a single implementation.
Signed-off-by: Sa...
Sage Weil
12:31 PM Support #3902 (Closed): S3-tests need to cleanup after themselves
On Congress, DHO has hit the max number of users due to s3-tests not cleaning up after execution. Could we have the s... JuanJose Galvez
11:27 AM rbd Tasks #2853 (In Progress): krbd: read path
With my patches for the basic new request code now
out for initial review, I've started working on this
feature. I...
Alex Elder
11:20 AM rbd Subtask #2852 (In Progress): krbd: open parent on open
The many patches have now been posted for review.
Included in that is a small, temporary patch that enables
this ...
Alex Elder
05:21 AM rbd Fix #3665: librbd: deadlock during flatten
possibly here: ... Sage Weil
05:20 AM Revision 657df852 (ceph): os/FileStore: add stall injection into filestore op queue
Allow admin to artificially induce a stall in the op queue. Forces the
thread(s) to sleep for N seconds. We pause f...
Sage Weil
05:20 AM Revision 132045ce (ceph): common/HeartbeatMap: inject unhealthy heartbeat for N seconds
This lets us test code that is triggered by an unhealthy heartbeat in a
generic way.
Signed-off-by: Sage Weil <sage@...
Sage Weil
02:03 AM Revision a4e78652 (ceph): osd: do not join cluster if not healthy
If our internal heartbeats are failing, do not send a boot message and try
to join the cluster.
Signed-off-by: Sage ...
Sage Weil
02:01 AM Revision c406476c (ceph): osd: hold lock while calling start_boot on startup
This probably doesn't strictly matter because start_boot doesn't need the
lock (currently) and few other threads shou...
Sage Weil
01:56 AM Revision ad6b2311 (ceph): osd: do not reply to ping if internal heartbeat is not healthy
If we find that our internal threads are stalled, do not reply to ping
requests. If we do this long enough, peers wi...
Sage Weil
01:53 AM Revision 61eafffc (ceph): osd: reduce op thread heartbeat default 30 -> 15 seconds
If the thread stalls for 15 seconds, let our internal heartbeat fail.
This will let us internally respond more quickl...
Sage Weil
12:54 AM Revision 371e6fbe (ceph): Merge pull request #35 from cholcombe973/master
Making the usage details a little better. Yehuda Sadeh
12:23 AM Bug #3900: init-ceph should do ulimit -n's with do_root_cmd
I think he's right, except it should be do_root_cmd, and I'm not certain if that echoes the result of the command cor... Dan Mick
12:11 AM Bug #3900 (Resolved): init-ceph should do ulimit -n's with do_root_cmd
Chen Xiaoxi points out on ceph-devel:
Here is part of /etc/init.d/ceph script:
case "$command" in
s...
Dan Mick
12:19 AM Revision 0d172b95 (ceph): packaging: add smalliobenchrbd
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
12:13 AM Revision 8eee815f (ceph): Merge remote-tracking branch 'gh/wip-3833-b'
Conflicts:
src/osd/OSD.cc
src/osd/OSD.h
Reviewed-by: Samuel Just <sam.just@inktank.com>
Sage Weil
12:07 AM Revision 9388f941 (ceph): Update src/rgw/rgw_admin.cc
Improved the usage message. Chris Holcombe

01/22/2013

11:58 PM Revision eaf20fa9 (ceph): Merge branch 'wip-3651'
David Zafman
11:57 PM Revision 509a93e8 (ceph): osd: Add digest of omap for deep-scrub
Add ScrubMap encode/decode v4 message with omap digest
Compute digest of header and key/value. Use bufferlist
to ref...
David Zafman
11:57 PM Revision db48caf6 (ceph): osd: debug support for omap deep-scrub
Deep-scrub test support through admin socket
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Sam...
David Zafman
11:57 PM Revision cfb1aa80 (ceph): osd: Add missing unregister_command() in OSD::shutdown()
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Samuel Just <sam.just@inktank.com>
David Zafman
11:48 PM Revision e714c778 (ceph): osd: Testing of deep-scrub omap changes
Fix scrub_test.py and add omap corruption test
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: S...
David Zafman
11:23 PM Revision e328fa6c (ceph): test/bench: add rbd backend to smalliobench
Only supports format 1 images to start, and does not issue flushes, so
it's best used with caching off.
Signed-off-b...
Josh Durgin
11:10 PM Revision 0ee5ec7e (ceph): common/Throttle: fix modeline, whitespace
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:10 PM Revision c3266ad1 (ceph): config: helper to identify internal fields we should be quiet about
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:01 PM Revision 89072fbb (ceph): test/bench: don't alias bl from above
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
11:01 PM Revision c50f5f52 (ceph): test/bench: use uint64_t for uniform distribution
int is too small for rbd image sizes
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
10:55 PM Revision 451cc00a (ceph): doc: Modified usage for upgrade.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
10:47 PM Revision 73a96936 (ceph): osd: improve sub_op flag points
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision 33efe321 (ceph): osd: simplify asok to single callback
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision 24d0d7eb (ceph): osd: dump op priority queue state via admin socket
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision a1137eb3 (ceph): osd: make last state for slow requests more informative
Report on the last event string, and pass in important context for the
op event list, including:
- which peers were...
Sage Weil
10:47 PM Revision 23c02bce (ceph): osd: refactor ReplicatedPG::do_sub_op
PULL is the only case where we don't wait for active.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:47 PM Revision c549a0cf (ceph): common/PrioritizedQueue: buckets -> tokens
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision 6e3363b2 (ceph): common/PrioritizedQueue: add min cost, max tokens per bucket
Two problems.
First, we need to cap the tokens per bucket. Otherwise, a stream of
items at one priority over time w...
Sage Weil
10:47 PM Revision 514af15e (ceph): common/PrioritizedQueue: dump state to Formatter
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision bec96a23 (ceph): osd: debug msg prio, cost, latency
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision e8e0da1a (ceph): osd: use Message::get_cost() function for queueing
The data payload is a decent proxy for cost in most cases, but not all.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:47 PM Revision a1bf8220 (ceph): osd: set PULL subop cost to size of requested data
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision b685f727 (ceph): osd: add OpRequest flag point when commit is sent
With writeahead journaling in particular, we can get requests that
stay in the queue for a long time even after the c...
Sage Weil
10:47 PM Revision 128fcfca (ceph): note puller's max chunk in pull requests
this lets us calculate a cost value Sage Weil
10:47 PM Revision cfe4b851 (ceph): os/FileStore: allow filestore_queue_max_{ops,bytes} to be adjusted at r...
The 'committing' ones too.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
10:47 PM Revision 44dca5c8 (ceph): filestore: disable extra committing queue allowance
The motivation here is if there is a problem draining the op queue
during a sync. For XFS and ext4, this isn't gener...
Sage Weil
10:47 PM Revision 1233e861 (ceph): osd: target transaction size 300 -> 30
Small transactions make pg removal nicer to the op queue. It also slows
down PG deletion a bit, which may exacerbate...
Sage Weil
10:47 PM Revision 40654d6d (ceph): filestore: filestore_queue_max_ops 500 -> 50
Having a deep queue limits the effectiveness of the priority queues
above by adding additional latency.
Signed-off-b...
Sage Weil
10:47 PM Revision 9230c863 (ceph): osd: make OSD a config observer
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:47 PM Revision 101955a6 (ceph): osd: make osd_max_backfills dynamically adjustable
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
09:23 PM Feature #3888 (Fix Under Review): osd: stop heartbeating peers when internal heartbeat fails
wip-osd-hb Sage Weil
03:09 PM Feature #3888: osd: stop heartbeating peers when internal heartbeat fails
backport to bobtail! Sage Weil
08:12 AM Feature #3888 (Resolved): osd: stop heartbeating peers when internal heartbeat fails
if our internal thread heartbeats fail, stop replying to pings from peers. Sage Weil
09:09 PM Revision b6e3edc6 (ceph): test: create /tmp/cephtest/mnt.{id}
The workunit task assumes that a mount exists
at /tmp/cephtest/mnt.{id}
This patch creates the path if it doesn't
exi...
Joe Buck
09:05 PM Revision 6401abf8 (ceph): qa/workunit: Add iozone test script for sync
The iozone-sync.sh script runs iozone testing
various sync flags, O_SYNC, O_DSYNC, O_RSYNC.
Signed-off-by: Sam Lang ...
Sam Lang
09:05 PM Revision 72147fd3 (ceph): objectcacher: Remove commit_set, use flush_set
commit_set() and flush_set() are identical in functionality,
so use flush_set everywhere and remove commit_set from
t...
Sam Lang
08:43 PM Revision 00b11869 (ceph): testing: add workunit to run hadoop internal tests.
This workunit runs the internal tests for our local branch of hadoop-common.
Requires ant be installed on the host ru...
Joe Buck
07:37 PM Bug #3899 (Won't Fix): osd: failed to decode object_info_t
This happened after moving a journal from a file to an ssd, and changing filestore xattr use omap from true to false,... Josh Durgin
07:36 PM Bug #3836: osd: common/Mutex.cc: 94: FAILED assert(r == 0) in PG::start_flush()
ubuntu@teuthology:/a/teuthology-2013-01-22_07:00:04-regression-bobtail-master-basic/3235... Tamilarasi muthamizhan
07:19 PM devops Bug #3898 (Resolved): ceph-deploy: problems with >1 mon
If you try "ceph-deploy new ceph1 ceph2" then it correctly creates the ceph.conf and then spits out "Cluster config e... Greg Farnum
06:25 PM Revision 4a871b55 (ceph): Merge branch 'wip-config'
Reviewed-by: Yehuda Sadeh <yehuda@inktank.com> Sage Weil
06:24 PM Revision 359d0e98 (ceph): config: report on log level changes
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
06:24 PM Revision c5e09517 (ceph): config: clean up output
Report a simple list of key='value', without extra verbosity.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:37 PM CephFS Bug #3404: oops in strlen() from set_request_path_attr()
I'm found the same bug in Bobtail release with NFS kernel server and 3.7.3 kernel
[70205.985665] BUG: unable to ha...
Ivan Kudryavtsev
05:35 PM Bug #3513 (Resolved): rgw log show error
Dan Mick
05:35 PM Bug #3513: rgw log show error
Nope, I had it wrong; the required params are: object *or* all three of date, bucket, and bucket-id.
Message change ...
Dan Mick
02:37 PM Bug #3513: rgw log show error
Actually I guess the && should be || and the || should be && (the old DeMorgan's rule) Dan Mick
02:30 PM Bug #3513: rgw log show error
I experienced this also on ubuntu 12.10 0.56.1-1
root@dlcephgw01:~# radosgw-admin log show --bucket=chris --date...
Chris Holcombe
05:04 PM rgw Bug #3896 (Resolved): rest-bench common/WorkQueue.cc: 54: FAILED assert(_threads.empty())
It seems rest-bench doesn't like to exit cleanly while cleaning up after itself.... I did test at low concurrency bu... Bill Reid
04:31 PM Bug #3895 (Resolved): librados test hang during mon thrashing
ubuntu@teuthology:/var/lib/teuthworker/archive/teuthology-2013-01-21_19:00:03-regression-master-testing-gcov/2929
...
Sage Weil
04:05 PM Feature #3651 (Resolved): osd: deep scrub should hash omap
David Zafman
02:58 PM Bug #3894 (Closed): monclient: --keyring failed despite presence of file
While going over install basics with Gary, we got "ERROR: missing keyring, cannot use cephx for authentication" when ... Greg Farnum
02:40 PM rbd Feature #3877 (Fix Under Review): krbd: don't wait for notify ack to complete
I've posted this code for review. I continue to do testing. Alex Elder
02:39 PM rbd Subtask #3741 (Fix Under Review): krbd: rework request tracking code
I've posted this code for review. I continue to do testing. Alex Elder
02:39 PM rbd Tasks #3755 (Fix Under Review): krbd: use new request tracking code for sync object operations
I've posted this code for review. I continue to do testing. Alex Elder
02:39 PM rbd Feature #3754 (Fix Under Review): krbd: use new request tracking code for notify ack
I've posted this code for review. I continue to do testing. Alex Elder
02:19 PM rbd Feature #3893 (Rejected): krbd: document the new request code
There are bits and pieces of the new request code
documented for the kernel rbd client--in the comments
and in the ...
Alex Elder
02:09 PM CephFS Bug #3832: client: does not observe O_SYNC
Fixed a bug in objectcacher::flush_set. Branch wip-3832-oc-flushrange has been updated, and passes the accompanying ... Sam Lang
01:09 PM Subtask #2659: mon: Single-Paxos: ceph tool -w subscriptions not being updated
Can't recall if this was fixed at some point, or if the root cause was even related.
This must be tested again onc...
Joao Eduardo Luis
01:06 PM Subtask #2622 (Resolved): mon: Single-Paxos: convert existing, old MonitorStore to a brand new Mo...
This was implemented both as an offline tool as well as integrated in ceph-mon. The ceph-mon will attempt to open the... Joao Eduardo Luis
01:02 PM Subtask #3069: mon: Single-Paxos: messaging: log MMonSync messages for offline matching
If we really want to do offline matching, this can be done using just the logs. This could be interesting however fo... Joao Eduardo Luis
12:54 PM Subtask #3843 (Rejected): osd: move purged_snaps out of info
Sage Weil
12:54 PM Subtask #3844 (Rejected): osd: move info and log into leveldb
Sage Weil
12:54 PM Subtask #3842 (Rejected): osd: create tool to extract pg info and pg log from filestore
Sage Weil
12:54 PM Feature #3841 (Rejected): osd: avoid seeks for log and info writes on client writes
broke out subtasksa nd top level features Sage Weil
12:53 PM Feature #3892 (Resolved): osd: move pg info into leveldb
Sage Weil
12:53 PM Feature #3891 (Resolved): osd: move purged_snaps out of info
Sage Weil
12:53 PM Feature #3890 (Resolved): osd: create tool to extract pg info and pg log from filestore
Sage Weil
10:38 AM Feature #2580 (Resolved): perf: investigate poor performance at 10 osds per node
This was probably unique to the burnupi cluster and/or older ceph. Performance is fine on the SC847a now with lots o... Mark Nelson
10:27 AM rbd Bug #3889 (Won't Fix): krbd: handle zero-length requests
I'm pretty sure there are some special zero-length
requests (like flush) that can come down from the
block layer. ...
Alex Elder
07:07 AM Linux kernel client Bug #3887 (Closed): kernel client: small object memory leak
In testing my new request code for rbd (issue 3741 and related)
I tried paying special attention to Linux slab usage...
Alex Elder
05:10 AM Revision 98cc1b83 (ceph): task: mon_clock_skew_check: add option to run at least one timecheck
at-least-once Runs at least once, even if we are told to stop.
(default: True)
at...
Joao Eduardo Luis
04:11 AM Linux kernel client Bug #3886: Futher testing result for the issue "ceph: avoid 32-bit page index overflow"
https://SizableSend.com/0g9dwn/ceph_mds.a.log Mohamed Pakkeer
04:06 AM Linux kernel client Bug #3886 (New): Futher testing result for the issue "ceph: avoid 32-bit page index overflow"
We raised an issue in the following ticket and the ticket has been resolved
http://tracker.newdrea...
Mohamed Pakkeer

01/21/2013

11:09 PM Revision b7cb1b11 (ceph): rados/thrash: 3 monitors, so that we can thrash them
Sage Weil
10:20 PM Feature #3848: osd: gracefully handle cluster network heartbeat failure
One option: do not mark ourselves back up (after being wrongly marked down) unless we are able to successfully ping a... Sage Weil
10:12 PM Bug #3885 (Resolved): osd: osd-recovery-incomplete qa test failing
ubuntu@teuthology:/a/teuthology-2013-01-21_19:00:03-regression-master-testing-gcov$ teuthology-ls --archive-dir . | g... Sage Weil
10:08 PM Feature #3833 (In Progress): osd: improve recovery throttling
Sage Weil
09:59 PM Bug #2655: scrub slows writes more than it should
This ticket predates the chunky scrub work that went into ~0.54 or thereabouts. Sage Weil
09:15 PM Bug #2655 (Resolved): scrub slows writes more than it should
Sage Weil
09:12 PM Bug #2357 (Can't reproduce): mds takes down ceph
Sage Weil
09:11 PM Bug #3854 (Resolved): mon: clock skew tests failing on master
pushed to master Sage Weil
04:45 PM Revision d7d81922 (ceph): config: don't make noise about 'internal_safe_to_start_threads'
This is set on start, and subsequently gets into the changed set.
Once any other config value is injected, it is the ...
Sage Weil
04:22 PM Revision 3399860d (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
04:21 PM Revision 2e39dd5e (ceph): mds: fix default_file_layout constructor
Signed-off-by: Greg Farnum <greg@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Greg Farnum
04:21 PM Revision e461f096 (ceph): mds: fix byte_range_t ctor
I do not think we saw any bugs from this, but anything that involved
capability issues on restart or migrate might ha...
Greg Farnum
01:20 PM Fix #3884 (Resolved): osd: resurrect partially deleted PGs
If a PG is in the process of getting removed and we repeer and discover we want to keep it, we currently block waitin... Sage Weil
12:30 PM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Corin-
Have you tried 0.48.3 again since then? I'd like to get to the bottom of this, if possible... :)
Sage Weil
09:35 AM rbd Bug #3737: Higher ping-latency observed in qemu with rbd_cache=true during disk-write
Hi Josh,
according to our conversation I did some testing.
I started the dd if=/dev... of=/tmp/doof.dat bs=4k cou...
Oliver Francke
12:11 AM Revision c5fe0965 (ceph): osd: calculate initial PG mapping from PG's osdmap
The initial values of up/acting need to be based on the PG's osdmap, not
the OSD's latest. This can cause various co...
Sage Weil
12:11 AM Revision 17160843 (ceph): osd: calculate initial PG mapping from PG's osdmap
The initial values of up/acting need to be based on the PG's osdmap, not
the OSD's latest. This can cause various co...
Sage Weil

01/20/2013

11:01 PM CephFS Feature #1236 (Fix Under Review): libceph: set layout via virtual xattrs (libceph/cfuse)
wip-vxattr (ceph.git) and wip-vxattrs (ceph-client.git). There's a test script that passes on both fuse and kclient.... Sage Weil
10:58 PM CephFS Feature #1236: libceph: set layout via virtual xattrs (libceph/cfuse)
Greg Farnum wrote:
> How large would a simple "layout" xattr actually be in comparison to the shipped inodes? I'm no...
Sage Weil
03:12 PM CephFS Feature #1236: libceph: set layout via virtual xattrs (libceph/cfuse)
How large would a simple "layout" xattr actually be in comparison to the shipped inodes? I'm not sure the size is so ... Greg Farnum
08:26 PM rbd Feature #3877: krbd: don't wait for notify ack to complete
I have implemented this in the new request code.
It will be posted for review along with the rest
of that new code ...
Alex Elder
08:14 PM rbd Feature #3877 (In Progress): krbd: don't wait for notify ack to complete
Ian points out that "I've already implemented this change"
suggests that the status of this issue should at least
b...
Alex Elder
08:26 PM rbd Subtask #3741 (In Progress): krbd: rework request tracking code
Considering this "is actually work that's mostly complete"
I'm (finally) marking it "In Progress."
This code is f...
Alex Elder
08:22 PM rbd Feature #3754 (In Progress): krbd: use new request tracking code for notify ack
I have completed implementing sending synchronous acknowledgement
in response to a watch request notification. It i...
Alex Elder
08:19 PM rbd Tasks #3755 (In Progress): krbd: use new request tracking code for sync object operations
I have completed implementing all of these in the new request
code:
- synchronous object read (for v1 header object...
Alex Elder
04:12 PM Bug #3879 (Resolved): ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
thanks! commit:17160843d0c523359d8fa934418ff2c1f7bffb25 Sage Weil
03:51 PM Bug #3879: ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
Looks good to me. Samuel Just
09:58 AM Bug #3879 (Fix Under Review): ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
wip-3879 Sage Weil
09:06 AM Bug #3879: ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
Output from the following attached:
ceph osd getmap 554 -o 554
Jens Kristian Søgaard
08:46 AM Bug #3879 (In Progress): ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
Jens Kristian Søgaard wrote:
> Output from the following attached:
>
> ceph osd getmap 555 -o 555
> ceph osd get...
Sage Weil
12:49 AM Bug #3879: ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
Output from the following attached:
ceph osd getmap 555 -o 555
ceph osd getmap 556 -o 556
Jens Kristian Søgaard
11:15 AM Bug #3883 (Won't Fix): osd: leaks memory (possibly triggered by scrubbing) on argonaut
100MB/day reported by multiple users, both on 0.48 and 0.56.1.
Some correlation with scrubbing. Possibly specific...
Sage Weil
09:55 AM CephFS Feature #3882: Hide snapshot directory name in mount/mtab
It seems like better (or perhaps just "more important") fix is to restrict access to .snap in the first place.
FWI...
Sage Weil
07:14 AM CephFS Feature #3882 (Rejected): Hide snapshot directory name in mount/mtab
The idea is to avoid users to see what snapshot directory name choosen during mount.
This is useful if we want to...
Ivan Kudryavtsev
09:51 AM CephFS Bug #3881 (Rejected): Wrong ip network to exchange data between kernel ceph and MDS
Ivan Kudryavtsev wrote:
> Hm. It seems that I'm wrong about the way it works. It connects to OSDs via OSD-defined pu...
Sage Weil
09:44 AM CephFS Bug #3881: Wrong ip network to exchange data between kernel ceph and MDS
Hm. It seems that I'm wrong about the way it works. It connects to OSDs via OSD-defined public network. It seems that... Ivan Kudryavtsev
07:03 AM CephFS Bug #3881 (Rejected): Wrong ip network to exchange data between kernel ceph and MDS
I'm using ceph installation with three networks:
1st is Infiniband network for OSD exchange and replication
2nd i...
Ivan Kudryavtsev

01/19/2013

02:24 PM Bug #3879: ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
full log at http://bit.ly/11Hn7BN
Sage Weil
02:04 PM Bug #3879 (Resolved): ./osd/OSDMap.h: 367: FAILED assert(exists(osd))
... Sage Weil
12:58 PM Bug #3878 (Rejected): osd: nobackfill flag doesn't work
on currently master, bobtail Sage Weil
11:43 AM Feature #3833: osd: improve recovery throttling
see wip-3833 for push Sage Weil
08:40 AM rbd Feature #3877 (Closed): krbd: don't wait for notify ack to complete
When we receive notification of a change to an rbd image's header
object we need to refresh our information about th...
Alex Elder
06:36 AM Revision 2491f976 (ceph): workunits/cephtool: add tests for ceph osd pool set/get
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
04:57 AM Revision ea9628fb (ceph): Merge remote-tracking branch 'gh/next'
Sage Weil
03:26 AM Revision 48308954 (ceph): Clarify journal size based on filestore max sync
The docs had the recommended journal size based on the option
"filestore min sync interval" when it should have been
...
Travis Rhoden
02:32 AM Revision aea898db (ceph): ceph: reject negative weights at ceph osd <n> reweight
Check the integer (fixed-point) value to avoid any worries
about floating-point rounding. Add tests for reweight < 0...
Dan Mick
02:32 AM Revision 7d9d7651 (ceph): workunit/cephtool: Use '! cmd' when expecting failure
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
12:55 AM Revision ee4a9f25 (ceph): marginal/mds_thrasher: Add tests for mds thrasher
Adds a basic set of roles for testing the mds thrasher
with 1 active and 1 standby, and a few basic tests that
stress...
Sam Lang
12:40 AM Revision 6008b1d8 (ceph): osdmap: make replica separate in default crush map configurable
Add 'osd crush chooseleaf type' option to control what the default
CRUSH rule separates replicas across. Default to ...
Sage Weil
12:17 AM Revision 8c0d702e (ceph): msg/Pipe: use state_closed atomic_t for _lookup_pipe
We shouldn't look at Pipe::state in SimpleMessenger::_lookup_pipe() without
holding pipe_lock. Instead, use an atomi...
Sage Weil
12:17 AM Revision 5fb77bf1 (ceph): ceph: adjust crush tunables via 'ceph osd crush tunables <profile>'
Make it easy to adjust crush tunables. Create profiles:
legacy: the legacy values
argonaut: the argonaut defaults...
Sage Weil
12:17 AM Revision 373f1671 (ceph): msgr: atomically queue first message with connect_rank
Atomically queue the first message on the new pipe, without dropping
and retaking pipe_lock.
Signed-off-by: Sage Wei...
Sage Weil
12:17 AM Revision ae1882e7 (ceph): msgr: don't queue message on closed pipe
If we have a con that refs a pipe but it is closed, don't use it. If
the ref is still there, it is only because we a...
Sage Weil
12:17 AM Revision 34e2d402 (ceph): msgr: fix race on Pipe removal from hash
When a pipe is faulting and shutting down, we have to drop pipe_lock to
take msgr lock and then remove the entry. Th...
Sage Weil
12:17 AM Revision 8e0359c3 (ceph): msgr: inject delays at inconvenient times
Exercise some rare races by injecting delays before taking locks
via the 'ms inject internal delays' option.
Signed-...
Sage Weil
12:01 AM Revision 0cb760f3 (ceph): OSD: do deep_scrub for repair
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
Samuel Just

01/18/2013

11:45 PM Revision 684a8f8f (ceph): Merge branch 'wip-pg-removal'
Reviewed-by: Samuel Just <sam.just@inktank.com> Sage Weil
11:44 PM Revision f6c69c3f (ceph): os: add apply_transaction() variant that takes a sequencer
Also, move the convenience wrappers into the interface and funnel through
a single implementation.
Signed-off-by: Sa...
Sage Weil
11:44 PM Revision bc994045 (ceph): os: move apply_transactions() sync wrapper into ObjectStore
This has nothing to do with the backend implementation.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
11:44 PM Revision 4712e984 (ceph): osd: make pg removal thread more friendly
For a large PG these are saturating the filestore and journal queues. Do
them synchronously to make them more friend...
Sage Weil
11:44 PM Revision 5e00af40 (ceph): osd: set pg removal transactions based on configurable
Use the osd_target_transaction_size knob, and gracefully tolerate bogus
values (e.g., <= 0).
Signed-off-by: Sage Wei...
Sage Weil
11:33 PM Revision 82f22b38 (ceph): config_opts.h: default osd_recovery_delay_start to 0
This setting was intended to prevent recovery from overwhelming peering traffic
by delaying the recovery_wq until osd...
Samuel Just
11:21 PM Documentation #3711: crush-map.rst: choose firstn talks about "N", but does not clearly define wh...
Dan Mick
11:20 PM Documentation #3711: crush-map.rst: choose firstn talks about "N", but does not clearly define wh...
Sorry, I think this is still wrong; the descriptions of {num} only apply if firstn is supplied, correct? Otherwise {... Dan Mick
11:12 PM Bug #3869: ceph osd pool get doesn't support everything set does
Added tests with commit:2491f976e4cd6eca5c30f7c184038364e4fe1873
Dan Mick
01:22 PM Bug #3869: ceph osd pool get doesn't support everything set does
how about a quick bash test script that gets and sets some of these values? Sage Weil
12:49 PM Bug #3869 (Resolved): ceph osd pool get doesn't support everything set does
commit:1f911fd0616c3fb45d5d36de7947a1914190017b
Dan Mick
12:27 PM Bug #3869 (Fix Under Review): ceph osd pool get doesn't support everything set does
Dan Mick
12:15 PM Bug #3869: ceph osd pool get doesn't support everything set does
This was noted on #ceph overnight. Dan Mick
12:14 PM Bug #3869 (Resolved): ceph osd pool get doesn't support everything set does
...for no apparently good reason. Adding the missing info is easy. Dan Mick
11:11 PM RADOS Bug #3872 (Resolved): You can put negative weights on OSDs
commit:aea898db2b56878b50f09dcbbf52347f4cc5c754
Dan Mick
05:39 PM RADOS Bug #3872: You can put negative weights on OSDs
Dan Mick
04:01 PM RADOS Bug #3872 (Fix Under Review): You can put negative weights on OSDs
Dan Mick
02:32 PM RADOS Bug #3872 (Resolved): You can put negative weights on OSDs
DHO reports that negative weights can be assigned to an OSD. Tested on Alexandria running 0.56-20-g9aecacd-1precise.
...
JuanJose Galvez
09:48 PM Revision 53f22d94 (ceph): task/mds_thrasher: New task for thrashing the mds
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
09:43 PM Revision 4bdcfbff (ceph): client: Respect O_SYNC, O_DSYNC, and O_RSYNC
If the file is opened with O_SYNC, O_DSYNC, or O_RSYNC, we need to
flush cached data (and metadata for O_SYNC) on a w...
Sam Lang
09:31 PM Revision b4e0f7ca (ceph): Merge remote-tracking branch 'gh/wip-client-pool-api'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
09:16 PM Linux kernel client Bug #3875 (Resolved): osd_client: don't use r_num_pages for bio requests
There is an osd request field "r_num_pages" that's used
to record the number of pages supplied with the request.
Fo...
Alex Elder
09:02 PM Revision 609442da (ceph): Merge remote-tracking branch 'gh/wip-scrub-argonaut' into argonaut
Sage Weil
08:42 PM Revision 1f911fd0 (ceph): ceph: allow osd pool get to get everything you can set
osd pool get was missing size, min_size, crash_replay_interval,
and crush_ruleset; they're all easily added.
Fixes: ...
Dan Mick
08:21 PM Revision 045af959 (ceph): qa: remove xfstest 068 from qemu testing
This tests fsfreeze, which sometimes hangs in xfs in linux 3.2
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
08:14 PM Revision 49726dcf (ceph): os/FileStore: only flush inline if write is sufficiently large
Honor filestore_flush_min in the inline flush case.
Backport: bobtail
Signed-off-by: Sage Weil <sage@inktank.com>
Re...
Sage Weil
08:14 PM Revision 8ddb55d3 (ceph): os/FileStore: fix compile when sync_file_range is missing;
If sync_file_range is not present, we always close inline, and flush
via fdatasync(2).
Fixes compile on ancient plat...
Sage Weil
07:05 PM Revision b8d5e286 (ceph): doc/rados/operations/crush: need kernel v3.6 for first round of tunables
Reported-by: rl219 in #ceph on irc.oftc.net
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
06:47 PM Revision dbc38eff (ceph): rbd.py: update scratch and test image sizes
Test 167 was failing due to running out of space on the scratch
file system. The test reserves 21MB in a file, and r...
Alex Elder
06:45 PM Revision 7e8e6491 (ceph): os/: Add CollectionIndex::prep_delete
If an unlink is interupted between removing the file
and updating the subdir attribute, the attribute will
overestima...
Samuel Just
06:35 PM Revision 736966f3 (ceph): java: support get pool id/replication interface
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
06:33 PM Revision 40415d1c (ceph): libcephfs: add pool id/size lookup interface
Adds new interfaces ceph_get_pool_id() and ceph_get_pool_replication()
to libcephfs.
Signed-off-by: Noah Watkins <no...
Noah Watkins
05:35 PM CephFS Feature #1236: libceph: set layout via virtual xattrs (libceph/cfuse)
Translating any ceph.* setxattrs into a sync setxattr and handling it on the MDS seems like an easy win. I can't thi... Sage Weil
01:34 PM CephFS Feature #1236: libceph: set layout via virtual xattrs (libceph/cfuse)
We're still thinking through the implications of the best way to implement this. Nonetheless there are people using h... Greg Farnum
05:01 PM CephFS Bug #3832: client: does not observe O_SYNC
Current status: the iozone-sync.sh test script is causing a segfault (sometimes at hang). Needs more testing! Segf... Sam Lang
04:46 PM Documentation #3808 (In Progress): Block device quick start page need update
John Wilkins
03:58 PM Bug #3768: perl is required for logrotate, we need to include Perl as a dependency
It had sounded to me like the trend was towards eliminating the Perl usage rather than adding it as a dependency. Did... Greg Farnum
03:56 PM Feature #3815 (Duplicate): osd: move pg_info_t back into the xattr; avoid writing pginfo file whe...
Sage Weil
03:49 PM Bug #3870 (Resolved): osd: make pg removal more friendly
commit:684a8f8f84312d4d9c6cdeb8d6d9fad792bd5a6d Sage Weil
01:44 PM Bug #3870 (Resolved): osd: make pg removal more friendly
wip-pg-removal needs cleanup and merge Sage Weil
03:49 PM Bug #3806 (Won't Fix): OSDs stuck in active+degraded after changing replication from 2 to 3
Thanks. I was trying to figure out where the conflict could come from, and actually it does make sense: The single-os... Greg Farnum
03:45 PM Bug #3806: OSDs stuck in active+degraded after changing replication from 2 to 3
Sure, it's attached... Ben Poliakoff
03:40 PM Bug #3806: OSDs stuck in active+degraded after changing replication from 2 to 3
@Josh: Even with the new CRUSH tunables it's still a matter of probability, so if you give it a particularly challeng... Greg Farnum
03:31 PM Bug #3806: OSDs stuck in active+degraded after changing replication from 2 to 3
OK, it looks like I may have simply given CRUSH a challenging assignment, given the resources of the cluster.
I ...
Ben Poliakoff
02:58 PM Bug #3873 (Duplicate): Ceph cli tool allows setting negative weights
Ian Colle
02:54 PM Bug #3873 (Duplicate): Ceph cli tool allows setting negative weights
Setting OSD weights to negative values:... Kyle Bader
02:46 PM Bug #1807 (Can't reproduce): CentOS compile error in perfglue/heap_profiler.cc
Anonymous
02:01 PM CephFS Feature #3570 (Resolved): teuthology: mds thrasher
Sage Weil
02:01 PM rbd Bug #3871 (Resolved): krbd: initial header read may be out of date
Currently krbd uses the version parameter of a watch operation to try to prevent this, but that was never implemented... Josh Durgin
01:55 PM Linux kernel client Bug #3860 (Rejected): rbd: problems if watch setup returns ERANGE
Josh Durgin
01:54 PM Linux kernel client Bug #3860: rbd: problems if watch setup returns ERANGE
ERANGE is never actually returned - it was never implemented (#2592). The real fix for the race it was intended to pr... Josh Durgin
08:08 AM Linux kernel client Bug #3860 (Rejected): rbd: problems if watch setup returns ERANGE
When rbd sets up the watch request for a newly-mapped rbd image
it loops and tries again if the request returns ERAN...
Alex Elder
12:49 PM CephFS Feature #3865 (Duplicate): mds: implement lookup-by-ino based on inode backtraces
#3541. Whoops! Greg Farnum
11:02 AM CephFS Feature #3865 (Duplicate): mds: implement lookup-by-ino based on inode backtraces
Following #3862 and #3863, implement the lookup-by-ino algorithm described in http://www.spinics.net/lists/ceph-devel... Greg Farnum
12:49 PM CephFS Feature #3541: mds: robust ino lookup using file backpointers
We have a design now! Greg Farnum
12:48 PM CephFS Feature #3862 (Duplicate): mds: add file backtraces to data objects
#3540. Whoops! Greg Farnum
10:26 AM CephFS Feature #3862 (Duplicate): mds: add file backtraces to data objects
Add backtraces to each file object, as described at http://www.spinics.net/lists/ceph-devel/msg11872.html. This ticke... Greg Farnum
12:48 PM CephFS Feature #3540: mds: maintain per-file backpointers on first file object
We have a design now! Greg Farnum
11:09 AM CephFS Feature #3727: mds: refactor EMetablob encoding paths
What is this bug about? Greg Farnum
11:08 AM CephFS Feature #3867 (Resolved): optionally do not use an anchor table
Following #3865 and #3866, we should introduce a config option that, when set, does not make use of the Anchor table ... Greg Farnum
11:07 AM CephFS Feature #3866 (New): mds: Add lazily-updated backtraces to hard links
As described in http://www.spinics.net/lists/ceph-devel/msg11872.html, we want hard links to contain lazily-updated b... Greg Farnum
10:55 AM CephFS Feature #3863: implement a tool to lookup inode numbers without holding their path
+1 for just adding the libcephfs function, and a test in test_libcephfs. Sam Lang
10:41 AM CephFS Feature #3863 (Resolved): implement a tool to lookup inode numbers without holding their path
This should just be a small wrapper around Client.cc*, but we need to be able to generate inode lookups without knowi... Greg Farnum
10:41 AM Feature #3769: osd: scrub should verify snap collection existence, membership
In master, sha-1 7b6fe03208c507b55517abe45cdff5c96d91904a
Needs backport when we are happy with the testing (if it's...
Samuel Just
10:15 AM rbd Tasks #3755: krbd: use new request tracking code for sync object operations
The sync header read operation was another one that was needed.
That's basically done too.
All of this will be re...
Alex Elder
10:09 AM rbd Tasks #3755: krbd: use new request tracking code for sync object operations
I have been looking in detail at how the watch requests are
implemented and in the process identified a few potentia...
Alex Elder
10:14 AM Linux kernel client Bug #3751 (Resolved): krbd: fix type of snap_id local variable
... Alex Elder
10:11 AM Bug #3854 (Fix Under Review): mon: clock skew tests failing on master
Joao Eduardo Luis
10:07 AM Bug #3854: mon: clock skew tests failing on master
teuthology's wip-3854 commit:1d8640860441dc27e8342788c1ae17f5c1b3ccc0 fixes this issue. Joao Eduardo Luis
09:00 AM Bug #3816: osd/OSD.cc: 3318: FAILED assert(osd_lock.is_locked())
commit:98a763123240803741ac9f67846b8f405f1b005b
When the osd does a "mark myself back up" it takes care to rebind ...
Sage Weil
08:58 AM rbd Feature #3861 (Resolved): rbd: consider splitting rbd_osd_req_op_create()
When it was out for review, Josh suggested that it might
be better to have separate (type-checking) functions for
b...
Alex Elder
08:25 AM CephFS Bug #3845: mds: standby_for_rank not getting cleared on takeover
+1 clearing it for cosmetic reasons. Sam Lang
08:25 AM Revision 76e715ba (ceph): doc: Added link to rotation section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:25 AM Revision e1741ba6 (ceph): doc: Added hyperlink to log rotation section.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
08:24 AM Revision 612717af (ceph): doc: Added section on log rotation.
fixes: #3776
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
08:07 AM rbd Bug #3859 (Resolved): osd_client: define ceph_osdc_clear_request_linger()
There is a ceph_osdc_set_request_linger() function that
sets a flag on a request and takes an additional reference.
...
Alex Elder
08:04 AM rbd Bug #3858 (Resolved): osd_client: ceph_osdc_wait_request() seems wrong
The only error wait_for_completion_interruptible() will
return is ERESTARTSYS. So if that gets returned inside
cep...
Alex Elder
07:33 AM Revision 48f41468 (ceph): Merge branch 'master' of https://github.com/ceph/ceph
John Wilkins
07:32 AM Revision 83326588 (ceph): doc: Modified index to include mon-osd-interaction.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:31 AM Revision d6fc92df (ceph): doc: Added a section describing mon/osd interaction.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
07:14 AM rbd Feature #1491: qemu: make qemu-img convert fast
This was rejected because feature is not relevant anymore. At the time, when I was looking at it there was some obvio... Yehuda Sadeh
06:43 AM Revision bebdc70b (ceph): build: Add perl installation dependency to rpm and debian packages.
There was already a dependency on python in the debian control file,
a similar dependency was added to the rpm spec f...
Gary Lowell
06:13 AM Revision ff7c971f (ceph): doc: Added an admonishment for SSD write latency.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
06:00 AM Revision 6f28faf9 (ceph): mds: open mydir after replay
In certain cases, we may replay the journal and not end up with the
dirfrag for mydir open. This is fine--we just ne...
Sage Weil
05:51 AM Revision dd7caf5f (ceph): mds: gracefully exit if newer gid replaces us by name
If 'mds enforce unique name' is set, and another MDS with the same name
kicks us out of the MDSMap, gracefully exit i...
Sage Weil
05:45 AM Revision 2e112333 (ceph): mon: enforce unique name in mdsmap
Add 'mds enforce unique name' option, defaulting to true.
If set, when an MDS boots, it will kick any previous mds w...
Sage Weil
05:27 AM Revision ca2d9ac9 (ceph): doc: Updated OSD configuration reference with backfill config options.
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
05:25 AM Revision e330b7ec (ceph): mon: create fail_mds_gid() helper; make 'ceph mds rm ...' more generic
Take a gid or a rank or a name. Use a nicer helper.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
05:05 AM Revision 5a384f48 (ceph): Merge branch 'wip-mds'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
05:00 AM Revision f41b5421 (ceph): add mon_thrash task to kernel and rados thrashers collections
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
04:57 AM Revision 626f6104 (ceph): Add a test for the truncate/osd-commit-reply race
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
04:54 AM Revision cc7bf1bd (ceph): rados: add osd reply delay injection
Sage Weil
01:54 AM Revision d81ac841 (ceph): rbd: fix bench-write infinite loop
I/O was continously submitted as long as there were few enough ops in
flight. If the number of 'threads' was high, or...
Josh Durgin
01:01 AM Revision 233d034d (ceph): Merge branch 'wip-cephx'
Reviewed-by: Josh Durgin <josh.durgin@inktank.com> Sage Weil
12:43 AM devops Feature #2885 (Resolved): doc: mon initial members requirements, functioning, admin steps to take
This was done some time ago. Step 9 here: http://ceph.com/docs/master/rados/deployment/chef/#configure-your-ceph-envi... John Wilkins
12:35 AM Documentation #3062 (Resolved): doc: osd tuning config options
This was completed some time ago. John Wilkins
12:28 AM Documentation #3329 (Resolved): doc: What metrics should be used to set node weight
Discussion was primarily starting with 1TB as a weight of 1.00 with additional consideration for throughput. If this ... John Wilkins
12:27 AM Tasks #3779 (In Progress): update osd config ref as appropriate
John Wilkins
12:26 AM Bug #3776 (Resolved): Need doc describing how to alter our log rotation
John Wilkins
12:09 AM Revision e776b63d (ceph): crushtool: consolidate_whitespace() should eat everything except \n
CRUSH map source with \r (like a DOS text file) failed to compile
with the usual nonuseful message; turns out that ea...
Dan Mick
12:09 AM Revision 60db6e3e (ceph): crushtool: warn usefully about missing output spec
When running with --test, you must request output to CSV files or
specific types of output to --show-X; make the erro...
Dan Mick

01/17/2013

11:41 PM Documentation #3711 (Resolved): crush-map.rst: choose firstn talks about "N", but does not clearl...
John Wilkins
11:41 PM Documentation #3389 (Resolved): doc: crush docs could use a full example crushmap
John Wilkins
11:40 PM Documentation #3709 (Resolved): crush-map.rst: claims 'types' are default, not true (must be spec...
John Wilkins
11:40 PM Documentation #3707 (Resolved): crush-map.rst: syntax error in example
John Wilkins
11:28 PM Feature #3505 (Resolved): default to libnss
This was done for RPMs with the commit listed below. Debians already had the --with-nss flag in the rules file.
...
Anonymous
11:21 PM Bug #2176 (In Progress): dependencies not checked by autoconf
All these are listed as build requirements for the rpm and debian packages. I'll add the missing ones to configure.ac. Anonymous
11:16 PM devops Tasks #3512 (In Progress): Publish our fastcgi packages
The approach is to pick up the latest debian and rpm packages for mod_fastcgi, apply the ceph patch, and build manual... Anonymous
11:13 PM Bug #3736: kernel build: failures starting in 3.8-rc1
The immediate kernel build problems have been solved by recreating the patch that is applied to the debian package bu... Anonymous
11:09 PM Bug #3736: kernel build: failures starting in 3.8-rc1
Branch: refs/heads/master
Home: https://github.com/ceph/autobuild-ceph
Commit: 0ff4f9a9ce82b37288b3bbcc5b5d65b5...
Anonymous
11:12 PM Revision efa595f5 (ceph): doc/rados/operations/authentication: update for cephx sig requirement o...
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:12 PM Revision 50db10dc (ceph): msg/Pipe: require MSG_AUTH feature on server if option is enabled
If we
negotiate cephx AND
are a server AND
cephx require signatures = true
then require the MSG_AUTH feature ...
Sage Weil
11:12 PM Revision 91a573a4 (ceph): mon: enforce 'cephx require signatures' during negotiation
If we are negotiating which auth protocol to use, and the client does not
support the MSG_AUTH feature, and the serve...
Sage Weil
11:11 PM Revision 4a49a09d (ceph): cephx: control signaures for service vs cluster
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
11:01 PM Revision c236a51a (ceph): osdmap: make replica separate in default crush map configurable
Add 'osd crush chooseleaf type' option to control what the default
CRUSH rule separates replicas across. Default to ...
Sage Weil
10:54 PM Bug #3768 (Resolved): perl is required for logrotate, we need to include Perl as a dependency
Branch: refs/heads/master
Home: https://github.com/ceph/ceph
Commit: bebdc70b4254a78d9fe86af9c645e828fd11e2b2
...
Anonymous
10:16 PM Documentation #3831 (In Progress): ceph osd crush set command needs correction in the doc
John Wilkins
10:14 PM CephFS Feature #1236: libceph: set layout via virtual xattrs (libceph/cfuse)
Sage Weil
10:02 PM CephFS Feature #3857: mds: enforce unique mds names in mdsmap
see wip-mds-names Sage Weil
09:36 PM CephFS Feature #3857 (Resolved): mds: enforce unique mds names in mdsmap
Currently mds's are uniquely identified by their addr (i.e., a unique instance of the process). The name is useful on... Sage Weil
08:27 PM Revision cd09be6a (ceph): ceph: pass ceph.conf to osdmaptool
This ensure it sees the chooseleaf option and generates the proper
CRUSH rules.
Sage Weil
06:37 PM rbd Bug #3413 (Resolved): rbd bench-write fails with assert when rbd caching turned on
commit:d81ac8418f9e6bbc9adcc69b2e7cb98dd4db6abb Josh Durgin
01:39 PM rbd Bug #3413 (Fix Under Review): rbd bench-write fails with assert when rbd caching turned on
branch wip-rbd-bench-write Josh Durgin
06:11 PM Revision c6f8010b (ceph): mon: Monitor: drop messages from old timecheck epochs
We were asserting when the message's timecheck epoch (which is mapped to
the election epoch) was older than the curre...
Joao Eduardo Luis
06:08 PM Revision 81e8bb55 (ceph): osdmaptool: more fix cli test
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit b0162fab3d927544885f2b9609b9ab3dc4aaff74)
Sage Weil
06:08 PM Revision 2b5b2657 (ceph): osdmaptool: fix cli test
Signed-off-by: Sage Weil <sage@inktank.com>
(cherry picked from commit 5bd8765c918174aea606069124e43c480c809943)
Sage Weil
06:08 PM Revision f739d123 (ceph): osdmaptool: allow user to specify pool for test-map-object
Fixes: #3820
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Gregory Farnum <greg@in...
Samuel Just
06:07 PM Revision 00759ee0 (ceph): rados.cc: fix rmomapkey usage: val not needed
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Samuel Just <samuel.just@inktank.com>
(cherry pic...
David Zafman
06:07 PM Revision 06b3270f (ceph): librados.hpp: fix omap_get_vals and omap_get_keys comments
We list keys greater than start_after.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <...
Samuel Just
06:07 PM Revision 75072965 (ceph): rados.cc: use omap_get_vals_by_keys in getomapval
Fixes: #3811
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
(...
Samuel Just
06:07 PM Revision a3c2980f (ceph): rados.cc: fix listomapvals usage: key,val are not needed
Fixes: #3812
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
(...
Samuel Just
06:00 PM rgw Feature #3856 (Resolved): rgw: list buckets S3 api should be paginated
The S3 api (unlike swift) does not define marker, max when listing buckets (probably due to the fact that max buckets... Yehuda Sadeh
05:25 PM Bug #3836: osd: common/Mutex.cc: 94: FAILED assert(r == 0) in PG::start_flush()
... Sage Weil
08:52 AM Bug #3836 (Resolved): osd: common/Mutex.cc: 94: FAILED assert(r == 0) in PG::start_flush()
... Sage Weil
04:55 PM Bug #3279: mon/caps: cap comparison in get-or-create is based on a string literal
This effects the chef mon recipe. I am able to correct this error by joining lines 96-99.
[Thu, 17 Jan 2013 16:5...
Kraig Amador
04:44 PM Feature #3850: Add json output for ceph pg dump and ceph osd tree
'pg dump' and 'osd dump' both have 'json' support since argonaut, but argonaut does not support outputting json on 'o... Joao Eduardo Luis
03:20 PM Feature #3850: Add json output for ceph pg dump and ceph osd tree
It already exists for pg dump and osd dump too. osd tree was recent though, maybe it's not in the version he's using? Josh Durgin
03:02 PM Feature #3850 (Closed): Add json output for ceph pg dump and ceph osd tree
Kyle Bader has requested json output for the following commands:
ceph pg dump
ceph osd tree
Sage Comment:
th...
Ian Colle
04:32 PM Feature #3855 (Resolved): Making Scrubs Nicer
As requested from DHO:
Currently scrubs are not very nice, Sage referred to these issues and it would be nice if t...
JuanJose Galvez
04:26 PM Bug #3854 (Resolved): mon: clock skew tests failing on master
... Sage Weil
04:21 PM Feature #3853 (Resolved): qa: include iogen in qa suite
Sage Weil
04:10 PM Bug #3827 (Resolved): crushtool --test: claims to want -o, really wants --output-csv or --show-*
commit:60db6e3e394df1e4110eefa5951657b648b02006
Dan Mick
04:10 PM RADOS Bug #3834 (Resolved): crushtool really really hates \r
commit:e776b63dd5c540a6f49b03b67e72a1f4636a74fd Dan Mick
11:06 AM RADOS Bug #3834: crushtool really really hates \r
Well isspace() would catch newline too, which I think we don't want, so it'd be iswhite(c) && c != '\n', which I'm no... Dan Mick
04:06 PM devops Bug #3852 (Resolved): chef recipes don't try to start OSDs
I wasn't aware the chef recipes were this incomplete, but it appears as though, unless
you're running Crowbar, osd.r...
Dan Mick
04:05 PM devops Bug #3851 (Resolved): chef recipes don't enable upstart
Since upstart management of daemons now explicitly looks for an upstart tag file, Chef
doesn't start the monitors co...
Dan Mick
03:17 PM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
I presume we're planning to backport this to bobtail after it passes some nights of testing? Maybe we should leave th... Greg Farnum
03:03 PM Bug #3785 (Resolved): ceph: default crush rule does not suit multi-OSD deployments
commit:f358cb1d2b0a3a78bf59c4fd085906fcb5541bbe Sage Weil
02:58 PM Feature #3849 (Resolved): Track slow PGs and times OSDs marked down
Kyle Bader:
"Over the weekend of 01/02/13 we encountered an issue that we had not yet
encountered. One of our cephs...
Ian Colle
02:54 PM Feature #3848 (Resolved): osd: gracefully handle cluster network heartbeat failure
From Kyle Bader
"Back in October we had a switch failure on our cluster (backend) network.
This was not noticed b...
Ian Colle
02:24 PM rbd Bug #3847 (Resolved): rbd: figure out correct byte order for watch version
In the process of refactoring rbd code that builds up osd
operations I noticed that for NOTIFY_ACK and WATCH operati...
Alex Elder
01:40 PM Documentation #3846 (Resolved): Debian install has incorrect gitbuilder URL

From http://ceph.com/docs/master/install/debian/ :...
Anonymous
12:32 PM rbd Feature #1491 (Rejected): qemu: make qemu-img convert fast
Yehuda Sadeh
12:28 PM CephFS Bug #3832 (Fix Under Review): client: does not observe O_SYNC
Implemented in wip-3832. Needs review. Sam Lang
12:17 PM CephFS Bug #3845: mds: standby_for_rank not getting cleared on takeover
I dont' think it matters. It's is a fixed lifecycle from standby -> active -> dead, so the leftover standby_ just te... Sage Weil
12:13 PM CephFS Bug #3845: mds: standby_for_rank not getting cleared on takeover
This is a monitor thing; the MDS is only involved in relaying the config setting over on boot-up. Greg Farnum
11:38 AM CephFS Bug #3845 (Closed): mds: standby_for_rank not getting cleared on takeover
This is the mdsmap after mds.a was active and given rank 0, then killed, and another mds (mds.b-s-r0) that had standb... Sam Lang
11:34 AM CephFS Feature #3730: Support replication factor in Hadoop
Sage Weil wrote:
> If there are more such cases, that is a separate bug!
It was a bug I had introduced in wip-cli...
Noah Watkins
09:51 AM CephFS Feature #3730: Support replication factor in Hadoop
Noah Watkins wrote:
> In Client, osdmap is protected by client_lock? If so, new version of branch isn't broken..
...
Sage Weil
08:55 AM CephFS Feature #3730: Support replication factor in Hadoop
In Client, osdmap is protected by client_lock? If so, new version of branch isn't broken.. Noah Watkins
10:45 AM Subtask #3844 (Rejected): osd: move info and log into leveldb
Samuel Just
10:45 AM Subtask #3843 (Rejected): osd: move purged_snaps out of info
the purged_snaps set is really a property of the local pg instance rather than a global property and does not get upd... Samuel Just
10:42 AM Subtask #3842 (Rejected): osd: create tool to extract pg info and pg log from filestore
Once these are moved into leveldb, it will be much more difficult to manually extract these structures. Samuel Just
10:41 AM Feature #3841 (Rejected): osd: avoid seeks for log and info writes on client writes
Probable approach is to move log and info into leveldb. Samuel Just
10:38 AM Subtask #3840 (Resolved): osd: ack push after apply+commit
This will prevent the primary from shoving another push before the first has completed. Alternately, make the number... Samuel Just
10:28 AM Documentation #3839 (Resolved): SSD crushmap example will not compile
The SSD CRUSH map example (http://ceph.com/docs/master/rados/operations/crush-map/#placing-different-pools-on-differe... Alexandre Marangone
10:24 AM CephFS Bug #1435: mds: loss of layout policies upon mds restart
wip-mds-layout2
needs to be rebased reviewed and tested!
Sage Weil
10:13 AM Bug #3835 (Resolved): mon: timecheck: hits FAILED assert(m->epoch == timecheck_epoch) when monito...
pushed to master, commit:c6f8010b1c8e4d54f9fb24b2e4e25ff8a2bde778 Joao Eduardo Luis
09:34 AM Bug #3835 (Fix Under Review): mon: timecheck: hits FAILED assert(m->epoch == timecheck_epoch) whe...
Ian Colle
08:51 AM Bug #3835: mon: timecheck: hits FAILED assert(m->epoch == timecheck_epoch) when monitors are seve...
This issue is fixed on wip-3835, commit:785a2bc3e9271607b1ddf25390056e9dd9c72b21 Joao Eduardo Luis
07:47 AM Bug #3835 (Resolved): mon: timecheck: hits FAILED assert(m->epoch == timecheck_epoch) when monito...
The leader schedules a new 'ping' to the monitors in the quorum as soon as the pings are all sent.
This allows for...
Joao Eduardo Luis
10:04 AM Bug #3820: osdmaptool - user cannot specify pool
85eb8e382a26dfc53df36ae1a473185608b282aa Samuel Just
09:58 AM Bug #3816 (Resolved): osd/OSD.cc: 3318: FAILED assert(osd_lock.is_locked())
Sage Weil
09:50 AM rbd Feature #3838 (New): krbd: use common functions for striping calculations
With the STRIPINGV2 feature bit, format 2 striping has the same parameters as cephfs striping. Re-work the rbd object... Josh Durgin
09:29 AM Linux kernel client Feature #3837 (Resolved): krbd: support format 2 striping
Format 2 images with the STRIPINGV2 feature bit set (created with rbd create --stripe-count X --stripe-unit Y --order... Josh Durgin
09:12 AM rbd Feature #3754: krbd: use new request tracking code for notify ack
Yay! Sage Weil
04:52 AM rbd Feature #3754: krbd: use new request tracking code for notify ack
Yeehah! All tests passed, including the previously-failing
blogbench.sh, fsstress, and two passes through xfstests.
Alex Elder
09:11 AM Bug #2843: filestore: replay failure on xfs
The post-v0.50 version of this bug was just fixed, commit:66eb93b83648b4561b77ee6aab5b484e6dba4771, which is backport... Sage Weil
02:38 AM Bug #2843: filestore: replay failure on xfs
Hi,
We have exactly the same problem on 1 of our osd (bobtail 0.56.1).
[[https://gist.github.com/4555135]]
Wha...
Guilhem Lettron
09:08 AM CephFS Bug #3261 (Rejected): mds crashes in EMetaBlob::replay
Understood. I'm sorry we weren't able to dig in when it happened. When do you get around to retesting we should be ... Sage Weil
02:09 AM CephFS Bug #3261: mds crashes in EMetaBlob::replay
should i test the same btrfs volume with a new ceph? if so i might get to it in the next month. please close with ins... Tobias Florek
05:19 AM Revision b0162fab (ceph): osdmaptool: more fix cli test
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:10 AM Revision 5bd8765c (ceph): osdmaptool: fix cli test
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
05:01 AM Revision 98a76312 (ceph): osd: leave osd_lock locked in shutdown()
No callers expect the lock to be dropped.
Fixes: #3816
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
04:48 AM Revision 72db1a59 (ceph): When running teuthology with targets provisionned on OpenStack and kvm,...
Signed-off-by: Loic Dachary <loic@dachary.org> Loïc Dachary
02:04 AM Revision faa62fa8 (ceph): radosgw: increate nofile ulimit in upstart
The default ulimit for open file descriptors per process is 1024,
far too few for radosgw if you have lots of OSDs an...
Kyle Bader
12:59 AM Revision df399da1 (ceph): rgw: copy object should not copy source acls
Fixes: #3802
Backport: argonaut, bobtail
When using the S3 api and x-amz-metadata-directive is
set to COPY we used t...
Yehuda Sadeh
12:25 AM Revision 19ee2311 (ceph): ceph: adjust crush tunables via 'ceph osd crush tunables <profile>'
Make it easy to adjust crush tunables. Create profiles:
legacy: the legacy values
argonaut: the argonaut defaults...
Sage Weil
12:19 AM Revision 85eb8e38 (ceph): osdmaptool: allow user to specify pool for test-map-object
Fixes: #3820
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Gregory Farnum <greg@in...
Samuel Just

01/16/2013

11:52 PM Revision 7b6fe032 (ceph): Merge branch 'wip_snap_scrub'
Reviewed-by: Sage Weil <sage@inktank.com> Samuel Just
11:40 PM Revision 0946a78c (ceph): fix mon clock queue test syntax
Sage Weil
11:30 PM Revision 20b27a1c (ceph): rgw: copy object should not copy source acls
Fixes: #3802
Backport: argonaut, bobtail
When using the S3 api and x-amz-metadata-directive is
set to COPY we used t...
Yehuda Sadeh
11:22 PM Revision 37dbf7d9 (ceph): rgw: copy object should not copy source acls
Fixes: #3802
Backport: argonaut, bobtail
When using the S3 api and x-amz-metadata-directive is
set to COPY we used t...
Yehuda Sadeh
10:42 PM Revision b8568747 (ceph): osd_types: add nlink and snapcolls fields to ScrubMap::object
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
10:42 PM Revision 57352351 (ceph): ReplicatedPG/PG: check snap collections during _scan_list
During _scan_list check the snapcollections corresponding to the
object_info attr on the object. Report inconsistenc...
Samuel Just
10:42 PM Revision e65ea70e (ceph): ReplicatedPG: compare nlinks to snapcolls
nlinks gives us the number of hardlinks to the object.
nlinks should be 1 + snapcolls.size(). This will allow
us to ...
Samuel Just
10:42 PM Revision 665577a8 (ceph): osd/ReplicatedPG: validate ino when scrubbing snap collections
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:42 PM Revision 381e2587 (ceph): osd/PG: fix osd id in error message on snap collection errors
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
10:42 PM Revision 70c35120 (ceph): ReplicatedPG: ignore snap link info in scrub if nlinks==0
links==0 implies that the replica did not sent snap link information.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Samuel Just
10:42 PM Revision 39bc6549 (ceph): PG: move auth replica selection to helper in scrub
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
10:42 PM Revision 9e44fca1 (ceph): ReplicatedPG: correctly handle new snap collections on replica
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
10:35 PM Revision 88956e31 (ceph): ReplicatedPG: make_snap_collection when moving snap link in snap_trimmer
Backport: bobtail
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Samuel Just
10:33 PM Revision 3f0ad497 (ceph): librados.hpp: fix omap_get_vals and omap_get_keys comments
We list keys greater than start_after.
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <...
Samuel Just
10:33 PM Revision 625c3cb9 (ceph): rados.cc: fix rmomapkey usage: val not needed
Signed-off-by: David Zafman <david.zafman@inktank.com>
Reviewed-by: Samuel Just <samuel.just@inktank.com>
David Zafman
10:33 PM Revision 44c45e52 (ceph): rados.cc: fix listomapvals usage: key,val are not needed
Fixes: #3812
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
Samuel Just
10:33 PM Revision cb5e2be4 (ceph): rados.cc: use omap_get_vals_by_keys in getomapval
Fixes: #3811
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
Samuel Just
09:57 PM Revision 3c67ee36 (ceph): rbd: add test for formatted output from rbd cli
Josh Durgin
09:41 PM RADOS Bug #3834: crushtool really really hates \r
Ha! Sorry about htat. Maybe iswhite() (or wahtever that helper is) would be best here? Sage Weil
09:36 PM RADOS Bug #3834 (Resolved): crushtool really really hates \r
Spent a long time trying to figure out why a crush map wouldn't compile; finally got it to no differences at all, eve... Dan Mick
09:29 PM Revision 333cc0d5 (ceph): Merge branch 'wip-rbd-formatted-output'
Reviewed-by: Dan Mick <dan.mick@inktank.com>
Conflicts:
src/rbd.cc
src/test/cli/rbd/help.t
Josh Durgin
09:23 PM rbd Feature #3754: krbd: use new request tracking code for notify ack
OK, that quick fix wasn't enough.
I had a spinlock protecting the check for something being
complete. But that w...
Alex Elder
08:13 PM rbd Feature #3754: krbd: use new request tracking code for notify ack
Well that's unfortunate. I hit the same problem. I'll
need to take a closer look I guess.
Alex Elder
07:39 PM rbd Feature #3754: krbd: use new request tracking code for notify ack
Seems to be working better. It may end up being an
atomic rather than protecting with a spinlock, but
either way, ...
Alex Elder
03:15 PM rbd Feature #3754: krbd: use new request tracking code for notify ack
I've pretty much implemented this feature but having done
this I'm looking at a crash that happened with this code
...
Alex Elder
09:17 PM Revision b59c27dd (ceph): Merge branch 'master' into wip-scrub
Sage Weil
09:15 PM Revision fb4bb5d7 (ceph): osd: better error message for request on pool that dne
If the request is sent when the pool didn't even exist, say so. This
would have made #3734 a bit easier to track dow...
Sage Weil
09:14 PM Revision 9a1f5742 (ceph): osd: drop newlines from event descriptions
These produce extra newlines in the log.
Signed-off-by: Sage Weil <sage@inktank.com>
Reviewed-by: Samuel Just <sam.j...
Sage Weil
09:14 PM Revision 6934ac3f (ceph): rbd: move Formatter construction to main
Each method that uses a formatter is doing the same thing.
Simplify by constructing and handling errors only once.
Al...
Josh Durgin
09:14 PM Revision 8fea6dee (ceph): rbd: add --pretty-format option
This is the same option the rados and radosgw-admin tool use for more
human-readable json/xml.
Signed-off-by: Josh D...
Josh Durgin
09:14 PM Revision 4e5a07bc (ceph): XMLFormatter: fix pretty printing
It used the wrong indentation level and did not add a newline after
closing a section. dump_stream() did not indent a...
Josh Durgin
09:14 PM Revision d7cdcc0e (ceph): rbd: regenerate man page and cli test
Signed-off-by: Josh Durgin <josh.durgin@inktank.com> Josh Durgin
09:14 PM Revision f6dabc83 (ceph): rbd: always output result for formatted output
When there's nothing, return an empty array.
This way scripts don't have to special case this.
Signed-off-by: Josh D...
Josh Durgin
09:14 PM Revision 0efb9c51 (ceph): test: add cram integration test for formatted output
This can be used with the new teuthology cram task.
Signed-off-by: Josh Durgin <josh.durgin@inktank.com>
Josh Durgin
09:14 PM Revision 84c5d857 (ceph): rbd: support plain/json/xml output formatting
This patch renames the --format option to --image-format, for
specifying the RBD image format, and uses --format to s...
Stratos Psomadakis
09:14 PM Revision 98487b56 (ceph): rbd: fix long lines
Several >80 characters have crept in recently.
The older ones generally don't have very useful history,
so I'm not wo...
Josh Durgin
07:21 PM Revision a586966a (ceph): osd: fix rescrub after repair
We were rescrubbing if INCONSISTENT is set, but that is now persistent.
Add a new scrub_after_recovery flag that is r...
Sage Weil
07:21 PM Revision 8e33a8b9 (ceph): mon: note scrub errors in health summary
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
07:17 PM Revision 476eb24b (ceph): Merge branch 'wip-rpm-update'
Merges work around for odd AS_IF behaviour in configure.ac. Gary Lowell
06:37 PM rgw Bug #3813: radosgw doesn't have a logrotate script
Let's go with /var/log/radosgw and a separate logrotate script. Simpler! Sage Weil
09:06 AM rgw Bug #3813: radosgw doesn't have a logrotate script
Given that radosgw gets installed without ceph, it seems like teh viable optoins are putting the logrotate cofnig in ... Sage Weil
04:14 AM rgw Bug #3813: radosgw doesn't have a logrotate script
Note that the official docs suggest to put "log file = /var/log/ceph/radosgw.log" too. If "ceph" isn't installed, thi... Faidon Liambotis
04:02 AM rgw Bug #3813 (Resolved): radosgw doesn't have a logrotate script
Currently there's no logrotate configuration for radosgw at all. Even if one sets "log file" to /var/log/ceph/somethi... Faidon Liambotis
06:35 PM Feature #3833 (Resolved): osd: improve recovery throttling
Sage Weil
06:24 PM Bug #3810 (Need More Info): btrfs corrupts file size on 3.7
I need a dump of the xattrs on the d0c18e1d/605.00000000/head//1 object in pg 1.1d on osd 7 and osd 0 Samuel Just
05:59 PM CephFS Bug #3832 (Resolved): client: does not observe O_SYNC
if the file was opened with O_SYNC we need to flush the io on every write call. Sage Weil
05:49 PM Bug #3795 (Resolved): loadgen task gets into msgr loop
Sage Weil
05:44 PM rgw Feature #3207 (Resolved): qa: swift functional tests in nightly
Yehuda Sadeh
05:41 PM Revision c1a86ab1 (ceph): configure.ac: fix problem with --enable-cephfs-java
The AS_IF used to cover java related checks via --enable-cephfs-java
didn't work correctly. Use a plain 'if/fi' inste...
Danny Al-Gaaf
05:34 PM CephFS Feature #3730: Support replication factor in Hadoop
Oh right, libcephfs is not built on top of librados. Never mind, that's a whole different discussion we start occasio... Greg Farnum
05:15 PM CephFS Feature #3730: Support replication factor in Hadoop
I don't think libcephfs will give up an instance of the rados client, if that's what you mean by grant access to rado... Noah Watkins
04:33 PM CephFS Feature #3730: Support replication factor in Hadoop
Sorry to back this up a little, but I can't recall — does using libcephfs automatically grant a user access to the RA... Greg Farnum
04:30 PM CephFS Feature #3730: Support replication factor in Hadoop
This interface update is up for review in wip-client-pool-api Noah Watkins
09:52 AM CephFS Feature #3730: Support replication factor in Hadoop
From stand-up, stick with int64_t for userspace, and enforce 32-bit range. Noah Watkins
09:43 AM CephFS Feature #3730: Support replication factor in Hadoop
The move from int32 -> int64 was misguided, and incomplete. At this point it's not really worth the effort to move a... Sage Weil
07:31 AM CephFS Feature #3730: Support replication factor in Hadoop
It looks like in OSDMap there is some mixed usage of int64 and int for pool id, too. In Client::_create pool id is e... Noah Watkins
06:40 AM CephFS Feature #3730: Support replication factor in Hadoop
Can we change the type in libcephfs to uint64? We're the only ones calling ceph_get_file_pool() right now as far as ... Sam Lang
05:33 PM Bug #3820 (Resolved): osdmaptool - user cannot specify pool
Samuel Just
02:24 PM Bug #3820 (Resolved): osdmaptool - user cannot specify pool
Samuel Just
05:23 PM Documentation #3831 (Resolved): ceph osd crush set command needs correction in the doc
ceph osd crush set command has different parameters in different places.
http://ceph.com/docs/master/rados/operat...
Tamilarasi muthamizhan
05:21 PM rgw Bug #3802 (Resolved): x-amz-acl header ignored on copy operation
Fixed, commit:ccfefe3097a51b49885f2ed5d9334e85b497d963. Fix was pushed to both argonaut and bobtail branches. Yehuda Sadeh
11:17 AM rgw Bug #3802: x-amz-acl header ignored on copy operation
ok, affects both argonaut and bobtail. Actual bug is when copying object, if x-amz-metadata-directive is set to COPY ... Yehuda Sadeh
10:01 AM rgw Bug #3802: x-amz-acl header ignored on copy operation
On what version? Yehuda Sadeh
05:16 PM RADOS Documentation #3830 (Closed): crush-map.rst: chooseleaf doesn't include 'firstn|indep', and 'aggr...
1) I think chooseleaf should also include [firstn|indep] like choose does.
2) I'm not certain I understand just wh...
Dan Mick
05:15 PM Bug #3829 (Can't reproduce): new osd added to the cluster is not receiving data
ceph version: 0.56.1 (e4a541624df62ef353e754391cbbb707f54b16f7)
1. Initially , had a cluster[burnupi21,burnupi22,b...
Tamilarasi muthamizhan
04:12 PM CephFS Bug #3828 (Rejected): seeing error: fault, server, going to standby whenever I run a ceph-syn loa...
This is showing up on your MDS, about 15 minutes after a client completes accesses, right? This is associated with th... Greg Farnum
04:01 PM CephFS Bug #3828 (Rejected): seeing error: fault, server, going to standby whenever I run a ceph-syn loa...
while validating bug 520, i saw an interesting error. it may be a red herring, as I am seeing no problem with the wr... Anonymous
03:47 PM CephFS Bug #520 (Closed): mds: change ifile state mix->sync on (many) lookups?
3 Node Cluster:
ceph version 0.56.1 (e4a541624df62ef353e754391cbbb707f54b16f7)
# cat /etc/ceph/ceph.conf
[global]...
Anonymous
02:51 PM CephFS Bug #520: mds: change ifile state mix->sync on (many) lookups?
csyn is now called ceph-syn
and --debug-ms 1 to see those messages go by!
Sage Weil
03:43 PM Revision 1d50affc (ceph): mds: fix usage typo for ceph-mds
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
03:26 PM CephFS Bug #3261: mds crashes in EMetaBlob::replay
This looks like a problem with what's in the journal, but soo much MDS code has changed since then that I don't think... Sage Weil
03:24 PM CephFS Bug #1760 (Resolved): multiple_rsync workunit cannot remove non-empty directory intermittently
this also looks like the tmap problem, commit:e52ebacb73747ef642aabdb3cc3cb2a328687a4c and preceeding 4 commits. Sage Weil
03:23 PM CephFS Bug #2380 (Rejected): kclient: aufs over a cephfs mount fails with Stale NFS file handle
this is a generic problem with lookup by ino, see #3541 and other features Sage Weil
03:23 PM CephFS Bug #2092 (Can't reproduce): BUG at fs/ceph/caps.c:999
commit:561cf283173360c39db19dc735da4a319be68ff6 fixes the multi-mds case. we haven't seen this again for single-mds..... Sage Weil
03:21 PM Bug #3827 (Resolved): crushtool --test: claims to want -o, really wants --output-csv or --show-*
The error message is wrong, apparently, for crushtool's test mode; it looks like it wants either
--output-csv (in wh...
Dan Mick
03:11 PM CephFS Feature #3826 (Resolved): uclient: Be more aggressive about checking for pools we can't write to
Right now the client will happily buffer up writes to a pool that it can't actually write to. #2753 is going to make ... Greg Farnum
03:06 PM CephFS Bug #3746 (Rejected): kclient mmap doesn't zero past EOF
Run against bad code. Greg Farnum
03:03 PM CephFS Bug #2444 (Can't reproduce): null pointer deference in ceph_d_prune inside kvm
Sage Weil
03:00 PM CephFS Bug #2071 (Can't reproduce): kclient: pjd mkfifo failures
Sage Weil
02:59 PM CephFS Bug #1770 (Can't reproduce): directory nonexistent on kernel_untar_build.sh
Sage Weil
02:58 PM CephFS Bug #1749 (Can't reproduce): nonexistent directory in kclient_workunit_kernel_untar_build
Sage Weil
02:55 PM CephFS Bug #1318 (Resolved): directories disappear across multiple rsyncs
commit:e52ebacb73747ef642aabdb3cc3cb2a328687a4c and 4 preceeding patches fix up the TMAP bug that is the likely cause... Sage Weil
02:55 PM CephFS Bug #1511: fsstress failure with 3 active mds
Sam thinks this works now! Adding to QA suite. Greg Farnum
02:50 PM CephFS Bug #3625 (Resolved): client: EEXIST error on multiple clients to create
commit:b4d3bd06d4083d780755f6ef506df1643932fa2f Sage Weil
02:49 PM CephFS Bug #3625: client: EEXIST error on multiple clients to create
Maybe you already handled this? Greg Farnum
02:11 PM CephFS Bug #3625 (Fix Under Review): client: EEXIST error on multiple clients to create
Sam Lang
06:16 AM CephFS Bug #3625: client: EEXIST error on multiple clients to create
The kernel side has been reviewed and tested, but needs to be merged. The fuse side has been tested, but I think it ... Sam Lang
02:48 PM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
we should return an error code on fsync().. that is the quick fix.
a more polite feature will be opened to return ...
Sage Weil
09:19 AM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
This is clearly a bug, bureaucracy or not. It should not be a feature. We can do new development to fix a bug. If you... Ian Colle
02:47 PM Bug #3812 (Resolved): rados.cc listomapvals usage is wrong, <key> <val> are ignored and not needed
Samuel Just
02:47 PM Bug #3811 (Resolved): rados.cc getomapval implementation is broken, should use omap_get_vals_by_keys
Samuel Just
02:46 PM CephFS Bug #3544: ./configure checks CFLAGS for jni.h if --with-hadoop is specified but also needs to ch...
I think this can be closed. There is a bunch of autoconf changes for Java that have or will be merged. Noah Watkins
02:41 PM CephFS Bug #3544: ./configure checks CFLAGS for jni.h if --with-hadoop is specified but also needs to ch...
I just did a ./configure and using CPPFLAGS to indicate where the jni headers were and that worked just fine. Using C... Anonymous
02:45 PM CephFS Bug #3254: mds: Replica inode's parent snaprealms are not open
Multi-mds, currently low priority. Greg Farnum
02:44 PM CephFS Bug #3637 (In Progress): client: not issuing caps for with clients doing shared writes
Sage Weil
02:43 PM CephFS Bug #3637 (Fix Under Review): client: not issuing caps for with clients doing shared writes
Sage Weil
02:40 PM CephFS Bug #3498 (Resolved): mds: mds assert failure during untar_kernel
this was a msgr bug, long since fixed. commit:36c0fd220ef02b1ffd7a3ae0d98e0fdec6b55a5b or thereabouts Sage Weil
02:39 PM CephFS Bug #1666: hadoop: time-related meta-data problems
http://www.mail-archive.com/ceph-devel@vger.kernel.org/msg10334.html
Also wip-mtime-incr in the ceph repo.
Sam Lang
02:38 PM CephFS Bug #2218: CephFS "mismatch between child accounted_rstats and my rstats!"
Greg Farnum
02:32 PM CephFS Feature #3821 (New): qa: run backuppc as part of qa suite
Sage Weil
02:32 PM CephFS Bug #2494 (Can't reproduce): mds: Cannot remove directory despite it being empty.
The dupe inode suggests this is the problem fixed by Yan's tmap fixes. Greg Farnum
02:29 PM CephFS Bug #2019 (Can't reproduce): mds: CInode::filelock stuck in sync->mix
Presumably we'll see this again, but it hasn't turned up in our testing lately and we need more info to debug it. Greg Farnum
02:27 PM CephFS Bug #1811 (Duplicate): 2 pjd chown tests failed on cfuse
Ian Colle
02:22 PM CephFS Bug #1537 (Resolved): cmds 100% when copying lots of files, mds_cache_size and mds_bal_frag
This is an optimization issue, which we'll get to! Sage Weil
02:22 PM Bug #3816: osd/OSD.cc: 3318: FAILED assert(osd_lock.is_locked())
Interesting, but where did this actually get from?
And why didn't it get triggered when I started the OSDs again? ...
Wido den Hollander
01:08 PM Bug #3816 (Fix Under Review): osd/OSD.cc: 3318: FAILED assert(osd_lock.is_locked())
-5678> 2013-01-15 17:18:24.509093 7f5a10cec700 1 accepter.accepter.rebind avoid 6812
-5677> 2013-01-15 17:18:24.5...
Sage Weil
12:43 PM Bug #3816: osd/OSD.cc: 3318: FAILED assert(osd_lock.is_locked())
Like requested on the mailinglist I'm attaching the logfiles from osd.0 to osd.3
There is indeed a osd_map logline...
Wido den Hollander
09:59 AM Bug #3816 (Resolved): osd/OSD.cc: 3318: FAILED assert(osd_lock.is_locked())
... Sage Weil
02:21 PM CephFS Feature #3819 (Resolved): mds: re-add snaptests to qa suite
Sage Weil
02:02 PM CephFS Bug #3818 (Duplicate): kclient: fsx fails in mapread

With the fix in #3681, fsx fails in mapread with bad data. It looks like this is unrelated to the fix, and is a se...
Sam Lang
01:56 PM Bug #3786: osd: scrub is deferred indefinitely if load is high
Fixed by https://github.com/ceph/ceph/commit/299548024acbf8123a4e488424c06e16365fba5a Ian Colle
01:38 PM Bug #3786 (Resolved): osd: scrub is deferred indefinitely if load is high
Sage Weil
01:38 PM Bug #3774 (Resolved): osd: 'ceph osd scrub' and 'ceph pg scrub' are poorly scheduled
Sage Weil
11:38 AM rbd Feature #3817 (Resolved): librbd: make cache write-through until a flush is encountered
Writeback caching is unsafe if higher layers don't send flushes. qemu can be accidentally misconfigured to not send f... Josh Durgin
11:09 AM CephFS Feature #3543 (In Progress): mds: new encoding
Oh, this has been in progress all week. Greg Farnum
10:35 AM CephFS Bug #3773 (Can't reproduce): mds crashed at LogEvent::decode
I have been trying to reproduce this but have not hit it yet.
will reopen the bug, when needed.
Tamilarasi muthamizhan
10:34 AM Bug #3801 (New): Cascading OSD failures beginning with common/HeartbeatMap.cc: 78: FAILED assert(...
Ian Colle
10:28 AM Bug #3801: Cascading OSD failures beginning with common/HeartbeatMap.cc: 78: FAILED assert(0 == "...
Sage Weil wrote:
> The osd.40 error means the fs returned EIO on a read operation. Check yoru kern.org.. there is p...
Justin Lott
09:39 AM Bug #3801 (Need More Info): Cascading OSD failures beginning with common/HeartbeatMap.cc: 78: FAI...
The osd.40 error means the fs returned EIO on a read operation. Check yoru kern.org.. there is probably a bad disk, ... Sage Weil
09:41 AM Feature #3815 (Duplicate): osd: move pg_info_t back into the xattr; avoid writing pginfo file whe...
see wip-pginfo for a hacky prototype.
did some testing, and it looks good:...
Sage Weil
09:39 AM Linux kernel client Bug #3800 (Won't Fix): libceph: check compatibility between ceph modules
Sage Weil
07:03 AM Feature #3805: log: detect dup messages
The one that comes to mind is "no heartbeat from osd.foo since timestamp bar" messages. We could try to identify the... Sam Lang
06:43 AM Revision 2dc2b480 (ceph): mds: use #defines for bits per cap
Hard-coding 0xff in SimpleLock.h is too far away from where we add new cap
bits.
Signed-off-by: Sage Weil <sage@inkt...
Sage Weil
06:04 AM CephFS Bug #3601: client: With multiple clients, file remove doesn't free up space
Yeah its that the lru doesn't have a timeout.
The mds could send an "enable timeout" message to clients once it se...
Sam Lang
03:27 AM Revision 63e33c8a (ceph): osd: send forced scrub/repair through scrub scheduling
This marks a PG for immediate scrub or repair. Adjust the sched_scrub()
code so that we handle these PGs even when s...
Sage Weil
03:26 AM Revision 27ad74b9 (ceph): osd: use helpers to queue a PG in the scrub LRU
Move the duplicated reach into info.history.last_scrub_stamp into a helper
so we can control when we queue the PG for...
Sage Weil
03:25 AM Revision f8a649c0 (ceph): osd/ReplicatedPG: validate ino when scrubbing snap collections
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:25 AM Revision 8fb04813 (ceph): ReplicatedPG: compare nlinks to snapcolls
nlinks gives us the number of hardlinks to the object.
nlinks should be 1 + snapcolls.size(). This will allow
us to ...
Samuel Just
03:24 AM Revision 4affecee (ceph): ReplicatedPG/PG: check snap collections during _scan_list
During _scan_list check the snapcollections corresponding to the
object_info attr on the object. Report inconsistenc...
Samuel Just
03:21 AM Revision 40e0f2db (ceph): byteorder: fix gcc 4.7 warnings
./include/encoding.h: In function 'void encode(int64_t, ceph::bufferlist&, uint64_t)':
./include/encoding.h:101:1: wa...
Sage Weil
03:21 AM Revision dde83262 (ceph): osd_types: add nlink and snapcolls fields to ScrubMap::object
Signed-off-by: Samuel Just <sam.just@inktank.com> Samuel Just
03:21 AM Revision f969f6b3 (ceph): osd_types: bring ScrubMap::object up to the 0.56.1 encoding
We need to introduce some new fields here, so to maintain compatibility
we'll need to first bring the 48.* series up ...
Samuel Just
03:21 AM Revision b6561a2f (ceph): osd: make missing head non-fatal during scrub
If we encounter a scrub without a preceeding head, warn instead of
crashing. Note that this is still something we ca...
Sage Weil
02:00 AM Revision d882d053 (ceph): ReplicatedPG: fix snapdir trimming
The previous logic was both complicated and not correct. Consequently,
we have been tending to drop snapcollection l...
Samuel Just
02:00 AM Revision 015a454a (ceph): osdmap: spread replicas across hosts with default crush map
This is more often the case than not, and we don't have a good way to
magically know what size of cluster the user wi...
Sage Weil
02:00 AM Revision 55b7dd32 (ceph): mon: OSDMonitor: don't output to stdout in plain text if json is specified
Fixes: #3748
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
(che...
Joao Eduardo Luis
02:00 AM Revision 898a4b19 (ceph): Revert "osdmap: spread replicas across hosts with default crush map"
This reverts commit 503917f0049d297218b1247dc0793980c39195b3.
This breaks vstart and teuthology configs. A better f...
Sage Weil
02:00 AM Revision 3293b31b (ceph): OSD: only trim up to the oldest map still in use by a pg
map_cache.cached_lb() provides us with a lower bound across
all pgs for in-use osdmaps. We cannot trim past this sin...
Samuel Just

01/15/2013

10:07 PM Revision c8a9a9a8 (ceph): Add cram task
This runs cram tests, which are an easy way to test output
stays consistent. We already use cram for basic cli tests ...
Josh Durgin
09:39 PM Bug #3811 (Fix Under Review): rados.cc getomapval implementation is broken, should use omap_get_v...
Samuel Just
09:21 PM Bug #3811 (Resolved): rados.cc getomapval implementation is broken, should use omap_get_vals_by_keys
Samuel Just
09:38 PM Bug #3812 (Fix Under Review): rados.cc listomapvals usage is wrong, <key> <val> are ignored and n...
Samuel Just
09:22 PM Bug #3812 (Resolved): rados.cc listomapvals usage is wrong, <key> <val> are ignored and not needed
Samuel Just
08:53 PM CephFS Feature #3728 (Resolved): mds: draft design for lookup by ino
Sage Weil
08:41 PM Revision cf149c8c (ceph): Merge branch 'wip-rpm-update'
Clean-up the handling of ceph java bindings in the rpm specfile and
configure.ac.
Gary Lowell
08:38 PM CephFS Feature #3730: Support replication factor in Hadoop
pool ids are currently exposed via libcephfs from ceph_file_layout, which uses a 32bit integer for pool id. However, ... Noah Watkins
08:34 PM CephFS Feature #3730: Support replication factor in Hadoop
Someone could toss a 'ceph osd pool set size' Hadoop's way, so a static mapping between pg pool size and pool name co... Noah Watkins
07:51 PM rbd Feature #3754: krbd: use new request tracking code for notify ack
I'm not sure yet whether the problem has to do with this
or whether it's in the existing "new request" code. But
I...
Alex Elder
06:23 PM Documentation #3808: Block device quick start page need update
Fixed description formatting. Also, 3784 is in master now (e94b06a19218decaf7d2d7b009bd862040f20285) Dan Mick
04:46 PM Documentation #3808: Block device quick start page need update
The current writeup also assumes that the mount is local to the cluster so it hides (for the beginner) important deta... Ken Franklin
03:38 PM Documentation #3808: Block device quick start page need update
-c and --secret aren't needed if you're using the default ceph.conf and your keyring can be found based on your ceph.... Josh Durgin
03:30 PM Documentation #3808 (Resolved): Block device quick start page need update
The instructions don't match well with the bobtail release.
- should include a note that ceph-common needs to be ins...
Ken Franklin
06:21 PM Feature #3805: log: detect dup messages
I tend to think there aren't very many dups we could usefully compress. It's pretty easy to add a one-string buffer ... Dan Mick
02:25 PM Feature #3805: log: detect dup messages
What kind of dups are we trying to detect?
This sounds to me like a wishlist item that requires much more work to...
Greg Farnum
02:17 PM Feature #3805 (New): log: detect dup messages
If a log message comes through and is a dup of the previous, increment a counter or something and only log it once wi... Sage Weil
05:35 PM CephFS Bug #3254: mds: Replica inode's parent snaprealms are not open
No. So far I'm focus on stabilize basic fs function for multiple MDS setup, completely ignore snapshot. Zheng Yan
03:28 PM CephFS Bug #3254: mds: Replica inode's parent snaprealms are not open
Hmm, did this get fixed by some of Zheng's later patches? I remember things about snaprealms and migration... Greg Farnum
05:33 PM Bug #3810 (Resolved): btrfs corrupts file size on 3.7
After creating a new ceph cluster pg's become inconsistent after using the qemu client. Logs indicate that the prima... Mike Lowe
04:54 PM Bug #3809 (Won't Fix): crush compiler errors are not helpful
Small, or large, errors in the CRUSH input are apparently all treated the same by crushtool -c:
error: parse error a...
Dan Mick
04:44 PM CephFS Feature #3289: ceph-fuse: somehow exert pressure on the VFS to remove dentries from the cache
#3575 should be kept in mind while doing this/instead of this — there's a forget_multi as well. Greg Farnum
04:44 PM CephFS Bug #3601 (New): client: With multiple clients, file remove doesn't free up space
Whoops, didn't mean to change that status. Greg Farnum
04:43 PM CephFS Bug #3601 (Duplicate): client: With multiple clients, file remove doesn't free up space
The LRU actually already exists; check out Client::lru. (Unless I'm misunderstanding something?) So we might want to ... Greg Farnum
04:37 PM CephFS Bug #925: mds: update replica snaprealm on rename
De-prioritizing multi-MDS issues... Greg Farnum
04:34 PM CephFS Bug #1117: mds: rename rollback broken on slaves during replay
De-prioritizing multi-mds issues for now. Greg Farnum
04:27 PM CephFS Bug #1435: mds: loss of layout policies upon mds restart
I'm guessing we want to move this up the queue; will discuss in bug scrub tomorrow! Greg Farnum
04:23 PM CephFS Bug #1511: fsstress failure with 3 active mds
De-prioritizing multi-mds failures at this time. Greg Farnum
04:23 PM CephFS Bug #1535: concurrent creating and removing directories crashes cmds
De-prioritizing multi-MDS bugs at this time. Greg Farnum
03:51 PM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
Fair enough, but if I can just make a suggestion, perhaps you might want to explain these procedures somewhere in the... Florian Haas
03:45 PM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
I agree it's a bug, but given the procedures we have now (ack! changing procedures coming alert!) I don't think we wa... Greg Farnum
03:43 PM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
No, please. A write pretending to succeed while actually not writing data _is_ a bug. The filesystem _not lying to it... Florian Haas
03:33 PM CephFS Bug #2753: Writes to mounted Ceph FS fail silently if client has no write capability on data pool
This is a great suggestion but falls into feature rather than bug-fix category. My initial thought is keeping a list ... Greg Farnum
03:42 PM CephFS Bug #1675 (Can't reproduce): mds: failed rstat assert
The logs are long gone. This will presumably pop up again; it's a pretty common failure mode, but there's nothing in ... Greg Farnum
03:38 PM CephFS Bug #1938: mds: snaptest-2 doesn't pass with 3 MDS system
De-prioritizing all multi-MDS bugs for now. Greg Farnum
03:27 PM CephFS Bug #3267: Multiple active MDSes stall when listing freshly created files
Currently de-prioritizing multi-MDS bugs. Greg Farnum
03:23 PM Bug #3537: Logs can run root out of space and crash ceph cluster (need more aggressive log rotation)
Not an FS bug, and #3775 has a lot more conversation on this subject. Greg Farnum
03:22 PM Bug #3552: After ceph-deploy installation a reboot breaks OSDs
Whoops, not an FS bug!
I've put this in the main Ceph project for now, but it might also belong in devops. We need...
Greg Farnum
03:18 PM CephFS Bug #3625: client: EEXIST error on multiple clients to create
I know you guys did a couple rounds on this one, what's the status? Greg Farnum
02:39 PM Bug #3806: OSDs stuck in active+degraded after changing replication from 2 to 3
Yes, the question is why they're 'getting unlucky'. Josh Durgin
02:22 PM Bug #3806: OSDs stuck in active+degraded after changing replication from 2 to 3
Haven't looked into this, but my guess is a couple PGs are getting unlucky with their replica selection. I assume you... Greg Farnum
02:17 PM Bug #3806 (Won't Fix): OSDs stuck in active+degraded after changing replication from 2 to 3
Small 3 node cluster running 0.56.1-1~bpo60+1 on Debian/Squeeze, with "tuneables" enabled
I recently changed the r...
Ben Poliakoff
02:27 PM RADOS Feature #3807 (Resolved): crush: simple commands to create common rules
These should be in CrushWrapper or similar, and available via crushtool and via some 'ceph osd crush ...' commands.
...
Sage Weil
02:16 PM Feature #3775: log: stop logging in statfs reports usage above some threshold
I agree. If there are lots of log messages at the default levels, that is the problem. I don't think there is much ... Sage Weil
01:59 PM Feature #3775 (Need More Info): log: stop logging in statfs reports usage above some threshold
So I suggest we split this into two issues:
1) the documentation examples show an awfully-high logging value for s...
Dan Mick
12:03 PM Feature #3775: log: stop logging in statfs reports usage above some threshold
so, a couple ideas of what can be done.
if we do set size and frequency (or inform the user how to), then it could...
Anonymous
11:39 AM Feature #3775: log: stop logging in statfs reports usage above some threshold
So a couple of thoughts:
1) changing size in logrotate.conf doesn't help unless we also change frequency
2) with ...
Dan Mick
02:15 PM Documentation #3804 (Resolved): Logging section recommends fairly high levels, doesn't stress how...
3775 introduced the observation that logs can fill very quickly and bury a small root disk.
Our documentation could ...
Dan Mick
02:03 PM rbd Feature #3635: rbd cli: call "udevadm settle" after use of add/remove kernel interface
commit:15bb00cafc31305cacf3c4684a429c2c9ee6f804 in master
Dan Mick
02:03 PM rbd Feature #3635 (Resolved): rbd cli: call "udevadm settle" after use of add/remove kernel interface
Dan Mick
02:02 PM rbd Feature #3784: rbd: issue modprobe when rbd map is called
commit:e94b06a19218decaf7d2d7b009bd862040f20285 in master
Dan Mick
02:01 PM rbd Feature #3784 (Resolved): rbd: issue modprobe when rbd map is called
Dan Mick
01:47 PM Bug #3803 (Resolved): rados parsing error with hostnames in mon_host
nevermind.. this is fixed in v0.48.3argonaut too. Sage Weil
01:45 PM Bug #3803: rados parsing error with hostnames in mon_host
Responed to the upstraem bug. This is fixed in master and bobtail, but not backported to argonaut. Should we? Sage Weil
08:37 AM Bug #3803 (Resolved): rados parsing error with hostnames in mon_host
In /etc/ceph/ceph.conf, if I set hostnames in the mon_host variable and separate them with spaces, the parsing algori... Ian Colle
01:25 PM CephFS Bug #3637: client: not issuing caps for with clients doing shared writes
Sage has a different proposed fix than what's in the branch. Still needs to be tested. Sam Lang
12:50 PM CephFS Bug #3637: client: not issuing caps for with clients doing shared writes
I don't remember where this ended up. Was the proposed fix problematic, or did it never get looked at? Greg Farnum
01:16 PM Bug #3770: OSD crashes on boot
Yeah, I just pushed a work-around branch (which I haven't tested much, so ideally you would try it on a node you can ... Samuel Just
12:08 PM rbd Subtask #3741: krbd: rework request tracking code
I found the source of my trouble, and in the process understood
a little more about some subtlety in bio reference c...
Alex Elder
11:39 AM CephFS Bug #3718: multi-client dbench gets stuck over NFS exported cephfs
This apparently is only a problem under re-export, which I believe we are not focusing on right now. Greg Farnum
11:35 AM CephFS Bug #3553: MDS core dumped running 0.48.2argonaut
Given what we know so far (the Op got sent to the wrong OSD) this is a bug in the Objecter, not the MDS. Or possibly ... Greg Farnum
11:17 AM Bug #3771: ceph does not have startup scripts in Centos
Not an FS bug! :) Greg Farnum
10:17 AM Bug #3771 (In Progress): ceph does not have startup scripts in Centos
Anonymous
11:16 AM Bug #3768: perl is required for logrotate, we need to include Perl as a dependency
Whoops, this was never an FS bug. :) Greg Farnum
10:15 AM Bug #3768 (In Progress): perl is required for logrotate, we need to include Perl as a dependency
Anonymous
10:54 AM Bug #3747: PGs stuck in active+remapped
No I didn't, just the CRUSH rule. Faidon Liambotis
10:46 AM Bug #3747 (Need More Info): PGs stuck in active+remapped
Faidon: did you also change the replication level of pool 3 (.rgw.buckets) ? Samuel Just
10:18 AM Feature #3505 (In Progress): default to libnss
This may already have been done. Will double check. Anonymous
10:16 AM Feature #3733 (In Progress): osd: update leveldb submodule
Anonymous
10:10 AM Bug #3797 (Need More Info): osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest ...
Ian Colle
07:09 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Can you try reupgrading one of the nodes and start it with debug file store = 20? That will tell is what it is writing. Sage Weil
02:49 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
I just downgraded to 0.48.2argonaut and everything seems to be running normally again now:
Before downgrade:
ii ...
Corin Langosch
02:28 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
Here's the output of dstat http://pastie.org/5687470.text
I'm not sure why it is writing so much now, before the ...
Corin Langosch
02:17 AM Bug #3797: osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48.3argonaut
I just noticed the second osd is now consuming 100% cpu too. Before it was properly running for around 15 minutes. Gu... Corin Langosch
02:14 AM Bug #3797 (Duplicate): osd takes 100% cpu after upgrading from 0.48.2argonaut to the latest 0.48....
I just upgraded one of my production servers (2 osds) from 0.48.2argonaut to the latest 0.48.3argonaut and now of the... Corin Langosch
08:33 AM rgw Bug #3802 (Resolved): x-amz-acl header ignored on copy operation
When copying an object the x-amz-acl header is ignored. To replicate; copy a private object and send the 'x-amz-acl' ... JuanJose Galvez
07:43 AM Bug #3801 (Won't Fix): Cascading OSD failures beginning with common/HeartbeatMap.cc: 78: FAILED a...
0.48.2argonaut
Relevant logs are attached. Core dumps are available if needed....
Justin Lott
07:25 AM Linux kernel client Bug #3800: libceph: check compatibility between ceph modules
You're right, as long as you are using matching
code it's fine.
If it occurred, it's a serious problem. It just
...
Alex Elder
07:17 AM Linux kernel client Bug #3800: libceph: check compatibility between ceph modules
Is this really a problem? It seems like this could only bite someone building mixed versions out of tree. Sage Weil
06:57 AM Linux kernel client Bug #3800 (Resolved): libceph: check compatibility between ceph modules
It's possible for semantic changes to occur in one of the
ceph modules (fs/ceph, net/libceph, or block/rbd) that is
...
Alex Elder
06:58 AM Linux kernel client Bug #3799: libceph/rbd: bio refs are messed up
Because this suggests a semantically-incompatible change
between modules, this should probably be completed first:
...
Alex Elder
06:56 AM Linux kernel client Bug #3799 (Resolved): libceph/rbd: bio refs are messed up
There is an ugly reference counting dance that occurs with bio
pointers in the kernel osd I/O path, and it needs to ...
Alex Elder
06:57 AM Linux kernel client Bug #3798: libceph/rbd: take reference to all bio's in list
The other bug related to this is:
http://tracker.newdream.net/issues/3799
Alex Elder
06:56 AM Linux kernel client Bug #3798 (Resolved): libceph/rbd: take reference to all bio's in list
In a separate bug ("libceph/rbd: bio refs are messed up") I
describe how reference counting of bio's interact betwee...
Alex Elder
03:20 AM Revision d56af797 (ceph): osd: note must_scrub* flags in PG operator<<
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
03:20 AM Revision 26a63df9 (ceph): osd: fix scrub scheduling for 0.0
The initial value for pair<utime_t,pg_t> can match pg 0.0, preventing it
from being manually scrubbed. Fix!
Signed-...
Sage Weil
03:20 AM Revision 2baf1253 (ceph): osd: based INCONSISTENT pg state on persistent scrub errors
This makes the state persistent across PG peering and OSD restarts.
This has the side-effect that, on recovery, we r...
Sage Weil
02:24 AM Revision 16d67c79 (ceph): osd/PG: remove useless osd_scrub_min_interval check
This was already a no-op: we don't call PG::scrub_sched() unless it has
been osd_scrub_max_interval seconds since we ...
Sage Weil
02:24 AM Revision 29954802 (ceph): osd: change scrub min/max thresholds
The previous 'osd scrub min interval' was mostly meaningless and useless.
Meanwhile, the 'osd scrub max interval' wou...
Sage Weil
02:24 AM Revision 6f6a4193 (ceph): osd: fix object_stat_sum_t dump signedness
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:24 AM Revision d7383284 (ceph): osd: add last_clean_scrub_stamp to pg_stat_t, pg_history_t
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:24 AM Revision 2475066c (ceph): osd: add num_scrub_errors to object_stat_t
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:24 AM Revision 389bed5d (ceph): osd: note last_clean_scrub_stamp, last_scrub_errors
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:24 AM Revision 796907e2 (ceph): osd/PG: move scrub schedule registration into a helper
Simplifies callers, and will let us easily modify the decision of when
to schedule the PG for scrub.
Signed-off-by: ...
Sage Weil
02:24 AM Revision 1441095d (ceph): osd/PG: introduce flags to indicate explicitly requested scrubs
Signed-off-by: Sage Weil <sage@inktank.com> Sage Weil
02:24 AM Revision 62ee6e09 (ceph): osd/PG: trigger scrub via scrub schedule, must_ flags
When a scrub is requested, flag it and move it to the front of the
scrub schedule instead of immediately queuing it. ...
Sage Weil
02:24 AM Revision a1481207 (ceph): osd: move scrub schedule random backoff to seperate helper
Separate this from the load check, which will soon vary dependon on the
PG.
Signed-off-by: Sage Weil <sage@inktank.com>
Sage Weil
12:25 AM Revision 123a2dc4 (ceph): rados: adjust socket injection rate down
See #3795. Sage Weil
12:14 AM Revision 71097b7b (ceph): Revert "task/kclient: chmod root to 1777."
This reverts commit f17847e537802671c6f90bd1a0cdaa0e9d1e6f7a. It had
a typo and we hopefully don't need it.
Signed-o...
Greg Farnum

01/14/2013

10:11 PM Revision be0c4b34 (ceph): ac_prog_javah.m4: Use AC_CANONICAL_TARGET instead of AC_CANONICAL_SYSTEM.
Gary Lowell
10:07 PM Bug #3748: ceph osd dump --format=json includes non-JSON line
oh *fine*. :) Dan Mick
10:04 PM Bug #3748: ceph osd dump --format=json includes non-JSON line
Funny you should mention it: that is step #1 (or maybe 2 or 3) for the management API work, IMHO. :) Sage Weil
09:41 PM Bug #3748: ceph osd dump --format=json includes non-JSON line
I sorta think we ought to clean up how the various output channels are used in this code in general. This fixes the ... Dan Mick
09:23 PM Revision e182c1fd (ceph): Merge branch 'wip-java-sync'
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Reviewed-by: Joe Buck <jbbuck@gmail.com>
Noah Watkins
09:11 PM Revision fb8a488e (ceph): java: remove create/release synchronization
The constructor calls create, and finalize() calls release. Since each
of these can only happen once (enforced by Jav...
Noah Watkins
09:11 PM Revision 2b9da45d (ceph): java: remove unnecessary synchronization
The body of ceph_unmount is a call to a synchronized method.
Signed-off-by: Noah Watkins <noahwatkins@gmail.com>
Noah Watkins
09:11 PM Revision 85c10357 (ceph): java: remove all intrinsic locks
Signed-off-by: Noah Watkins <noahwatkins@gmail.com> Noah Watkins
09:11 PM Revision 13cb196e (ceph): java: add fine grained synchronization
Adds r/w lock to protect against some races.
1. Mutual exclusion for mount/unmount prevents races between the two in...
Noah Watkins
08:02 PM rbd Subtask #3741: krbd: rework request tracking code
OK, I ran a test and got a crash. The bio built for
an object request gets handed off to an osd request.
I need to...
Alex Elder
07:32 PM rbd Subtask #3741: krbd: rework request tracking code
I spent the day trying to find the memory leak and finally
found it. The structure being leaked was a bio. It was
...
Alex Elder
06:48 AM rbd Subtask #3741: krbd: rework request tracking code
For some reason my tests started hanging on Friday when
I added memory debug code for catching leaks and reuses.
I ...
Alex Elder
07:49 PM CephFS Bug #3544: ./configure checks CFLAGS for jni.h if --with-hadoop is specified but also needs to ch...
Is this still an issue? Noah Watkins
04:54 PM Bug #3752: fsync-tester script need to be fixed to run in the nightlies
Josh just pinged me that there was a typo in the chmod patch, and nobody's noticed so apparently it still hasn't been... Greg Farnum
04:24 PM Bug #3795: loadgen task gets into msgr loop
I looked a bit more and I see some failures before that, and also some passes after, e.g. teuthology-2013-01-11_07:00... Sage Weil
11:35 AM Bug #3795: loadgen task gets into msgr loop
taking a look again at the nightly runs, looks like this issue has been happening on next branch from 01-01-2013 whic... Tamilarasi muthamizhan
08:13 AM Bug #3795: loadgen task gets into msgr loop
going to see if the recent msgr changes are to blame.. bisecting! Sage Weil
08:04 AM Bug #3795: loadgen task gets into msgr loop
This appears to be a simple cycle:
- objecter has lots of requests outstanding
- there is a fault (msgr failure i...
Sage Weil
03:37 PM Revision 017b6d63 (ceph): Revert "osdmap: spread replicas across hosts with default crush map"
This reverts commit 7ea5d84fa3d0ed3db61eea7eb9fa8dbee53244b6.
This breaks teuthology and vstart both in its current ...
Sage Weil
03:04 PM CephFS Documentation #3796 (Resolved): FUSE mount documentation needs some corrections for v0,56x
The FUSE instructions need to be updated for v0.56 and later
currently:
> http://ceph.com/docs/master/cephfs/fuse...
Anonymous
01:35 PM Bug #3772 (Can't reproduce): osd: osd_disk_threads = 5 seems to hang recovery
I also don't seem to be able to reproduce on bobtail, marking can't reproduce. Samuel Just
12:58 PM Bug #3772 (New): osd: osd_disk_threads = 5 seems to hang recovery
I don't seem to be able to reproduce this on master. Samuel Just
10:37 AM Bug #3772: osd: osd_disk_threads = 5 seems to hang recovery
didn't reproduce with simple test, trying something more complicated. (roles/8882.yaml + osd disk threads : 10, teste... Samuel Just
01:28 PM CephFS Feature #3749 (Resolved): Remove forced synchronization from Java bindings
Noah Watkins
12:57 PM Feature #3769 (Fix Under Review): osd: scrub should verify snap collection existence, membership
wip_snap_scrub Samuel Just
11:55 AM rbd Bug #2871 (Resolved): rbd export command hangs when trying to export an image of size 0 to a loca...
Not certain which recent fix resolved this, but it works now.
Dan Mick
11:32 AM rbd Bug #3585 (Closed): Image import via QEMU-IMG results in a corrupt rbd
Great, glad to hear it's fixed. Josh Durgin
11:09 AM rbd Bug #3427: krbd: unmap does not remove block device properly
Patch posted for review. I'm not sure I'll be able to test
the scenario very well but hopefully it can be seen by
...
Alex Elder
09:56 AM rbd Bug #3427: krbd: unmap does not remove block device properly
Implementing the change I described now. Alex Elder
11:01 AM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
for reference, ubuntu@teuthology:/a/teuthology-2013-01-10_07:00:03-regression-argonaut-master-basic/38145 Tamilarasi muthamizhan
10:50 AM Bug #2691: osd/ReplicatedPG.cc: 5888: FAILED assert(latest->is_update())
This has shown up once in argonaut, probably not worth backporting unless it becomes more of a problem? Samuel Just
09:42 AM Bug #3629 (Resolved): test_mon_workloadgen.cc: 766: FAILED assert(m->fsid == monc.get_fsid())
commit:3610e72e4f9117af712f34a2e12c5e9537a5746f Joao Eduardo Luis
07:00 AM CephFS Bug #2187: pjd chown/00.t failed test 97
Happened again on Friday. Time to add the delay injection to the nightlies?
2013-01-11T07:32:37.489 INFO:teutholo...
Sam Lang
06:52 AM Revision 92a9d9c2 (ceph): ceph.conf: separate replicas across osds
ceph.git master now separates across crush hosts without this setting.
For teuthology clusters, we don't want that (u...
Sage Weil
05:43 AM Bug #3770: OSD crashes on boot
So, my (very basic) understanding of this suggests that the fix is that the trim wouldn't happen in the first place.
...
Faidon Liambotis

01/13/2013

10:11 PM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
Nope.. which leads me to realize that that setting needs to go in teuthology's ceph.conf. Doing that now, and then I... Sage Weil
10:01 PM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
*sigh*
This also looks good to me, and I like it better (should have suggested this the first time around). But no...
Greg Farnum
10:05 PM Bug #3774 (Fix Under Review): osd: 'ceph osd scrub' and 'ceph pg scrub' are poorly scheduled
wip-scrub Sage Weil
10:05 PM Bug #3786 (Fix Under Review): osd: scrub is deferred indefinitely if load is high
wip-scrub Sage Weil
07:04 AM Revision 410906e0 (ceph): mon: OSDMonitor: don't output to stdout in plain text if json is specified
Fixes: #3748
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Joao Eduardo Luis

01/12/2013

11:05 PM Bug #3748 (Resolved): ceph osd dump --format=json includes non-JSON line
commit:410906e04936c935903526f26fb7db16c412a711 Sage Weil
11:03 PM Bug #3795 (Resolved): loadgen task gets into msgr loop
... Sage Weil
11:01 PM Bug #3785 (Fix Under Review): ceph: default crush rule does not suit multi-OSD deployments
der, broke vstart. can you review wip-3785? Sage Weil
08:01 AM CephFS Feature #3749: Remove forced synchronization from Java bindings
In libcephfs mount/unmount race against each other, and the test of the API (e.g. unmount racing against write). In C... Noah Watkins
01:10 AM Revision 7ea5d84f (ceph): osdmap: spread replicas across hosts with default crush map
This is more often the case than not, and we don't have a good way to
magically know what size of cluster the user wi...
Sage Weil
01:09 AM Revision 3610e72e (ceph): mon: OSDMonitor: only share osdmap with up OSDs
Try to share the map with a randomly picked OSD; if the picked monitor is
not 'up', then try to find the nearest 'up'...
Joao Eduardo Luis
12:25 AM Revision 1f721804 (ceph): rbd: Fix tabs
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick

01/11/2013

11:56 PM Revision 34138993 (ceph): doc: Updates to CRUSH paper.
fixes: 3329, 3707, 3711, 3389
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins
10:28 PM Revision 15bb00ca (ceph): rbd: call udevadm settle on map/unmap
When we map/unmap devices, udev gets called to manage device nodes;
this will allow the command to wait for those man...
Dan Mick
10:28 PM Revision e94b06a1 (ceph): rbd: make 'add' modprobe rbd so it has a chance of success
Check for existence of /sys/bus/rbd first to avoid unnecessary calls
Fixes: #3784
Signed-off-by: Dan Mick <dan.mick@...
Dan Mick
08:17 PM Revision 66eb93b8 (ceph): OSD: only trim up to the oldest map still in use by a pg
map_cache.cached_lb() provides us with a lower bound across
all pgs for in-use osdmaps. We cannot trim past this sin...
Samuel Just
08:15 PM Revision 8cf79f25 (ceph): OSD: check for empty command in do_command
Fixes: #3878
Signed-off-by: Samuel Just <sam.just@inktank.com>
Reviewed-by: David Zafman <david.zafman@inktank.com>
Samuel Just
08:09 PM Revision 3e147295 (ceph): Merge pull request #32 from imjustmatthew/imjustmatthew_docs
Correct typo in mon docs 'ceph.com' to 'ceph.conf' John Wilkins
07:59 PM Revision 0f161f1e (ceph): Correct typo in mon docs 'ceph.com' to 'ceph.conf'
Matthew Roy
06:49 PM Revision aeb02061 (ceph): qa/run_xfstests.sh: use cloned xfstests repository
Use our own copy of the xfstests repository rather than hitting
the upstream one repeatedly.
Signed-off-by: Alex Eld...
Alex Elder
06:15 PM Revision 8d0fa15e (ceph): mon: Monitor: only schedule a timecheck after election if we are not alone
Fixes: #3790
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Joao Eduardo Luis
05:51 PM Bug #3785 (Resolved): ceph: default crush rule does not suit multi-OSD deployments
Merged to master in commit:7ea5d84fa3d0ed3db61eea7eb9fa8dbee53244b6 and cherry-picked to bobtail in commit:503917f004... Greg Farnum
05:45 PM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
good question. let's start with bobtail. Sage Weil
05:39 PM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
Looks good to me. What branches do we want to cherry-pick it on. Greg Farnum
05:24 PM Bug #3785 (Fix Under Review): ceph: default crush rule does not suit multi-OSD deployments
wip-3785 Sage Weil
01:59 PM Bug #3785 (New): ceph: default crush rule does not suit multi-OSD deployments
dang! wrong bug. opening this one back up.
sorry all!
Anonymous
12:34 PM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
I think maybe Deb's comments and closure were meant for another bug (perhaps 3789?) Dan Mick
11:34 AM Bug #3785 (Won't Fix): ceph: default crush rule does not suit multi-OSD deployments
This comment should have been in bug 3789
caused by a lack of resources on the system.
have increased the memory fro...
Anonymous
11:32 AM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
This comment should have been in bug 3789
upping the memory on these VMs from 512M to 2G
since it appears it was a...
Anonymous
10:55 AM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
I agree with Ian, I have seen *very bad things* happen when crush choses two OSD on one host, rather than distribute... Anonymous
10:11 AM Bug #3785: ceph: default crush rule does not suit multi-OSD deployments
The issue here is that CRUSH maps which behave well on multi-host deployments behave quite poorly on one or two host ... Greg Farnum
05:46 PM Bug #3752: fsync-tester script need to be fixed to run in the nightlies
Yes, Greg. The test passed in the recent runs. Tamilarasi muthamizhan
05:34 PM Bug #3752 (Resolved): fsync-tester script need to be fixed to run in the nightlies
This appears to be passing now, right Tamil?
Since I'm not seeing anything else breaking I'm inclined to leave the...
Greg Farnum
04:25 PM Bug #3772 (In Progress): osd: osd_disk_threads = 5 seems to hang recovery
Samuel Just
03:53 PM Documentation #3330 (In Progress): doc: How to troubleshoot unbalanced CRUSH
John Wilkins
03:51 PM Documentation #3329 (In Progress): doc: What metrics should be used to set node weight
John Wilkins
02:45 PM CephFS Bug #3793: wrong size reported in some distributions/toolchains
That makes this sounds like a simple fix... we need to swap the frsize and bsize fields. Except that right now we ar... Sage Weil
02:39 PM CephFS Bug #3793: wrong size reported in some distributions/toolchains
I spent a bit of time with gregaf trying to find authoritative sources for what the different values denote. While `... David McBride
01:40 PM CephFS Bug #3793: wrong size reported in some distributions/toolchains
This coreutils commit may have useful data:
http://git.savannah.gnu.org/cgit/coreutils.git/commit/src?id=0863f018f0f...
Greg Farnum
01:38 PM CephFS Bug #3793 (Resolved): wrong size reported in some distributions/toolchains
In ceph_statfs we set f_bsize to be 1MB in order to report very large available spaces. However, nowadays it is appar... Greg Farnum
02:38 PM CephFS Feature #3749: Remove forced synchronization from Java bindings
This needs more thought than just removing synchronization. We'd like to be segfault free in Java, even though you co... Noah Watkins
02:26 PM Bug #3789: OSD core dump and down OSD on CentOS cluster
There is 'ceph health', and a nagios plugin that runs it. A similarly trivial plugin can probably be written for oth... Sage Weil
02:01 PM Bug #3789 (Won't Fix): OSD core dump and down OSD on CentOS cluster
dmesg shows it was a lack of resources.
upping the memory on these VMs from 512M to 2G
since it appears it ...
Anonymous
10:28 AM Bug #3789: OSD core dump and down OSD on CentOS cluster
Deb Barba wrote:
> all core files have similar backtrace.
> again, Sage, looks like you are right, low resources
>...
Anonymous
10:27 AM Bug #3789: OSD core dump and down OSD on CentOS cluster
all core files have similar backtrace.
again, Sage, looks like you are right, low resources
dmesg:
hrtimer: inte...
Anonymous
10:23 AM Bug #3789: OSD core dump and down OSD on CentOS cluster
looks from dmesg, you are right Sage, low on resources
centos1 core# gdb /usr/bin/ceph-osd core.0.26177
Core wa...
Anonymous
10:16 AM Bug #3789: OSD core dump and down OSD on CentOS cluster
backtrace of core.0.14401 from centos3:
Core was generated by `/usr/bin/ceph-osd -i 8 --pid-file /var/run/ceph/osd....
Anonymous
09:37 AM Bug #3789 (Need More Info): OSD core dump and down OSD on CentOS cluster
check dmesg, or VM responsiveness. this triggers when a call to sync(2) takes more than... 2 minutes? i forget how l... Sage Weil
09:13 AM Bug #3789 (Won't Fix): OSD core dump and down OSD on CentOS cluster
Running a CentOS VM cluster. Running v0.56.1
I had written a bit of data, and stopped writing about 4pm yesterday...
Anonymous
02:17 PM rbd Subtask #3741: krbd: rework request tracking code
Unfortunately my system crashed after an hour or so. The
crash was in the network driver, and a little analysis
su...
Alex Elder
10:45 AM rbd Subtask #3741: krbd: rework request tracking code
My full test run isn't complete but I seem to have resolved
whatever problem I was hitting yesterday. I have not ye...
Alex Elder
01:39 PM CephFS Bug #3794 (Resolved): uclient: reports sizes wrong in some cases
This is the counterpart to kernel bug #3793. See Client::statfs, in which we set f_bsize to 1MB but f_frsize to 4KB. ... Greg Farnum
12:22 PM Bug #3787 (Resolved): Ceph OSD crashes on ceph tell osd.x
8cf79f252a1bcea5713065390180a36f31d66dfd Samuel Just
11:12 AM Bug #3787 (Fix Under Review): Ceph OSD crashes on ceph tell osd.x
wip_3787 Samuel Just
09:33 AM Bug #3787: Ceph OSD crashes on ceph tell osd.x
verified this happens on master. should be an easy fix. thanks for the report! Sage Weil
12:17 AM Bug #3787 (Resolved): Ceph OSD crashes on ceph tell osd.x
I recently set up a small test cluster with 2 nodes to test the 0.48.3 -> 0.56.1 upgrade. After Upgrading one of the ... Seb Mel
12:22 PM Bug #3770 (Resolved): OSD crashes on boot
66eb93b83648b4561b77ee6aab5b484e6dba4771 Samuel Just
11:16 AM Bug #3770 (Fix Under Review): OSD crashes on boot
wip_3770 Samuel Just
11:03 AM Bug #3770: OSD crashes on boot
The fault is in OSD::handle_osd_map where we trim old maps. Prior to 0.50, the pgs would have processed up to the cu... Samuel Just
09:59 AM Bug #3770: OSD crashes on boot
I'm seeing this same assert failure when trying to startup 3 of my OSDs. Happy to provide feedback for the debugging ... Mike Dawson
09:43 AM Bug #3770: OSD crashes on boot
sjust said that we're done collecting information and that I could rm the pg directory/log/info, which I did. Unfortu... Faidon Liambotis
09:41 AM Bug #3770: OSD crashes on boot
Ian Colle
12:04 PM Bug #3788: debian source packages are missing
Gary Lowell wrote:
> It looks like the Sources file has been zero length in past releases as well. Still investigat...
Loïc Dachary
12:03 PM Bug #3788: debian source packages are missing
My favorite use case when source packages are available would be... Loïc Dachary
11:33 AM Bug #3788: debian source packages are missing
I think we should build source packages too (in addition to tarballs, etc.). Sage Weil
10:47 AM Bug #3788: debian source packages are missing
We are not currently building debian or rpm source packages. We do put out a source tarball corresponding to the rel... Anonymous
09:56 AM Bug #3788 (In Progress): debian source packages are missing
It looks like the Sources file has been zero length in past releases as well. Still investigating. Anonymous
02:20 AM Bug #3788: debian source packages are missing
Proposed fix at https://github.com/ceph/ceph-build/pull/1 Loïc Dachary
01:44 AM Bug #3788: debian source packages are missing
http://ceph.com/debian/conf/distributions is created from https://github.com/ceph/ceph-build/blob/master/gen_reprepro... Loïc Dachary
01:35 AM Bug #3788 (Resolved): debian source packages are missing
Following the instructions at http://ceph.com/docs/master/install/debian/ to add the ... Loïc Dachary
10:52 AM CephFS Bug #3773: mds crashed at LogEvent::decode
Sure Sage. I was running bonnie from client during upgrade.
I had debug ms=1 set, i will try to reproduce this with...
Tamilarasi muthamizhan
09:41 AM CephFS Bug #3773 (Need More Info): mds crashed at LogEvent::decode
Tamil, I wonder if you can try to reproduce this with mds logging turned up from teh start (debug mds = 20, debug ms ... Sage Weil
10:34 AM Messengers Bug #2569: msgr: connect_rank crash
yes, you are right, Greg. I just wanted to put a note of this somewhere, so chose to update the bug itself :) Tamilarasi muthamizhan
10:23 AM Bug #3748 (Fix Under Review): ceph osd dump --format=json includes non-JSON line
wip-3748 has a fix, commit:0edb53f02231fb83f33d3bc5f58b37b14cd5df82 Joao Eduardo Luis
10:20 AM Bug #3695 (Resolved): monitor crashed after an upgrade in Monitor::timecheck
Ian Colle
10:16 AM Bug #3790 (Resolved): Mon crash after update to ceph version 0.56-209-g310112f
looks good, merged into master. commit:8d0fa15e6aa3847e89de5d5adfca0a863e8da976 Sage Weil
10:06 AM Bug #3790: Mon crash after update to ceph version 0.56-209-g310112f
Had a redundant check on the previous commit; fixed and rebased it and the new commit can be found on wip-3790 commit... Joao Eduardo Luis
10:02 AM Bug #3790: Mon crash after update to ceph version 0.56-209-g310112f
This patch fixes it. Joao Eduardo Luis
09:31 AM Bug #3790 (In Progress): Mon crash after update to ceph version 0.56-209-g310112f
My fault. Forgot a check on win_election().
Any chance you can test 6104629d95207f3dfd3a744d81b011b6a714070e on wi...
Joao Eduardo Luis
09:18 AM Bug #3790: Mon crash after update to ceph version 0.56-209-g310112f
Previous installed version was .56-193. Ken Franklin
09:14 AM Bug #3790 (Resolved): Mon crash after update to ceph version 0.56-209-g310112f
I have a single node cluster on burnupi60 updated each morning to the latest Master branch. After the update this mo... Ken Franklin
09:16 AM Bug #3774 (In Progress): osd: 'ceph osd scrub' and 'ceph pg scrub' are poorly scheduled
Sage Weil
09:16 AM Bug #3774: osd: 'ceph osd scrub' and 'ceph pg scrub' are poorly scheduled
wip-scrub-sched for the argonaut version. should look very similar for master/bobtail. Sage Weil
02:05 AM Revision 310112f7 (ceph): Merge remote-tracking branch 'gh/wip-3633'
Reviewed-by: Sage Weil <sage@inktank.com> Sage Weil
02:04 AM Revision 9e4a3f03 (ceph): Merge remote-tracking branch 'gh/wip-3633'
Sage Weil
02:03 AM Revision 305cb54a (ceph): suites: rados: multimon: add mon clock skews task yaml files
Signed-off-by: Joao Eduardo Luis <jecluis@gmail.com> Joao Eduardo Luis
12:58 AM Revision 2fa5d23b (ceph): test: Hadoop cluster and task config.
Add a 3-node cluster specification and a
task for running wordcount with Hadoop on Ceph.
Signed-off-by: Joe Buck <jb...
Joe Buck
12:44 AM Revision aa40de90 (ceph): messages: add MTimeCheck
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inktank.com>
Joao Eduardo Luis
12:44 AM Revision 684d4ba2 (ceph): mon: Monitor: add timecheck infrastructure to detect clock skews
Fixes: #3633
Fixes: #3695
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Reviewed-by: Sage Weil <sage@inkt...
Joao Eduardo Luis
12:44 AM Revision ff1c254b (ceph): mon: Monitor: reduce indentation level; make code more readable
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
12:44 AM Revision 7a7fff57 (ceph): mon: Monitor: move a couple of if's together on handle_command()
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
12:44 AM Revision bc57c7a9 (ceph): mon: Monitor: use 'else if' on handle_command instead of bunches of 'if'
... when the options are mutually exclusive.
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com>
Joao Eduardo Luis
12:44 AM Revision 58e03ecb (ceph): mon: Monitor: unify 'ceph health' and 'ceph status'; add json output
Signed-off-by: Joao Eduardo Luis <joao.luis@inktank.com> Joao Eduardo Luis
12:03 AM Revision e6f284e9 (ceph): doc: Added -a option. Should work without from server, as described.
fixes: #3750
Signed-off-by: John Wilkins <john.wilkins@inktank.com>
John Wilkins

01/10/2013

11:59 PM Revision de6633f9 (ceph): doc: Normalized to term "drive" rather than disk. Changed "(Manual)" en...
Signed-off-by: John Wilkins <john.wilkins@inktank.com> John Wilkins
11:06 PM Revision 7a8ec194 (ceph): Merge branch 'next'
Samuel Just
09:54 PM Revision 988f3597 (ceph): rados: add truncate support
Signed-off-by: Samuel Just <sam.just@inktank.com>
Revewed-by: Greg Farnum <greg@inktank.com>
Samuel Just
09:04 PM Bug #3786 (Resolved): osd: scrub is deferred indefinitely if load is high
If the load is above the threshold, we will never scrub. For some environments, this is normal (e.g., mixed OSD and ... Sage Weil
08:23 PM rbd Bug #3585: Image import via QEMU-IMG results in a corrupt rbd
This seems to be fixed in QEMU 1.3.0 and Ceph 0.56.1
I've tried QED -> Raw -> Ceph -> Raw then QED -> Ceph -> Raw an...
Matt Anderson
07:56 PM Bug #3785 (Resolved): ceph: default crush rule does not suit multi-OSD deployments
Version: 0.48.2-0ubuntu2~cloud0
Our Ceph deployments typically involve multiple OSDs per host with no disk redunda...
Ian Colle
07:10 PM rbd Feature #3635 (In Progress): rbd cli: call "udevadm settle" after use of add/remove kernel interface
Dan Mick
07:10 PM Revision 44625d44 (ceph): config_opts.h: default osd_recovery_delay_start to 0
This setting was intended to prevent recovery from overwhelming peering traffic
by delaying the recovery_wq until osd...
Samuel Just
07:09 PM rbd Feature #3784 (In Progress): rbd: issue modprobe when rbd map is called
Dan Mick
06:04 PM rbd Feature #3784 (Resolved): rbd: issue modprobe when rbd map is called
rbd map will not work unless the rbd kernel module is loaded, and this must be done manually. Add code to rbd to cau... Dan Mick
07:02 PM Revision 830b8ffa (ceph): ReplicatedPG: fix snapdir trimming
The previous logic was both complicated and not correct. Consequently,
we have been tending to drop snapcollection l...
Samuel Just
06:34 PM Revision 0f42c373 (ceph): ReplicatedPG: fix snapdir trimming
The previous logic was both complicated and not correct. Consequently,
we have been tending to drop snapcollection l...
Samuel Just
06:24 PM Bug #3774: osd: 'ceph osd scrub' and 'ceph pg scrub' are poorly scheduled
Sage Weil
06:14 PM Revision 035caac5 (ceph): Revert "rgw: fix handler leak in handle_request"
This reverts commit eba314a811cd98a79f483dc7a9128fe76c722c78.
Signed-off-by: Yehuda Sadeh <yehuda@inktank.com>
Yehuda Sadeh
06:11 PM rgw Feature #3402 (Fix Under Review): rgw: improve tests for multipart upload
caleb miles
06:10 PM rgw Feature #3634 (Fix Under Review): rgw: improve teuthology radosgw-admin test
caleb miles
06:09 PM Bug #3633 (Resolved): mon: clock drift errors not reported by ceph status
commit:310112f702d14294e6ba48f8af41a306288cba65 Sage Weil
06:09 PM Revision eb997e25 (ceph): Merge pull request #31 from chrisglass/expose_cluster_stats_to_python
Added python wrapper to rados_cluster_stat Greg Farnum
05:59 PM rbd Bug #3518 (Can't reproduce): rbd import file --format 2 creates an image named '--format'
Dan Mick
05:59 PM rbd Bug #3518: rbd import file --format 2 creates an image named '--format'
It seems that this no longer happens as of e6f284e945f45e39c57921149d4551d9e78557a5,
so closing non-reproducible.
Dan Mick
05:06 PM CephFS Bug #3773: mds crashed at LogEvent::decode
Okay, I gathered up a core file, a high-debug MDS log, and the log with the bad event (and the bad event itself) in t... Greg Farnum
02:05 PM CephFS Bug #3773: mds crashed at LogEvent::decode
I'll at least start this off. Greg Farnum
04:54 PM Revision c8f3fd6e (ceph): marginal: Remove broken symlinks
Signed-off-by: Sam Lang <sam.lang@inktank.com> Sam Lang
04:47 PM Messengers Bug #2569: msgr: connect_rank crash
I believe this was caused by some issues which we decided not to backport the fixes for due to their size; Sage can c... Greg Farnum
04:43 PM Messengers Bug #2569: msgr: connect_rank crash
hit this on a mixed cluster running argonaut v0.48.3 and v0.56 [ ceph version 0.56-193-g00898c1]
monitors,mds,osds...
Tamilarasi muthamizhan
04:37 PM rbd Bug #3688 (Won't Fix): rbd allows image of size 0 to be created
I claim that zero-sized images are legal, if not particularly useful in that size...but one might well want to create... Dan Mick
04:15 PM Bug #3770: OSD crashes on boot
root@ms-be1003:/var/lib/ceph/osd/ceph-27# find current/meta/ | tee ~/ceph-osd.27.meta | wc -l
42992
Attached.
Faidon Liambotis
04:02 PM Bug #3770: OSD crashes on boot
root@ms-be1003:/var/lib/ceph/osd/ceph-27/current/4.f9_head# attr -lq $PWD | while read attr; do echo $attr; attr -q -... Faidon Liambotis
02:27 PM Bug #3770 (Need More Info): OSD crashes on boot
From the backtrace:
pgid = {m_pool = 4, m_seed = 249, m_preferred = -1}
Based on the info attr, we try to...
Samuel Just
04:04 PM Bug #3750 (Resolved): Possible Ceph 5-minute quick start guide typo
Documentation described making the call from the server console, which should work as described. Added -a so that it ... John Wilkins
03:52 PM Bug #3780 (Won't Fix): pg_num inappropriately low on new pools
Version: 0.48.2-0ubuntu2~cloud0
On a Ceph cluster with 18 OSDs, new object pools are being created with a pg_num o...
Ian Colle
03:08 PM rgw Bug #3778: document procedure for enabling subdomain S3 api calls
The documentation should note that the
@rgw dns name = {hostname}@
option must be set in the
@[client.radosgw.g...
caleb miles
11:13 AM rgw Bug #3778 (Resolved): document procedure for enabling subdomain S3 api calls
The process for setting up a server that handles subdomain API requests is not documented. If possible we should add ... caleb miles
03:07 PM Documentation #3711 (In Progress): crush-map.rst: choose firstn talks about "N", but does not cle...
John Wilkins
03:05 PM devops Documentation #2886 (In Progress): doc: crush location tricks, ceph.conf, automatic host=
John Wilkins
02:23 PM rbd Subtask #3741: krbd: rework request tracking code
I am leaving shortly for a few hours. In reviewing this
new code I find a few things that make it a little hard
ma...
Alex Elder
01:00 PM rbd Subtask #3741: krbd: rework request tracking code
I did some testing yesterday and found that I got I/O errors
while running xfstests. This was unexpected; I thought...
Alex Elder
01:43 PM Revision 797b3db3 (ceph): Added python wrapper to rados_cluster_stat
The new get_cluster_stats() method on the rados.Rados object calls
the rados_cluster_stat() function in the librados ...
Chris Glass
12:51 PM Bug #2533 (Duplicate): osd: watchers tracked by entity_name_t, not by cookie
Ian Colle
12:48 PM Feature #3769: osd: scrub should verify snap collection existence, membership
Written, just needs to be ported to Bobtail Ian Colle
09:40 AM Feature #3769 (In Progress): osd: scrub should verify snap collection existence, membership
Sage Weil
12:47 PM Bug #3736 (In Progress): kernel build: failures starting in 3.8-rc1
Ian Colle
12:02 PM Bug #3736: kernel build: failures starting in 3.8-rc1
The remaining issue is that the patch we apply to scripts/package/builddeb to build the perf tools is out of date. I... Anonymous
12:45 PM Bug #3702 (New): OSD SIGABRT during startup
Ian Colle
12:40 PM Bug #3617 (Resolved): Ceph doesn't support > 65536 PGs(?) and fails silently
Ian Colle
09:35 AM Bug #3617: Ceph doesn't support > 65536 PGs(?) and fails silently
How's the testing come along, Sage? Greg Farnum
12:39 PM Bug #3695: monitor crashed after an upgrade in Monitor::timecheck
Believed fixed by patch to 3633
684d4ba242b26828bd7927860226bfc8a0cfcc2b
Ian Colle
12:35 PM Bug #3650 (Can't reproduce): osd: crash in Reset state -> start_peering_interval -> on_change -> ...
Looked into the core dump, can't see how this happened. Samuel Just
12:30 PM Bug #3591 (Closed): auth: could not find secret_id=0
Ian Colle
12:30 PM Bug #3591 (Resolved): auth: could not find secret_id=0
Resolved by Sage's fix above. Ian Colle
12:29 PM Bug #3563 (Closed): osd crashed with error "auth: could not find secret_id=2"
Ian Colle
12:29 PM Bug #3563 (Resolved): osd crashed with error "auth: could not find secret_id=2"
Resolved by fix to 3591 Ian Colle
12:20 PM Bug #3467 (Closed): osd: bad state machine event in start_recoverY_ops
Ian Colle
12:20 PM Bug #3467 (Won't Fix): osd: bad state machine event in start_recoverY_ops
If encountered, restart OSD. Ian Colle
12:13 PM Bug #3300: ceph::buffer::end_of_buffer isn't caught
Josh - Is this just a case where the documentation needs to be updated? Ian Colle
11:46 AM Bug #3768: perl is required for logrotate, we need to include Perl as a dependency
The same issue exists with the debian packages. We have an explicit dependency on python, but not on perl. I don't ... Anonymous
10:55 AM Bug #3768: perl is required for logrotate, we need to include Perl as a dependency
Can we check to ensure perl is not used elsewhere?
Are there guidelines that are provided to the developers that spe...
Anonymous
10:06 AM Bug #3768: perl is required for logrotate, we need to include Perl as a dependency
I hate to see a dependency like perl get added for a oneliner perl regex. Is this the only place perl is used? Can ... Sam Lang
09:43 AM Bug #3768: perl is required for logrotate, we need to include Perl as a dependency
backport to bobtail Ian Colle
11:26 AM Tasks #3779 (Resolved): update osd config ref as appropriate
I'm not sure what our update policies on the docs are, but the defaults named in http://ceph.com/docs/master/rados/co... Greg Farnum
11:11 AM rgw Cleanup #3777 (Resolved): rgw: audit code for reading NULL env variables
Similar to the issue that triggered #3735 Yehuda Sadeh
10:25 AM Bug #3647 (Can't reproduce): forgot the auth options for Cephx and added them later: Get msg: 7f...
Sage Weil
10:19 AM rgw Bug #3735 (Closed): rgw: Crashes when using a fastCGI front end that doesn't set SCRIPT_URI
Ian Colle
10:19 AM rgw Bug #3735 (Resolved): rgw: Crashes when using a fastCGI front end that doesn't set SCRIPT_URI
Ian Colle
10:00 AM rgw Bug #3735: rgw: Crashes when using a fastCGI front end that doesn't set SCRIPT_URI
commit:e1da85f286838cdd3a6329840cec748c6a11fd26 Sage Weil
09:57 AM Bug #3747: PGs stuck in active+remapped
Sage Weil wrote:
> commit:f83fcf63a928fdb8ab4d604bdce596c0c4afd854
oops, wrong bug!
Sage Weil
09:45 AM Bug #3747 (Resolved): PGs stuck in active+remapped
commit:f83fcf63a928fdb8ab4d604bdce596c0c4afd854 Sage Weil
09:55 AM CephFS Feature #3621 (Closed): qa: add knfsd reexport tests to qa suite
Ian Colle
09:52 AM CephFS Feature #3621: qa: add knfsd reexport tests to qa suite
commit:aaa03bbcd2549a38f962a61fc63be16cca3a6d90 in teuthology.git Sage Weil
09:34 AM Bug #3776 (Resolved): Need doc describing how to alter our log rotation
If a user has a small to moderate size of root disk, they will probably have to modify the log rotation process for c... Anonymous
09:32 AM Bug #3661 (Resolved): mon: idle/empty osds marked down after 15 min
Sage Weil
08:34 AM Feature #3775: log: stop logging in statfs reports usage above some threshold
Sam,
That is a cool idea. I will open a doc bug for that. Providing instructions for those with smaller root dri...
Anonymous
06:32 AM Feature #3775: log: stop logging in statfs reports usage above some threshold
The easiest solution for this might be to adjust the default logrotate script (src/logrotate.conf) to use the size pa... Sam Lang
03:52 AM Revision 59aad347 (ceph): configure.ac: check for org.junit.rules.ExternalResource
Check for org.junit.rules.ExternalResource if build with
--enable-cephfs-java and --with-debug. Checking for junit4
i...
Danny Al-Gaaf
01:13 AM Revision 12af11a1 (ceph): src/java/Makefile.am: fix default java dir
Fix default javadir in src/java/Makefile.am to $(datadir)/java
since this is the common data dir for java files.
Sig...
Danny Al-Gaaf
01:13 AM Revision 9b167b46 (ceph): ceph.spec.in: fix handling of java files
Fix handling of JAVA (jar) files. Don't move the files around in the install
section since the related Makefile is fi...
Danny Al-Gaaf
01:13 AM Revision f027d025 (ceph): ceph.spec.in: rename libcephfs-java package to cephfs-java
Rename the libcephfs-java package to cephfs-java since the package
contains no (classic) library and RPMLINT complain...
Danny Al-Gaaf
01:13 AM Revision d8c4fc5e (ceph): ceph.spec.in: fix libcephfs-jni package name
Rename libcephfs-jni to libcephfs_jni1 to reflect the SO name/version of
the library and to prevent RPMLINT to compla...
Danny Al-Gaaf
01:13 AM Revision aedbb97f (ceph): configure.ac: remove AC_PROG_RANLIB
Remove already comment out AC_PROG_RANLIB to get rid of warning:
libtoolize: `AC_PROG_RANLIB' is rendered obsolete b...
Danny Al-Gaaf
01:13 AM Revision 61437ee2 (ceph): configure.ac: change junit4 handling
Change handling of --with-debug and junit4. Add a new conditional HAVE_JUNIT4
to be able to build ceph-test package a...
Danny Al-Gaaf
12:11 AM Revision 00898c18 (ceph): rbd: allow copy of zero-length images. Includes simple test.
Fixes: #3765
Signed-off-by: Dan Mick <dan.mick@inktank.com>
Dan Mick
12:10 AM Revision 1c3d6840 (ceph): doc/install/debian.rst: fix typo in link ref; broke doc build
Signed-off-by: Dan Mick <dan.mick@inktank.com> Dan Mick
 

Also available in: Atom