General

Profile

John Spray's activity

From 01/22/2018 to 02/20/2018

02/20/2018

01:30 PM mgr Bug #21225: ceph-mgr: dashboard and zabbix plugin report wrong values
This seems likely to be the same issue as http://tracker.ceph.com/issues/22142, the fix for which will be in 12.2.3 John Spray

02/19/2018

06:34 PM mgr Feature #23042 (Resolved): Prompt ServiceMap removal on clean shutdown
We should have MgrClient explicitly send a message on shutdown, so that DaemonServer can immediately remove it from S... John Spray
03:23 PM mgr Bug #23017 (Fix Under Review): mgr log spamming about down osds
This was also happening at startup even if no OSDs were down, so let's stop reporting it at error level.
https://git...
John Spray
03:21 PM Ceph Revision b6b2b07f (ceph): mgr: quieten logging on missing OSD stats
This is only cause for concern if the OSD is actually up,
and even then it can also happen at startup, so
we shouldn'...
John Spray

02/14/2018

03:19 PM Ceph Revision c15cf4d8 (ceph): mon: dump percent_used PGMap field as float
Formatting this as a string was awkward for anyone
consuming it from other code.
Signed-off-by: John Spray <john.spr...
John Spray

02/13/2018

11:12 AM mgr Bug #22947 (Duplicate): Prometheus exporter module fails due to incorrect PG_STATES
Thanks for the report: the fix for this will be part of the forthcoming 12.2.3 release. John Spray

02/12/2018

01:57 PM mgr Bug #22226: ceph zabbix plugin sends incorrect motinoring info to zabbix server
If it's correlated with a mon going down then I suspect this is the same underlying cause as http://tracker.ceph.com/... John Spray

02/06/2018

05:50 PM Ceph Revision f19f1e1c (ceph): mds: don't report repaired backtraces in damagetable
Fixes: http://tracker.ceph.com/issues/18743
Signed-off-by: John Spray <john.spray@redhat.com>
(cherry picked from com...
John Spray
05:50 PM Ceph Revision 6e544ac9 (ceph): mds: flush after scrub repairs
Otherwise, if we restart the MDS right after the scrub,
then scrub again, it will see the same inconsistency when
it ...
John Spray
05:50 PM Ceph Revision 798cc37c (ceph): mds: clean up clog damage message when repairing
If we've repaired then we don't need to be at WRN
and we don't need to advise the user to look up
the damage detail.
...
John Spray
05:50 PM Ceph Revision 162e25ec (ceph): mds: cleaner scrub complete log msg
People often are not using a tag with their scrub,
let's not have the ugly "with tag ''" in that case.
Signed-off-by...
John Spray

02/05/2018

08:27 PM mgr Bug #21399 (Closed): ceph-mgr module(s) inaccessible after a reboot
There's no log to look at, and no further reports, so I'm going to close this.
The fix for the 'exe' thing went in...
John Spray
04:46 PM mgr Backport #22921 (In Progress): luminous: dashboard module: 404 for static resouces
https://github.com/ceph/ceph/pull/20325 John Spray
04:38 PM mgr Backport #22921 (Resolved): luminous: dashboard module: 404 for static resouces
https://github.com/ceph/ceph/pull/20325 John Spray
04:19 PM mgr Backport #22919 (Closed): luminous: mgr sends early beacon with no modules reported
Oops, created this backport ticket from the wrong original. Disregard! John Spray
03:12 PM mgr Backport #22919: luminous: mgr sends early beacon with no modules reported
https://github.com/ceph/ceph/pull/20321 John Spray
03:11 PM mgr Backport #22919 (Closed): luminous: mgr sends early beacon with no modules reported
Sends beacon before PyModuleRegistry::init has been called -- if admin quickly does a "mgr module enable" after daemo... John Spray
12:07 PM Dashboard Bug #22851 (Pending Backport): dashboard module: 404 for static resouces
John Spray
12:06 PM mgr Bug #22918 (Resolved): mgr sends early beacon with no modules reported
Sends beacon before PyModuleRegistry::init has been called -- if admin quickly does a "mgr module enable" after daemo... John Spray

02/01/2018

05:31 PM mgr Bug #22880: failed to load ceph_module with Python3
Kefu, do you know if a py3-compiled mgr has been seen to work? I've never tried it. John Spray
11:12 AM website Bug #22850: Luminous/Jewel docs missing table of contents
Confirmed, master has the TOC but luminous doesn't. John Spray
11:11 AM RADOS Bug #22842: (luminous) ceph-disk prepare of simple filestore failed with 'Unable to set partition...
I would suspect that something is strange about the disk (non-GPT partition table perhaps?), and you're getting less-... John Spray

01/31/2018

09:19 PM Ceph Revision d8f9aec7 (ceph): mon: tidy up mds_blacklist_interval
Remove the misleading reference to this from the client
eviction page, it was never the right option to mention
there...
John Spray
09:19 PM Ceph Revision 5dcd2a59 (ceph): mon: s/mds_blacklist_interval/mon_mds_blacklist_interval/
We can rename LEVEL_DEV options at will.
Signed-off-by: John Spray <john.spray@redhat.com>
(cherry picked from commi...
John Spray
01:20 PM CephFS Bug #22839 (Rejected): MDSAuthCaps (unlike others) still require "allow" at start
This was changed for the OSD and mon caps, but the MDS caps were missed:
https://github.com/ceph/ceph/pull/15991/com...
John Spray
12:11 PM mgr Bug #21999 (Resolved): mgr tests don't indicate failure if exception thrown from serve()
John Spray
12:11 PM mgr Bug #21707 (Resolved): "osd status" command exception if OSD not in pgmap stats
John Spray
12:10 PM mgr Bug #21687 (Resolved): mgr: mark_down of osd without metadata is broken
John Spray
12:10 PM mgr Bug #20887 (Resolved): Services reported with blank hostname by mgr
John Spray
12:09 PM mgr Bug #22361 (Resolved): balancer crush-compat sends "foo" command
John Spray
12:09 PM mgr Backport #22455 (Resolved): luminous: balancer crush-compat sends "foo" command
John Spray
12:07 PM mgr Bug #22096 (Resolved): Authentication failed, did you specify a mgr ID with a valid keyring?
John Spray
12:07 PM mgr Backport #22811 (Resolved): luminous: Authentication failed, did you specify a mgr ID with a vali...
John Spray
12:03 PM RADOS Bug #22142 (Resolved): mon doesn't send health status after paxos service is inactive temporarily
John Spray
12:03 PM RADOS Backport #22421 (Resolved): mon doesn't send health status after paxos service is inactive tempor...
John Spray
11:59 AM mgr Bug #22429 (Resolved): mgr/balancer/upmap_max_iterations must be cast to integer
John Spray
11:59 AM mgr Backport #22453 (Resolved): luminous: mgr/balancer/upmap_max_iterations must be cast to integer
John Spray
11:58 AM mgr Bug #22470 (Resolved): KeyError: ('name',) in balancer rm
John Spray
11:58 AM mgr Backport #22496 (Resolved): luminous: KeyError: ('name',) in balancer rm
John Spray
11:49 AM mgr Bug #22313 (Resolved): Prometheus exporter can't get metrics after update to 12.2.2
John Spray
11:48 AM mgr Backport #22585 (Resolved): luminous: Prometheus exporter can't get metrics after update to 12.2.2
John Spray
11:47 AM mgr Fix #22718 (Resolved): mgr: prometheus: missed osd commit\apply latency metrics.
John Spray
11:47 AM mgr Backport #22760 (Resolved): luminous: mgr: prometheus: missed osd commit\apply latency metrics.
John Spray
11:46 AM mgr Bug #22286 (Resolved): Service daemons never recover from transient outage
John Spray
11:46 AM mgr Backport #22768 (Resolved): luminous: Service daemons never recover from transient outage
John Spray
11:43 AM Dashboard Bug #22557 (Resolved): Dashboard on backup MGRs always redirects to /, breaking reverse proxy sup...
John Spray
11:43 AM mgr Backport #22831 (Resolved): luminous: Dashboard on backup MGRs always redirects to /, breaking re...
John Spray
11:33 AM mgr Backport #22197 (Resolved): luminous: mgr: mark_down of osd without metadata is broken
John Spray
11:32 AM mgr Backport #22216 (Resolved): luminous: "osd status" command exception if OSD not in pgmap stats
John Spray
11:32 AM mgr Backport #21479 (Resolved): luminous: Services reported with blank hostname by mgr
John Spray
11:32 AM mgr Backport #22075 (Resolved): luminous: mgr tests don't indicate failure if exception thrown from s...
John Spray

01/30/2018

10:37 AM mgr Backport #22831 (Fix Under Review): luminous: Dashboard on backup MGRs always redirects to /, bre...
https://github.com/ceph/ceph/pull/20182 John Spray
10:35 AM mgr Backport #22831 (Resolved): luminous: Dashboard on backup MGRs always redirects to /, breaking re...
https://github.com/ceph/ceph/pull/20182 John Spray
10:33 AM mgr Backport #22576 (Resolved): luminous: zabbix throws exception
John Spray
10:33 AM mgr Bug #22514 (Resolved): zabbix throws exception
John Spray
10:28 AM mgr Bug #21981 (Resolved): mgr: dashboard plugin OSD daemons' table the Usage column's value is alway...
John Spray
10:28 AM mgr Backport #22023 (Resolved): luminous: mgr: dashboard plugin OSD daemons' table the Usage column's...
John Spray
10:27 AM mgr Bug #21599 (Resolved): List of filesystems does not get refreshed after a filesystem deletion
John Spray
10:26 AM mgr Backport #22030 (Resolved): luminous: List of filesystems does not get refreshed after a filesyst...
John Spray
10:26 AM mgr Bug #21253 (Resolved): Prometheus crash when update
John Spray
10:25 AM mgr Bug #20899 (Resolved): prometheus module generates invalid output when counter names contain non-...
John Spray
10:15 AM Ceph Bug #22784 (Fix Under Review): Key exported from auth keyring is not imported properly
John Spray
10:15 AM Ceph Bug #22828 (Fix Under Review): build Debian installation packages failure
John Spray

01/29/2018

03:20 PM CephFS Bug #22754: mon: removing tier from an EC base pool is forbidden, even if allow_ec_overwrites is set
As far as I'm aware, nobody has worked on it, so that would be a no. John Spray
02:34 PM mgr Feature #22748 (Fix Under Review): Add units to configuration settings
https://github.com/ceph/ceph/pull/20163 John Spray
02:34 PM mgr Feature #22747 (Fix Under Review): Add units to performance counters
https://github.com/ceph/ceph/pull/20152 John Spray
02:32 PM mgr Bug #22327: MGR dashboard doesn't update OSD's ceph version after updating from 12.2.1 to 12.2.2
I imagine this is probably and underlying ceph-mgr glitch rather than dashboard specific. I'll leave this ticket her... John Spray
02:29 PM Dashboard Bug #22557 (Pending Backport): Dashboard on backup MGRs always redirects to /, breaking reverse p...
John Spray
02:57 AM Ceph Revision 2852a5f5 (ceph): mon: fix mgr using auth_client_required policy
This caused mgr daemons to fail to authenticate
when auth_client_required was set to something
different to auth_clus...
John Spray
02:57 AM Ceph Revision f2a20277 (ceph): mgr: apply auth_service_required to client conns
Previously was using auth_cluster_required for all
connections, which meant that if someone had
disabled client cephx...
John Spray

01/26/2018

01:03 PM Ceph Revision 29b5611d (ceph): doc/dev: add a brief guide to serialization
Contributors encounter this pretty soon when trying
to add any feature to Ceph, so I'd like a central place
to point ...
John Spray

01/25/2018

02:55 PM mgr Backport #21656 (Closed): luminous: crash on DaemonPerfCounters::update
I think this was already covered by the backport commit 2fea47d9710ec1e388db044065bc42b8046dae22 that was done as par... John Spray
02:54 PM mgr Bug #21197 (Resolved): crash on DaemonPerfCounters::update
John Spray

01/24/2018

06:08 PM Ceph Revision 6eb5c636 (ceph): mon: expose can_run flag in `module ls` + mgr dump
Enable people to see can_run failures and the explanatory
messages (telling them about a missing dependency)
before t...
John Spray
06:08 PM Ceph Revision 191cce74 (ceph): doc: note new mgr module error codes
Signed-off-by: John Spray <john.spray@redhat.com> John Spray
06:08 PM Ceph Revision 9999ddf6 (ceph): mgr: refactor get_modules/list_modules
list_modules is really about searching for them
on disk, so it's now probe_modules and private.
Both methods now ret...
John Spray
06:08 PM Ceph Revision b3e6b8e1 (ceph): qa: always dump corrupt messages in rados/upgrade
Previously only a service with debug_ms>=1 would dump
corrupt messages: in an upgrade test we're *alway*
interested i...
John Spray
06:08 PM Ceph Revision 6a8da7ca (ceph): mgr: load all modules (not just active ones)
This is to enable us to learn more about the module
before it is enabled, such as whether its can_run method
return t...
John Spray
06:08 PM Ceph Revision 712ad57d (ceph): mgr: evaluate `can_run` method on modules
...and transmit the result to the monitor in
our beacon.
Fixes: http://tracker.ceph.com/issues/21502
Signed-off-by: ...
John Spray
06:08 PM Ceph Revision 834bc279 (ceph): mgr: load command definitions earlier
...and for all modules, not just the active ones.
This enables us to give better feedback to the user
when they try ...
John Spray
06:08 PM Ceph Revision f95b079c (ceph): qa/mgr: add test for command execution errors
Signed-off-by: John Spray <john.spray@redhat.com> John Spray
06:08 PM Ceph Revision d9a47181 (ceph): mgr: add health checks for failed modules
Signed-off-by: John Spray <john.spray@redhat.com> John Spray

01/23/2018

12:45 PM Ceph Revision 86ee30c3 (ceph): mgr: apply auth_service_required to client conns
Previously was using auth_cluster_required for all
connections, which meant that if someone had
disabled client cephx...
John Spray
12:31 PM mgr Bug #22096: Authentication failed, did you specify a mgr ID with a valid keyring?
Ah -- turns out the mgr code never thought about the possibility that people would be using a ceph CLI with cephx tur... John Spray
10:24 AM mgr Bug #22755 (Fix Under Review): cluster [ERR] Unhandled exception from module 'prometheus' while r...
Already fixed by https://github.com/ceph/ceph/pull/20047 which should merge soon. John Spray
10:23 AM Ceph Revision dd4f3225 (ceph): qa: add new prometheus test to rados/mgr suite
Added in https://github.com/ceph/ceph/pull/19744
Signed-off-by: John Spray <john.spray@redhat.com>
John Spray
10:23 AM Ceph Revision e2c68d5e (ceph): qa: assign prometheus ports during selftest
This was throwing IOError("Port 9283 not free on '::'",)
when trying to serve, since merging https://github.com/ceph/...
John Spray
10:17 AM Ceph Revision e1af6673 (ceph): mgr: remove a few junk lines
Signed-off-by: John Spray <john.spray@redhat.com> John Spray

01/22/2018

10:11 PM CephFS Bug #22754 (Resolved): mon: removing tier from an EC base pool is forbidden, even if allow_ec_ove...
OSDMonitor::_check_remove_tier needs to be made aware that this should be permitted if the base tier is suitable for ... John Spray
02:16 PM RADOS Bug #22746 (Fix Under Review): osd/common: ceph-osd process is terminated by the logratote task
John Spray
02:14 PM CephFS Bug #22741: osdc: "FAILED assert(bh->last_write_tid > tid)" in powercycle-wip-yuri-master-1.19.18...
Assigned to CephFS because it's crashing in the ceph-fuse process (in the absence of a better home for ObjectCacher i... John Spray
02:11 PM Ceph Bug #22735 (Closed): about mon_max_pg_per_osd
The factor osd_max_pg_per_osd_hard_ratio (default 2) is applied to the PG count limit before actually preventing PG c... John Spray
01:44 PM mgr Bug #22096 (Fix Under Review): Authentication failed, did you specify a mgr ID with a valid keyring?
https://github.com/ceph/ceph/pull/20048 John Spray
01:23 PM mgr Bug #22096: Authentication failed, did you specify a mgr ID with a valid keyring?
Looking a few posts back, I see this:... John Spray
01:43 PM Ceph Revision 1e06fe00 (ceph): mon: fix mgr using auth_client_required policy
This caused mgr daemons to fail to authenticate
when auth_client_required was set to something
different to auth_clus...
John Spray
01:38 PM mgr Bug #22682: "PyModuleRegistry.cc: 139: FAILED assert(map.epoch > 0)" in rados-luminous-distro-bas...
As with #22266, this is a rebuild-mondb run so presumably the solution is a test modification to avoid leaving the mg... John Spray
12:14 PM mgr Feature #22748 (Resolved): Add units to configuration settings
Same as http://tracker.ceph.com/issues/22747 but for configuration settings instead of performance counters.
Confi...
John Spray
12:10 PM mgr Feature #22747 (Resolved): Add units to performance counters
Currently, our performance counters don't indicate whether they are just generic counts (e.g. number of requests), or... John Spray
 

Also available in: Atom