John Spray's activity
From 01/22/2018 to 02/20/2018
02/20/2018
- 01:30 PM mgr Bug #21225: ceph-mgr: dashboard and zabbix plugin report wrong values
- This seems likely to be the same issue as http://tracker.ceph.com/issues/22142, the fix for which will be in 12.2.3
02/19/2018
- 06:34 PM mgr Feature #23042 (Resolved): Prompt ServiceMap removal on clean shutdown
- We should have MgrClient explicitly send a message on shutdown, so that DaemonServer can immediately remove it from S...
- 03:23 PM mgr Bug #23017 (Fix Under Review): mgr log spamming about down osds
- This was also happening at startup even if no OSDs were down, so let's stop reporting it at error level.
https://git... - 03:21 PM Ceph Revision b6b2b07f (ceph): mgr: quieten logging on missing OSD stats
- This is only cause for concern if the OSD is actually up,
and even then it can also happen at startup, so
we shouldn'...
02/14/2018
- 03:19 PM Ceph Revision c15cf4d8 (ceph): mon: dump percent_used PGMap field as float
- Formatting this as a string was awkward for anyone
consuming it from other code.
Signed-off-by: John Spray <john.spr...
02/13/2018
- 11:12 AM mgr Bug #22947 (Duplicate): Prometheus exporter module fails due to incorrect PG_STATES
- Thanks for the report: the fix for this will be part of the forthcoming 12.2.3 release.
02/12/2018
- 01:57 PM mgr Bug #22226: ceph zabbix plugin sends incorrect motinoring info to zabbix server
- If it's correlated with a mon going down then I suspect this is the same underlying cause as http://tracker.ceph.com/...
02/06/2018
- 05:50 PM Ceph Revision f19f1e1c (ceph): mds: don't report repaired backtraces in damagetable
- Fixes: http://tracker.ceph.com/issues/18743
Signed-off-by: John Spray <john.spray@redhat.com>
(cherry picked from com... - 05:50 PM Ceph Revision 6e544ac9 (ceph): mds: flush after scrub repairs
- Otherwise, if we restart the MDS right after the scrub,
then scrub again, it will see the same inconsistency when
it ... - 05:50 PM Ceph Revision 798cc37c (ceph): mds: clean up clog damage message when repairing
- If we've repaired then we don't need to be at WRN
and we don't need to advise the user to look up
the damage detail.
... - 05:50 PM Ceph Revision 162e25ec (ceph): mds: cleaner scrub complete log msg
- People often are not using a tag with their scrub,
let's not have the ugly "with tag ''" in that case.
Signed-off-by...
02/05/2018
- 08:27 PM mgr Bug #21399 (Closed): ceph-mgr module(s) inaccessible after a reboot
- There's no log to look at, and no further reports, so I'm going to close this.
The fix for the 'exe' thing went in... - 04:46 PM mgr Backport #22921 (In Progress): luminous: dashboard module: 404 for static resouces
- https://github.com/ceph/ceph/pull/20325
- 04:38 PM mgr Backport #22921 (Resolved): luminous: dashboard module: 404 for static resouces
- https://github.com/ceph/ceph/pull/20325
- 04:19 PM mgr Backport #22919 (Closed): luminous: mgr sends early beacon with no modules reported
- Oops, created this backport ticket from the wrong original. Disregard!
- 03:12 PM mgr Backport #22919: luminous: mgr sends early beacon with no modules reported
- https://github.com/ceph/ceph/pull/20321
- 03:11 PM mgr Backport #22919 (Closed): luminous: mgr sends early beacon with no modules reported
- Sends beacon before PyModuleRegistry::init has been called -- if admin quickly does a "mgr module enable" after daemo...
- 12:07 PM Dashboard Bug #22851 (Pending Backport): dashboard module: 404 for static resouces
- 12:06 PM mgr Bug #22918 (Resolved): mgr sends early beacon with no modules reported
- Sends beacon before PyModuleRegistry::init has been called -- if admin quickly does a "mgr module enable" after daemo...
02/01/2018
- 05:31 PM mgr Bug #22880: failed to load ceph_module with Python3
- Kefu, do you know if a py3-compiled mgr has been seen to work? I've never tried it.
- 11:12 AM website Bug #22850: Luminous/Jewel docs missing table of contents
- Confirmed, master has the TOC but luminous doesn't.
- 11:11 AM RADOS Bug #22842: (luminous) ceph-disk prepare of simple filestore failed with 'Unable to set partition...
- I would suspect that something is strange about the disk (non-GPT partition table perhaps?), and you're getting less-...
01/31/2018
- 09:19 PM Ceph Revision d8f9aec7 (ceph): mon: tidy up mds_blacklist_interval
- Remove the misleading reference to this from the client
eviction page, it was never the right option to mention
there... - 09:19 PM Ceph Revision 5dcd2a59 (ceph): mon: s/mds_blacklist_interval/mon_mds_blacklist_interval/
- We can rename LEVEL_DEV options at will.
Signed-off-by: John Spray <john.spray@redhat.com>
(cherry picked from commi... - 01:20 PM CephFS Bug #22839 (Rejected): MDSAuthCaps (unlike others) still require "allow" at start
- This was changed for the OSD and mon caps, but the MDS caps were missed:
https://github.com/ceph/ceph/pull/15991/com... - 12:11 PM mgr Bug #21999 (Resolved): mgr tests don't indicate failure if exception thrown from serve()
- 12:11 PM mgr Bug #21707 (Resolved): "osd status" command exception if OSD not in pgmap stats
- 12:10 PM mgr Bug #21687 (Resolved): mgr: mark_down of osd without metadata is broken
- 12:10 PM mgr Bug #20887 (Resolved): Services reported with blank hostname by mgr
- 12:09 PM mgr Bug #22361 (Resolved): balancer crush-compat sends "foo" command
- 12:09 PM mgr Backport #22455 (Resolved): luminous: balancer crush-compat sends "foo" command
- 12:07 PM mgr Bug #22096 (Resolved): Authentication failed, did you specify a mgr ID with a valid keyring?
- 12:07 PM mgr Backport #22811 (Resolved): luminous: Authentication failed, did you specify a mgr ID with a vali...
- 12:03 PM RADOS Bug #22142 (Resolved): mon doesn't send health status after paxos service is inactive temporarily
- 12:03 PM RADOS Backport #22421 (Resolved): mon doesn't send health status after paxos service is inactive tempor...
- 11:59 AM mgr Bug #22429 (Resolved): mgr/balancer/upmap_max_iterations must be cast to integer
- 11:59 AM mgr Backport #22453 (Resolved): luminous: mgr/balancer/upmap_max_iterations must be cast to integer
- 11:58 AM mgr Bug #22470 (Resolved): KeyError: ('name',) in balancer rm
- 11:58 AM mgr Backport #22496 (Resolved): luminous: KeyError: ('name',) in balancer rm
- 11:49 AM mgr Bug #22313 (Resolved): Prometheus exporter can't get metrics after update to 12.2.2
- 11:48 AM mgr Backport #22585 (Resolved): luminous: Prometheus exporter can't get metrics after update to 12.2.2
- 11:47 AM mgr Fix #22718 (Resolved): mgr: prometheus: missed osd commit\apply latency metrics.
- 11:47 AM mgr Backport #22760 (Resolved): luminous: mgr: prometheus: missed osd commit\apply latency metrics.
- 11:46 AM mgr Bug #22286 (Resolved): Service daemons never recover from transient outage
- 11:46 AM mgr Backport #22768 (Resolved): luminous: Service daemons never recover from transient outage
- 11:43 AM Dashboard Bug #22557 (Resolved): Dashboard on backup MGRs always redirects to /, breaking reverse proxy sup...
- 11:43 AM mgr Backport #22831 (Resolved): luminous: Dashboard on backup MGRs always redirects to /, breaking re...
- 11:33 AM mgr Backport #22197 (Resolved): luminous: mgr: mark_down of osd without metadata is broken
- 11:32 AM mgr Backport #22216 (Resolved): luminous: "osd status" command exception if OSD not in pgmap stats
- 11:32 AM mgr Backport #21479 (Resolved): luminous: Services reported with blank hostname by mgr
- 11:32 AM mgr Backport #22075 (Resolved): luminous: mgr tests don't indicate failure if exception thrown from s...
01/30/2018
- 10:37 AM mgr Backport #22831 (Fix Under Review): luminous: Dashboard on backup MGRs always redirects to /, bre...
- https://github.com/ceph/ceph/pull/20182
- 10:35 AM mgr Backport #22831 (Resolved): luminous: Dashboard on backup MGRs always redirects to /, breaking re...
- https://github.com/ceph/ceph/pull/20182
- 10:33 AM mgr Backport #22576 (Resolved): luminous: zabbix throws exception
- 10:33 AM mgr Bug #22514 (Resolved): zabbix throws exception
- 10:28 AM mgr Bug #21981 (Resolved): mgr: dashboard plugin OSD daemons' table the Usage column's value is alway...
- 10:28 AM mgr Backport #22023 (Resolved): luminous: mgr: dashboard plugin OSD daemons' table the Usage column's...
- 10:27 AM mgr Bug #21599 (Resolved): List of filesystems does not get refreshed after a filesystem deletion
- 10:26 AM mgr Backport #22030 (Resolved): luminous: List of filesystems does not get refreshed after a filesyst...
- 10:26 AM mgr Bug #21253 (Resolved): Prometheus crash when update
- 10:25 AM mgr Bug #20899 (Resolved): prometheus module generates invalid output when counter names contain non-...
- 10:15 AM Ceph Bug #22784 (Fix Under Review): Key exported from auth keyring is not imported properly
- 10:15 AM Ceph Bug #22828 (Fix Under Review): build Debian installation packages failure
01/29/2018
- 03:20 PM CephFS Bug #22754: mon: removing tier from an EC base pool is forbidden, even if allow_ec_overwrites is set
- As far as I'm aware, nobody has worked on it, so that would be a no.
- 02:34 PM mgr Feature #22748 (Fix Under Review): Add units to configuration settings
- https://github.com/ceph/ceph/pull/20163
- 02:34 PM mgr Feature #22747 (Fix Under Review): Add units to performance counters
- https://github.com/ceph/ceph/pull/20152
- 02:32 PM mgr Bug #22327: MGR dashboard doesn't update OSD's ceph version after updating from 12.2.1 to 12.2.2
- I imagine this is probably and underlying ceph-mgr glitch rather than dashboard specific. I'll leave this ticket her...
- 02:29 PM Dashboard Bug #22557 (Pending Backport): Dashboard on backup MGRs always redirects to /, breaking reverse p...
- 02:57 AM Ceph Revision 2852a5f5 (ceph): mon: fix mgr using auth_client_required policy
- This caused mgr daemons to fail to authenticate
when auth_client_required was set to something
different to auth_clus... - 02:57 AM Ceph Revision f2a20277 (ceph): mgr: apply auth_service_required to client conns
- Previously was using auth_cluster_required for all
connections, which meant that if someone had
disabled client cephx...
01/26/2018
- 01:03 PM Ceph Revision 29b5611d (ceph): doc/dev: add a brief guide to serialization
- Contributors encounter this pretty soon when trying
to add any feature to Ceph, so I'd like a central place
to point ...
01/25/2018
- 02:55 PM mgr Backport #21656 (Closed): luminous: crash on DaemonPerfCounters::update
- I think this was already covered by the backport commit 2fea47d9710ec1e388db044065bc42b8046dae22 that was done as par...
- 02:54 PM mgr Bug #21197 (Resolved): crash on DaemonPerfCounters::update
01/24/2018
- 06:08 PM Ceph Revision 6eb5c636 (ceph): mon: expose can_run flag in `module ls` + mgr dump
- Enable people to see can_run failures and the explanatory
messages (telling them about a missing dependency)
before t... - 06:08 PM Ceph Revision 191cce74 (ceph): doc: note new mgr module error codes
- Signed-off-by: John Spray <john.spray@redhat.com>
- 06:08 PM Ceph Revision 9999ddf6 (ceph): mgr: refactor get_modules/list_modules
- list_modules is really about searching for them
on disk, so it's now probe_modules and private.
Both methods now ret... - 06:08 PM Ceph Revision b3e6b8e1 (ceph): qa: always dump corrupt messages in rados/upgrade
- Previously only a service with debug_ms>=1 would dump
corrupt messages: in an upgrade test we're *alway*
interested i... - 06:08 PM Ceph Revision 6a8da7ca (ceph): mgr: load all modules (not just active ones)
- This is to enable us to learn more about the module
before it is enabled, such as whether its can_run method
return t... - 06:08 PM Ceph Revision 712ad57d (ceph): mgr: evaluate `can_run` method on modules
- ...and transmit the result to the monitor in
our beacon.
Fixes: http://tracker.ceph.com/issues/21502
Signed-off-by: ... - 06:08 PM Ceph Revision 834bc279 (ceph): mgr: load command definitions earlier
- ...and for all modules, not just the active ones.
This enables us to give better feedback to the user
when they try ... - 06:08 PM Ceph Revision f95b079c (ceph): qa/mgr: add test for command execution errors
- Signed-off-by: John Spray <john.spray@redhat.com>
- 06:08 PM Ceph Revision d9a47181 (ceph): mgr: add health checks for failed modules
- Signed-off-by: John Spray <john.spray@redhat.com>
01/23/2018
- 12:45 PM Ceph Revision 86ee30c3 (ceph): mgr: apply auth_service_required to client conns
- Previously was using auth_cluster_required for all
connections, which meant that if someone had
disabled client cephx... - 12:31 PM mgr Bug #22096: Authentication failed, did you specify a mgr ID with a valid keyring?
- Ah -- turns out the mgr code never thought about the possibility that people would be using a ceph CLI with cephx tur...
- 10:24 AM mgr Bug #22755 (Fix Under Review): cluster [ERR] Unhandled exception from module 'prometheus' while r...
- Already fixed by https://github.com/ceph/ceph/pull/20047 which should merge soon.
- 10:23 AM Ceph Revision dd4f3225 (ceph): qa: add new prometheus test to rados/mgr suite
- Added in https://github.com/ceph/ceph/pull/19744
Signed-off-by: John Spray <john.spray@redhat.com> - 10:23 AM Ceph Revision e2c68d5e (ceph): qa: assign prometheus ports during selftest
- This was throwing IOError("Port 9283 not free on '::'",)
when trying to serve, since merging https://github.com/ceph/... - 10:17 AM Ceph Revision e1af6673 (ceph): mgr: remove a few junk lines
- Signed-off-by: John Spray <john.spray@redhat.com>
01/22/2018
- 10:11 PM CephFS Bug #22754 (Resolved): mon: removing tier from an EC base pool is forbidden, even if allow_ec_ove...
- OSDMonitor::_check_remove_tier needs to be made aware that this should be permitted if the base tier is suitable for ...
- 02:16 PM RADOS Bug #22746 (Fix Under Review): osd/common: ceph-osd process is terminated by the logratote task
- 02:14 PM CephFS Bug #22741: osdc: "FAILED assert(bh->last_write_tid > tid)" in powercycle-wip-yuri-master-1.19.18...
- Assigned to CephFS because it's crashing in the ceph-fuse process (in the absence of a better home for ObjectCacher i...
- 02:11 PM Ceph Bug #22735 (Closed): about mon_max_pg_per_osd
- The factor osd_max_pg_per_osd_hard_ratio (default 2) is applied to the PG count limit before actually preventing PG c...
- 01:44 PM mgr Bug #22096 (Fix Under Review): Authentication failed, did you specify a mgr ID with a valid keyring?
- https://github.com/ceph/ceph/pull/20048
- 01:23 PM mgr Bug #22096: Authentication failed, did you specify a mgr ID with a valid keyring?
- Looking a few posts back, I see this:...
- 01:43 PM Ceph Revision 1e06fe00 (ceph): mon: fix mgr using auth_client_required policy
- This caused mgr daemons to fail to authenticate
when auth_client_required was set to something
different to auth_clus... - 01:38 PM mgr Bug #22682: "PyModuleRegistry.cc: 139: FAILED assert(map.epoch > 0)" in rados-luminous-distro-bas...
- As with #22266, this is a rebuild-mondb run so presumably the solution is a test modification to avoid leaving the mg...
- 12:14 PM mgr Feature #22748 (Resolved): Add units to configuration settings
- Same as http://tracker.ceph.com/issues/22747 but for configuration settings instead of performance counters.
Confi... - 12:10 PM mgr Feature #22747 (Resolved): Add units to performance counters
- Currently, our performance counters don't indicate whether they are just generic counts (e.g. number of requests), or...
Also available in: Atom