Activity
From 07/21/2021 to 08/19/2021
08/19/2021
- 11:51 PM Bug #51818: "ceph orch host add" presents unhelpful error message if target host is missing cephadm
- Likely caused when podman/docker are not present ...
from the mgr log:... - 11:48 PM Bug #51818 (Fix Under Review): "ceph orch host add" presents unhelpful error message if target ho...
- 09:26 PM Bug #51818 (In Progress): "ceph orch host add" presents unhelpful error message if target host is...
- 06:23 PM Bug #52334 (Fix Under Review): cephadm: tcmalloc environment variable is set for all containers
- 06:20 PM Bug #52334 (Resolved): cephadm: tcmalloc environment variable is set for all containers
- The TCMALLOC_MAX_TOTAL_THREAD_CACHE_BYTES environment variable is set for services deployed by cephadm.
This varia... - 02:55 PM Bug #52279 (Fix Under Review): cephadm tests fail due to: error adding seccomp filter rule for sy...
- 01:43 PM Bug #52328 (Resolved): unfinished update messages in ceph -s ..
- Since the update to v16.2.5 had finished we see the following output when doing 'ceph -s':
.....
progress:
... - 05:05 AM Bug #52321 (New): qa/tasks/rook times out: 'check osd count' reached maximum tries (90) after wai...
- ...
- 04:18 AM Bug #52320 (New): unable to get monitor info from DNS SRV with service name: ceph-mon
- ...
08/18/2021
- 09:08 PM Bug #52040 (Fix Under Review): during an apply the host must be online otherwise the apply fails ...
- 02:16 PM Bug #52279: cephadm tests fail due to: error adding seccomp filter rule for syscall bdflush: requ...
- https://sentry.ceph.com/organizations/ceph/issues/14737/?project=2&query=is%3Aunresolved+%22sudo+systemctl+stop+ceph%...
- 10:20 AM Bug #51027 (Pending Backport): monmap drops rebooted mon if deployed via label
- 03:20 AM Feature #52237: use cephadm shell to operate iscsi gateway
- Sure, here's the existing ones from ceph master:
to create gateways: https://github.com/ceph/ceph/blob/master/src/t...
08/17/2021
- 03:26 PM Bug #51027: monmap drops rebooted mon if deployed via label
- If you want to use the label deployment feature: Not that I was able to find. It's a real problem. And it's been al...
- 03:22 PM Bug #51027: monmap drops rebooted mon if deployed via label
- Is there any workaround for this other than redeploying? As David said this is dangerous. We had quite some trouble t...
- 02:27 PM Bug #52040: during an apply the host must be online otherwise the apply fails with a traceback
- more info: When deploying make sure your hosts are all online. If host listed is not reachable, you’ll get traceback...
08/16/2021
- 03:33 PM Bug #52279: cephadm tests fail due to: error adding seccomp filter rule for syscall bdflush: requ...
- https://pulpito.ceph.com/yuriw-2021-08-12_17:49:36-rados-wip-yuri2-testing-2021-08-10-1044-pacific-distro-basic-smithi/
- 08:50 AM Bug #52279: cephadm tests fail due to: error adding seccomp filter rule for syscall bdflush: requ...
- https://github.com/containers/podman/issues/11031
- 08:47 AM Bug #52279: cephadm tests fail due to: error adding seccomp filter rule for syscall bdflush: requ...
- all on centos 8.2 and centos 8.3
- 08:44 AM Bug #52279 (New): cephadm tests fail due to: error adding seccomp filter rule for syscall bdflush...
- ...
- 02:20 PM Bug #52116 (Pending Backport): kubeadm task fails with error execution phase wait-control-plane: ...
- Fixed in https://github.com/ceph/ceph/commit/517b7759b3ab2b84b2a4ddace411e6ac7599eddd
- 12:59 PM Bug #51621 (Duplicate): Multiple "Updating node-exporter deployment"
- 12:53 PM Support #51737 (Resolved): How to restore data after I reinstall host opera system
- please keep the installed packages and cluster versions consistent.
- 12:50 PM Feature #51793 (Closed): cephadm: Grafana: Add switch to enable the Gafana admin account
- 12:49 PM Bug #51817 (Resolved): FAILED tests/test_cephadm.py::TestShell::test_fsid - AttributeError: 'Fake...
- https://github.com/ceph/ceph/pull/42664
- 12:03 PM Feature #52237: use cephadm shell to operate iscsi gateway
- Hi Deepika, can you give me some example commands of how to interact with gwcli ?
08/13/2021
- 07:21 PM Bug #51616 (Pending Backport): Updating node-exporter deployment progress stuck
- 03:48 PM Documentation #48267 (Resolved): orchestrator docs should at least mention keyrings
- 03:48 PM Documentation #48267: orchestrator docs should at least mention keyrings
- see https://docs.ceph.com/en/latest/cephadm/client-setup/ and https://docs.ceph.com/en/latest/cephadm/operations/#cli...
- 03:47 PM Bug #47495 (Resolved): rook: 'ceph orch device ls' does not list devices
- 03:46 PM Feature #47274: cephadm: make the container_image setting available to the cephadm binary indepen...
- I'm not sure this is a good idea since container_image now references a specific digest. That would mean that 'cephad...
- 03:43 PM Bug #46582 (Resolved): cephadm: NFS services should not share the same namespace in a pool
- This is solved indirectly since the namespace always == service_id now.
- 07:53 AM Bug #52064 (Fix Under Review): octopus: cephadm bootstrap --container-init broken in Octopus
08/12/2021
- 06:29 PM Bug #51667 (Fix Under Review): cephadm: host add existing host should be noop
- 02:39 PM Bug #51667 (In Progress): cephadm: host add existing host should be noop
- 04:56 PM Bug #51027: monmap drops rebooted mon if deployed via label
- We can confirm this impacts 16.2.5 clusters. On host failures/reboots, we have to undeploy/redeploy monitors, which i...
- 04:01 PM Documentation #46335 (Resolved): Document "Using cephadm to set up rgw-nfs"
- https://docs.ceph.com/en/latest/mgr/nfs/#create-rgw-export
- 02:40 PM Bug #52040 (In Progress): during an apply the host must be online otherwise the apply fails with ...
- 01:17 PM Bug #52109 (Won't Fix): test_cephadm.sh: Timeout('Port 8443 not free on 127.0.0.1.',)
- test_cephadm.sh deploys two mgrs on the same host first one from bootstrap and the second with cephadm deploy. both m...
- 09:44 AM Bug #46038 (New): cephadm mon start failure: Failed to reset failed state of unit ceph-9342dcfe-a...
- hey Sebastian/orch team! I added cephadm based iscsi tests recently and am observing this failure after rebase, can y...
- 05:36 AM Feature #52237 (Closed): use cephadm shell to operate iscsi gateway
- Right now a use has to attach to iscsi container in order to access iscsi gateway, the right way should be support gw...
08/11/2021
- 05:18 PM Bug #51632 (Pending Backport): cephadm: selinux is not checked against running configuration
- 05:17 PM Bug #51973 (Pending Backport): cephadm: global default ingress container images value
- 02:26 PM Bug #51590 (Pending Backport): cephadm: iscsi: The first gateway defined must be the local machine
08/10/2021
- 09:11 PM Bug #52116: kubeadm task fails with error execution phase wait-control-plane: couldn't initialize...
- /a/sseshasa-2021-08-06_04:49:51-rados-wip-sseshasa2-testing-2021-08-04-1847-pacific-distro-basic-smithi/6323276
- 06:11 PM Feature #51947: cephadm: Redeploy services, on property update (was: Ingress for RGW does not app...
- Ok, you're right, I did not redeploy, just re-applied the updated ingress yaml. I have tested on my newly upgraded 16...
08/09/2021
- 05:46 PM Bug #52116 (Resolved): kubeadm task fails with error execution phase wait-control-plane: couldn't...
- ...
- 01:58 PM Bug #52109 (Won't Fix): test_cephadm.sh: Timeout('Port 8443 not free on 127.0.0.1.',)
- https://sentry.ceph.com/organizations/ceph/issues/1585/?project=2&query=is%3Aunresolved+%22workunit+test+cephadm%22&s...
- 01:53 PM Bug #51601: mgr/dashboard: server does not bind to all addresses anymore
- do we need to do anything on the cephadm side?
08/06/2021
- 10:22 PM Bug #51620 (Pending Backport): Ceph orch upgrade to 16.2.5 fails
- 09:40 AM Bug #52083 (Resolved): cephadm: networks_and_interfaces: duplicated IPs
- From: https://c4b83513adcc4a3a9323-03e51d134fc1c044893e5fb53732499a.ssl.cf1.rackcdn.com/778915/68/check/tripleo-ci-ce...
08/05/2021
- 08:21 PM Bug #51027 (In Progress): monmap drops rebooted mon if deployed via label
- 12:08 PM Feature #51901 (Closed): cephadm: support pulling images from insecure registries
- closing this one in favor of #52065
- 11:18 AM Bug #49287 (Resolved): podman: setting cgroup config for procHooks process caused: Unit libpod-$h...
- Fixed by https://github.com/opencontainers/runc/pull/2614
- 09:56 AM Feature #52065 (Fix Under Review): make cephadm support passing any additional parameter to eithe...
- This is a feature request to make cephadm support passing any existing parameters from docker or podman CLI.
- 08:52 AM Bug #52064 (In Progress): octopus: cephadm bootstrap --container-init broken in Octopus
- 08:46 AM Bug #52064 (Resolved): octopus: cephadm bootstrap --container-init broken in Octopus
- In Octopus, when the user provides the "--container-init" option to "cephadm bootstrap", all containerized daemons in...
08/04/2021
- 09:10 PM Bug #51978: podman version check broken on cent7
- podman 1.x isn't supported with Pacific
https://docs.ceph.com/en/latest/cephadm/compatibility/#cephadm-compatibili... - 05:04 PM Bug #51978: podman version check broken on cent7
- Please try to avoid running cephadm on centos 7: WE had to disable automated QE for it, cause the old kernel wasn't a...
- 12:56 AM Bug #52042 (Resolved): After deployment the example of cephadm shell invocation is overly complex
- If there is only one ceph instance on the host (which is the most likely user scenario), cephadm shell can infer the ...
- 12:52 AM Bug #52041 (New): `orch ps` shows wrong ports for MGR
- Only one mgr is active, so any mgr module that has an associated listening port should only be listed against the act...
- 12:46 AM Bug #52040 (Resolved): during an apply the host must be online otherwise the apply fails with a t...
- If a host is offline during an apply, the process stops with a traceback instead of continuing to the next host.
... - 12:40 AM Bug #52039 (New): cephadm rm-cluster should check whether the given fsid exists
- if the fsid provided is not present in /var/lib/ceph a warning should be printed. The return status should be *succes...
08/03/2021
- 04:48 PM Bug #51794 (Pending Backport): mgr/test_orchestrator: remove pool and namespace from nfs service
08/02/2021
- 06:48 PM Bug #51806: cephadm: stopped contains end up in error state
- I'm thinking this might not be a cephadm specific issue. For one thing, no matter what version I tested this with (I ...
07/31/2021
07/30/2021
- 10:55 PM Bug #51978 (Closed): podman version check broken on cent7
- I tried upgrading a test cluster (built on CentOS 7) from v15.2.13 to v16.2.5 today and ran into this problem:
---... - 07:29 PM Bug #51973 (Fix Under Review): cephadm: global default ingress container images value
- 07:22 PM Bug #51973 (Resolved): cephadm: global default ingress container images value
- All services (ceph/iscsi/ganesha, prometheus, alertmanager, node-exporter and grafana) have a global default containe...
- 07:02 PM Feature #51972 (Resolved): cephadm/ingress: support TLS RGW backend
- As per the documentation (and the code), the ingress service via haproxy doesn't support RGW backend with TLS. [1][2]...
- 06:38 PM Feature #51971 (Resolved): cephadm/ingress: update keepalived container image
- The default keepalived container image is : arcts/keepalived [1]
There's multiple issues here:
- We don't use a... - 02:44 PM Feature #44414 (Fix Under Review): bubble up errors during 'apply' phase to 'cluster warnings'
- 10:59 AM Bug #51902 (Resolved): cephadm adopt fails on clean_cgroup
- 05:04 AM Bug #51616: Updating node-exporter deployment progress stuck
- Thanks, Harry, this woraround worked for me, though I watched some different daemons which stuck during the update pr...
- 01:51 AM Feature #51947: cephadm: Redeploy services, on property update (was: Ingress for RGW does not app...
- Ok looks like you didn't redeploy the service after updating the spec file with the intermediate ca certificate right...
07/29/2021
- 09:51 PM Feature #51947: cephadm: Redeploy services, on property update (was: Ingress for RGW does not app...
- I finished to test with v16.2.5 and I counldn't reproduce the issue....
- 08:30 PM Feature #51947: cephadm: Redeploy services, on property update (was: Ingress for RGW does not app...
- That's weird because the code doesn't do anything special from the ssl_cert value in the spec
https://github.com/c... - 07:06 PM Feature #51901 (Fix Under Review): cephadm: support pulling images from insecure registries
- 06:06 PM Bug #51961 (Resolved): Stuck progress indicators in ceph status output
- If an exception is thrown while cephadm is attempting to apply a service spec in the serve loop, the progress indicat...
- 01:25 PM Bug #51601: mgr/dashboard: server does not bind to all addresses anymore
- I think this is related to the change in the URI setting and hostname/IP address handling.
07/28/2021
- 09:06 PM Bug #51902 (Fix Under Review): cephadm adopt fails on clean_cgroup
- 02:06 PM Bug #51902 (Resolved): cephadm adopt fails on clean_cgroup
- Until recently, the cephadm adopt command was working perfectly.
Now this ends up with a stack trace... - 07:32 PM Feature #51947 (New): cephadm: Redeploy services, on property update (was: Ingress for RGW does n...
- Using v16.2.4, Ubuntu 20.04 hosts for cluster and ingress (haproxy) for RGW instances. Multisite setup with one zone ...
- 04:29 PM Bug #51829 (Resolved): cephadm: deploying cephadm-exporter fails with shutil SameFileError
- 03:01 PM Bug #49633: podman: ERROR (catatonit:2): failed to exec pid1: No such file or directory
- ...
- 01:50 PM Feature #51901 (Closed): cephadm: support pulling images from insecure registries
- For convenience, it would be nice if cephadm could support pulling images from insecure registries with a native opti...
- 04:48 AM Fix #51721 (Resolved): ingress: Fix for virtual_interface_networks not working
07/26/2021
- 06:56 AM Bug #51736: mgr hung forever when execute multiprocessing.pool.ThreadPool accidentally
- Sebastian Wagner wrote:
> you're sure you did not hit #51733 ?
I think that the bug is different from #51733. The...
07/25/2021
- 08:20 PM Bug #51616: Updating node-exporter deployment progress stuck
Workaround (caution: temporarily disruptive), Assuming this is the only reported problem remaining after upgrade o...- 01:43 PM Bug #51298 (Resolved): ceph orch stop mgr should not stop all the mgrs and should give a warning ...
07/24/2021
07/23/2021
- 03:24 PM Bug #51796 (Fix Under Review): cephadm: unable to deploy grafana without mgr/dashboard
- 03:03 PM Bug #51796 (In Progress): cephadm: unable to deploy grafana without mgr/dashboard
- 02:08 PM Bug #51829 (Resolved): cephadm: deploying cephadm-exporter fails with shutil SameFileError
- ...
- 02:06 PM Bug #51818: "ceph orch host add" presents unhelpful error message if target host is missing cephadm
- in any case this is broken. the list (*[]*) should contain the error message. Somehow it got lost, leading to a non-h...
- 01:24 AM Bug #51818: "ceph orch host add" presents unhelpful error message if target host is missing cephadm
- Argh, sorry for the typos there, s/cephadm/ceph orch/ in several places.
- 01:23 AM Bug #51818 (Resolved): "ceph orch host add" presents unhelpful error message if target host is mi...
- When using "ceph orch host add", if the remote host is missing cephadm
and its dependencies then "ceph orch host add... - 11:10 AM Documentation #47637: mgr/cephadm: document how to configure custom TLS certificate for Grafana
- Reconfigure can be done easier with...
- 09:56 AM Bug #51794 (Fix Under Review): mgr/test_orchestrator: remove pool and namespace from nfs service
- 12:10 AM Bug #51817 (Resolved): FAILED tests/test_cephadm.py::TestShell::test_fsid - AttributeError: 'Fake...
- ...
07/22/2021
- 04:27 PM Bug #51806: cephadm: stopped contains end up in error state
- Adding ...
- 03:26 PM Bug #51806 (Need More Info): cephadm: stopped contains end up in error state
- ...
- 03:53 PM Bug #51111: Pacific: CEPHADM_STRAY_DAEMON after deploying iSCSI gateway with cephadm due to tcmu-...
- ceph version 16.2.5 (0883bdea7337b95e4b611c768c0279868462204a) pacific (stable)
- 03:53 PM Bug #51111: Pacific: CEPHADM_STRAY_DAEMON after deploying iSCSI gateway with cephadm due to tcmu-...
- same here as well:...
- 12:04 PM Bug #51796 (Resolved): cephadm: unable to deploy grafana without mgr/dashboard
- ...
- 11:08 AM Bug #51794 (Resolved): mgr/test_orchestrator: remove pool and namespace from nfs service
- ...
- 09:20 AM Feature #51793 (Closed): cephadm: Grafana: Add switch to enable the Gafana admin account
- Right now, users need to replace the Jinja2 template in order to enable the admin account. This has a few downsides. ...
- 12:56 AM Bug #51355 (Resolved): ingress service /var/lib/haproxy/haproxy.cfg
07/21/2021
- 06:57 PM Feature #50815 (Fix Under Review): cephadm: Removing an offline host
- 02:22 PM Bug #51298 (In Progress): ceph orch stop mgr should not stop all the mgrs and should give a warni...
- 12:36 PM Bug #51733: offline host hangs serve loop for 15 mins
- only is happening is host is not gracefully shutdown
- 12:05 PM Bug #51761 (Closed): journald logs are broken up again
- ...
- 11:46 AM Bug #51311: Failed to apply ingress.rgw: IndexError: list index out of range
- # ceph orch ls
NAME PORTS RUNNING REFRESHED AGE PLACEMENT
alertmanager ... - 08:50 AM Bug #51311 (Fix Under Review): Failed to apply ingress.rgw: IndexError: list index out of range
- 08:15 AM Bug #51713 (Duplicate): Cephadm: Timeout waiting for ingress.nfs.foo to start
- 02:46 AM Support #51737: How to restore data after I reinstall host opera system
- Sebastian Wagner wrote:
> I hope you still have a few MONs and MGRs left. Cause then, you can follow https://docs.ce...
Also available in: Atom