Project

General

Profile

Actions

Bug #19705

closed

Ubuntu amd64 client can not discover the ubuntu arm64 ceph cluster

Added by Jay Zhu about 7 years ago. Updated about 5 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Tags:
amd64 client can't discover the arm64 cluster
Backport:
jewel,luminous
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

Hi help,

When upgrading my cluster from jewel 10.2.3 (no such issue) to jewwl 10.2.6, I found my client(ubuntu 16.04 amd64) can not discover my ceph cluster(ubuntu 16.04 arm64)

My cluster configuration:

Node    username   os                 machine
deploy  cephadmin  ubuntu16.04 amd64  x86 PC(test client)
node1   cephadmin  ubuntu16.04 arm64  ARM dev board
node2   cephadmin  ubuntu16.04 arm64  ARM dev board
node3   cephadmin  ubuntu16.04 arm64  ARM dev board

run 'ceph -v' on all nodes

cephadmin@node1:~$ ceph -v
ceph version 10.2.6 (656b5b63ed7c43bd014bcafd81b001959d5f089f)
cephadmin@node2:~$ ceph -v
ceph version 10.2.6 (656b5b63ed7c43bd014bcafd81b001959d5f089f)
cephadmin@node3:~$ ceph -v
ceph version 10.2.6 (656b5b63ed7c43bd014bcafd81b001959d5f089f)
cephadmin@deploy:~$ ceph -v
ceph version 10.2.6 (656b5b63ed7c43bd014bcafd81b001959d5f089f)

run 'ceph -s' on each cluster node

cephadmin@node3:~$ ceph status
cluster 61b4956e-692f-4272-8a12-997538381feb
health HEALTH_OK
monmap e1: 3 mons at {node1=172.12.55.209:6789/0,node2=172.12.55.213:6789/0,node3=172.12.55.210:6789/0}
election epoch 4, quorum 0,1,2 node1,node3,node2
osdmap e14: 3 osds: 3 up, 3 in
flags sortbitwise,require_jewel_osds
pgmap v38: 64 pgs, 1 pools, 0 bytes data, 0 objects
101 MB used, 5571 GB / 5571 GB avail
64 active+clean

run 'ceph -s' on client node

cephadmin@deploy:~$ ceph status
2017-04-07 14:07:38.223860 7f2b28816700 0 monclient(hunting): authenticate timed out after 300
2017-04-07 14:07:38.223909 7f2b28816700 0 librados: client.admin authentication error (110) Connection timed out
Error connecting to cluster: TimedOut
cephadmin@deploy:~$
cephadmin@deploy:~$
cephadmin@deoloy:~$ rbd create --size 1024 foo -m 172.12.55.209 --image-feature layering
2017-04-07 15:59:21.398973 7fa5f6511100 0 monclient(hunting): authenticate timed out after 300
2017-04-07 15:59:21.399030 7fa5f6511100 0 librados: client.admin authentication error (110) Connection timed out
rbd: couldn't connect to the cluster!

and all nodes do not have firewalls, I have upgraded to jewel 10.2.7, but the issue remains. I am not sure such issue (interoperable between AMD64 client and ARM64 cluster) was introduced from which version (surly from 10.2.3 to 10.2.6).


Files


Related issues 2 (0 open2 closed)

Copied to Ceph - Backport #21795: luminous: Ubuntu amd64 client can not discover the ubuntu arm64 ceph clusterResolvedNathan CutlerActions
Copied to Ceph - Backport #21796: jewel: Ubuntu amd64 client can not discover the ubuntu arm64 ceph clusterResolvedNathan CutlerActions
Actions

Also available in: Atom PDF