Tasks #17851: jewel v10.2.6 - Stable releases - Ceph

Actions

#1

Updated by Loïc Dachary over 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel-next..jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

+ Pull request 10865
+ Pull request 11413
+ Pull request 11470
|\
| + rgw:bucket check remove multipart prefix
+ Pull request 11476
|\
| + rgw: RGWCoroutinesManager::run returns status of last cr
+ Pull request 11477
|\
| + rgw: fix for assertion in RGWMetaSyncCR
+ Pull request 11508
|\
| + utime.h: fix timezone issue in round_to_* funcs.
+ Pull request 11529
|\
| + common: Improve linux dcache hash algorithm
+ Pull request 11574
+ Pull request 11606
+ Pull request 11627
+ Pull request 11660
|\
| + mon/PGMap: PGs can be stuck more than one thing
+ Pull request 11672
|\
| + rgw_rest_s3: apply missed base64 try-catch
+ Pull request 11675
+ Pull request 11735
+ Pull request 11736
+ Pull request 11737
+ Pull request 11743
+ Pull request 11757
|\
| + rgw ldap: protect rgw::from_base64 from non-base64 input
+ Pull request 11758
|\
| + rgw: fix osd crashes when execute radosgw-admin bi list --max-entries=1 command
+ Pull request 11759
|\
| + rgw: json encode/decode index_type, allow modification
+ Pull request 11760
+ Pull request 11852
+ Pull request 11853
+ Pull request 11854
+ Pull request 11855
|\
| + rpm: fix permissions for /etc/ceph/rbdmap
+ Pull request 11856
+ Pull request 11857
+ Pull request 11858
+ Pull request 11860
+ Pull request 11861
+ Pull request 11862
+ Pull request 11863
+ Pull request 11864
+ Pull request 11865
|\
| + rgw: RGWSimpleRadosReadCR tolerates empty reads
+ Pull request 11866
|\
| + rgw: clean up RGWShardedOmapCRManager on early return
+ Pull request 11867
+ Pull request 11868
|\
| + rgw: store oldest mdlog period in rados
+ Pull request 11869
+ Pull request 11870
+ Pull request 11871
|\
| + librbd: batch ObjectMap updations upon trim
+ Pull request 11872
|\
| + rgw: delete entries_index in RGWFetchAllMetaCR
+ Pull request 11873
+ Pull request 11875
+ Pull request 11876
|\
| + rgw: fix the field 'total_time' of log entry in log show opt
+ Pull request 11884
+ ceph-create-keys: wait 10 minutes to get or create the bootstrap key, not forever
+ ceph-create-keys: wait 10 minutes to get or create a key, not forever
+ ceph-create-keys: wait for quorum for ten minutes, not forever

Actions

Copy link

#2

Updated by Loïc Dachary over 7 years ago

rbd¶

teuthology-suite --priority 1000 --suite rbd --subset $(expr $RANDOM % 5)/5 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2016-11-10_10:23:37-rbd-jewel-backports---basic-smithi/
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=b124ab8b49b6708673caf6b240d73515122f8eb3 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}

Re-running failed tests

pass http://pulpito.ceph.com/loic-2016-11-10_13:12:09-rbd-jewel-backports---basic-smithi/

Actions

Copy link

#3

Updated by Loïc Dachary over 7 years ago

rgw¶

teuthology-suite -k distro --priority 1000 --suite rgw --subset $(expr $RANDOM % 5)/5 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2016-11-10_10:25:55-rgw-jewel-backports-distro-basic-smithi
- HTTPConnectionPool(host='smithi075.front.sepia.ceph.com', port=8000): Max retries exceeded with url: /metadata/incremental (Caused by NewConnectionError('<requests.packages.urllib3.connection.HTTPConnection object at 0x7fb10a323dd0>: Failed to establish a new connection: [Errno 111] Connection refused',))
  - rgw/singleton/{overrides.yaml xfs.yaml all/radosgw-admin-data-sync.yaml frontend/apache.yaml fs/xfs.yaml rgw_pool_type/ec-profile.yaml}

Re-running failed tests

pass http://pulpito.ceph.com/loic-2016-11-10_13:13:25-rgw-jewel-backports-distro-basic-smithi

Actions

Copy link

#4

Updated by Loïc Dachary over 7 years ago

rados¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 2000)/2000 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2016-11-10_10:27:02-rados-jewel-backports-distro-basic-smithi
- 'cd /home/ubuntu/cephtest && sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper term valgrind --trace-children=no --child-silent-after-fork=yes --num-callers=50 --suppressions=/home/ubuntu/cephtest/valgrind.supp --xml=yes --xml-file=/var/log/ceph/valgrind/mon.b.log --time-stamp=yes --tool=memcheck --leak-check=full --show-reachable=yes ceph-mon -f --cluster ceph -i b'
  - rados/verify/{rados.yaml 1thrash/default.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr/random.yaml msgr-failures/few.yaml tasks/mon_recovery.yaml validater/valgrind.yaml}
- 'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds
  - rados/singleton-nomsgr/{rados.yaml all/lfn-upgrade-hammer.yaml}

Re-running failed tests

pass http://pulpito.ceph.com/loic-2016-11-11_10:59:48-rados-jewel-backports-distro-basic-smithi

Actions

Copy link

#5

Updated by Loïc Dachary over 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --subset $(expr $RANDOM % 5)/5 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2016-11-10_10:29:25-fs-jewel-backports-distro-basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=b124ab8b49b6708673caf6b240d73515122f8eb3 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/libcephfs/test.sh'
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=b124ab8b49b6708673caf6b240d73515122f8eb3 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/libcephfs-java/test.sh'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/no.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_java.yaml}
- 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster ceph -i 3'
  - fs/traceless/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/cfuse_workunit_suites_ffsb.yaml traceless/50pc.yaml}

Re-running failed tests

fail http://pulpito.ceph.com/loic-2016-11-11_10:57:34-fs-jewel-backports-distro-basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=b124ab8b49b6708673caf6b240d73515122f8eb3 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/libcephfs/test.sh'
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=b124ab8b49b6708673caf6b240d73515122f8eb3 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/libcephfs-java/test.sh'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/no.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_java.yaml}

Actions

Copy link

#6

Updated by Loïc Dachary over 7 years ago

powercycle¶

teuthology-suite -v -c jewel-backports --suite-branch jewel -k testing -m smithi -s powercycle -p 1000 --email loic@dachary.org

Actions

Copy link

#7

Updated by Loïc Dachary over 7 years ago

Upgrade¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x --suite-branch jewel --ceph jewel-backports --machine-type vps --priority 1000 machine_types/vps.yaml

fail http://pulpito.ceph.com/loic-2016-11-10_10:31:33-upgrade:jewel-x-jewel-backports-distro-basic-vps
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=jewel TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
  - upgrade:jewel-x/stress-split/{0-cluster/{openstack.yaml start.yaml} 1-jewel-install/jewel.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml} distros/centos_7.2.yaml}

Re-running failed tests

pass http://pulpito.ceph.com/loic-2016-11-11_10:55:56-upgrade:jewel-x-jewel-backports-distro-basic-vps

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --suite-branch jewel --ceph jewel-backports --machine-type vps --priority 1000 machine_types/vps.yaml

fail http://pulpito.ceph.com/loic-2016-11-10_10:32:38-upgrade:hammer-x-jewel-backports-distro-basic-vps
- 'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds
  - upgrade:hammer-x/tiering/{0-cluster/start.yaml 1-hammer-install/hammer.yaml 2-setup-cache-tiering/{0-create-base-tier/create-ec-pool.yaml 1-create-cache-tier/create-cache-tier.yaml} 3-upgrade/upgrade.yaml 4-finish-upgrade/flip-success.yaml distros/centos_7.2.yaml}

Re-running failed tests

fail http://pulpito.ceph.com/loic-2016-11-11_10:52:59-upgrade:hammer-x-jewel-backports-distro-basic-vps

Re-running the dead test with https://github.com/ceph/ceph-qa-suite/pull/1256 to set require_jewel

filter='upgrade:hammer-x/stress-split/{0-tz-eastern.yaml 0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml} distros/ubuntu_14.04.yaml}'
teuthology-suite -k distro --filter="$filter" --verbose --suite upgrade/hammer-x --suite-branch wip-17734-jewel --ceph jewel-backports --machine-type vps --priority 1000 machine_types/vps.yaml

Actions

Copy link

#8

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --suite-branch jewel --ceph jewel-backports --machine-type vps --priority 1000 machine_types/vps.yaml

pass http://pulpito.ceph.com/loic-2016-11-10_10:33:41-ceph-disk-jewel-backports-distro-basic-vps

Actions

Copy link

#9

Updated by Loïc Dachary over 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel-next..jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#10

Updated by Loïc Dachary over 7 years ago

rbd¶

teuthology-suite --priority 1000 --suite rbd --subset $(expr $RANDOM % 5)/5 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/loic-2016-11-23_21:22:04-rbd-jewel-backports---basic-smithi/
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4920d41d4d8be57c06687d4de029ec2b687a9f09 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 6h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror_stress.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-stress-workunit.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4920d41d4d8be57c06687d4de029ec2b687a9f09 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4920d41d4d8be57c06687d4de029ec2b687a9f09 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin IMAGE_NAME=client.0.1-clone adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/qemu_rebuild_object_map.sh'
  - rbd/maintenance/{xfs.yaml base/install.yaml clusters/{fixed-3.yaml openstack.yaml} qemu/xfstests.yaml workloads/rebuild_object_map.yaml}

Re-running the failed jobs

fail http://pulpito.ceph.com/loic-2016-11-25_08:54:08-rbd-jewel-backports---basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4920d41d4d8be57c06687d4de029ec2b687a9f09 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 6h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror_stress.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-stress-workunit.yaml}
- 'adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper term /usr/libexec/qemu-kvm -enable-kvm -nographic -m 4096 -drive file=/home/ubuntu/cephtest/qemu/base.client.0.qcow2,format=qcow2,if=virtio -cdrom /home/ubuntu/cephtest/qemu/client.0.iso -drive file=rbd:rbd/client.0.0-clone:id=0,format=raw,if=virtio,cache=none -drive file=rbd:rbd/client.0.1-clone:id=0,format=raw,if=virtio,cache=none'
  - rbd/maintenance/{xfs.yaml base/install.yaml clusters/{fixed-3.yaml openstack.yaml} qemu/xfstests.yaml workloads/rebuild_object_map.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4920d41d4d8be57c06687d4de029ec2b687a9f09 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}

Re-running failed jobs

running http://pulpito.ceph.com/loic-2016-12-05_16:31:02-rbd-wip-jewel-backports---basic-smithi/

Re-running failed jobs on jewel

running http://pulpito.ceph.com/loic-2016-12-05_16:32:03-rbd-jewel---basic-smithi

Actions

Copy link

#11

Updated by Loïc Dachary over 7 years ago

rgw¶

teuthology-suite -k distro --priority 1000 --suite rgw --subset $(expr $RANDOM % 5)/5 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/loic-2016-11-23_21:26:42-rgw-jewel-backports-distro-basic-smithi/
- **

Re-running failed jobs

running http://pulpito.ceph.com/loic-2016-12-05_15:54:16-rgw-wip-jewel-backports-distro-basic-smithi

Actions

Copy link

#12

Updated by Loïc Dachary over 7 years ago

rados¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 2000)/2000 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

pass http://pulpito.ceph.com:80/loic-2016-11-23_21:35:27-rados-jewel-backports-distro-basic-smithi/

Actions

Copy link

#13

Updated by Loïc Dachary over 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --subset $(expr $RANDOM % 5)/5 --suite-branch jewel --email loic@dachary.org --ceph jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/loic-2016-11-23_21:37:30-fs-jewel-backports-distro-basic-smithi/
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4920d41d4d8be57c06687d4de029ec2b687a9f09 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/libcephfs-java/test.sh'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/yes.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_java.yaml}
- 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster ceph -i 3'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/no.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/cfuse_workunit_suites_ffsb.yaml}

Re-running failed jobs

fail http://pulpito.front.sepia.ceph.com/loic-2016-11-25_08:58:04-fs-jewel-backports-distro-basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=4920d41d4d8be57c06687d4de029ec2b687a9f09 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/libcephfs-java/test.sh'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/yes.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_java.yaml}

Actions

Copy link

#14

Updated by Loïc Dachary over 7 years ago

powercycle¶

teuthology-suite -v -c jewel-backports --suite-branch jewel -k distro -m smithi -s powercycle -p 1000 --email loic@dachary.org

fail http://pulpito.ceph.com:80/loic-2016-11-23_21:39:07-powercycle-jewel-backports-distro-basic-smithi/
- 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster ceph -i 2'
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/btrfs.yaml powercycle/default.yaml tasks/cfuse_workunit_kernel_untar_build.yaml}

Re-running the failed job

pass http://pulpito.ceph.com/loic-2016-11-25_09:02:49-powercycle-jewel-backports-distro-basic-smithi

Actions

Copy link

#15

Updated by Loïc Dachary over 7 years ago

Upgrade¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x --suite-branch jewel --ceph jewel-backports --machine-type vps --priority 1000 machine_types/vps.yaml

fail http://pulpito.ceph.com/loic-2016-11-23_21:39:57-upgrade:jewel-x-jewel-backports-distro-basic-vps
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=jewel TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
  - upgrade:jewel-x/stress-split/{0-cluster/{openstack.yaml start.yaml} 1-jewel-install/jewel.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml} distros/centos_7.2.yaml}

Re-running failed jobs

pass http://pulpito.ceph.com/loic-2016-12-05_09:16:21-upgrade:jewel-x-wip-jewel-backports-distro-basic-vps/

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --suite-branch jewel --ceph jewel-backports --machine-type vps --priority 1000 machine_types/vps.yaml

fail http://pulpito.ceph.com/loic-2016-11-23_21:46:45-upgrade:hammer-x-jewel-backports-distro-basic-vps
- File is closed
  - upgrade:hammer-x/stress-split-erasure-code/{0-tz-eastern.yaml 0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/ec-rados-default.yaml 6-next-mon/monb.yaml 8-finish-upgrade/last-osds-and-monc.yaml 9-workload/ec-rados-plugin=jerasure-k=3-m=1.yaml distros/ubuntu_14.04.yaml}
  - upgrade:hammer-x/stress-split-erasure-code-x86_64/{0-tz-eastern.yaml 0-x86_64.yaml 0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/ec-rados-default.yaml 6-next-mon/monb.yaml 8-finish-upgrade/last-osds-and-monc.yaml 9-workload/ec-rados-plugin=isa-k=2-m=1.yaml}
- 'sudo TESTDIR=/home/ubuntu/cephtest bash -c \'sudo ceph osd erasure-code-profile set profile-shec k=2 m=1 c=1 plugin=shec 2>&1 | grep "unsupported by"\''
  - upgrade:hammer-x/stress-split-erasure-code/{0-tz-eastern.yaml 0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/ec-rados-default.yaml 6-next-mon/monb.yaml 8-finish-upgrade/last-osds-and-monc.yaml 9-workload/ec-no-shec.yaml distros/ubuntu_14.04.yaml}
  - upgrade:hammer-x/stress-split-erasure-code/{0-tz-eastern.yaml 0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/ec-rados-default.yaml 6-next-mon/monb.yaml 8-finish-upgrade/last-osds-and-monc.yaml 9-workload/ec-no-shec.yaml distros/centos_7.2.yaml}

Re-running failed jobs

pass http://pulpito.ceph.com/loic-2016-12-05_09:14:21-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps/

Actions

Copy link

#16

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --suite-branch jewel --ceph jewel-backports --machine-type vps --priority 1000 machine_types/vps.yaml

pass http://pulpito.ceph.com/loic-2016-11-23_21:51:12-ceph-disk-jewel-backports-distro-basic-vps

Actions

Copy link

#17

Updated by Loïc Dachary over 7 years ago


git --no-pager log --format='%H %s' --graph ceph/jewel-next..wip-jewel-backports-loic | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#18

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --suite-branch jewel --ceph wip-jewel-backports-loic --machine-type vps --priority 1000 machine_types/vps.yaml ~/shaman.yaml

pass http://pulpito.ceph.com/loic-2016-12-06_09:36:02-ceph-disk-wip-jewel-backports-loic-distro-basic-vps/

Actions

Copy link

#19

Updated by Loïc Dachary over 7 years ago

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --suite-branch jewel --ceph wip-jewel-backports-loic --machine-type vps --priority 1000 machine_types/vps.yaml ~/shaman.yaml

fail http://pulpito.ceph.com/loic-2016-12-06_09:38:33-upgrade:hammer-x-wip-jewel-backports-loic-distro-basic-vps
- ceph-objectstore-tool: exp list-pgs failure with status 1
  - upgrade:hammer-x/stress-split-erasure-code/{0-tz-eastern.yaml 0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/ec-rados-default.yaml 6-next-mon/monb.yaml 8-finish-upgrade/last-osds-and-monc.yaml 9-workload/ec-rados-plugin=jerasure-k=3-m=1.yaml distros/ubuntu_14.04.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&sha1=95cefea9fd9ab740263bf8bb4796fd864d9afe2b
  - upgrade:hammer-x/v0-94-4-stop/{ignore.yaml v0-94-4-stop.yaml distros/centos_7.2.yaml distros/ubuntu_14.04.yaml}
- 'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds
  - upgrade:hammer-x/tiering/{0-cluster/start.yaml 1-hammer-install/hammer.yaml 2-setup-cache-tiering/{0-create-base-tier/create-ec-pool.yaml 1-create-cache-tier/create-cache-tier.yaml} 3-upgrade/upgrade.yaml 4-finish-upgrade/flip-success.yaml distros/ubuntu_14.04.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=centos%2F7%2Fx86_64&ref=firefly
  - upgrade:hammer-x/f-h-x-offline/{0-install.yaml 1-pre.yaml 2-upgrade.yaml 3-jewel.yaml 4-after.yaml distros/centos_7.2.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&ref=firefly
  - upgrade:hammer-x/f-h-x-offline/{0-install.yaml 1-pre.yaml 2-upgrade.yaml 3-jewel.yaml 4-after.yaml distros/ubuntu_14.04.yaml}

Actions

Copy link

#20

Updated by Loïc Dachary over 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports-loic --suite-branch jewel -k distro -m smithi -s powercycle -p 1000 --email loic@dachary.org ~/shaman.yaml

fail http://pulpito.ceph.com/loic-2016-12-06_09:40:31-powercycle-wip-jewel-backports-loic-distro-basic-smithi
- , 'smithi057.front.sepia.ceph.com': {'msg': 'One or more items failed', 'failed': True, 'changed': False}}
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/btrfs.yaml powercycle/default.yaml tasks/cfuse_workunit_misc.yaml}
- 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster ceph -i 0'
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/btrfs.yaml powercycle/default.yaml tasks/cfuse_workunit_kernel_untar_build.yaml}

Actions

Copy link

#21

Updated by Loïc Dachary over 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports-loic --machine-type smithi ~/shaman.yaml

fail http://pulpito.ceph.com/loic-2016-12-06_09:42:04-fs-wip-jewel-backports-loic-distro-basic-smithi
- }
  - fs/multiclient/{clusters/three_clients.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml mount/ceph-fuse.yaml tasks/ior-shared-file.yaml}
- saw valgrind issues
  - fs/verify/{clusters/fixed-2-ucephfs.yaml debug/{mds_client.yaml mon.yaml} dirfrag/frag_enable.yaml fs/btrfs.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/cfuse_workunit_suites_fsstress.yaml validater/valgrind.yaml}
- 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster ceph -i 3'
  - fs/traceless/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/cfuse_workunit_suites_ffsb.yaml traceless/50pc.yaml}

Actions

Copy link

#22

Updated by Loïc Dachary over 7 years ago

rados¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 50)/50 --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports-loic --machine-type smithi ~/shaman.yaml

fail http://pulpito.ceph.com/loic-2016-12-06_09:43:31-rados-wip-jewel-backports-loic-distro-basic-smithi
- SELinux denials found on ubuntu@smithi013.front.sepia.ceph.com: ['type=AVC msg=audit(1481196479.614:8253): avc: denied { read } for pid=10410 comm="ceph-osd" name="type" dev="nvme0n1p3" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481196520.457:8326): avc: denied { read } for pid=11590 comm="ceph-osd" name="type" dev="nvme0n1p3" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481196479.614:8253): avc: denied { open } for pid=10410 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-1/type" dev="nvme0n1p3" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481196520.457:8326): avc: denied { open } for pid=11590 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-1/type" dev="nvme0n1p3" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file']
  - rados/objectstore/filejournal.yaml
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&ref=v0.80.8
  - rados/singleton-nomsgr/{rados.yaml all/11429.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&sha1=9d446bd416c52cd785ccf048ca67737ceafcdd7f
  - rados/singleton-nomsgr/{rados.yaml all/13234.yaml}
- 'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds
  - rados/singleton-nomsgr/{rados.yaml all/lfn-upgrade-infernalis.yaml}
- saw valgrind issues
  - rados/verify/{rados.yaml 1thrash/default.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr/async.yaml msgr-failures/few.yaml tasks/rados_api_tests.yaml validater/valgrind.yaml}
  - rados/verify/{rados.yaml 1thrash/default.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr/simple.yaml msgr-failures/few.yaml tasks/rados_cls_all.yaml validater/valgrind.yaml}

Actions

Copy link

#23

Updated by Loïc Dachary over 7 years ago

rgw¶

teuthology-suite -k distro --priority 1000 --suite rgw --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports-loic --machine-type smithi ~/shaman.yaml

fail http://pulpito.ceph.com/loic-2016-12-06_09:46:40-rgw-wip-jewel-backports-loic-distro-basic-smithi

Actions

Copy link

#24

Updated by Loïc Dachary over 7 years ago

rbd¶

teuthology-suite --priority 1000 --suite rbd --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports-loic --machine-type smithi ~/shaman.yaml

fail http://pulpito.ceph.com/loic-2016-12-06_09:46:56-rbd-wip-jewel-backports-loic---basic-smithi
- 'adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper term /usr/libexec/qemu-kvm -enable-kvm -nographic -m 4096 -drive file=/home/ubuntu/cephtest/qemu/base.client.0.qcow2,format=qcow2,if=virtio -cdrom /home/ubuntu/cephtest/qemu/client.0.iso -drive file=rbd:rbd/client.0.0-clone:id=0,format=raw,if=virtio,cache=writeback -drive file=rbd:rbd/client.0.1-clone:id=0,format=raw,if=virtio,cache=writeback'
  - rbd/qemu/{cache/writeback.yaml cachepool/none.yaml clusters/{fixed-3.yaml openstack.yaml} features/defaults.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/qemu_xfstests.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=10328195aad09f59d5c2c382bd9241c7418f744e TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/qemu-iotests.sh'
  - rbd/singleton/{openstack.yaml all/qemu-iotests-writethrough.yaml}
  - rbd/singleton/{openstack.yaml all/qemu-iotests-no-cache.yaml}
- 'adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper term /usr/libexec/qemu-kvm -enable-kvm -nographic -m 4096 -drive file=/home/ubuntu/cephtest/qemu/base.client.0.qcow2,format=qcow2,if=virtio -cdrom /home/ubuntu/cephtest/qemu/client.0.iso -drive file=rbd:rbd/client.0.0-clone:id=0,format=raw,if=virtio,cache=writethrough -drive file=rbd:rbd/client.0.1-clone:id=0,format=raw,if=virtio,cache=writethrough'
  - rbd/qemu/{cache/writethrough.yaml cachepool/small.yaml clusters/{fixed-3.yaml openstack.yaml} features/defaults.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/qemu_xfstests.yaml}
- SELinux denials found on ubuntu@smithi023.front.sepia.ceph.com: ['type=AVC msg=audit(1481252822.592:36797): avc: denied { create } for pid=13771 comm="mandb" name="13771" scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.561:36792): avc: denied { unlink } for pid=13771 comm="mandb" name="index.db" dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.561:36792): avc: denied { remove_name } for pid=13771 comm="mandb" name="#index.db#" dev="sda1" ino=29363155 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=dir', 'type=AVC msg=audit(1481252822.462:36790): avc: denied { create } for pid=13771 comm="mandb" name="#index.db#" scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252821.938:36784): avc: denied { create } for pid=13760 comm="logrotate" name="logrotate.status.tmp" scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252821.939:36785): avc: denied { setattr } for pid=13760 comm="logrotate" name="logrotate.status.tmp" dev="sda1" ino=29363169 scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.561:36792): avc: denied { rename } for pid=13771 comm="mandb" name="#index.db#" dev="sda1" ino=29363155 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.182:36789): avc: denied { lock } for pid=13771 comm="mandb" path="/var/cache/man/index.db" dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.561:36792): avc: denied { add_name } for pid=13771 comm="mandb" name="index.db" dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=dir', 'type=AVC msg=audit(1481252821.938:36784): avc: denied { write } for pid=13760 comm="logrotate" path="/var/lib/logrotate.status.tmp" dev="sda1" ino=29363169 scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.182:36788): avc: denied { open } for pid=13771 comm="mandb" path="/var/cache/man/index.db" dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.026:36786): avc: denied { rename } for pid=13760 comm="logrotate" name="logrotate.status.tmp" dev="sda1" ino=29363169 scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.462:36790): avc: denied { add_name } for pid=13771 comm="mandb" name="#index.db#" scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=dir', 'type=AVC msg=audit(1481252822.562:36795): avc: denied { read write } for pid=13771 comm="mandb" path="/var/cache/man/index.db" dev="sda1" ino=29363155 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.561:36792): avc: denied { write } for pid=13771 comm="mandb" name="man" dev="sda1" ino=29360274 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=dir', 'type=AVC msg=audit(1481252821.786:36783): avc: denied { read } for pid=13760 comm="logrotate" name="logrotate.status" dev="sda1" ino=29363155 scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.561:36793): avc: denied { lock } for pid=13771 comm="mandb" path=2F7661722F63616368652F6D616E2F23696E6465782E646223202864656C6574656429 dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.562:36794): avc: denied { getattr } for pid=13771 comm="mandb" path="/var/cache/man/index.db" dev="sda1" ino=29363155 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252821.786:36783): avc: denied { open } for pid=13760 comm="logrotate" path="/var/lib/logrotate.status" dev="sda1" ino=29363155 scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.182:36788): avc: denied { read write } for pid=13771 comm="mandb" name="index.db" dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252821.786:36782): avc: denied { getattr } for pid=13760 comm="logrotate" path="/var/lib/logrotate.status" dev="sda1" ino=29363155 scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.595:36798): avc: denied { setattr } for pid=13771 comm="mandb" name="13771" dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.026:36786): avc: denied { unlink } for pid=13760 comm="logrotate" name="logrotate.status" dev="sda1" ino=29363155 scontext=system_u:system_r:logrotate_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252823.146:36812): avc: denied { read } for pid=13771 comm="mandb" name="man" dev="sda1" ino=29360274 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=dir', 'type=AVC msg=audit(1481252822.157:36787): avc: denied { getattr } for pid=13771 comm="mandb" path="/var/cache/man/index.db" dev="sda1" ino=29361527 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481252822.462:36790): avc: denied { write } for pid=13771 comm="mandb" name="man" dev="sda1" ino=29360274 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=dir', 'type=AVC msg=audit(1481252822.592:36796): avc: denied { open } for pid=13771 comm="mandb" path="/var/cache/man/index.db" dev="sda1" ino=29363155 scontext=system_u:system_r:mandb_t:s0-s0:c0.c1023 tcontext=system_u:object_r:unlabeled_t:s0 tclass=file']
  - rbd/librbd/{cache/writethrough.yaml cachepool/small.yaml clusters/{fixed-3.yaml openstack.yaml} copy-on-read/on.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/python_api_tests.yaml}
- }, 'changed': False, 'msg': 'Failed to update apt cache.'}}
  - rbd/librbd/{cache/writethrough.yaml cachepool/small.yaml clusters/{fixed-3.yaml openstack.yaml} copy-on-read/off.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/fsx.yaml}
- 'adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper term /usr/libexec/qemu-kvm -enable-kvm -nographic -m 4096 -drive file=/home/ubuntu/cephtest/qemu/base.client.0.qcow2,format=qcow2,if=virtio -cdrom /home/ubuntu/cephtest/qemu/client.0.iso -drive file=rbd:rbd/client.0.0-clone:id=0,format=raw,if=virtio,cache=none -drive file=rbd:rbd/client.0.1-clone:id=0,format=raw,if=virtio,cache=none'
  - rbd/qemu/{cache/none.yaml cachepool/none.yaml clusters/{fixed-3.yaml openstack.yaml} features/defaults.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/qemu_xfstests.yaml}
  - rbd/maintenance/{xfs.yaml base/install.yaml clusters/{fixed-3.yaml openstack.yaml} qemu/xfstests.yaml workloads/dynamic_features.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=10328195aad09f59d5c2c382bd9241c7418f744e TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 6h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror_stress.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-stress-workunit.yaml}
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-stress-workunit.yaml}
- SELinux denials found on ubuntu@smithi012.front.sepia.ceph.com: ['type=AVC msg=audit(1481253420.151:8286): avc: denied { open } for pid=11385 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481253937.158:8443): avc: denied { open } for pid=17743 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481253420.151:8286): avc: denied { read } for pid=11385 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481253688.221:8387): avc: denied { read } for pid=14751 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481253688.221:8387): avc: denied { open } for pid=14751 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481253937.158:8443): avc: denied { read } for pid=17743 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481253440.886:8326): avc: denied { read } for pid=12332 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481253440.886:8326): avc: denied { open } for pid=12332 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file']
  - rbd/qemu/{cache/writeback.yaml cachepool/none.yaml clusters/{fixed-3.yaml openstack.yaml} features/journaling.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/qemu_xfstests.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=10328195aad09f59d5c2c382bd9241c7418f744e TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin RBD_FEATURES=125 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd.sh'
  - rbd/librbd/{cache/writeback.yaml cachepool/small.yaml clusters/{fixed-3.yaml openstack.yaml} copy-on-read/off.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/c_api_tests_with_journaling.yaml}
- }, 'changed': False, 'msg': 'Failed to update apt cache.'}}
  - rbd/librbd/{cache/writeback.yaml cachepool/small.yaml clusters/{fixed-3.yaml openstack.yaml} copy-on-read/off.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/python_api_tests_with_defaults.yaml}
- SELinux denials found on ubuntu@smithi003.front.sepia.ceph.com: ['type=AVC msg=audit(1481244833.591:4250): avc: denied { read } for pid=23876 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481244813.084:4167): avc: denied { read } for pid=23274 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481244833.591:4250): avc: denied { open } for pid=23876 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481244813.084:4167): avc: denied { open } for pid=23274 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file']
  - rbd/cli/{base/install.yaml cachepool/small.yaml clusters/{fixed-1.yaml openstack.yaml} features/layering.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/rbd_cli_import_export.yaml}
- SELinux denials found on ubuntu@smithi003.front.sepia.ceph.com: ['type=AVC msg=audit(1481245626.292:4358): avc: denied { open } for pid=18438 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481245483.209:4162): avc: denied { read } for pid=23241 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481245483.209:4162): avc: denied { open } for pid=23241 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481245626.292:4358): avc: denied { read } for pid=18438 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481245503.709:4257): avc: denied { open } for pid=23893 comm="ceph-osd" path="/var/lib/ceph/osd/ceph-0/type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file', 'type=AVC msg=audit(1481245503.709:4257): avc: denied { read } for pid=23893 comm="ceph-osd" name="type" dev="nvme0n1p2" ino=25 scontext=system_u:system_r:ceph_t:s0 tcontext=unconfined_u:object_r:unlabeled_t:s0 tclass=file']
  - rbd/cli/{base/install.yaml cachepool/small.yaml clusters/{fixed-1.yaml openstack.yaml} features/format-1.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/rbd_cli_copy.yaml}
- }, 'changed': False, 'msg': 'Failed to update apt cache.'}}
  - rbd/qemu/{cache/none.yaml cachepool/none.yaml clusters/{fixed-3.yaml openstack.yaml} features/journaling.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/qemu_xfstests.yaml}
- 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster ceph -i 3'
  - rbd/qemu/{cache/writethrough.yaml cachepool/none.yaml clusters/{fixed-3.yaml openstack.yaml} features/defaults.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/qemu_fsstress.yaml}

Re-running failed jobs:

running http://pulpito.ceph.com/loic-2016-12-24_12:27:31-rbd-wip-jewel-backports-loic---basic-smithi

Actions

Copy link

#25

Updated by Loïc Dachary over 7 years ago

Subject changed from jewel v10.2.5 to jewel v10.2.6
Target version changed from v10.2.5 to v10.2.6

Actions

Copy link

#26

Updated by Loïc Dachary over 7 years ago

Description updated (diff)

Actions

Copy link

#27

Updated by Loïc Dachary over 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel..wip-jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#28

Updated by Loïc Dachary over 7 years ago

rados¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 50)/50 --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-12_15:26:07-rados-wip-jewel-backports-distro-basic-smithi/
- **
  - rados/basic/{clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/many.yaml msgr/async.yaml rados.yaml tasks/scrub_test.yaml}
  - rados/basic/{clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/scrub_test.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=hammer TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
  - rados/upgrade/{hammer-x-singleton/{0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{ec-rados-plugin=jerasure-k=3-m=1.yaml rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml test_cache-pool-snaps.yaml}} rados.yaml}
- "2017-01-14 09:13:29.649276 osd.3 172.21.15.44:6800/865390 12 : cluster [ERR] 5.0 shard 3: soid 5:a0216fbc:::repair_test_obj:head size 1 != size 223 from auth oi 5:a0216fbc:::repair_test_obj:head(20'1 client.4352.0:1 dirty|data_digest|omap_digest s 223 uv 1 dd 9a3a59aa od ffffffff)" in cluster log
  - rados/basic/{clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml msgr/simple.yaml rados.yaml tasks/repair_test.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&ref=v0.80.8
  - rados/singleton-nomsgr/{all/11429.yaml rados.yaml}
- Command crashed: 'CEPH_CLIENT_ID=0 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage ceph_test_rados --max-ops 4000 --objects 500 --max-in-flight 16 --size 4000000 --min-stride-size 400000 --max-stride-size 800000 --max-seconds 0 --op snap_remove 50 --op snap_create 50 --op rollback 50 --op read 100 --op copy_from 50 --op write 50 --op write_excl 50 --op delete 50 --pool base'
  - rados/thrash/{0-size-min-size-overrides/2-size-2-min-size.yaml 1-pg-log-overrides/short_pg_log.yaml clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml hobj-sort.yaml msgr-failures/osd-delay.yaml msgr/random.yaml rados.yaml thrashers/pggrow.yaml workloads/cache-snaps.yaml}
- 'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds
  - rados/singleton-nomsgr/{all/lfn-upgrade-hammer.yaml rados.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&sha1=9d446bd416c52cd785ccf048ca67737ceafcdd7f
  - rados/singleton-nomsgr/{all/13234.yaml rados.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rados/test_python.sh'
  - rados/basic/{clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/many.yaml msgr/async.yaml rados.yaml tasks/rados_python.yaml}
- "wget ~~q -O /home/ubuntu/cephtest/admin_socket_client.0/objecter_requests -~~ 'http://git.ceph.com/?p=ceph.git;a=blob_plain;f=src/test/admin_socket/objecter_requests;hb=wip-jewel-backports' && chmod u=rx -- /home/ubuntu/cephtest/admin_socket_client.0/objecter_requests"
- saw valgrind issues
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/rados_cls_all.yaml validater/valgrind.yaml}

Re-running failed jobs

running http://pulpito.ceph.com/loic-2017-01-23_06:20:41-rados-wip-jewel-backports-distro-basic-smithi

Actions

Copy link

#29

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --suite-branch jewel --ceph wip-jewel-backports --machine-type vps --priority 1000

pass http://pulpito.ceph.com/loic-2017-01-12_15:35:21-ceph-disk-wip-jewel-backports-distro-basic-vps

Actions

Copy link

#30

Updated by Loïc Dachary over 7 years ago

Upgrade¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x --suite-branch jewel --ceph wip-jewel-backports --machine-type vps --priority 1000

fail http://pulpito.ceph.com/loic-2017-01-12_15:37:14-upgrade:jewel-x-wip-jewel-backports-distro-basic-vps

Re-running dead job

pass http://pulpito.ceph.com/loic-2017-01-13_07:35:08-upgrade:jewel-x-wip-jewel-backports-distro-basic-vps

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --suite-branch jewel --ceph wip-jewel-backports --machine-type vps --priority 1000

fail http://pulpito.ceph.com/loic-2017-01-12_15:38:04-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps

Re-running failed tests

running http://pulpito.ceph.com/loic-2017-01-25_21:42:17-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps/

Actions

Copy link

#31

Updated by Loïc Dachary over 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports --suite-branch jewel -k distro -m smithi -s powercycle -p 1000 --email loic@dachary.org

fail http://pulpito.ceph.com/loic-2017-01-12_15:39:38-powercycle-wip-jewel-backports-distro-basic-smithi
- "wget ~~q -O /home/ubuntu/cephtest/admin_socket_client.0/objecter_requests -~~ 'http://git.ceph.com/?p=ceph.git;a=blob_plain;f=src/test/admin_socket/objecter_requests;hb=wip-jewel-backports' && chmod u=rx -- /home/ubuntu/cephtest/admin_socket_client.0/objecter_requests"
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/xfs.yaml powercycle/default.yaml tasks/admin_socket_objecter_requests.yaml}
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/btrfs.yaml powercycle/default.yaml tasks/admin_socket_objecter_requests.yaml}
- timed out waiting for admin_socket to appear after osd.0 restart
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/btrfs.yaml powercycle/default.yaml tasks/cfuse_workunit_kernel_untar_build.yaml}

Re-running failed jobs

running http://pulpito.ceph.com/loic-2017-01-23_06:26:22-powercycle-wip-jewel-backports-distro-basic-smithi

Actions

Copy link

#32

Updated by Loïc Dachary over 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-12_15:40:36-fs-wip-jewel-backports-distro-basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/libcephfs-java/test.sh'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/yes.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_java.yaml}
- Test failure: test_session_reject (tasks.cephfs.test_sessionmap.TestSessionMap)
  - fs/recovery/{clusters/4-remote-clients.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml mounts/ceph-fuse.yaml tasks/sessionmap.yaml xfs.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/fs/test_python.sh'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/no.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_python.yaml}
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/yes.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_python.yaml}
- saw valgrind issues

Re-running failed jobs

running http://pulpito.ceph.com/loic-2017-01-23_06:28:06-fs-wip-jewel-backports-distro-basic-smithi

Actions

Copy link

#33

Updated by Loïc Dachary over 7 years ago

rgw¶

teuthology-suite -k distro --priority 1000 --suite rgw --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-12_15:41:47-rgw-wip-jewel-backports-distro-basic-smithi
- saw valgrind issues

Re-running failed jobs

fail http://pulpito.ceph.com/loic-2017-01-23_06:30:03-rgw-wip-jewel-backports-distro-basic-smithi
- three passed, all the rest failed due to "valgrind issues"

Actions

Copy link

#34

Updated by Loïc Dachary over 7 years ago

rbd¶

teuthology-suite --priority 1000 --suite rbd --suite-branch jewel --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-12_15:43:54-rbd-wip-jewel-backports---basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 6h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror_stress.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-stress-workunit.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin RBD_FEATURES=61 VALGRIND=memcheck adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
  - rbd/valgrind/{base/install.yaml clusters/{fixed-1.yaml openstack.yaml} fs/xfs.yaml validator/memcheck.yaml workloads/python_api_tests_with_defaults.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin RBD_FEATURES=125 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin RBD_FEATURES=61 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin RBD_FEATURES=1 VALGRIND=memcheck adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
  - rbd/valgrind/{base/install.yaml clusters/{fixed-1.yaml openstack.yaml} fs/xfs.yaml validator/memcheck.yaml workloads/python_api_tests.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin RBD_FEATURES=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_lock_fence.sh'
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/qemu-iotests.sh'
  - rbd/singleton/{all/qemu-iotests-no-cache.yaml openstack.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.cluster1.client.mirror/rbd/rbd_mirror.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=87836ae7a2a75a9c7324de4bf91a2e8c27058832 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin RBD_FEATURES=125 VALGRIND=memcheck adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/workunit.client.0/rbd/test_librbd_python.sh'
  - rbd/valgrind/{base/install.yaml clusters/{fixed-1.yaml openstack.yaml} fs/xfs.yaml validator/memcheck.yaml workloads/python_api_tests_with_journaling.yaml}

Re-running failed jobs

running http://pulpito.ceph.com/loic-2017-01-23_06:32:58-rbd-wip-jewel-backports---basic-smithi

Actions

Copy link

#35

Updated by Loïc Dachary about 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel..wip-jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#36

Updated by Loïc Dachary about 7 years ago

Upgrade¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x --ceph wip-jewel-backports --machine-type vps --priority 1000

cannot be scheduled
- IOError: /home/smithfarm/src/git.ceph.com_ceph-c_wip-jewel-backports/qa/suites/upgrade/jewel-x/point-to-point-x/distros/centos.yaml does not exist (abs /home/smithfarm/src/git.ceph.com_ceph-c_wip-jewel-backports/qa/suites/upgrade/jewel-x/point-to-point-x/distros/centos.yaml)
  - added a commit to https://github.com/ceph/ceph/pull/13050
  - pushed wip-18406-jewel to Shaman so we can see if this one commit is sufficient or if more are needed

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --ceph wip-jewel-backports --machine-type vps --priority 1000

fail http://pulpito.ceph.com/loic-2017-01-26_22:13:59-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps
- known bug http://tracker.ceph.com/issues/18089
  - all the upgrade:hammer-x/f-h-x-offline jobs that got run on CentOS
  - opened https://github.com/ceph/ceph/pull/13153 to fix

Actions

Copy link

#37

Updated by Loïc Dachary about 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --ceph wip-jewel-backports --machine-type vps --priority 1000

fail http://pulpito.ceph.com/loic-2017-01-26_22:12:12-ceph-disk-wip-jewel-backports-distro-basic-vps
- infrastructure noise Downburst failed on ubuntu@vpm051.front.sepia.ceph.com: libvirt: XML-RPC error : Cannot recv data: ssh: connect to host
  - ceph-disk/basic/{distros/centos.yaml tasks/ceph-disk.yaml}

Re-running the failed job:

fail http://pulpito.ceph.com/smithfarm-2017-01-27_14:51:25-ceph-disk-wip-jewel-backports-distro-basic-vps/
- known bug http://tracker.ceph.com/issues/18416
  - fixed by cherry-picking a patch into https://github.com/ceph/ceph/pull/13050

Actions

Copy link

#38

Updated by Loïc Dachary about 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports -k distro -m smithi -s powercycle -p 1000 --email loic@dachary.org

fail http://pulpito.ceph.com/loic-2017-01-26_22:11:16-powercycle-wip-jewel-backports-distro-basic-smithi
- three jobs failed, out of 28

Re-running 3 failed jobs:

pass http://pulpito.ceph.com:80/smithfarm-2017-01-27_22:17:34-powercycle-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#39

Updated by Loïc Dachary about 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-26_22:04:20-fs-wip-jewel-backports-distro-basic-smithi
- saw valgrind issues (3 failed jobs)
- cephfs_java test failure (1 failed job; can probably be ignored)

Re-running 4 failed jobs:

fail http://pulpito.ceph.com:80/smithfarm-2017-01-28_11:44:02-fs-wip-jewel-backports-distro-basic-smithi/
- saw valgrind issues (2 failed jobs)

Actions

Copy link

#40

Updated by Loïc Dachary about 7 years ago

rgw¶

teuthology-suite -k distro --priority 1000 --suite rgw --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-26_22:03:35-rgw-wip-jewel-backports-distro-basic-smithi
- "saw valgrind issues" new bug http://tracker.ceph.com/issues/18744 (the daemons in the notcmalloc build are linked with libtcmalloc) (17 failed jobs)
- clock skew (1 failed job)

Re-running 18 failed jobs:

fail http://pulpito.ceph.com:80/smithfarm-2017-01-28_11:40:26-rgw-wip-jewel-backports-distro-basic-smithi/ - 11 failed jobs
- "saw valgrind issues" new bug http://tracker.ceph.com/issues/18744 (the daemons in the notcmalloc build are linked with libtcmalloc) (11 failed jobs)

Found and staged pending RGW valgrind-related backports:

Hopefully that will clear up the valgrind issues in the next integration run.

NEWS FLASH rgw valgrind issues are appearing also in hammer QE testing, but only on centos. Hammer testing is done on CentOS 7.3 and Ubuntu 14.04 only. Upon closer examination, I don't see Ubuntu 14.04 in any of the jobs with valgrind issues.

Actions

Copy link

#41

Updated by Loïc Dachary about 7 years ago

rbd¶

teuthology-suite --priority 1000 --suite rbd --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-26_22:02:45-rbd-wip-jewel-backports---basic-smithi
- known bug, possible regression?? http://tracker.ceph.com/issues/17695 supposedly already fixed in jewel long ago by https://github.com/ceph/ceph/pull/11644
  - rbd/librbd/{cache/writethrough.yaml cachepool/small.yaml clusters/{fixed-3.yaml openstack.yaml} copy-on-read/off.yaml fs/xfs.yaml msgr-failures/few.yaml workloads/python_api_tests_with_journaling.yaml}

Re-running the one failed job:

pass http://pulpito.ceph.com:80/smithfarm-2017-01-28_11:38:48-rbd-wip-jewel-backports---basic-smithi/

Actions

Copy link

#42

Updated by Loïc Dachary about 7 years ago

rados¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 50)/50 --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-01-26_22:01:29-rados-wip-jewel-backports-distro-basic-smithi
- known bug http://tracker.ceph.com/issues/18089 - opened https://github.com/ceph/ceph/pull/13170 to drop these two problematic tests
  - rados/singleton-nomsgr/{all/11429.yaml rados.yaml}
  - rados/singleton-nomsgr/{all/13234.yaml rados.yaml}
- saw valgrind issues
  - rados/verify/{1thrash/default.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/async.yaml rados.yaml tasks/rados_cls_all.yaml validater/valgrind.yaml}
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/rados_api_tests.yaml validater/valgrind.yaml}
- Regular scrub request, losing deep-scrub details" in cluster log - already fixed by adding a commit to https://github.com/ceph/ceph/pull/13047
  - rados/basic/{clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/repair_test.yaml}
  - rados/basic/{clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/many.yaml msgr/async.yaml rados.yaml tasks/repair_test.yaml}
- new bug http://tracker.ceph.com/issues/18719 - cluster gets stuck in "HEALTH_WARN all OSDs are running jewel or later but the 'require_jewel_osds' osdmap flag is not set" after upgrading from hammer
  - rados/singleton-nomsgr/{all/lfn-upgrade-hammer.yaml rados.yaml}
  - opened https://github.com/ceph/ceph/pull/13161 to fix the bug
- infrastructure noise Could not reconnect to ubuntu@smithi016.front.sepia.ceph.com (host failed to come back from 'sudo shutdown -r now')
  - rados/thrash/{0-size-min-size-overrides/3-size-2-min-size.yaml 1-pg-log-overrides/short_pg_log.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml hobj-sort.yaml msgr-failures/osd-delay.yaml msgr/async.yaml rados.yaml thrashers/pggrow.yaml workloads/cache-pool-snaps-readproxy.yaml}
- clock skew mon.0 172.21.15.38:6789/0 5 : cluster [WRN] message from mon.2 was stamped 9.536220s in the future, clocks not synchronized" in cluster log
  - rados/thrash-erasure-code-isa/{arch/x86_64.yaml clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml msgr-failures/fastclose.yaml rados.yaml supported/centos.yaml thrashers/pggrow.yaml workloads/ec-rados-plugin=isa-k=2-m=1.yaml}
- first dead job restart of last upgraded MON never completes
  - rados/upgrade/{hammer-x-singleton/{0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{ec-rados-plugin=jerasure-k=3-m=1.yaml rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml test_cache-pool-snaps.yaml}} rados.yaml}
- second dead job rados task (under thrashosds) never completes, log output suddenly stops and job is killed 10 hours later
  - rados/thrash/{0-size-min-size-overrides/2-size-2-min-size.yaml 1-pg-log-overrides/normal_pg_log.yaml clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml hobj-sort.yaml msgr-failures/osd-delay.yaml msgr/simple.yaml rados.yaml thrashers/morepggrow.yaml workloads/cache-pool-snaps.yaml}
- third dead job rados task (under thrashosds) fails to complete before thrashosds timeout expires
  - rados/thrash/{0-size-min-size-overrides/2-size-2-min-size.yaml 1-pg-log-overrides/normal_pg_log.yaml clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml hobj-sort.yaml msgr-failures/few.yaml msgr/simple.yaml rados.yaml thrashers/morepggrow.yaml workloads/cache-snaps.yaml}

Re-running the jobs that stand a chance of succeeding on re-run:

fail http://pulpito.ceph.com:80/smithfarm-2017-01-28_11:36:51-rados-wip-jewel-backports-distro-basic-smithi/
- saw valgrind issues (1 failed job)
- problem with hammer upgrade rados/upgrade/{hammer-x-singleton/{0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{ec-rados-plugin=jerasure-k=3-m=1.yaml rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml test_cache-pool-snaps.yaml}} rados.yaml}

Re-running with the fix from PR#13161

teuthology-suite -k distro --priority 101 --suite rados --email ncutler@suse.com --machine-type smithi --ceph wip-lfn-upgrade-hammer --filter="rados/upgrade/{hammer-x-singleton/{0-cluster/{openstack.yaml start.yaml} 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{ec-rados-plugin=jerasure-k=3-m=1.yaml rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml test_cache-pool-snaps.yaml}} rados.yaml}"

running http://pulpito.ceph.com:80/smithfarm-2017-01-29_21:05:27-rados-wip-lfn-upgrade-hammer-distro-basic-smithi/

Re-running the upgrade jobs to test David Galloway's fix from http://tracker.ceph.com/issues/18089#note-12

fail http://pulpito.ceph.com:80/smithfarm-2017-01-29_02:56:52-rados-wip-jewel-backports-distro-basic-smithi/
- opened https://github.com/ceph/ceph/pull/13170 to drop these two problematic tests

Actions

Copy link

#43

Updated by Nathan Cutler about 7 years ago

rados baseline (jewel)¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 50)/50 --email loic@dachary.org --ceph jewel --machine-type smithi --ceph-repo http://github.com/ceph/ceph.git --suite-repo http://github.com/ceph/ceph.git

running http://pulpito.ceph.com:80/smithfarm-2017-01-29_03:27:21-rados-jewel-distro-basic-smithi/
- known bug http://tracker.ceph.com/issues/18089
  - David Galloway is helping get the right builds on the chacra hosts
  - re-testing the failed jobs at http://tracker.ceph.com/issues/18089
- valgrind issues
- selinux issue
- segfault

Actions

Copy link

#44

Updated by Nathan Cutler about 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel..wip-jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#45

Updated by Nathan Cutler about 7 years ago

rados¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 50)/50 --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-01-30_11:11:11-rados-wip-jewel-backports-distro-basic-smithi/
- false positive saw valgrind issues
  - 764136: false positive (tcmalloc)
- known intermittent bug, succeeded on re-run http://tracker.ceph.com/issues/16236 ("racing read got wrong version")
  - 764195
- known intermittent bug http://tracker.ceph.com/issues/16943 Error ENOENT: unrecognized pool '.rgw.control' (CRITICAL:root:IOError) in s3tests-test-readwrite under thrashosds
  - 764231: rados/thrash/{0-size-min-size-overrides/3-size-2-min-size.yaml 1-pg-log-overrides/short_pg_log.yaml clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml hobj-sort.yaml msgr-failures/osd-delay.yaml msgr/async.yaml rados.yaml thrashers/morepggrow.yaml workloads/rgw_snaps.yaml}
- problem with the tests, now being fixed in lfn-upgrade-{hammer,infernalis}.yaml
  - 764236 "rados/singleton-nomsgr/{all/lfn-upgrade-hammer.yaml rados.yaml}" - "need more than 0 values to unpack" - appears to be a regression caused by smithfarm stupidity in PR#13161, fixed and re-pushed
  - 764258 "rados/singleton-nomsgr/{all/lfn-upgrade-infernalis.yaml rados.yaml}" - "'wait_until_healthy' reached maximum tries (150) after waiting for 900 seconds" - ceph osd set require_jewel_osds does not get run - same stupid smithfarm mistake

Re-running the first three failures:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-30_19:58:05-rados-wip-jewel-backports-distro-basic-smithi/
- false positive saw valgrind issues (tcmalloc) in the same test as before
- low-priority bug http://tracker.ceph.com/issues/18739 FAILED assert(0 == "out of order op")

tcmalloc valgrind failure¶

Re-running the "tcmalloc valgrind" test 10 times:

filter="rados/verify/{1thrash/default.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/simple.yaml rados.yaml tasks/rados_cls_all.yaml validater/valgrind.yaml}" ./virtualenv/bin/teuthology-suite -k distro --priority 101 --suite rados --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi --num 10 --filter="$filter"

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-30_22:28:13-rados-wip-jewel-backports-distro-basic-smithi/
- reproduces 7 times out of 10
- not a blocker

16236 ("racing read got wrong version")¶

Succeeded on re-run.

s3tests-test-readwrite dies under thrashosds, out of order op¶

Test: rados/thrash/{0-size-min-size-overrides/3-size-2-min-size.yaml 1-pg-log-overrides/short_pg_log.yaml clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml hobj-sort.yaml msgr-failures/osd-delay.yaml msgr/async.yaml rados.yaml thrashers/morepggrow.yaml workloads/rgw_snaps.yaml}

This test failed twice in two different ways.

Re-running 10 times:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_06:30:59-rados-wip-jewel-backports-distro-basic-smithi/
- two failed jobs
- eight successful jobs

(11:15:51 AM) owasserm: smithfarm, I looked at the failures and the s3_readwrite looks like a timeout issue (it happens after xfs injected stalls)

Conclusion: not a blocker

lfn-upgrade-{hammer,infernalis}.yaml failures¶

Re-running the last two failures with fixed PR branch (after prolonged trial-and-error):

teuthology-suite -k distro --priority 101 --suite rados --email ncutler@suse.com --ceph wip-lfn-upgrade-hammer --machine-type smithi --filter="rados/singleton-nomsgr/{all/lfn-upgrade-hammer.yaml rados.yaml},rados/singleton-nomsgr/{all/lfn-upgrade-infernalis.yaml rados.yaml}"

pass http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_13:48:11-rados-wip-lfn-upgrade-hammer-distro-basic-smithi/

teuthology-suite -k distro --priority 101 --suite rados --email ncutler@suse.com --ceph wip-lfn-upgrade-hammer --machine-type vps --filter="rados/singleton-nomsgr/{all/lfn-upgrade-hammer.yaml rados.yaml},rados/singleton-nomsgr/{all/lfn-upgrade-infernalis.yaml rados.yaml}"

pass http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_13:48:45-rados-wip-lfn-upgrade-hammer-distro-basic-vps/

And again with --num 8

pass http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_14:02:14-rados-wip-lfn-upgrade-hammer-distro-basic-smithi/

pass http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_14:19:30-rados-wip-lfn-upgrade-hammer-distro-basic-vps/

Actions

Copy link

#46

Updated by Nathan Cutler about 7 years ago

Upgrade¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x --ceph wip-jewel-backports --machine-type vps --priority 1000

fail http://pulpito.ceph.com/smithfarm-2017-01-30_12:04:19-upgrade:jewel-x-wip-jewel-backports-distro-basic-vps/
- "os_type: centos" without os_version causes jobs to fail on VPS, added another commit to https://github.com/ceph/ceph/pull/13050 to address this
- new bug, probably infrastructure noise http://tracker.ceph.com/issues/18733

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --ceph wip-jewel-backports --machine-type vps --priority 1000

fail http://pulpito.ceph.com/smithfarm-2017-01-30_12:05:09-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps/
- known bug http://tracker.ceph.com/issues/18089

CONCLUSION: nothing more to do here, needs a new integration branch

Actions

Copy link

#47

Updated by Nathan Cutler about 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --ceph wip-jewel-backports --machine-type vps --priority 1000

pass http://pulpito.ceph.com/smithfarm-2017-01-30_12:06:28-ceph-disk-wip-jewel-backports-distro-basic-vps/

Actions

Copy link

#48

Updated by Nathan Cutler about 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports -k distro -m smithi -s powercycle -p 1000 --email loic@dachary.org

fail http://pulpito.ceph.com:80/smithfarm-2017-01-30_12:07:01-powercycle-wip-jewel-backports-distro-basic-smithi/
- unknown bug journal FileJournal::write_bl : write_fd failed: (28) No space left on device
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/btrfs.yaml powercycle/default.yaml tasks/cfuse_workunit_kernel_untar_build.yaml}
  - /a/smithfarm-2017-01-30_12:07:01-powercycle-wip-jewel-backports-distro-basic-smithi/765012

Re-running the failed job 10 times:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_06:47:46-powercycle-wip-jewel-backports-distro-basic-smithi/
- four jobs passed
- four jobs failed
- two jobs (probably) died

Since there was just one failure in the entire suite, and that job only fails about 60% of the time, the overall run is ruled a pass

Actions

Copy link

#49

Updated by Nathan Cutler about 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-01-30_12:08:10-fs-wip-jewel-backports-distro-basic-smithi/
- "saw valgrind issues" (3 failed jobs)
- 84 successful jobs

Since the valgrind issues are apparently benign, ruled a pass

Actions

Copy link

#50

Updated by Nathan Cutler about 7 years ago

rgw¶

teuthology-suite -k distro --priority 1000 --suite rgw --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-01-30_12:10:53-rgw-wip-jewel-backports-distro-basic-smithi/
- "saw valgrind issues" new bug http://tracker.ceph.com/issues/18744 (the daemons in the notcmalloc build are linked with libtcmalloc) (18 of 21 failed jobs)
- No summary info found for user: foo
  - rgw/singleton/{all/radosgw-admin.yaml frontend/apache.yaml fs/xfs.yaml overrides.yaml rgw_pool_type/ec.yaml xfs.yaml}
- new bug ERROR: test suite for <module 's3tests.functional' from '/home/ubuntu/cephtest/s3-tests/s3tests/functional/__init__.py'>
  - rgw/verify/{clusters/fixed-2.yaml frontend/civetweb.yaml fs/btrfs.yaml msgr-failures/few.yaml overrides.yaml rgw_pool_type/replicated.yaml tasks/rgw_s3tests.yaml validater/lockdep.yaml}
- ansible: failed to update apt-cache
  - rgw/multifs/{clusters/fixed-2.yaml frontend/apache.yaml fs/btrfs.yaml overrides.yaml rgw_pool_type/replicated.yaml tasks/rgw_bucket_quota.yaml}

Re-running all the failed jobs, expecting a largish number of tcmalloc-related valgrind failures:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_12:35:14-rgw-wip-jewel-backports-distro-basic-smithi/
- "saw valgrind issues" new bug http://tracker.ceph.com/issues/18744 (the daemons in the notcmalloc build are linked with libtcmalloc) (six failed jobs)
- no other failures, ruled a pass

Re-running just "rgw_s3tests.yaml lockdep.yaml" to try to reproduce the s3tests.functional failure:

teuthology-suite -k distro --priority 1000 --suite rgw/verify --ceph wip-jewel-backports --machine-type smithi --email ncutler@suse.com --filter="tasks/rgw_s3tests.yaml validater/lockdep.yaml" --num 5

pass http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_14:29:35-rgw:verify-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#51

Updated by Nathan Cutler about 7 years ago

rbd¶

teuthology-suite -k distro --priority 101 --suite rbd --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-01-30_12:16:19-rbd-wip-jewel-backports-distro-basic-smithi/
- newly reported, known intermittent bug http://tracker.ceph.com/issues/18731
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}
- dead job, known bug http://tracker.ceph.com/issues/16263
  - rbd/thrash/{base/install.yaml clusters/{fixed-2.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml thrashers/default.yaml workloads/rbd_fsx_nbd.yaml}

failed job¶

Re-running the single failed job:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-30_19:52:39-rbd-wip-jewel-backports-distro-basic-smithi/
- newly reported, but known bug http://tracker.ceph.com/issues/18739 FAILED assert(0 == "out of order op")

Re-running it again with --num 10:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-30_22:32:27-rbd-wip-jewel-backports-distro-basic-smithi/
- 9 pass, 1 fail
- known bug, not a blocker http://tracker.ceph.com/issues/18731

dead job¶

Re-running with --num 10:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-30_22:56:59-rbd-wip-jewel-backports-distro-basic-smithi/
- two dead jobs (thrashosds failed to recover before timeout expired)
- eight successful jobs

conclusion¶

Ruled a pass

Actions

Copy link

#52

Updated by Nathan Cutler about 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel..wip-jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#53

Updated by Nathan Cutler about 7 years ago

rados¶

teuthology-suite -k distro --priority 101 --suite rados --subset $(expr $RANDOM % 50)/50 --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:05:35-rados-wip-jewel-backports-distro-basic-smithi/
- infrastructure noise clock skew
  - 770350
  - 770354
  - 770358
- false negative http://tracker.ceph.com/issues/18744 (Leak_StillReachable in MON, with libtcmalloc frame)
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/rados_cls_all.yaml validater/valgrind.yaml}
- infrastructure noise SSH connection to smithi178 was lost
  - rados/thrash/{0-size-min-size-overrides/3-size-2-min-size.yaml 1-pg-log-overrides/normal_pg_log.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml hobj-sort.yaml msgr-failures/osd-delay.yaml msgr/async.yaml rados.yaml thrashers/pggrow.yaml workloads/cache-pool-snaps-readproxy.yaml}

Re-running failed and dead jobs:

pass http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-02-01_08:55:45-rados-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#54

Updated by Nathan Cutler about 7 years ago

Upgrade¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x --ceph wip-jewel-backports --machine-type vps --priority 1000 --email ncutler@suse.com

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:13:18-upgrade:jewel-x-wip-jewel-backports-distro-basic-vps/
- timeout in radosbench
  - upgrade:jewel-x/stress-split/{0-cluster/{openstack.yaml start.yaml} 1-jewel-install/jewel.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/{rbd-cls.yaml rbd-import-export.yaml readwrite.yaml snaps-few-objects.yaml} 6-next-mon/monb.yaml 7-workload/{radosbench.yaml rbd_api.yaml} 8-next-mon/monc.yaml 9-workload/{rbd-python.yaml rgw-swift.yaml snaps-many-objects.yaml} distros/ubuntu_14.04.yaml}
  - smithfarm note: "I looked at this test and noticed that it installs "jewel" (i.e. latest not-yet-released) on two cluster nodes and one client node, and then upgrades just one cluster node to "-x" which is the integration branch. Shouldn't it install, say, v10.2.0 instead of latest jewel??"

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --ceph wip-jewel-backports --machine-type vps --priority 1000  --email ncutler@suse.com

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:13:55-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps/

In general, both of these runs have improved significantly but still need work:

Actions

Copy link

#55

Updated by Nathan Cutler about 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --ceph wip-jewel-backports --machine-type vps --priority 1000 --email ncutler@suse.com

pass http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:14:43-ceph-disk-wip-jewel-backports-distro-basic-vps/

Actions

Copy link

#56

Updated by Nathan Cutler about 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports -k distro -m smithi -s powercycle -p 1000 --email ncutler@suse.com

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:15:14-powercycle-wip-jewel-backports-distro-basic-smithi/
- one failure, looks similar to the previous run

Ruled a pass

Actions

Copy link

#57

Updated by Nathan Cutler about 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:16:06-fs-wip-jewel-backports-distro-basic-smithi/
- saw valgrind issues
- probable "out of order op" (just an unverified suspicion)
- cephfs failure (same as before, not a blocker)
- one dead job

Re-running the six failed jobs:

running http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-02-01_11:11:18-fs-wip-jewel-backports-distro-basic-smithi/

Re-running the one dead job:

pending

Actions

Copy link

#58

Updated by Nathan Cutler about 7 years ago

rgw¶

teuthology-suite -k distro --priority 101 --suite rgw --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:01:05-rgw-wip-jewel-backports-distro-basic-smithi/
- saw valgrind issues (18 failures) most likely http://tracker.ceph.com/issues/18744

Re-running:

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-02-01_11:09:07-rgw-wip-jewel-backports-distro-basic-smithi/
- saw valgrind issues (down to 8 failures) all checked and verified to be http://tracker.ceph.com/issues/18744

Conclusion: since all 8 failures are libtcmalloc-related, Orit says we can merge all the RGW backport PRs.

Actions

Copy link

#59

Updated by Nathan Cutler about 7 years ago

rbd¶

teuthology-suite -k distro --priority 1000 --suite rbd --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.front.sepia.ceph.com:80/smithfarm-2017-01-31_20:17:40-rbd-wip-jewel-backports-distro-basic-smithi/
- known, low-priority bug http://tracker.ceph.com/issues/18739 FAILED assert(0 == "out of order op")
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}

Ruled a pass

Actions

Copy link

#60

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#61

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#62

Updated by Nathan Cutler about 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel..wip-jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#63

Updated by Nathan Cutler about 7 years ago

rados¶

teuthology-suite -k distro --priority 101 --suite rados --subset $(expr $RANDOM % 50)/50 --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-02-13_20:36:56-rados-wip-jewel-backports-distro-basic-smithi/
python ../fail.py $run fail
- **
  - rados/singleton/{all/dump-stuck.yaml fs/xfs.yaml msgr-failures/many.yaml msgr/async.yaml rados.yaml}
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=86f3d99a9d10afc8be7a72b4648550687dcf6cf1 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin CEPH_BASE=/home/ubuntu/cephtest/clone.client.0 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/clone.client.0/qa/workunits/mon/crush_ops.sh'
  - rados/monthrash/{ceph/ceph.yaml clusters/3-mons.yaml fs/xfs.yaml msgr-failures/mon-delay.yaml msgr/simple.yaml rados.yaml thrashers/many.yaml workloads/rados_mon_workunits.yaml}
  - rados/monthrash/{ceph/ceph.yaml clusters/9-mons.yaml fs/xfs.yaml msgr-failures/few.yaml msgr/async.yaml rados.yaml thrashers/one.yaml workloads/rados_mon_workunits.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&ref=v0.80.8
  - rados/singleton-nomsgr/{all/11429.yaml rados.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&sha1=9d446bd416c52cd785ccf048ca67737ceafcdd7f
  - rados/singleton-nomsgr/{all/13234.yaml rados.yaml}
- saw valgrind issues
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/async.yaml rados.yaml tasks/mon_recovery.yaml validater/valgrind.yaml}

Actions

Copy link

#64

Updated by Nathan Cutler about 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports -k distro -m smithi -s powercycle -p 1000 --email ncutler@suse.com

running http://pulpito.ceph.com:80/smithfarm-2017-02-13_21:18:00-powercycle-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#65

Updated by Nathan Cutler about 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

pass http://pulpito.ceph.com:80/smithfarm-2017-02-13_21:24:17-fs-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#66

Updated by Nathan Cutler about 7 years ago

rgw¶

teuthology-suite -k distro --priority 101 --suite rgw --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

pass http://pulpito.ceph.com:80/smithfarm-2017-02-13_21:27:10-rgw-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#67

Updated by Nathan Cutler about 7 years ago

rbd¶

teuthology-suite -k distro --priority 1000 --suite rbd --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-02-13_22:42:25-rbd-wip-jewel-backports-distro-basic-smithi/
- 'sudo adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage daemon-helper kill ceph-osd -f --cluster cluster2 -i 1'
  - known bug tracker.ceph.com/issues/18731 rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/many.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}
- dead job is qemu_xfstest

Re-running failed jobs

pass http://pulpito.ceph.com/loic-2017-02-15_09:57:52-rbd-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#68

Updated by Loïc Dachary about 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel..wip-jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#69

Updated by Loïc Dachary about 7 years ago

rados¶

teuthology-suite -k distro --priority 1000 --suite rados --subset $(expr $RANDOM % 50)/50 --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-02-16_09:52:21-rados-wip-jewel-backports-distro-basic-smithi
- **
  - rados/singleton/{all/dump-stuck.yaml fs/xfs.yaml msgr-failures/few.yaml msgr/simple.yaml rados.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&ref=v0.80.8
  - rados/singleton-nomsgr/{all/11429.yaml rados.yaml}
- "2017-02-16 13:16:40.828313 osd.0 172.21.15.33:6804/12748 8 : cluster [WRN] map e388 wrongly marked me down" in cluster log
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/async.yaml rados.yaml tasks/rados_api_tests.yaml validater/valgrind.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&sha1=9d446bd416c52cd785ccf048ca67737ceafcdd7f
  - rados/singleton-nomsgr/{all/13234.yaml rados.yaml}
- "2017-02-16 13:35:11.781460 osd.3 172.21.15.202:6800/22006 1 : cluster [WRN] map e200 wrongly marked me down" in cluster log
  - rados/monthrash/{ceph/ceph.yaml clusters/9-mons.yaml fs/xfs.yaml msgr-failures/mon-delay.yaml msgr/async.yaml rados.yaml thrashers/many.yaml workloads/snaps-few-objects.yaml}
- saw valgrind issues
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/mon_recovery.yaml validater/valgrind.yaml}

Re-running failed jobs

fail http://pulpito.ceph.com/loic-2017-02-20_07:43:57-rados-wip-jewel-backports-distro-basic-smithi
- known bug assert len(unclean) == num_unclean in dump_stuck.py in rados suite
  - rados/singleton/{all/dump-stuck.yaml fs/xfs.yaml msgr-failures/few.yaml msgr/simple.yaml rados.yaml}
- "2017-02-20 08:15:50.988177 osd.0 172.21.15.126:6800/675602 1 : cluster [WRN] map e420 wrongly marked me down" in cluster log
  - rados/monthrash/{ceph/ceph.yaml clusters/9-mons.yaml fs/xfs.yaml msgr-failures/mon-delay.yaml msgr/async.yaml rados.yaml thrashers/many.yaml workloads/snaps-few-objects.yaml}
- "2017-02-20 08:09:21.196329 osd.1 172.21.15.179:6806/27276 1 : cluster [WRN] map e14 wrongly marked me down" in cluster log
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/mon_recovery.yaml validater/valgrind.yaml}
- Failed to fetch package version from https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=ubuntu%2F14.04%2Fx86_64&sha1=9d446bd416c52cd785ccf048ca67737ceafcdd7f
  - rados/singleton-nomsgr/{all/13234.yaml rados.yaml}
- "2017-02-20 08:27:05.421722 osd.0 172.21.15.110:6808/29569 52 : cluster [WRN] map e602 wrongly marked me down" in cluster log
  - rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/async.yaml rados.yaml tasks/rados_api_tests.yaml validater/valgrind.yaml}

Re-running on the failed jobs on jewel to assert a regression

running http://pulpito.front.sepia.ceph.com/loic-2017-02-20_10:39:39-rados-jewel-distro-basic-smithi/

Actions

Copy link

#70

Updated by Loïc Dachary about 7 years ago

Upgrade jewel point-to-point-x¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x/point-to-point-x --ceph wip-jewel-backports --machine-type vps --priority 1000

pass http://pulpito.ceph.com/loic-2017-02-16_09:54:41-upgrade:jewel-x:point-to-point-x-wip-jewel-backports-distro-basic-vps

Actions

Copy link

#71

Updated by Loïc Dachary about 7 years ago

Upgrade hammer-x¶

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --ceph wip-jewel-backports --machine-type vps --priority 1000

fail http://pulpito.ceph.com/loic-2017-02-16_09:55:58-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps

Re-running

fail http://pulpito.ceph.com/loic-2017-02-20_07:46:16-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps
- massive failure fixed by https://github.com/ceph/ceph/pull/13533

Actions

Copy link

#72

Updated by Loïc Dachary about 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --ceph wip-jewel-backports --machine-type vps --priority 1000

pass http://pulpito.ceph.com/loic-2017-02-16_09:57:01-ceph-disk-wip-jewel-backports-distro-basic-vps

Actions

Copy link

#73

Updated by Loïc Dachary about 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports -k distro -m smithi -s powercycle -p 1000 --email loic@dachary.org

fail http://pulpito.ceph.com/loic-2017-02-16_09:58:01-powercycle-wip-jewel-backports-distro-basic-smithi
- "sudo yum -y install '' ceph-radosgw"
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/xfs.yaml powercycle/default.yaml tasks/cfuse_workunit_suites_truncate_delay.yaml}
- "2017-02-16 16:43:03.101694 osd.0 172.21.15.4:6800/22904 24 : cluster [ERR] 1.0 deep-scrub 1 errors" in cluster log
  - powercycle/osd/{clusters/3osd-1per-target.yaml fs/btrfs.yaml powercycle/default.yaml tasks/cfuse_workunit_misc.yaml}

Actions

Copy link

#74

Updated by Loïc Dachary about 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-02-16_09:58:57-fs-wip-jewel-backports-distro-basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.0/client.0/tmp && cd -- /home/ubuntu/cephtest/mnt.0/client.0/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=52902891f10862c107758b1e7f0ed67edd486a89 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster ceph" CEPH_ID="0" PATH=$PATH:/usr/sbin CEPH_BASE=/home/ubuntu/cephtest/clone.client.0 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/clone.client.0/qa/workunits/libcephfs-java/test.sh'
  - fs/basic/{clusters/fixed-2-ucephfs.yaml debug/mds_client.yaml dirfrag/frag_enable.yaml fs/btrfs.yaml inline/no.yaml overrides/whitelist_wrongly_marked_down.yaml tasks/libcephfs_java.yaml}

Re-running failed jobs

pass http://pulpito.ceph.com/loic-2017-02-20_08:03:01-fs-wip-jewel-backports-distro-basic-smithi

Actions

Copy link

#75

Updated by Loïc Dachary about 7 years ago

rgw¶

teuthology-suite -k distro --priority 1000 --suite rgw --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com/loic-2017-02-16_10:00:14-rgw-wip-jewel-backports-distro-basic-smithi
- "sudo yum -y install '' ceph-radosgw"
  - rgw/singleton/{all/radosgw-admin-multi-region.yaml frontend/civetweb.yaml fs/xfs.yaml overrides.yaml rgw_pool_type/ec.yaml xfs.yaml}
- 'sudo apt-get update'
  - rgw/singleton/{all/radosgw-admin-multi-region.yaml frontend/apache.yaml fs/xfs.yaml overrides.yaml rgw_pool_type/ec-cache.yaml xfs.yaml}
- reached maximum tries (50) after waiting for 300 seconds
  - rgw/verify/{clusters/fixed-2.yaml frontend/apache.yaml fs/btrfs.yaml msgr-failures/few.yaml overrides.yaml rgw_pool_type/ec-profile.yaml tasks/rgw_s3tests_multiregion.yaml validater/valgrind.yaml}
- No summary info found for user: foo
  - rgw/singleton/{all/radosgw-admin.yaml frontend/civetweb.yaml fs/xfs.yaml overrides.yaml rgw_pool_type/ec.yaml xfs.yaml}

Re-running failed jobs

pass http://pulpito.ceph.com/loic-2017-02-20_08:04:21-rgw-wip-jewel-backports-distro-basic-smithi

Actions

Copy link

#76

Updated by Loïc Dachary about 7 years ago

rbd¶

teuthology-suite -k distro --priority 1000 --suite rbd --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

running http://pulpito.ceph.com/loic-2017-02-16_10:01:26-rbd-wip-jewel-backports-distro-basic-smithi
- 'mkdir ~~p -~~ /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && cd -- /home/ubuntu/cephtest/mnt.cluster1.mirror/client.mirror/tmp && CEPH_CLI_TEST_DUP_COMMAND=1 CEPH_REF=52902891f10862c107758b1e7f0ed67edd486a89 TESTDIR="/home/ubuntu/cephtest" CEPH_ARGS="--cluster cluster1" CEPH_ID="mirror" PATH=$PATH:/usr/sbin CEPH_BASE=/home/ubuntu/cephtest/clone.cluster1.client.mirror CEPH_ARGS=\'\' RBD_MIRROR_USE_RBD_MIRROR=1 RBD_MIRROR_USE_EXISTING_CLUSTER=1 adjust-ulimits ceph-coverage /home/ubuntu/cephtest/archive/coverage timeout 3h /home/ubuntu/cephtest/clone.cluster1.client.mirror/qa/workunits/rbd/rbd_mirror.sh'
  - rbd/mirror/{base/install.yaml cluster/{2-node.yaml openstack.yaml} fs/xfs.yaml msgr-failures/few.yaml rbd-mirror/one-per-cluster.yaml workloads/rbd-mirror-workunit.yaml}

Re-running failed jobs

pass http://pulpito.ceph.com/loic-2017-02-20_08:06:18-rbd-wip-jewel-backports-distro-basic-smithi

Actions

Copy link

#77

Updated by Loïc Dachary about 7 years ago

git --no-pager log --format='%H %s' --graph ceph/jewel..wip-jewel-backports | perl -p -e 's/"/ /g; if (/\w+\s+Merge pull request #(\d+)/) { s|\w+\s+Merge pull request #(\d+).*|"Pull request $1":https://github.com/ceph/ceph/pull/$1|; } else { s|(\w+)\s+(.*)|"$2":https://github.com/ceph/ceph/commit/$1|; } s/\*/+/; s/^/* /;'

Actions

Copy link

#78

Updated by Nathan Cutler about 7 years ago

fs¶

teuthology-suite -k distro --priority 1000 --suite fs --email loic@dachary.org --ceph wip-jewel-backports --machine-type smithi

pass http://pulpito.ceph.com:80/smithfarm-2017-02-20_21:11:08-fs-wip-jewel-backports-distro-basic-smithi/

~~NOTE: merge https://github.com/ceph/ceph/pull/13459 when/if this run passes, and then ask John to approve 10.2.6~~ DONE

Actions

Copy link

#79

Updated by Nathan Cutler about 7 years ago

rados¶

teuthology-suite -k distro --priority 101 --suite rados --subset $(expr $RANDOM % 50)/50 --email ncutler@suse.com --ceph wip-jewel-backports --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-02-20_21:18:05-rados-wip-jewel-backports-distro-basic-smithi/
- one failure is an instance of http://tracker.ceph.com/issues/18089 and can be ignored
- 840283 (rados/singleton/{all/osd-recovery.yaml fs/xfs.yaml msgr-failures/many.yaml msgr/random.yaml rados.yaml}): runs for 8 hours, no sign of stopping
- 840382 (rados/singleton/{all/dump-stuck.yaml fs/xfs.yaml msgr-failures/many.yaml msgr/random.yaml rados.yaml}): http://tracker.ceph.com/issues/17366
- one transient valgrind failure

Re-running the three other failures:

filter="rados/singleton/{all/osd-recovery.yaml fs/xfs.yaml msgr-failures/many.yaml msgr/random.yaml rados.yaml},rados/singleton/{all/dump-stuck.yaml fs/xfs.yaml msgr-failures/many.yaml msgr/random.yaml rados.yaml},rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/random.yaml rados.yaml tasks/rados_cls_all.yaml validater/valgrind.yaml}"

running http://pulpito.ceph.com:80/smithfarm-2017-02-21_14:15:43-rados-wip-jewel-backports-distro-basic-smithi/
- 843306 (rados/singleton/{all/dump-stuck.yaml fs/xfs.yaml msgr-failures/many.yaml msgr/random.yaml rados.yaml}): http://tracker.ceph.com/issues/17366

Actions

Copy link

#80

Updated by Nathan Cutler about 7 years ago

powercycle¶

teuthology-suite -v -c wip-jewel-backports -k distro -m smithi -s powercycle -p 101 --email ncutler@suse.com

pass http://pulpito.ceph.com:80/smithfarm-2017-02-20_21:23:15-powercycle-wip-jewel-backports-distro-basic-smithi/

Actions

Copy link

#81

Updated by Nathan Cutler about 7 years ago

Upgrade jewel point-to-point-x¶

teuthology-suite -k distro --verbose --suite upgrade/jewel-x/point-to-point-x --ceph wip-jewel-backports --machine-type vps --priority 101 --email ncutler@suse.com

fail http://pulpito.ceph.com:80/smithfarm-2017-02-20_21:26:57-upgrade:jewel-x:point-to-point-x-wip-jewel-backports-distro-basic-vps/

Re-running:

pass http://pulpito.ceph.com:80/smithfarm-2017-02-21_12:45:42-upgrade:jewel-x:point-to-point-x-wip-jewel-backports-distro-basic-vps/

Actions

Copy link

#82

Updated by Nathan Cutler about 7 years ago

Upgrade hammer-x¶

teuthology-suite -k distro --verbose --suite upgrade/hammer-x --ceph wip-jewel-backports --machine-type vps --priority 101 --email ncutler@suse.com

fail http://pulpito.ceph.com:80/smithfarm-2017-02-20_21:28:11-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps/
- two cases of http://tracker.ceph.com/issues/18089
- one mysterious failure (thrashosds/ceph_manager.py raises exception "Exception: ceph-objectstore-tool: exp list-pgs failure with status 1")

Re-running the list-pgs failure:

filter="stress-split-erasure-code/{0-cluster/{openstack.yaml start.yaml} 0-tz-eastern.yaml 1-hammer-install/hammer.yaml 2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/mona.yaml 5-workload/ec-rados-default.yaml 6-next-mon/monb.yaml 8-finish-upgrade/last-osds-and-monc.yaml 9-workload/ec-rados-plugin=jerasure-k=3-m=1.yaml distros/ubuntu_14.04.yaml}" 
teuthology-suite -k distro --verbose --suite upgrade/hammer-x --ceph wip-jewel-backports --machine-type vps --priority 101 --email ncutler@suse.com --filter="$filter"

pass http://pulpito.ceph.com:80/smithfarm-2017-02-21_12:44:45-upgrade:hammer-x-wip-jewel-backports-distro-basic-vps/

Actions

Copy link

#83

Updated by Nathan Cutler about 7 years ago

ceph-disk¶

teuthology-suite -k distro --verbose --suite ceph-disk --ceph wip-jewel-backports --machine-type vps --priority 101 --email ncutler@suse.com

fail http://pulpito.ceph.com:80/smithfarm-2017-02-20_21:30:56-ceph-disk-wip-jewel-backports-distro-basic-vps/
- a bunch of tests fail with Exception: timeout waiting for osd $SOME_UUID to be up

Because of one of :

Actions

Copy link

#84

Updated by Nathan Cutler about 7 years ago

Release blockers to be merged urgently upon obtaining green rados/powercycle/upgrade:

After these are merged, ask Josh to approve 10.2.6

Actions

Copy link

#85

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#86

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#87

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#88

Updated by Nathan Cutler about 7 years ago

RADOS ON PR#13131 AND PR#13255¶

teuthology-suite -k distro --priority 101 --suite rados --subset $(expr $RANDOM % 50)/50 --email ncutler@suse.com --ceph pr-13131 --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-02-21_20:38:05-rados-pr-13131-distro-basic-smithi/
- one failure is http://tracker.ceph.com/issues/18089 and can be ignored
- three jobs are rados/thrash-erasure-code-isa jobs which cannot be scheduled because they want CentOS 7.2 smithis (of which there are none) - fix would be to rebase the branch to pick up PR#13050, and re-run

teuthology-suite -k distro --priority 101 --suite rados --subset $(expr $RANDOM % 50)/50 --email ncutler@suse.com --ceph pr-13255 --machine-type smithi

fail http://pulpito.ceph.com:80/smithfarm-2017-02-21_20:43:35-rados-pr-13255-distro-basic-smithi/
- one failure is http://tracker.ceph.com/issues/18089 and can be ignored
- three jobs cannot be scheduled because they want CentOS 7.2 smithis which do not exist (missing PR#13050 in this branch)
- one instance of "api_misc: [ FAILED ] LibRadosMiscConnectFailure.ConnectFailure" in rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/async.yaml rados.yaml tasks/rados_api_tests.yaml validater/lockdep.yaml}

Re-running four failed jobs (three are btrfs, one is xfs):

fail http://pulpito.ceph.com:80/smithfarm-2017-02-22_15:06:17-rados-pr-13255-distro-basic-smithi/
- rados/verify/{1thrash/none.yaml clusters/{fixed-2.yaml openstack.yaml} fs/btrfs.yaml msgr-failures/few.yaml msgr/async.yaml rados.yaml tasks/rados_api_tests.yaml validater/lockdep.yaml}

The one failure is in the same test, and looks very similar:

2017-02-22T15:26:29.695 INFO:tasks.workunit.client.0.smithi114.stdout:                 api_misc: test/librados/misc.cc:71: Failure
2017-02-22T15:26:29.695 INFO:tasks.workunit.client.0.smithi114.stdout:                 api_misc: Expected: (0) != (rados_connect(cluster)), actual: 0 vs 0
2017-02-22T15:26:29.695 INFO:tasks.workunit.client.0.smithi114.stdout:                 api_misc: [  FAILED  ] LibRadosMiscConnectFailure.ConnectFailure (43 ms)

Actions

Copy link

#89

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#90

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#91

Updated by Nathan Cutler about 7 years ago

Description updated (diff)

Actions

Copy link

#92

Updated by Yuri Weinstein about 7 years ago

QE VALIDATION (STARTED 2/23/17)¶

(Note: PASSED / FAILED - indicates "TEST IS IN PROGRESS")

re-runs command lines and filters are captured in http://pad.ceph.com/p/hammer_v10.2.6_QE_validation_notes

command line CEPH_QA_MAIL="ceph-qa@ceph.com"; MACHINE_NAME=smithi; CEPH_BRANCH=jewel; SHA1=d9eaab456ff45ae88e83bd633f0c4efb5902bf07 ; teuthology-suite -v --ceph-repo https://github.com/ceph/ceph.git --suite-repo https://github.com/ceph/ceph.git -c $CEPH_BRANCH -S $SHA1 -m $MACHINE_NAME -s rados --subset 35/50 -k distro -p 100 -e $CEPH_QA_MAIL --suite-branch jewel --dry-run

teuthology-suite -v -c $CEPH_BRANCH -S $SHA1 -m $MACHINE_NAME -r $RERUN --suite-repo https://github.com/ceph/ceph.git --ceph-repo https://github.com/ceph/ceph.git --suite-branch jewel -p 90 -R fail,dead,running

Suite	Runs/Reruns	Notes/Issues
rados	http://pulpito.ceph.com/yuriw-2017-02-23_17:29:38-rados-jewel-distro-basic-smithi/	PASSED one job #18089 Josh approved, failed job removed https://github.com/ceph/ceph/pull/13705
	http://pulpito.ceph.com/yuriw-2017-02-24_16:57:35-rados-jewel---basic-smithi/

rgw	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:08:36-rgw-jewel-distro-basic-smithi/	PASSED
	http://pulpito.ceph.com/yuriw-2017-02-24_00:03:16-rgw-jewel---basic-smithi/

rbd	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:19:29-rbd-jewel-distro-basic-smithi/	PASSED
	http://pulpito.ceph.com/yuriw-2017-02-24_16:59:22-rbd-jewel---basic-smithi/

fs	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:23:09-fs-jewel-distro-basic-smithi/	PASSED
	http://pulpito.ceph.com/yuriw-2017-02-24_17:02:22-fs-jewel---basic-smithi/

krbd	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:25:04-krbd-jewel-testing-basic-smithi/	FAILED #17221 Ilya approved
	http://pulpito.ceph.com/yuriw-2017-02-24_21:11:45-krbd-jewel-testing-basic-smithi/

kcephfs	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:26:11-kcephfs-jewel-testing-basic-smithi/	PASSED

knfs	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:28:17-knfs-jewel-testing-basic-smithi/	PASSED

rest	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:29:02-rest-jewel-distro-basic-smithi/	PASSED

hadoop	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:29:41-hadoop-jewel-distro-basic-smithi/	PASSED
	http://pulpito.ceph.com/yuriw-2017-02-24_17:24:15-hadoop-jewel---basic-smithi/

samba	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-24_18:14:42-samba-jewel-distro-basic-smithi/	FAILED #19101 John approved
	http://pulpito.ceph.com/yuriw-2017-02-24_20:42:46-samba-jewel---basic-smithi/

ceph-deploy	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:30:27-ceph-deploy-jewel-distro-basic-vps/	PASSED

ceph-disk	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:31:57-ceph-disk-jewel-distro-basic-vps/	PASSED

upgrade/client-upgrade	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:32:48-upgrade:client-upgrade-jewel-distro-basic-smithi/	FAILED #19080 Jason approved
	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-24_17:46:07-upgrade:client-upgrade-jewel-distro-basic-vps/

upgrade/hammer-x (jewel)	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:34:03-upgrade:hammer-x-jewel-distro-basic-vps/	PASSED 2 jobs #18089
	http://pulpito.ceph.com/yuriw-2017-02-27_21:22:18-upgrade:hammer-x-jewel---basic-vps/
	http://pulpito.ceph.com/yuriw-2017-03-01_23:00:18-upgrade:hammer-x-jewel---basic-vps/	added firefly to shaman

upgrade/jewel-x/point-to-point-x	http://pulpito.front.sepia.ceph.com:80/yuriw-2017-02-23_21:35:13-upgrade:jewel-x:point-to-point-x-jewel-distro-basic-vps/	PASSED

powercycle	http://pulpito.front.sepia.ceph.com/yuriw-2017-02-23_21:04:26-powercycle-jewel-testing-basic-smithi/	PASSED

ceph-ansible	http://pulpito.ceph.com/yuriw-2017-02-28_22:57:14-ceph-ansible-jewel-distro-basic-ovh/	PASSED

=========

		PASSED / FAILED

Actions

Copy link

#93

Updated by Yuri Weinstein about 7 years ago

Description updated (diff)

Actions

Copy link

#94

Updated by Yuri Weinstein about 7 years ago

Description updated (diff)

Actions

Copy link

#95

Updated by Nathan Cutler about 7 years ago

Status changed from In Progress to Resolved

Actions

Copy link

#96

Updated by Nathan Cutler about 7 years ago

Release set to jewel

Project

General

Profile

Ceph » Stable releases

Custom queries

Tasks #17851

jewel v10.2.6

Workflow¶

Release information¶

Updated by Loïc Dachary over 7 years ago

Updated by Loïc Dachary over 7 years ago

rbd¶

Updated by Loïc Dachary over 7 years ago

rgw¶

Updated by Loïc Dachary over 7 years ago

rados¶

Updated by Loïc Dachary over 7 years ago

fs¶

Updated by Loïc Dachary over 7 years ago

powercycle¶

Updated by Loïc Dachary over 7 years ago

Upgrade¶

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

Updated by Loïc Dachary over 7 years ago

Updated by Loïc Dachary over 7 years ago

rbd¶

Updated by Loïc Dachary over 7 years ago

rgw¶

Updated by Loïc Dachary over 7 years ago

rados¶

Updated by Loïc Dachary over 7 years ago

fs¶

Updated by Loïc Dachary over 7 years ago

powercycle¶

Updated by Loïc Dachary over 7 years ago

Upgrade¶

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

Updated by Loïc Dachary over 7 years ago

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

Updated by Loïc Dachary over 7 years ago

Updated by Loïc Dachary over 7 years ago

powercycle¶

Updated by Loïc Dachary over 7 years ago

fs¶

Updated by Loïc Dachary over 7 years ago

rados¶

Updated by Loïc Dachary over 7 years ago

rgw¶

Updated by Loïc Dachary over 7 years ago

rbd¶

Updated by Loïc Dachary over 7 years ago

Updated by Loïc Dachary over 7 years ago

Updated by Loïc Dachary over 7 years ago

Updated by Loïc Dachary over 7 years ago

rados¶

Updated by Loïc Dachary over 7 years ago

ceph-disk¶

Updated by Loïc Dachary over 7 years ago

Upgrade¶

Updated by Loïc Dachary over 7 years ago

powercycle¶

Updated by Loïc Dachary over 7 years ago

fs¶

Updated by Loïc Dachary over 7 years ago

rgw¶

Updated by Loïc Dachary over 7 years ago

rbd¶

Updated by Loïc Dachary about 7 years ago

Updated by Loïc Dachary about 7 years ago

Upgrade¶

Updated by Loïc Dachary about 7 years ago

ceph-disk¶

Updated by Loïc Dachary about 7 years ago

powercycle¶

Updated by Loïc Dachary about 7 years ago

fs¶

Updated by Loïc Dachary about 7 years ago