Project

General

Profile

Actions

Bug #6778

closed

log bound mismatch errors seen

Added by Tamilarasi muthamizhan over 10 years ago. Updated over 10 years ago.

Status:
Won't Fix
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

logs:ubuntu@teuthology:/a/teuthology-2013-11-13_14:42:07-upgrade-parallel-next-testing-basic-vps/97229

from osd.4.log

2013-11-14 01:36:04.305857 7f3f67b8e700  1 -- 10.214.138.146:6805/20823 <== osd.0 10.214.138.142:6817/2301
6 53 ==== pg_log(0.1a epoch 16 query_epoch 16) v3 ==== 632+0+0 (499347906 0 0) 0x343cb00 con 0x3364580
2013-11-14 01:36:04.305933 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.1a( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/3) [4,0] r=0 lpr=15 pi=9-14/1 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 mlcod 0'0 peering] exit Started/Primary/Peering/GetLog 0.001792 2 0.000035
2013-11-14 01:36:04.305948 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.1a( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/3) [4,0] r=0 lpr=15 pi=9-14/1 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 mlcod 0'0 peering] enter Started/Primary/Peering/GetMissing
2013-11-14 01:36:04.305958 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.1a( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/3) [4,0] r=0 lpr=15 pi=9-14/1 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 mlcod 0'0 peering] exit Started/Primary/Peering/GetMissing 0.000010 0 0.000000
2013-11-14 01:36:04.305968 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.1a( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/3) [4,0] r=0 lpr=15 pi=9-14/1 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 mlcod 0'0 peering] enter Started/Primary/Peering/WaitFlushedPeering
2013-11-14 01:36:04.313792 7f3f67b8e700  1 -- 10.214.138.146:6805/20823 <== osd.3 10.214.138.146:6801/20814 103 ==== pg_info(1 pgs e16:0.1c(1)) v3 ==== 631+0+0 (3399545596 0 0) 0x345ca80 con 0x33658c0
2013-11-14 01:36:04.313876 7f3f62b84700  5 osd.4 pg_epoch: 16 pg[0.1c( v 10'4 (10'4,10'4] local-les=4 n=4 ec=1 les/c 4/4 3/15/3) [3,4] r=1 lpr=15 pi=1-14/3 lcod 0'0 inactive NOTIFY] exit Started/Stray 0.027670 3 0.000112
2013-11-14 01:36:04.313890 7f3f62b84700  5 osd.4 pg_epoch: 16 pg[0.1c( v 10'4 (10'4,10'4] local-les=4 n=4 ec=1 les/c 4/4 3/15/3) [3,4] r=1 lpr=15 pi=1-14/3 lcod 0'0 inactive NOTIFY] enter Started/ReplicaActive
2013-11-14 01:36:04.313900 7f3f62b84700  5 osd.4 pg_epoch: 16 pg[0.1c( v 10'4 (10'4,10'4] local-les=4 n=4 ec=1 les/c 4/4 3/15/3) [3,4] r=1 lpr=15 pi=1-14/3 lcod 0'0 inactive NOTIFY] enter Started/ReplicaActive/RepNotRecovering
2013-11-14 01:36:04.317123 7f3f67b8e700  1 -- 10.214.138.146:6805/20823 <== osd.0 10.214.138.142:6817/23016 54 ==== pg_log(0.20 epoch 16 query_epoch 16) v3 ==== 632+0+0 (2979938073 0 0) 0x3399600 con 0x3364580
2013-11-14 01:36:04.317241 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.20( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/9) [0,4] r=1 lpr=15 pi=3-14/4 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 inactive NOTIFY] exit Started/Stray 0.031166 4 0.000317
2013-11-14 01:36:04.317315 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.20( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/9) [0,4] r=1 lpr=15 pi=3-14/4 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 inactive NOTIFY] enter Started/ReplicaActive
2013-11-14 01:36:04.317338 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.20( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/9) [0,4] r=1 lpr=15 pi=3-14/4 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 inactive NOTIFY] enter Started/ReplicaActive/RepNotRecovering
2013-11-14 01:36:04.317387 7f3f63385700  0 log [ERR] : 0.20 log bound mismatch, info (0'0,10'2] actual [10:
'1,10'1]
2013-11-14 01:36:04.328498 7f3f6b1b5700  1 -- 10.214.138.146:6805/20823 --> osd.0 10.214.138.142:6817/23016 -- pg_info(1 pgs e16:0.0) v3 -- ?+0 0x3858e00
2013-11-14 01:36:04.328535 7f3f6b1b5700  1 -- 10.214.138.146:6805/20823 --> osd.3 10.214.138.146:6801/20814 -- pg_info(1 pgs e16:0.c) v3 -- ?+0 0x3684380
2013-11-14 01:36:04.330747 7f3f67b8e700  1 -- 10.214.138.146:6805/20823 <== osd.3 10.214.138.146:6801/20814 104 ==== pg_info(1 pgs e16:0.c) v3 ==== 588+0+0 (868137421 0 0) 0x3684380 con 0x33658c0
2013-11-14 01:36:04.330923 7f3f67b8e700  1 -- 10.214.138.146:6805/20823 <== osd.0 10.214.138.142:6817/23016 55 ==== pg_info(1 pgs e16:0.0) v3 ==== 596+0+0 (1929562928 0 0) 0x3858e00 con 0x3364580
2013-11-14 01:36:04.331016 7f3f67b8e700  1 -- 10.214.138.146:6805/20823 <== osd.3 10.214.138.146:6801/20814 105 ==== pg_info(1 pgs e16:0.c) v3 ==== 588+0+0 (397349786 0 0) 0x37f3180 con 0x33658c0
2013-11-14 01:36:04.331114 7f3f67b8e700  1 -- 10.214.138.146:6805/20823 <== osd.0 10.214.138.142:6817/23016 56 ==== pg_info(1 pgs e16:0.0) v3 ==== 596+0+0 (1565663401 0 0) 0x345d500 con 0x3364580
2013-11-14 01:36:04.333519 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.1a( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/3) [4,0] r=0 lpr=15 pi=9-14/1 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 mlcod 0'0 peering] enter Started/Primary/Peering/WaitFlushedPeering
2013-11-14 01:36:04.333551 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.1a( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/3) [4,0] r=0 lpr=15 pi=9-14/1 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 mlcod 0'0 peering] exit Started/Primary/Peering 0.047196 1 0.000057
2013-11-14 01:36:04.333576 7f3f63385700  5 osd.4 pg_epoch: 16 pg[0.1a( v 10'2 (0'0,10'2] local-les=10 n=2 ec=1 les/c 10/10 9/15/3) [4,0] r=0 lpr=15 pi=9-14/1 (log bound mismatch, actual=[10'1,10'1]) lcod 0'0 mlcod 0'0 inactive] enter Started/Primary/Active
2013-11-14 01:36:04.333607 7f3f63385700  0 log [ERR] : 0.1a log bound mismatch, info (0'0,10'2] actual [10'1,10'1]

ubuntu@teuthology:/a/teuthology-2013-11-13_14:42:07-upgrade-parallel-next-testing-basic-vps/97229$ cat config.yaml 
archive_path: /var/lib/teuthworker/archive/teuthology-2013-11-13_14:42:07-upgrade-parallel-next-testing-basic-vps/97229
description: upgrade-parallel/stress-split/{0-cluster/start.yaml 1-dumpling-install/dumpling.yaml
  2-partial-upgrade/firsthalf.yaml 3-thrash/default.yaml 4-mon/more.yaml 5-workload/radosbench.yaml
  6-next-mon/monb.yaml 7-workload/rados_api_tests.yaml distro/debian_7.0.yaml}
email: null
job_id: '97229'
kernel:
  kdb: true
  sha1: 68174f0c97e7c0561aa844059569e3cbf0a43de1
last_in_suite: false
machine_type: vps
name: teuthology-2013-11-13_14:42:07-upgrade-parallel-next-testing-basic-vps
nuke-on-error: true
os_type: debian
os_version: '7.0'
overrides:
  admin_socket:
    branch: next
  ceph:
    conf:
      mon:
        debug mon: 20
        debug ms: 1
        debug paxos: 20
      osd:
        debug ms: 1
        debug osd: 5
    log-whitelist:
    - slow request
    - wrongly marked me down
    - objects unfound and apparently lost
    sha1: aef3378bd721ff4b73ad3a7a8b07e5f6e2e578f8
  ceph-deploy:
    branch:
      dev: next
    conf:
      client:
        log file: /var/log/ceph/ceph-$name.$pid.log
      mon:
        debug mon: 1
        debug ms: 20
        debug paxos: 20
  install:
    ceph:
      sha1: aef3378bd721ff4b73ad3a7a8b07e5f6e2e578f8
  s3tests:
    branch: master
  workunit:
    sha1: aef3378bd721ff4b73ad3a7a8b07e5f6e2e578f8
owner: scheduled_teuthology@teuthology
roles:
- - mon.a
  - mon.b
  - mds.a
  - osd.0
  - osd.1
  - osd.2
- - osd.3
  - osd.4
  - osd.5
  - client.0
  - mon.c
targets:
  ubuntu@vpm067.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCjf92b4a0WtyfJtk8hsaEHEGdH8mLjWbHXrPW5Cu47IbpjIt3DGnjC2dwJo+FZJGsqWB+eI5JYgy1f3WS8N90X7WIqM1Clv97PbaB6L1J0TPfauUuzwO7TqxZ82dPYdxFxRKw5+TGpf9C0XklCl2QgOsW1FWaVtzI4IlGzp721/M/8/UVZ7jriZcyVViWs7Xdhrt8l+HPFHUju/nlvTDFt5cOJetfD1NEx8++yn2a8G6zSCgYD5cVqmHDThEqzLkkVTWQCGDcePMYuGRBuk+6Yb7fc8vgzLy8k7UVibbtGhK7hvUJWqCrvWYuM5Vf3yUoyynLo4m/ltypibgEEFPML
  ubuntu@vpm069.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCoNVg07fLldgVxIfvrYevGIssQB90EQOa1ZwkE9qmHEVVerWF2kMAROLdSGEaETlZGzsKv8CRF1wotnZFh8LSPlP/AB8v3NSsq9r8rkoTP6jnD7xaq42R+DTsxAKmGPwzgcrNxbUFkpEh/NaDH1GFwDD6dKwtLQ/csZSTXkiOZfyuGX8Svt14fjD+h2o+C/v3lndmEyI7iBYzJ7Ji7jV9E7Fub9pJpBqlXJYQsIt5IaWqRSiFx2wakSIh2+3eJU2x5z/sRZDMQ+MS7enX5pZpMFNASdF8bfPCl4tarDDtyNQgUg4aGdjjPePBgzvPfwxtLsNl/RPH1n+tKVCp6t4Fx
tasks:
- internal.lock_machines:
  - 2
  - vps
- internal.save_config: null
- internal.check_lock: null
- internal.connect: null
- internal.check_conflict: null
- internal.check_ceph_data: null
- internal.vm_setup: null
- internal.base: null
- internal.archive: null
- internal.coredump: null
- internal.sudo: null
- internal.syslog: null
- internal.timer: null
- chef: null
- clock.check: null
- install:
    branch: dumpling
- ceph: null
- install.upgrade:
    osd.0: null
- ceph.restart:
    daemons:
    - osd.0
    - osd.1
    - osd.2
- thrashosds:
    chance_pgnum_grow: 1
    chance_pgpnum_fix: 1
    timeout: 1200
- ceph.restart:
    daemons:
    - mon.a
    wait-for-healthy: false
    wait-for-osds-up: true
- radosbench:
    clients:
    - client.0
    time: 1800
- ceph.restart:
    daemons:
    - mon.b
    wait-for-healthy: false
    wait-for-osds-up: true
- ceph.wait_for_mon_quorum:
  - a
  - b
- workunit:
    branch: dumpling
    clients:
      client.0:
      - rados/test.sh
teuthology_branch: master
verbose: true

Actions #1

Updated by Samuel Just over 10 years ago

  • Status changed from New to Won't Fix

Added to whitelist.

Actions

Also available in: Atom PDF