Project

General

Profile

Actions

Bug #7177

closed

osd crashed at OSD::advance_pg on master branch

Added by Tamilarasi muthamizhan over 10 years ago. Updated over 10 years ago.

Status:
Resolved
Priority:
Urgent
Assignee:
Category:
-
Target version:
-
% Done:

0%

Source:
Q/A
Tags:
Backport:
Regression:
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

logs: :/home/ubuntu/bug_osd_crash

hit this issue when trying to run the rest/test.py script,

here is the config file that i used,


roles:
- [mon.a, mon.b, mds.a, osd.0, osd.1]
- [client.0]

targets:
  ubuntu@mira057.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQC+IR8k7Hrnnl7zZUj9Kyb5GrlCodfuyvxkpokRLWZGLbjPOzd3gdszhLCWa0F2FVzl/2upKr9VfMzoVYF5Q3eKn7sQJ1AmDdvHINKM6hYnm2ruKxzLCjK11wdr5Gt/WFQ3g6U5YFjIX19cLVLhrPwj0aM+27cTv+6KZrl56dPwRj7vVnyB7CIUmc1NpbD/LN+Oan+DISnWNvSUrdq0e70owvuZv2uHgWOJstErLD/arxQ97A1AdxLcfi8sAA12Gu3if4t+Aq+6KmZorrxQimni06b7vWr9EC5NDuxcOm5ReGkyy45ED4QK7yVCmzmQFpRX+X1ZLbQq71zcQf7E3EW7
  ubuntu@mira074.front.sepia.ceph.com: ssh-rsa AAAAB3NzaC1yc2EAAAADAQABAAABAQCuzTGrQ9CaYud6lfhAXosbRh8/P1xCeTQfxuj5QYWYJf079r2b4IPlhW+rOc2ZfK5HkOatZH0+eV6eMZREYMLZNn8n+S3jQclWpyyoI6U0B0TP65ByYRtI2f+wvab5TGBWXHasGLNQh7zzxadhLWMVQ9AT/7c5oJTEHe1+BRIfvR0dBpK/cCrOlVjwcGUYkZn6s/My216zbVVuENHXa62NJBAlmNEWJsJHRh9IEDB+Cl+PmD+qD5zAWgJr2e2OtOWh+9v8v6YWOyO3KEhg/BKKxmBevkdcKZTcybjARDjU2IMu9nyeOhH1F+8xQQJ7dDRQ5TA7DYH6lKO9iEHD8YFr

tasks:
- chef:
- install:
    branch: master
- ceph:
    log-whitelist:
    - wrongly marked me down
- rest-api: [client.0]
- workunit:
    clients:
      client.0:
         - rest/test.py

     0> 2014-01-17 16:05:49.755198 7fbb1dd10700 -1 ./osd/OSDMap.h: In function 'void OSDMap::pg_to_up_acting_osds(pg_t, std::vector<int>&, std::vector<int>&) const' thread 7fbb1dd10700 time 2014-01-17 16:0
5:49.727141
./osd/OSDMap.h: 577: FAILED assert(acting.empty() || acting_primary == acting.front())

 ceph version 0.75-293-g6e93132 (6e931326b080f6f8d306c7e9e9b23268e3e8b814)
 1: (OSD::advance_pg(unsigned int, PG*, ThreadPool::TPHandle&, PG::RecoveryCtx*, std::set<boost::intrusive_ptr<PG>, std::less<boost::intrusive_ptr<PG> >, std::allocator<boost::intrusive_ptr<PG> > >*)+0x62d
) [0x63126d]
 2: (OSD::process_peering_events(std::list<PG*, std::allocator<PG*> > const&, ThreadPool::TPHandle&)+0x1d3) [0x631453]
 3: (OSD::PeeringWQ::_process(std::list<PG*, std::allocator<PG*> > const&, ThreadPool::TPHandle&)+0x12) [0x67a372]
 4: (ThreadPool::worker(ThreadPool::WorkThread*)+0x4e6) [0x997a76]
 5: (ThreadPool::WorkThread::entry()+0x10) [0x999880]
 6: (()+0x7e9a) [0x7fbb323aae9a]
 7: (clone()+0x6d) [0x7fbb3096b3fd]
 NOTE: a copy of the executable, or `objdump -rdS <executable>` is needed to interpret this.

Actions #1

Updated by Greg Farnum over 10 years ago

  • Status changed from New to 4

Unfortunately I can't make much sense of the core dump.
Fortunately just by looking at the code involved I'm pretty sure I know what the problem is, and I've pushed a fix to wip-osdmap-7177, and I'll submit a pull request as soon as the gitbuilders are happy with it.

Actions #2

Updated by Sage Weil over 10 years ago

  • Status changed from 4 to Resolved
Actions

Also available in: Atom PDF