Project

General

Profile

Actions

Bug #50355

open

Monitor node crashing while creating new clay pool

Added by Pietari Hyvärinen about 3 years ago. Updated over 2 years ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
EC Pools
Target version:
-
% Done:

0%

Source:
Community (user)
Tags:
Backport:
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Component(RADOS):
Pull request ID:
Crash signature (v1):
Crash signature (v2):

Description

# ceph osd erasure-code-profile set CLAY8-4-11 plugin=clay k=8 m=4 d=11 crush-failure-domain=host
# ceph osd pool create clay8-4-11 erasure CLAY8-4-11

(meanwhile 1/5 mons goes down.)

load: clay pool 'clay8-4-11' created

# ceph crash info 2021-04-14T12:21:16.238842Z_19631c22-0260-4d8d-a3f7-d9a163522bc9
{
    "assert_condition": "(*__errno_location ()) == 4",
    "assert_file": "/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.2.0/rpm/el8/BUILD/ceph-16.2.0/src/common/fork_function.h",
    "assert_func": "int fork_function(int, std::ostream&, std::function<signed char()>)",
    "assert_line": 38,
    "assert_msg": "/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.2.0/rpm/el8/BUILD/ceph-16.2.0/src/common/fork_function.h: In function 'int fork_function(int, std::ostream&, std::function<signed char()>)' thread 7fe067c7e700 time 2021-04-14T12:21:16.228050+0000\n/home/jenkins-build/build/workspace/ceph-build/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos8/DIST/centos8/MACHINE_SIZE/gigantic/release/16.2.0/rpm/el8/BUILD/ceph-16.2.0/src/common/fork_function.h: 38: FAILED ceph_assert((*__errno_location ()) == 4)\n",
    "assert_thread_name": "fn_monstore",
    "backtrace": [
        "/lib64/libpthread.so.0(+0x12b20) [0x7fe075a60b20]",
        "gsignal()",
        "abort()",
        "(ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x1a9) [0x7fe077f66399]",
        "/usr/lib64/ceph/libceph-common.so.2(+0x276562) [0x7fe077f66562]",
        "(CrushTester::test_with_fork(int)+0x796) [0x7fe078512e46]",
        "(OSDMonitor::prepare_new_pool(std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >&, int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, unsigned int, unsigned int, unsigned int, unsigned long, unsigned long, float, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, unsigned int, unsigned long, OSDMonitor::FastReadType, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&, std::ostream*)+0x42c) [0x558965d0176c]",
        "(OSDMonitor::prepare_command_impl(boost::intrusive_ptr<MonOpRequest>, std::map<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, boost::variant<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, bool, long, double, std::vector<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > >, std::vector<long, std::allocator<long> >, std::vector<double, std::allocator<double> > >, std::less<void>, std::allocator<std::pair<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const, boost::variant<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, bool, long, double, std::vector<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >, std::allocator<std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > > >, std::vector<long, std::allocator<long> >, std::vector<double, std::allocator<double> > > > > > const&)+0x184fa) [0x558965d2994a]",
        "(OSDMonitor::prepare_command(boost::intrusive_ptr<MonOpRequest>)+0xf4) [0x558965d34e44]",
        "(OSDMonitor::prepare_update(boost::intrusive_ptr<MonOpRequest>)+0x373) [0x558965d390b3]",
        "(PaxosService::dispatch(boost::intrusive_ptr<MonOpRequest>)+0xa6d) [0x558965cba8ed]",
        "(PaxosService::C_RetryMessage::_finish(int)+0x64) [0x558965c0ed34]",
        "(C_MonOp::finish(int)+0x49) [0x558965bad9e9]",
        "(Context::complete(int)+0xd) [0x558965baaf6d]",
        "(void finish_contexts<std::__cxx11::list<Context*, std::allocator<Context*> > >(ceph::common::CephContext*, std::__cxx11::list<Context*, std::allocator<Context*> >&, int)+0xa5) [0x558965bd6a55]",
        "(PaxosService::_active()+0x1cb) [0x558965cbb7fb]",
        "(Context::complete(int)+0xd) [0x558965baaf6d]",
        "(void finish_contexts<std::__cxx11::list<Context*, std::allocator<Context*> > >(ceph::common::CephContext*, std::__cxx11::list<Context*, std::allocator<Context*> >&, int)+0xa5) [0x558965bd6a55]",
        "(Paxos::finish_round()+0x169) [0x558965cb20e9]",
        "(Paxos::commit_finish()+0x8c0) [0x558965cb4c10]",
        "(C_Committed::finish(int)+0x45) [0x558965cb8845]",
        "(Context::complete(int)+0xd) [0x558965baaf6d]",
        "(MonitorDBStore::C_DoTransaction::finish(int)+0x98) [0x558965cb8568]",
        "(Context::complete(int)+0xd) [0x558965baaf6d]",
        "(Finisher::finisher_thread_entry()+0x1a5) [0x7fe078005485]",
        "/lib64/libpthread.so.0(+0x814a) [0x7fe075a5614a]",
        "clone()" 
    ],
    "ceph_version": "16.2.0",
    "crash_id": "2021-04-14T12:21:16.238842Z_19631c22-0260-4d8d-a3f7-d9a163522bc9",
    "entity_name": "mon.devel-mon0",
    "os_id": "centos",
    "os_name": "CentOS Linux",
    "os_version": "8",
    "os_version_id": "8",
    "process_name": "ceph-mon",
    "stack_sig": "cab9fae193e00d677d17df21977fd8ff289735b590a62f5d80d50397bfbd9a84",
    "timestamp": "2021-04-14T12:21:16.238842Z",
    "utsname_hostname": "devel-mon0.novalocal",
    "utsname_machine": "x86_64",
    "utsname_release": "4.18.0-147.3.1.el8_1.x86_64",
    "utsname_sysname": "Linux",
    "utsname_version": "#1 SMP Fri Jan 3 23:55:26 UTC 2020" 
}
Actions #1

Updated by Pietari Hyvärinen about 3 years ago

Maybe related #46266.

jerasure pool creation is successfull ( without mon crash).

Actions #2

Updated by Loïc Dachary about 3 years ago

  • Target version changed from v16.2.1 to v16.2.2
Actions #3

Updated by Sage Weil almost 3 years ago

  • Project changed from Ceph to RADOS
  • Category deleted (Monitor)
Actions #4

Updated by Loïc Dachary almost 3 years ago

  • Target version deleted (v16.2.2)
Actions #5

Updated by Neha Ojha over 2 years ago

  • Category set to EC Pools
Actions

Also available in: Atom PDF