Bug #57956: Ceph monitors in crash loop - Ceph - Ceph

Actions

Copy link

Bug #57956

open

Ceph monitors in crash loop

Added by liu jun over 1 year ago. Updated 11 months ago.

Status:

New

Priority:

Normal

Assignee:

Category:

Monitor

Target version:

% Done:

Source:

Tags:

mon

Backport:

Regression:

Severity:

1 - critical

Reviewed:

Affected Versions:

ceph-qa-suite:

Pull request ID:

Crash signature (v1):

Crash signature (v2):

Description

Creating a pool causes mon to restart

This is the detailed question：https://github.com/rook/rook/issues/10110

https://github.com/rook/rook/issues/11242
https://github.com/rook/rook/issues/11081

Environment:

OS : Fedora CoreOS 35.20220327.3.0
Kernel : 5.16.16-200.fc35.x86_64
Cloud provider or hardware configuration: Dell FC640/MX740c
Rook version: 1.8.8/1.9.0
Storage backend version : 16.2.7
Kubernetes version : 1.23.5
Kubernetes cluster type: Baremetal (self-managed/vanilla)
Storage backend status : Keeps timings out when mons are crashing and updated with number of mons down

Actions

Copy link

Updated by liu jun over 1 year ago

liu jun wrote:

Creating a pool causes mon to restart

This is the detailed question：https://github.com/rook/rook/issues/10110

https://github.com/rook/rook/issues/11242
https://github.com/rook/rook/issues/11081

Environment:

OS : Fedora CoreOS 35.20220327.3.0
Kernel : 5.16.16-200.fc35.x86_64
Cloud provider or hardware configuration: Dell FC640/MX740c
Rook version: 1.8.8/1.9.0
Storage backend version : 16.2.7
Kubernetes version : 1.23.5
Kubernetes cluster type: Baremetal (self-managed/vanilla)
Storage backend status : Keeps timings out when mons are crashing and updated with number of mons down

Create 3 mons normally, check the mon node process after initialization, you will find that ms_dispatch and fn_monstore will have cpu 100% problem, and pg pool is being created at this time. Quickly kill these two processes ceph cluster is normal.

Actions

Copy link

Updated by Ilya Dryomov 11 months ago

Target version deleted (~~v17.2.6~~)

Actions

Copy link

Also available in: Atom PDF

Project

General

Profile

Ceph

Custom queries

Bug #57956

Ceph monitors in crash loop

Updated by liu jun over 1 year ago

Updated by Ilya Dryomov 11 months ago