Project

General

Profile

Bug #40018

crash in io_context thread when lots of connections abort

Added by Ali Maredia 6 months ago. Updated 3 months ago.

Status:
Pending Backport
Priority:
High
Target version:
-
Start date:
05/23/2019
Due date:
% Done:

0%

Source:
Tags:
Backport:
luminous, mimic, nautilus
Regression:
No
Severity:
3 - minor
Reviewed:
Affected Versions:
ceph-qa-suite:
Pull request ID:
Crash signature:

Description

During a small percentage of runs of the full rgw:verify suite the radosgw crashes in the middle of a run of the java s3-tests suite.

See .txt files with snippits of teuthology logs and the link to the logs themselves.

java_s3_tests_crash_log_1.txt View (238 KB) Ali Maredia, 05/23/2019 09:47 PM

java_s3_tests_crash_log_2.txt View (100 KB) Ali Maredia, 05/23/2019 09:47 PM


Related issues

Copied to rgw - Backport #41569: nautilus: crash in io_context thread when lots of connections abort Resolved
Copied to rgw - Backport #41570: mimic: crash in io_context thread when lots of connections abort Resolved
Copied to rgw - Backport #41571: luminous: crash in io_context thread when lots of connections abort New

History

#2 Updated by Matt Benjamin 6 months ago

  • Status changed from New to Triaged
  • Assignee set to Casey Bodley

#3 Updated by Abhishek Lekshmanan 5 months ago

  • Priority changed from Normal to High

#4 Updated by Abhishek Lekshmanan 5 months ago

it looks like the io_context::run() ec overload variant is deprecated. https://www.boost.org/doc/libs/1_66_0/doc/html/boost_asio/reference/io_context/run/overload2.html

Following down the stacktrace, it seems like the ec value is ignored in boost::asio::detail::scheduler_operation::complete

#5 Updated by Casey Bodley 5 months ago

  • Subject changed from RGW is crashing after some Java S3 tests to crash in io_context thread when lots of connections abort

#6 Updated by Abhishek Lekshmanan 4 months ago

  • Status changed from Triaged to In Progress
  • Assignee changed from Casey Bodley to Abhishek Lekshmanan

#7 Updated by Nathan Cutler 3 months ago

  • Status changed from In Progress to Pending Backport
  • Backport set to mimic, nautilus
  • Pull request ID set to 29967

#8 Updated by Nathan Cutler 3 months ago

  • Copied to Backport #41569: nautilus: crash in io_context thread when lots of connections abort added

#9 Updated by Nathan Cutler 3 months ago

  • Copied to Backport #41570: mimic: crash in io_context thread when lots of connections abort added

#10 Updated by Nathan Cutler 3 months ago

  • Backport changed from mimic, nautilus to luminous, mimic, nautilus

#11 Updated by Nathan Cutler 3 months ago

  • Copied to Backport #41571: luminous: crash in io_context thread when lots of connections abort added

Also available in: Atom PDF