Project

General

Profile

Bug #38452

Updated by Patrick Donnelly about 5 years ago

Here is the stack trace, it was caused by a Postgresql trying to unlink a file in the log archive. 

 <pre> 
  

  ceph version 13.2.4 (b10be4d44915a4d78a8e06aa31919e74927b142e) mimic (stable) 
  1: (()+0x4058c0) [0x5588549cd8c0] 
  2: (()+0x12890) [0x7fdf788d3890] 
  3: (gsignal()+0xc7) [0x7fdf779c6e97] 
  4: (abort()+0x141) [0x7fdf779c8801] 
  5: (ceph::__ceph_assert_fail(char const*, char const*, int, char const*)+0x250) [0x7fdf78fbd530] 
  6: (()+0x26d5a7) [0x7fdf78fbd5a7] 
  7: (Server::_unlink_local(boost::intrusive_ptr<MDRequestImpl>&, CDentry*, CDentry*)+0x15f8) [0x5588547482e8] 
  8: (Server::handle_client_unlink(boost::intrusive_ptr<MDRequestImpl>&)+0x961) [0x558854748cd1] 
  9: (Server::handle_client_request(MClientRequest*)+0x49b) [0x55885476305b] 
  10: (Server::dispatch(Message*)+0x2db) [0x558854766d1b] 
  11: (MDSRank::handle_deferrable_message(Message*)+0x434) [0x5588546da1e4] 
  12: (MDSRank::_dispatch(Message*, bool)+0x89b) [0x5588546e7a1b] 
  13: (MDSRank::retry_dispatch(Message*)+0x12) [0x5588546e8012] 
  14: (MDSInternalContextBase::complete(int)+0x67) [0x55885494df87] 
  15: (MDSRank::_advance_queues()+0xf1) [0x5588546e69c1] 
  16: (MDSRank::ProgressThread::entry()+0x43) [0x5588546e7043] 
  17: (()+0x76db) [0x7fdf788c86db] 
  18: (clone()+0x3f) [0x7fdf77aa988f] 
 </pre> 

 Attached a debug-ms=10 file, however I can't reproduce as I fixed it by killing the ceph-fuse client on our Postgres server et restarting the MDS.

Back