General

Profile

Justin Lott

Issues

open closed Total
Assigned issues 0 0 0
Reported issues 0 3 3

Activity

01/16/2013

10:28 AM Ceph Bug #3801: Cascading OSD failures beginning with common/HeartbeatMap.cc: 78: FAILED assert(0 == "hit suicide timeout")
Sage Weil wrote:
> The osd.40 error means the fs returned EIO on a read operation. Check yoru kern.org.. there is p...
Justin Lott

01/15/2013

07:43 AM Ceph Bug #3801 (Won't Fix): Cascading OSD failures beginning with common/HeartbeatMap.cc: 78: FAILED assert(0 == "hit suicide timeout")
0.48.2argonaut
Relevant logs are attached. Core dumps are available if needed....
Justin Lott

01/07/2013

12:16 PM Ceph Bug #3702: OSD SIGABRT during startup
Dan Mick wrote:
> Is this related to rbd, or should it be in category 'ceph'?
Ah, yes, it should. Thank you for c...
Justin Lott

01/04/2013

06:06 PM Ceph Bug #3702: OSD SIGABRT during startup
Sage Weil wrote:
> Was the monitor also running 0.48.2argonaut when osd.131 originally crashed? Or something else?
...
Justin Lott

01/02/2013

12:39 PM Ceph Bug #3702: OSD SIGABRT during startup
Attempting to start osd.131 (which was down due to the above noted problems) today resulted in quorum loss. Essential... Justin Lott

12/31/2012

09:06 AM Ceph Bug #3702 (Can't reproduce): OSD SIGABRT during startup
After conversion of OSD's from btrfs to XFS, some OSD's SIGABRT during their first startup on XFS:
2012-12-29 05:0...
Justin Lott

12/28/2012

12:08 PM rbd Bug #3692: OSD's abort with "./common/Mutex.h: 89: FAILED assert(nlock == 0)"
Chronology of events (UTC) in the latest example of this happening, in case it's relevant:
15:50:46 mon.b is s...
Justin Lott
12:01 PM rbd Bug #3692 (Won't Fix): OSD's abort with "./common/Mutex.h: 89: FAILED assert(nlock == 0)"
I've seen this happen twice:
- Reboot a node running a number of OSD's
- Within a short period of time, seemingly...
Justin Lott

Also available in: Atom