Subject: Re: kern/25285: i386 MP panic: TLB IPI rendezvous failed (mask 1)
To: None <dokas@cs.umn.edu>
From: Paul Dokas <dokas@cs.umn.edu>
List: current-users
Date: 06/07/2004 10:43:23
On Sun, 6 Jun 2004 20:59:16 -0500, Paul Dokas <dokas@cs.umn.edu> wrote:
> 
> The quad Xeon that I put this patch onto panic'd sometime on Friday night.
> I'll put the detail here tomorrow after I've had a change to visit the
> computer (I don't have a serial console on it yet).

I've looked over the machine and here's what I found.

For CPU 0, the backtrace is exactly as in my previous emails.  CPU 2
and CPU 4 are both idle (for some reason it numbers the CPUs 0, 2, 4, 6)

However, CPU 6 shows something interesting.  Here's the backtrace:

  acquire()
  spinlock_acquire_count()
  _kernel_lock_acquire_count()
  mi_switch()
  ltsleep()
  rf_RaidIOThread()

I do have 4 136GB disks in a striped raidset.  My guess is that this is
the cause of the problem for me and probably falls into the "well, don't
do that" category.  Although, it would be really nice to be able to
use SMP and RaidFrame at the same time.

In the meanwhile, I'm going to stop using RaidFrame.


Paul
-- 
Paul Dokas                                         dokas at cs.umn.edu
======================================================================
Don Juan Matus:  "an enigma wrapped in mystery wrapped in a tortilla."