Port-amd64 archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: sometimes resumeable freeze



Following up,

It turns out that this issue is caused by the 8259 PICs not being masked
in IOAPIC mode, causing spurious interrupts in conjunction with real
ones.  I should get a PR filed soon.

On Tue, Apr 12, 2011 at 01:17:32AM +0000, Jonathan A. Kollasch wrote:
> Hi,
> 
> My quad core AMD box has an issue where it will lock up under load.
> 
> Often, when the machine is otherwise unresponsive, I'll press
> Num Lock on the pckbd(4) and it will awaken at the point in time
> it went comatose.  Eventually, pressing num lock doesn't bring it
> back and I have to force a reboot.
> 
> If I `cpuctl offline` ¾ of the CPUs the machine is acceptably stable.
> The machine is also acceptably stable under Linux with all cores
> enabled and performing a similar workload (build.sh release).
> 
> acpicpu(4) is present in the kernel and I have verified that C1E
> is not enabled.  The machine was significantly less stable when
> C1E was enabled.
> 
> I've tried to wire in a Conventional PCI SERR# trigger, but
> the chipset doesn't propagate that into an NMI and I lack
> details on the chipset required to enable that if possible.
> 
> Can anyone think of ways to debug this issue?
> 
>       Jonathan Kollasch


Home | Main Index | Thread Index | Old Index