Port-sparc archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]

Re: SuperSPARCvs HyperSPARC and SS20 stability



Hi Martin,

I think I have two different issues.

1) SS20 + HyperSparc + NetBSD 10: that caused system freezes, unusable system
2) SS20 + SuperSparc + NetBSD 10: "hangs of processes"

in addition, I noticed that I never experienced 1) in the past and it is not reproduceable using the same processors on the SS10 with NetBSD 9.4, neither did I experience 2) on 9.x using HyperSparcs.

Martin Husemann wrote:

It causes userland processes (using pthreads) to do basically busy loops in
some mutex operation and never make any process. Once you are in this
state, the process never recovers. I think I could always (manually)
kill them, but right now I am not 100% sure of that.

What do you mean by "busy loop" ? would you see 100% usage on the stuck process (or more processes?).

In my case the processes seem to be parked, waiting or otherwise stalled, I start building a package and after a while CPU drops to 0%.

Depending on the process it might or might not be Ctrl-C or SIGTERM, but it always ends with kill -9! so the system always recovers nicely there

I have never seen them hang the whole machine (i.e. ddb on the console
still works).

The process hang I speak about does not hang. Only the Combination SS20-HyperSparc-NetBSD10 did really crash. So my guess that is another problem.. Either a strange CPU/ROM/BusSpeed issue where the SS10 and SS20 differ


My reliable way to trigger this issue is: unpack a sparc userland from the
autobuilds on any sparc64 SMP machine, chroot to it and run the ATF test
suite. It will reliably stop making progress quickly when enough RUMP servers

Is there a simpler way I could try to run on a small space, limited machine like mines? can I get the ATF test suite somehow "alone"? as a port maybe?


Riccardo


Home | Main Index | Thread Index | Old Index