Subject: Re: xen network issues
To: Manuel Bouyer <bouyer@antioche.eu.org>
From: Johan Ihren <johani@johani.org>
List: port-xen
Date: 03/01/2006 02:46:31
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi Manuel,
>> * 100% interrupt rate
>> * oodles of "sip0: FIFO ring overrun" on one server
>> * "fxp0: device timeout" on the other server
>> * oodles of "nfs_timer: ignoring error 64" on all domUs
>>
>> What I did *not* find was any massive network traffic. I.e. no raging
>> storms that I could see.
>
> Hi,
> how old are your dom0 and domU kernels ? An issue have been fixed
> recently
> (virtual interfaces not checking the ethernet addresses in packets)
> which could cause the kind of issue you're seeing.
> The fix has been pulled up to the netbsd-3 and netbsd-3-0 branches.
All kernels were original 3.0 machines, only modified to allow the
dom0s to deal with larger numbers of vnd devices, i.e. without that
fix (that I had not noticed).
I will rebuild and see if I can trigger this again. However, as I've
had zero trouble in all my prior testing I guess I will not easily
verify if this was indeed my problem, but will have to wait until the
next time I have live students (nothing beats students for finding
problems). I will let you know whether that works out or if I end up
in trouble again...
But I agree the (fixed) problem sounds like a plausible explanation
to the problems I saw. Especially as I run a larger number of
simultaneous domUs than most (10+) and hence would probably be harder
hit by that problem.
Thanks,
Johan
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.0 (Darwin)
iD8DBQFEBP0AKJmr+nqSTbYRAkYOAKCkA+j56Momk8vFL1uIEmswVrteewCeK1mV
aIhSpf6KznorW3YuM5CZ1mo=
=0ZuH
-----END PGP SIGNATURE-----