Subject: Re: xen network issues
To: Manuel Bouyer <bouyer@antioche.eu.org>
From: Johan Ihren <johani@johani.org>
List: port-xen
Date: 03/01/2006 02:46:31
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Manuel,

>> * 100% interrupt rate
>> * oodles of "sip0: FIFO ring overrun" on one server
>> * "fxp0: device timeout" on the other server
>> * oodles of "nfs_timer: ignoring error 64" on all domUs
>>
>> What I did *not* find was any massive network traffic. I.e. no raging
>> storms that I could see.
>
> Hi,
> how old are your dom0 and domU kernels ? An issue have been fixed  
> recently
> (virtual interfaces not checking the ethernet addresses in packets)
> which could cause the kind of issue you're seeing.
> The fix has been pulled up to the netbsd-3 and netbsd-3-0 branches.

All kernels were original 3.0 machines, only modified to allow the  
dom0s to deal with larger numbers of vnd devices, i.e. without that  
fix (that I had not noticed).

I will rebuild and see if I can trigger this again. However, as I've  
had zero trouble in all my prior testing I guess I will not easily  
verify if this was indeed my problem, but will have to wait until the  
next time I have live students (nothing beats students for finding  
problems). I will let you know whether that works out or if I end up  
in trouble again...

But I agree the (fixed) problem sounds like a plausible explanation  
to the problems I saw. Especially as I run a larger number of  
simultaneous domUs than most (10+) and hence would probably be harder  
hit by that problem.

Thanks,

Johan

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.0 (Darwin)

iD8DBQFEBP0AKJmr+nqSTbYRAkYOAKCkA+j56Momk8vFL1uIEmswVrteewCeK1mV
aIhSpf6KznorW3YuM5CZ1mo=
=0ZuH
-----END PGP SIGNATURE-----