NetBSD-Bugs archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: port-amd64/52596 (Another netbsd-8 panic)
The following reply was made to PR port-amd64/52596; it has been noted by GNATS.
From: Kamil Rytarowski <n54%gmx.com@localhost>
To: gnats-bugs%NetBSD.org@localhost
Cc:
Subject: Re: port-amd64/52596 (Another netbsd-8 panic)
Date: Mon, 9 Oct 2017 21:21:13 +0200
This is an OpenPGP/MIME signed message (RFC 4880 and 3156)
--cidepVPSeqFoUukHPl1uGdk4cLpk2SVRj
Content-Type: multipart/mixed; boundary="4WW8gCMGTltqtxfIRjsJGr2vwcT7hbkVT";
protected-headers="v1"
From: Kamil Rytarowski <n54%gmx.com@localhost>
To: gnats-bugs%NetBSD.org@localhost
Message-ID: <f2f55b8f-b697-c125-ad36-40318b8c93bb%gmx.com@localhost>
Subject: Re: port-amd64/52596 (Another netbsd-8 panic)
References: <pr-port-amd64-52596%gnats.netbsd.org@localhost>
<20171006053940.1CB755CD8%yenn.ulegend.net@localhost>
<20171009130002.6603D7A2B0%mollari.NetBSD.org@localhost>
<20171009135258.GA11341%yenn.ulegend.net@localhost>
In-Reply-To: <20171009135258.GA11341%yenn.ulegend.net@localhost>
--4WW8gCMGTltqtxfIRjsJGr2vwcT7hbkVT
Content-Type: text/plain; charset=utf-8
Content-Language: en-US
Content-Transfer-Encoding: quoted-printable
On 09.10.2017 15:52, Dominik Bialy wrote:
> On Mon, Oct 09, 2017 at 01:00:02PM +0000, Kamil Rytarowski wrote:
>> The following reply was made to PR port-amd64/52596; it has been noted=
by GNATS.
>>
>> From: Kamil Rytarowski <n54%gmx.com@localhost>
>> To: gnats-bugs%NetBSD.org@localhost
>> Cc:=20
>> Subject: Re: port-amd64/52596 (Another netbsd-8 panic)
>> Date: Mon, 9 Oct 2017 14:58:28 +0200
>>
>> This is an OpenPGP/MIME signed message (RFC 4880 and 3156)
>> --tQv6se3kuwJu5afbaaitkaHLbm3ej6EH0
>> Content-Type: multipart/mixed; boundary=3D"Q3m66g38MJWwicPqkbVeUfD3Op=
jiGD205";
>> protected-headers=3D"v1"
>> From: Kamil Rytarowski <n54%gmx.com@localhost>
>> To: gnats-bugs%NetBSD.org@localhost
>> Message-ID: <35eeab5d-5eb1-2c24-5719-4ee284bbd4e0%gmx.com@localhost>
>> Subject: Re: port-amd64/52596 (Another netbsd-8 panic)
>> References: <pr-port-amd64-52596%gnats.netbsd.org@localhost>
>> <20171006053940.1CB755CD8%yenn.ulegend.net@localhost>
>> <20171009124001.6CD267A2AC%mollari.NetBSD.org@localhost>
>> In-Reply-To: <20171009124001.6CD267A2AC%mollari.NetBSD.org@localhost>
>> =20
>> --Q3m66g38MJWwicPqkbVeUfD3OpjiGD205
>> Content-Type: text/plain; charset=3Dutf-8
>> Content-Language: en-US
>> Content-Transfer-Encoding: quoted-printable
>> =20
>> On 09.10.2017 14:40, Dominik Bialy wrote:
>> > The following reply was made to PR port-amd64/52596; it has been no=
ted =3D
>> by GNATS.
>> >=3D20
>> > From: Dominik Bialy <dmb%yenn.ulegend.net@localhost>
>> > To: coypu%sdf.org@localhost
>> > Cc: Dominik Bialy <dmb%yenn.ulegend.net@localhost>, gnats-bugs%NetBSD.org@localhost
>> > Subject: Re: port-amd64/52596 (Another netbsd-8 panic)
>> > Date: Mon, 9 Oct 2017 14:37:47 +0200
>> >=3D20
>> > On Mon, Oct 09, 2017 at 10:13:34AM +0000, coypu%sdf.org@localhost wrote:
>> > > On Mon, Oct 09, 2017 at 10:01:47AM +0200, Dominik Bialy wrote:
>> > > > Current sysctls are:
>> > > >=3D20
>> > > > yenn# sysctl machdep.xsave_features
>> > > > machdep.xsave_features =3D3D 0
>> > > > yenn# sysctl machdep.fpu_save
>> > > > machdep.fpu_save =3D3D 1
>> > > >=3D20
>> > > > I'll try applying the patch today and building the kernel.
>> > >=3D20
>> > > sorry, I misread the code, it shouldn't make a functional differ=
ence=3D
>> =20
>> > > either way.
>> > >=3D20
>> > > do you have a coredump in /var/crash?
>> > > can you:
>> > > gunzip netbsd.3.core.gz
>> > > gunzip netbsd.3.gz
>> > > crash -M netbsd.3.core -N netbsd.3
>> > >=3D20
>> > > crash> dmesg
>> > > (only to confirm it died at the same spot)
>> > > crash> examine x86_xsave_features
>> > > crash> bt
>> > =3D20
>> > I found one coredump from Sep 23 (sources were
>> > dated around Sep 15.)
>> > =3D20
>> > fatal privileged instruction fault in supervisor mode
>> > trap type 0 code 0 rip 0xffffffff80224a52 cs 0x8 rflags 0x10016 cr=
2 0x=3D
>> 75ba90c36d60 ilevel 0x8 rsp 0xfffffe804057
>> > bea8
>> > curlwp 0xfffffe81318c2720 pid 391.2 lowest kstack 0xfffffe80405792=
c0
>> > panic: trap
>> > cpu1: Begin traceback...
>> > vpanic() at netbsd:vpanic+0x140
>> > snprintf() at netbsd:snprintf
>> > startlwp() at netbsd:startlwp
>> > alltraps() at netbsd:alltraps+0x96
>> > fpudna() at netbsd:fpudna+0x61
>> > cpu1: End traceback...
>> > =3D20
>> > dumping to dev 18,1 (offset=3D3D132519, size=3D3D1032011):
>> > dump
>> > crash> examine x86_xsave_features
>> > x86_xsave_features: 160b78a0
>> =20
>> Looks like trash..
>> =20
>> Please try:
>> examine x86_fpu_save_size
>> examine x86_fpu_save
>> examine i386_nocpuid_cpus
>> =20
>> (checking if the stack has been damaged)
>=20
> yenn# crash -M netbsd.6.core -N netbsd.6
> Crash version 8.0_BETA, image version 8.0_BETA.
> System panicked: trap
> Backtrace from time of crash is available.
> crash> examine x86_fpu_save_size
> x86_fpu_save_size: 200
> crash> examine x86_fpu_save
> x86_fpu_save: 1
> crash> examine i386_nocpuid_cpus
> i386_nocpuid_cpus: 1
> crash>
>=20
So something is overwrites x86_xsave_features with trash.
A valid value would like like this:
$ sysctl machdep.xsave_features
machdep.xsave_features =3D 7
Unless I miss something, the only place of setting this value is in:
/src/sys/arch/x86/x86/identcpu.c: cpu_probe_fpu(struct cpu_info *ci)
x86_xsave_features =3D (uint64_t)descs[3] << 32 | descs[0];
It would be easier to track it down with a reproducer, with temporary
asserts.. but I expect that we are restricted to reading the code.
A possible hand-made assert is to put panic() like this:
/* Get features and maximum size of the save area */
x86_cpuid(0xd, descs);
if (descs[2] > 512)
x86_fpu_save_size =3D descs[2];
+ panic("Oops how did we get here!\n");
#ifdef XEN
/* Don't use xsave, force fxsave with x86_xsave_features =3D 0. */
#else
x86_xsave_features =3D (uint64_t)descs[3] << 32 | descs[0];
#endif
Once it will be fired, we will need stacktrace.
>> =20
>> > crash> bt
>> > _KERNEL_OPT_NARCNET() at 0
>> > _KERNEL_OPT_ACPI_SCANPCI() at _KERNEL_OPT_ACPI_SCANPCI+0x7
>> > vpanic() at vpanic+0x149
>> > snprintf() at snprintf
>> > startlwp() at startlwp
>> > calltrap() at calltrap+0x11
>> > fpudna() at fpudna+0x61
>> > crash>
>> > =3D20
>> >=3D20
>> =20
>> =20
>> =20
>> --Q3m66g38MJWwicPqkbVeUfD3OpjiGD205--
>> =20
>> --tQv6se3kuwJu5afbaaitkaHLbm3ej6EH0
>> Content-Type: application/pgp-signature; name=3D"signature.asc"
>> Content-Description: OpenPGP digital signature
>> Content-Disposition: attachment; filename=3D"signature.asc"
>> =20
>> -----BEGIN PGP SIGNATURE-----
>> Version: GnuPG v2
>> =20
>> iQIcBAEBCAAGBQJZ23J7AAoJEEuzCOmwLnZsgd4P/14ZY9CY1o2WTGneVG//Ai9U
>> voBxogER+xwyI+4gChaMwZCixIpQ3fLJnQd89EHOcLWuVTVvFroiWtdFr+uhkSTH
>> lG6xhAHvVWvv+UX3+BqoRNZVsSQfFWNpbWfUpS+71mKlNkWr/gKIKEOt3bl6+mEG
>> kTtRlU+vGbaCVv90UYtJMfiTIoBKCSC/EDLTNnpfU7i0Rc+gUEBmaHj1yK5G1l5F
>> 3GxX2yjHW6yTIp9mYrd8Qo4gJ5SHBaTfo0lNxWX5YUKbGYhH5VqeIG/mkSpGlzUr
>> uiRX3E2YWI7wpuAYDDxeAA9jhPTK0DJWDIGvmL7c3Renj7SdXUN2PVMR+w+cegBQ
>> k6vPHUYzc5+OMj2azgVt1KGhf01i+PCPsQs1bforCQ1Q1CUO02oZuxRg+O97m2ph
>> BKECkosmrN8JL3llfi54MI0JEo4mEvhjEswc5pToaMUWYJcEwzSaiMfgjX/eg2EQ
>> oRumz1kR8pPkKAxHbwmb3G2L6fBo+iYx6RrQuWCXdjTCKaGe6LWbmd177c9rhBj7
>> 5xD8zXLhQ3dnLibtCSq8oKKNCWO2D2eO4v/bJx5I9axVNpOo8DYAz/muBtGiaT9y
>> VCjDoDCa3+nXu8+WznmXp/iaKVeBkX8atearj4gpJC6xTjyi7HbDKw7Up8TLlKae
>> isK0kEBa1v1SC7dpb+ZK
>> =3DylKn
>> -----END PGP SIGNATURE-----
>> =20
>> --tQv6se3kuwJu5afbaaitkaHLbm3ej6EH0--
>> =20
>=20
--4WW8gCMGTltqtxfIRjsJGr2vwcT7hbkVT--
--cidepVPSeqFoUukHPl1uGdk4cLpk2SVRj
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2
iQIcBAEBCAAGBQJZ28wvAAoJEEuzCOmwLnZs7iAP/Auf8ebvPhKUG61FIpZt2x4V
nqCzAyZZr5Y8ejn+FvkbkSICv1D82UnBVeFtlv59vkcBjKsn0rSbL+I0A6Qvv3aL
ba7B0076Ge15jtMUutDg/dFpdKLihhYm5VUYO2ODS1obDbitB4BzvFeSBtcj7DrG
txHAgax4k2Oc8iLCBMEIXP8f/ljSEfhnrSaUrDJIMrRruaex5cEm5FBNUe2Vi5LQ
tB1IAi69D5WlUfw1NKL0UWoaPqxTqrdtwhexvtbhT/OAFbWp/2Dnz7CbmsW3OnMa
wgp69Vt+NqtZMetkT6WzZLasS/uxjaPE8d6XHuffDBWUu5nB31wm9UOk3IHvV9RU
GDO6WgjzBwY/Ps/ukbIBeX88uzGNQkwGDreJRfBVIRXWjdPoBEZZUCea+g2i7FJN
84e/+Yyzh2K9iBu6nAJuoSUWM5AU3pUUZxbeMMO4xWMBlmLw3rrsqAB7xS5Tpf9e
N38ENi2OqKPtYbrPvviVnbaky/ycusL9eeVvn3IBcPcRVOn7rq/cnPNryF61Ij71
q700WBxhM281OSxAt8pDjJhIFSMPTbSRUZM+ySNysZXrVxJwuuEwtoMsQB7XNrLT
Jtp22Yj+zc/uGWiUJ1IKUdS061SVpazYigYL3yLlJ3Yy8WgDEQBAcgATon5Ekfn0
ULG9cB3ItvYy+E2Ku79D
=2ZrA
-----END PGP SIGNATURE-----
--cidepVPSeqFoUukHPl1uGdk4cLpk2SVRj--
Home |
Main Index |
Thread Index |
Old Index