Port-macppc archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index][Old Index]
Re: TSTILE lockups on disk dump/backups
> On Dec 21, 2024, at 9:21 AM, Martin Husemann <martin%duskware.de@localhost> wrote:
>
> On Sat, Dec 21, 2024 at 02:20:23AM -0600, Don Lee wrote:
>> Any thought about how I should deal with this? My kernel debugging
>> skills are weak. Is there anything I can do to produce useful debug
>> information next time this happens?
>
> A tstile just means the process/kernel thread is blocked for a longer
> time waiting for "something". In this context that would typically be
> either a disk buffer ("buf") read from disk (the source you are backing
> up) or a network buffer ("mbuf") handed to the network device output
> queue.
>
> This would point to a bug in the disk driver or the network driver.
Wherever it is, I didn’t see it on NetBSD 8.2. That may mean that it’s a new bug, or it might be that something in the timing of 8.2 just didn’t trigger it.
It’s a macppc system with 1 GB of memory. I wonder if NetBSD 9.4 needs more memory...
> The other option would be your system runing out of kernel address space,
> so it can not allocate either type of buffer. This would show up in
> the output of vmstat -s (or you are running against limits for either
> type of buffer, but I would expect that to fail differently).
> Can you start new processes in the state when it hangs? Like the vmstat -s
> mentioned above, or "crash" to look at kernel state? I am not sure "crash"
> would work on macppc and 9.x good enough.
> Martin
I should be able to do a vmstat -s next time it hangs. It seems that most processes run OK while the system's stuck, as long as I don’t touch the the filesystem(s) the wrong way. If I don’t notice the hang, the dump processes “pile up” as they get launched by cron. Eventually, I can’t log in, but early in the process, I can log in and do stuff.
Many thanks,
-dgl-
Home |
Main Index |
Thread Index |
Old Index