I am trying to determine the cause of a stopped process in Linux. This is a telecommunications application running under a rather heavy load. There is a separate process for each of the 8 T1 intervals. Each so often one of the processes will be very immune - perhaps 50 seconds before the event is noted in the log with a very busy process.
Most likely, some system resource will be short. The obvious thing - CPU usage - looks fine.
Which linux utilities are best suited for finding and analyzing these kinds of things and as unobtrusive as possible, since this is a highly loaded system? It would seem that this requires processes, not systems. Maybe constant monitoring of / proc / pid / XX? Top would not seem too useful here.
John h
source
share