mirror of
https://github.com/fish-shell/fish-shell
synced 2025-01-16 23:14:04 +00:00
Switch to wait_by_process when waitpid
without WNOHANG returns nothing
By exclusively waiting by pgrp, we can fail to reap processes that change their own pgrp then either crash or close their fds. If we wind up in a situation where `waitpid(2)` returns 0 or ECHLD even though we did not specify `WNOHANG` but we still have unreaped child processes, wait on them by pid. Closes #5596.
This commit is contained in:
parent
93c0d3f4a5
commit
4dfaa33d95
1 changed files with 13 additions and 5 deletions
18
src/proc.cpp
18
src/proc.cpp
|
@ -511,6 +511,7 @@ static bool process_mark_finished_children(bool block_on_fg) {
|
|||
|
||||
if (pid > 0) {
|
||||
// A child process has been reaped
|
||||
debug(4, "Reaped PID %d", pid);
|
||||
handle_child_status(pid, status);
|
||||
|
||||
// Always set WNOHANG (that is, don't hang). Otherwise we might wait on a non-stopped job
|
||||
|
@ -520,15 +521,22 @@ static bool process_mark_finished_children(bool block_on_fg) {
|
|||
} else if (pid == 0 || errno == ECHILD) {
|
||||
// No killed/dead children in this particular process group
|
||||
if (!wait_by_process) {
|
||||
if ((options & WNOHANG) == 0) {
|
||||
// This normally implies that the job has completed, but if we try to wait
|
||||
// on a job that includes a process that changed its own group before we
|
||||
// enter `waitpid`, we will be waiting forever. See #5596 for such a case.
|
||||
wait_by_process = true;
|
||||
continue;
|
||||
}
|
||||
break;
|
||||
}
|
||||
} else {
|
||||
// pid < 0 indicates an error. One likely failure is ECHILD (no children), which is
|
||||
// not an error and is ignored. The other likely failure is EINTR, which means we
|
||||
// got a signal, which is considered an error. We absolutely do not break or return
|
||||
// on error, as we need to iterate over all constructed jobs but we only call
|
||||
// waitpid for one pgrp at a time. We do bypass future waits in case of error,
|
||||
// however.
|
||||
// not an error and is ignored in the branch above. The other likely failure is
|
||||
// EINTR, which means we got a signal, which is considered an error. We absolutely
|
||||
// do not break or return on error, as we need to iterate over all constructed jobs
|
||||
// but we only call waitpid for one pgrp at a time. We do bypass future waits in
|
||||
// case of error, however.
|
||||
has_error = true;
|
||||
|
||||
// Do not audibly complain on interrupt (see #5293)
|
||||
|
|
Loading…
Reference in a new issue