process-lib

Installation
Usage
Documentation
Error Handling
Gotcha's
Design

Installation

FetchContent

The easiest way to use process-lib is with the CMake build system. If you're using CMake 3.11 or later you can use the FetchContent API to use process-lib in your project.

include(FetchContent)

FetchContent_Declare(
  PROCESS_LIB
  GIT_REPOSITORY https://github.com/DaanDeMeyer/process-lib.git
  GIT_TAG        origin/master
)

FetchContent_GetProperties(PROCESS_LIB)
if(NOT PROCESS_LIB_POPULATED)
  FetchContent_Populate(PROCESS_LIB)
  add_subdirectory(${PROCESS_LIB_SOURCE_DIR} ${PROCESS_LIB_BINARY_DIR})
endif()

target_link_libraries(executable process)

Git Submodule

If you can't use CMake 3.11 or higher, you can add process-lib as a git submodule instead:

mkdir third-party
cd third-party
git submodule add https://github.com/DaanDeMeyer/process-lib.git
# Optionally checkout a specific commit
cd process-lib
git checkout 41f90d0

Commit the result and you can call add_subdirectory with the process-lib directory:

add_subdirectory(third-party/process-lib)

target_link_libraries(executable process)

Vendor

If you're not using git you can download a zip/tar of the source code from Github and manually put the code in a third-party directory. To update you overwrite the process-lib directory with the contents of an updated zip/tar from Github.

After unzipping the source code to third-party/process-lib you can then add the directory with add_subdirectory in CMake.

add_subdirectory(third-party/process-lib)

target_link_libraries(executable process)

CMake Options

process-lib supports the following CMake options:

PROCESS_LIB_BUILD_CPP_WRAPPER (ON|OFF): Build the C++ wrapper (default: OFF)
PROCESS_LIB_BUILD_TESTS (ON|OFF): Build tests (default: OFF)
PROCESS_LIB_BUILD_EXAMPLES (ON|OFF): Build examples (default: OFF)

Options can be configured before calling add_subdirectory as follows:

set(PROCESS_LIB_BUILD_CPP_WRAPPER ON CACHE BOOL FORCE)

Usage

See examples/cmake-help.c for an example that uses process-lib to print the cmake CLI --help output. examples/cmake-help.cpp does the same but with the C++ API.

Documentation

API documentation can be found in process.h. Documentation for the C++ wrapper can be found in process.hpp (which mostly refers to process.h).

Error Handling

There are lots of things that can go wrong when working with child processes. process-lib tries to unify the different platform errors as much as possible but this is an ongoing effort. In particular, the Windows Win32 documentation mostly does not specify what errors a function can throw. As a result, when an error occurs on Windows process-lib will usually return PROCESS_LIB_UNKNOWN_ERROR. However, process-lib also provides the function process_system_error which gives the user the actual system error. Use this function to retrieve the actual system error and file an issue with the system error and the process-lib function that returned it. This way we can incrementally find all remaining unknown errors and add them to process-lib.

Gotcha's

While process_terminate allows the child process to perform cleanup it is up to the child process to correctly clean up after itself. process-lib only sends a termination signal to the child process itself. The child process is responsible for cleaning up its own child processes and other resources in its signal handler.
When using process_kill the child process does not receive a chance to perform cleanup which could result in resources being leaked. Chief among these leaks is that the child process will not be able to stop its own child processes. Always let a child process exit normally or try to stop it with process_terminate before switching to process_kill.
(POSIX) On POSIX a parent process is required to wait on a child process (using process_wait) after it has exited before all resources related to that process can be freed by the kernel. If the parent doesn't wait on a child process after it exits, the child process becomes a zombie process.
(Windows) process_kill is not guaranteed to kill a child process on Windows. For more information, read the Remarks section in the documentation of the TerminateProcess function that process-lib uses to kill child processes on Windows.
(Windows) Immediately terminating a process after starting it on Windows might result in an error window with error code 0xc0000142 popping up. The error code indicates that the process was terminated before it was fully initialized. This problem shouldn't pop up with normal use of the library since most of the time you'll want to read/write to the process or wait until it exits normally.

If someone runs into this problem, I mitigated it in process-lib's tests by waiting a few milliseconds using process_wait before terminating the child process.
File descriptors/handles created by process-lib can leak to child processes not spawned by process-lib if the application is multithreaded. This is not the case on systems that support the pipe2 system call (Linux 2.6+ and newer BSD's). See Avoiding resource leaks for more information.
(POSIX) On POSIX, file descriptors above the file descriptor resource limit (obtained with sysconf(_SC_OPEN_MAX)) and without the FD_CLOEXEC flag set are leaked into child processes created by process-lib.

Note that in multithreaded applications immediately setting the FD_CLOEXEC with fcntl after creating a file descriptor can still not be sufficient to avoid leaks. See Avoiding resource leaks for more information.
(Windows < Vista) File descriptors that are not marked not inheritable with SetHandleInformation will leak into process-lib child processes.

Note that the same FD_CLOEXEC caveat as mentioned above applies. In multithreaded applications there is a split moment after calling CreatePipe but before calling SetHandleInformation that a handle can still be inherited by process-lib child processes. See Avoiding resource leaks for more information.

Design

process-lib is designed to be a minimal wrapper around the platform-specific API's for starting a process, interacting with its standard streams and finally terminating it.

Opaque pointer

process-lib uses a process struct to store information between calls to library functions. This struct is forward-declared in process.h with each platform-specific implementation hidden in the source files for that platform.

This struct is typedefed to process_type and exposed to the user as an opaque pointer (process_type *).

typedef struct process process_type;

The process.h header only contains a forward-declaration of the process struct. We provide an implementation in the source files of each supported platform where we store platform-specific members such as pipe handles, process id's, ....

To enable the user to allocate memory for the opaque pointer we provide the process_size function that returns the required size for the opaque pointer on each platform. This function can be used as follows:

process_type *process = malloc(process_size());

Advantages:

Fewer includes required in process.h

Because we only have a forward declaration of the process struct in process.h we don't need any platform-specific includes (such as windows.h) in the header file to define the data types of all the members that the struct contains.
No leaking of implementation details

Including the process.h header only gives the user access to the forward declaration of the process struct and not its implementation which means its impossible to access the internals of the struct outside of the library. This allows us to change its implementation between versions without having to worry about breaking user code.

Disadvantages:

No simple allocation on the stack

Because including process.h does not give the compiler access to the full definition of the process struct it is unable to allocate its implementation on the stack since it doesn't know its size. Allocating on the stack is still possible but requires functions such as alloca which is harder compared to just writing
```
process_type process;
```
Not possible to allocate on the heap without help from the library

Because the compiler doesn't know the size of the process struct the user can't easily allocate the process struct on the heap because it can't figure out its size with sizeof.

We already mentioned that we solve this problem by providing the process_size function that returns the size of the process struct.

Memory allocation

process-lib aims to do as few dynamic memory allocations as possible in its own code (not counting allocations that happen in system calls). As of this moment, dynamic memory allocation is only done on Windows:

When converting the array of program arguments to a single string as required by the CreateProcess function.
When converting UTF-8 strings to UTF-16 (Windows Unicode functions take UTF-16 strings as arguments).

I have not found a way to avoid allocating memory while keeping a uniform cross-platform API for both POSIX and Windows. (Windows CreateProcessW requires a single UTF-16 string of arguments delimited by spaces while POSIX execvp requires an array of UTF-8 string arguments). Since process-lib's API takes child process arguments as an array of UTF-8 strings we have to allocate memory to convert the array into a single UTF-16 string as required by Windows.

process-lib uses the standard malloc and free functions to allocate and free memory. However, providing support for custom allocators should be straightforward. If you need them, please open an issue.

(POSIX) Waiting on child process with timeout

I did not find a counterpart for the Windows WaitForSingleObject function which can be used to wait until a process exits or the provided timeout expires. POSIX has a similar function called waitpid but this function does not support specifying a timeout value.

To support waiting with a timeout value on POSIX, each process is put in its own process group with the same id as the process id with a call to setpgid after forking the process. When calling the process_wait function, a timeout process is forked which we put in the same process group as the process we want to wait for with the same setpgid function and puts itself to sleep for the requested amount of time (timeout value) before exiting. We then call the waitpid function in the main process but instead of passing the process id of the process we want to wait for we pass the negative value of the process id -process->pid. Passing a negative value for the process id to waitpid instructs it to wait for all processes in the process group of the absolute value of the passed negative value. In our case it will wait for both the timeout process we started and the process we actually want to wait for. If waitpid returns the process id of the timeout process we know the timeout value has been exceeded. If waitpid returns the process id of the process we want to wait for we know it has exited before the timeout process and that the timeout value has not been exceeded.

This solution was inspired by this Stack Overflow answer.

(POSIX) Check if execve call was succesful

process-lib uses a fork-exec model to start new child processes on POSIX systems. A problem that occured is that process-lib needs to differentiate between errors that happened before the exec call (which are errors from process-lib) and errors after the exec call (which are errors from the child process itself). To do this we create an extra pipe in the parent procces with the FD_CLOEXEC flag set and write any errors before and from exec to that pipe. If we then read from the error pipe after forking the read call will either read 0 which means exec was called and the write endpoint was closed (because of the FD_CLOEXEC flag) or it reads a single integer (errno) which indicates an error occured before or during exec.

This solution was inspired by this Stack Overflow answer.

Avoiding resource leaks

On POSIX systems, by default file descriptors are inherited by child processes when calling execve. To prevent unintended leaking of file descriptors to child processes, POSIX provides a function fcntl which can be used to set the FD_CLOEXEC flag on file descriptors which instructs the underlying system to close them when execve (or one of its variants) is called. However, using fcntl introduces a race condition since any process created in another thread after a file descriptor is created (for example using pipe) but before fcntl is called to set FD_CLOEXEC on the file descriptor will still inherit that file descriptor.

To get around this race condition process-lib uses the pipe2 function (when it is available) which takes the O_CLOEXEC flag as an argument. This ensures the file descriptors of the created pipe are closed when execve is called. Similar system calls that take the O_CLOEXEC flag exist for other system calls that create file descriptors. If pipe2 is not available (for example on Darwin) process-lib falls back to calling fcntl to set FD_CLOEXEC immediately after creating a pipe.

Darwin does not support the FD_CLOEXEC flag on any of its system calls but instead provides an extra flag for the posix_spawn API (a wrapper around fork/exec) named POSIX_SPAWN_CLOEXEC_DEFAULT that instructs posix_spawn to close all open file descriptors in the child process created by posix_spawn. However, posix_spawn doesn't support changing the working directory of the child process. A solution to get around this was implemented in process-lib but it was deemed too complex and brittle so it was removed.

While using pipe2 prevents file descriptors created by process-lib from leaking into other child processes, file descriptors created outside of process-lib without the FD_CLOEXEC flag set will still leak into process-lib child processes. To mostly get around this after forking and redirecting the standard streams (stdin, stdout, stderr) of the child process we close all file descriptors (except the standard streams) up to _SC_OPEN_MAX (obtained with sysconf) in the child process. _SC_OPEN_MAX describes the maximum number of files that a process can have open at any time. As a result, trying to close every file descriptor up to this number closes all file descriptors of the child process which includes file descriptors that were leaked into the child process. However, an application can manually lower the resource limit at any time (for example with setrlimit(RLIMIT_NOFILE)), which can lead to open file descriptors with a value above the new resource limit if they were created before the resource limit was lowered. These file descriptors will not be closed in the child process since only the file descriptors up to the latest resource limit are closed. Of course, this only happens if the application manually lowers the resource limit.

On Windows the same race condition occurs. The CreatePipe function receives a flag as part of its arguments that specifies if the returned handles can be inherited by child processes or not. The CreateProcess function also takes a flag indicating whether it should inherit handles or not. Inheritance for endpoints of a single pipe can be configured after the CreatePipe call using the function SetHandleInformation. The race condition occurs after calling CreatePipe (allowing inheritance) but before calling SetHandleInformation in one thread and calling CreateProcess (configured to inherit pipes) in another thread. This would unintentionally leak handles into a child process. We try to mitigate this in two ways:

We call SetHandleInformation after CreatePipe for the handles that should not be inherited by any process to lower the chance of them accidentally being inherited (just like with fnctl if µipe2 is not available). This only works for half of the endpoints created (the ones intended to be used by the parent process) since the endpoints intended to be used by the child actually need to be inherited by their corresponding child process.
Windows Vista added the STARTUPINFOEXW structure in which we can put a list of handles that should be inherited. Only these handles are inherited by the child process. This again (just like Darwin posix_spawn) only stops process-lib's processes from inheriting unintended handles. Other code in an application that calls CreateProcess without passing a STARTUPINFOEXW struct containing the handles it should inherit can still unintentionally inherit handles meant for a process-lib child process. process-lib uses the STARTUPINFOEXW struct if it is available.

gitter-badger / process-lib Goto Github PK

process-lib's Introduction