Schedule an R function or formula to run after a specified period of
time. Similar to JavaScript’s setTimeout
function. Like
JavaScript, R is single-threaded so there’s no guarantee that the
operation will run exactly at the requested time, only that at least
that much time will elapse.
To avoid bugs due to reentrancy, by default, scheduled operations
only run when there is no other R code present on the execution stack;
i.e., when R is sitting at the top-level prompt. You can force past-due
operations to run at a time of your choosing by calling
later::run_now()
.
The mechanism used by this package is inspired by Simon Urbanek’s background package and similar code in Rhttpd.
::install_github("r-lib/later") remotes
Pass a function (in this case, delayed by 5 seconds):
::later(function() {
laterprint("Got here!")
5) },
Or a formula (in this case, run as soon as control returns to the top-level):
::later(~print("Got here!")) later
It is also possible to have a function run based on when file descriptors are ready for reading or writing, at some indeterminate time in the future.
Below, a logical vector is printed indicating which of file descriptors 21 or 22 were ready, subject to a timeout of 1s. Instead of just printing the result, the supplied function can also do something more useful such as reading from the descriptor.
::later_fd(print, c(21L, 22L), timeout = 1) later
This is useful in particular for asynchronous or streaming data
transfer over the network / internet, so that reads can be made from TCP
sockets as soon as data is available. later::later_fd()
pairs well with functions such as curl::multi_fdset()
that
return the relevant file descriptors to be monitored .
You can also call later::later
from C++ code in your own
packages, to cause your own C-style functions to be called back. This is
safe to call from either the main R thread or a different thread; in
both cases, your callback will be invoked from the main R thread.
later::later
is accessible from later_api.h
and its prototype looks like this:
void later(void (*func)(void*), void* data, double secs)
The first argument is a pointer to a function that takes one
void*
argument and returns void. The second argument is a
void*
that will be passed to the function when it’s called
back. And the third argument is the number of seconds to wait (at a
minimum) before invoking.
later::later_fd
is also accessible from
later_api.h
and its prototype looks like this:
void later_fd(void (*func)(int *, void *), void *data, int num_fds, struct pollfd *fds, double secs)
The first argument is a pointer to a function that takes two
arguments: the first being an int*
array provided by
later_fd()
when called back, and the second being a
void*
. The int*
array will be the length of
num_fds
and contain the values 0
,
1
or NA_INTEGER
to indicate the readiness of
each file descriptor, or an error condition respectively. The second
argument data
is passed to the void*
argument
of the function when it’s called back. The third is the total number of
file descriptors being passed, the fourth a pointer to an array of
stuct pollfds
, and the fifth the number of seconds to wait
until timing out.
To use the C++ interface, you’ll need to add later
to
your DESCRIPTION
file under both LinkingTo
and
Imports
, and also make sure that your
NAMESPACE
file has an import(later)
entry.
Finally, this package also offers a higher-level C++ helper class to
make it easier to execute tasks on a background thread. It is also
available from later_api.h
and its public/protected
interface looks like this:
class BackgroundTask {
public:
();
BackgroundTaskvirtual ~BackgroundTask();
// Start executing the task
void begin();
protected:
// The task to be executed on the background thread.
// Neither the R runtime nor any R data structures may be
// touched from the background thread; any values that need
// to be passed into or out of the Execute method must be
// included as fields on the Task subclass object.
virtual void execute() = 0;
// A short task that runs on the main R thread after the
// background task has completed. It's safe to access the
// R runtime and R data structures from here.
virtual void complete() = 0;
}
Create your own subclass, implementing a custom constructor plus the
execute
and complete
methods.
It’s critical that the code in your execute
method not
mutate any R data structures, call any R code, or cause any R
allocations, as it will execute in a background thread where such
operations are unsafe. You can, however, perform such operations in the
constructor (assuming you perform construction only from the main R
thread) and complete
method. Pass values between the
constructor and methods using fields.
#include <Rcpp.h>
#include <later_api.h>
class MyTask : public later::BackgroundTask {
public:
MyTask(Rcpp::NumericVector vec) :
inputVals(Rcpp::as<std::vector<double> >(vec)) {
}
protected:
void execute() {
double sum = 0;
for (std::vector<double>::const_iterator it = inputVals.begin();
it != inputVals.end();
it++) {
sum += *it;
}
result = sum / inputVals.size();
}
void complete() {
Rprintf("Result is %f\n", result);
}
private:
std::vector<double> inputVals;
double result;
};
To run the task, new
up your subclass and call
begin()
, e.g. (new MyTask(vec))->begin()
.
There’s no need to keep track of the pointer; the task object will
delete itself when the task is complete.
// [[Rcpp::export]]
asyncMean(Rcpp::NumericVector data) {
void MyTask(data))->begin();
(new }
It’s not very useful to execute tasks on background threads if you can’t get access to the results back in R. The promises package complements later by providing a “promise” abstraction.