DESIGN DISCUSSION: How to execute a DAG of IO and compute tasks? 2 Tokio threadpools? Or Tokio + Rayon?

Andrew Lamb at InfluxData wrote [a blog post](https://thenewstack.io/using-rustlangs-async-tokio-runtime-for-cpu-bound-tasks/) (in 2022) making compelling arguments for scheduling CPU-bound tasks using Tokio. The essential "trick" is to use two Tokio threadpools: one for IO, and another for CPU-bound tasks (so that CPU-bound tasks don't block IO tasks).

For `hypergrib`, it might be nice to be able to use Rust's `async` API to naturally express a directed acyclic graph (DAG) of tasks. For example:

```mermaid
graph TD
    L1[Load GRIB message 1] --> D1[Decode msg 1] --> M[Merge into final array]
    L2[Load GRIB message 2] --> D2[Decode msg 2] --> M[Merge into final array]
```

Andrew Lamb's blog post suggests using two Tokio threadpools. Andrew's implementation involves ~750 lines of [custom Rust code](https://github.com/influxdata/influxdb3_core/blob/main/executor/src/lib.rs) (including tests).

If we really wanted to avoid using Rayon (and use two Tokio threadpools) then I think we could do it by "just" creating two Tokio threadpools. Something like:

```rust
use tokio::runtime::Runtime;

// Create the runtime
let cpu_runtime  = Runtime::new().unwrap();

// Execute the future, blocking the current thread until completion
cpu_handle = cpu_runtime.spawn(cpu_main);

let io_runtime = Runtime::new().unwrap();
io_handle = io_runtime.spawn(io_main);

cpu_handle.await??;
io_handle.await??;
```

(Although I'm really not sure if that'll work! And I'm not sure how to pass `Futures` between the two runtimes?)

On ballance, I think I prefer [Alice Ryhl's recommendation](https://ryhl.io/blog/async-what-is-blocking/) of using Tokio with Rayon, and using a `tokio::sync::oneshot::channel` to pass things between Tokio and Rayon. I'm 99% sure this'll still allow us to construct a DAG of tasks. And feels like it'll result in less code in `hypergrib`. And, crucially, we may have tasks that run a long time (seconds?), but Andrew Lamb suggests that, even when using two tokio threadpools, tasks in the CPU threadpool still shouldn't block for more than something like 100ms. But it does add a pretty heavyweight dependency (Rayon).

## Further reading

- [Andrew Lamb](https://thenewstack.io/author/andrew-lamb/): [Using Rustlang’s Async Tokio Runtime for CPU-Bound Tasks](https://thenewstack.io/using-rustlangs-async-tokio-runtime-for-cpu-bound-tasks/)
    - The latest IOx code: https://github.com/influxdata/influxdb3_core/blob/main/executor/src/lib.rs
- Alice Ryhl: [What is blocking?](https://ryhl.io/blog/async-what-is-blocking/)
- Tokio docs: [CPU-bound tasks and blocking code](https://docs.rs/tokio/latest/tokio/#cpu-bound-tasks-and-blocking-code)
- Stack Overflow: [How to create a dedicated threadpool for CPU-intensive work in Tokio?](https://stackoverflow.com/questions/61752896/how-to-create-a-dedicated-threadpool-for-cpu-intensive-work-in-tokio)
- Reddit: [Tokio for CPU intensive work](https://www.reddit.com/r/rust/comments/xk0yph/tokio_for_cpu_intensive_work/)
- [Status quo stories: Niklaus Builds a Hydrodynamics Simulator](https://rust-lang.github.io/wg-async/vision/submitted_stories/status_quo/niklaus_simulates_hydrodynamics.html#-status-quo-stories-niklaus-builds-a-hydrodynamics-simulator) (Which actually suggests that Rust's `async` API isn't a great choice for CPU-intensive tasks)
- Stack Overflow: [How can I create a Tokio runtime inside another Tokio runtime without getting the error "Cannot start a runtime from within a runtime"?](https://stackoverflow.com/questions/62536566/how-can-i-create-a-tokio-runtime-inside-another-tokio-runtime-without-getting-th)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DESIGN DISCUSSION: How to execute a DAG of IO and compute tasks? 2 Tokio threadpools? Or Tokio + Rayon? #10

Further reading

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

DESIGN DISCUSSION: How to execute a DAG of IO and compute tasks? 2 Tokio threadpools? Or Tokio + Rayon? #10

Description

Further reading

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions