bevy/crates/bevy_pbr at dfdc9f8369a0e7dd7e6f78a950f687a5c3822c39 - Mirrors/bevy

mirror of https://github.com/bevyengine/bevy synced 2025-01-04 17:28:56 +00:00

History

James Liu a1a81e5721 Parallelize extract_meshes (#9966 ) # Objective `extract_meshes` can easily be one of the most expensive operations in the blocking extract schedule for 3D apps. It also has no fundamentally serialized parts and can easily be run across multiple threads. Let's speed it up by parallelizing it! ## Solution Use the `ThreadLocal<Cell<Vec<T>>>` approach utilized by #7348 in conjunction with `Query::par_iter` to build a set of thread-local queues, and collect them after going wide. ## Performance Using `cargo run --profile stress-test --features trace_tracy --example many_cubes`. Yellow is this PR. Red is main. `extract_meshes`: ![image](https://github.com/bevyengine/bevy/assets/3137680/9d45aa2e-3cfa-4fad-9c08-53498b51a73b) An average reduction from 1.2ms to 770us is seen, a 41.6% improvement. Note: this is still not including #9950's changes, so this may actually result in even faster speedups once that's merged in.	2023-10-01 09:44:03 +00:00
..
src	Parallelize extract_meshes (#9966 )	2023-10-01 09:44:03 +00:00
Cargo.toml	Parallelize extract_meshes (#9966 )	2023-10-01 09:44:03 +00:00

# Objective
`extract_meshes` can easily be one of the most expensive operations in
the blocking extract schedule for 3D apps. It also has no fundamentally
serialized parts and can easily be run across multiple threads. Let's
speed it up by parallelizing it!

## Solution
Use the `ThreadLocal<Cell<Vec<T>>>` approach utilized by #7348 in
conjunction with `Query::par_iter` to build a set of thread-local
queues, and collect them after going wide.

## Performance
Using `cargo run --profile stress-test --features trace_tracy --example
many_cubes`. Yellow is this PR. Red is main.

`extract_meshes`:


![image](https://github.com/bevyengine/bevy/assets/3137680/9d45aa2e-3cfa-4fad-9c08-53498b51a73b)

An average reduction from 1.2ms to 770us is seen, a 41.6% improvement.

Note: this is still not including #9950's changes, so this may actually
result in even faster speedups once that's merged in.

2023-10-01 09:44:03 +00:00

src

Parallelize extract_meshes (#9966 )

2023-10-01 09:44:03 +00:00

Cargo.toml

Parallelize extract_meshes (#9966 )

2023-10-01 09:44:03 +00:00