|
|
|
@ -27,19 +27,19 @@ Run with `-h` for more options.
|
|
|
|
|
* Doesn't dump huge amounts of trash onto each stack frame
|
|
|
|
|
|
|
|
|
|
## Performance
|
|
|
|
|
[https://github.com/sharkdp/hyperfine](hyperfine) reports a **700-800%** speedup over `v1`.
|
|
|
|
|
[hyperfine](https://github.com/sharkdp/hyperfine) reports a **700-800%** speedup over `v1`.
|
|
|
|
|
It's easy to see why.
|
|
|
|
|
|
|
|
|
|
### V1 flamegraph
|
|
|
|
|
V1 uses a pesudo-array adaptor to perform filesystem reads, seeks, and writes. This causes a massive syscall overhead.
|
|
|
|
|
data:image/s3,"s3://crabby-images/51be8/51be85ec2cdd548cba6c3baba2289061e5eda0f6" alt=""
|
|
|
|
|
data:image/s3,"s3://crabby-images/4799c/4799c2ca13324f263892475daaa689498e77e0df" alt=""
|
|
|
|
|
|
|
|
|
|
### V2 flamegraph
|
|
|
|
|
Whereas V2 uses a single `mmap()`.
|
|
|
|
|
data:image/s3,"s3://crabby-images/9848a/9848aab15140db2f4d9a94eaa171dfdb15d2397a" alt=""
|
|
|
|
|
data:image/s3,"s3://crabby-images/47796/4779675b883d0339b039135e5e197a68c34a09c2" alt=""
|
|
|
|
|
|
|
|
|
|
## Memory usage
|
|
|
|
|
The [https://www.systutorials.com/docs/linux/man/1-memusage/](memusage) graph for =v1= shows extremely inefficient stack usage.
|
|
|
|
|
The [memusage](https://www.systutorials.com/docs/linux/man/1-memusage/) graph for =v1= shows extremely inefficient stack usage.
|
|
|
|
|
data:image/s3,"s3://crabby-images/f6d3d/f6d3d6e12b024095ac34e4ae76a88ad716dee846" alt=""
|
|
|
|
|
( the green is supposed to be a line, not a bar )
|
|
|
|
|
This is due to how the unshuffler buffers RNG results.
|
|
|
|
|