Version 3 (modified by 5 years ago) ( diff ) | ,
---|
Latency tuning guidelines
Many of our tutorials and applications depend on reliabile, low latency processing.
This is a rough checklist of steps for creating an image (e.g. baseline_1804_lowlatency) that has minimized deterministic latency.
factors affecting this:
- maximum stable processor clock speed
- we cannot depend on boost states (intel turbo boost), as they can't be maintained under all load conditions, or for many cores.
- it is possible to fix a small number of cores to a high boost by disabling others, but this usually requires bios support
- number of cpus
- systems with multiple cpus can introduce latency with communication between them. Single cpu systems are much simpler to optimize
- NUMA issues: with multiple cpus, you must pay close attention to which memory is allocated to which cpu, as well as where pcie devices are attached.
- Choosing node to use:
- from above, pick a node with a single cpu, and the maximum clock available
- install a low latency kernel
- usage of tuned-adm
- tuned is a perfomance optimization project that wraps many configuration methods
Monitoring tools
- htop
Note:
See TracWiki
for help on using the wiki.