3 | | Many of our tutorials and applications depend on reliabile, low latency processing. This is a rough checklist of steps for creating an image (e.g. baseline_1804_lowlatency) that has minimized deterministic latency. |
| 3 | Many of our tutorials and applications depend on reliabile, low latency processing. |
| 4 | |
| 5 | This is a rough checklist of steps for creating an image (e.g. baseline_1804_lowlatency) that has minimized deterministic latency. |
| 6 | |
| 7 | factors affecting this: |
| 8 | * maximum stable processor clock speed |
| 9 | * we cannot depend on boost states (intel turbo boost), as they can't be maintained under all load conditions, or for many cores. |
| 10 | * it is possible to fix a small number of cores to a high boost by disabling others, but this usually requires bios support |
| 11 | * number of cpus |
| 12 | * systems with multiple cpus can introduce latency with communication between them. Single cpu systems are much simpler to optimize |
| 13 | * NUMA issues: with multiple cpus, you must pay close attention to which memory is allocated to which cpu, as well as where pcie devices are attached. |
| 14 | |
| 15 | |
| 16 | 1. Choosing node to use: |
| 17 | * from above, pick a node with a single cpu, and the maximum clock available |
| 18 | 1. install a low latency kernel |
| 19 | 1. usage of tuned-adm |
| 20 | * tuned is a perfomance optimization project that wraps many configuration methods |