100x Slower Code due to False Sharing

Explores potential performance degradation of an embarrassingly parallel problem due to false sharing by benchmarking various implementations of a basic kinematic problem. Examples available on Github: https://github.com/KeithJH/kinematics... Chapter Timestamps: 00:00 - What is False Sharing 02:02 - Baseline Implementation 02:33 - Local Atomic Points 03:39 - Global Atomic Points 05:40 - Better Atomic Points 06:49 - Aligned Points 08:08 - Aligned Point 08:26 - Thread Local 08:50 - Array of Arrays 09:14 - Summary Music: Untitled by ‪@keiferjh‬ Examples are similar to those explored in "4x Code Performance with SIMD" (   • 4x Code Performance with SIMD  ) and "Increasing code performance with LTO" (   • Increasing code performance with LTO  )