CPU

Performance Optimization, SIMD and Cache

CPU

Why Intel Added Cache Partitioning

CPU

http://danluu.com/intel-cat/

Simple ARM NEON optimized sin, cos, log and exp

CPU

http://gruntthepeon.free.fr/ssemath/neon_mathfun.html

Horizontal SSE Stable Sort Indice Generation

CPU

http://conorstokes.github.io/optimization/2015/09/19/horizontal-sse-stable-sort-indice-generation/

Memory, Cache, CPU optimization links

CPU

https://gist.github.com/ocornut/cb980ea183e848685a36

"Memory Optimization" Christer Ericson. GDC2003

http://realtimecollisiondetection.net/pubs/GDC03_Ericson_Memory_Optimization.ppt

GPU Pro Tip: Lerp Faster in C++

CPU

http://devblogs.nvidia.com/parallelforall/lerp-faster-cuda/

SIMD-friendly raytracing techniques

I’ve Relapsed Into Raytracing Again http://www.joshbarczak.com/blog/?p=787

NVIDIA Tegra X1

NVIDIA Tegra X1 Preview & Architecture Analysis http://www.anandtech.com/show/8811/nvidia-tegra-x1-preview Live: NVIDIA’s CES Press Event http://blogs.nvidia.com/blog/2015/01/04/live-nvidia-ces/

"Why modern CPUs are starving and what can be done about it" By Francesc Alted

CPU

http://www.blosc.org/docs/StarvingCPUs-CISE-2010.pdf

"Native code performance on modern CPUs: A changing landscape" @ Build 2014

CPU

http://view.officeapps.live.com/op/view.aspx?src=http%3a%2f%2fvideo.ch9.ms%2fsessions%2fbuild%2f2014%2f4-587.pptx

AMD’s Jaguar Microarchitecture

http://www.realworldtech.com/jaguar/

An Introduction to x86_64 Assembly Language

CPU

http://hackeradam17.com/2014/03/18/an-introduction-to-x86_64-assembly-language/