global memory | The Supercomputing Blog

Posts tagged ‘global memory’

CUDA – Tutorial 5 – Performance of atomics

Atomic operations are often essential for multithreaded programs, especially when different threads need to access or modify the same data. Conventional multicore CPUs generally use a test-and-set instruction to manage which thread controls which data. CUDA has a much more expansive set of atomic operations. With CUDA, you can effectively perform a test-and-set using the atomicInc() instruction. However, you can also use atomic operations to actually manipulate the data itself, without the need for a lock variable. Continue reading ‘CUDA – Tutorial 5 – Performance of atomics’ »

Posted by admin on December 4, 2009 at 8:38 pm under CUDA.
Tags: Atomic, Atomic Function, Atomic operation, CUDA, global memory, GPGPU, memory access, nVidia, Performance, shared memory, Tutorial
Comments Off on CUDA – Tutorial 5 – Performance of atomics.

The Supercomputing Blog

Posts tagged ‘global memory’

CUDA – Tutorial 5 – Performance of atomics

Pages

Categories

Recent Posts