Kernel | The Supercomputing Blog

Posts tagged ‘Kernel’

CUDA – Tutorial 2 – The Kernel

Welcome to the second tutorial in how to write high performance CUDA based applications. This tutorial will cover the basics of how to write a kernel, and how to organize threads, blocks, and grids. For this tutorial, we will complete the previous tutorial by writing a kernel function. The goal of this application is very simple. The idea is to take two arrays of floating point numbers, and perform an operation on them and store the result in a third floating point array. We will then study how fast the code executes on a CUDA device, and compare it to a traditional CPU. The data analysis will take place toward the end of the article. Continue reading ‘CUDA – Tutorial 2 – The Kernel’ »

Posted by admin on July 11, 2009 at 12:59 am under CUDA.
Tags: Basic, CUDA, Experiment, HPC, Kernel, Tutorial
Comments Off on CUDA – Tutorial 2 – The Kernel.

The Supercomputing Blog

Posts tagged ‘Kernel’

CUDA – Tutorial 2 – The Kernel

Pages

Categories

Recent Posts