NVIDIA CUDA SDK - Linear Algebra

		CUDA SDK Quick Links
		Computational Finance CUDA Advanced Topics CUDA Basic Topics CUDA Systems Integration Data-Parallel Algorithms Graphics Interop Image/Video Processing and Data Compression Linear Algebra Performance Strategies Physically-Based Simulation Texture

NVIDIA CUDA SDK - Linear Algebra


FFT Ocean Simulation This sample simulates an Ocean heightfield using CUFFT and renders the result using OpenGL.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Separable Convolution This sample implements a separable convolution filter of a 2D signal with a gaussian kernel.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Texture-based Separable Convolution Texture-based implementation of a separable 2D convolution with a gaussian kernel. Used for performance comparison against convolutionSeparable.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


FFT-Based 2D Convolution This sample demonstrates how 2D convolutions with very large kernel sizes can be efficiently implemented using FFT transformations.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Matrix Transpose Efficient matrix transpose.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Scalar Product This sample calculates scalar products of a given set of input vector pairs.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Fast Walsh Transform Naturally(Hadamard)-ordered Fast Walsh Tranform for batched vectors of arbitrary eligible(power of two) lengths		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Eigenvalues The computation of all or a subset of all eigenvalues is an important problem in linear algebra, statistics, physics, and many other fields. This sample demonstrates a parallel implementation of a bisection algorithm for the computation of all eigenvalues of a tridiagonal symmetric matrix of arbitrary size with CUDA.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Matrix Multiplication (Driver Version) This sample implements matrix multiplication using the CUDA driver API. It has been written for clarity of exposition to illustrate various CUDA programming principles, not with the goal of providing the most performant generic kernel for matrix multiplication. CUBLAS provides high-performance matrix multiplication.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Simple CUBLAS Example of using CUBLAS.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac


Matrix Multiplication This sample implements matrix multiplication and is exactly the same as Chapter 6 of the programming guide. It has been written for clarity of exposition to illustrate various CUDA programming principles, not with the goal of providing the most performant generic kernel for matrix multiplication. CUBLAS provides high-performance matrix multiplication.		or later Download - Windows x86 Download - Windows x64 Download - Linux/Mac

Last Update: 06/15/2009