Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
ConfigForLinux		ConfigForLinux
CudaBlas		CudaBlas
CudaDNN		CudaDNN
CudaFFT		CudaFFT
CudaRand		CudaRand
CudaSolve		CudaSolve
CudaSparse		CudaSparse
ManagedCUDA		ManagedCUDA
NPP		NPP
NVRTC		NVRTC
NvJpeg		NvJpeg
Samples		Samples
Scripts		Scripts
.gitattributes		.gitattributes
.gitignore		.gitignore
CudaDNN.netCore.sln		CudaDNN.netCore.sln
CudaDNN.sln		CudaDNN.sln
GlobalAssemblyInfo.cs		GlobalAssemblyInfo.cs
GlobalAssemblyInfo.proj		GlobalAssemblyInfo.proj
LICENSE.txt		LICENSE.txt
ManagedCUDA.netCore.sln		ManagedCUDA.netCore.sln
ManagedCUDA.sln		ManagedCUDA.sln
README.md		README.md

Repository files navigation

Donate a beer to help me to keep managedCuda up to date :)

or

managedCuda

ManagedCUDA aims an easy integration of NVidia's CUDA in .net applications written in C#, Visual Basic or any other .net language.

For this it includes:

A complete wrapper for the CUDA Driver API, version 11.1 (a 1:1 representation of cuda.h in C#)
Based on this, wrapper classes for CUDA context, kernel, device variable, etc.
Wrapper for graphics interop with DirectX and OpenGL, respectively SlimDX and OpenTK
CUDA vector types like int2, float3 etc. with ToString() methods and operators (+, –, *, /)
Define your own types: CudaDeviceVariable accepts any user defined type if it is a value type, i.e. a struct in C#
Includes CUDA libraries: CUFFT, CURAND, CUSPARSE, CUBLAS, CUSOLVER, NPP, NvJPEG and NVRTC
Compatibility for .net Framework and .net Core >3.x.
Native Linux support for .net Core 3.x: Automatically switches the native library names.
Access device memory directly per element using [] operator:

CudaDeviceVariable<float> devVar = new CudaDeviceVariable<float>(64);
devVar[0] = 1.0f;
devVar[1] = 2.0f;
float hostVar1 = devVar[0];
float hostVar2 = devVar[1];

Implicit converter operators: Allocate and initialize device or host arrays in only one line of code:

float3[] array_host = new float3[100];
for (int i = 0; i < 100; i++)
{
	array_host[i] = new float3(i, i+1, i+2);
}
//alloc device memory and copy data:
CudaDeviceVariable<float3> array_device = array_host;
//alloc host array and copy data: 
float3[] array_host2 = array_device;

NPPs extension methods for CudaDeviceVariable. Add a reference to the NPP library and include the ManagedCuda.NPP.NPPsExtensions namespace:

Random rand = new Random();
int length = 256;

//init some ramdom values
double[] randoms = new double[length];
for (int i = 0; i < length; i++)
{
	randoms[i] = rand.NextDouble();
}

//Alloc device memory
CudaDeviceVariable<double> a = randoms;
CudaDeviceVariable<double> b = new CudaDeviceVariable<double>(length);
b.Set(10.0); //NPPs method
int size = a.MeanGetBufferSize(); //NPPs method
//Alloc temporary memory for NPPs mean method
CudaDeviceVariable<byte> buffer = new CudaDeviceVariable<byte>(size);
CudaDeviceVariable<double> mean = new CudaDeviceVariable<double>(1);

a.Mul(b); //NPPs method
a.DivC(10.0); //NPPs method
a.Mean(mean, buffer); //NPPs method

//Copy data back to host
double m = mean;
double[] res = a;

//Clean up
mean.Dispose();
buffer.Dispose();
b.Dispose();
a.Dispose();

The new feature 'per thread default stream' is available as a compiler directive of the managedCuda main library: Compile the library with the option "_PerThreadDefaultStream" to enable it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

managedCuda

About

Releases

Packages

Languages

License

dinggeonly/managedCuda

Folders and files

Latest commit

History

Repository files navigation

managedCuda

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages