HandsOnPyCuda : Beginning with CUDA on GPUs

PyCUDA Note1: `gpuarray`

GPU memory -> device memory

CPU memory -> host memory

In CUDA C: the transfer of data is carried out using cudaMemcpyHostToDevice and cudaMemcpyDeviceToHost. Memory allocation using cudaMalloc and deallocation using cudaFree

PyCUDA covers all overhead of memory allocation, deallocation and transfer using gpuarray class.

It performs automatic cleanup based on lifetime.

How to transfer data from the host to GPU?

Contain data in host memory using Numpy e.g. host_data

Transfer to GPU using gpuarray.to_gpu(host_data)

After computation, retrieve data from GPU using gpuarray.get()

Always set the datatypes of Numpy arrays transferred to GPU using `dtype`.

Pointwise operations are intrinsically parallelizable.

In PyCUDA, GPU code is often compiled at runtime with the NVIDIA nvcc compiler and then subsequently called from PyCUDA. This can lead to an unexpected slowdown, usually the first time a program or GPU operation is run in a given Python session.

`ElementWiseKernel` for performing pointwise computations

Implementing pointwise operation using inline code in CUDA C. Example below

gpu_2x_ker = ElementwiseKernel( "float *in, float *out", "out[i] = 2*in[i];", "gpu_2x_ker")

This is compiled externally by nvcc compiler and then launched at runtime via PyCUDA.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
deviceQuery.py		deviceQuery.py
mandelbrot.png		mandelbrot.png
mandelbrot0.py		mandelbrot0.py
mandelbrot_profile.txt		mandelbrot_profile.txt
simple_element_kernel_example0.py		simple_element_kernel_example0.py
time_calc0.py		time_calc0.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HandsOnPyCuda : Beginning with CUDA on GPUs

PyCUDA Note1: `gpuarray`

Always set the datatypes of Numpy arrays transferred to GPU using `dtype`.

Pointwise operations are intrinsically parallelizable.

`ElementWiseKernel` for performing pointwise computations

About

Uh oh!

Releases

Packages

Languages

kanand-cfd/HandsOnPyCuda

Folders and files

Latest commit

History

Repository files navigation

HandsOnPyCuda : Beginning with CUDA on GPUs

PyCUDA Note1: gpuarray

Always set the datatypes of Numpy arrays transferred to GPU using dtype.

Pointwise operations are intrinsically parallelizable.

ElementWiseKernel for performing pointwise computations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

PyCUDA Note1: `gpuarray`

Always set the datatypes of Numpy arrays transferred to GPU using `dtype`.

`ElementWiseKernel` for performing pointwise computations

Packages