GPU Glossary

TABLE OF CONTENTS

Device Hardware

CUDA (Device Architecture)

Streaming Multiprocessor

Special Function Unit

Load/Store Unit

Streaming Multiprocessor Architecture

Texture Processing Cluster

Graphics/GPU Processing Cluster

Device Software

CUDA (Programming Model)

Streaming ASSembler

Parallel Thread eXecution

Compute Capability

Cooperative Thread Array

Thread Block Grid

Thread Hierarchy

Memory Hierarchy

CUDA (Software Platform)

CUDA C++ (programming language)

NVIDIA GPU Drivers

CUDA Driver API

NVIDIA Management Library

CUDA Runtime API

NVIDIA CUDA Compiler Driver

NVIDIA Runtime Compiler

NVIDIA CUDA Profiling Tools Interface

NVIDIA Nsight Systems

CUDA Binary Utilities

/device-software/thread-block-grid

What is a Thread Block Grid?

When a CUDA kernel is launched, it creates a collection of threads known as a thread block grid. Grids can be one, two, or three dimensional. They are made up of thread blocks .

The matching level of the memory hierarchy is the global memory .

Thread blocks are effectively independent units of computation. They execute concurrently, that is, with indeterminate order, ranging from fully sequentially in the case of a GPU with a single Streaming Multiprocessor to fully in parallel when run on a GPU with sufficient resources to run them all simultaneously.

Something seem wrong?
Or want to contribute?

Click this button to
let us know on GitHub.

Thread Hierarchy ?