Skip to content

tomchengchitang/gcn-assembly-study

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

35 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Brief

This repository collects some projects for my GCN assembly learning roads.

How to Build

Clone this repository, cd to it, and then

mkdir build
cd build
cmake .. -DCMAKE_CXX_COMPILER=hipcc -DCMAKE_C_COMPILER=hipcc -DCMAKE_PREFIX_PATH=/opt/rocm/lib/cmake
make -j

Introduction for each lesson

Lesson 0

  • How to set up the host part to launch assembly kernel and use 1 thread to set a value with specified value in assembly code.
  • Set up buffer resource descriptor

Lesson 1

  • How to set up buffer resource descriptor
  • Set exec mask to make lanes active/inactive

Lesson 2

  • Implement relu and leaky relu in both hip and assembly code

Lesson 3

  • Demonstrate how to use reduction method to get the maximum value via multiple threads in GPU
  • Please refer to my diagram to understand the assembly algorithm if you are new to assembly.

IMG_1912

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • C++ 36.5%
  • Python 31.5%
  • Assembly 28.5%
  • CMake 3.2%
  • Shell 0.3%