hrshl212

Follow

Harshal Raut hrshl212

Follow

High Performance Computing | Machine Learning | CFD

2 followers · 0 following

Johns Hopkins Univeristy
Baltimore, Maryland

Achievements

Achievements

hrshl212/README.md

👋 Hi, I’m Harshal Raut
👀 I’m interested in High Performance Computing, Machine Learning and CFD
📫 How to reach me hrshl212@gmail.com

Popular repositories Loading

TD3-libtorch TD3-libtorch Public

TD3 reinforcement learning algorithm using libtorch in simple environment

Makefile 3
Preconditioned-Conjugate-Gradient-Method-in-CUDA Preconditioned-Conjugate-Gradient-Method-in-CUDA Public

Preconditioned conjugate gradient method with ILU preconditioner implemented in CUDA

Cuda 2
Transformer-Attention-Mechanism-in-CUDA Transformer-Attention-Mechanism-in-CUDA Public

Custom CUDA kernel for transformer's attention mechanism and integrating it with pytorch

Cuda 2
Matrix_transpose_using_GPU Matrix_transpose_using_GPU Public

Performing matrix transpose, first with each thread performing transpose of one element of matrix and later using the unrolling option as well.

Cuda 1
Optimized-matrix-multiplication-in-CUDA Optimized-matrix-multiplication-in-CUDA Public

The repository contains different ways to implement matrix-matrix multiplication in CUDA starting from basic implementation to using tensor cores in NVIDIA A100 GPUs

Cuda 1
Custom-CUDA-kernels-with-Neural-Network-Implementation Custom-CUDA-kernels-with-Neural-Network-Implementation Public

The repository contains custom CUDA kernels for linear layer, softmax and relu which are integrated with python to develop a Neural Network

Python 1