Parallel Computing starter project to build GPU & CPU kernels in CUDA & C++ and call them from Python without a single line of CMake using PyBind11
The C++ & CUDA Starter Kit for Python Developers is an essential tool for anyone involved in high-performance computing. With a streamlined workflow for prototyping algorithms in Python before porting them to C++ and CUDA, it alleviates the common headaches associated with configuring build tools for heterogeneous code and hardware. This kit simplifies the development process significantly, enabling developers to focus on optimizing their algorithms rather than getting bogged down by configuration complexities.
This project provides a pre-configured environment that fosters swift development and testing. By supporting both CPU and GPU implementations seamlessly, it promises to enhance productivity while ensuring that developers can leverage the full potential of CUDA and OpenMP for their computational needs.
setup.py and requirements-{cpu,gpu}.txt, eliminating the cumbersome configuration typically associated with such projects.tasks.json is pre-configured to support debugging for both CPU and GPU code, enhancing the development experience.