1. GENERAL LEARNING RESOURCES
1.1. Official resources and APIs
- CUDA C Programming Guide (pdf)
- CUDA Runtime API (pdf)
- CUDA Best Practices Guide (pdf)
- CUDA Driver API
1.2. Blogs
- CUDA Programming
- Parallel for all and a list of posts on disqus
- Solarian programmer
1.3. Slides and courseware
- Official slides
- Qwiklab's CUDA lab (among other interesting tutorials)
- An excellent collection of course material by the university of Illinois
- Yong Cao's (Virginia Tech) presentation of the CUDA programming model
- A very good collection of lectures notes on the website of the university of Standford
- More slides with a very good tutorial on reduction and scan
- Rice university lecture notes
1.4. Books
- CUDA Application Design and Development by Rob Farberg which is an excellent book
- CUDA by Example by Jason Sanders and Edward Kandrot
- The CUDA Handbook by Nicholas Wilt
- More suggestions can be found here by NVIDIA
2. CUDA ACCELERATED LIBRARIES
A collection of general-purpose libraries and links to the official websites:
3. SPECIAL TOPICS
3.1. Thrust
- First steps
- Thurst wiki on github - a good reference
3.2. Matrix multiplication
3.3. Advanced Topics
- Optimisation for your CUDA code: Performance assessment, Memory, Instructions, CUDA and concurrent execution, and more from this web page of the Penn State University, Institute for CyberScience.
- M. Harris, S. Sengupta and J.D. Owens, "Parallel Prefix Sum (Scan) with CUDA," GPU Gems 3, by NVIDIA.
No comments:
Post a Comment