%0 Journal Article %T Design and Implementation of Matrix Multiplication on GPU
GPU上的矩阵乘法的设计与实现 %A LIANG Juan-Juan %A REN Kai-Xin %A GUO Li-Cai %A LIU Yan-Jun %A
梁娟娟 %A 任开新 %A 郭利财 %A 刘燕君 %J 计算机系统应用 %D 2011 %I %X Matrix multiplication is a basic operation in scientific computing.Efficient implementation of matrix multiplication can speed up many applications.In this paper,we implement an efficient matrix multiplication on GPU using NVIDIA's CUDA.The experiment shows that our implementation is as fast as the implementation in CUBLAS,and the speed of our implementation can reach the peak speed's 97%,on Geforce GTX260. %K matrix multiplication %K GPU %K CUDA
矩阵乘法 %K GPU %K CUDA %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=D4F6864C950C88FFCE5B6C948A639E39&aid=2913A3BD7E7D147DE28C2E102B5E408E&yid=9377ED8094509821&vid=A04140E723CB732E&iid=CA4FD0336C81A37A&sid=4609832E4B5C797B&eid=51F97E23F233E582&journal_id=1003-3254&journal_name=计算机系统应用&referenced_num=0&reference_num=4