%0 Journal Article
%T High Performance Massive Data Computing Framework Based on Hadoop Cluster
基于Hadoop的高性能海量数据处理平台研究
%A 翟岩龙
%A 罗 壮
%A 杨 凯
%A 徐晟晨
%J 计算机科学
%D 2013
%I
%X HPC of massive data presents tremendous value. However, cloud systems still lack HPC computing power.This study improved the HPC ability of cloud computing technology by adding GPU to the cloud system. The proposed platform is based on Hadoop MapReduce programming model, and it defines some OpenMP like directives to annotate MapReduce program. The annotated code will try to be executed in parallel. A GPUClassloader was designed to convert annotated java code regions to CUDA code. With JNI,generated CUDA code and run on the GPUs. The computing resups of GPUs can be transferred back to the map function, in the end, the map function finishes the rest computing. The platform can support the user to complete CPU, GPU collaborative large-scale data parallel processing programming conveniently.
%K CPU/GPU collaborative computing
%K Hadoop
%K Massive data processing
%K HPC
CPU/UPU协同计算,Hadoop,海量数据处理,高性能计算
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=64A12D73428C8B8DBFB978D04DFEB3C1&aid=EEFFAAE7C668ACD18700B90D21241931&yid=FF7AA908D58E97FA&vid=1371F55DA51B6E64&iid=38B194292C032A66&sid=8C83C265AD318E34&eid=89F76E117E9BDB76&journal_id=1002-137X&journal_name=计算机科学&referenced_num=0&reference_num=0