%0 Journal Article %T Determine priority of ETL workflow activities and their parallel execution
ETL工作流活动优先级的确定及并行实现* %A HUANG Jue-ming %A XI Jian-qing %A
黄觉明 %A 奚建清 %J 计算机应用研究 %D 2010 %I %X The process of ETL could be treated as a data-centric workflow. This paper discussed the execution of the ETL workflow and proposed an algorithm to determine the priority of the activities in the ETL workflow, threads were created for the activities that share the same priority and were not dependent on each other. The activities were put in the parallel execution environment, which could improve the execution efficiency of the ETL workflow. The result of the experiment shows that the acceleration ratio of the parallel algorithm and the serial algorithm could be approaching the ideal value, as long as the data records involving is large enough. The acceleration ratio rises as the number of the involved data records increases. %K data warehouse %K ETL(extraction %K transformation %K loading) workflow %K execution priority %K parallel execution
数据仓库 %K 抽取、转换和加载工作流 %K 执行优先级 %K 并行执行 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=CC2537DEDFC56F9C46848FD5160831CB&yid=140ECF96957D60B2&vid=DB817633AA4F79B9&iid=0B39A22176CE99FB&sid=E5D85F291CED2DA6&eid=CEFF71AEB051114C&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=1&reference_num=9