全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Determine priority of ETL workflow activities and their parallel execution
ETL工作流活动优先级的确定及并行实现*

Keywords: data warehouse,ETL(extraction,transformation,loading) workflow,execution priority,parallel execution
数据仓库
,抽取、转换和加载工作流,执行优先级,并行执行

Full-Text   Cite this paper   Add to My Lib

Abstract:

The process of ETL could be treated as a data-centric workflow. This paper discussed the execution of the ETL workflow and proposed an algorithm to determine the priority of the activities in the ETL workflow, threads were created for the activities that share the same priority and were not dependent on each other. The activities were put in the parallel execution environment, which could improve the execution efficiency of the ETL workflow. The result of the experiment shows that the acceleration ratio of the parallel algorithm and the serial algorithm could be approaching the ideal value, as long as the data records involving is large enough. The acceleration ratio rises as the number of the involved data records increases.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133