OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

中国科学生命科学 2013

生命科学信息工程设施以及在中国的实现

DOI: 10.1360/052012-439, PP. 80-88

朱伟民, 朱云平, 杨啸林

Keywords: 生命组学,生物信息学,数据密集型科学,中国生物信息学,工程设施

Full-Text Cite this paper Add to My Lib

Abstract:

以组学数据为代表的生命科学数据呈指数增长.与高能物理、气象、地质、地理和环境科学等其他数据密集型学科一样,现代生命科学已经进入了高度信息化的时代——第四范式时代.国家跨组学信息工程大设施(ChinaInformationEngineeringInfrastructureforPan-OmicsStudies,CIEIPOS)已经成为推动中国生命科学进一步发展、并使海量数据转化成知识与应用的必不可少的国家生命科学基础设施.本文介绍国内外生物数据收集、管理与利用的现状,提出建设CIEIPOS生物信息“集散地”的重要性与迫切性,阐述实现数据整合、搜索与可视化的挑战与可能方案.CIEIPOS的另外一个重要功能是支持对组学数据的管理、分析、挖掘与利用,这使得CIEIPOS不同于传统的国际生物信息中心,如美国国家生物信息技术中心(NationalCenterforBiotechnologyInformation)与欧洲生物信息学研究所(EuropeanBioinformaticsInstitute).本文以质谱平台产出的高通量蛋白质组数据为例,说明组学数据分析的复杂性.通过对跨组学数据在不同时空的模拟分析,进一步说明CIEIPOS的实际应用对计算机硬件与网络的要求.

References

[1]	1 Schadt E E, Linderman M D, Sorenson J, et al. Computational solutions to large-scale data management and analysis. Nat Rev Genet, 2010, 11: 647-657
[2]	2 Smith A, Balazinska M, Baru C, et al. Biology and data-intensive scientific discovery in the beginning of the 21st century. OMICS, 2011, 15: 209-212？？
[3]	3 Kolker E, Stewart E, Ozdemir V. Opportunities and challenges for the life sciences community. OMICS, 2012, 16: 136-147
[4]	4 Crosswell L, Thornton J. ELIXIR: a distributed infrastructure for European biological data. Trends Biotechnol, 2012, 30: 241-242？？
[5]	5 Bu D C, Yu K T, Sun S L, et al. NONCODE v3.0: integrative annotation of long noncoding RNAs. Nucleic Acids Res, 2012, 40: Database issue, D210-D215
[6]	6 Wei L P, Yu J. Bioinformatics in China: a personal perspective. PLoS Comput Biol, 2008, 4: e1000020？？
[7]	7 Zdobnov E M, Lopez R, Apweiler R, et al. The EBI SRS server—recent developments. Bioinformatics, 2002, 18: 368-373？？
[8]	8 Saltz J H, Oster S, Hastings S L, et al. Integrating heterogeneous rules-engine technologies with caGrid. AMIA AnnuSymp Proc, 2007, 11: 1099
[9]	9 Smedley D, Haider S, Ballester B, et al. BioMart—biological queries made easy. BMC Genomics, 2009, 14: 22
[10]	10 Livne O E, Schultz N D, Narus S P. Federated querying architecture with clinical & translational health IT application. J Med Syst, 2011, 35: 1211-1224？？
[11]	11 van Vlymen J, de Lusignan S. A system of metadata to control the process of query, aggregating, cleaning and analysing large datasets of primary care data. Inform Prim Care, 2005, 13: 281-291
[12]	12 Shah P K, Perez-Iratxeta C, Bork P, et al. Information extraction from full text scientific articles: where are the keywords? BMC Bioinformatics, 2003, 4: 20
[13]	13 Gehlenborg N, O''Donoghue S I, Baliga N S, et al. Visualization of omics data for systems biology. Nat Meth, 2010, 7: S56-S68？？
[14]	14 Iragne F, Nikolski M, Mathieu B, et al. ProViz: protein interaction visualization and exploration. Bioinformatics, 2005, 21: 272-274？？
[15]	15 Zhou T T. Computational reconstruction of metabolic networks from KEGG. Methods Mol Biol, 2013, 930: 235-249？？
[16]	16 Funahashi A, Matsuoka Y, Jouraku A, et al. CellDesigner 3.5: a versatile modeling tool for biochemical networks. Proc IEEE, 2008, 96: 1254-1265
[17]	17 Leinonen R, Akhtar R, Birney E, et al. Improvements to services at the European Nucleotide Archive. Nucleic Acids Res, 2010, 38: Database issue, D39-D45？？
[18]	18 ENCODE Project Consortium. An integrated encyclopedia of DNA elements in the human genome. Nature, 2012, 489: 57-74？？
[19]	19 Kuehn B M. 1000 Genomes Project finds substantial genetic variation among populations. JAMA, 2012, 308: 2322-2325？？
[20]	20 Flicek P, Ahmed I, Amode M R, et al. Ensembl 2013. Nucleic Acids Res, 2013, 41: D48-D55？？
[21]	21 Meyer L R, Zweig A S, Hinrichs A S, et al. The UCSC Genome Browser database: extensions and updates 2013. Nucleic Acids Res, 2013, 41: D64-D69？？
[22]	22 Vizcaíno J A, C？té R G, Csordas A, et al. The Proteomics Identifications (PRIDE) database and associated tools: status in 2013. Nucleic Acids Res, 2013, 41: D1063-D1069？？
[23]	23 Ji L, Barrett T, Ayanbule O, et al. NCBI Peptidome: a new repository for mass spectrometry proteomics data. Nucleic Acids Res, 2010, 38: Database issue, D731-D735？？
[24]	24 Vizcaíno J A, Foster J M, Martens L. Proteomics data repositories: providing a safe haven for your data and acting as a springboard for further research. J Proteomics, 2010, 73: 2136-2146？？
[25]	25 Dowell R D, Jokerst R M, Day A, et al. The distributed annotation system. BMC Bioinformatics, 2001, 2: 7？？
[26]	26 Boeckmann B, Bairoch A, Apweiler R, et al. The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003. Nucleic Acids Res, 2003, 31: 365-370？？
[27]	27 Hassanien A E, Milanova M, Smolinski T, et al. Computional intelligence in solving bioinformatics problems: reviews, perspectives, and challenges. Comp Intel in Biomed & Bioinform, SCI, 2008, 151: 3-47 ？？
[28]	28 Taylor R C. An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinfor-matics. BMC Bioinformatics, 2010, 11: S1？？
[29]	29 Dean J, Ghemawat S. MapReduce: simplified data processing on large clusters. Proceedings of the 6th Symposium on OSDI, 2004, 137-150

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133