..

计算机科学与系统生物学杂志

Management of High-Throughput DNA Sequencing Projects: Alpheus

Abstract

Neil A. Miller, Stephen F. Kingsmore, Andrew D. Farmer, Raymond J. Langley, Joann Mudge, John A. Crow, Alvaro J. Gonzalez, Faye D. Schilkey, Ryan J. Kim, Jennifer van Velkinburgh, Gregory D. May, C. Forrest Black, M. Kathy Myers, John P. Utsey, Nicholas S. Frost, Selene M. Virk, David J. Sugarbaker, Raphael Bueno, Stephen R. Gullans, Susan M. Baxter, St

High-throughput DNA sequencing has enabled systems biology to begin to address areas in health, agricultural and basic biological research. Concomitant with the opportunities is an absolute necessity to manage significant volumes of high-dimensional and inter-related data and analysis. is an analysis pipeline, database and visualization software for use with massively parallel DNA sequencing technologies that feature multigigabase throughput characterized by relatively short reads, such as Illumina-Solexa (sequencing-by-synthesis), Roche-454 (pyrosequencing) and Applied Biosystem’s SOLiD (sequencing-by-ligation). enables alignment to reference sequence(s), detection of variants and enumeration of sequence abundance, including expression levels in transcriptome sequence. is able to detect several types of variants, including non-synonymous and synonymous single nucleotide polymorphisms (SNPs), insertions/deletions (indels), premature stop codons, and splice isoforms. Variant detection is aided by the ability to filter variant calls dynamically based on consistency, expected allele frequency, sequence quality, coverage, and variant type in order to minimize false positives while maximizing the identification of true positives. also enables comparisons of genes with variants between cases and controls or bulk segregant pools. Sequence-based differential expression comparisons can be developed, with data export to SAS JMP Genomics for statistical analysis.

免责声明: 此摘要通过人工智能工具翻译,尚未经过审核或验证

分享此文章

索引于

相关链接

arrow_upward arrow_upward