IB-CAS  > 系统与进化植物学国家重点实验室
Are Differences in Genomic Data Sets due to True Biological Variants or Errors in Genome Assembly: An Example from Two Chloroplast Genomes
Wu, Zhiqiang; Tembrock, Luke R.; Ge, Song
2015
发表期刊PLOS ONE
ISSN1932-6203
卷号10期号:2
摘要DNA sequencing has been revolutionized by the development of high-throughput sequencing technologies. Plummeting costs and the massive throughput capacities of second and third generation sequencing platforms have transformed many fields of biological research. Concurrently, new data processing pipelines made rapid de novo genome assemblies possible. However, high quality data are critically important for all investigations in the genomic era. We used chloroplast genomes of one Oryza species (O. australiensis) to compare differences in sequence quality: one genome (GU592209) was obtained through Illumina sequencing and reference-guided assembly and the other genome (KJ830774) was obtained via target enrichment libraries and shotgun sequencing. Based on the whole genome alignment, GU592209 was more similar to the reference genome (O. sativa: AY522330) with 99.2% sequence identity (SI value) compared with the 98.8% SI values in the KJ830774 genome; whereas the opposite result was obtained when the SI values in coding and noncoding regions of GU592209 and KJ830774 were compared. Additionally, the junctions of two single copies and repeat copies in the chloroplast genome exhibited differences. Phylogenetic analyses were conducted using these sequences, and the different data sets yielded dissimilar topologies: phylogenetic replacements of the two individuals were remarkably different based on whole genome sequencing or SNP data and insertions and deletions (indels) data. Thus, we concluded that the genomic composition of GU592209 was heterogeneous in coding and non-coding regions. These findings should impel biologists to carefully consider the quality of sequencing and assembly when working with next-generation data.
学科领域Multidisciplinary Sciences
DOI10.1371/journal.pone.0118019
收录类别SCI
语种英语
WOS关键词PHYLOGENETIC ANALYSIS ; GENE ORGANIZATION ; DNA ; SEQUENCE ; INDELS ; DIVERSIFICATION ; IDENTIFICATION ; TRANSFORMATION ; CHALLENGES ; EVOLUTION
WOS研究方向Science Citation Index Expanded (SCI-EXPANDED)
WOS记录号WOS:000349444900251
出版者PUBLIC LIBRARY SCIENCE
文献子类Article
出版地SAN FRANCISCO
资助机构National Natural Science Foundation of China [30990240]
作者邮箱wu.zhiqiang.1020@gmail.com
作品OA属性Green Published, Green Submitted, gold
引用统计
被引频次:43[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://ir.ibcas.ac.cn/handle/2S10CLM1/26022
专题系统与进化植物学国家重点实验室
作者单位1.Chinese Acad Sci, Inst Bot, State Key Lab Systemat & Evolutionary Bot, Beijing, Peoples R China
2.Wu, Zhiqiang; Tembrock, Luke R.] Colorado State Univ, Dept Biol, Ft Collins, CO 80523 USA
推荐引用方式
GB/T 7714
Wu, Zhiqiang,Tembrock, Luke R.,Ge, Song. Are Differences in Genomic Data Sets due to True Biological Variants or Errors in Genome Assembly: An Example from Two Chloroplast Genomes[J]. PLOS ONE,2015,10(2).
APA Wu, Zhiqiang,Tembrock, Luke R.,&Ge, Song.(2015).Are Differences in Genomic Data Sets due to True Biological Variants or Errors in Genome Assembly: An Example from Two Chloroplast Genomes.PLOS ONE,10(2).
MLA Wu, Zhiqiang,et al."Are Differences in Genomic Data Sets due to True Biological Variants or Errors in Genome Assembly: An Example from Two Chloroplast Genomes".PLOS ONE 10.2(2015).
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
Wu-2015-Are Differen(1144KB)期刊论文出版稿开放获取CC BY-NC-SA浏览 请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wu, Zhiqiang]的文章
[Tembrock, Luke R.]的文章
[Ge, Song]的文章
百度学术
百度学术中相似的文章
[Wu, Zhiqiang]的文章
[Tembrock, Luke R.]的文章
[Ge, Song]的文章
必应学术
必应学术中相似的文章
[Wu, Zhiqiang]的文章
[Tembrock, Luke R.]的文章
[Ge, Song]的文章
相关权益政策
暂无数据
收藏/分享
文件名: Wu-2015-Are Differences in Genomic Data Sets d.pdf
格式: Adobe PDF
此文件暂不支持浏览
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。