必读文献
2016-06-24 10:35
316 查看
基因组组装
(四倍体陆地棉基因组 - 南农&诺禾 – Illumina2000 – PE100 – 245X - SOAPdenovo组装 – 17w BAC - 遗传图谱)Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement(NB 41)Supplementary Text and Figures (1,578 KB)
基因大小2.5G,contig N50 34 kb,scaffold N50 1.6 Mb。
Short-insert paired-end (180, 300, and 500 bp) and large-insert mate-pair libraries (2, 5, 10 kb)
All libraries were sequenced at 2 × 100 bp on an Illumina HiSeq 2000 platform
Three BAC libraries with average insert sizes of 160 kb, 152 kb and 100 kb, respectively, were sequenced at both ends using the Sanger sequencing method.
Genome assembly, scaffolding and gap-closing.
All sequences were assembled using the SOAPdenovo package12. A de Bruijn graph was built using a K-mer size of 63. After removing tips, merging bubbles and concatenating the tiny repeats, contigs were built from the simplified de Bruijn graph. Paired-end short reads were then aligned back onto the contigs to construct the linkage relationship for contigs. Scaffolds were assembled based on these paired-end links and gaps in the scaffolds were filled by Gapcloser. BAC-ends were mapped to the assembly using BWA-SW software12. Further scaffolding was then conducted, based on links between BAC-ends.
野生大豆泛基因组阐明遗传多样性与重要农艺性状
(大豆 - 农科院作科所)De novo assembly of soybean wild relatives for pan-genome analysis of diversity and agronomic traits(NB 41)
Supplementary Text and Figures (10,233 KB)
De novo assembly.
First, we generated a 17-mer depth distribution of short-insert paired-end reads using Meryl50 and applied GCE51 to estimate the genome sizes of individual G. soja accessions. Reads were preprocessed by ALLPATHS-LG52 error correction module to remove base calling errors. We also used ErrorCorrection in SOAPdenovo11 package to connect 180-bp library pair end reads and to generate longer sequences for assembly. Reads of 180-bp and 500-bp library were used for contig building, and all pair-end reads libraries were used to provide links for scaffold construction. GapCloser (v1.12) from SOAPdenovo11 package was used for gap filling within assembled scaffolds using all pair-end reads. Finally, scaffold sequences, which can be aligned to bacterial genomes with identity ≥95% and e-value ≤1e-5, were filtered.
(金丝猴基因组 - 中科院动物所&诺禾 – Illumina2000 - PE100)Whole-genome sequencing of the snub-nosed monkey provides insights into folivory and evolutionary history(NG 29)
基因组组装软件评估文章
1. Bao, S., et al. (2011). "Evaluation of next-generation sequencing software in mapping and assembly." J Hum Genet.2. Vezzi, F., et al. (2012). "Reevaluating assembly evaluations with feature response curves: GAGE and assemblathons." PLoS One 7(12): e52210.
3. Salzberg, S. L., et al. (2012). "GAGE: A critical evaluation of genome assemblies and assembly algorithms." Genome Res 22(3): 557-567.
4. Zhang, W., et al. (2011). "A practical comparison of de novo genome assembly software tools for next-generation sequencing technologies." PLoS One 6(3): e17915.
5. Narzisi, G. and B. Mishra (2011). "Comparing de novo genome assembly: the long and short of it." PLoS One 6(4): e19175.
6. Lin, Y., et al. (2011). "Comparative Studies of de novo Assembly Tools for Next-generation Sequencing Technologies." Bioinformatics.
7. Finotello, F., et al. (2011). "Comparative analysis of algorithms for whole-genome assembly of pyrosequencing data." Brief Bioinform.
8. Earl, D. A., et al. (2011). "Assemblathon 1: A competitive assessment of de novo short read assembly methods." Genome Res.
三代组装
Assembly and diploid architecture of an individual human genome via single-molecule technologiesFalcon讲得非常详细,有详细的配置文件和方法。
Long-read sequence assembly of the gorilla genome
使用Falcon组装,没找到具体的配置参数。
De novo assembly of Dekkera bruxellensis: a multi technology approach using short and long-read sequencing and optical mapping
多种组装软件的比较,其中包括Falcon
其他
全基因组测序项目发表文章档次的影响因素相关文章推荐
- 深度学习(5)
- js数组排序
- PHPStorm&PHPstudy环境配置
- GDAL/OGR 调试方式编译
- 二叉树深度和宽度
- WebRTC结构
- 简单科普下hosts文件原理与制作
- weka的java使用(3)——特征选择
- ERROR 2002 (HY000): Can't connect to local MySQL server through socket '/tmp/mysql.sock' (2)
- java基础讲解之集合鼻祖--- Collection
- Mysql用户以及权限
- 配置hadoop2.X的namenode HA及Yarn HA
- 操作系统面试—进程同步(二)
- 深度学习(4)
- 动态规划、记忆化搜索、Dijkstra算法的总结
- [Android Tips] 20. Android Studio Tips
- Broadcat监视电量变化
- QT QTreeWidget 选中某行并设置背景色高亮
- VMware虚拟机窗口设置
- Linux下必须知道的11个网络命令