MOJ ISSN: 2374-6920MOJPB

Proteomics & Bioinformatics
Research Article
Volume 4 Issue 5 - 2016
Whole Metagenome Analysis of Extreme Environment Samples by Quasi-Linear Whole Genome Amplification
Beena PS*, Rajadurai CP, Aswathy S, Tintu Joy, Saju Michel, Sam Santhosh and Dhinoth Kumar Bangarusamy
SciGenom Labs Pvt Ltd, India
Received: October 25, 2016| Published: December 20, 2016
*Corresponding author: Beena P.S, SciGenom Labs Pvt Ltd, 43A, SDF 3rd Floor, CSEZ, Kakkanad, Kerala, India, Tel: +919446971288; Fax: 04842413398; Email: ;
Citation: Beena PS, Rajadurai CP, Aswathy S, Joy T, Michel S, et al. (2016) Whole Metagenome Analysis of Extreme Environment Samples by Quasi-Linear Whole Genome Amplification. MOJ Proteomics Bioinform 4(5): 00137. DOI: 10.15406/mojpb.2016.04.00137

Abstract

Microbiomes are gaining a lot of importance owing to its impact in the human health and other life processes. The significance of microbes in the deep sea is often undermined but it holds the key for the environmental changes that affects our planet. Recently scientists are investigating the diversity of microbial life in the sea to learn its role in the ocean processes, its interactions with other marine life, and its importance to humans. Emergence of cutting edge genomics and molecular biology tools has propelled the field of metagenomics to study the microbial populations without the need to culture them. In spite of these growth studies of extreme environmental samples with very low nucleic acid concentration remains still a major challenge. A whole genome amplification method with less bias is required to get over this problem. Commercially available single cell whole genome amplification kits don’t perform well with the low concentration of samples. A minimum of 100ng of DNA is the input requirement for these kits and some of the extreme environment samples can only yield 1-10ng of DNA. Our aim was to devise an efficient and cost effective workflow that could work with very low concentration of metagenomic DNA. We have adopted a quasi-linear whole genome amplification methodology, Multiple Annealing and Looping Based Amplification Cycles (MALBAC) that was earlier used in the single cell genomic studies. MALBAC methodology was carried out using 4ng of the deep sea sediment DNA and it was compared against the NEB Next Ultra DNA Library prep kit using 125ng of deep sea sediment DNA. The data generated were analysed and compared for the microbial diversity and gene function annotations. The results obtained through MALBAC methodology were comparable to that of the commercial kit based procedure utilizing nearly 125ng of DNA. The MALBAC based whole metagenome analysis of deep sea sample from Arabian Sea revealed the immense microbial diversity of unexplored extreme environment. The study also identified the functional genes coding for valuable enzymes and proteins that could be used in the biotech industry. This study has showed that MALBAC is a cost effective procedure for extreme environment samples containing very low concentration of DNA.

Keywords: MALBAC; Deep sea; Amplification; Metagenome; Extreme environment

Abbreviations

MALBAC: Multiple Annealing and Looping Based Amplification Cycles; DNA: Deoxyribonucleic acid; MDA: Multiple Displacement Amplification Method; NGS: Next Generation Sequencing

Introduction

The deep sea is an extreme unique environment with low temperature and high pressure. Similar to the ocean surface, several microbes living in the deep sea have specialized genomic features associated with their adaptation to extreme conditions. Deep sea is such an ecosystem in which 99% remains unexplored compared to other ecosystems studied. This genomic divergence can lead to difficulty in categorization, characterization and in turn exploration of biological treasure of deep sea for the benefit of human kind. Difficulty in culturing majority of microbes in an ecosystem prevents the potential study of the specific system. There are many molecular methods to explore these diverse potential. Cultivable organisms and its potential could be studied without any fail if the samples are available. But uncultivable microbes remain as a question mark. Culture independent techniques based on nucleic acid extraction from the ecosystem provide information on community structure and diversity. Hence microbial diversity in deep sea has the capability to tolerate nutrient poor environments, high concentration of salts and temperature variations. The potential enzymes or metabolites produced by these organisms would be much more stable and useful than those produced by terrestrial organisms.

The microbiome studies like Earth microbiome [1] and Human gut microbiome [2] the entire DNA of a sample is extracted and sequenced to determine its taxonomy and to annotate the genes present in sample and overall functionalities and pathways present in the specific environment. For a whole metagenome study the most critical aspect is isolation of quality and quantity DNA from the environmental sample. The sample features differ from place to place, as the salinity and other mineral content differs, and the extraction protocols could not be standardized for each and every location. The success of whole genome metagenome sequencing relies on the quality and amount of DNA obtained from the sample. Even though there are many extraction protocols that promises high yield of DNA, there are cases of extreme environments from which we fail to isolate quality and quantity DNA for high through put sequencing. Here we come up with an amplification procedure which is generally used for single cell genomics which could get rid of the problem of low quality and quantity of DNA. The first metagenomic studies attempted using high-throughput sequencing method was massively parallel 454 pyrosequencing [3]. Three other technologies commonly applied to environmental sampling are the Ion Torrent Personal Genome Machine, the Illumina MiSeq or HiSeq and the Applied Biosystems SOLiD system [4].

In the late 1970s, Carl Woese proposed the use of ribosomal RNA genes as molecular markers for life classification [5]. This idea in conjunction with the Sanger automated sequencing [6] method revolutionized the study and classification of microorganisms. Later, advances in molecular techniques were applied to microbial diversity studies to a “new uncultured world” of microbial communities. Some of these techniques, which had a remarkable impact, were the polymerase chain reaction (PCR), rRNA genes cloning and sequencing. Polymerase chain reaction (PCR) based amplification have sequence dependent bias due to the exponential amplification with random primers [7-9]. Multiple displacement w29 DNA polymerase with random primers under isothermal conditions is widely applied in single-cell genomics projects [10-12]. But amplification bias still exists in Multiple Displacement Amplification method (MDA). An alternative method of whole genome amplification, so-called multiple annealing and looping-based amplification cycles (MALBAC), emerged recently [13,14].

 Combining advantage features of MDA and tweaked PCR, MALBAC substantially reduces experimental bias related to non-linear amplification. Amplification with MALBAC is initiated with a group of random primers that can evenly hybridize to the templates at 0ËšC. MALBAC primers are 35 nucleotides long [13] which have a common 27-nucleotide sequence and 8 variable nucleotides. The common nucleotide sequence is GTG AGT GAT GGT TGA GGT AGT GTG GAG. The 8 variable nucleotides anneal randomly to the single stranded genomic DNA molecule. After a 5-cycle initial reaction, specific DNA polymerase derived from Bacillus stearothermophilus (Bst polymerase) with strand-displacement activity were used to generate semi-amplicons at 65 ËšC. The same primers having complementary ends were then used to generate full amplicons after the annealing at 94 ËšC. As the common sequence of random primer on terminal can form loop-like amplicons, enough quantity of DNA for sequencing will be obtained after 20 cycle regular PCR amplification. By this approach, MALBAC method can prevent a lot of random amplification bias.

We report on assessment of the ability of MALBAC protocol for generating good quality and quantity DNA for whole metagenomic sequencing approach to produce DNA profiles that can give taxonomic classification and gene annotation as a whole genome metagenome sequencing approach from high quality and quantity DNA. This methodology could be adopted for extreme environmental samples from which DNA isolation is critical.

Materials and Methods

Sample collection

Sediment sample collected onboard on the FORV (Fishery and Oceanographic Research Vessel) “Sagar Sampada” the cruise no. 321 (during 03/12/2013 to 14/12/2013) along southwest coast of India. Sediment samples collected using a Smith McIntyre grab (having a bite area of 0.2 m2) from 9⁰57'N, 75⁰42'E, continental shelf region of Arabian Sea at a depth of 102 m depth on 5/12/2013 (Figure 1).

Figure 1: Location in Arabian sea where the deep sea sediment samples were collected.

DNA isolation and MALBAC PCR

DNA was extracted from 50mg sediment using Ultra clean Soil DNA Isolation Kit (MO-BIO, USA) following the manufacturer's recommendations. The quality and quantity of the extracted DNA was checked using agarose gel (1%) stained with sybr safe and Qubit dsDNA HS assay kit (Invitrogen, USA) respectively followed by the MALBAC procedure.

50µl of a linear pre-amplification mix containing 36µl H2O, 5µl 10x standard Taq reaction buffer (NEB, USA), 2µl 10 mM dNTP, 2µl 50 mM MgSO4, and 2µl of 5 mM of each MALBAC Primers was added to 4ng of deep sea sediment DNA. The reaction mixture was incubated at 94 °C for 3 min and immediately quenched on ice. 1µl of Bst polymerase (NEB,USA) was added to the reaction and run at 10 °C for 45 sec; 20 °C for 45 sec; 30 °C for 45 sec; 40 °C for 45 sec; 50 °C for 45 sec; 65 °C for 2 min; 94 °C for 20 sec, and immediately quenched on ice. Another 1 µl of Bst polymerase (NEB, USA) was added to the reaction and run at 10 °C for 45 sec; 20 °C for 45 sec; 30 °C for 45 sec; 40 °C for 45 sec; 50°C for 45 sec; 65 °C for 2 min; 94 °C for 20 sec; 58 °C for 20 sec and immediately quenched on ice. The second pre-amplification step was repeated 5 times and an aliquot of 10µl from this mixture was further subjected to PCR amplification using 5µl 10X standard taq reaction buffer (NEB,USA), 1µl 10 mM dNTP, 3.35µl 50 mM MgSO4, 2µl Bst polymerase (NEB, USA), 3µl 10 uM MALBAC PCR primer (5’GTGAGTGATGGTTGAGGTAGTGTGGAG3’) and 25.65µl of H2O and incubated for 20 cycles at 94 °C for 20 sec; 59 °C for 20 sec; 65 °C for 1 min and 72°C for 2 min. Final PCR reactions were then cleaned up using Invitrogen Pure Link custom PCR purification columns (Invitrogen, USA)and quantified using the Qubit HS DNA kit (Invitrogen, USA).

Simultaneously 125 ng of the deep sea sediment DNA was processed using NEB Next Ultra DNA Library prep kit using the procedure recommended by the manufacturer. Libraries obtained from this procedure were compared against the MALBAC libraries in order to study the efficiency and sensitivity of MALBAC procedure on low concentration samples.

Primers used:
MALBAC1 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNTCG 3’
MALBAC2 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNAGT 3’
MALBAC3 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNGAC 3’
MALBAC4 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNTCA 3’
MALBAC5 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNCAT 3’
MALBAC6 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNGAT 3’
MALBAC7 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNTAG 3’
MALBAC8 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG NNNNNGCT 3’
MALBAC PCR 5’GTG AGT GAT GGT TGA GGT AGT GTG GAG 3’

 Library preparation

 The DNA sample (DS) from deep sea sediment was divided in to two parts. One part of the sample with ~120 ng concentration (DS1) was sheared on Covaris M220 (Covaris, USA) instrument using the following program: 2 min 10% duty cycle, intensity 5, 200cycles/burst and under frequency sweep. Sequencing library was then made using the NEBNext® Ultra™ DNA Library Prep Kit for Illumina® according to the manufacturer’s protocol. Another part of the sample (DS2) with 4 ng concentration was subjected to MALBAC procedure and 120ng of the resulting product was used to prepare library using NEBNext® Ultra™ DNA Library Prep Kit for Illumina® following the manufacturer's instruction. Final libraries were quantified with the Qubit HS DNA kit, visualized on an Agilent Bio analyzer 2100, and the libraries were sequenced from both ends for 150 cycles on Illumina HiSeq 2500 using rue Seq SBS V3 kit (Figure 2).

Figure 2: Work flow from sample to Data.

Quality trimming and alignment of sequence

The raw reads of samples (DS1 and DS2) were quality checked using FastQC tool [15] and the reads with Phred quality ≥Q20 were retained for further downstream analysis. The adapters were removed from the sample reads using cutadapt, version 1.8 [16]. MALBAC primers were removed from raw reads of DS2 using an in-house perl program. Prior to assembly, all reads were normalized to the lowest read count (1.92 million reads) among the two samples (DS1 & DS2), in order to avoid any assembly bias. Pre-processed sequences were the assembled with RayMeta [17] using a k-mer size of 31. The contigs with less than 150 bp length were filtered and protein coding regions (Figure 3) were predicted using Glimmer-MG v0.3.2 [18] program. We performed BLASTX of all the predicted genes against the protein database using the BLAST version 2.2.29+ [19] with an e value of 1e-5. The functional analysis of all hits were analysed using the KEGG [20] and SEED [21] options provided in the MEGAN software [22].

Figure 3: Overall bioinformatics workflow of metagenomics studies on DS1 and DS2 sample.

Results and Discussion

The focus of metagenomics is to assess the frequency of taxa and gene functions of microbes within an environment through DNA sequence data. The austerity of these evaluation lies on how accurately the frequency of the originating microbial population within the community can be captured. Next-generation sequencing (NGS) provides us exciting possibilities to achieve rapid and cost-effective identification of both cultivable and uncultivable microorganisms in the sample. The accuracy of the NGS data relies on the quality and quantity of the Nucleic acid content derived from the samples [23]. The concentration of the nucleic acid in the sample plays a significant role in determining the specificity of the analysis and hence it is a challenge to analyse the extreme environment samples. In this study we have showed that MALBAC procedure accurately reflect the true composition and frequency of the microbial population and hence can be effectively used in the case of extreme environment samples with low nucleic acid content.

Sizeable number of studies exploring the microbial population of environments have focused on DNA profiling [24,25], which requires efficient and unbiased DNA extraction and amplification procedures. These requirements are more critical in the case of NGS based metagenomic analysis of complex and extreme environments [23].

In this study we wanted to explore the efficiency of MALBAC procedure and its compatibility with the NGS applications to analyse the low concentration of nucleic acid obtained from the deep sea sediment sample (DS). In order to assess the specificity and sensitivity of the procedure we compared the MALBAC mediated library preparation (DS2) with a direct procedure (DS1) that doesn't involve the genome amplification step. The 150 base paired reads from whole metagenome sequenced libraries were analyzed after filtering out low quality reads and trimming out the adaptors and primers. The reads were then normalized and assembled to contigs. The number of contigs and the N50 (Longest contig length of DS2 was 920 compared to 517 of DS1) was found to be more for MALBAC based procedure compared to the direct method indicating the quality of assembly and the efficient capture of the microbial population in the sample [26].

The contig size length distribution differs for DS1 and DS2 (Figure 4 & 5). MALBAC based sample has less contig length than the direct methodology, eventhough the longest contig was obtained in DS2. Microbial ecology has many tools for assessing species diversity. Rarefaction curves are used to estimate the coverage obtained from sampling (Figure 6). No significant difference observed between the number of leaves in taxonomy of DS1 and DS2.

Figure 4: Length distribution of all contigs of sample DS1 after assembly.
Figure 5: Length distribution of all contigs of sample DS2 after assembly
Figure 6: Rarefaction curve of sample DS1 and DS2.

The microbial community of the sediment was dominated by Proteobacteria, followed by Planctomycetes, Actinobacteria, Firmicutes, Bacteroidetes, Cyanobacteria, Chloroflexi, Nitrospirae, Acidobacteria etc. The major representatives are found to be similar in both of the samples with slightly varied abundance (Figure 7) Microbial diversity in both of the methodology shows the presence of Shewanella frigidimarina Antartic bacterial species which has the ability to produce Omega 3 fatty acid like Eicosapentaenoic acid which has medicinal properties. Other major representative in both methodologies is Vibrio rumoiensis, psychrophilic bacteria, which has high catalase activity.

Figure 7: Phylum representations of DS1 and DS2.

One goal of sequencing-based metagenomic community analysis is the quantitative taxonomic assessment of microbial community compositions. In particular, relative quantification of taxons is of high relevance for metagenomic community comparison. Hence finding the relative abundance of species is very important in determining the metagenome of an extreme environment. Figure 8 shows the relative abundance of species in direct method and MALBAC approach. Even though quantitatively DS2 abundance is less, qualitatively all the major representations are present in DS2 which shows the usability of the approach in samples with low concentration. 

Figure 8: Relative abundance of species distributed among DS1 and DS2 Sample.

Comparison of KEGG functional classification results obtained from KEGG mapper (Figure 9) indicated that DS1 and DS2 were able to detect the prominent metabolic pathways in spite of the lower read abundance in DS2. The datasets were further annotated with SEED (Figure 10) indicating a similar trend for 12 metabolic pathways. Results showed that at the same sequencing depth, abundance of annotated genes in DS1 dataset were generally more than DS2. Similar to the findings about soil [27], and marine microbial communities [28], protein metabolism, carbohydrates, amino acids and derivatives, as well as miscellaneous were the four most abundant categories in the global metabolism of microbial communities, suggesting the dominant roles of these categories in the microorganisms. These results indicated that MALBAC based procedure was able to detect almost all the metabolic pathways captured by the direct method, although there was a difference in the number of reads representing these pathways. BLASTX results were utilized for annotating Vibrio sps functional genes (camphor resistance protein CrcB, N- acetyl glutamate synthase, Valyl t RNA synthase, carboxyl phosphatase synthase, chitinase, recombinase RecA) and , Shewanella sps fucntional genes (heavy metal efflux pump, CRISPR associated CsyZ family protein etc) (Table 1).

 

Assembly

 

DS1 Sample

 

DS2 Sample

 

Total raw reads

 

2,684,044

 

18,820,122

 

Normalized read count

 

964,712

 

964,712

 

Average read length (bp)

 

150

 

150

 

Total number of contigs

27,639

93,163

 

N50

 

517

 

920

 

Longest Contig

 

623

 

920

Table 1: Assembly summary for Illumina data of DS1 and DS2 samples. It represents raw reads obtained, normalized read count, average read length obtained in bp, total number of contigs obtained from data, N50 and longest contig representation of DS1 and DS2 sample.

Figure 9: KEGG method based gene functional classification of DS1 and DS2 Sample. Functional classification with reference to number of reads.
Figure 10: SEED method based gene functional classification of DS1 and DS2 Sample. Gene function with reference to number of reads

Whole-genome amplification strategy is indispensable for precious samples from extreme environments. Majority of the commercial kits in market focus on human samples requires template in excess of 10 ng and utilize a clonal amplification step via random primers that can result in a bias [29-31]. There are several multiple displacement amplification procedures (MDA) that has come out recently which uses random priming and the strand-displacing phi 29 polymerase under isothermal condition [32], to provide improvements over the conventional PCR-based methods. These methods still exhibit considerable bias due to nonlinear amplification.

In order to overcome these obstacles we tried Multiple Annealing and Looping Based Amplification Cycles (MALBAC) [13] for the whole genome amplification of an extreme environmental sample and compared it with the high concentration DNA library preparation methodology to assess the uniformity, reproducibility and specificity. This methodology introduces quasi-linear preamplification to reduce the bias associated with nonlinear amplification. Picograms of DNA fragments (~10 to 100kb) from each individual in a population serve as templates for amplification with MALBAC. The amplification is initiated with a pool of random primers, each having a common 27-nucleotide sequence and 8 variable nucleotides that can evenly hybridize to the templates at 0 °C. At an elevated temperature of 65 °C, DNA polymerases with strand displacement activity are used to generate semi-amplicons with variable lengths (0.5-1.5kb), which are then melted off from the template at 94 °C. Amplification of these semi-amplicons gives full amplicons which have complementary ends. The temperature is cycled to 58 °C to allow the looping of full amplicons, which prevents further amplification and cross hybridizations. Five cycles of preamplification procedure are followed by exponential amplification of these full amplicons by PCR in order to generate micrograms of DNA required for next generation sequencing [13].

Conclusion

MALBAC strategy renders the specific amplification of target template and uniformity of coverage with very little bias even with extreme amplification. Except for the difference in read counts, all of the major phylum and species that are captured by the direct methodology were also represented by MALBAC based protocol. Gene functional classification study also indicates that the MALBAC protocol can perform well as the direct protocol and its reproducibility is reliable. This protocol is also cost effective and enables the use of standard reaction volumes. Hence MALBAC protocol can be utilized as a methodology for amplification of low quantity precious DNA from extreme environments for high through put sequencing and in turn help us to explore the uncultured potential of extreme environments for novel enzymes proteins and other metabolites.

Acknowledgements

We thank Dr. Sarita G Bhat, PI, and Centre for Marine Living Resources & Ecology- Ministry of Earth Sciences project (No. MOES/10-MLR/2/2007), Department of Biotechnology, Cochin University of Science and technology and Sagar Sampada Cruise no. 321 participants especially Dr. Raghul Zubin for providing samples for present study.

Conflict of Interest

“The study was funded by SciGenom Labs Pvt Ltd, Research and Development division. The authors declare that they have no competing interests financially and non-financially. All authors of manuscript are employed by SciGenom Labs Pvt Ltd and have worked from processing of sample until analysis of result. This does not alter the author’s adherence to all policies on sharing data and materials”.

References

  1. Gilbert JA, Jansson JK, Knight R (2014) The Earth Microbiome project: successes and aspirations. BMC Biol 12(1): 69.
  2. Bull MJ, Plummer NT (2014) Part 1: the human gut microbiome in health and disease. Integr Med (Encinitas) 13(6): 17.
  3. Poinar HN, Schwarz C, Qi J, Shapiro B, Macphee RD, et al. (2006) Metagenomics to Paleogenomics: Large-Scale Sequencing of Mammoth DNA. Science 311(5759): 392-394.
  4. Rodrigue S, Materna AC, Timberlake SC, Blackburn MC, Malmstrom RR, et al. (2010) Unlocking Short Read Sequencing for Metagenomics. PLoS One 5 (7): e11840.
  5. Woese CR, Fox GE (1977) Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proc Natl Acad Sci U S A 74(11): 5088-5090.
  6. Sanger F, Nicklen S, Coulson AR (1977) DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 74(12): 5463-5467.
  7. Telenius H, Carter NP, Bebb CE, Nordenskjöld M, Ponder BA, (1992) Degenerate oligonucleotide-primed PCR: general amplification of target DNA by a single degenerate primer. Genomics 13(3): 718-725.
  8. Zhang L, Cui X, Schmitt K, Hubert R, Navidi W, et al. (1992) Whole genome amplification from a single cell: implications for genetic analysis. Proc Natl Acad Sci U S A 89(13): 5847-5851.
  9. Dietmaier W, Hartmann A, Wallinger S, Heinmöller E, Kerner T, et al. (1999) Multiple mutation analyses in single tumor cells with improved whole genome amplification. Am J Pathol 154(1): 83-95.
  10. Dean FB, Nelson JR, Giesler TL, Lasken RS (2001) Rapid amplification of plasmid and phage DNA using phi29 DNA polymerase and multiply-primed rolling circle amplification. Genome Res 11(6): 1095-1099.
  11. Dean FB1, Hosono S, Fang L, Wu X, Faruqi AF, et al. (2002) Comprehensive human genome amplification using multiple displacement amplification. Proc Natl Acad Sci U S A 99(8): 5261-5266.
  12. Lasken RS (2009) Genomic DNA amplification by the multiple displacement amplification (MDA) method. Biochem Soc Trans 37(2): 450.
  13. Zong C, Lu S, Chapman AR, Xie XS (2012) Genome-wide detection of single-nucleotide and copy-number variations of a single human cell. Science 338(6114): 1622-1626.
  14. Lasken, Roger S (2013) Single-cell sequencing in its prime. Nat Biotechnol 31(3): 211-212.
  15. Andrews S (2010) FastQC: a quality control tool for high throughput sequence data.
  16. Marcel M (2011) Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet .journal 17(1): 1-3.
  17. Boisvert S, Raymond F, Godzaridis E, Laviolette F, Corbeil J (2012) Ray Meta: scalable de novo metagenome assembly and profiling. Genome Biol 13(12): R122.
  18. Kelley DR, Liu B, Delcher AL, Pop M, Salzberg SL (2012) Gene prediction with Glimmer for metagenomic sequences augmented by classification and clustering. Nucleic Acids Res 40(1): e9-e9.
  19. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215(3): 403-410.
  20. Kanehisa M, Goto S (2000) KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res 28(1): 27-30.
  21. Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, et al. (2014) The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res 42: D206-D214.
  22. Huson DH, Auch AF, Qi J, Schuster SC (2007) MEGAN analysis of metagenomic data. Genome Res 17(3): 377-386.
  23. Cruaud P, Vigneron A, Lucchetti-Miganeh C, Ciron PE, Godfroy A, et al. (2014) Influence of DNA extraction method, 16S rRNA targeted hypervariable regions, and sample origin on microbial diversity detected by 454 pyrosequencing in marine chemosynthetic ecosystems. Appl Environ Microbiol 80(15): 4626-4639.
  24. Rappé MS, Giovannoni SJ (2003) The uncultured microbial majority. Annu Rev Microbiol 57(1): 369-394.
  25. Miller JR, Koren S, Sutton G (2010) Assembly algorithms for next-generation sequencing data. Genomics 95(6): 315-327.
  26. Gilbert JA, Field D, Huang Y, Edwards R, Li W, et al. (2008) Detection of large numbers of novel sequences in the metatranscriptomes of complex marine microbial communities. PloS One 3(8): e3042.
  27. Urich T, Lanzén A, Qi J, Huson DH, Schleper C, et al. (2008) Simultaneous assessment of soil microbial community structure and function through analysis of the meta-transcriptome. PloS One 3(6): e2527-e2527.
  28. Teske, Andreas, and Ketil B. Sørensen (2008) Uncultured archaea in deep marine subsurface sediments: have we caught them all? The ISME J 2(1): 3-18.
  29. Zhang L, Cui X, Schmitt K, Hubert R, Navidi W, et al. (1992)Whole genome amplification from a single cell: implications for genetic analysis. Proc Natl Acad Sci U S A 89(13): 5847-5851.
  30. Telenius H, Carter NP, Bebb CE, Nordenskjöld M, Ponder BA, et al. (1992) egenerate oligonucleotide-primed PCR: general amplification of target DNA by a single degenerate primer. Genomics 13(3): 718-725.
  31. Dietmaier W, Hartmann A, Wallinger S, Heinmöller E, Kerner T, et al. (1999) Multiple mutation analyses in single tumor cells with improved whole genome amplification. Am J Pathol 154(1): 83-95.
  32. Dean FB, Nelson JR, Giesler TL, Lasken RS (2001) Rapid amplification of plasmid and phage DNA using phi29 DNA polymerase and multiply-primed rolling circle amplification. Genome Res 11(6): 1095-1099.
© 2014-2016 MedCrave Group, All rights reserved. No part of this content may be reproduced or transmitted in any form or by any means as per the standard guidelines of fair use.
Creative Commons License Open Access by MedCrave Group is licensed under a Creative Commons Attribution 4.0 International License.
Based on a work at http://medcraveonline.com
Best viewed in Mozilla Firefox | Google Chrome | Above IE 7.0 version | Opera |Privacy Policy