SPADA Docs

important to include in the inclusivity database only sequences (complete or partial) that contain 309 that gene of interest. 310 The number of bacterial and viral genomes in GenBank continues to climb (Figure 6). The 311 low cost of generating short read sequences using next generation sequencing has led to 312 increased production of draft microbial genomes consisting of multiple contigs. Although 313 complete finished genomes can be generated by combining these contigs with long read 314 sequences obtained from platforms such as PacBio or Oxford Nanopore with nominal additional 315 cost, there is a decline over time in the percentage of available genomes that are complete 316 finished genomes versus draft genomes (Figure 6). At the same time, perhaps, with smaller 317 genomes (e.g., viruses) there is an increase in percentage of full length genomes over time. 318 Along with the expected exponential increase in the size of databases will be a growing demand 319 on the computational resources to handle such larger databases. 320

321 322

18

Made with FlippingBook - professional solution for displaying marketing and sales documents online