AOAC SPADA VNGS - Final
189
The domain for VNGS shall be defined and include the term Biothreat Agent Next Generation
190
Sequences.
191
( m ) Stable URIs and versioning
192
193
Stable URIs for the terms, concepts, and versioning of VNGS shall be maintained by the sequence
194
provider.
195
( n ) Raw Sequence Data
196
197
All raw sequence data shall be available with each VNGS. The possible sequence formats are FASTQ
(26, 27, 28), FAST5 (29), and pod5 (30). In the case of FAST5, these files may be converted to FASTQ If a 198
199
human reader is required.
200
( o ) Aligned Sequence Data
201
202
Aligned sequences shall be included as BAM (Binary Alignment/MAP) formatted files (31, 32).
203
( p ) Annotation Formats
204
205
Annotation formats shall include Browser Extensible Data (BED) Format (33), Wiggle Track Format
(WIG) (34), General Feature Format (GFF3) (35), Variant Call format (VCF) (36), Gene Transfer Format 206
(GTF) (37), Genome Variation Format (GVF) (38) and/or Synthetic Biology Open Language (SBOL) (39). 207
208
( q ) Sequence Instrument Quality Metrics
209
210
(1) Base quality score .—Statistical algorithms used for base calling shall be known, verified and
converted to a Q score (26, 27). Average base quality score Q>20. Single base quality score for the 211
212
targeted region Q>30.
AOAC Draft Standard – Version 09282022; Public Comment Revisions
9
Made with FlippingBook Digital Proposal Maker