last update
2026-April-29
VirJenDB
v1.0
Data Ingestion and Standardization
VirJenDB integrates sequence and metadata records from multiple sources through a standardized ingestion workflow designed to improve consistency, interoperability, and downstream analysis.
Metadata Standardization
During ingestion, metadata records are harmonized through:
- Date reformatting into ISO 8601 standard
- Splitting multi-value fields into structured fields
- Typographical and formatting corrections
- Field normalization across source databases
These steps support consistent indexing and downstream annotation workflows.
Integrated Sources
Source datasets incorporated through this workflow are described on the Data Sources page.
Related Workflows
Subsequent workflows build on standardized ingested data, including:
- Host Annotation
- Taxonomy Mapping
- vOTU Clustering