last update
2026-April-29
VirJenDB
v1.0
Host Taxonomy Mapping
Host taxonomy fields in VirJenDB are standardized to the GTDB v226 framework.
Mapping Strategy
NCBI TaxIDs were mapped to GTDB taxonomy names using the GTDB–NCBI mapping resource.
For each NCBI taxon:
- GTDB classifications were mapped
- Most frequently represented classification selected
Additional Sources
Host taxonomy was incorporated from:
- PhiSpy prophage hosts via BV-BRC TaxIDs
- IMG/VR v4.0 isolation host annotations (GTDB v207)
- PHD host metadata (NCBI + GTDB v220)
Coverage
Mapped host taxonomy fields were integrated for approximately 1.47 million phage sequences.
Included mapped fields include:
- Host NCBI TaxID
- Host NCBI Species Name
- GTDB taxonomy fields