Data Download

 

All hits (37 MB compressed, 160 MB decompressed)
File format (after un-zipping):
Lines that start with space contain virus sequence accession. All other lines contain information about hits, one hit per line, in format: blast score, blast e-value, percent identity, aligned length of query, eukaryote organism name, eukaryote genome accession.

 

Tabular blast output, zip-compressed

dsDNA (453 MB)
dsRNA (1 MB)
ssDNA (3 MB)
ssRNA (2 MB)
other (933 MB)

 

To obtain those outputs, we used the following programs and databases:
GenomeSync database
Genome Search Toolkits
BLAST+
tantan