<p>This repository contains data used in Muscatt et al. 2022 (preprint).</p>
<p><br></p>
<p>c1.ntw.gz = vConTACT2 network output file</p>
<p><br></p>
<p>DNAP_plus_ref.faa.gz = fasta amino acid file containing DNA polymerase protein sequences used for phylogenetic tree</p>
<p><br></p>
<p>DNAP_tree.gz = tree file for jumbo phage phylogenetic tree based on DNA polymerase gene</p>
<p><br></p>
<p>edges.csv.gz = edges for drawing vConTACT2 network</p>
<p><br></p>
<p>gene_to_genome.csv.gz = viral gene to genome index used as input to vConTACT2</p>
<p><br></p>
<p>genome_by_genome.csv.gz = viral cluster statuses outputted by vConTACT2</p>
<p><br></p>
<p>MAG_bins.fna.gz = fasta nucleotide file containing 285 microbial MAG bins previously assembled by Sharrar et al. 2020</p>
<p><br></p>
<p>nodes.csv.gz = nodes for drawing vConTACT2 network</p>
<p><br></p>
<p>rpS3.fna.gz = fasta nucleotide file containing 1516 rpS3 sequences</p>
<p><br></p>
<p>vOTU_prediction.csv.gz = stats on viral scaffold prediction</p>
<p><br></p>
<p>vOTU_proteins.faa.gz = fasta amino acid file containing XXX dsDNA vOTU genes</p>
<p><br></p>
<p>vOTUs.fna.gz = fasta nucleotide file containing 10,196 </p>
<p><br></p>
<p>dsDNA vOTUs</p>
<p><br></p>
<h4><br></h4>
<h4>The Supplementary table titles are as follows:</h4>
<p>Table S1: Assembly statistics for sample libraries.</p>
<p>Table S2: DNA vOTU raw reads.</p>
<p>Table S3: Sources of hits to viral sequences in PhageClouds database.</p>
<p>Table S4: Microbial OTU normalised coverage. </p>
<p>Table S5: Raw read bacterial class taxonomy.</p>
<p>Table S6: Annotations of viral-encoded auxiliary metabolic genes.</p>
<p>Table S7: Microbial MAG normalised coverage.</p>
<p>Table S8: Annotations of viral genes under positive selection.</p>
<p><br></p>
<p><br></p>
<p><br></p>