Benchmarking hybrid assemblies of Giardia and prediction of widespread intra-isolate structural variation

dc.contributor.authorPollo, Stephen M J
dc.contributor.authorReiling, Sarah J
dc.contributor.authorWit, Janneke
dc.contributor.authorWorkentine, Matthew L
dc.contributor.authorGuy, Rebecca A
dc.contributor.authorBatoff, G. W
dc.contributor.authorYee, Janet
dc.contributor.authorDixon, Brent R
dc.contributor.authorWasmuth, James D
dc.date.accessioned2020-03-01T01:02:45Z
dc.date.available2020-03-01T01:02:45Z
dc.date.issued2020-02-28
dc.date.updated2020-03-01T01:02:45Z
dc.description.abstractAbstract Background Currently available short read genome assemblies of the tetraploid protozoan parasite Giardia intestinalis are highly fragmented, highlighting the need for improved genome assemblies at a reasonable cost. Long nanopore reads are well suited to resolve repetitive genomic regions resulting in better quality assemblies of eukaryotic genomes. Subsequent addition of highly accurate short reads to long-read assemblies further improves assembly quality. Using this hybrid approach, we assembled genomes for three Giardia isolates, two with published assemblies and one novel, to evaluate the improvement in genome quality gained from long reads. We then used the long reads to predict structural variants to examine this previously unexplored source of genetic variation in Giardia. Methods With MinION reads for each isolate, we assembled genomes using several assemblers specializing in long reads. Assembly metrics, gene finding, and whole genome alignments to the reference genomes enabled direct comparison to evaluate the performance of the nanopore reads. Further improvements from adding Illumina reads to the long-read assemblies were evaluated using gene finding. Structural variants were predicted from alignments of the long reads to the best hybrid genome for each isolate and enrichment of key genes was analyzed using random genome sampling and calculation of percentiles to find thresholds of significance. Results Our hybrid assembly method generated reference quality genomes for each isolate. Consistent with previous findings based on SNPs, examination of heterozygosity using the structural variants found that Giardia BGS was considerably more heterozygous than the other isolates that are from Assemblage A. Further, each isolate was shown to contain structural variant regions enriched for variant-specific surface proteins, a key class of virulence factor in Giardia. Conclusions The ability to generate reference quality genomes from a single MinION run and a multiplexed MiSeq run enables future large-scale comparative genomic studies within the genus Giardia. Further, prediction of structural variants from long reads allows for more in-depth analyses of major sources of genetic variation within and between Giardia isolates that could have effects on both pathogenicity and host range.
dc.identifier.citationParasites & Vectors. 2020 Feb 28;13(1):108
dc.identifier.doihttps://doi.org/10.1186/s13071-020-3968-8
dc.identifier.urihttp://hdl.handle.net/1880/111704
dc.identifier.urihttps://doi.org/10.11575/PRISM/44435
dc.language.rfc3066en
dc.rights.holderThe Author(s)
dc.titleBenchmarking hybrid assemblies of Giardia and prediction of widespread intra-isolate structural variation
dc.typeJournal Article
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
13071_2020_Article_3968.pdf
Size:
2.54 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: