Biological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota

dc.contributor.authorMoossavi, Shirin
dc.contributor.authorAtakora, Faisal
dc.contributor.authorFehr, Kelsey
dc.contributor.authorKhafipour, Ehsan
dc.date.accessioned2020-09-20T00:03:18Z
dc.date.available2020-09-20T00:03:18Z
dc.date.issued2020-09-18
dc.date.updated2020-09-20T00:03:18Z
dc.description.abstractAbstract Background In recent years, the microbiome field has undergone a shift from clustering-based methods of operational taxonomic unit (OTU) designation based on sequence similarity to denoising algorithms that identify exact amplicon sequence variants (ASVs), and methods to identify contaminating bacterial DNA sequences from low biomass samples have been developed. Although these methods improve accuracy when analyzing mock communities, their impact on real samples and downstream analysis of biological associations is less clear. Results Here, we re-processed our recently published milk microbiota data using Qiime1 to identify OTUs, and Qiime2 to identify ASVs, with or without contaminant removal using decontam. Qiime2 resolved the mock community more accurately, primarily because Qiime1 failed to detect Lactobacillus. Qiime2 also considerably reduced the average number of ASVs detected in human milk samples (364 ± 145 OTUs vs. 170 ± 73 ASVs, p < 0.001). Compared to the richness, the estimated diversity measures had a similar range using both methods albeit statistically different (inverse Simpson index: 14.3 ± 8.5 vs. 15.6 ± 8.7, p = 0.031) and there was strong consistency and agreement for the relative abundances of the most abundant bacterial taxa, including Staphylococcaceae and Streptococcaceae. One notable exception was Oxalobacteriaceae, which was overrepresented using Qiime1 regardless of contaminant removal. Downstream statistical analyses were not impacted by the choice of algorithm in terms of the direction, strength, and significance of associations of host factors with bacterial diversity and overall community composition. Conclusion Overall, the biological observations and conclusions were robust to the choice of the sequencing processing methods and contaminant removal.
dc.identifier.citationBMC Microbiology. 2020 Sep 18;20(1):290
dc.identifier.doihttps://doi.org/10.1186/s12866-020-01949-7
dc.identifier.urihttp://hdl.handle.net/1880/112554
dc.identifier.urihttps://doi.org/10.11575/PRISM/44857
dc.language.rfc3066en
dc.rights.holderThe Author(s)
dc.titleBiological observations in microbiota analysis are robust to the choice of 16S rRNA gene sequencing processing algorithm: case study on human milk microbiota
dc.typeJournal Article
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
12866_2020_Article_1949.pdf
Size:
1.98 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
0 B
Format:
Item-specific license agreed upon to submission
Description: