Assembly, stability, and dynamics of the infant gut microbiome are linked to bacterial strains and functions in mother’s milk
The establishment of the gut microbiome in early life is critical for healthy infant development. Although human milk is recommended as the sole source of nutrition for the human infant, little is known about how variation in milk composition, and especially the milk microbiome, shapes the microbial communities in the infant gut. Here, we quantified the similarity between the maternal milk and the infant gut microbiome using 507 metagenomic samples collected from 195 mother-infant pairs at 1, 3, and 6 months postpartum. We found that:
- the microbial taxonomic overlap between milk and the infant gut was driven by bifidobacteria, and in particular by B. longum. Infant stool samples dominated by B. longum also showed higher temporal stability compared to samples dominated by other species.
- We identified two instances of strain sharing between maternal milk and the infant gut, one involving a commensal (B. longum) and one a pathobiont (K. pneumoniae).
- In addition, strain sharing between unrelated infants was higher among infants born at the same hospital compared to infants born in different hospitals, suggesting a potential role of the hospital environment in shaping the infant gut microbiome composition.
- The infant gut microbiome at 1 month compared to 6 months of age was enriched in metabolic pathways associated with de-novo molecule biosynthesis, suggesting that early colonisers might be more versatile and metabolically independent compared to later colonizers.
- Lastly, we found a significant overlap in antimicrobial resistance genes carriage between the mother’s milk and their infant's gut microbiome.
Taken together, our results suggest that the human milk microbiome has an important role in the assembly, composition, and stability of the infant gut microbiome.
If you use the data, or find this work useful, please cite:
Assembly, stability, and dynamics of the infant gut microbiome are linked to bacterial strains and functions in mother’s milk
Mattea Allert† , Pamela Ferretti† , Kelsey E. Johnson, Timothy Heisel, Sara Gonia, Dan Knights, David A. Fields, Frank W. Albert, Ellen W. Demerath, Cheryl A. Gale, and Ran Blekhman. († equal contribution)
For more information on the tools used and their references, please check the Methods section of the paper.
This project requires R version 4.2
The required libraries are indicated at the beginning of each script.
The data analyses steps described below require the output of the following publicly available tools: MetaPhlAn4, StrainPhlAn4, HUMAnN3 and DeepARG. The masterfiles resulting from these steps are provided as Supplementary Tables in the paper.
The raw metagenomic sequences and the associated metadata were deposited and are available on NCBI Sequence Read Archive (SRA) under the BioProject accession number PRJNA1019702. Comprehensive metadata are available in the Supplementary Material.
Overview of the structure of the cohort and its relevant metadata, including delivery mode, breastfeeding and antibiotics intake (pre- and post-partum).
bin/Rmarkdown src/metadata_stats/metadata_stats.Rmd figures/metadata_stats/metadata_stats.html
Here we looked at the intra and inter-sample taxonomic composition at the species-level for both milk and infant stool samples. Shannon diversity values were generated with the MetaPhlAn4 utility script calculate_diversity.R
.
bin/Rmarkdown src/alpha_diversity/alpha_diversity.Rmd figures/alpha_diversity/alpha_diversity.html
bin/Rmarkdown src/beta_diversity/beta_diversity.Rmd figures/beta_diversity/beta_diversity.html
To investigate the microbial composition of milk and infant gut at the species-level we used MetaPhlAn4. Installation and usage commands are available here. From the resulting taxonomic profiles, we generated the species-level heatmaps:
bin/Rmarkdown src/species_composition/species_composition.Rmd figures/species_composition/species_composition.html
Here we identify the predominance group for each sample. We broadly defined 4 predominance groups: samples dominated by B. longum, by B. breve, B. bifidum and samples dominated by non bifidobacteria species (most frequently E. coli). We also focused on the prevalence and mean relative abundance of the above listed Bifidobacteria in relation to the breastfeeding practice in the infants at 1 and 6 months of age.
bin/Rmarkdown src/groups_stability/bifido_groups.Rmd figures/groups_stability/bifido_groups.html
In this section we investigated the functional potential of the maternal milk and infant gut microbiomes using HUMAnN3. Installation and usage commands are available here. As the most prevalent pathways identified in the infant stool samples were associated with de-novo biosynthesis of molecules, we further explore the abundance of pathways associated with the biosynthesis of essential amino acids. We also looked at the pathways shared between the maternal milk and the infant gut.
bin/Rmarkdown src/functional_profiling/heatmap_pathways.Rmd figures/functional_profiling/heatmap_pathways.html
bin/Rmarkdown src/functional_profiling/functional_analysis_biosynthesis_essentialAA.Rmd figures/functional_profiling/functional_analysis_biosynthesis_essentialAA.html
We then investigate the functional potential similarities in mother-infant pairs.
bin/Rmarkdown src/functional_profiling/functional_analysis_couples.Rmd figures/functional_profiling/functional_analysis_couples.html
We then looked at the strain-level composition of the maternal breast milk and infant gut. We used StrainPhlAn4 to identify strains shared between a mother and her infant, as well as between unrelated infants. We also looked at how persistent over time were the identified strains:
bin/Rmarkdown src/strain_analysis/strain_sharing_persistence.Rmd figures/strains/strain_sharing_persistence.html
and then we leveraged the multi-hospital structure of the cohort to assess wether infants born at the same hospital (and in the same hospital and same year) shared more strains than infants born across different hospitals.
bin/Rmarkdown src/strain_analysis/strain_sharing_hospital_perInfant.Rmd figures/strains/strain_sharing_hospital_perInfant.html
Last, we investigated the carriage of antimicrobial resistance genes (ARGs) in milk and infant stools using DeepARG. To avoid false positives, we considered only ARGs associated with well-defined ARG classes (excluding multi-drug and undefined ARGs classes), and with an identity threshold >95% (See Methods for more details). First, we describe the major classes identified in each sample type and collection timepoint. We then investigate the correlation between the ARGs found in milk compared to those found in the stools, as well as the correlation between the ARGs found at 1M versus those found at 6M. Finally, we looked at ARGs sharing between the mother's milk and her infant's gut.
bin/Rmarkdown src/ARG_analysis/ARG_analysis.Rmd figures/ARG_analysis/ARG_analysis.html
bin/Rmarkdown src/ARG_analysis/ARG_analysis_couples.Rmd figures/ARG_analysis/ARG_analysis_couples.html
A subset of the whole dataset can be used to first get familiar with the code and the tools listed above. Please refer to the original papers and tutorials for the tools' computational requirements and running time.
Please refer to the Discussion section of the paper for the limitations of this study.