German China

USA: Gene Flow New Method Identifies Important Bacteria Groups

Editor: MA Alexander Stark

Identifying species among plants and animals has been a full-time occupation for some biologists, but the task is even more daunting for the myriad microbes that inhabit the planet. Now, MIT researchers have developed a simple measurement of gene flow that can define ecologically important populations among bacteria and archaea, including pinpointing populations associated with human diseases.

Related Vendor

A microscopic image of mixed bacteria from environmental samples.
A microscopic image of mixed bacteria from environmental samples.
(Bild: David VanInsberghe)

Cambridge/USA — Biologists often call a group of plants or animals a species if the group is reproductively isolated from others — that is, individuals in the group can reproduce with each other, but they can’t reproduce with others. As a result, members of a species share a set of genes that differs from other species. Much of evolutionary theory centers on species and populations, the representatives of a species in a particular area.

But microbes defy the classic species concept for plants and animals, Martin Polz, a professor of civil and environmental engineering at MIT explains. Microbes tend to reproduce asexually, simply splitting themselves in two rather than combining their genes with other individuals to produce offspring. Microbes are also notorious for taking up DNA from environmental sources, such as viruses, he says. Viruses could transfer DNA into microbial cells and that DNA can be incorporated into their genomes. These processes make it difficult to sort coexisting microbes into distinct populations based on their genetic makeup. “If we can’t identify those populations in microbes, we can’t one-to-one apply all this rich ecological and evolutionary theory that has been developed for plants and animals to microbes,” says Polz.

If researchers want to measure an ecosystem’s resilience in the face of environmental change, for instance, they might look at how populations within species change over time. “If we don’t know what a species is, it’s very difficult to measure and assess these types of perturbations,” he adds.

A Yardstick for Gene Flow

Martin and his colleagues decided to look for another way to define ecologically meaningful populations in microbes. Led by microbiology graduate student Philip Arevalo, the researchers developed a metric of gene flow that they called Popcogent (Populations as Clusters Of Gene Transfer). Popcogent measures recent gene flow or gene transfer between closely related genomes. In general, microbial genomes that have exchanged DNA recently should share longer and more frequent stretches of identical DNA than if individuals were just reproducing by splitting their DNA in two. Without this sort of recent exchange, the researchers suggested, the length of these shared stretches of identical DNA would shorten as mutations insert new “letters” into the stretch.

Two microbial strains that are not genetically identical to each other but share sizable “chunks” of identical DNA are probably exchanging more genetic material with each other than with other strains. This gene flow measurement can define distinct microbial populations, as the researchers discovered in their tests of three different kinds of bacteria.

In Vibrio bacteria, for instance, closely related populations may share some core gene sequences, but they appear completely isolated from each other when viewed through this measurement of recent gene flow, Polz and colleagues found. Polz says that the Popcogent method may work better at defining microbial populations than previous studies because it focuses on recent gene flow among closely related organisms, rather than including gene flow events that may have happened thousands of years in the past. The method also suggests that while microbes are constantly taking in different DNA from their environment that might obscure patterns of gene flow, “it may be that this divergent DNA is really removed by selection from populations very quickly,” says Polz.

The Reverse Ecology Approach

Microbiology graduate student David VanInsberghe then suggested a “reverse ecology” approach that could identify regions of the genome in these newly defined populations that show “selective sweeps” — places where DNA variation is reduced or eliminated, likely as a result of strong natural selection for a particular beneficial genetic variant.

By identifying specific sweeps within populations, and mapping the distribution of these populations, the method can reveal possible adaptations that drive microbes to inhabit a particular environment or host — without any prior knowledge of their environment. When the researchers tested this approach in the gut bacterium Ruminococcus gnavus, they uncovered separate populations of the microbe associated with healthy people and patients with Crohn’s disease.

Polz says the reverse ecology method is likely to be applied in the near future to studying the full diversity of the bacteria that inhabit the human body. There was a lot of interest in sequencing closely related organisms within the human microbiome and looking for health and disease associations, and the datasets were growing, he added. He hopes to use the approach to examine the “flexible genome” of microbes. Strains of E. coli bacteria, for instance, share about 40 % of their genes in a “core genome,” while the other 60 % — the flexible part — varies between strains. In the scientist's view, it’s one of the biggest questions in microbiology: Why are these genomes so diverse in gene content? Once it is possible to define populations as evolutionary units, the researchers can interpret gene frequencies in these populations in light of evolutionary processes.

Reference: Cell; Polz, Martin F. et al.: "A reverse ecology approach based on a biological definition of microbial populations" DOI: 10.1016/j.cell.2019.06.033