Machine learning uncovers ‘genes of importance’ in agriculture and medicine

Machine learning can pinpoint “genes of importance” that help crops to grow with less fertilizer, according to a new study published in Nature Communications. It can also predict additional traits in plants and disease outcomes in animals, illustrating its applications beyond agriculture.

Using genomic data to predict outcomes in agriculture and medicine is both a promise and challenge for systems biology. Researchers have been working to determine how to best use the vast amount of genomic data available to predict how organisms respond to changes in nutrition, toxins, and pathogen exposure — which in turn would inform crop improvement, disease prognosis, epidemiology, and public health. However, accurately predicting such complex outcomes in agriculture and medicine from genome-scale information remains a significant challenge.

In the Nature Communications study, NYU researchers and collaborators in the U.S. and Taiwan tackled this challenge using machine learning, a type of artificial intelligence used to detect patterns in data.

“We show that focusing on genes whose expression patterns are evolutionarily conserved across species enhances our ability to learn and predict ‘genes of importance’ to growth performance for staple crops, as well as disease outcomes in animals,” explained Gloria Coruzzi, Carroll & Milton Petrie Professor in NYU’s Department of Biology and Center for Genomics and Systems Biology and the paper’s senior author.

“Our approach exploits the natural variation of genome-wide expression and related phenotypes within or across species,” added Chia-Yi Cheng of NYU’s Center for Genomics and Systems Biology and National Taiwan University, the lead author of this study. “We show that paring down our genomic input to genes whose expression patterns are conserved within and across species is a biologically principled way to reduce dimensionality of the genomic data, which significantly improves the ability of our machine learning models to identify which genes are important to a trait.”

As a proof-of-concept, the researchers demonstrated that genes whose responsiveness to nitrogen are evolutionarily conserved between two diverse plant species — Arabidopsis, a small flowering plant widely used as a model organism in plant biology, and varieties of corn, America’s largest crop — significantly improved the ability of machine learning models to predict genes of importance for how efficiently plants use nitrogen. Nitrogen is a crucial nutrient for plants and the main component of fertilizer; crops that use nitrogen more efficiently grow better and require less fertilizer, which has economic and environmental benefits.

The researchers conducted experiments that validated eight master transcription factors as genes of importance to nitrogen use efficiency. They showed that altered gene expression in Arabidopsis or corn could increase plant growth in low nitrogen soils, which they tested both in the lab at NYU and in cornfields at the University of Illinois.

“Now that we can more accurately predict which corn hybrids are better at using nitrogen fertilizer in the field, we can rapidly improve this trait. Increasing nitrogen use efficiency in corn and other crops offers three key benefits by lowering farmer costs, reducing environmental pollution, and mitigating greenhouse gas emissions from agriculture,” said study author Stephen Moose, Alexander Professor of Crop Sciences at the University of Illinois at Urbana-Champaign.

Moreover, the researchers proved that this evolutionarily informed machine learning approach can be applied to other traits and species by predicting additional traits in plants, including biomass and yield in both Arabidopsis and corn. They also showed that this approach can predict genes of importance to drought resistance in another staple crop, rice, as well as disease outcomes in animals through studying mouse models.

“Because we showed that our evolutionarily informed pipeline can also be applied in animals, this underlines its potential to uncover genes of importance for any physiological or clinical traits of interest across biology, agriculture, or medicine,” said Coruzzi.

“Many key traits of agronomic or clinical importance are genetically complex and hence it’s difficult to pin down their control and inheritance. Our success proves that big data and systems level thinking can make these notoriously difficult challenges tractable,” said study author Ying Li, faculty in the Department of Horticulture and Landscape Architecture at Purdue University.

Story Source:

Materials provided by New York University. Note: Content may be edited for style and length.

Note: This article have been indexed to our site. We do not claim legitimacy, ownership or copyright of any of the content above. To see the article at original source Click Here

Related Posts
Best iPhones in 2021: Which iPhone model is right for you? thumbnail

Best iPhones in 2021: Which iPhone model is right for you?

The original iPhone completely changed the tech world, and since then, each new release has had a major impact on smartphones in general. There are plenty of reasons people swear by them — whether it be their tight integration with the Apple ecosystem, their easy-to-use interface, or their own great features. But within the Apple…
Read More
A remote control for functional materials thumbnail

A remote control for functional materials

An intense mid-infrared laser pulse hits a ferroelectric LiNbO3 crystal and kicks atomic vibrations only in a short depth below the surface, emphasized by the bright tetrahedra. Through anharmonic coupling, this strong vibration launches a polarization wave, also called polariton, which propagates throughout the remaining depth of the crystal to modulate the ferroelectric polarization. Credit:…
Read More
Multimodal chromatin profiling using nanobody-based single-cell CUT&Tag thumbnail

Multimodal chromatin profiling using nanobody-based single-cell CUT&Tag

MainCell identity and the underlying gene expression programs are determined through the action of epigenetic modalities including transcription factor binding1, modifications of histones2, chromatin remodeling3, DNA methylation4, genome architecture5 and long non-coding RNAs6. Together, effects of these factors determine the regulatory logic behind cell state transitions during development and in disease. Changes in the chromatin
Read More
Paleo diet vs keto: The difference explained thumbnail

Paleo diet vs keto: The difference explained

Home References (Image credit: Getty Images) When it comes to the paleo diet vs keto, which one is best? It’s an understatement to say that the popularity of these diets has skyrocketed in recent years. Many people are enticed by their claims of impressive health benefits and want to try them out. But, it may…
Read More
Index Of News
Consider making some contribution to keep us going. We are donation based team who works to bring the best content to the readers. Every donation matters.
Donate Now

Subscription Form

Liking our Index Of News so far? Would you like to subscribe to receive news updates daily?

Total
0
Share