New Antibiotics Are Desperately Needed: Machine Learning could Help

As the threat of antibiotic resistance looms, microbiologists aren’t the only ones thinking up new solutions. James Zou, PhD, assistant professor of biomedical data science at Stanford, has applied machine learning to create an algorithm that generates thousands of entirely new virtual DNA sequences with the intent of one day creating antimicrobial proteins.

The algorithm, called Feedback GAN, essentially acts as a mass producer of different DNA snippets. And while these sequence attempts are somewhat random, the algorithm isn't working blindly. It’s basing the new possible peptides, or small groups of amino acids, on previous research that lays out the DNA sequences most likely to align with antimicrobial properties.

For now, these templates, which don't exist in nature, are theoretical, generated on a computer. But in the face of rising concerns about microbe resistance, Zou said it's critical to think about solutions that don't already exist.

"We chose to pursue antimicrobial proteins because it's a very important, high-impact problem that's also a relatively tractable problem for the algorithm," Zou said. "There are existing tools that we incorporate into our system that evaluate if a new sequence is likely to have the properties of a successful antimicrobial protein."

Feedback GAN builds on that, working to incorporate just the right balance of random chance and precision.

A paper describing the algorithm was published online Feb. 11 in Nature Machine Learning. Anvita Gupta, a student in computer science, is the first author; Zou is the senior author.

Self-refining

Gupta and Zou's algorithm doesn't just churn out new combinations of DNA. It also actively refines itself, learning what works and what doesn’t through a feedback loop: After the algorithm spits out a wide range of DNA sequences, it runs a trial-and-error learning process that sifts through the peptide suggestions. Based on their resemblance to other known antimicrobial peptides, the “good” ones get fed back into the algorithm to inform future DNA sequences generated from the code, and to get refined themselves.

"There's a built-in arbiter and, by having this feedback loop, the system learns to model newly generated sequences after those that are deemed likely to have antimicrobial properties," Zou said. "So the idea is both individual peptide sequences and the generation of the sequences get better and better."

Zou has also considered another core component of hypothetical proteins: protein folding. Proteins contort into very specific structures linked to their functions. An algorithm could create the perfect sequence, but unless it can fold up, it's useless - like the cogs of a clock strewn on a table.

Zou can tweak the algorithm so that instead of analyzing a propensity for antimicrobial properties, it determines the likelihood of correct folding.

"We can actually do these two things in parallel where we look at antimicrobial properties of one sequence and folding likelihood of another," said Zou. "We run both so that we’re optimizing either the antimicrobial properties or its ability to fold."

Next, Zou hopes to merge the two variations of the algorithm to create peptide sequences that are optimized for both their microbe-killing abilities and their ability to fold into a genuine protein.

Demo, instructions and code for FBGAN are available at https://github.com/av1659/fbgan.

Anvita Gupta, James Zou.
Feedback GAN for DNA optimizes protein functions.
Nature Machine Intelligence, 1, 105-111 (2019). doi: 10.1038/s42256-019-0017-4.

Most Popular Now

Personalized Breast Cancer Prevention No…

A new telemedicine service for personalised breast cancer prevention has launched at preventcancer.co.uk. It allows women aged 30 to 75 across the UK to understand their risk of developing breast...

New App may Help Caregivers of People Ge…

A new study by investigators from Mass General Brigham showed that a new app they created can help improve the quality of life for caregivers of patients undergoing bone marrow...

An App to Detect Heart Attacks and Strok…

A potentially lifesaving new smartphone app can help people determine if they are suffering heart attacks or strokes and should seek medical attention, a clinical study suggests. The ECHAS app (Emergency...

A Machine Learning Tool for Diagnosing, …

Scientists aiming to advance cancer diagnostics have developed a machine learning tool that is able to identify metabolism-related molecular profile differences between patients with colorectal cancer and healthy people. The analysis...

Fine-Tuned LLMs Boost Error Detection in…

A type of artificial intelligence (AI) called fine-tuned large language models (LLMs) greatly enhances error detection in radiology reports, according to a new study published in Radiology, a journal of...

DeepSeek-R1 Offers Promising Potential t…

A joint research team from The Hong Kong University of Science and Technology and The Hong Kong University of Science and Technology (Guangzhou) has published a perspective article in MedComm...

Deep Learning can Predict Lung Cancer Ri…

A deep learning model was able to predict future lung cancer risk from a single low-dose chest CT scan, according to new research published at the ATS 2025 International Conference...