PEERspectives: Reviewing the Enzyme Engineering Database (EnzEngDB)

Machine learning for protein engineering needs infrastructure for standardized sequence–function data. EnzEngDB aims to provide it. In this episode of PEERspectives, Le Yuan (PostDoc, NSF Molecule Maker Lab Institute) explores EnzEngDB, a new database platform linking enzyme sequences, mutations, reactions, and experimental performance data. The platform enables storage, visualization, search, and sharing of standardized sequence–function data for protein engineering and machine learning. It also includes an LLM-based pipeline that extracts enzyme engineering data from scientific literature, expanding the database and supporting data-driven enzyme design. PUBLICATION Long Y, Abbasinejad F, Li FZ, et al. Enzyme Engineering Database (EnzEngDB): a platform for sharing and interpreting sequence-function relationships across protein engineering campaigns. Nucleic Acids Res. 2026;54(D1):D564-D571. Doi:10.1093/nar/gkaf1142 ABSTRACT The discovery and engineering of new enzymes is important across the bioeconomy, with diverse applications from foods to pharmaceuticals, sensors to agriculture. However, enzyme engineering, in particular machine learning-guided engineering, is hampered by a lack of data. Currently there exists no database designed to capture and interpret datasets created in this domain, nor are there easy analysis and visualisation tools. We developed the Enzyme Engineering Database to provide a centralized resource and an online analysis tool to consolidate sequence-function data from enzyme engineering campaigns, thereby making three contributions: (i) a database into which researchers can deposit public data, (ii) visualisation and analysis tools for protein engineers to analyse their own data or compare enzyme variants to other engineering campaigns, and (iii) a gold-standard dataset for benchmarking automated extraction along with the first large language model extraction pipeline specific for enzyme engineering campaigns. The Enzyme Engineering Database is accessible at http://enzengdb.org/. KEYWORDS EnzEngDB, Enzyme Engineering Database, enzyme engineering, protein engineering, directed evolution, machine learning biology, AI in biology, bioinformatics, enzyme database, sequence-function relationships, sequence-function mapping, protein design, enzyme design, synthetic biology, biocatalysis, computational biology, biotechnology, protein variants, enzyme optimization, protein machine learning, data-driven protein engineering, protein fitness landscape, scientific databases, literature mining, large language models, LLM biology, enzyme evolution, bioengineering, molecular biology, Nucleic Acids Research, Frances Arnold, machine learning for proteins, AI-driven enzyme engineering, protein sequence analysis, biological data sharing, scientific data management, enzyme activity prediction, protein engineering datasets, bioinformatics tools, computational protein engineering, NSF Molecule Maker Lab Institute (NSF MMLI), MMLI, AI-driven molecular discovery, programmable biomolecules

Nobel Prize lecture: Demis Hassabis, Nobel Prize in Chemistry 2024

Nobel Prize lecture: Demis Hassabis, Nobel Prize in Chemistry 2024

PEERspectives: Computational Design of Metallohydrolases (Lit Review)

PEERspectives: Computational Design of Metallohydrolases (Lit Review)

Genomics with Deep Learning: A Concise Overview | AISC

Genomics with Deep Learning: A Concise Overview | AISC

Moody Gardens Penguin Cam LIVE | Penguin Habitat Stream at the Aquarium in Galveston, Texas

Moody Gardens Penguin Cam LIVE | Penguin Habitat Stream at the Aquarium in Galveston, Texas

AlphaFold - The Most Useful Thing AI Has Ever Done

AlphaFold - The Most Useful Thing AI Has Ever Done

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

Inside the Mind of Anthropic CEO Dario Amodei | The Circuit | Extended Interview

How US Air Force B 52 Pilot Performed an Emergency Takeoff at Full Speed

How US Air Force B 52 Pilot Performed an Emergency Takeoff at Full Speed

Judge Can’t Stop Laughing At Sovereign Citizen’s Courtroom Meltdown!!!

Judge Can’t Stop Laughing At Sovereign Citizen’s Courtroom Meltdown!!!

When an audition changed TV forever

When an audition changed TV forever

How AI Cracked the Protein Folding Code and Won a Nobel Prize

How AI Cracked the Protein Folding Code and Won a Nobel Prize

Watch Ukrainian Drones OBLITERATE a Russian Jet

Watch Ukrainian Drones OBLITERATE a Russian Jet

The Problem With Fingerprint Analysis

The Problem With Fingerprint Analysis

How to Introduce Yourself — and Get Hired | Rebecca Okamoto | TED

How to Introduce Yourself — and Get Hired | Rebecca Okamoto | TED

MIT Just Revealed the AI Bubble's Fatal Flaw

MIT Just Revealed the AI Bubble's Fatal Flaw

FURIOUS World Leaders BAN IVANKA and SEND WARNING!!!!

FURIOUS World Leaders BAN IVANKA and SEND WARNING!!!!

The Successor to CRISPR May Be Even More World Changing

The Successor to CRISPR May Be Even More World Changing

Liquid Neon Balls in Zero Gravity Abstract Background video | Footage | Screensaver

Liquid Neon Balls in Zero Gravity Abstract Background video | Footage | Screensaver

Biggest Breakthroughs in Biology and Neuroscience: 2025

Biggest Breakthroughs in Biology and Neuroscience: 2025

Maggie Haberman & Jonathan Swan - On “Regime Change” & Inside The Trump Presidency | The Daily Show

Maggie Haberman & Jonathan Swan - On “Regime Change” & Inside The Trump Presidency | The Daily Show

How to Use BLAST for Finding and Aligning DNA or Protein Sequences

How to Use BLAST for Finding and Aligning DNA or Protein Sequences