EnzChemRED

  • Proteins & Proteomes
  • Database
EnzChemRED is a training and benchmarking dataset to support the development of Natural Language Processing (NLP) methods such as (large) language models that can assist enzyme curation. EnzChemRED consists of 1,210 expert curated PubMed abstracts in which enzymes and the chemical reactions they catalyze are annotated using identifiers from the UniProt Knowledgebase (UniProtKB) and the ontology of Chemical Entities of Biological Interest (ChEBI).

Developed by the Swiss-Prot group and partners at National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), and Dalian University of Technology; supported by the SIB Swiss Institute of Bioinformatics.

This resource is released under a Creative Commons Attribution International license CC BY 4.0.

You might also be interested in

      • Text mining & Machine learning
      • Database
      • Software tool

    SIBiLS

    Personalized information retrieval from the literature
      • Text mining & Machine learning
      • Software tool

    Variomes

    Search engine to support the curation of genomic variants
      • Systems Biology
      • Proteins & Proteomes
      • Database
      • Software tool

    MetaNetX

    Metabolic network repository & analysis
      • Glycomics
      • Software tool

    GlycoDigest

    In silico digestion of glycans by exoglycosidases
      • Proteins & Proteomes
      • Lipidomics
      • Database

    SwissLipids

    Knowledge resource for lipids
      • Proteins & Proteomes
      • Systems Biology
      • , Metabolomics
      • Database

    Rhea

    Expert-curated database of biochemical reactions
      • Proteins & Proteomes
      • Software tool

    PeptideCutter

    Potential cleavage sites in a protein
      • Proteins & Proteomes
      • Database

    ENZYME

    Enzyme nomenclature database
      • Systems Biology
      • , Metabolomics
      • Proteins & Proteomes
      • Database

    Rhea SPARQL endpoint

    SPARQL access to the Rhea knowledgebase
      • Glycomics
      • , Systems Biology
      • Software tool

    HMO-Glycologue

    Simulator of Human Milk Oligosaccharide synthesis