Introduction BioCyc Features Purchase Subscription BioCyc Website
BioCyc logo

Introduction to BioCyc Genome and Metabolic Pathway Web Portal

The BioCyc web portal from SRI International contains integrated genome and metabolic-pathway databases for more than 20,000 sequenced organisms, including 70 curated databases. BioCyc encyclopedias are unique in integrating a diverse range of data and providing a high level of curation. BioCyc Pathway/Genome Databases (PGDBs) are organized into tiers according to the amount of manual updating they have received:

  • Tier 1 PGDBs received intensive manual curation efforts, and are updated frequently
  • Tier 2 PGDBs were computationally generated and have received moderate manual curation
  • Tier 3 PGDBs were computationally generated and have received no manual curation

For a list of available databases, click here.

PGDBs describe the genome of an organism, as well as its biochemical pathways; for a small fraction of organisms regulatory networks are also present. New releases occur three times per year, often including new databases as well as updated data. Two members of the BioCyc collection, the EcoCyc and MetaCyc databases, are derived from more than three decades of literature-based curation of genome and pathway data. Other members of the BioCyc database collection include HumanCyc and BsubCyc.

The Pathway Tools software combines unparalleled breadth and user friendliness and includes a unique set of visualization tools to speed comprehension of PGDB data. For a full list of analysis and visualization tools please refer to the Pathway Tools Data Sheet or view the BioCyc overview video.

BioCyc Features

Pathway/Genome Web Portal for 20,000 Organisms

BioCyc has 20,000 databases including 70 curated databases. For a listing of databases, click here.

Each BioCyc PGDB describes the genome of an organism, as well as its biochemical pathways and, for a small fraction of organisms, its regulatory network.

Manually Curated Databases

  • EcoCyc: The data in this Escherichia coli K-12 MG1655 database have been gathered during two decades of literature-based curation from more than 43,000 articles. EcoCyc describes the metabolic, transport, and regulatory machinery of E. coli. EcoCyc curators have authored the equivalent of 3,600 textbook-pages of mini-review summaries for E. coli genes and pathways.
  • MetaCyc: Contains 3,006 metabolic pathways and 17,800 biochemical reactions from all domains of life. MetaCyc data and commentary were gathered from 71,000 publications to provide a comprehensive metabolic encyclopedia whose mini-review summaries encompass the equivalent of 10,079 textbook pages.

Computationally Generated Databases with some Manual Curation

  • YeastCyc: This highly curated metabolic network for Saccharomyces cerevisiae was curated from 47,000 publications.
  • HumanCyc: This database was derived from a computational pathway analysis of the human genome, followed by literature-based curation of human pathways and enzymes.
  • BsubCyc: The curated metabolic and regulatory networks for this Bacillus subtilis database cite 7,000 publications. Included are 180 regulatory genes controlling 1,200 regulated genes.
  • Mycobacterium tuberculosis, Corynebacterium glutamicum, Peptoclostridium difficile, Bacillus anthracis, Francisella tularensis, Helicobacter pylori, Vibrio cholerae, and others.

Computationally Generated Databases

Eukaryotic databases in BioCyc include Rattus norvegicus, Mus musculus, Caenorhabditis elegans, Drosophila melanogaster, and Arabidopsis thaliana.

All PGDBs include the genome, predicted metabolic pathways, and predicted pathway hole fillers (genes coding for missing enzymes in metabolic pathways). Bacterial PGDBs include predicted operons. The exact types of data present in different databases will vary. Many databases also include:

  • Protein subcellular locations, enzyme kinetics data, protein features, Gene Ontology terms, predicted Pfam domains
  • Curated regulatory information including promoters, operons, transcription-factor binding sites
  • Reaction atom mappings (map atoms from reactants to products)
  • Ortholog relationships to other BioCyc genomes, organism phenotypes
  • Database links

For an overview of BioCyc, visit http://bioinformatics.ai.sri.com/ptools/webinars/2020webinars/2020INTROBIOCYC01.mp4

A sample of the most popular BioCyc informatics tools:

  • Comprehensive search options (Genes, Proteins, RNAs, Compounds, Reactions, Pathways, Articles and more)
  • Omics Data Analysis
  • Genome Browser
  • Metabolic-map and regulatory-network viewers
  • Run Metabolic Model
  • Metabolic route search
  • BLAST and sequence pattern searches
  • Browse Pathway Ontology
  • Comparative Analysis
  • SmartTables

Curated Genome and Metabolic Pathway Data Derived from 130,000 Publications

BioCyc curators summarize and synthesize information from thousands of publications, saving scientists time in literature research, and integrating data for large-scale computational analyses.

BioCyc flow figure

BioCyc data include:

  • Genomes
  • Gene essentiality data
  • Regulatory networks
  • Gene Ontology terms
  • Orthologs
  • Protein features (e.g., enzyme active sites)
  • Protein functions
  • Metabolic pathways
  • Metabolites
  • Growth media and growth observations

Extensive Genome Informatics and Comparative Genomics Tools

BioCyc provides a powerful and comprehensive set of features for querying, visualization, and analysis of BioCyc data and user data. Some of our more popular tools are listed here.

Genome Browser Image

Genome Browser (above)

Genome browser depiction of a region of the E. coli chromosome.

Gene colors indicate operon organization. Promoters and terminators are depicted when known. Pseudogenes are marked with X’s.

Comparative Genome Browser Image

Comparative Genome Browser

Aligns genome regions around orthologous genes.

Regulatory Overview Image

Regulatory Network Viewer

The Regulatory Overview depicts the transcriptional regulatory network in a PGDB. Here, the E. coli regulatory network is highlighted to show genes that regulate the gntR gene (blue lines), and the genes that are regulated by gntR (purple lines). The inner two rings are populated by transcription factors and sigma factors; the outer ring contains other genes.

Omics Data Analysis Tools

Omics Viewer

Cellular Omics Viewer

The BioCyc Cellular Omics Viewer enables the user to paint omics datasets onto organism-specific metabolic network diagrams . Scientists can interpret high-throughput datasets in a pathway context, including animation of time-course or comparative datasets. Reaction lines can be colored with gene expression, proteomics, or reaction flux data; compound nodes can be colored with metabolomics data. Multi-omics data can be analyzed by coloring data onto reactions and metabolites simultaneously. Omics pop-ups graph omics data values using bar graphs, heat maps, or X-Y plots.

Omics Dashboard

Omics Dashboard

The BioCyc Omics Dashboard enables the user to rapidly survey how all cellular systems are responding to a given stimulus. Scientists can quickly view the response of genes within one or more specific systems of interest, and can gauge the relative activity levels of different cellular systems. The Dashboard also enables comparison of expression levels of a cellular system with those of its known regulators. The Dashboard consists of a set of panels, each representing a system of cellular function. Each panel shows a graph depicting omics data for each of a set of subsystems. Multiple timepoints or experimental conditions are plotted as separate data series within the graphs. Clicking on a panel drills down into its component subsystems.

Advanced Database Access

  • Extensive query tools available under Tools > Search menu
  • Author advanced queries: The Structured Advanced Query Form enables intuitive construction of database queries of SQL power using a Web-based interface
  • Extensive web services APIs provided
  • SmartTables enable table-based data analyses

Purchase BioCyc Subscription

A subscription is required for ongoing access to BioCyc.

Does your institution already have access? View our current list of institutional subscribers.

BioCyc Subscription FAQ