67
The Pathway/Genome Navigator (These slides are a guide as you experiment with the Navigator)

The Pathway/Genome Navigator

  • Upload
    jamar

  • View
    59

  • Download
    0

Embed Size (px)

DESCRIPTION

The Pathway/Genome Navigator. (These slides are a guide as you experiment with the Navigator). Pathway/Genome Navigator – topics. Getting started with the Navigator Organism pages Queries in general Object displays and queries Miscellaneous commands User preferences Lab exercises. - PowerPoint PPT Presentation

Citation preview

Page 1: The Pathway/Genome Navigator

The Pathway/Genome Navigator

(These slides are a guide as you experiment with the Navigator)

Page 2: The Pathway/Genome Navigator

Pathway/Genome Navigator – topics

Getting started with the NavigatorOrganism pagesQueries in generalObject displays and queriesMiscellaneous commandsUser preferencesLab exercises

Page 3: The Pathway/Genome Navigator

Introduction

Navigator runs both on the desktop and on the web

The desktop version generally has more capabilities, but each has unique features

Page 4: The Pathway/Genome Navigator

Desktop Mode -- Introduction

Page 5: The Pathway/Genome Navigator

Desktop Layout

One Large Window

Several Panes: Display pane Command menu LISP listener

Page 6: The Pathway/Genome Navigator

Desktop Menus

Main command menu

Single-choice menu

Multiple-choice menu (e.g. after a search)

Aborting out of menus Click Cancel or No Select Click outside the menu Type ^z

Page 7: The Pathway/Genome Navigator

Using the Mouse

Left mouse button: to invoke specific commands and for hypertext navigation

Right mouse button: to bring up menus of additional operations (for example, when editing a frame)

Middle mouse button: for very specialized uses (you probably won’t use it)

Mouse documentation line (shows what you’re over, what you can do)

Page 8: The Pathway/Genome Navigator

Organism Pages

All Organisms Page Organism grouping Summary of organisms

Single organism page Summary of organism stats

Notion of current organism Command mode queries Comparative analyses Clicking through links – organism continuity

Page 9: The Pathway/Genome Navigator

Queries with multiple answers

Results in form of a menu to: select one some all

Answer List Next Answer

Page 10: The Pathway/Genome Navigator

Indirect Queries

Related objects

Objects are clickable

Objects are color-coded by type

Page 11: The Pathway/Genome Navigator

History List

Backward history

Forward history

Select from history

Page 12: The Pathway/Genome Navigator

Writing Complex Queries

Web Search Advanced

Write queries in LISP Must understand features of schema

class names slot names

Pathway tools site has example searches Definitely learnable Can place results on the answer list

Page 13: The Pathway/Genome Navigator

Overview of Object Displays

Page 14: The Pathway/Genome Navigator

Shared Display Characteristics

Gene-Reaction schematic

Citations and comments

Database Links

Classes

Page 15: The Pathway/Genome Navigator

Gene-Reaction Schematic

Drawn in reaction, protein, and gene windows

Representations (ArgB) Genes are boxes on the right Proteins are circles in the middle – numbers show complexes Reactions in box on left, with E.C. number if available

Allows navigation between genes, proteins, rxnsLinks proteins with shared reactions

ArgDLinks members of protein complexes

Pol III – extreme example

Page 16: The Pathway/Genome Navigator

Citations and Comments

Citations in mnemonic form

Click on citation – go to citations at bottom of page

Click there, go to PubMed ref, if available

Page 17: The Pathway/Genome Navigator

Database Links

Unification links (info about the protein elsewhere)

PDB PIR RefSeq UniProt Gene page: For coli, we added links to coli-specific sites

Relationship links: PDB-Homolog-P34554

Page 18: The Pathway/Genome Navigator

Class Hierarchies

Reactions Enzyme-nomenclature system (full EC system in MetaCyc

only)

Proteins Gene Ontology terms are assigned to proteins Can also assign MultiFun terms

Compounds

Pathways

Page 19: The Pathway/Genome Navigator

Menu Categories

PathwayReactionProtein(RNA)GeneCompound

Page 20: The Pathway/Genome Navigator

Pathway Mode Commands

Search by pathway name

Search by substring

Search by class

Search by substrates (can pick role in pathway)

Page 21: The Pathway/Genome Navigator

What’s in a pathway frame? Go to arginine biosynthesis I (from ArgD)

Intermediates and reactions

Can toggle level of detail

Feedback regulation can be shown

Locations of mapped genes

Genetic regulation schematic

Note presence of comments, citations, class hierarchy

Page 22: The Pathway/Genome Navigator

Reaction Mode Commands

Search by reaction name

Search by E.C. #

Search by class (another E.C. interface)

Search by pathway

Search by substrates

Page 23: The Pathway/Genome Navigator

What’s in a reaction frame?

Search by EC for 2.6.1.11 (pick one)

Picture of reaction with clickable compounds

Pathways the reaction is involved in

Place in class hierarchy

Enzymes carrying out reaction (note schematic)

Page 24: The Pathway/Genome Navigator

Protein Mode Commands

Search by protein nameSearch by substringSearch by pathwaySearch by organism (MetaCyc)Search by UniProt AccSearch by GO termSearch by MultiFun termSearch by Weight, pISearch by modulation of activity

Page 25: The Pathway/Genome Navigator

What’s in a protein frame?

Sample frame (ArgD)

Synonyms, general features, comments

Unification links, gene-reaction schematic

GO terms

Enzymatic reaction frames – how this protein carries out that reaction (bridging the two)

Evidence codes

Page 26: The Pathway/Genome Navigator

Gene Mode Commands

Search by gene name (can also put in TU IDs)

Search by substring

Get gene by class

Basically the same for RNAs

Page 27: The Pathway/Genome Navigator

What’s in a gene frame?

Sample frame (argC)

Synonyms, classification (GO), link to browser

Unification links, gene-reaction schematic

Regulation schematic

Gene local context and TUs

Page 28: The Pathway/Genome Navigator

What’s in a TU frame?

Sample frame (argCBH)

Genes in context, with TFs

Promoter with start site and citations

TF binding sites, with citations

Regulatory interactions (ilvL attenuator in TU524)

Page 29: The Pathway/Genome Navigator

Compound Mode Commands

Search compound by name

Search compound by substring

Search by SMILES (structure)

Search by class

Advanced search

Page 30: The Pathway/Genome Navigator

The SMILES Language

Simplified Molecular Input Line Entry SystemFormal language for describing chemical

structuresUsed within the Pathway Tools in a substructure

searchCase is significant (lowercase for aromatic rings)Examples:

formate C(=O)Omalate OC(=O)CC(O)C(O)=O

For more information, see the Help facility

Page 31: The Pathway/Genome Navigator

What’s in a compound frame?

Sample (N-acetylglutamyl-phosphate)

Synonyms, empirical formula, MW, links

Structure (you can add these in editors)

SMILES code

Pathways and reactions involving this compound

Page 32: The Pathway/Genome Navigator

Miscellaneous Commands

History commandsAnswer-List commandsClone window commandFix window and unfix window commandsOther commands:

Print to file (makes a postscript) Help Preferences Exit

Page 33: The Pathway/Genome Navigator

User Preferences

ColorLayoutCompound windowReaction windowPathway windowHistory/Answer listReverting and saving user preferences

Page 34: The Pathway/Genome Navigator

Lab Exercises – Desktop Version

Page 35: The Pathway/Genome Navigator

Lab Exercises

Set up personal preferences for:

Color

Layout (set number of windows to 2)Save new preferencesPlay with settings for Compound, Reaction,

Pathway, and Overview windows.Choose settings for History/Answer List

preferences

Page 36: The Pathway/Genome Navigator

Desktop Lab Exercises

Retrieve compounds containing a formate groupRetrieve compounds adenine and uracil using

class query Retrieve reaction with EC# 5.3.1.9Retrieve all reactions in the class

sulfurtransferasesRetrieve all reactions involved in proline

biosynthesisRetrieve all reactions where glutamate appears on

left sidegenes coding for enzymes involved in the

degradation of short-chain fatty acids

Page 37: The Pathway/Genome Navigator

Desktop Lab Exercises

Retrieve all enzymes involved in purine biosynthesis

Retrieve all kinasesDisplay region spanning from 10 % - 20 % of E.

coli chromosome Display chromosomal region around gene aroADisplay a map showing all chaperone genes

Page 38: The Pathway/Genome Navigator

Desktop Lab Exercises

Retrieve all chaperone genesRetrieve gene aroAFind the glutamine biosynthesis pathway by

issuing each of the three types of queries in Pathway mode.

Page 39: The Pathway/Genome Navigator

Desktop Lab Exercises

Clone windowNavigate in the cloned windowSet preferences so Navigator displays 2 windowsNavigate by clicking on live objectsFix WindowNavigate in unfixed windowFix second window and then click on live object

Page 40: The Pathway/Genome Navigator

The Web Interface: BioCyc

Ron Caspi

Page 41: The Pathway/Genome Navigator

Help is One Click Away!

Page 42: The Pathway/Genome Navigator

The Web Account System

Page 43: The Pathway/Genome Navigator

The Web Account System

Creating a web account enables you to:

Save Object Groups

Define page formatting preferences

Define Overview layout preferences

Save organism groups for comparative analysis

Page 44: The Pathway/Genome Navigator

Save Organism Groups with Web Accounts

Note the My Lists tab on the multi-organism selector for comparative analyses.

When you perform comparative analyses, you can easily save groups of organisms for re-using at a later time.

Page 45: The Pathway/Genome Navigator

Define a Favorite Database with Web Accounts

If you create a web

account, you can define

a favorite database that

will be opened by

default when you login

Page 46: The Pathway/Genome Navigator

Searching the Database

Page 47: The Pathway/Genome Navigator

Why the Need for Dedicated Search Tools

Search BioCyc for “L-arginine”

2080 results

Need to have specific tools for finding exactly

what we search for.

Page 48: The Pathway/Genome Navigator

BioCyc Searches

Multiple searches available for finding information in different ways

The easiest searches to use are fairly coarse

Start by selecting database to search

Simplest search: Quick Search At upper right of most pages

Page 49: The Pathway/Genome Navigator

Selecting the Database

You can only search one database at a time*!

* With the exception of Google searches

Click on word “change” under Search menu or under Quick

Search button

In resulting selector, choose a PGDB

Start typing a word in organism name

Click on letter to navigate to organisms starting with that letter

Click a frequently used PGDB Select by Taxonomy

All subsequent searches will apply to that database

Page 50: The Pathway/Genome Navigator

The Quick Search Box What can you type here:

Gene names (dnaA ) Compound name (L-lysine) Pathway name (peptidoglycan biosynthesis) Reaction name (lysine decarboxylase) Protein name (peptidase) EC number (1.3.1.26) Organism name (Escherichia coli) Frame ID (CPLX-8024) GO term (0006086) Links to other databases (O33998) An exact term using the format (Peptidase D search:exact) Limited term (hydrogen type:compound)

What doesn’t work: Exact text using the Google format (“peptidase D”)

Page 51: The Pathway/Genome Navigator

Quick Search Results

Results are divided into multiple categories

Page 52: The Pathway/Genome Navigator

Examples of searches performed by users of the BioCyc website:

Successful Unsuccessful Ascorbate pheV EC 3.4.17.5 Transmembrane helix

6 Sigma factor 3.4.24.B11 Polysulfide reductase ABC cobalt transporter Entner-Doudoroff pathway affinity of DnaACyanobacteria DnaA

A simple auto-correction mechanism tries to correct typos. For example, searching for “sacrosine” will find “sarcosine”.

Page 53: The Pathway/Genome Navigator

Quick Gene Search

Useful when only interested in genes.

For example, compare the results when searching for “dnaA” by using the Quick Search and Gene Search buttons.

Page 54: The Pathway/Genome Navigator

The Search Menu

Search Menu Object-specific searches Advanced search Ontologies search Google search BLAST search Search of full-text articles (EcoCyc only)

Page 55: The Pathway/Genome Navigator

Google This Site

The BioCyc site is indexed by Google

You can launch a Google text search from:

1. Search → Google This Site

2. The alternative searches box that appears on Quick Search results pages

Page 56: The Pathway/Genome Navigator

Object-Specific Searches

The first four items in the search menu provide a medium-level search interface against single types of objects

Use of filtering

Click on triangles at the left to expand or hide filters

Note that if a filter is hidden it will not be used in a

search

Page 57: The Pathway/Genome Navigator

Compound Search

List All buttons – quick way to get complete listsExamples for compound searching:

Page 58: The Pathway/Genome Navigator

Search Genes/Proteins/RNAs

List All buttons – quick way to get complete listsExtensive filtering options

Page 59: The Pathway/Genome Navigator

Search Pathways

Page 60: The Pathway/Genome Navigator

Advanced Search

The BioVelo query languageSAQP: Structured Advanced Query Page

Permits the definition of complex searches without mastering BioVelo.

To learn more about the advanced query interface, see online

documentation.

Page 61: The Pathway/Genome Navigator

Sequence Search by BLAST

Nothing unusual here – a regular BLAST interface that permits BLASTing sequences against BioCyc PGDBs.

The results are linked to the PGDB gene/protein pages

Page 62: The Pathway/Genome Navigator

The desktop version of Pathway Tools allows definition of growth media, gene knockout growth information, and growth data for phenotype microarray plates.

Growth Media and Phenotype

Page 63: The Pathway/Genome Navigator

EcoCyc-Specific Searches: Growth MediaSearch for growth media based on:

namecompounds presentcompounds not presentobserved growth

Page 64: The Pathway/Genome Navigator

EcoCyc-Specific Searches: Textpresso

Mining E. coli literature poses special challenges – because almost every molecular biology paper references E. coli

The solution – EcoCyc Textpresso! An E. coli only collection of literature

30,000 full-text articles and 6,500 abstracts.

Full text literature searches

Results presented at bottom of page

Page 65: The Pathway/Genome Navigator

Web Version Lab Exercises

Page 66: The Pathway/Genome Navigator

Web Lab Exercises

In EcoCyc:

Retrieve gene aroARetrieve all kinasesRetrieve all chaperone genes using two different

approachesRetrieve all membrane proteins that cite an article

by GrossRetrieve all genes whose product interacts with

ppGpp as a regulator or cofactor

Page 67: The Pathway/Genome Navigator

Web Lab Exercises

In MetaCyc:

Retrieve reaction with EC# 5.3.1.9Retrieve compounds adenine and uracil using

ontology search Retrieve all reactions involving glutamate as a

substrateFind genes coding for enzymes involved in the

degradation of short-chain fatty acidsFind all pathways involved in degradation of

arginine